From lojbab@lojban.org Fri Jul 13 21:20:41 2001 Return-Path: X-Sender: lojbab@lojban.org X-Apparently-To: lojban@yahoogroups.com Received: (EGP: mail-7_2_0); 14 Jul 2001 04:20:40 -0000 Received: (qmail 99463 invoked from network); 14 Jul 2001 04:20:40 -0000 Received: from unknown (10.1.10.142) by l8.egroups.com with QMQP; 14 Jul 2001 04:20:40 -0000 Received: from unknown (HELO stmpy-1.cais.net) (205.252.14.71) by mta3 with SMTP; 14 Jul 2001 04:20:38 -0000 Received: from bob.lojban.org (dynamic236.cl8.cais.net [205.177.20.236]) by stmpy-1.cais.net (8.11.1/8.11.1) with ESMTP id f6E4KUY42094 for ; Sat, 14 Jul 2001 00:20:31 -0400 (EDT) Message-Id: <4.3.2.7.2.20010714000128.00db3e00@127.0.0.1> X-Sender: vir1036/pop.cais.com@127.0.0.1 X-Mailer: QUALCOMM Windows Eudora Version 4.3.2 Date: Sat, 14 Jul 2001 00:24:33 -0400 To: lojban list Subject: Re: [lojban] columns 158-164 In-Reply-To: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="=====================_111446924==_" From: "Bob LeChevalier (lojbab)" X-Yahoo-Message-Num: 8554 --=====================_111446924==_ Content-Type: text/plain; charset="us-ascii"; format=flowed At 11:19 PM 07/13/2001 -0500, Michal Wallace wrote: >I'm looking at the gismu list, and notice two columns of codes right >after the english definitions and before the cross references. What >do these mean? Excellent question! (maybe add it to the FAQ in the brochure book, Nick?) The number and letter code refer to the original outline for the Lojban textbook (that outline is attached to this message in RTF format, and will be added to the historical file archive on the website when next I update it). The current draft has the lessons broken differently so that the codes are less meaningful, though they still roughly track with the order of presentation in the textbook, up to what used to be lesson 6. >The first one almost looks like some sort of grouping: blanu, xunre, >narju all share the code 1a.. The groupings in the lesson did color words all at once in the first part of the first lesson, since one can make lots of simple sentences using those words. >The second one.. I thought I heard something about word frequency? Yes. It is a word frequency count on all Lojban text, Lojban List text, and teaching materials through sometime in 1992, so as to give another plausible order to study the words other than the textbook order, one which presumably would support following Lojban List discussions. Lojban grammar terminology was very highly represented because that was much of what people talked about on Lojban List. The web site has current frequency lists, and the grammar terms are still the most used, but I haven't changed the numbers in the gismu list. >I just wrote a little program to sort the list by that number.. >The top comes out like: > >('cusku', 'express ', '1h ', '872') >('tanru', 'phrase compoun', '1b ', '776') >('prenu', 'person ', '1k ', '632') >('gismu', 'root word ', '1b ', '554') >('djica', 'desire ', '3l ', '500') >('lujvo', 'affix compound', '1b ', '428') >('diklo', 'local ', '5d ', '426') >('klama', 'come ', '1g1', '399') >('bacru', 'utter ', '1h ', '386') >('djuno', 'know ', '1h ', '375') >('sumti', 'argument ', '1b2', '373') >('drata', 'other ', '2g ', '351') >('kumfa', 'room ', '2k ', '346') >('tavla', 'talk ', '1h ', '338') >('nanmu', 'man ', '1k ', '332') >('cmalu', 'small ', '1e ', '326') >('citka', 'eat ', '5c ', '320') >('barda', 'big ', '1e ', '318') > >I find it hard to believe tanru is a more common word than citka or >barda, How often do people talk about eating or the size of things online? But they do talk about Lojban grammar. > but these do seem to be "simple" lojban words.. But then again, >the other end came out like: > >('gluta', 'glove ', 'ao ', ' 0') >('pambe', 'pump ', 'a ', ' 0') >('kanji', 'calculate ', '7e ', ' 0') >('barja', 'bar ', 'ap ', ' 0') >('sigja', 'cigar ', 'a ', ' 0') >('xatsi', '1E-18 ', 'ae ', ' 0') >('petso', '1E15 ', 'ae ', ' 0') >('fanri', 'factory ', '8c ', ' 0') >('barna', 'mark ', 'a ', ' 0') >('tsina', 'stage ', '5g ', ' 0') > >Which definitely seem less common (or more culture-specific). And indeed the words with no lesson number not only were in no teaching material, but a couple dozen of them were made after the frequency count was done. lojbab --=====================_111446924==_ Content-Type: application/rtf; charset="us-ascii" Content-Disposition: attachment; filename="NEWOUT2.rtf" [Attachment content not displayed.] --=====================_111446924==_ Content-Type: text/plain; charset="us-ascii"; format=flowed -- lojbab lojbab@lojban.org Bob LeChevalier, President, The Logical Language Group, Inc. 2904 Beau Lane, Fairfax VA 22031-1303 USA 703-385-0273 Artificial language Loglan/Lojban: http://www.lojban.org --=====================_111446924==_--