Received: from spooler by stryx.demon.co.uk (Mercury/32 v2.01); 28 Sep 98 00:38:03 +0000 Return-path: Received: from punt-11.mail.demon.net (194.217.242.34) by stryx.demon.co.uk (Mercury/32 v2.01); 28 Sep 98 00:37:52 +0000 Received: from punt-1.mail.demon.net by mailstore for ia@stryx.demon.co.uk id 906540951:10:05568:0; Wed, 23 Sep 98 08:55:51 GMT Received: from listserv.cuny.edu ([128.228.100.10]) by punt-1.mail.demon.net id aa1117833; 23 Sep 98 8:55 GMT Received: from listserv (listserv.cuny.edu) by listserv.cuny.edu (LSMTP for Windows NT v1.1b) with SMTP id <0.FFFAD7F0@listserv.cuny.edu>; Wed, 23 Sep 1998 4:57:14 -0400 Date: Wed, 23 Sep 1998 11:50:46 -0700 Reply-To: Robin Turner Sender: Lojban list From: Robin Turner Subject: Re: Word frequency X-To: Lojban List To: Multiple recipients of list LOJBAN Message-ID: <906540948.1117833.0@listserv.cuny.edu> X-PMFLAGS: 33554560 7 1 Y01446.CNM Content-Length: 1731 Lines: 34 la pablov. cu cusku di'e > coi > > Due to my condition of lojban beginner, I would find extremely helpful > to be able to consult a "word frequency" list, since one of the most > important elements in language learning happens to be word memorization. > This process would of course be much more efficient if one could start > memorizing the most used words first. > I know other lojbanists have made similar requests. I don't honestly > know the tools needed for such a task, but I think it shouldn't be neither > difficult, nor excessively time consuming. Although I have an extremely > limited command of the language, I'd gladly help with this. It's actually very easy - you just take every text you have and feed it into a concordancing program. I could do this if people want, but I'm not sure how valid the exercise would be as a way of generating a vocabulary list. The problem is that frequency will be heavily skewed by the things which are translated into Lojban, and the kind of topics people discuss in Lojban on the list (usually Lojban itself). If your aim is to acquire enough vocabulary to read all this, then obviously the word-frequency list would be appropriate, but if you are interested in general communicative competence, then it would probably be more appropriate to use a frequncy count for your native language, then translate this into Lojban. co'o mi'e robin.