Received: from spooler by stryx.demon.co.uk (Mercury/32 v2.01); 27 Oct 98 22:57:35 +0000 Return-path: Received: from punt-11.mail.demon.net (194.217.242.34) by stryx.demon.co.uk (Mercury/32 v2.01); 27 Oct 98 22:57:32 +0000 Received: from punt-1.mail.demon.net by mailstore for ia@stryx.demon.co.uk id 909467424:10:22524:5; Tue, 27 Oct 98 05:50:24 GMT Received: from listserv.cuny.edu ([128.228.100.10]) by punt-1.mail.demon.net id aa1022267; 27 Oct 98 5:49 GMT Received: from listserv (listserv.cuny.edu) by listserv.cuny.edu (LSMTP for Windows NT v1.1b) with SMTP id <3.FFA7B6D7@listserv.cuny.edu>; Tue, 27 Oct 1998 0:51:32 -0500 Date: Mon, 26 Oct 1998 23:44:31 -0600 Reply-To: hezekiah@CS.UTEXAS.EDU Sender: Lojban list From: John_Arley Burns Subject: additions to web site X-To: lojban@cuvmb.cc.columbia.edu To: Multiple recipients of list LOJBAN Message-ID: <909467374.1022267.0@listserv.cuny.edu> X-PMFLAGS: 33554560 7 1 Y06F4E.CNM Content-Length: 1692 Lines: 34 .ui coi rodo I've recently updated the translation page and added a 'combined frequency' list. I took the file 'frequencies' from the ftp site and collated the gismu, cmavo, lujvo, etc. categories together. Then I sorted the whole list and put the data on my web page. In addition, I made a wordlist of the 302 most common words (with a textual count >=90) in a vaiety of formats. This is mainly to assist those learning Lojban (including me :-) so that we can focus on memorizing the most common words. It would also be helpful for textbook developers, who could focus on teaching the most common words first. It is also interesting to look at the complete frequency list, some 11000+ entries. First the cmavo dominate, giving way to gismu, giving way to simple lujvo, complex, etc. It is the exact kind of Ziph cure a natural language would have---it is a testimony to the usability and linguistic efficiency of Lojban. It is also an aid at examing the potential creation of gismu from frequent lujvo, creating new cmavo for very frequent cmavo combinations (such as 'lenu'), etc. Of course the list itself will become more accurate with time as more texts are added and analyzed, especially when diverse textual material such as scientific and journalistic texts are added in quantity. co'omi'e djan. http://www.cs.utexas.edu/users/hezekiah/lojban