Received: from spooler by stryx.demon.co.uk (Mercury/32 v2.01); 5 Oct 98 00:13:48 +0000 Return-path: Received: from punt-11.mail.demon.net (194.217.242.34) by stryx.demon.co.uk (Mercury/32 v2.01); 5 Oct 98 00:13:46 +0000 Received: from punt-1.mail.demon.net by mailstore for ia@stryx.demon.co.uk id 907430784:10:26879:0; Sat, 03 Oct 98 16:06:24 GMT Received: from listserv.cuny.edu ([128.228.100.10]) by punt-1.mail.demon.net id aa1026835; 3 Oct 98 16:06 GMT Received: from listserv (listserv.cuny.edu) by listserv.cuny.edu (LSMTP for Windows NT v1.1b) with SMTP id <3.FFA60E57@listserv.cuny.edu>; Sat, 3 Oct 1998 12:07:49 -0400 Date: Sat, 3 Oct 1998 12:03:33 -0400 Reply-To: Logical Language Group Sender: Lojban list From: Logical Language Group Subject: word frequency file on FTP site X-To: lojban@cuvmb.cc.columbia.edu To: Multiple recipients of list LOJBAN Message-ID: <907430781.1026835.0@listserv.cuny.edu> X-PMFLAGS: 33554560 7 1 Y03AA1.CNM Content-Length: 2428 Lines: 43 I have finished creating the file with word frequencies, and placed it on the Digex FTP site (where it should also be accessible from the Xiron Web page soon). The filename is "wordfreq.zip" and it is in subdirectory wordlists. The URL is ftp://ftp.access.digex.net/pub/access/lojbab/wordlists/wordfreq.zip (I think that is the correct format). I did some extra mail processing, so that my local text archive now is complete until the end of 1994 (previously I had stopped in Sept 1994), so the archive and now the frequency count now includes some of Jorge's conversations with Goran, which seem to have started in November 1994. There's still a lot of mail processing needed before I can add the later years to my archive and the frequency counts, but this data is better than any previous data we have had. I also have some possibility of tracking down a particular word to find its contextual usage, which could be helpful for dictionaryt work in the longer term. Meanwhile, Nora has gotten her Lojban-to-English glosser program basically running. There are still bugs, amd we still need the current parser, but the program is now outputting usable if not always perfect word-for-word translations of Lojban text, with some minimal grammar recognition. When she gets something she is willing to have people work with, I will upload it. I am also close to having a new version of Nora's random sentence generator ready to upload (the program needed only trivial changes but the data files need to be updtated to the baseline grammar, from their previous state which was back around 1991 Lojban grammar). lojbab ---- lojbab lojbab@access.digex.net Bob LeChevalier, President, The Logical Language Group, Inc. 2904 Beau Lane, Fairfax VA 22031-1303 USA 703-385-0273 Artificial language Loglan/Lojban: ftp.access.digex.net /pub/access/lojbab or see Lojban WWW Server: href="http://xiron.pc.helsinki.fi/lojban/" Order _The Complete Lojban Language_ - see our Web pages or ask me.