Received: from spooler by stryx.demon.co.uk (Mercury/32 v2.01); 5 Oct 98 00:14:11 +0000 Return-path: Received: from punt-21.mail.demon.net (194.217.242.6) by stryx.demon.co.uk (Mercury/32 v2.01); 5 Oct 98 00:13:59 +0000 Received: from punt-2.mail.demon.net by mailstore for ia@stryx.demon.co.uk id 907452538:20:01835:10; Sat, 03 Oct 98 22:08:58 GMT Received: from listserv.cuny.edu ([128.228.100.10]) by punt-2.mail.demon.net id aa2102196; 3 Oct 98 22:08 GMT Received: from listserv (listserv.cuny.edu) by listserv.cuny.edu (LSMTP for Windows NT v1.1b) with SMTP id <3.FFA611B5@listserv.cuny.edu>; Sat, 3 Oct 1998 18:10:17 -0400 Date: Sat, 3 Oct 1998 16:06:07 -0600 Reply-To: Robert Rapplean Sender: Lojban list From: Robert Rapplean Subject: Re: word frequency file on FTP site X-To: Lojban Mailing List To: Multiple recipients of list LOJBAN Message-ID: <907452523.212196.0@listserv.cuny.edu> X-PMFLAGS: 33554560 7 1 Y0758E.CNM Content-Length: 2760 Lines: 51 Thank you, thank you, thank you. I'll post the top 400 on the mailing list in about a week. It may or may not be the 400 most commonly used ones because I want to adjust them a little to complete logical sets (like counting 1 to 10 and such). Rob Rapplean > I have finished creating the file with word frequencies, and placed it > on the Digex FTP site (where it should also be accessible from the > Xiron Web page soon). The filename is "wordfreq.zip" > and it is in subdirectory wordlists. The URL is > ftp://ftp.access.digex.net/pub/access/lojbab/wordlists/wordfreq.zip > (I think that is the correct format). > > I did some extra mail processing, so that my local text archive now is > complete until the end of 1994 (previously I had stopped in Sept 1994), > so the archive and now the frequency count now includes some of Jorge's > conversations with Goran, which seem to have started in November 1994. > There's still a lot of mail processing needed before I can add the later > years to my archive and the frequency counts, but this data is better > than any previous data we have had. I also have some possibility of > tracking down a particular word to find its contextual usage, which could > be helpful for dictionaryt work in the longer term. > > Meanwhile, Nora has gotten her Lojban-to-English glosser program basically > running. There are still bugs, amd we still need the current parser, but > the program is now outputting usable if not always perfect word-for-word > translations of Lojban text, with some minimal grammar recognition. When > she gets something she is willing to have people work with, I will upload > it. I am also close to having a new version of Nora's random sentence > generator ready to upload (the program needed only trivial changes but the > data files need to be updtated to the baseline grammar, from their previous > state which was back around 1991 Lojban grammar). > > lojbab > ---- > lojbab lojbab@access.digex.net > Bob LeChevalier, President, The Logical Language Group, Inc. > 2904 Beau Lane, Fairfax VA 22031-1303 USA 703-385-0273 > Artificial language Loglan/Lojban: ftp.access.digex.net /pub/access/lojbab > or see Lojban WWW Server: href="http://xiron.pc.helsinki.fi/lojban/" > Order _The Complete Lojban Language_ - see our Web pages or ask me.