[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: word frequency file on FTP site
- To: Multiple recipients of list LOJBAN <LOJBAN@CUVMB.BITNET>
- Subject: Re: word frequency file on FTP site
- From: Robert Rapplean <kingcats@EARTHLINK.NET>
- Date: Sat, 3 Oct 1998 16:06:07 -0600
- Reply-to: Robert Rapplean <kingcats@EARTHLINK.NET>
- Sender: Lojban list <LOJBAN@CUVMB.BITNET>
Thank you, thank you, thank you. I'll post the top 400 on the mailing list in
about a week. It may or may not be the 400 most commonly used ones because I
want to adjust them a little to complete logical sets (like counting 1 to 10 and
such).
Rob Rapplean
> I have finished creating the file with word frequencies, and placed it
> on the Digex FTP site (where it should also be accessible from the
> Xiron Web page soon). The filename is "wordfreq.zip"
> and it is in subdirectory wordlists. The URL is
> ftp://ftp.access.digex.net/pub/access/lojbab/wordlists/wordfreq.zip
> (I think that is the correct format).
>
> I did some extra mail processing, so that my local text archive now is
> complete until the end of 1994 (previously I had stopped in Sept 1994),
> so the archive and now the frequency count now includes some of Jorge's
> conversations with Goran, which seem to have started in November 1994.
> There's still a lot of mail processing needed before I can add the later
> years to my archive and the frequency counts, but this data is better
> than any previous data we have had. I also have some possibility of
> tracking down a particular word to find its contextual usage, which could
> be helpful for dictionaryt work in the longer term.
>
> Meanwhile, Nora has gotten her Lojban-to-English glosser program basically
> running. There are still bugs, amd we still need the current parser, but
> the program is now outputting usable if not always perfect word-for-word
> translations of Lojban text, with some minimal grammar recognition. When
> she gets something she is willing to have people work with, I will upload
> it. I am also close to having a new version of Nora's random sentence
> generator ready to upload (the program needed only trivial changes but the
> data files need to be updtated to the baseline grammar, from their previous
> state which was back around 1991 Lojban grammar).
>
> lojbab
> ----
> lojbab lojbab@access.digex.net
> Bob LeChevalier, President, The Logical Language Group, Inc.
> 2904 Beau Lane, Fairfax VA 22031-1303 USA 703-385-0273
> Artificial language Loglan/Lojban: ftp.access.digex.net /pub/access/lojbab
> or see Lojban WWW Server: href="http://xiron.pc.helsinki.fi/lojban/"
> Order _The Complete Lojban Language_ - see our Web pages or ask me.