[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
word frequency file on FTP site
- To: Multiple recipients of list LOJBAN <LOJBAN@CUVMB.BITNET>
- Subject: word frequency file on FTP site
- From: Logical Language Group <lojbab@ACCESS.DIGEX.NET>
- Date: Sat, 3 Oct 1998 12:03:33 -0400
- Reply-to: Logical Language Group <lojbab@ACCESS.DIGEX.NET>
- Sender: Lojban list <LOJBAN@CUVMB.BITNET>
I have finished creating the file with word frequencies, and placed it
on the Digex FTP site (where it should also be accessible from the
Xiron Web page soon). The filename is "wordfreq.zip"
and it is in subdirectory wordlists. The URL is
ftp://ftp.access.digex.net/pub/access/lojbab/wordlists/wordfreq.zip
(I think that is the correct format).
I did some extra mail processing, so that my local text archive now is
complete until the end of 1994 (previously I had stopped in Sept 1994),
so the archive and now the frequency count now includes some of Jorge's
conversations with Goran, which seem to have started in November 1994.
There's still a lot of mail processing needed before I can add the later
years to my archive and the frequency counts, but this data is better
than any previous data we have had. I also have some possibility of
tracking down a particular word to find its contextual usage, which could
be helpful for dictionaryt work in the longer term.
Meanwhile, Nora has gotten her Lojban-to-English glosser program basically
running. There are still bugs, amd we still need the current parser, but
the program is now outputting usable if not always perfect word-for-word
translations of Lojban text, with some minimal grammar recognition. When
she gets something she is willing to have people work with, I will upload
it. I am also close to having a new version of Nora's random sentence
generator ready to upload (the program needed only trivial changes but the
data files need to be updtated to the baseline grammar, from their previous
state which was back around 1991 Lojban grammar).
lojbab
----
lojbab lojbab@access.digex.net
Bob LeChevalier, President, The Logical Language Group, Inc.
2904 Beau Lane, Fairfax VA 22031-1303 USA 703-385-0273
Artificial language Loglan/Lojban: ftp.access.digex.net /pub/access/lojbab
or see Lojban WWW Server: href="http://xiron.pc.helsinki.fi/lojban/"
Order _The Complete Lojban Language_ - see our Web pages or ask me.