[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: word frequency list coming
- To: Multiple recipients of list LOJBAN <LOJBAN@CUVMB.BITNET>
- Subject: Re: word frequency list coming
- From: Robert Rapplean <kingcats@EARTHLINK.NET>
- Date: Thu, 24 Sep 1998 07:38:01 -0600
- Reply-to: Robert Rapplean <kingcats@EARTHLINK.NET>
- Sender: Lojban list <LOJBAN@CUVMB.BITNET>
> Well I took a look around the Web and found a word-frequency/concordance
> program. I have run it on the entirety of my Lojban text archive (which
> unbfortunately has some reptitions in it because of commentaries, quoted
> text and revisions) and am working on filtering out all the garbage.
> This is all Lojban text that I have up till 10/94, because my mail processing
> is that far backlogged, that I haven't extracted the Lojban text from my
> logs since then (takes me around 1-2 hours per month, so don't hold your
> breath %^)
This should be adequate. A couple quick time trials have convinced me that I
can
fit approximately 400 words per hour on the tapes. I'll start with two
half-hour
tape sides of the most frequent words, and add to that if there is a decent
calling for it.
On the subject of a phonographer, if said individual had the ability to make wav
files on their system and mail them to me, they wouldn't actually have to be in
Denver. This would make sense, also, from the standpoint that it might not be a
bad idea to review the pronunciation of the words before release of the
recordings.
Rob Rapplean