[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[lojban] Sources for luj1999?
http://www.lojban.org/publications/draft-dictionary/Working/luj1999.ZIP
This file contains lujvo that have been automatically excerpted from texts, semi-automatically converted into their canonical forms. It also contains frequency counts of this words.
What I would like to know is which source texts have been used, and if they are available somewhere.
To take a specific example, consider this line:
(2) cevyspe god+married canonical form=ceispe
This apparently means that the word "cevyspe" was used two times in the corpus. But a web search turns up nothing for "cevyspe", save an older word frequency list:
http://www.lojban.org/publications/wordlists/frequencies2.txt
What do I need to have to make sure that I have the context for every word that occurs in luj1999.zip?
--
Arnt Richard Johansen http://arj.nvg.org/
Keyboard: The Ultimate Input Device
To unsubscribe from this list, send mail to lojban-list-request@lojban.org
with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if
you're really stuck, send mail to secretary@lojban.org for help.