[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[lojban] Sources for luj1999?



http://www.lojban.org/publications/draft-dictionary/Working/luj1999.ZIP

This file contains lujvo that have been automatically excerpted from texts, semi-automatically converted into their canonical forms. It also contains frequency counts of this words.

What I would like to know is which source texts have been used, and if they are available somewhere.

To take a specific example, consider this line:

 (2)         cevyspe                 god+married                                    canonical form=ceispe

This apparently means that the word "cevyspe" was used two times in the corpus. But a web search turns up nothing for "cevyspe", save an older word frequency list:

http://www.lojban.org/publications/wordlists/frequencies2.txt

What do I need to have to make sure that I have the context for every word that occurs in luj1999.zip?

-- 
Arnt Richard Johansen                                http://arj.nvg.org/
Keyboard: The Ultimate Input Device


To unsubscribe from this list, send mail to lojban-list-request@lojban.org
with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if
you're really stuck, send mail to secretary@lojban.org for help.