Em sexta-feira, 12 de abril de 2013 17:57:17 UTC+3, la gleki escreveu:
peeps, i need ur help.
we are gonna have Swype/Swipe feature for MultiLing android keyboard. I need a list of all lojbanic words + frequency of each.
i know of a gismu frequency list. But it seems that not all gismu are there (less than 1342). What about cmavo, fu'ivla?
Of course, most rare words can be given the lowest rating but what are the most frequent words?
Can we rerun the algorithm to count all the occurrencies of all words?
For MultiLing the problem seems to have been solved.
For a bit better frequency list where utterances to/from bots, non Lojban vocatives and other service information is removed see
https://mw.lojban.org/papri/N-grams_of_Lojban_corpus
1-grams is basically frequency list.