[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lojban] Re: More espeak words, please.



Okay, then it must be one of the sumti items. Based on my information, there are 4876 items, which includes the gismu and sumti for place of each gismu. For example, {klama} accounts for six items- {klama}, {lo klama}, {lo se klama}, {lo te klama}, {lo ve klama}, and {lo xe klama}.
 
Subtracting the 1342 gismu from that list leaves 3,534 gismu sumti. The "gismu_places-espeak.zip" file Dag uploaded to http://www.lojban.org/tiki/valsi%20Sound%20Files, which contains only gismu sumti, has 3.527 items, which means either 7 of the items are missing audio, or the list has 7 extra gismu sumti. I checked my list of the 4786 items, and it has exactly 1342 gismu, so the error definitely lies somewhere in the gismu sumti. My mistake.
 
Timos, would you be so kind as to run that comparison again, but this time run it against this list, instead of the gismu list?

On Sun, Mar 7, 2010 at 6:53 AM, Timo Paulssen <timonator@perpetuum-immobile.de> wrote:
moorkids@juno.com wrote:
> If you have a list of just gismu you could separate them into different word
> documents by first letter.  Do a word count for each letter group and compare
> it with a word count of just that letter gismu from a complete list (like the
> one here: http://en.wiktionary.org/wiki/Index:Lojban/gismu ).  Then narrow
> down which gismu is missing alphabetically.  (Theirs probably a much faster
> way to do this but I don't know it)

list all the gismu sound files without .mp3:
 ls | grep '^.....\.mp3' | sed -e 's/.mp3//' > existing.txt
list all the gismu from the gismu list (could've been done easier, i think)
 egrep '^ [a-z]{5}' /usr/share/lojban/gismu.txt | sed -e 's/^ //' -e 's/ .*//'\
 > allgismu.txt
compare the lists:
 diff -n existing.txt allgismu.txt

no output. apparently every gismu is there:

count the number of lines in every textfile:
 wc -l *txt

 1342 allgismu.txt
 1342 existing.txt
 2684 total

mu'o mi'e timos




--
mu'o mi'e .aionys.

.i.a'o.e'e ko klama le bende pe denpa bu