[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lojban] Re: More espeak words, please.

To: lojban-list@lojban.org
Subject: Re: [lojban] Re: More espeak words, please.
From: Timo Paulssen <timonator@perpetuum-immobile.de>
Date: Sun, 07 Mar 2010 14:53:07 +0100
In-reply-to: <20100307.005234.11213.1@webmail02.dca.untd.com>
References: <20100307.005234.11213.1@webmail02.dca.untd.com>
User-agent: Mozilla-Thunderbird 2.0.0.22 (X11/20091109)

moorkids@juno.com wrote:
> If you have a list of just gismu you could separate them into different word
> documents by first letter.  Do a word count for each letter group and compare
> it with a word count of just that letter gismu from a complete list (like the
> one here: http://en.wiktionary.org/wiki/Index:Lojban/gismu ).  Then narrow
> down which gismu is missing alphabetically.  (Theirs probably a much faster
> way to do this but I don't know it)

list all the gismu sound files without .mp3:
  ls | grep '^.....\.mp3' | sed -e 's/.mp3//' > existing.txt
list all the gismu from the gismu list (could've been done easier, i think)
  egrep '^ [a-z]{5}' /usr/share/lojban/gismu.txt | sed -e 's/^ //' -e 's/ .*//'\
  > allgismu.txt
compare the lists:
  diff -n existing.txt allgismu.txt

no output. apparently every gismu is there:

count the number of lines in every textfile:
  wc -l *txt

 1342 allgismu.txt
 1342 existing.txt
 2684 total

mu'o mi'e timos

Attachment: signature.asc
Description: OpenPGP digital signature

Follow-Ups:
- Re: [lojban] Re: More espeak words, please.
  - From: Jonathan Jones <eyeonus@gmail.com>

References:
- Re: [lojban] Re: More espeak words, please.
  - From: "moorkids@juno.com" <moorkids@juno.com>

Prev by Date: Special Code for 75% for lojban-list@digitalkingdom.org
Next by Date: ** Great offer, lojban-list! 75% to save on every order **
Previous by thread: Re: [lojban] Re: More espeak words, please.
Next by thread: Re: [lojban] Re: More espeak words, please.
Index(es):
- Date
- Thread