From nobody@digitalkingdom.org Sun Jul 27 23:01:49 2008 Received: with ECARTIS (v1.0.0; list lojban-list); Sun, 27 Jul 2008 23:01:49 -0700 (PDT) Received: from nobody by chain.digitalkingdom.org with local (Exim 4.69) (envelope-from ) id 1KNLnp-0007ze-34 for lojban-list-real@lojban.org; Sun, 27 Jul 2008 23:01:49 -0700 Received: from ti-out-0910.google.com ([209.85.142.189]) by chain.digitalkingdom.org with esmtp (Exim 4.69) (envelope-from ) id 1KNLni-0007z2-V5 for lojban-list@lojban.org; Sun, 27 Jul 2008 23:01:48 -0700 Received: by ti-out-0910.google.com with SMTP id i7so2305440tid.20 for ; Sun, 27 Jul 2008 23:01:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to :subject:in-reply-to:mime-version:content-type:references; bh=Z9ztx9oSLvWGuobMuLnt2/H4w1KaCAHvRdgyrQag7xE=; b=DzyRkOkp4BhqmeudUpbFqnU/hw9jvXxRkrTZ3mRfdD9RhqBZtcw7Y2IUBEAsQVCyJ0 TlzpgpK9IbdYkOY47lMspKYIlZxfxlpTzrozh/GuZjLo33mkmXBx/ZUtySSRsJ7NMZd7 SHiLHm0v9idDm5C76ziILpWOaUCQWZU91jmOE= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:in-reply-to:mime-version :content-type:references; b=qcALpUzijI6ypJLXybKaN8Bl8yWTjghSoLmTnpQnmcKiNBCRgSKGKXS9pQXxDMuKpY pGsMQOlgsLfwzszMx1nstcJD0xnXtdpj4BvpJZNzo0lCRrpYHnLhsbo/UP+YI0IdJ/fV oXGSfcIki03MqyAX6NfxRen5iUgxth/y2TNik= Received: by 10.110.10.16 with SMTP id 16mr5445519tij.15.1217224900587; Sun, 27 Jul 2008 23:01:40 -0700 (PDT) Received: by 10.110.31.11 with HTTP; Sun, 27 Jul 2008 23:01:40 -0700 (PDT) Message-ID: <97f5058c0807272301l2580dd7ds7dca02b72bdf8019@mail.gmail.com> Date: Mon, 28 Jul 2008 14:01:40 +0800 From: Penguino To: lojban-list@lojban.org Subject: [lojban] Re: Lojban Speech Recognition semester-project In-Reply-To: <488CAA0D.8000102@finagle.org> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_43482_29481252.1217224900579" References: <488CAA0D.8000102@finagle.org> X-Spam-Score: 0.0 X-Spam-Score-Int: 0 X-Spam-Bar: / X-archive-position: 14620 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: spheniscine@gmail.com Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list ------=_Part_43482_29481252.1217224900579 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline A Lojban pangram: *.o'i mu xagji sofybakni cu zvati le purdi *(Watch out, five hungry Soviet cows are in the garden) On Mon, Jul 28, 2008 at 1:02 AM, Steve Sloan wrote: > Nico M=F6ller wrote: > >> Unfortunately we discovered that there is very few (usable) lojban audio >> data on the web, but we actually need huge amounts of them to feed our >> training algorithms. It would be really cool if some of you could actual= ly >> send us some audio data we can work with, >> > > Instead of collecting random bits of audio, it occurs to me that the > community could devise a short sample corpus of Lojban text that could th= en > be recorded as spoken by a wide variety of different accents, speech > rhythms, mis-pronunciations, etc. > > A good place to start would be a Lojban pangram[0], but an ideal training > set would include most/all legal two-letter combinations. Would it be cra= zy > to consider the shortest meaningful text that included all cmavo and lujv= o? > Probably ... > > > [0] a short text containing every letter in the alphabet, e.g. > http://en.wikipedia.org/wiki/The_quick_brown_fox > > -- Steve > > > > > To unsubscribe from this list, send mail to lojban-list-request@lojban.or= g > with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if > you're really stuck, send mail to secretary@lojban.org for help. > > ------=_Part_43482_29481252.1217224900579 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline
A Lojban pangram: .o'i mu xagji sofybakni cu zvati = le purdi (Watch out, five hungry Soviet cows are in the garden)

=
------=_Part_43482_29481252.1217224900579-- To unsubscribe from this list, send mail to lojban-list-request@lojban.org with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if you're really stuck, send mail to secretary@lojban.org for help.