[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[lojban] Re: Lojban Speech Recognition semester-project

To: lojban-list@lojban.org
Subject: [lojban] Re: Lojban Speech Recognition semester-project
From: Steve Sloan <steve@finagle.org>
Date: Sun, 27 Jul 2008 10:02:05 -0700
In-reply-to: <bffd72fa0807161429g7121fd9en6b54c90016fcaa65@mail.gmail.com>
References: <bffd72fa0807161429g7121fd9en6b54c90016fcaa65@mail.gmail.com>
Reply-to: lojban-list@lojban.org
Sender: lojban-list-bounce@lojban.org
User-agent: Thunderbird 2.0.0.5 (X11/20070719)

Nico Möller wrote:

Unfortunately we discovered that there is very few (usable) lojban audiodata on the web, but we actually need huge amounts of them to feed ourtraining algorithms. It would be really cool if some of you couldactually send us some audio data we can work with,

Instead of collecting random bits of audio, it occurs to me that thecommunity could devise a short sample corpus of Lojban text that couldthen be recorded as spoken by a wide variety of different accents,speech rhythms, mis-pronunciations, etc.

A good place to start would be a Lojban pangram[0], but an idealtraining set would include most/all legal two-letter combinations.Would it be crazy to consider the shortest meaningful text that includedall cmavo and lujvo? Probably ...

[0] a short text containing every letter in the alphabet, e.g.http://en.wikipedia.org/wiki/The_quick_brown_fox


-- Steve



To unsubscribe from this list, send mail to lojban-list-request@lojban.org
with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if
you're really stuck, send mail to secretary@lojban.org for help.

Follow-Ups:
- [lojban] Re: Lojban Speech Recognition semester-project
  - From: Penguino <spheniscine@gmail.com>

References:
- [lojban] Lojban Speech Recognition semester-project
  - From: "Nico Möller" <nmoeller@uos.de>

Prev by Date: [lojban] Jbonunsla 2009 at Penguicon!
Next by Date: [lojban] Re: ralju ke lujvo bo tarmi
Previous by thread: [lojban] Re: Lojban Speech Recognition semester-project
Next by thread: [lojban] Re: Lojban Speech Recognition semester-project
Index(es):
- Date
- Thread