[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[lojban] Lojban Speech Recognition semester-project



Hi guys,

We have got a request a hopefully some of you are willing to help us. We are currently studying cognitive science at the university of osnabrueck and participating in a course called "practical natural language processing", which is some kind of semester project in lingusitics.  Our group decided to deal with some speech recognition and because lojban has so nice phonetic features we choose it as our target language,  Unfortunately we discovered that there is very few (usable) lojban audio data on the web, but we actually need huge amounts of them to feed our training algorithms. It would be really cool if some of you could actually send us some audio data we can work with, if you do so please provide them in the following format:

- 16bit mono, 16khz
- preferable raw or wav data files
- one sentence per audio file
- a transcript text file containing one sentence per line + the name of the audio file in which the sentence was uttered

Everybody who sends as applicable data will be mentioned by name in our final term paper, which will be published at the end of this month (You see will really need those data quick).

Thanks a lot for your effort,
Nico & Thorben