[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[lojban] Lojban Speech Recognition semester-project
- To: lojban-list@chain.digitalkingdom.org
- Subject: [lojban] Lojban Speech Recognition semester-project
- From: "Nico Möller" <nmoeller@uos.de>
- Date: Wed, 16 Jul 2008 23:29:47 +0200
- Cc: "Thorben Krüger" <thkruege@uos.de>
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:sender :to:subject:cc:mime-version:content-type:x-google-sender-auth; bh=eI5kgqOmUTEx3aEKmig5XThsYAha5nPVhdFSzDr07f0=; b=OkXT3Org8iCQAWt1ohERIjCaWXlsxBFnNYpJ2gyf6Y+T94u+nRO7kd+xJNyW5auaKL +tghk5q2sYMMdcHn5tNaVV319SlWcbEt3s6Ai+FRHcdPHsZwSjhaLX8PQRjFbOOwXBxo azDGGu3EP/7/tPi7at08s9PKkjAY+zAKecEJk=
- Domainkey-signature: a=rsa-sha1; c=nofws; d=googlemail.com; s=gamma; h=message-id:date:from:sender:to:subject:cc:mime-version:content-type :x-google-sender-auth; b=TtbEYSoBAgKtAJrRPk4z6IDLxg2UZaw5DB35VsPcwRN5c6l07nGMQXUzAeXHqA/BiB VAwipCpvYWwuXQZ0CDvkiYR6f0vbSsARf93uSqmvjtbVu9Pw5UxhPXuixqGBBc/peNXe 99FTd/2sLhmwNmKqJr/7ub/Sp6JmjnJAmMmHo=
- Reply-to: lojban-list@lojban.org
- Sender: lojban-list-bounce@lojban.org
Hi guys,
We have got a request a hopefully some of you are willing to help us. We are currently studying cognitive science at the university of osnabrueck and participating in a course called "practical natural language processing", which is some kind of semester project in lingusitics. Our group decided to deal with some speech recognition and because lojban has so nice phonetic features we choose it as our target language, Unfortunately we discovered that there is very few (usable) lojban audio data on the web, but we actually need huge amounts of them to feed our training algorithms. It would be really cool if some of you could actually send us some audio data we can work with, if you do so please provide them in the following format:
- 16bit mono, 16khz
- preferable raw or wav data files
- one sentence per audio file
- a transcript text file containing one sentence per line + the name of the audio file in which the sentence was uttered
Everybody who sends as applicable data will be mentioned by name in our final term paper, which will be published at the end of this month (You see will really need those data quick).
Thanks a lot for your effort,
Nico & Thorben