From nobody@digitalkingdom.org Wed Jul 16 14:29:55 2008 Received: with ECARTIS (v1.0.0; list lojban-list); Wed, 16 Jul 2008 14:29:55 -0700 (PDT) Received: from nobody by chain.digitalkingdom.org with local (Exim 4.69) (envelope-from ) id 1KJEZO-0005in-TH for lojban-list-real@lojban.org; Wed, 16 Jul 2008 14:29:55 -0700 Received: from qw-out-1920.google.com ([74.125.92.150]) by chain.digitalkingdom.org with esmtp (Exim 4.69) (envelope-from ) id 1KJEZK-0005iV-9I for lojban-list@chain.digitalkingdom.org; Wed, 16 Jul 2008 14:29:54 -0700 Received: by qw-out-1920.google.com with SMTP id 5so1060865qwf.4 for ; Wed, 16 Jul 2008 14:29:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:sender :to:subject:cc:mime-version:content-type:x-google-sender-auth; bh=eI5kgqOmUTEx3aEKmig5XThsYAha5nPVhdFSzDr07f0=; b=OkXT3Org8iCQAWt1ohERIjCaWXlsxBFnNYpJ2gyf6Y+T94u+nRO7kd+xJNyW5auaKL +tghk5q2sYMMdcHn5tNaVV319SlWcbEt3s6Ai+FRHcdPHsZwSjhaLX8PQRjFbOOwXBxo azDGGu3EP/7/tPi7at08s9PKkjAY+zAKecEJk= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlemail.com; s=gamma; h=message-id:date:from:sender:to:subject:cc:mime-version:content-type :x-google-sender-auth; b=TtbEYSoBAgKtAJrRPk4z6IDLxg2UZaw5DB35VsPcwRN5c6l07nGMQXUzAeXHqA/BiB VAwipCpvYWwuXQZ0CDvkiYR6f0vbSsARf93uSqmvjtbVu9Pw5UxhPXuixqGBBc/peNXe 99FTd/2sLhmwNmKqJr/7ub/Sp6JmjnJAmMmHo= Received: by 10.142.52.9 with SMTP id z9mr308269wfz.30.1216243787875; Wed, 16 Jul 2008 14:29:47 -0700 (PDT) Received: by 10.143.195.4 with HTTP; Wed, 16 Jul 2008 14:29:47 -0700 (PDT) Message-ID: Date: Wed, 16 Jul 2008 23:29:47 +0200 From: "=?ISO-8859-1?Q?Nico_M=F6ller?=" To: lojban-list@chain.digitalkingdom.org Subject: [lojban] Lojban Speech Recognition semester-project Cc: "=?ISO-8859-1?Q?Thorben_Kr=FCger?=" MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_30019_27180582.1216243787733" X-Google-Sender-Auth: 810841a52f2f6c24 X-Spam-Score: 0.0 X-Spam-Score-Int: 0 X-Spam-Bar: / X-archive-position: 14606 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: nmoeller@uos.de Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list ------=_Part_30019_27180582.1216243787733 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline Hi guys, We have got a request a hopefully some of you are willing to help us. We are currently studying cognitive science at the university of osnabrueck and participating in a course called "practical natural language processing", which is some kind of semester project in lingusitics. Our group decided to deal with some speech recognition and because lojban has so nice phonetic features we choose it as our target language, Unfortunately we discovered that there is very few (usable) lojban audio data on the web, but we actually need huge amounts of them to feed our training algorithms. It would be really cool if some of you could actually send us some audio data we can work with, if you do so please provide them in the following format: - 16bit mono, 16khz - preferable raw or wav data files - one sentence per audio file - a transcript text file containing one sentence per line + the name of the audio file in which the sentence was uttered Everybody who sends as applicable data will be mentioned by name in our final term paper, which will be published at the end of this month (You see will really need those data quick). Thanks a lot for your effort, Nico & Thorben ------=_Part_30019_27180582.1216243787733 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline
Hi guys,

We have got a request a hopefully some of you are willing to help us. We are currently studying cognitive science at the university of osnabrueck and participating in a course called "practical natural language processing", which is some kind of semester project in lingusitics.  Our group decided to deal with some speech recognition and because lojban has so nice phonetic features we choose it as our target language,  Unfortunately we discovered that there is very few (usable) lojban audio data on the web, but we actually need huge amounts of them to feed our training algorithms. It would be really cool if some of you could actually send us some audio data we can work with, if you do so please provide them in the following format:

- 16bit mono, 16khz
- preferable raw or wav data files
- one sentence per audio file
- a transcript text file containing one sentence per line + the name of the audio file in which the sentence was uttered

Everybody who sends as applicable data will be mentioned by name in our final term paper, which will be published at the end of this month (You see will really need those data quick).

Thanks a lot for your effort,
Nico & Thorben
------=_Part_30019_27180582.1216243787733-- To unsubscribe from this list, send mail to lojban-list-request@lojban.org with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if you're really stuck, send mail to secretary@lojban.org for help.