From nobody@digitalkingdom.org Wed Jul 16 14:29:55 2008
Received: with ECARTIS (v1.0.0; list lojban-list); Wed, 16 Jul 2008 14:29:55 -0700 (PDT)
Received: from nobody by chain.digitalkingdom.org with local (Exim 4.69)	(envelope-from <nobody@digitalkingdom.org>)	id 1KJEZO-0005in-TH	for lojban-list-real@lojban.org; Wed, 16 Jul 2008 14:29:55 -0700
Received: from qw-out-1920.google.com ([74.125.92.150])	by chain.digitalkingdom.org with esmtp (Exim 4.69)	(envelope-from <nico.moeller@googlemail.com>)	id 1KJEZK-0005iV-9I	for lojban-list@chain.digitalkingdom.org; Wed, 16 Jul 2008 14:29:54 -0700
Received: by qw-out-1920.google.com with SMTP id 5so1060865qwf.4        for <lojban-list@chain.digitalkingdom.org>; Wed, 16 Jul 2008 14:29:48 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;        d=googlemail.com; s=gamma;        h=domainkey-signature:received:received:message-id:date:from:sender         :to:subject:cc:mime-version:content-type:x-google-sender-auth;        bh=eI5kgqOmUTEx3aEKmig5XThsYAha5nPVhdFSzDr07f0=;        b=OkXT3Org8iCQAWt1ohERIjCaWXlsxBFnNYpJ2gyf6Y+T94u+nRO7kd+xJNyW5auaKL         +tghk5q2sYMMdcHn5tNaVV319SlWcbEt3s6Ai+FRHcdPHsZwSjhaLX8PQRjFbOOwXBxo         azDGGu3EP/7/tPi7at08s9PKkjAY+zAKecEJk=
DomainKey-Signature: a=rsa-sha1; c=nofws;        d=googlemail.com; s=gamma;        h=message-id:date:from:sender:to:subject:cc:mime-version:content-type         :x-google-sender-auth;        b=TtbEYSoBAgKtAJrRPk4z6IDLxg2UZaw5DB35VsPcwRN5c6l07nGMQXUzAeXHqA/BiB         VAwipCpvYWwuXQZ0CDvkiYR6f0vbSsARf93uSqmvjtbVu9Pw5UxhPXuixqGBBc/peNXe         99FTd/2sLhmwNmKqJr/7ub/Sp6JmjnJAmMmHo=
Received: by 10.142.52.9 with SMTP id z9mr308269wfz.30.1216243787875;        Wed, 16 Jul 2008 14:29:47 -0700 (PDT)
Received: by 10.143.195.4 with HTTP; Wed, 16 Jul 2008 14:29:47 -0700 (PDT)
Message-ID: <bffd72fa0807161429g7121fd9en6b54c90016fcaa65@mail.gmail.com>
Date: Wed, 16 Jul 2008 23:29:47 +0200
From: "=?ISO-8859-1?Q?Nico_M=F6ller?=" <nmoeller@uos.de>
To: lojban-list@chain.digitalkingdom.org
Subject: [lojban] Lojban Speech Recognition semester-project
Cc: "=?ISO-8859-1?Q?Thorben_Kr=FCger?=" <thkruege@uos.de>
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="----=_Part_30019_27180582.1216243787733"
X-Google-Sender-Auth: 810841a52f2f6c24
X-Spam-Score: 0.0
X-Spam-Score-Int: 0
X-Spam-Bar: /
X-archive-position: 14606
X-ecartis-version: Ecartis v1.0.0
Sender: lojban-list-bounce@lojban.org
Errors-to: lojban-list-bounce@lojban.org
X-original-sender: nmoeller@uos.de
Precedence: bulk
Reply-to: lojban-list@lojban.org
X-list: lojban-list

------=_Part_30019_27180582.1216243787733
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit
Content-Disposition: inline

Hi guys,

We have got a request a hopefully some of you are willing to help us. We are
currently studying cognitive science at the university of osnabrueck and
participating in a course called "practical natural language processing",
which is some kind of semester project in lingusitics.  Our group decided to
deal with some speech recognition and because lojban has so nice phonetic
features we choose it as our target language,  Unfortunately we discovered
that there is very few (usable) lojban audio data on the web, but we
actually need huge amounts of them to feed our training algorithms. It would
be really cool if some of you could actually send us some audio data we can
work with, if you do so please provide them in the following format:

- 16bit mono, 16khz
- preferable raw or wav data files
- one sentence per audio file
- a transcript text file containing one sentence per line + the name of the
audio file in which the sentence was uttered

Everybody who sends as applicable data will be mentioned by name in our
final term paper, which will be published at the end of this month (You see
will really need those data quick).

Thanks a lot for your effort,
Nico & Thorben

------=_Part_30019_27180582.1216243787733
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit
Content-Disposition: inline

<div dir="ltr">Hi guys,<br><br>We have got a request a hopefully some of you are willing to help us. We are currently studying cognitive science at the university of osnabrueck and participating in a course called &quot;practical natural language processing&quot;, which is some kind of semester project in lingusitics.&nbsp; Our group decided to deal with some speech recognition and because lojban has so nice phonetic features we choose it as our target language,&nbsp; Unfortunately we discovered that there is very few (usable) lojban audio data on the web, but we actually need huge amounts of them to feed our training algorithms. It would be really cool if some of you could actually send us some audio data we can work with, if you do so please provide them in the following format:<br>
<br>- 16bit mono, 16khz<br>- preferable raw or wav data files<br>- one sentence per audio file<br>- a transcript text file containing one sentence per line + the name of the audio file in which the sentence was uttered <br>
<br>Everybody who sends as applicable data will be mentioned by name in our final term paper, which will be published at the end of this month (You see will really need those data quick).<br><br>Thanks a lot for your effort,<br>
Nico &amp; Thorben<br></div>

------=_Part_30019_27180582.1216243787733--


To unsubscribe from this list, send mail to lojban-list-request@lojban.org
with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if
you're really stuck, send mail to secretary@lojban.org for help.