From nobody@digitalkingdom.org Thu Jul 17 04:49:49 2008
Received: with ECARTIS (v1.0.0; list lojban-list); Thu, 17 Jul 2008 04:49:49 -0700 (PDT)
Received: from nobody by chain.digitalkingdom.org with local (Exim 4.69)	(envelope-from <nobody@digitalkingdom.org>)	id 1KJRzZ-0001Ms-0y	for lojban-list-real@lojban.org; Thu, 17 Jul 2008 04:49:49 -0700
Received: from qw-out-1920.google.com ([74.125.92.145])	by chain.digitalkingdom.org with esmtp (Exim 4.69)	(envelope-from <nico.moeller@googlemail.com>)	id 1KJRzV-0001M7-Hj	for lojban-list@lojban.org; Thu, 17 Jul 2008 04:49:48 -0700
Received: by qw-out-1920.google.com with SMTP id 5so453883qwf.58        for <lojban-list@lojban.org>; Thu, 17 Jul 2008 04:49:44 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;        d=googlemail.com; s=gamma;        h=domainkey-signature:received:received:message-id:date:from:sender         :to:subject:in-reply-to:mime-version:content-type:references         :x-google-sender-auth;        bh=bAumwKWF51HbwCmfIR5a4vM6iwWR9qcHk06dk92eTSE=;        b=Xb98XmEF1G0K0FF+XjpJNHIo/mJVsAZKE+b4Yy7tIjeeAatRKIOMJHjBT4cr1ewQkd         Rf8+qWbYNr2UUr/M1EZ3xEy/7d9cmkAcQeOhb7veUp46wosY80/DhZH8O7/NBEY0pWy0         kZ0HN91r1WubcuqhuL+dZzOPh65tqbM1V6Juc=
DomainKey-Signature: a=rsa-sha1; c=nofws;        d=googlemail.com; s=gamma;        h=message-id:date:from:sender:to:subject:in-reply-to:mime-version         :content-type:references:x-google-sender-auth;        b=WmmWSK/LjDdOq3yhdhPgul1j5Qtm0IRrCu/oHWMvYovAxNf4UIt/Aibd+HJ8FYZZ4r         jp7fs26/lXJ5eO0soG6vtNL1TBGXzaKMpzGayOSMlJ0Ctkh43CQpfRSgDya2mrfPZU7b         ZDUJqhwXBDWVJDh37HtDgllVM7S/jSY27drjk=
Received: by 10.142.177.7 with SMTP id z7mr560057wfe.249.1216295380822;        Thu, 17 Jul 2008 04:49:40 -0700 (PDT)
Received: by 10.143.195.4 with HTTP; Thu, 17 Jul 2008 04:49:40 -0700 (PDT)
Message-ID: <bffd72fa0807170449l9231ea1o93cae81be494cad1@mail.gmail.com>
Date: Thu, 17 Jul 2008 13:49:40 +0200
From: "=?ISO-8859-1?Q?Nico_M=F6ller?=" <nmoeller@uos.de>
To: lojban-list@lojban.org
Subject: [lojban] Re: Lojban Speech Recognition semester-project
In-Reply-To: <e810bafd0807170336h54345fb6u899574233e224616@mail.gmail.com>
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="----=_Part_41197_1723135.1216295380860"
References: <bffd72fa0807161429g7121fd9en6b54c90016fcaa65@mail.gmail.com>	 <e810bafd0807170336h54345fb6u899574233e224616@mail.gmail.com>
X-Google-Sender-Auth: f46e09624ec3e8cd
X-Spam-Score: 0.0
X-Spam-Score-Int: 0
X-Spam-Bar: /
X-archive-position: 14610
X-ecartis-version: Ecartis v1.0.0
Sender: lojban-list-bounce@lojban.org
Errors-to: lojban-list-bounce@lojban.org
X-original-sender: nmoeller@uos.de
Precedence: bulk
Reply-to: lojban-list@lojban.org
X-list: lojban-list

------=_Part_41197_1723135.1216295380860
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline

Random sentences are quite ok, we ourself recorded some sentences from
Alice, but send us whatever you have got, as long we got a transcript of
what was uttered it would be totally sufficient.

I know that uncompressed audio files are quite big, but hey its only 16bit
mono and of course you can compress them using zip, 7z or whatever you like
;). I think then it shold be no Problem to send them via mail. Or you can
use some free filehosting on the web and send us the links. Just be
creative... If none of theses methods should be appropriable just send them
in a format (mp3, etc.) we can convert back into wavs...

Thanks a lot for your help,
Nico

On Thu, Jul 17, 2008 at 12:36 PM, james riley <jimr1603@gmail.com> wrote:

> Random sentences okay or should they be part of a bigger prose? I could
> churn out loads tomorrow (unless something happens), but I'm afk today to
> help out at my uni. My pronunciation needs practise, but is mostly okay.
> Also, wav is very big, how do you want us to send you loads of recordings=
 in
> wav?
>
> 2008/7/16 Nico M=F6ller <nmoeller@uos.de>:
>
> Hi guys,
>>
>> We have got a request a hopefully some of you are willing to help us. We
>> are currently studying cognitive science at the university of osnabrueck=
 and
>> participating in a course called "practical natural language processing"=
,
>> which is some kind of semester project in lingusitics.  Our group decide=
d to
>> deal with some speech recognition and because lojban has so nice phoneti=
c
>> features we choose it as our target language,  Unfortunately we discover=
ed
>> that there is very few (usable) lojban audio data on the web, but we
>> actually need huge amounts of them to feed our training algorithms. It w=
ould
>> be really cool if some of you could actually send us some audio data we =
can
>> work with, if you do so please provide them in the following format:
>>
>> - 16bit mono, 16khz
>> - preferable raw or wav data files
>> - one sentence per audio file
>> - a transcript text file containing one sentence per line + the name of
>> the audio file in which the sentence was uttered
>>
>> Everybody who sends as applicable data will be mentioned by name in our
>> final term paper, which will be published at the end of this month (You =
see
>> will really need those data quick).
>>
>> Thanks a lot for your effort,
>> Nico & Thorben
>>
>
>

------=_Part_41197_1723135.1216295380860
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable
Content-Disposition: inline

<div dir=3D"ltr">Random sentences are quite ok, we ourself recorded some se=
ntences from Alice, but send us whatever you have got, as long we got a tra=
nscript of what was uttered it would be totally sufficient.<br>&nbsp;<br>I =
know that uncompressed audio files are quite big, but hey its only 16bit mo=
no and of course you can compress them using zip, 7z or whatever you like ;=
). I think then it shold be no Problem to send them via mail. Or you can us=
e some free filehosting on the web and send us the links. Just be creative.=
.. If none of theses methods should be appropriable just send them in a for=
mat (mp3, etc.) we can convert back into wavs...<br>
<br>Thanks a lot for your help,<br>Nico<br><blockquote style=3D"margin: 1.5=
em 0pt;"></blockquote><div class=3D"gmail_quote">On Thu, Jul 17, 2008 at 12=
:36 PM, james riley &lt;<a href=3D"mailto:jimr1603@gmail.com" target=3D"_bl=
ank">jimr1603@gmail.com</a>&gt; wrote:<br>
<blockquote class=3D"gmail_quote" style=3D"border-left: 1px solid rgb(204, =
204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
<div dir=3D"ltr">Random sentences okay or should they be part of a bigger p=
rose? I could churn out loads tomorrow (unless something happens), but I=
9;m afk today to help out at my uni. My pronunciation needs practise, but i=
s mostly okay. Also, wav is very big, how do you want us to send you loads =
of recordings in wav?<br>


<br><div class=3D"gmail_quote">2008/7/16 Nico M=F6ller &lt;<a href=3D"mailt=
o:nmoeller@uos.de" target=3D"_blank">nmoeller@uos.de</a>&gt;:<div><div></di=
v><div><br><blockquote class=3D"gmail_quote" style=3D"border-left: 1px soli=
d rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">


<div dir=3D"ltr">Hi guys,<br><br>We have got a request a hopefully some of =
you are willing to help us. We are currently studying cognitive science at =
the university of osnabrueck and participating in a course called &quot;pra=
ctical natural language processing&quot;, which is some kind of semester pr=
oject in lingusitics.&nbsp; Our group decided to deal with some speech reco=
gnition and because lojban has so nice phonetic features we choose it as ou=
r target language,&nbsp; Unfortunately we discovered that there is very few=
 (usable) lojban audio data on the web, but we actually need huge amounts o=
f them to feed our training algorithms. It would be really cool if some of =
you could actually send us some audio data we can work with, if you do so p=
lease provide them in the following format:<br>


<br>- 16bit mono, 16khz<br>- preferable raw or wav data files<br>- one sent=
ence per audio file<br>- a transcript text file containing one sentence per=
 line + the name of the audio file in which the sentence was uttered <br>


<br>Everybody who sends as applicable data will be mentioned by name in our=
 final term paper, which will be published at the end of this month (You se=
e will really need those data quick).<br><br>Thanks a lot for your effort,<=
br>


Nico &amp; Thorben<br></div>
</blockquote></div></div></div><br></div>
</blockquote></div><br></div>

------=_Part_41197_1723135.1216295380860--


To unsubscribe from this list, send mail to lojban-list-request@lojban.org
with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if
you're really stuck, send mail to secretary@lojban.org for help.