From nobody@digitalkingdom.org Thu Jul 17 03:36:25 2008 Received: with ECARTIS (v1.0.0; list lojban-list); Thu, 17 Jul 2008 03:36:25 -0700 (PDT) Received: from nobody by chain.digitalkingdom.org with local (Exim 4.69) (envelope-from ) id 1KJQqX-000586-1I for lojban-list-real@lojban.org; Thu, 17 Jul 2008 03:36:25 -0700 Received: from an-out-0708.google.com ([209.85.132.242]) by chain.digitalkingdom.org with esmtp (Exim 4.69) (envelope-from ) id 1KJQqS-00057v-M8 for lojban-list@lojban.org; Thu, 17 Jul 2008 03:36:24 -0700 Received: by an-out-0708.google.com with SMTP id c3so136191ana.61 for ; Thu, 17 Jul 2008 03:36:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to :subject:in-reply-to:mime-version:content-type:references; bh=JYRt0eUXmHKQAO7H7e0ZkkM56ZNV6wTjgMUr+5x9iG4=; b=YHk9tjg26YBbgPcFjCXjTRtnGKoZbBMDZjL03OXafEJ05sOX/N7mdms2kf1rLyHXtT qfW1iNpKoLP8SNfyMzVUoE3/ZYsitf9gJA1lmVlGnzfC1BDDw99mHC6oDJKkmZP9mt3J vfBE3TtV8Txxupg5+cRy96RpX0YjpuxC44Ihs= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:in-reply-to:mime-version :content-type:references; b=AaJ6bQeiIP7+Jpuxvovjp0hqclebCJW05HQIzFt7GVb1ZyqeAmFtntg6IVKRCZrdIS mF3IzacCjKSlBCH6wI7cPiQCjfrnnhvczzji/5HW9fi9NHsU+9bGFQ47Iz0WTMOXIPLQ L4LyojwXZyzTc7C8sWI7YZz93EEHeSQOu0AKg= Received: by 10.100.120.15 with SMTP id s15mr3823828anc.66.1216290979463; Thu, 17 Jul 2008 03:36:19 -0700 (PDT) Received: by 10.100.248.15 with HTTP; Thu, 17 Jul 2008 03:36:19 -0700 (PDT) Message-ID: Date: Thu, 17 Jul 2008 11:36:19 +0100 From: "james riley" To: lojban-list@lojban.org Subject: [lojban] Re: Lojban Speech Recognition semester-project In-Reply-To: MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_51363_4229742.1216290979459" References: X-Spam-Score: 0.0 X-Spam-Score-Int: 0 X-Spam-Bar: / X-archive-position: 14608 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: jimr1603@gmail.com Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list ------=_Part_51363_4229742.1216290979459 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Random sentences okay or should they be part of a bigger prose? I could churn out loads tomorrow (unless something happens), but I'm afk today to help out at my uni. My pronunciation needs practise, but is mostly okay. Also, wav is very big, how do you want us to send you loads of recordings i= n wav? 2008/7/16 Nico M=F6ller : > Hi guys, > > We have got a request a hopefully some of you are willing to help us. We > are currently studying cognitive science at the university of osnabrueck = and > participating in a course called "practical natural language processing", > which is some kind of semester project in lingusitics. Our group decided= to > deal with some speech recognition and because lojban has so nice phonetic > features we choose it as our target language, Unfortunately we discovere= d > that there is very few (usable) lojban audio data on the web, but we > actually need huge amounts of them to feed our training algorithms. It wo= uld > be really cool if some of you could actually send us some audio data we c= an > work with, if you do so please provide them in the following format: > > - 16bit mono, 16khz > - preferable raw or wav data files > - one sentence per audio file > - a transcript text file containing one sentence per line + the name of t= he > audio file in which the sentence was uttered > > Everybody who sends as applicable data will be mentioned by name in our > final term paper, which will be published at the end of this month (You s= ee > will really need those data quick). > > Thanks a lot for your effort, > Nico & Thorben > ------=_Part_51363_4229742.1216290979459 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline
Random sentences okay or should they be part of a bigger p= rose? I could churn out loads tomorrow (unless something happens), but I= 9;m afk today to help out at my uni. My pronunciation needs practise, but i= s mostly okay. Also, wav is very big, how do you want us to send you loads = of recordings in wav?

2008/7/16 Nico M=F6ller <nmoeller@uos.de>:
Hi guys,

We have got a request a hopefully some of = you are willing to help us. We are currently studying cognitive science at = the university of osnabrueck and participating in a course called "pra= ctical natural language processing", which is some kind of semester pr= oject in lingusitics.  Our group decided to deal with some speech reco= gnition and because lojban has so nice phonetic features we choose it as ou= r target language,  Unfortunately we discovered that there is very few= (usable) lojban audio data on the web, but we actually need huge amounts o= f them to feed our training algorithms. It would be really cool if some of = you could actually send us some audio data we can work with, if you do so p= lease provide them in the following format:

- 16bit mono, 16khz
- preferable raw or wav data files
- one sent= ence per audio file
- a transcript text file containing one sentence per= line + the name of the audio file in which the sentence was uttered

Everybody who sends as applicable data will be mentioned by name in our= final term paper, which will be published at the end of this month (You se= e will really need those data quick).

Thanks a lot for your effort,<= br> Nico & Thorben

------=_Part_51363_4229742.1216290979459-- To unsubscribe from this list, send mail to lojban-list-request@lojban.org with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if you're really stuck, send mail to secretary@lojban.org for help.