Received: from mail-ig0-f188.google.com ([209.85.213.188]:33257) by stodi.digitalkingdom.org with esmtps (TLSv1.2:AES128-GCM-SHA256:128) (Exim 4.85) (envelope-from ) id 1Z8jUC-0003cn-UU for lojban-list-archive@lojban.org; Fri, 26 Jun 2015 23:21:12 -0700 Received: by igdj8 with SMTP id j8sf7028184igd.0 for ; Fri, 26 Jun 2015 23:21:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=from:to:subject:thread-topic:thread-index:date:message-id :references:in-reply-to:accept-language:content-language :content-type:mime-version:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe; bh=EOXbLgfasBcRaH1F9nx2LbzhfxKOJM/b7UjThGFSqbc=; b=tNWtLvuz2opMzGQrcn4rRY78i4C4om4Av1nYavZrCYz9FfXBarEnTAbAj9iGDPP/Lw R7/Cb5ttVgXA8CpTl6BWZPEmcmCIvNM7Poj3tDafW2k6OnMYNDM1tPC61EbOhe5brzmg rwPd4C8dcAeBuxrLzRqy90xhfJjT6lDz+O0QO6IP+IuJwoy9KgUZkoDOwIXm5gkP9OKw w7yoOQ11Ow6ef20p12Xig0tjZJ+8xzBZrydQxG2gTCHHg7esrR3riVzbC9PNHnilHoZ5 r/krKAY2Bh2Qi7rbc3/NFoFkbPUmEljGbJVIqsKaYv9HlE4LK5UX61SoKrdxfSWg95Gv iFoQ== X-Received: by 10.182.213.40 with SMTP id np8mr35548obc.32.1435386062770; Fri, 26 Jun 2015 23:21:02 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.182.81.230 with SMTP id d6ls867360oby.49.gmail; Fri, 26 Jun 2015 23:21:02 -0700 (PDT) X-Received: by 10.182.213.74 with SMTP id nq10mr1021636obc.39.1435386062335; Fri, 26 Jun 2015 23:21:02 -0700 (PDT) Received: from na01-by2-obe.outbound.protection.outlook.com (mail-by2on0053.outbound.protection.outlook.com. [207.46.100.53]) by gmr-mx.google.com with ESMTPS id x3si4052324pdh.2.2015.06.26.23.21.02 for (version=TLSv1.2 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Fri, 26 Jun 2015 23:21:02 -0700 (PDT) Received-SPF: pass (google.com: domain of timothy.lawrence@connect.qut.edu.au designates 207.46.100.53 as permitted sender) client-ip=207.46.100.53; Received: from BL2PR01MB370.prod.exchangelabs.com (10.141.90.16) by BL2PR01MB369.prod.exchangelabs.com (10.141.90.13) with Microsoft SMTP Server (TLS) id 15.1.195.15; Sat, 27 Jun 2015 06:20:59 +0000 Received: from BL2PR01MB370.prod.exchangelabs.com ([10.141.90.16]) by BL2PR01MB370.prod.exchangelabs.com ([10.141.90.16]) with mapi id 15.01.0195.005; Sat, 27 Jun 2015 06:20:59 +0000 From: Timothy Lawrence To: "lojban@googlegroups.com" Subject: Re: [lojban] The Prototype of a Lojban Speech Recognition Tool Thread-Topic: [lojban] The Prototype of a Lojban Speech Recognition Tool Thread-Index: AQHQsIl2p5PCAl5fm0q6hs1nLgMOHp2/36Ia Date: Sat, 27 Jun 2015 06:20:59 +0000 Message-ID: References: In-Reply-To: Accept-Language: en-AU, en-US Content-Language: en-AU X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [118.208.186.87] x-microsoft-exchange-diagnostics: 1;BL2PR01MB369;5:ErbwdrtCR4kDhM0n0NgxxIBRGExNPISV03jsoURlEIGJJRfkyc5dMITHcl61j0IkisqaxZ2fAFTS5FQRLAusTzs+Wrk1agEXOGWOEDkr6DpGaaCycp9iHyJVWbXmWa+cUr3koG3dpdM30BGRjKAang==;24:tnqBRpkUJcY2pjxZuAiOx8WyZGB0JZIYUw0GP96oQDk9fza02mk3A3cecKqVCLU0usQNpDU6oStWDmNT3yCtt//SHDfGZ5/9fWWDPUx6rrk=;20:vZJMHfPQMWf6DuIuhUu1m2tp54RRTY8z6Mbe/Xl2BDNeK1TvzxJA/RP2xpkxO+pUuJNFD9tz4DemODOXcRDfPw== x-microsoft-antispam: UriScan:;BCL:0;PCL:0;RULEID:(42134001)(42139001)(42142116001);SRVR:BL2PR01MB369; x-microsoft-antispam-prvs: x-exchange-antispam-report-test: UriScan:; x-exchange-antispam-report-cfa-test: BCL:0;PCL:0;RULEID:(5005006)(3002001);SRVR:BL2PR01MB369;BCL:0;PCL:0;RULEID:;SRVR:BL2PR01MB369; x-forefront-prvs: 0620CADDF3 x-forefront-antispam-report: SFV:NSPM;SFS:(10009020)(52044002)(53754006)(377454003)(88552001)(74316001)(77096005)(19273905006)(33656002)(106116001)(74482002)(46102003)(66066001)(14971765001)(16601075003)(2900100001)(2950100001)(2501003)(19625215002)(92566002)(15975445007)(102836002)(2351001)(19580395003)(19580405001)(110136002)(19627405001)(19617315012)(16297215004)(107886002)(5001960100002)(189998001)(450100001)(77156002)(62966003)(50986999)(76176999)(54356999)(40100003)(5002640100001)(122556002)(2656002)(16236675004)(87936001)(5003600100002)(86362001);DIR:OUT;SFP:1101;SCL:1;SRVR:BL2PR01MB369;H:BL2PR01MB370.prod.exchangelabs.com;FPR:;SPF:None;MLV:sfv;LANG:en; Content-Type: multipart/alternative; boundary="_000_BL2PR01MB3704E814FB2B677E0871E7EEBAC0BL2PR01MB370prodex_" MIME-Version: 1.0 X-OriginatorOrg: connect.qut.edu.au X-MS-Exchange-CrossTenant-originalarrivaltime: 27 Jun 2015 06:20:59.5647 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: dc0b52a3-68c5-44f7-881d-9383d8850b96 X-MS-Exchange-Transport-CrossTenantHeadersStamped: BL2PR01MB369 X-Original-Sender: timothy.lawrence@connect.qut.edu.au X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of timothy.lawrence@connect.qut.edu.au designates 207.46.100.53 as permitted sender) smtp.mail=timothy.lawrence@connect.qut.edu.au Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Spam-Checked-In-Group: lojban@googlegroups.com X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , X-Spam-Score: -1.9 (-) X-Spam_score: -1.9 X-Spam_score_int: -18 X-Spam_bar: - --_000_BL2PR01MB3704E814FB2B677E0871E7EEBAC0BL2PR01MB370prodex_ Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable I really like this initiative :) A friend of mine is working on speech recognition (for other purposes) and = tried Sphinx but ended up changing to http://kaldi.sourceforge.net/about.ht= ml, I believe because he found it to be better at recognising speech and/or= it was faster. I thought I'd mention it in case you're not fully set on Sp= hinx. ________________________________ From: lojban@googlegroups.com on behalf of sorpa'= as plat Sent: Saturday, 27 June 2015 1:29 PM To: lojban@googlegroups.com Subject: [lojban] The Prototype of a Lojban Speech Recognition Tool Hi all, I'm trying to build a Lojban speech recognition called tersku. Instead of b= uilding an acoustic model by hand (which may need many manpower and takes a= long time), the attempt is to take the English acoustic model (which is pr= etty mature) and adapt it for Lojban sounds. A running prototype can be found at https://git.null.tl/tersku.git (use git= ://git.null.tl/tersku.git to clone). The prototype uses a unmodified versio= n of CMU's generic English acoustic model, with only necessary dictionary a= nd grammars to parse the text "le tanxe be le birka cu cpana le tanxe be le= botpi". To use it, recording a version of the text "le tanxe be le birka c= u cpana le tanxe be le botpi", convert the recording to wav format, and rep= lace the /resources/org/lojban/tersku/recording.wav file with it. The progr= am will output the best "hypothesis" for the text. The program does not work really well. That means there's lots of work and = I would appreciate your help. Below are some details of things to be done. About the Program tersku uses CMU's Sphinx speech recognition engine. You can find Sphinx's t= utorials and documentations at http://cmusphinx.sourceforge.net. Adapt the Acoustic Model The adaptation requires some 16KHZ single-channel wav recordings. Help are = appreciated if someone can create a Lojban phrase recording collection. Not= e that a phrase recording collection will benefit the whole Lojban communit= y but not just the speech recognition program :) Finish the Dictionary The dictionary in the prototype locates at resources/org/lojban/tersku/jbo-= 1.dict. Because we are trying to adapt the English acoustic model, all the = phones are represented in Arpabet (https://en.wikipedia.org/wiki/Arpabet). = We will need to a) confirm which arpabet symbol represents which Lojban sou= nd, and b) write a program that generates all the words in "[lojban word] [= arpabet symbols]". This is probably dependent of the adaptation of the acou= stic model. Finish the Grammar The grammar needs to be written in JSGF format (http://cmusphinx.sourceforg= e.net/wiki/tutoriallm). This haven't been started yet (which needs help!). Correct Me! There must be mistakes and errors both in the codes and in the recognition = details (I'm new to speech recognition!). Feel free to reach me at this email address or by opening an task at https:= //phabricator.null.tl. I'm really looking forward to a Lojban speech recogn= ition tool, because it should be one of the features of Lojban :) Wei mu'o mi'e la sorpa'as -- You received this message because you are subscribed to the Google Groups "= lojban" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout. --=20 You received this message because you are subscribed to the Google Groups "= lojban" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout. --_000_BL2PR01MB3704E814FB2B677E0871E7EEBAC0BL2PR01MB370prodex_ Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable

I really like this initiative :)


A friend of mine is working on speech recognition (for other purposes) a= nd tried Sphinx but ended up changing to http://kaldi.sourceforge.net/about.html, I believe because he found it = to be better at recognising speech and/or it was faster. I thought I'd ment= ion it in case you're not fully set on Sphinx.



From: lojban@googlegroups= .com <lojban@googlegroups.com> on behalf of sorpa'as plat <sorpaas= @gmail.com>
Sent: Saturday, 27 June 2015 1:29 PM
To: lojban@googlegroups.com
Subject: [lojban] The Prototype of a Lojban Speech Recognition Tool<= /font>
 
Hi all,

I'm trying to build a Lojban speech recognition called tersku. Instead of b= uilding an acoustic model by hand (which may need many manpower and takes a= long time), the attempt is to take the English acoustic model (which is pr= etty mature) and adapt it for Lojban sounds.

A running prototype can be found at https://git.null.tl/tersku.git (use git://git.null.tl/tersku.git to clone). The prototype uses a unmodif= ied version of CMU's generic English acoustic model, with only necessary di= ctionary and grammars to parse the text "le tanxe be le birka cu cpana= le tanxe be le botpi". To use it, recording a version of the text "le tanxe be le birka cu cpana le tanxe be le b= otpi", convert the recording to wav format, and replace the /resources= /org/lojban/tersku/recording.wav file with it. The program will output the = best "hypothesis" for the text.

The program does not work really well. That means there's lots of work and = I would appreciate your help. Below are some details of things to be done.<= br>
About the Program
tersku uses CMU's Sphinx speech recognition engine. You can find Sphinx's t= utorials and documentations at http://cmusphinx.sourceforge.n= et.

Adapt the Acoustic Model
The adaptation requires some 16KHZ single-channel wav recordings. Help are = appreciated if someone can create a Lojban phrase recording collection. Not= e that a phrase recording collection will benefit the whole Lojban communit= y but not just the speech recognition program :)

Finish the Dictionary
The dictionary in the prototype locates at resources/org/lojban/tersku/jbo-= 1.dict. Because we are trying to adapt the English acoustic model, all the = phones are represented in Arpabet (https://en.wikipedia.org/wiki/Arpabet= ). We will need to a) confirm which arpabet symbol represents which Lojban sound, and b) write a program= that generates all the words in "[lojban word] [arpabet symbols]"= ;. This is probably dependent of the adaptation of the acoustic model.

Finish the Grammar
The grammar needs to be written in JSGF format (http://cmusphinx.sourceforge.net/wiki/tut= oriallm). This haven't been started yet (which needs help!).

Correct Me!
There must be mistakes and errors both in the codes and in the recognition = details (I'm new to speech recognition!).

Feel free to reach me at this email address or by opening an task at https://phabricator.null.tl. I'm really looking forward to a Lojban spe= ech recognition tool, because it should be one of the features of Lojban :)=

Wei
mu'o mi'e la sorpa'as

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsub= scribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http:= //groups.google.com/group/lojban.
For more options, visit http= s://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsub= scribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http:= //groups.google.com/group/lojban.
For more options, visit http= s://groups.google.com/d/optout.
--_000_BL2PR01MB3704E814FB2B677E0871E7EEBAC0BL2PR01MB370prodex_--