Received: from mail-oi0-f59.google.com ([209.85.218.59]:32822) by stodi.digitalkingdom.org with esmtps (TLSv1.2:AES128-GCM-SHA256:128) (Exim 4.85) (envelope-from ) id 1Z8go3-0001jk-CG for lojban-list-archive@lojban.org; Fri, 26 Jun 2015 20:29:33 -0700 Received: by oiav1 with SMTP id v1sf30175742oia.0 for ; Fri, 26 Jun 2015 20:29:20 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=date:from:to:message-id:subject:mime-version:content-type :x-original-sender:reply-to:precedence:mailing-list:list-id :x-spam-checked-in-group:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe; bh=UtXC3ah0VRpPccpiPZ56QI0BSAhqIeyt3Kyh7tA5AAY=; b=k0TvmXc+HkYYoFYiCJXKkUwjQUAJePRDAQJe7UMY0hcrCrA0T+cOYF23coF64Gb37w vuoIcSna4b9RhfUH5kCI6C+xd2X2O8ViwxUaBP1Io8AS5GR+7Amgp0M5+p5QCKsLyVEb sJ7K98j0JCC/EPXgFm8PJyiMzt/MV7XRf0ww/L2Fri6KcddKD0nyiMT6vONyqWE59TwJ Ge8SYiCTBz97jbP6lTd8twdIc7uN5ro2DhQAT5ZSPJ7vAksiPipkfIxQM0b3dISeQ33A 1xpllu1M2T7zITN6ky8Fp8EjT3tn/R996vMCh3zeLsyM38Cx1igYsA4ZrpBjzFXozyDL slxA== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=date:from:to:message-id:subject:mime-version:content-type :x-original-sender:reply-to:precedence:mailing-list:list-id :x-spam-checked-in-group:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe; bh=UtXC3ah0VRpPccpiPZ56QI0BSAhqIeyt3Kyh7tA5AAY=; b=E6760V9y5Rmo7l+FpPspzgYR2c8AveKh3KwZUXVezNkusnHGaOkqXX/3D/2qHv23/g wLHqylqtkdVq09SkQSCSBrtlbUuI5f6Qd1+Lih4ZqPJR6Iopee4aL/3EBSUjuzF5UbGl bsIXhLAefo7EKRlgmT8tK+XSNRSsflEBLgwDyxVnLjZSbJiZ8TzaNo3eKWSGjAJGGx9c SVINkirTzkD94AsBEgGpUl5C1nXtS4K8Jk/2f2NvkAOZOeKKuDe7R00Kxe7oszDuREYP fs4Gj/WRjLafadtMJjuEb8BTfe6dm/vnaxFW79q79AcoagAh+UyOnArqsjp+/nouj2IB KfCw== X-Received: by 10.140.94.166 with SMTP id g35mr65550qge.1.1435375760870; Fri, 26 Jun 2015 20:29:20 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.140.25.248 with SMTP id 111ls2403537qgt.12.gmail; Fri, 26 Jun 2015 20:29:20 -0700 (PDT) X-Received: by 10.140.37.129 with SMTP id r1mr66878qgr.18.1435375760504; Fri, 26 Jun 2015 20:29:20 -0700 (PDT) Date: Fri, 26 Jun 2015 20:29:20 -0700 (PDT) From: sorpa'as plat To: lojban@googlegroups.com Message-Id: Subject: [lojban] The Prototype of a Lojban Speech Recognition Tool MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_2619_1607334825.1435375760016" X-Original-Sender: sorpaas@gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Spam-Checked-In-Group: lojban@googlegroups.com X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , X-Spam-Score: -2.0 (--) X-Spam_score: -2.0 X-Spam_score_int: -19 X-Spam_bar: -- ------=_Part_2619_1607334825.1435375760016 Content-Type: multipart/alternative; boundary="----=_Part_2620_1279608162.1435375760016" ------=_Part_2620_1279608162.1435375760016 Content-Type: text/plain; charset=UTF-8 Hi all, I'm trying to build a Lojban speech recognition called tersku. Instead of building an acoustic model by hand (which may need many manpower and takes a long time), the attempt is to take the English acoustic model (which is pretty mature) and adapt it for Lojban sounds. A running prototype can be found at https://git.null.tl/tersku.git (use *git://git.null.tl/tersku.git* to clone). The prototype uses a unmodified version of CMU's generic English acoustic model, with only necessary dictionary and grammars to parse the text "le tanxe be le birka cu cpana le tanxe be le botpi". To use it, recording a version of the text "le tanxe be le birka cu cpana le tanxe be le botpi", convert the recording to wav format, and replace the /resources/org/lojban/tersku/recording.wav file with it. The program will output the best "hypothesis" for the text. The program does not work really well. That means there's lots of work and I would appreciate your help. Below are some details of things to be done. *About the Program* tersku uses CMU's Sphinx speech recognition engine. You can find Sphinx's tutorials and documentations at http://cmusphinx.sourceforge.net. *Adapt the Acoustic Model* The adaptation requires some 16KHZ single-channel wav recordings. Help are appreciated if someone can create a Lojban phrase recording collection. Note that a phrase recording collection will benefit the whole Lojban community but not just the speech recognition program :) *Finish the Dictionary* The dictionary in the prototype locates at resources/org/lojban/tersku/jbo-1.dict. Because we are trying to adapt the English acoustic model, all the phones are represented in Arpabet ( *https://en.wikipedia.org/wiki/Arpabet*). We will need to a) confirm which arpabet symbol represents which Lojban sound, and b) write a program that generates all the words in "[lojban word] [arpabet symbols]". This is probably dependent of the adaptation of the acoustic model. *Finish the Grammar* The grammar needs to be written in JSGF format ( http://cmusphinx.sourceforge.net/wiki/tutoriallm). This haven't been started yet (which needs help!). *Correct Me!* There must be mistakes and errors both in the codes and in the recognition details (I'm new to speech recognition!). Feel free to reach me at this email address or by opening an task at https://phabricator.null.tl. I'm really looking forward to a Lojban speech recognition tool, because it should be one of the features of Lojban :) Wei mu'o mi'e la sorpa'as -- You received this message because you are subscribed to the Google Groups "lojban" group. To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout. ------=_Part_2620_1279608162.1435375760016 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Hi all,

I'm trying to build a Lojban speech recogni= tion called tersku. Instead of building an acoustic model by hand (which ma= y need many manpower and takes a long time), the attempt is to take the Eng= lish acoustic model (which is pretty mature) and adapt it for Lojban sounds= .

A running prototype can be found at https://git.null.tl/tersku.git (use git://git.null.tl/t= ersku.git to clone). The prototype uses a unmodified version of CMU's g= eneric English acoustic model, with only necessary dictionary and grammars = to parse the text "le tanxe be le birka cu cpana le tanxe be le botpi". To = use it, recording a version of the text "le tanxe be le birka cu cpana le t= anxe be le botpi", convert the recording to wav format, and replace the /re= sources/org/lojban/tersku/recording.wav file with it. The program will outp= ut the best "hypothesis" for the text.

The program does not work rea= lly well. That means there's lots of work and I would appreciate your help.= Below are some details of things to be done.

About the Program
tersku uses CMU's Sphinx speech recognition engine. You can find Sphi= nx's tutorials and documentations at http://cmusphinx.sourceforge.net.

Adapt the Acoustic M= odel
The adaptation requires some 16KHZ single-channel wav recording= s. Help are appreciated if someone can create a Lojban phrase recording col= lection. Note that a phrase recording collection will benefit the whole Loj= ban community but not just the speech recognition program :)

Fini= sh the Dictionary
The dictionary in the prototype locates at resourc= es/org/lojban/tersku/jbo-1.dict. Because we are trying to adapt the English= acoustic model, all the phones are represented in Arpabet (https://en.w= ikipedia.org/wiki/Arpabet). We will need to a) confirm which arpabet sy= mbol represents which Lojban sound, and b) write a program that generates a= ll the words in "[lojban word] [arpabet symbols]". This is probably depende= nt of the adaptation of the acoustic model.

Finish the Grammar
The grammar needs to be written in JSGF format (http://cmusphinx.sourceforge.net/wik= i/tutoriallm). This haven't been started yet (which needs help!).
Correct Me!
There must be mistakes and errors both in the codes= and in the recognition details (I'm new to speech recognition!).

Fe= el free to reach me at this email address or by opening an task at https://phabricator.null.tl. I'm reall= y looking forward to a Lojban speech recognition tool, because it should be= one of the features of Lojban :)

Wei
mu'o mi'e la sorpa'as

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsub= scribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http:= //groups.google.com/group/lojban.
For more options, visit http= s://groups.google.com/d/optout.
------=_Part_2620_1279608162.1435375760016-- ------=_Part_2619_1607334825.1435375760016--