[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[lojban-beginners] Re: More lojban audio

To: lojban-beginners@lojban.org
Subject: [lojban-beginners] Re: More lojban audio
From: Alex Martini <alexjm@umich.edu>
Date: Sun, 11 Jun 2006 13:42:13 -0400
In-reply-to: <20060610102136.GA9252@haqq.starman.ee>
References: <C009F494-FA80-4D63-9B05-710EEAC19A06@umich.edu> <20060610102136.GA9252@haqq.starman.ee>
Reply-to: lojban-beginners@lojban.org
Sender: lojban-beginners-bounce@lojban.org

coi cizra

Yes, they could be analised and used as the starting point for a textto speech system. This would entail chopping them up into severalhundred bits, and doing some magic in Praat on each one. But, therereally isn't much point in doing that because most of what we wouldneed is already written in one form or anothe, so we'd just be re-inventing the wheel.

Speech, like many other problems, can be handled in two ways. There'sthe straight through approach, where we write one colossus of aprogram that takes a document in and spits out natural soundingspeech. This is very un-portable however, and usually difficult toadjust. If we write a speech engine of this nature for English (forexample) we have to start all over when we want one for Spanish orLojban or whatever.

The other way to tackle the problem is called the modular approach.We write parts that do different things. There's a module that takesthe written orthography of language X and converts it into some 1:1representation of phonetics like Sampa or IPA or whatever. Then wehave a module that analyses the structure of the sentences andclauses and produces the tonality (intonation) for the sentence. Wecould actually do without this module, we just get much moremechanical sounding output. Finally, we make a module that takes inIPA/Sampa and a tonality and produces a sound file of the pronunciation.

In the modular method, all that would need to be written is anorthography module for Lojban. Since Lojban doesn't formally defineany tonality, we could just use pretty much anything; although Lojbanspeakers probably just use the one from their first language.

In short, there are many existing text to speech solutions alreadyavailable. We are much better served making use of them thenattempting to make a new one from the ground up for Lojban. Text tospeech is easy -- the Commodore 64 had a text to speech program on itin the 70's or 80's. Quality natural text to speech is much harder.


mu'o mi'e .aleks.

On Jun 10, 2006, at 6:21 AM, elmo@haqq.pri.ee wrote:

On 20:20 Fri 09 Jun     , Alex Martini wrote:

Just finished recording a really long sample of the basic sounds of
Lojban, about 25 minutes in length. It is currently uploading to my

Can these sounds be used in speech synthesis? If I'm not wrong,Festival

and friends just concatenate digraphs (disounds?).
cizra
--
GPG public key: http://ttu.masendav.net/~t040673/pubkey

References:
- [lojban-beginners] More lojban audio
  - From: Alex Martini <alexjm@umich.edu>
- [lojban-beginners] Re: More lojban audio
  - From: elmo@haqq.pri.ee

Prev by Date: [lojban-beginners] Re: More lojban audio
Next by Date: [lojban-beginners] coi terdi
Previous by thread: [lojban-beginners] Re: More lojban audio
Next by thread: [lojban-beginners] coi terdi
Index(es):
- Date
- Thread