From lojbanlists@wonderclown.com Thu Jan 03 17:59:04 2002
Return-Path: <randy@wonderclown.com>
X-Sender: randy@wonderclown.com
X-Apparently-To: lojban@yahoogroups.com
Received: (EGP: mail-8_0_1_3); 4 Jan 2002 01:59:05 -0000
Received: (qmail 81719 invoked from network); 4 Jan 2002 01:59:05 -0000
Received: from unknown (216.115.97.172)
  by m11.grp.snv.yahoo.com with QMQP; 4 Jan 2002 01:59:05 -0000
Received: from unknown (HELO mail7.nc.rr.com) (24.93.67.54)
  by mta2.grp.snv.yahoo.com with SMTP; 4 Jan 2002 01:59:03 -0000
Received: from aerosol ([66.57.85.122]) by mail7.nc.rr.com with Microsoft SMTPSVC(5.5.1877.687.68);
  Thu, 3 Jan 2002 20:58:59 -0500
Received: from rnortman by aerosol with local (Exim 3.33 #1 (Debian))
  id 16MJdF-0000FU-00; Thu, 03 Jan 2002 20:58:53 -0500
Date: Thu, 3 Jan 2002 20:58:52 -0500
To: lojban@yahoogroups.com
Cc: buzzwyrd <jcrossco@bellsouth.net>
Subject: Re: [lojban] Lojban Text to Speech
Message-ID: <20020104015851.GA545@aerosol>
References: <66522672-0018-11D6-A890-000393074A5A@urbanium.tv> <a127ih+qvf9@eGroups.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <a127ih+qvf9@eGroups.com>
User-Agent: Mutt/1.3.24i
Sender: Randall Nortman <randy@wonderclown.com>
Return-Path: randy@wonderclown.com
From: "randl. nortmn." <lojbanlists@wonderclown.com>
X-Yahoo-Group-Post: member; u=91794573
X-Yahoo-Profile: uyndrklaun

On Thu, Jan 03, 2002 at 06:23:13PM -0000, buzzwyrd wrote:
[...]
> I have added a link to a wav file that I generated from the Lojban 
> text. It was made with CMU's Festvox TTS system. It's by no means 
> perfect, but it does seem to help me to hear spoken lojban, even if
> it is a little gravelly.
[...]

I'm interested in knowing how you got Festvox to speak Lojban. Did
you actually write the frontend that parses Lojban text into phonemes,
or did you first convert the Lojban to English text, and then send
that through the standard English frontend? If the former, then all
that remains is to record a set of high-quality diphones for Lojban.
(Also, where did you get the fricative 'k' ('x' in Lojban) sound?)

By the way, for those that have commented on the poor quality and
apparent Southern twang in the sound file, both are due to the fact
that very old, poor quality 8kHz diphone samples are being used, and
probably the English ones. With those samples, even English sounds
terrible. I think the Southern twang is just an accident, largely due
to the fact that the samples are not of the pure Lojban vowels, but
the American sliding dipthongy vowels.

> Oh, and believe me, if you want to record all the diphones, I hope 
> you've got a leather larynx.

Agreed! This is why I pawned that off on others to do. And since you
seem to have done my part of the job already, I get to sit back and
enjoy the fruits of others' labor! ;-)

Seriously, good job on both the SHRDLU dialog translation and the
TTS. That translation is much more ambitious than anything I've
undertaken so far.

mu'omi'e randl.