From lojbab@lojban.org Sat Dec 14 04:15:28 2002
Return-Path: <lojbab@lojban.org>
X-Sender: lojbab@lojban.org
X-Apparently-To: lojban@yahoogroups.com
Received: (EGP: mail-8_2_3_0); 14 Dec 2002 12:15:28 -0000
Received: (qmail 58159 invoked from network); 14 Dec 2002 12:15:27 -0000
Received: from unknown (66.218.66.217)
  by m7.grp.scd.yahoo.com with QMQP; 14 Dec 2002 12:15:27 -0000
Received: from unknown (HELO lakemtao03.cox.net) (68.1.17.242)
  by mta2.grp.scd.yahoo.com with SMTP; 14 Dec 2002 12:15:27 -0000
Received: from lojban.lojban.org ([68.100.206.153]) by lakemtao03.cox.net
  (InterMail vM.5.01.04.05 201-253-122-122-105-20011231) with ESMTP
  id <20021214121527.CCNT26808.lakemtao03.cox.net@lojban.lojban.org>
  for <lojban@yahoogroups.com>; Sat, 14 Dec 2002 07:15:27 -0500
Message-Id: <5.2.0.9.0.20021214070806.0317d9b0@pop.east.cox.net>
X-Sender: rlechevalier@pop.east.cox.net
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Sat, 14 Dec 2002 07:13:19 -0500
To: <lojban@yahoogroups.com>
Subject: Re: [lojban] Word resolution algorithm so far
In-Reply-To: <0212131723430D.03697@neofelis>
References: <02121314545209.03697@neofelis>
  <02121314545209.03697@neofelis>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
From: Robert LeChevalier <lojbab@lojban.org>
X-Yahoo-Group-Post: member; u=1120595
X-Yahoo-Profile: lojbab

At 05:23 PM 12/13/02 -0500, Pierre Abbat wrote:
>On Friday 13 December 2002 14:54, Pierre Abbat wrote:
> > C. If the piece contains 'y' and no consonant following 'y' is followed
> > two letters later, not counting apostrophes and commas, by a vowel,
> > split it after 'y'. (e.g. ly.Ebucy.Obukybu.DENpabu)
>
>On second thought, maybe that should be "If the piece contains 'y' not
>adjacent to a vowel, and no consonant...". What should the algorithm do with
>such as these?:
>da'ybaba
>doyli
>dyibuku
>by'ama
>byobu
>xayasa

I am assuming you are NOT allowing for alternate orthography.

As text, with no consonant clusters, and no consonant followed by a space, 
they should break before each consonant into cmavo. However, since several 
of the vowel combinations have no defined Lojban pronunciation, they cannot 
be renderings of a Lojban speech stream, and hence are errors that should 
be rejected out of hand as invalid input. The first and the fourth appear 
to be pronounceable, and hence should be cmavo sequences.

lojbab

-- 
lojbab lojbab@lojban.org
Bob LeChevalier, President, The Logical Language Group, Inc.
2904 Beau Lane, Fairfax VA 22031-1303 USA 703-385-0273
Artificial language Loglan/Lojban: http://www.lojban.org



