From lojbab@lojban.org Sat Dec 14 04:15:28 2002 Return-Path: X-Sender: lojbab@lojban.org X-Apparently-To: lojban@yahoogroups.com Received: (EGP: mail-8_2_3_0); 14 Dec 2002 12:15:28 -0000 Received: (qmail 58159 invoked from network); 14 Dec 2002 12:15:27 -0000 Received: from unknown (66.218.66.217) by m7.grp.scd.yahoo.com with QMQP; 14 Dec 2002 12:15:27 -0000 Received: from unknown (HELO lakemtao03.cox.net) (68.1.17.242) by mta2.grp.scd.yahoo.com with SMTP; 14 Dec 2002 12:15:27 -0000 Received: from lojban.lojban.org ([68.100.206.153]) by lakemtao03.cox.net (InterMail vM.5.01.04.05 201-253-122-122-105-20011231) with ESMTP id <20021214121527.CCNT26808.lakemtao03.cox.net@lojban.lojban.org> for ; Sat, 14 Dec 2002 07:15:27 -0500 Message-Id: <5.2.0.9.0.20021214070806.0317d9b0@pop.east.cox.net> X-Sender: rlechevalier@pop.east.cox.net X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9 Date: Sat, 14 Dec 2002 07:13:19 -0500 To: Subject: Re: [lojban] Word resolution algorithm so far In-Reply-To: <0212131723430D.03697@neofelis> References: <02121314545209.03697@neofelis> <02121314545209.03697@neofelis> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed From: Robert LeChevalier X-Yahoo-Group-Post: member; u=1120595 X-Yahoo-Profile: lojbab X-Yahoo-Message-Num: 18014 At 05:23 PM 12/13/02 -0500, Pierre Abbat wrote: >On Friday 13 December 2002 14:54, Pierre Abbat wrote: > > C. If the piece contains 'y' and no consonant following 'y' is followed > > two letters later, not counting apostrophes and commas, by a vowel, > > split it after 'y'. (e.g. ly.Ebucy.Obukybu.DENpabu) > >On second thought, maybe that should be "If the piece contains 'y' not >adjacent to a vowel, and no consonant...". What should the algorithm do with >such as these?: >da'ybaba >doyli >dyibuku >by'ama >byobu >xayasa I am assuming you are NOT allowing for alternate orthography. As text, with no consonant clusters, and no consonant followed by a space, they should break before each consonant into cmavo. However, since several of the vowel combinations have no defined Lojban pronunciation, they cannot be renderings of a Lojban speech stream, and hence are errors that should be rejected out of hand as invalid input. The first and the fourth appear to be pronounceable, and hence should be cmavo sequences. lojbab -- lojbab lojbab@lojban.org Bob LeChevalier, President, The Logical Language Group, Inc. 2904 Beau Lane, Fairfax VA 22031-1303 USA 703-385-0273 Artificial language Loglan/Lojban: http://www.lojban.org