From phma@webjockey.net Mon Jan 06 12:05:13 2003 Return-Path: X-Sender: phma@ixazon.dynip.com X-Apparently-To: lojban@yahoogroups.com Received: (EGP: mail-8_2_3_0); 6 Jan 2003 20:05:13 -0000 Received: (qmail 67328 invoked from network); 6 Jan 2003 20:05:13 -0000 Received: from unknown (66.218.66.217) by m2.grp.scd.yahoo.com with QMQP; 6 Jan 2003 20:05:13 -0000 Received: from unknown (HELO blackcat.ixazon.lan) (208.150.110.21) by mta2.grp.scd.yahoo.com with SMTP; 6 Jan 2003 20:05:12 -0000 Received: by blackcat.ixazon.lan (Postfix, from userid 1001) id 8EAE07D21; Mon, 6 Jan 2003 20:05:16 +0000 (UTC) Organization: dis To: lojban@yahoogroups.com Subject: Re: [lojban] Bug in word break algorithm Date: Mon, 6 Jan 2003 15:05:16 -0500 User-Agent: KMail/1.5 References: In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200301061505.16204.phma@webjockey.net> From: Pierre Abbat X-Yahoo-Group-Post: member; u=92712300 X-Yahoo-Message-Num: 18183 On Monday 06 January 2003 13:37, Jorge Llambias wrote: > I don't know what the front-middle method is, but a slinku'i > is any form starting with a permissible initial cluster such > that if you remove the initial consonant you are left with a > lujvo. So if you can recognize a lujvo, you can also easily > recognize a slinku'i. Not exactly; it's any form starting with a permissible initial cluster, and which is not a lujvo, such that prepending CV produces a lujvo. {branda} and {spa'i} are slinku'i. The front-middle method is explained in BRKWORDS.TXT. It consists of looking for a sequence of front-middles of lujvo, such as CCV and CV'V, followed by one lujvo back, or one long or CVC rafsi, depending on what part of the algorithm you're in. phma