From phma@ixazon.dynip.com Sun Jan 12 13:06:07 2003 Received: with ECARTIS (v1.0.0; list lojban-list); Sun, 12 Jan 2003 13:06:08 -0800 (PST) Received: from 208-150-110-21-adsl.precisionet.net ([208.150.110.21] helo=blackcat.ixazon.lan) by digitalkingdom.org with esmtp (Exim 4.05) id 18XpIw-00079E-00 for lojban-list@lojban.org; Sun, 12 Jan 2003 13:06:02 -0800 Received: by blackcat.ixazon.lan (Postfix, from userid 1001) id 197D1841A; Sun, 12 Jan 2003 21:05:31 +0000 (UTC) From: Pierre Abbat Organization: dis To: lojban-list@lojban.org (lojban-list@lojban.org) Subject: [lojban] Re: zoizoi Date: Sun, 12 Jan 2003 16:05:30 -0500 User-Agent: KMail/1.5 References: <200301081430.JAA05470@mail2.reutershealth.com> <200301081902.30179.phma@webjockey.net> In-Reply-To: <200301081902.30179.phma@webjockey.net> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Disposition: inline Message-Id: <200301121605.30573.phma@webjockey.net> X-archive-position: 3782 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: phma@webjockey.net Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list On Wednesday 08 January 2003 19:02, Pierre Abbat wrote: > I am currently starting the brivla-end-breaking routine, which is simpler > than the brivla-start-breaking routine. This case, or actually a slight > variant of it, tells me that the cmavo preceding a brivla have to be broken > off before the brivla is broken from what follows it. Consider > /zoiFLAluKAVbu.blableblibloblu.FLAluKAVbu./. If the part after the brivla > is broken off before or at the same time as the part before it, this will > be /zoi/ /FLAlu/ /KAVbu/ /blableblibloblu/ /FLAluKAVbu/ when the {zoi} is > detected, and the lexer will erroneously call {kavbu blableblibloblu} the > quote. On further thought, it's going to be difficult to catch all errors of this sort. Consider /zoikybuladjAn/. This is lexed as follows: It ends in a consonant. Look backward for a cmegadri and break before and after: /zoikybu/ /la/ /djAn/ Look backward for [yC] or [C*V]. The first it finds is /yb/, so break there: /zoiky/ /bu/ /la/ /djAn/ The first part consists entirely of cmavo. Split it: /zoi/ /ky/ /bu/ /la/ /djAn/ At this point it recognizes {zoi} and takes {ky} to be the delimiter, not noticing that it was originally not followed by a pause. phma