From phma@webjockey.net Mon Feb 03 15:05:22 2003 Return-Path: X-Sender: lojban-out@lojban.org X-Apparently-To: lojban@yahoogroups.com Received: (EGP: mail-8_2_3_4); 3 Feb 2003 23:05:22 -0000 Received: (qmail 19411 invoked from network); 3 Feb 2003 23:05:21 -0000 Received: from unknown (66.218.66.217) by m7.grp.scd.yahoo.com with QMQP; 3 Feb 2003 23:05:21 -0000 Received: from unknown (HELO digitalkingdom.org) (204.152.186.175) by mta2.grp.scd.yahoo.com with SMTP; 3 Feb 2003 23:05:21 -0000 Received: from lojban-out by digitalkingdom.org with local (Exim 4.05) id 18fpeT-0006bj-00 for lojban@yahoogroups.com; Mon, 03 Feb 2003 15:05:21 -0800 Received: from digitalkingdom.org ([204.152.186.175] helo=chain) by digitalkingdom.org with esmtp (Exim 4.05) id 18fpeM-0006bL-00; Mon, 03 Feb 2003 15:05:14 -0800 Received: with ECARTIS (v1.0.0; list lojban-list); Mon, 03 Feb 2003 15:05:12 -0800 (PST) Received: from 208-150-110-21-adsl.precisionet.net ([208.150.110.21] helo=blackcat.ixazon.lan) by digitalkingdom.org with esmtp (Exim 4.05) id 18fpeA-0006b6-00 for lojban-list@lojban.org; Mon, 03 Feb 2003 15:05:02 -0800 Received: by blackcat.ixazon.lan (Postfix, from userid 1001) id F2F0B905B; Mon, 3 Feb 2003 23:04:36 +0000 (UTC) Organization: dis To: lojban-list@lojban.org Subject: [lojban] Re: Lexing text with {fa'o} and {zoi} in it Date: Mon, 3 Feb 2003 18:04:36 -0500 User-Agent: KMail/1.5 References: <200302020059.24435.phma@webjockey.net> <20030203204620.GD20552@digitalkingdom.org> In-Reply-To: <20030203204620.GD20552@digitalkingdom.org> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Disposition: inline Message-Id: <200302031804.36643.phma@webjockey.net> X-archive-position: 4001 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: phma@webjockey.net Precedence: bulk X-list: lojban-list From: Pierre Abbat Reply-To: phma@webjockey.net X-Yahoo-Group-Post: member; u=92712300 On Monday 03 February 2003 15:46, Robin Lee Powell wrote: > On Sun, Feb 02, 2003 at 12:59:24AM -0500, Pierre Abbat wrote: li'o > > E. If the piece contains a consonant followed two letters later, > > not... > > Followed by what two letters later? By a vowel. If a piece that got this far contains a consonant followed two letters later by a vowel, and there is no 'y' after that, either it contains what looks like a brivla, or someone erroneously pronounced a word of selma'o BY followed by a CVV cmavo without inserting the required pause (such as /kybusO'e/). > I assume all the ellipses are resolved elsewhere? The tarball, which includes the lexing algorithm (which I have some slight changes to make to) and the program, is at http://phma.hn.org/Language/valfendi-0.1.0.tar.gz. > > 6. If the piece is a cmavo of selma'o ZOI and the ZO flag is clear, > > resolve the next piece as the starting delimiter and set the ZOI flag > > to a copy of it. > > > > 7. If the piece is a cmavo of selma'o FAhO and the ZO flag is clear, > > set the FAhO flag. > > > > 8. If the piece is a cmavo of selma'o ZO and the ZO flag is clear, set > > the ZO flag; otherwise clear the ZO flag. > > Seems like it would be simpler if Step 3 was: > > "If the ZO flag is set, lex the piece and clear the ZO flag", or > something like that. > > Then you'd only have to say it once. I think that wouldn't work, because step 5 doesn't always produce a fully resolved piece at the left; e.g. /zo.AisecUskuladjAn/ becomes /zo Aisecusku la djAn/ after two steps. > > 9. If there are any more pieces unresolved, return to step 3. > > > > According to 19:10, if {le'u} appears in a {zoi} quotation which is > > inside a {lo'u} quotation, the {lo'u} quotation is prematurely > > terminated. If the {le'u} is not surrounded by pauses, I don't see how > > that can work. For instance: > > /lo'ugA'ocUskuzoixy.vaiic.alE'umal`asOt.xy.axacuEroc.le'u/ The {le'u} > > inside the {zoi} quotation is embedded in the Hebrew word > > "wayish'alehu", which, being inside a {zoi} quotation, is not > > attempted to lex (and if it did, it would choke on the `ayin of "mah > > l`asot"). > > Wow. Yeah, there's no way that can work. > > > Chapter 21, step 2, says that the word following {zoi} *should* be > > delimited with a pause, not that it *must*. If it is not delimited > > with a pause, this can cause text to be lexed that is part of the > > quotation. For instance: /zoiladjAn.bagram.ladjAn/ is lexed as {zoi la > > djan. bagram. la djan.}. > > I'm sory; what is the delimiter in that case? la? Yes. It's ungrammatical, but the lexer doesn't know that. > > Conversely, two words could be taken as the delimiter, as in > > /zoi.flAlukAvbu.../ that I mentioned before. Should the word be > > changed to "must"? > > Seems like it to me. Likewise, must {fa'o} be followed by a pause? phma