From lojban-out@lojban.org Mon Feb 03 12:46:42 2003 Return-Path: X-Sender: lojban-out@lojban.org X-Apparently-To: lojban@yahoogroups.com Received: (EGP: mail-8_2_3_4); 3 Feb 2003 20:46:42 -0000 Received: (qmail 1967 invoked from network); 3 Feb 2003 20:46:41 -0000 Received: from unknown (66.218.66.216) by m6.grp.scd.yahoo.com with QMQP; 3 Feb 2003 20:46:41 -0000 Received: from unknown (HELO digitalkingdom.org) (204.152.186.175) by mta1.grp.scd.yahoo.com with SMTP; 3 Feb 2003 20:46:41 -0000 Received: from lojban-out by digitalkingdom.org with local (Exim 4.05) id 18fnUH-0005lR-00 for lojban@yahoogroups.com; Mon, 03 Feb 2003 12:46:41 -0800 Received: from digitalkingdom.org ([204.152.186.175] helo=chain) by digitalkingdom.org with esmtp (Exim 4.05) id 18fnU2-0005l4-00; Mon, 03 Feb 2003 12:46:26 -0800 Received: with ECARTIS (v1.0.0; list lojban-list); Mon, 03 Feb 2003 12:46:25 -0800 (PST) Received: from rlpowell by digitalkingdom.org with local (Exim 4.05) id 18fnTw-0005kt-00 for lojban-list@lojban.org; Mon, 03 Feb 2003 12:46:20 -0800 Date: Mon, 3 Feb 2003 12:46:20 -0800 To: lojban-list@lojban.org Subject: [lojban] Re: Lexing text with {fa'o} and {zoi} in it Message-ID: <20030203204620.GD20552@digitalkingdom.org> Mail-Followup-To: lojban-list@lojban.org References: <200302020059.24435.phma@webjockey.net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <200302020059.24435.phma@webjockey.net> User-Agent: Mutt/1.5.3i X-archive-position: 4000 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: rlpowell@digitalkingdom.org Precedence: bulk X-list: lojban-list X-eGroups-From: Robin Lee Powell From: Robin Lee Powell Reply-To: rlpowell@digitalkingdom.org X-Yahoo-Group-Post: member; u=116389790 X-Yahoo-Profile: lojban_out On Sun, Feb 02, 2003 at 12:59:24AM -0500, Pierre Abbat wrote: > Is this a correct way of lexing zoi-quotes?: > > 1. Scan the line from left to right. Convert all spaces to pauses > > 2. Break at all pauses (cannot pause in the middle of a word). > > 3. If the FAhO flag is set, resolve the piece as foreign text and skip > to step 9. > > 4. If the ZOI flag is set: > > A. If the beginning of the current unresolved piece matches the > delimiter (which is the value of the ZOI flag), ignoring commas and > capitalization, break the current piece after the delimiter (if > there is anything after it), resolve it as the ending delimiter, and > clear the ZOI flag. > > 5. Pick the first piece that has not been resolved. > > A. If the piece ends in a consonant... > > B. If the piece ends in 'y'... > > C. If the piece does not end in 'y' or a consonant and has no > consonant... > > D. If the piece contains 'y' and no consonant following the last > 'y' is... > > E. If the piece contains a consonant followed two letters later, > not... Followed by what two letters later? I assume all the ellipses are resolved elsewhere? > 6. If the piece is a cmavo of selma'o ZOI and the ZO flag is clear, > resolve the next piece as the starting delimiter and set the ZOI flag > to a copy of it. > > 7. If the piece is a cmavo of selma'o FAhO and the ZO flag is clear, > set the FAhO flag. > > 8. If the piece is a cmavo of selma'o ZO and the ZO flag is clear, set > the ZO flag; otherwise clear the ZO flag. Seems like it would be simpler if Step 3 was: "If the ZO flag is set, lex the piece and clear the ZO flag", or something like that. Then you'd only have to say it once. > 9. If there are any more pieces unresolved, return to step 3. > > According to 19:10, if {le'u} appears in a {zoi} quotation which is > inside a {lo'u} quotation, the {lo'u} quotation is prematurely > terminated. If the {le'u} is not surrounded by pauses, I don't see how > that can work. For instance: > /lo'ugA'ocUskuzoixy.vaiic.alE'umal`asOt.xy.axacuEroc.le'u/ The {le'u} > inside the {zoi} quotation is embedded in the Hebrew word > "wayish'alehu", which, being inside a {zoi} quotation, is not > attempted to lex (and if it did, it would choke on the `ayin of "mah > l`asot"). Wow. Yeah, there's no way that can work. > Chapter 21, step 2, says that the word following {zoi} *should* be > delimited with a pause, not that it *must*. If it is not delimited > with a pause, this can cause text to be lexed that is part of the > quotation. For instance: /zoiladjAn.bagram.ladjAn/ is lexed as {zoi la > djan. bagram. la djan.}. I'm sory; what is the delimiter in that case? la? > Conversely, two words could be taken as the delimiter, as in > /zoi.flAlukAvbu.../ that I mentioned before. Should the word be > changed to "must"? Seems like it to me. -Robin -- http://www.digitalkingdom.org/~rlpowell/ *** I'm a *male* Robin. .i le pamoi velru'e zo'u crepu le plibu taxfu .i le remoi velru'e zo'u mo .i le cimoi velru'e zo'u ba'e prali .uisai http://www.lojban.org/ *** to sa'a cu'u lei pibyta'u cridrnoma toi