From rlpowell@digitalkingdom.org Mon Feb 03 12:46:25 2003 Received: with ECARTIS (v1.0.0; list lojban-list); Mon, 03 Feb 2003 12:46:25 -0800 (PST) Received: from rlpowell by digitalkingdom.org with local (Exim 4.05) id 18fnTw-0005kt-00 for lojban-list@lojban.org; Mon, 03 Feb 2003 12:46:20 -0800 Date: Mon, 3 Feb 2003 12:46:20 -0800 To: lojban-list@lojban.org Subject: [lojban] Re: Lexing text with {fa'o} and {zoi} in it Message-ID: <20030203204620.GD20552@digitalkingdom.org> Mail-Followup-To: lojban-list@lojban.org References: <200302020059.24435.phma@webjockey.net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <200302020059.24435.phma@webjockey.net> User-Agent: Mutt/1.5.3i From: Robin Lee Powell X-archive-position: 4000 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: rlpowell@digitalkingdom.org Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list On Sun, Feb 02, 2003 at 12:59:24AM -0500, Pierre Abbat wrote: > Is this a correct way of lexing zoi-quotes?: > > 1. Scan the line from left to right. Convert all spaces to pauses > > 2. Break at all pauses (cannot pause in the middle of a word). > > 3. If the FAhO flag is set, resolve the piece as foreign text and skip > to step 9. > > 4. If the ZOI flag is set: > > A. If the beginning of the current unresolved piece matches the > delimiter (which is the value of the ZOI flag), ignoring commas and > capitalization, break the current piece after the delimiter (if > there is anything after it), resolve it as the ending delimiter, and > clear the ZOI flag. > > 5. Pick the first piece that has not been resolved. > > A. If the piece ends in a consonant... > > B. If the piece ends in 'y'... > > C. If the piece does not end in 'y' or a consonant and has no > consonant... > > D. If the piece contains 'y' and no consonant following the last > 'y' is... > > E. If the piece contains a consonant followed two letters later, > not... Followed by what two letters later? I assume all the ellipses are resolved elsewhere? > 6. If the piece is a cmavo of selma'o ZOI and the ZO flag is clear, > resolve the next piece as the starting delimiter and set the ZOI flag > to a copy of it. > > 7. If the piece is a cmavo of selma'o FAhO and the ZO flag is clear, > set the FAhO flag. > > 8. If the piece is a cmavo of selma'o ZO and the ZO flag is clear, set > the ZO flag; otherwise clear the ZO flag. Seems like it would be simpler if Step 3 was: "If the ZO flag is set, lex the piece and clear the ZO flag", or something like that. Then you'd only have to say it once. > 9. If there are any more pieces unresolved, return to step 3. > > According to 19:10, if {le'u} appears in a {zoi} quotation which is > inside a {lo'u} quotation, the {lo'u} quotation is prematurely > terminated. If the {le'u} is not surrounded by pauses, I don't see how > that can work. For instance: > /lo'ugA'ocUskuzoixy.vaiic.alE'umal`asOt.xy.axacuEroc.le'u/ The {le'u} > inside the {zoi} quotation is embedded in the Hebrew word > "wayish'alehu", which, being inside a {zoi} quotation, is not > attempted to lex (and if it did, it would choke on the `ayin of "mah > l`asot"). Wow. Yeah, there's no way that can work. > Chapter 21, step 2, says that the word following {zoi} *should* be > delimited with a pause, not that it *must*. If it is not delimited > with a pause, this can cause text to be lexed that is part of the > quotation. For instance: /zoiladjAn.bagram.ladjAn/ is lexed as {zoi la > djan. bagram. la djan.}. I'm sory; what is the delimiter in that case? la? > Conversely, two words could be taken as the delimiter, as in > /zoi.flAlukAvbu.../ that I mentioned before. Should the word be > changed to "must"? Seems like it to me. -Robin -- http://www.digitalkingdom.org/~rlpowell/ *** I'm a *male* Robin. .i le pamoi velru'e zo'u crepu le plibu taxfu .i le remoi velru'e zo'u mo .i le cimoi velru'e zo'u ba'e prali .uisai http://www.lojban.org/ *** to sa'a cu'u lei pibyta'u cridrnoma toi