From lojban-out@lojban.org Mon Feb 03 12:46:42 2003
Return-Path: <lojban-out@lojban.org>
X-Sender: lojban-out@lojban.org
X-Apparently-To: lojban@yahoogroups.com
Received: (EGP: mail-8_2_3_4); 3 Feb 2003 20:46:42 -0000
Received: (qmail 1967 invoked from network); 3 Feb 2003 20:46:41 -0000
Received: from unknown (66.218.66.216)
  by m6.grp.scd.yahoo.com with QMQP; 3 Feb 2003 20:46:41 -0000
Received: from unknown (HELO digitalkingdom.org) (204.152.186.175)
  by mta1.grp.scd.yahoo.com with SMTP; 3 Feb 2003 20:46:41 -0000
Received: from lojban-out by digitalkingdom.org with local (Exim 4.05)
  id 18fnUH-0005lR-00
  for lojban@yahoogroups.com; Mon, 03 Feb 2003 12:46:41 -0800
Received: from digitalkingdom.org ([204.152.186.175] helo=chain)
  by digitalkingdom.org with esmtp (Exim 4.05)
  id 18fnU2-0005l4-00; Mon, 03 Feb 2003 12:46:26 -0800
Received: with ECARTIS (v1.0.0; list lojban-list); Mon, 03 Feb 2003 12:46:25 -0800 (PST)
Received: from rlpowell by digitalkingdom.org with local (Exim 4.05)
  id 18fnTw-0005kt-00
  for lojban-list@lojban.org; Mon, 03 Feb 2003 12:46:20 -0800
Date: Mon, 3 Feb 2003 12:46:20 -0800
To: lojban-list@lojban.org
Subject: [lojban] Re: Lexing text with {fa'o} and {zoi} in it
Message-ID: <20030203204620.GD20552@digitalkingdom.org>
Mail-Followup-To: lojban-list@lojban.org
References: <200302020059.24435.phma@webjockey.net>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <200302020059.24435.phma@webjockey.net>
User-Agent: Mutt/1.5.3i
X-archive-position: 4000
X-ecartis-version: Ecartis v1.0.0
Sender: lojban-list-bounce@lojban.org
Errors-to: lojban-list-bounce@lojban.org
X-original-sender: rlpowell@digitalkingdom.org
Precedence: bulk
X-list: lojban-list
X-eGroups-From: Robin Lee Powell <rlpowell@digitalkingdom.org>
From: Robin Lee Powell <lojban-out@lojban.org>
Reply-To: rlpowell@digitalkingdom.org
X-Yahoo-Group-Post: member; u=116389790
X-Yahoo-Profile: lojban_out

On Sun, Feb 02, 2003 at 12:59:24AM -0500, Pierre Abbat wrote:
> Is this a correct way of lexing zoi-quotes?:
> 
> 1. Scan the line from left to right. Convert all spaces to pauses
>
> 2. Break at all pauses (cannot pause in the middle of a word).
>
> 3. If the FAhO flag is set, resolve the piece as foreign text and skip
> to step 9.
>
> 4. If the ZOI flag is set:
>
> A. If the beginning of the current unresolved piece matches the
> delimiter (which is the value of the ZOI flag), ignoring commas and
> capitalization, break the current piece after the delimiter (if
> there is anything after it), resolve it as the ending delimiter, and
> clear the ZOI flag.
>
> 5. Pick the first piece that has not been resolved.
>
> A. If the piece ends in a consonant...
>
> B. If the piece ends in 'y'...
>
> C. If the piece does not end in 'y' or a consonant and has no
> consonant...
>
> D. If the piece contains 'y' and no consonant following the last
> 'y' is...
>
> E. If the piece contains a consonant followed two letters later,
> not...

Followed by what two letters later?

I assume all the ellipses are resolved elsewhere?

> 6. If the piece is a cmavo of selma'o ZOI and the ZO flag is clear,
> resolve the next piece as the starting delimiter and set the ZOI flag
> to a copy of it.
>
> 7. If the piece is a cmavo of selma'o FAhO and the ZO flag is clear,
> set the FAhO flag.
>
> 8. If the piece is a cmavo of selma'o ZO and the ZO flag is clear, set
> the ZO flag; otherwise clear the ZO flag.

Seems like it would be simpler if Step 3 was:

"If the ZO flag is set, lex the piece and clear the ZO flag", or
something like that.

Then you'd only have to say it once.

> 9. If there are any more pieces unresolved, return to step 3.
> 
> According to 19:10, if {le'u} appears in a {zoi} quotation which is
> inside a {lo'u} quotation, the {lo'u} quotation is prematurely
> terminated. If the {le'u} is not surrounded by pauses, I don't see how
> that can work. For instance:
> /lo'ugA'ocUskuzoixy.vaiic.alE'umal`asOt.xy.axacuEroc.le'u/ The {le'u}
> inside the {zoi} quotation is embedded in the Hebrew word
> "wayish'alehu", which, being inside a {zoi} quotation, is not
> attempted to lex (and if it did, it would choke on the `ayin of "mah
> l`asot").

Wow. Yeah, there's no way that can work.

> Chapter 21, step 2, says that the word following {zoi} *should* be
> delimited with a pause, not that it *must*. If it is not delimited
> with a pause, this can cause text to be lexed that is part of the
> quotation. For instance: /zoiladjAn.bagram.ladjAn/ is lexed as {zoi la
> djan. bagram. la djan.}. 

I'm sory; what is the delimiter in that case? la?

> Conversely, two words could be taken as the delimiter, as in
> /zoi.flAlukAvbu.../ that I mentioned before. Should the word be
> changed to "must"?

Seems like it to me.

-Robin

-- 
http://www.digitalkingdom.org/~rlpowell/ *** I'm a *male* Robin.
.i le pamoi velru'e zo'u crepu le plibu taxfu
.i le remoi velru'e zo'u mo .i le cimoi velru'e zo'u ba'e prali .uisai
http://www.lojban.org/ *** to sa'a cu'u lei pibyta'u cridrnoma toi