From phma@webjockey.net Mon Feb 03 15:05:22 2003
Return-Path: <lojban-out@lojban.org>
X-Sender: lojban-out@lojban.org
X-Apparently-To: lojban@yahoogroups.com
Received: (EGP: mail-8_2_3_4); 3 Feb 2003 23:05:22 -0000
Received: (qmail 19411 invoked from network); 3 Feb 2003 23:05:21 -0000
Received: from unknown (66.218.66.217)
  by m7.grp.scd.yahoo.com with QMQP; 3 Feb 2003 23:05:21 -0000
Received: from unknown (HELO digitalkingdom.org) (204.152.186.175)
  by mta2.grp.scd.yahoo.com with SMTP; 3 Feb 2003 23:05:21 -0000
Received: from lojban-out by digitalkingdom.org with local (Exim 4.05)
  id 18fpeT-0006bj-00
  for lojban@yahoogroups.com; Mon, 03 Feb 2003 15:05:21 -0800
Received: from digitalkingdom.org ([204.152.186.175] helo=chain)
  by digitalkingdom.org with esmtp (Exim 4.05)
  id 18fpeM-0006bL-00; Mon, 03 Feb 2003 15:05:14 -0800
Received: with ECARTIS (v1.0.0; list lojban-list); Mon, 03 Feb 2003 15:05:12 -0800 (PST)
Received: from 208-150-110-21-adsl.precisionet.net ([208.150.110.21] helo=blackcat.ixazon.lan)
  by digitalkingdom.org with esmtp (Exim 4.05)
  id 18fpeA-0006b6-00
  for lojban-list@lojban.org; Mon, 03 Feb 2003 15:05:02 -0800
Received: by blackcat.ixazon.lan (Postfix, from userid 1001)
  id F2F0B905B; Mon, 3 Feb 2003 23:04:36 +0000 (UTC)
Organization: dis
To: lojban-list@lojban.org
Subject: [lojban] Re: Lexing text with {fa'o} and {zoi} in it
Date: Mon, 3 Feb 2003 18:04:36 -0500
User-Agent: KMail/1.5
References: <200302020059.24435.phma@webjockey.net> <20030203204620.GD20552@digitalkingdom.org>
In-Reply-To: <20030203204620.GD20552@digitalkingdom.org>
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"
Content-Disposition: inline
Message-Id: <200302031804.36643.phma@webjockey.net>
X-archive-position: 4001
X-ecartis-version: Ecartis v1.0.0
Sender: lojban-list-bounce@lojban.org
Errors-to: lojban-list-bounce@lojban.org
X-original-sender: phma@webjockey.net
Precedence: bulk
X-list: lojban-list
From: Pierre Abbat <phma@webjockey.net>
Reply-To: phma@webjockey.net
X-Yahoo-Group-Post: member; u=92712300

On Monday 03 February 2003 15:46, Robin Lee Powell wrote:
> On Sun, Feb 02, 2003 at 12:59:24AM -0500, Pierre Abbat wrote:
li'o
> > E. If the piece contains a consonant followed two letters later,
> > not...
>
> Followed by what two letters later?

By a vowel. If a piece that got this far contains a consonant followed two 
letters later by a vowel, and there is no 'y' after that, either it contains 
what looks like a brivla, or someone erroneously pronounced a word of selma'o 
BY followed by a CVV cmavo without inserting the required pause (such as 
/kybusO'e/).

> I assume all the ellipses are resolved elsewhere?

The tarball, which includes the lexing algorithm (which I have some slight 
changes to make to) and the program, is at 
http://phma.hn.org/Language/valfendi-0.1.0.tar.gz.

> > 6. If the piece is a cmavo of selma'o ZOI and the ZO flag is clear,
> > resolve the next piece as the starting delimiter and set the ZOI flag
> > to a copy of it.
> >
> > 7. If the piece is a cmavo of selma'o FAhO and the ZO flag is clear,
> > set the FAhO flag.
> >
> > 8. If the piece is a cmavo of selma'o ZO and the ZO flag is clear, set
> > the ZO flag; otherwise clear the ZO flag.
>
> Seems like it would be simpler if Step 3 was:
>
> "If the ZO flag is set, lex the piece and clear the ZO flag", or
> something like that.
>
> Then you'd only have to say it once.

I think that wouldn't work, because step 5 doesn't always produce a fully 
resolved piece at the left; e.g. /zo.AisecUskuladjAn/ becomes /zo Aisecusku 
la djAn/ after two steps.

> > 9. If there are any more pieces unresolved, return to step 3.
> >
> > According to 19:10, if {le'u} appears in a {zoi} quotation which is
> > inside a {lo'u} quotation, the {lo'u} quotation is prematurely
> > terminated. If the {le'u} is not surrounded by pauses, I don't see how
> > that can work. For instance:
> > /lo'ugA'ocUskuzoixy.vaiic.alE'umal`asOt.xy.axacuEroc.le'u/ The {le'u}
> > inside the {zoi} quotation is embedded in the Hebrew word
> > "wayish'alehu", which, being inside a {zoi} quotation, is not
> > attempted to lex (and if it did, it would choke on the `ayin of "mah
> > l`asot").
>
> Wow. Yeah, there's no way that can work.
>
> > Chapter 21, step 2, says that the word following {zoi} *should* be
> > delimited with a pause, not that it *must*. If it is not delimited
> > with a pause, this can cause text to be lexed that is part of the
> > quotation. For instance: /zoiladjAn.bagram.ladjAn/ is lexed as {zoi la
> > djan. bagram. la djan.}.
>
> I'm sory; what is the delimiter in that case? la?

Yes. It's ungrammatical, but the lexer doesn't know that.

> > Conversely, two words could be taken as the delimiter, as in
> > /zoi.flAlukAvbu.../ that I mentioned before. Should the word be
> > changed to "must"?
>
> Seems like it to me.

Likewise, must {fa'o} be followed by a pause?

phma




