From rlpowell@digitalkingdom.org Wed Mar 17 14:24:34 2004
Received: with ECARTIS (v1.0.0; list lojban-list); Wed, 17 Mar 2004 14:24:34 -0800 (PST)
Received: from rlpowell by chain.digitalkingdom.org with local (Exim 4.30)
	id 1B3jSe-0004Ry-0f
	for lojban-list@lojban.org; Wed, 17 Mar 2004 14:24:28 -0800
Date: Wed, 17 Mar 2004 14:24:28 -0800
To: lojban-list@lojban.org
Subject: [lojban] Official parser problem?
Message-ID: <20040317222428.GZ11847@digitalkingdom.org>
Mail-Followup-To: lojban-list@lojban.org
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.5.5.1+cvs20040105i
From: Robin Lee Powell <rlpowell@digitalkingdom.org>
X-archive-position: 7220
X-ecartis-version: Ecartis v1.0.0
Sender: lojban-list-bounce@lojban.org
Errors-to: lojban-list-bounce@lojban.org
X-original-sender: rlpowell@digitalkingdom.org
Precedence: bulk
Reply-to: lojban-list@lojban.org
X-list: lojban-list

The official parser fails on:

    lo'u mi le'u si lo'u mi le'u

If I'm reading the pre-processing description correctly, and I'm pretty
certain I am, that should work just fine.  "lo'u mi le'u" is turned into
one token; si eats any one token.  No problem.

    c. If the Lojban word "lo'u" (selma'o LOhU) is identified, search
    for the closing delimiter "le'u" (selma'o LEhU), ignoring any such
    closing delimiters absorbed by the previous two steps.  The text
    between the delimiters should be treated as the single token
    'any_words_697'.

    ...

    e. If the word "si" (selma'o SI) is identified, erase it and the
    previous word (or token, if the previous text has been condensed
    into a single token by one of the above rules).

Am I missing anything?

-Robin

-- 
Me: http://www.digitalkingdom.org/~rlpowell/  ***   I'm a *male* Robin.
"Constant neocortex override is the only thing that stops us all
from running out and eating all the cookies."  -- Eliezer Yudkowsky
http://www.lojban.org/             ***              .i cimo'o prali .ui