From rlpowell@digitalkingdom.org Wed Mar 17 17:23:34 2004
Received: with ECARTIS (v1.0.0; list lojban-list); Wed, 17 Mar 2004 17:23:34 -0800 (PST)
Received: from rlpowell by chain.digitalkingdom.org with local (Exim 4.30)
	id 1B3mFs-0003hR-W8
	for lojban-list@lojban.org; Wed, 17 Mar 2004 17:23:28 -0800
Date: Wed, 17 Mar 2004 17:23:28 -0800
To: lojban-list@lojban.org
Subject: [lojban] Re: Official parser problem?
Message-ID: <20040318012328.GI11847@digitalkingdom.org>
Mail-Followup-To: lojban-list@lojban.org
References: <20040317222428.GZ11847@digitalkingdom.org> <20040318011211.GH11847@digitalkingdom.org>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <20040318011211.GH11847@digitalkingdom.org>
User-Agent: Mutt/1.5.5.1+cvs20040105i
From: Robin Lee Powell <rlpowell@digitalkingdom.org>
X-archive-position: 7232
X-ecartis-version: Ecartis v1.0.0
Sender: lojban-list-bounce@lojban.org
Errors-to: lojban-list-bounce@lojban.org
X-original-sender: rlpowell@digitalkingdom.org
Precedence: bulk
Reply-to: lojban-list@lojban.org
X-list: lojban-list

On Wed, Mar 17, 2004 at 05:12:11PM -0800, Robin Lee Powell wrote:
> On Wed, Mar 17, 2004 at 02:24:28PM -0800, Robin Lee Powell wrote:
> > The official parser fails on:
> > 
> >     lo'u mi le'u si lo'u mi le'u
> > 
> > If I'm reading the pre-processing description correctly, and I'm
> > pretty certain I am, that should work just fine.  "lo'u mi le'u" is
> > turned into one token; si eats any one token.  No problem.
> > 
> >     c. If the Lojban word "lo'u" (selma'o LOhU) is identified,
> >     search for the closing delimiter "le'u" (selma'o LEhU), ignoring
> >     any such closing delimiters absorbed by the previous two steps.
> >     The text between the delimiters should be treated as the single
> >     token 'any_words_697'.
> 
> bancus just pointed out that that may not mean what I thought it
> meant. I was taking "The text between the delimiters should be treated
> as the single token 'any_words_697'." to mean "all the text between
> the delimiters, including the delimiters themselves", but that's
> probably not the most natural reading.

But then it gets *BETTER*!  Assuming bancus is right:


To parse: "lo'u mi le'u si lo'u mi le'u"       

You do step "c" of the preprocessing                         

LOhU any_words_697{mi} LEhU si LOhU any_words_697{mi} LEhU   

Then you do step "e", "si" erases LEhU and then you have                                            

LOhU any_words_697{mi} LOhU any_words_697{mi} LEhU           

Now, theoretically this is one big, long lo'u...le'u string.

But the grammar as writen will not accomodate it.


Of course, my grammar will.  :-)

http://www.digitalkingdom.org/~rlpowell/hobbies/lojban/grammar/

-Robin

-- 
Me: http://www.digitalkingdom.org/~rlpowell/  ***   I'm a *male* Robin.
"Constant neocortex override is the only thing that stops us all
from running out and eating all the cookies."  -- Eliezer Yudkowsky
http://www.lojban.org/             ***              .i cimo'o prali .ui