From rlpowell@digitalkingdom.org Wed Mar 17 17:23:34 2004 Received: with ECARTIS (v1.0.0; list lojban-list); Wed, 17 Mar 2004 17:23:34 -0800 (PST) Received: from rlpowell by chain.digitalkingdom.org with local (Exim 4.30) id 1B3mFs-0003hR-W8 for lojban-list@lojban.org; Wed, 17 Mar 2004 17:23:28 -0800 Date: Wed, 17 Mar 2004 17:23:28 -0800 To: lojban-list@lojban.org Subject: [lojban] Re: Official parser problem? Message-ID: <20040318012328.GI11847@digitalkingdom.org> Mail-Followup-To: lojban-list@lojban.org References: <20040317222428.GZ11847@digitalkingdom.org> <20040318011211.GH11847@digitalkingdom.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20040318011211.GH11847@digitalkingdom.org> User-Agent: Mutt/1.5.5.1+cvs20040105i From: Robin Lee Powell X-archive-position: 7232 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: rlpowell@digitalkingdom.org Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list On Wed, Mar 17, 2004 at 05:12:11PM -0800, Robin Lee Powell wrote: > On Wed, Mar 17, 2004 at 02:24:28PM -0800, Robin Lee Powell wrote: > > The official parser fails on: > > > > lo'u mi le'u si lo'u mi le'u > > > > If I'm reading the pre-processing description correctly, and I'm > > pretty certain I am, that should work just fine. "lo'u mi le'u" is > > turned into one token; si eats any one token. No problem. > > > > c. If the Lojban word "lo'u" (selma'o LOhU) is identified, > > search for the closing delimiter "le'u" (selma'o LEhU), ignoring > > any such closing delimiters absorbed by the previous two steps. > > The text between the delimiters should be treated as the single > > token 'any_words_697'. > > bancus just pointed out that that may not mean what I thought it > meant. I was taking "The text between the delimiters should be treated > as the single token 'any_words_697'." to mean "all the text between > the delimiters, including the delimiters themselves", but that's > probably not the most natural reading. But then it gets *BETTER*! Assuming bancus is right: To parse: "lo'u mi le'u si lo'u mi le'u" You do step "c" of the preprocessing LOhU any_words_697{mi} LEhU si LOhU any_words_697{mi} LEhU Then you do step "e", "si" erases LEhU and then you have LOhU any_words_697{mi} LOhU any_words_697{mi} LEhU Now, theoretically this is one big, long lo'u...le'u string. But the grammar as writen will not accomodate it. Of course, my grammar will. :-) http://www.digitalkingdom.org/~rlpowell/hobbies/lojban/grammar/ -Robin -- Me: http://www.digitalkingdom.org/~rlpowell/ *** I'm a *male* Robin. "Constant neocortex override is the only thing that stops us all from running out and eating all the cookies." -- Eliezer Yudkowsky http://www.lojban.org/ *** .i cimo'o prali .ui