From rlpowell@digitalkingdom.org Wed Mar 17 14:24:34 2004 Received: with ECARTIS (v1.0.0; list lojban-list); Wed, 17 Mar 2004 14:24:34 -0800 (PST) Received: from rlpowell by chain.digitalkingdom.org with local (Exim 4.30) id 1B3jSe-0004Ry-0f for lojban-list@lojban.org; Wed, 17 Mar 2004 14:24:28 -0800 Date: Wed, 17 Mar 2004 14:24:28 -0800 To: lojban-list@lojban.org Subject: [lojban] Official parser problem? Message-ID: <20040317222428.GZ11847@digitalkingdom.org> Mail-Followup-To: lojban-list@lojban.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.5.5.1+cvs20040105i From: Robin Lee Powell X-archive-position: 7220 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: rlpowell@digitalkingdom.org Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list The official parser fails on: lo'u mi le'u si lo'u mi le'u If I'm reading the pre-processing description correctly, and I'm pretty certain I am, that should work just fine. "lo'u mi le'u" is turned into one token; si eats any one token. No problem. c. If the Lojban word "lo'u" (selma'o LOhU) is identified, search for the closing delimiter "le'u" (selma'o LEhU), ignoring any such closing delimiters absorbed by the previous two steps. The text between the delimiters should be treated as the single token 'any_words_697'. ... e. If the word "si" (selma'o SI) is identified, erase it and the previous word (or token, if the previous text has been condensed into a single token by one of the above rules). Am I missing anything? -Robin -- Me: http://www.digitalkingdom.org/~rlpowell/ *** I'm a *male* Robin. "Constant neocortex override is the only thing that stops us all from running out and eating all the cookies." -- Eliezer Yudkowsky http://www.lojban.org/ *** .i cimo'o prali .ui