From rlpowell@digitalkingdom.org Fri May 07 13:40:30 2004 Received: with ECARTIS (v1.0.0; list lojban-list); Fri, 07 May 2004 13:40:30 -0700 (PDT) Received: from rlpowell by chain.digitalkingdom.org with local (Exim 4.31) id 1BMC8t-0008Ku-Tv for lojban-list@lojban.org; Fri, 07 May 2004 13:40:24 -0700 Date: Fri, 7 May 2004 13:40:23 -0700 To: lojban-list@lojban.org Subject: [lojban] Re: My parser, SI, SA, and ZOI Message-ID: <20040507204023.GA27947@digitalkingdom.org> Mail-Followup-To: lojban-list@lojban.org References: <20040507185509.GN7020@digitalkingdom.org> <20040507193128.20639.qmail@web41903.mail.yahoo.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20040507193128.20639.qmail@web41903.mail.yahoo.com> User-Agent: Mutt/1.5.5.1+cvs20040105i From: Robin Lee Powell X-archive-position: 7684 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: rlpowell@digitalkingdom.org Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list On Fri, May 07, 2004 at 12:31:28PM -0700, Jorge Llamb?as wrote: > > --- Robin Lee Powell wrote: > > > What does {da zo si si} do? > > > > It *should* result in just 'da', because zo is defined as turning > > itself and the next argument into a single word. zoi is *not* so > > defined. > > Is there a justification for that difference? Huh. I'm sorry, I thought that that difference was explicitely stated somewhere, but I can't find it. The Red Book doesn't seem to say whether or not "zo da" is treated as a single word at all. So unless I'm missing something, all we have to go on is grammar.300, which says: a. If the Lojban word "zoi" (selma'o ZOI) is identified, take the following Lojban word (which should be end delimited with a pause for separation from the following non-Lojban text) as an opening delimiter. Treat all text following that delimiter, until that delimiter recurs *after a pause*, as grammatically a single token (labelled 'anything_699' in this grammar). There is no need for processing within this text except as necessary to find the closing delimiter. b. If the Lojban word "zo" (selma'o ZO) is identified, treat the following Lojban word as a token labelled 'any_word_698', instead of lexing it by its normal grammatical function. So "zoi da de da" is turned into four tokens, "zoi da anything_699 da" and "zo da" is turned into the single token "any_word_698". It is this behaviour that I am trying to emulate, without being a slave to YACC restrictions (which, for example, make it so that "zoi da weeble da si si si si" works, but no lesser number of "si" after the zoi have any effect. -Robin -- http://www.digitalkingdom.org/~rlpowell/ *** I'm a *male* Robin. "Many philosophical problems are caused by such things as the simple inability to shut up." -- David Stove, liberally paraphrased. http://www.lojban.org/ *** loi pimlu na srana .i ti rokci morsi