Received: from mail-oa0-f61.google.com ([209.85.219.61]:63131) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.80.1) (envelope-from ) id 1Wxo1o-0000t1-PI for lojban-list-archive@lojban.org; Thu, 19 Jun 2014 18:54:17 -0700 Received: by mail-oa0-f61.google.com with SMTP id o6sf708036oag.6 for ; Thu, 19 Jun 2014 18:54:02 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe:content-type; bh=+oj0MCv5VtJJvNNIdz0LHZiya0PzSL47T6K7Xo4geaA=; b=DmUmhV1D2FP8CGGuENTijHVkcUAtFYgFeUPEFavmvZ55Tsfxuo3mVvaIvxPhgfqr7Y ZbCYs2rD9GPNNswgSo7hJKBff3tn9yq2vX+Mq/UpC2IbJ5q5AxRc2FAroOudPIG1e9oi FPDH5VuPmBu8xrP8BEcFwQbmyUnZsk77RoMfmJcYfovgogYdr+GTH8wyzB+qPnEIcJwT JDJtVBmHx+qkpowwgTY9pw61fW1xk25S790n7fI3fCo/PQ6k/0kzOvtM7DDkFiG2oz5j QOdqSzQLn8uz9e5K5s90bSgepsYh+3XpaGVt6exdjg3XQ1Qy66SS3zu7ApumH1hhareR j/QQ== X-Received: by 10.140.49.227 with SMTP id q90mr334qga.22.1403229242665; Thu, 19 Jun 2014 18:54:02 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.140.87.38 with SMTP id q35ls784786qgd.98.gmail; Thu, 19 Jun 2014 18:54:02 -0700 (PDT) X-Received: by 10.236.162.9 with SMTP id x9mr139324yhk.17.1403229242426; Thu, 19 Jun 2014 18:54:02 -0700 (PDT) Received: from mail-ve0-x235.google.com (mail-ve0-x235.google.com [2607:f8b0:400c:c01::235]) by gmr-mx.google.com with ESMTPS id xn6si315846vdc.2.2014.06.19.18.54.02 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Thu, 19 Jun 2014 18:54:02 -0700 (PDT) Received-SPF: pass (google.com: domain of jjllambias@gmail.com designates 2607:f8b0:400c:c01::235 as permitted sender) client-ip=2607:f8b0:400c:c01::235; Received: by mail-ve0-f181.google.com with SMTP id db11so3048239veb.40 for ; Thu, 19 Jun 2014 18:54:02 -0700 (PDT) MIME-Version: 1.0 X-Received: by 10.221.27.8 with SMTP id ro8mr193873vcb.30.1403229242353; Thu, 19 Jun 2014 18:54:02 -0700 (PDT) Received: by 10.220.170.73 with HTTP; Thu, 19 Jun 2014 18:54:02 -0700 (PDT) In-Reply-To: <96882b42-e167-4a7b-88d2-038324354074@googlegroups.com> References: <96882b42-e167-4a7b-88d2-038324354074@googlegroups.com> Date: Thu, 19 Jun 2014 22:54:02 -0300 Message-ID: Subject: Re: [lojban] Does the lojban language include the empty string? From: =?UTF-8?Q?Jorge_Llamb=C3=ADas?= To: lojban@googlegroups.com X-Original-Sender: jjllambias@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of jjllambias@gmail.com designates 2607:f8b0:400c:c01::235 as permitted sender) smtp.mail=jjllambias@gmail.com; dkim=pass header.i=@gmail.com; dmarc=pass (p=NONE dis=NONE) header.from=gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: multipart/alternative; boundary=001a11336baa876ded04fc3ac43e X-Spam-Score: -1.9 (-) X-Spam_score: -1.9 X-Spam_score_int: -18 X-Spam_bar: - --001a11336baa876ded04fc3ac43e Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable On Thu, Jun 19, 2014 at 10:19 PM, Riley Martinez-Lynch wrote: > In the course of working on a script to extract grammatical lojban texts > from a stream of heterogenous input, I discovered that camxes considers a= n > empty string to be a valid lojban text. I tried jbofihe and > yacc/grammar.300, and got the same result. > > Is the empty string (=CE=B5) really intended to be included in the lojban > language, or is this a quirk of the machine grammar implementations? > > *xu lo'u le'u te gerna lo jbobau* > > It was done on purpose. It would be trivial to change the grammar to disallow it. If we disallow it, however, would we also have to consider the text "valsi si" ungrammatical? > If it is included, as the parsers indicate, null lojban texts are to be > found everywhere that you can't parse the beginning of a non-null lojban > text. > > Thoughts? > I'm not sure I follow. The grammar can only parse one text at a time, it can't parse a string of texts, so I'm not sure I see what the problem is. A non-null text is just one text, not a sequence of null texts followed by a non-null text, just like "broda brode" is one single text, not text "broda" followed by second text "brode". mu'o mi'e xorxes --=20 You received this message because you are subscribed to the Google Groups "= lojban" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout. --001a11336baa876ded04fc3ac43e Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable

= On Thu, Jun 19, 2014 at 10:19 PM, Riley Martinez-Lynch &l= t;shunpiker@gmail.= com> wrote:
In the course of worki= ng on a script to extract grammatical lojban texts from a stream of heterog= enous input, I discovered that camxes considers an empty string to be a val= id lojban text. I tried jbofihe and yacc/grammar.300, and got the same resu= lt.

Is the empty string (=CE=B5) really intended to be incl= uded in the lojban language, or is this a quirk of the machine grammar impl= ementations?

xu lo'u le'u te gerna lo jbobau

It was done on purpose. It would be trivi= al to change the grammar to disallow it. If we disallow it, however, would = we also have to consider the text "valsi si" ungrammatical? =C2= =A0
=C2=A0
If it is included, as the parsers indicate, null lojban texts are to = be found everywhere that you can't parse the beginning of a non-null lo= jban text.

Thoughts?

I= 'm not sure I follow. The grammar can only parse one text at a time, it= can't parse a string of texts, so I'm not sure I see what the prob= lem is. A non-null text is just one text, not a sequence of null texts foll= owed by a non-null text, just like "broda brode" is one single te= xt, not text "broda" followed by second text "brode".

mu'o mi'e xorxes
=C2=A0

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsub= scribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http:= //groups.google.com/group/lojban.
For more options, visit http= s://groups.google.com/d/optout.
--001a11336baa876ded04fc3ac43e--