Received: from mail-ie0-f192.google.com ([209.85.223.192]:35733) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.80.1) (envelope-from ) id 1W1gz1-0001as-1l for lojban-list-archive@lojban.org; Fri, 10 Jan 2014 10:39:15 -0800 Received: by mail-ie0-f192.google.com with SMTP id x13sf1068898ief.9 for ; Fri, 10 Jan 2014 10:38:57 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe:content-type; bh=cZI3XBfHCrHw3FwX3zxhk4Yhn9kbjaip9XRmW6pv+nU=; b=g7JS6dTHoIW4UnDAokCpYUWPpK0jUaarg2Z6IXffbQS618aW0okBEYNwN+Ho2exSMZ Nu9xMYexLEv0cTcIftdYjTlcaCTsEsEJAShNUz1G/GuGADfFfvHLWtzKrYaEcWACDiYm nnvjsQCKHBTogKn99BsCc+gjXCljhwwkRnGLbSseYbRUOfWgUxdK4VailFRhxIjZPWWr L94786GnxQoWpC4bvbf7ThbnVIqag46utjMAr2zNducfPB/nH0JuIhzyLkl9+QHmVJ0J iFUXjEltsteiJ6Qiieieye2SUmjx25sRA0tjUdmv7z9ZMCLxkzMN4oRBRNhhrHQ72NeK fh0g== X-Received: by 10.50.30.69 with SMTP id q5mr82085igh.15.1389379137006; Fri, 10 Jan 2014 10:38:57 -0800 (PST) X-BeenThere: lojban@googlegroups.com Received: by 10.50.37.140 with SMTP id y12ls434390igj.32.gmail; Fri, 10 Jan 2014 10:38:56 -0800 (PST) X-Received: by 10.66.218.70 with SMTP id pe6mr4159087pac.33.1389379136308; Fri, 10 Jan 2014 10:38:56 -0800 (PST) Received: from mail-ve0-x236.google.com (mail-ve0-x236.google.com [2607:f8b0:400c:c01::236]) by gmr-mx.google.com with ESMTPS id h4si2172291qch.3.2014.01.10.10.38.56 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Fri, 10 Jan 2014 10:38:56 -0800 (PST) Received-SPF: pass (google.com: domain of lytlesw@gmail.com designates 2607:f8b0:400c:c01::236 as permitted sender) client-ip=2607:f8b0:400c:c01::236; Received: by mail-ve0-f182.google.com with SMTP id jy13so3795865veb.27 for ; Fri, 10 Jan 2014 10:38:56 -0800 (PST) X-Received: by 10.58.201.169 with SMTP id kb9mr3188406vec.42.1389379135940; Fri, 10 Jan 2014 10:38:55 -0800 (PST) MIME-Version: 1.0 Received: by 10.58.186.136 with HTTP; Fri, 10 Jan 2014 10:38:25 -0800 (PST) In-Reply-To: <20140110185840.22249c17@aol.com> References: <20140110185840.22249c17@aol.com> From: MorphemeAddict Date: Fri, 10 Jan 2014 13:38:25 -0500 Message-ID: Subject: Re: [lojban] Quoting with ZOI can be ambigious To: lojban@googlegroups.com X-Original-Sender: lytlesw@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of lytlesw@gmail.com designates 2607:f8b0:400c:c01::236 as permitted sender) smtp.mail=lytlesw@gmail.com; dkim=pass header.i=@gmail.com; dmarc=pass (p=NONE dis=NONE) header.from=gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: multipart/alternative; boundary=047d7bd6b800db2bb604efa20956 X-Spam-Score: 0.0 (/) X-Spam_score: 0.0 X-Spam_score_int: 0 X-Spam_bar: / --047d7bd6b800db2bb604efa20956 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable I thought {zoi} plus delimiters was only for words, which would exclude bare punctuation. stevo On Fri, Jan 10, 2014 at 12:58 PM, Wuzzy wrote: > TL;DR: Quoting with ZOI can be ambigious. I point out the flaws and show > how the Reference Grammar does not help here. Then I propose a change=97o= r > more precisely: a clarification=97to the grammar which hopefully fixes > the flaws. > > Full text: > I noticed that you run into trouble when your ZOI quote has trailing > or leading whitespace, or trailing or leading periods. Also you run > into trouble if you want to decode a ZOI quote because you can=92t tell > if the text truly has trailing/leading whitespace/periods or not. > > Just for extra clarity: In this e-mail I will make use of the so-called > logical quotation style so you know exactly what I=92m talking about and > because the British quotation style is just damn awful. > Logical quotation style is basically this: All characters which are > between the quotation marks are quoted. No magic! ;-) > > > Let=92s go! > Just look at this: > What does =93zoi gy..gy=94 quote? > I could think of these answers: > (1) =93=94 (empty string (string =3D sequence of characters)) > (2) =93..=94 (two periods) > (3) =93.=94 (one period) > > All three answers seem plausible to me. Remember: according to the > Reference Grammar, all periods are optional: > =93Technically, the period is an optional reminder to the reader of a > mandatory pause that is dictated by the rules of the language; > because these rules are unambiguous, a missing period can be inferred > from otherwise correct text. Periods are included only as an aid to > the reader.=94=97Reference Grammar, chapter 3, section 3. > > (1) would be true if both periods are considered to be part of the > delimeter, therefore, they are part of the Lojban text and not of the > quoted text. > (2) would be true if the =93pause periods=94 of Lojban have > been ommitted. Therefore, the periods must be part of the quoted text. > (3) is a weird case which could be true if one of the =93pause periods=94 > has been ommited, but not the other one. Therefore, there is one period > left which is therefore part of the quoted text. > > All three possibilities are plausible to me under the rules. But > under these rules, I can=92t tell which is the correct one. > > There is another problem, with whitespace: > =93zoi gy gy=94 =3D =93=94 or =93 =94? > > Here are more examples (look closely at spaces and periods): > =93zoi gy. .gy=94 =3D =93 =94, =93=94, =93. =94, =93.=94, =93. =94? > =93zoi gy.Peter Smith.gy=94 =3D =93Peter Smith=94, =93.Peter Smith=94, = =93Peter Smith.=94, > =93.Peter Smith.=94? > =93zoi gy. Peter Smith .gy=94 =3D =93Peter Smith=94, =93 Peter Smith=94, > =93Peter Smith =94, =93 Peter Smith =94? > With or without periods? > =93zoi gy.pdf.gy=94 =3D =93pdf=94, =93.pdf=94, =93.pdf.=94, =93pdf.=94? > =93zoi gy..pdf.gy=94 =3D =93pdf=94, =93.pdf=94, =93.pdf.=94, =93pdf.=94, = =93..pdf=94 or even > =93..pdf.=94? > > Now here comes the real problem: > The Reference Grammar does NOT have an answer, > so any of these answers I suggested could be true. > > Here is what the grammar chapter of the Reference Grammar says: > =93a. If the Lojban word =91zoi=92 (selma'o ZOI) is identified, take the > following Lojban word (which should be end delimited with a pause for > separation from the following non-Lojban text) as an opening delimiter. > Treat all text following that delimiter, until that delimiter recurs > after a pause, as grammatically a single token (labelled =91anything_699= =92 > in this grammar). There is no need for processing within this text > except as necessary to find the closing delimiter.=94 > This text does not explicitly state wheather to write down the periods. > I only know all periods are optional, leading to the situation > I pointed out before (pun intended). > The section on ZO and ZOI is of no help either. > > The point I want to make here is that the Reference Grammar does NOT say > which possiblity to choose. So please don=92t argue about what you think > would be the correct choice here. There is no correct choice, the > Reference Grammer does not define these borderline cases at all. Which > means: It is impossible to reliably quote any string with whitespace or > a period at the beginning or end. > > Since the Reference Grammar is disappointing me here, there must be a > clear definition for these cases. > > I therefore suggest the following rule changes to be applied to the > Reference Grammar: > - Written Lojban: > - periods are MANDATORY at the following places: Right after the > opening delimeter word and right before the closing delimeter word. > The mandatory period is considered part of the delimeter. > - The period of the opening and closing delimeter may not be merged > into one period. > And just to be sure: > - The quote starts after the period of the opening delimeter. > - The quote ends before the period of the closing delimeter. > - Everything, even whitespace between those two periods is considered > quoted. > - Spoken Lojban: > - The sound of the delimeter word may not appear in the quoted text, > no matter if there=92s a glottal stop before or after it. > > Here are some valid and invalid examples under these proposed rules: > =93zoi gy..gy=94 =3D =93=94. The opening delmeter is = =93gy.=94, > the closing delimeter is =93.gy=94. > =93zoi gy.gy=94 is an error! 2nd delimeter missing > =93zoi gygy=94 is an error! 1st delimeter missing > =93zoi gy. .gy=94 =3D =93 =94 > =93zoi gy..pdf.gy=94 =3D =93.pdf=94 > =93zoi gy.pdf.gy=94 =3D =93pdf=94 > =93zoi gy.Peter Smith.gy.=94 =3D =93Peter Smith=94 > =93zoi gy.Peter Smith.gy=94 =3D =93Peter Smith=94 (note the perio= d > AFTER the closing delimeter is still > optional) > =93zoi gy.Wuzzy eats Pizza..gy.=94 =3D =93Wuzzy eats Pizza.=94 > =93zoi gy. Peter Smith .gy.=94 =3D =93 Peter Smith =94 > =93zoi lo..lo.lo=94 =3D =93=94 followed by an article > =93zoi gy...gy=94 =3D =93.=94 > =93zoi gy. . .gy=94 =3D =93 . =94 > =93zoi gy.....gy=94 =3D =93...=94 > > Here are some borderline cases: > =93zoi gy.gy.gy=94 violates the spoken Lojban rule iff the 2nd > =93gy=94 has the sound of Lojban=92s =93gy=94. > =93zoi gy.gy.gy=94 is valid and is =93gy=94 iff the 2nd =93gy=94 has NOT = the sound > of Lojban=92s =93gy=94. > =93zoi gy.gyrations.gy=94: same as previous ones. > > Note the last three cases are borderline cases and should be used with > care. The pronounciation of the quoted text must be known for this to > work. While these may be valid, at best such borderline cases are > avoided by choosing another delimeter. Because you aren=92t going to > tortue the recipient, aren=92t you? ;-) > > That=92s it. > I have made some research and it seems nobody has pointed out these > problems before. Anyhow, this looks like serious trouble and > certainly must be fixed. > It is embarrassing that I am able to quote more precisely with English > than with Lojban. ;-) ;-) ;-) > > Please say wheather you agree with me in general and wheather you think > the proposed change is okay. If my proposal or reasoning has flaws in > is, please point them out. > > Greetings! > -- > Wuzzy > XMPP: Wuzzy2@jabber.ccc.de > E-Mail: wuzzy2@mail.ru > > -- > You received this message because you are subscribed to the Google Groups > "lojban" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to lojban+unsubscribe@googlegroups.com. > To post to this group, send email to lojban@googlegroups.com. > Visit this group at http://groups.google.com/group/lojban. > For more options, visit https://groups.google.com/groups/opt_out. > --=20 You received this message because you are subscribed to the Google Groups "= lojban" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban. For more options, visit https://groups.google.com/groups/opt_out. --047d7bd6b800db2bb604efa20956 Content-Type: text/html; charset=windows-1252 Content-Transfer-Encoding: quoted-printable
I thought {zoi} plus delimiters was only for words, which wo= uld exclude bare punctuation.=A0

stevo


On Fri, Jan 10, 2014 at 12:58 PM, Wuzzy <<= a href=3D"mailto:almikes@aol.com" target=3D"_blank">almikes@aol.com>= wrote:
TL;DR: Quoting with ZOI can be ambigious. I = point out the flaws and show
how the Reference Grammar does not help here. Then I propose a change=97or<= br> more precisely: a clarification=97to the grammar which hopefully fixes
the flaws.

Full text:
I noticed that you run into trouble when your ZOI quote has trailing
or leading whitespace, or trailing or leading periods. Also you run
into trouble if you want to decode a ZOI quote because you can=92t tell
if the text truly has trailing/leading whitespace/periods or not.

Just for extra clarity: In this e-mail I will make use of the so-called
logical quotation style so you know exactly what I=92m talking about and because the British quotation style is just damn awful.
Logical quotation style is basically this: All characters which are
between the quotation marks are quoted. No magic! ;-)
<https://en.wikipedia.org/wiki/Quotation_mark#Punctuation>

Let=92s go!
Just look at this:
What does =93zoi gy..gy=94 quote?
I could think of these answers:
(1) =93=94 (empty string (string =3D sequence of characters))
(2) =93..=94 (two periods)
(3) =93.=94 (one period)

All three answers seem plausible to me. Remember: according to the
Reference Grammar, all periods are optional:
=93Technically, the period is an optional reminder to the reader of a
mandatory pause that is dictated by the rules of the language;
because these rules are unambiguous, a missing period can be inferred
from otherwise correct text. Periods are included only as an aid to
the reader.=94=97Reference Grammar, chapter 3, section 3.

(1) would be true if both periods are considered to be part of the
delimeter, therefore, they are part of the Lojban text and not of the
quoted text.
(2) would be true if the =93pause periods=94 of Lojban have
been ommitted. Therefore, the periods must be part of the quoted text.
(3) is a weird case which could be true if one of the =93pause periods=94 has been ommited, but not the other one. Therefore, there is one period
left which is therefore part of the quoted text.

All three possibilities are plausible to me under the rules. But
under these rules, I can=92t tell which is the correct one.

There is another problem, with whitespace:
=93zoi gy gy=94 =3D =93=94 or =93 =94?

Here are more examples (look closely at spaces and periods):
=93zoi gy. .gy=94 =3D =93 =94, =93=94, =93. =94, =93.=94, =93. =94?
=93zoi gy.Peter Smith.gy=94 =3D =93Peter Smith=94, =93.Peter Smith=94, =93P= eter Smith.=94,
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =93.Peter Smith.=94? =93zoi gy. Peter Smith .gy=94 =3D =93Peter Smith=94, =93 Peter Smith=94, =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =93Peter Smith =94,= =93 Peter Smith =94?
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 With or without per= iods?
=93zoi
gy.pdf.gy=94 =3D = =93pdf=94, =93.pdf=94, =93.pdf.=94, =93pdf.=94?
=93zoi gy..pdf.gy=94 =3D = =93pdf=94, =93.pdf=94, =93.pdf.=94, =93pdf.=94, =93..pdf=94 or even
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0=93..pdf.=94?

Now here comes the real problem:
The Reference Grammar does NOT have an answer,
so any of these answers I suggested could be true.

Here is what the grammar chapter of the Reference Grammar says:
=93a. If the Lojban word =91zoi=92 (selma'o ZOI) is identified, take th= e
following Lojban word (which should be end delimited with a pause for
separation from the following non-Lojban text) as an opening delimiter.
Treat all text following that delimiter, until that delimiter recurs
after a pause, as grammatically a single token (labelled =91anything_699=92=
in this grammar). There is no need for processing within this text
except as necessary to find the closing delimiter.=94
This text does not explicitly state wheather to write down the periods.
I only know all periods are optional, leading to the situation
I pointed out before (pun intended).
The section on ZO and ZOI is of no help either.

The point I want to make here is that the Reference Grammar does NOT say which possiblity to choose. So please don=92t argue about what you think would be the correct choice here. There is no correct choice, the
Reference Grammer does not define these borderline cases at all. Which
means: It is impossible to reliably quote any string with whitespace or
a period at the beginning or end.

Since the Reference Grammar is disappointing me here, there must be a
clear definition for these cases.

I therefore suggest the following rule changes to be applied to the
Reference Grammar:
- Written Lojban:
=A0 - periods are MANDATORY at the following places: Right after the
=A0 =A0 opening delimeter word and right before the closing delimeter word.=
=A0 =A0 The mandatory period is considered part of the delimeter.
=A0 - The period of the opening and closing delimeter may not be merged
=A0 =A0 into one period.
=A0 And just to be sure:
=A0 - The quote starts after the period of the opening delimeter.
=A0 - The quote ends before the period of the closing delimeter.
=A0 - Everything, even whitespace between those two periods is considered =A0 =A0 quoted.
- Spoken Lojban:
=A0 - The sound of the delimeter word may not appear in the quoted text, =A0 =A0 no matter if there=92s a glottal stop before or after it.

Here are some valid and invalid examples under these proposed rules:
=93zoi gy..gy=94 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0=3D =93=94. The ope= ning delmeter is =93gy.=94,
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 the closing= delimeter is =93.gy=94.
=93zoi gy.gy=94 =A0 =A0 =A0 = =A0 =A0 =A0 =A0 =A0 =A0 =A0 is an error! 2nd delimeter missing
=93zoi gygy=94 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0is an error! 1st = delimeter missing
=93zoi gy. .gy=94 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =3D =93 =94
=93zoi gy..pdf.gy=94 =A0 = =A0 =A0 =A0 =A0 =A0 =A0 =A0=3D =93.pdf=94
=93zoi gy.pdf.gy=94 =A0 = =A0 =A0 =A0 =A0 =A0 =A0 =A0 =3D =93pdf=94
=93zoi gy.Peter Smith.gy.=94 =A0 =A0 =A0 =A0=3D =93Peter Smith=94
=93zoi gy.Peter Smith.gy=94 =A0 =A0 =A0 =A0 =3D =93Peter Smith=94 (note the= period
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 AFTER the c= losing delimeter is still
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 optional) =93zoi gy.Wuzzy eats Pizza..gy.=94 =A0=3D =93Wuzzy eats Pizza.=94
=93zoi gy. Peter Smith .gy.=94 =A0 =A0 =A0=3D =93 Peter Smith =94
=93zoi lo..lo.lo=94 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =3D =93=94 followed by = an article
=93zoi gy...gy=94 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =3D =93.=94
=93zoi gy. . .gy=94 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =3D =93 . =94
=93zoi gy.....gy=94 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =3D =93...=94

Here are some borderline cases:
=93zoi gy.gy.gy=94 violat= es the spoken Lojban rule iff the 2nd
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0=93gy=94 has the sound of Lojban=92s =93gy= =94.
=93zoi gy.gy.gy=94 is val= id and is =93gy=94 iff the 2nd =93gy=94 has NOT the sound
=A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0of Lojban=92s =93gy=94.
=93zoi gy.gyrations.gy= =94: same as previous ones.

Note the last three cases are borderline cases and should be used with
care. The pronounciation of the quoted text must be known for this to
work. While these may be valid, at best such borderline cases are
avoided by choosing another delimeter. Because you aren=92t going to
tortue the recipient, aren=92t you? ;-)

That=92s it.
I have made some research and it seems nobody has pointed out these
problems before. Anyhow, this looks like serious trouble and
certainly must be fixed.
It is embarrassing that I am able to quote more precisely with English
than with Lojban. ;-) ;-) ;-)

Please say wheather you agree with me in general and wheather you think
the proposed change is okay. If my proposal or reasoning has flaws in
is, please point them out.

Greetings!
--
Wuzzy
XMPP: Wuzzy2@jabber.ccc.de
E-Mail: wuzzy2@mail.ru

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+uns= ubscribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http:= //groups.google.com/group/lojban.
For more options, visit https://groups.google.com/groups/opt_out.
--047d7bd6b800db2bb604efa20956--