Received-SPF: pass (google.com: domain of jjllambias@gmail.com designates 2607:f8b0:400c:c03::234 as permitted sender) client-ip=2607:f8b0:400c:c03::234;
MIME-Version: 1.0
In-Reply-To: <5416B55B.9030302@gmx.de>
References: <5415B8C0.4030003@gmx.de>
	<CAO7tK2f4j-UBvERLsqfjfbaMAEkQGO2FX4QBGvot+VdgDfQxGw@mail.gmail.com>
	<5416B55B.9030302@gmx.de>
Date: Mon, 15 Sep 2014 09:56:43 -0300
Message-ID: <CAO7tK2dU6hqhft2Btqxau2pGi1F6zVB9s42+=Stf+nXkX=UA2w@mail.gmail.com>
Subject: Re: [lojban] The White Knight (Through the Looking Glass)
From: =?UTF-8?Q?Jorge_Llamb=C3=ADas?= <jjllambias@gmail.com>
To: lojban@googlegroups.com
Reply-To: lojban@googlegroups.com
Precedence: list
Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com
Sender: lojban@googlegroups.com
Content-Type: multipart/alternative; boundary=089e011847dea99dc705031a2ae3
X-Spam_score: -1.9
X-Spam_score_int: -18
X-Spam_bar: -

--089e011847dea99dc705031a2ae3
Content-Type: text/plain; charset=UTF-8

On Mon, Sep 15, 2014 at 6:46 AM, selpa'i <seladwa@gmx.de> wrote:

>
> Right. I think I first had {... go nai klaku gi co'e li'u}, but while that
> maintains grammaticality it doesn't really correspond to being interrupted
> mid-speech (or just stopping etc). Just like you I don't know what the best
> general solution would be; I feel like there should be a way to have an
> ungrammatical chunk inside grammatical text without making the entire text
> parsefail. I don't know how one would parse human speech otherwise, which
> is going to be full of incomplete sentences.


Right, human speech is obviously not parsed as a single chunk. The Lojban
parser is somewhat unnatural in that sense.


> One thing that could help is to add a lot more productions to the fragment
> rule of the grammar. Another solution I pondered involved giving EOF some
> magic powers so that it can make incomplete sentences parse up to the
> failure part and just treat the remainder as some sort of meaningless
> left-over. What's important is that the grammatical part of such a sentence
> still gets parsed properly.
>

It wouldn't be EOF though, because we still want to keep parsing what comes
after the incomplete sentence. The PEG can be modified so as to allow
incomplete sentences, but it means adding a lot of rules.

I'm not sure about the equally likely case where the mistake happens in the
> middle of a sentence. Perhaps an external statistical analyser would have
> to guess what was meant and make corrections accordingly...
>

I would say mistakes are different from interruptions, so they probably
require different treatments.

lo'u-le'u doesn't satisfy me, because 1) it requires you to know in advance
> that a sentence or text will be ungrammatical (and it's an ugly give-away
> in a written story), and 2) because text in error quotes does not get
> parsed, so there is no way to extract meaning from what is said.


Right.

 The second comment is about "ba'e" in:
>>
>> -.i ri cmene lo selsa'a ku xu
>> - na go'i .i do na jimpe .i ra ba'e cmene lo cmene
>>
>> Assuming the emphasis marks the rheme/comment as opposed to the
>> theme/topic, I would expect the "ba'e" on the second cmene. The first
>> cmene just repeats Alice's sentence, so it's not what the White Knight
>> is correcting. I understand the sentence structure is somewhat different
>> in Lojban than in the original, but the "ba'e" there just sounds off to
>> me.
>>
>
> I know exactly what you mean. When I read the Lojban I had the same
> feeling, so I went over to the English and found that it was "backwards" as
> well. {ba'e} on the second {cmene} definitely feels better, I just wasn't
> sure if I should make a "correction" to the original or if it fit the
> general weirdness in Alice.


I don't think the original has the same problem, because in the English you
have "the name" and "is called", and the White Knight's "the name" does
repeat Alice's "the name", and "is called" is the new information. The
problem with the Lojban is that there's two "cmene", and the one that is
new information comes first and in the same position as Alice's "cmene". If
it was a different word, say:

-.i ri cmene lo selsa'a ku xu
- na go'i .i do na jimpe .i ra ba'e sinxa lo cmene

or:

-.i ri cmene lo selsa'a ku xu
- na go'i .i do na jimpe .i ra lo cmene cu ba'e sinxa

then it would be easier to follow, because it would be more clear that "lo
cmene" is "lo cmene be lo selsa'a". (I'm not saying it would be a better
translation though.)

Also the way you have "xu" questioning "lo selsa'a" may add to the
garden-pathing. I think "vau xu" would correspond more closely to the
original.

mu'o mi'e xorxes

-- 
You received this message because you are subscribed to the Google Groups "lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/d/optout.

--089e011847dea99dc705031a2ae3
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><br><div class=3D"gmail_extra"><div class=3D"gmail_quote">=
On Mon, Sep 15, 2014 at 6:46 AM, selpa&#39;i <span dir=3D"ltr">&lt;<a href=
=3D"mailto:seladwa@gmx.de" target=3D"_blank">seladwa@gmx.de</a>&gt;</span> =
wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8=
ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-sty=
le:solid;padding-left:1ex"><span class=3D""><br></span>
Right. I think I first had {... go nai klaku gi co&#39;e li&#39;u}, but whi=
le that maintains grammaticality it doesn&#39;t really correspond to being =
interrupted mid-speech (or just stopping etc). Just like you I don&#39;t kn=
ow what the best general solution would be; I feel like there should be a w=
ay to have an ungrammatical chunk inside grammatical text without making th=
e entire text parsefail. I don&#39;t know how one would parse human speech =
otherwise, which is going to be full of incomplete sentences. </blockquote>=
<div><br></div><div>Right, human speech is obviously not parsed as a single=
 chunk. The Lojban parser is somewhat unnatural in that sense.</div><div>=
=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0=
.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-s=
tyle:solid;padding-left:1ex">One thing that could help is to add a lot more=
 productions to the fragment rule of the grammar. Another solution I ponder=
ed involved giving EOF some magic powers so that it can make incomplete sen=
tences parse up to the failure part and just treat the remainder as some so=
rt of meaningless left-over. What&#39;s important is that the grammatical p=
art of such a sentence still gets parsed properly.<br></blockquote><div><br=
></div><div>It wouldn&#39;t be EOF though, because we still want to keep pa=
rsing what comes after the incomplete sentence. The PEG can be modified so =
as to allow incomplete sentences, but it means adding a lot of rules.=C2=A0=
</div><div><br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px =
0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);bord=
er-left-style:solid;padding-left:1ex">
I&#39;m not sure about the equally likely case where the mistake happens in=
 the middle of a sentence. Perhaps an external statistical analyser would h=
ave to guess what was meant and make corrections accordingly...<br></blockq=
uote><div><br></div><div>I would say mistakes are different from interrupti=
ons, so they probably require different treatments.</div><div><br></div><bl=
ockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-lef=
t-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padd=
ing-left:1ex">
lo&#39;u-le&#39;u doesn&#39;t satisfy me, because 1) it requires you to kno=
w in advance that a sentence or text will be ungrammatical (and it&#39;s an=
 ugly give-away in a written story), and 2) because text in error quotes do=
es not get parsed, so there is no way to extract meaning from what is said.=
</blockquote><div><br></div><div>Right.</div><div><br></div><blockquote cla=
ss=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;=
border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex=
"><span class=3D"">
<blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-=
left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;p=
adding-left:1ex">
The second comment is about &quot;ba&#39;e&quot; in:<br>
<br>
-.i ri cmene lo selsa&#39;a ku xu<br>
- na go&#39;i .i do na jimpe .i ra ba&#39;e cmene lo cmene<br>
<br>
Assuming the emphasis marks the rheme/comment as opposed to the<br>
theme/topic, I would expect the &quot;ba&#39;e&quot; on the second cmene. T=
he first<br>
cmene just repeats Alice&#39;s sentence, so it&#39;s not what the White Kni=
ght<br>
is correcting. I understand the sentence structure is somewhat different<br=
>
in Lojban than in the original, but the &quot;ba&#39;e&quot; there just sou=
nds off to me.<br>
</blockquote>
<br></span>
I know exactly what you mean. When I read the Lojban I had the same feeling=
, so I went over to the English and found that it was &quot;backwards&quot;=
 as well. {ba&#39;e} on the second {cmene} definitely feels better, I just =
wasn&#39;t sure if I should make a &quot;correction&quot; to the original o=
r if it fit the general weirdness in Alice.</blockquote><div><br></div><div=
>I don&#39;t think the original has the same problem, because in the Englis=
h you have &quot;the name&quot; and &quot;is called&quot;, and the White Kn=
ight&#39;s &quot;the name&quot; does repeat Alice&#39;s &quot;the name&quot=
;, and &quot;is called&quot; is the new information. The problem with the L=
ojban is that there&#39;s two &quot;cmene&quot;, and the one that is new in=
formation comes first and in the same position as Alice&#39;s &quot;cmene&q=
uot;. If it was a different word, say:</div><div><br></div><div>-.i ri cmen=
e lo selsa&#39;a ku xu<br>- na go&#39;i .i do na jimpe .i ra ba&#39;e sinxa=
 lo cmene<br></div><div><br></div><div>or:</div><div><br></div><div>-.i ri =
cmene lo selsa&#39;a ku xu<br>- na go&#39;i .i do na jimpe .i ra lo cmene c=
u ba&#39;e sinxa<br></div><div><br></div><div>then it would be easier to fo=
llow, because it would be more clear that &quot;lo cmene&quot; is &quot;lo =
cmene be lo selsa&#39;a&quot;. (I&#39;m not saying it would be a better tra=
nslation though.)</div><div><br></div><div>Also the way you have &quot;xu&q=
uot; questioning &quot;lo selsa&#39;a&quot; may add to the garden-pathing. =
I think &quot;vau xu&quot; would correspond more closely to the original.=
=C2=A0</div><div><br></div><div>mu&#39;o mi&#39;e xorxes</div><div><br></di=
v></div></div></div>

<p></p>

-- <br />
You received this message because you are subscribed to the Google Groups &=
quot;lojban&quot; group.<br />
To unsubscribe from this group and stop receiving emails from it, send an e=
mail to <a href=3D"mailto:lojban+unsubscribe@googlegroups.com">lojban+unsub=
scribe@googlegroups.com</a>.<br />
To post to this group, send email to <a href=3D"mailto:lojban@googlegroups.=
com">lojban@googlegroups.com</a>.<br />
Visit this group at <a href=3D"http://groups.google.com/group/lojban">http:=
//groups.google.com/group/lojban</a>.<br />
For more options, visit <a href=3D"https://groups.google.com/d/optout">http=
s://groups.google.com/d/optout</a>.<br />

--089e011847dea99dc705031a2ae3--