Received-SPF: pass (google.com: domain of gleki.is.my.name@gmail.com designates 2a00:1450:400c:c05::229 as permitted sender) client-ip=2a00:1450:400c:c05::229;
MIME-Version: 1.0
In-Reply-To: <e157ff58-8ad1-4ca0-afc9-18cf7d2f352e@googlegroups.com>
References: <CAO7bV+hZTwiquY0GcTA0xVjg2t+QnRt_DcY+XafDJYfa10L7ow@mail.gmail.com>
 <20150204124517.GA1243@kuebelreiter.informatik.Uni-Osnabrueck.DE>
 <CAO7bV+jCrZy4cO62r=jJXbdM67W9RZw=k5HBvML2ZgferZiJcQ@mail.gmail.com> <e157ff58-8ad1-4ca0-afc9-18cf7d2f352e@googlegroups.com>
From: Gleki Arxokuna <gleki.is.my.name@gmail.com>
Date: Mon, 9 Feb 2015 13:54:19 +0300
Message-ID: <CAO7bV+gV_p8w-de=N1DcfSt8CzkmSZ=bHp6H1AV3Fqm8wYTh0A@mail.gmail.com>
Subject: Re: [lojban] the myth of monoparsing
To: "lojban@googlegroups.com" <lojban@googlegroups.com>
Content-Type: multipart/alternative; boundary=047d7ba97234cf9a69050ea598c7
Reply-To: lojban@googlegroups.com
Precedence: list
Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com
Sender: lojban@googlegroups.com
X-Spam_score: 0.8
X-Spam_score_int: 8
X-Spam_bar: /
X-Spam-Report: Spam detection software, running on the system "stodi.digitalkingdom.org",
 has NOT identified this incoming email as spam.  The original
 message has been attached to this so you can view it or label
 similar future email.  If you have any questions, see
 @@CONTACT_ADDRESS@@ for details.
 
 Content preview:  2015-02-08 4:34 GMT+03:00 ianek <janek37@gmail.com>: > > >
    On Friday, February 6, 2015 at 8:13:30 AM UTC+1, la gleki wrote: >> >> >>
    >> 2015-02-04 15:45 GMT+03:00 v4hn <m...@v4hn.de>: >> >>> On Tue, Feb 03,
    2015 at 11:42:32AM +0300, Gleki Arxokuna wrote: >>> > "Fred saw a plane flying
    over Zurich" can have several meanings >>> >>> Yes. >>> However, for me,
   the issue here is that we (hopefully..) agree >>> that there are different
    parse trees (which yield the different >>> meanings). >>> >> >> No, several
    trees arise after you interpret the sentence. >> > > But if you had an English
    parser, it would yield several trees without any > interpreting. > [...] 
 
 Content analysis details:   (0.8 points, 5.0 required)
 
  pts rule name              description
 ---- ---------------------- --------------------------------------------------
  0.0 URIBL_BLOCKED          ADMINISTRATOR NOTICE: The query to URIBL was blocked.
                             See
                             http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block
                              for more information.
                             [URIs: googlegroups.com]
  2.7 DNS_FROM_AHBL_RHSBL    RBL: Envelope sender listed in dnsbl.ahbl.org
                             [listed in googlegroups.com.rhsbl.ahbl.org.	IN]
                             [A]
 -0.0 RCVD_IN_MSPIKE_H2      RBL: Average reputation (+2)
                             [209.85.215.64 listed in wl.mailspike.net]
  0.0 T_HEADER_FROM_DIFFERENT_DOMAINS From and EnvelopeFrom 2nd level mail
                             domains are different
 -0.0 SPF_PASS               SPF: sender matches SPF record
  0.0 FREEMAIL_FROM          Sender email is commonly abused enduser mail provider
                             (gleki.is.my.name[at]gmail.com)
  0.0 DKIM_ADSP_CUSTOM_MED   No valid author signature, adsp_override is
                             CUSTOM_MED
  0.0 HTML_MESSAGE           BODY: HTML included in message
 -1.9 BAYES_00               BODY: Bayes spam probability is 0 to 1%
                             [score: 0.0000]
 -0.1 DKIM_VALID             Message has at least one valid DKIM or DK signature
  0.1 DKIM_SIGNED            Message has a DKIM or DK signature, not necessarily valid
  0.0 T_FREEMAIL_FORGED_FROMDOMAIN 2nd level domains in From and
                             EnvelopeFrom freemail headers are different

--047d7ba97234cf9a69050ea598c7
Content-Type: text/plain; charset=UTF-8

2015-02-08 4:34 GMT+03:00 ianek <janek37@gmail.com>:

>
>
> On Friday, February 6, 2015 at 8:13:30 AM UTC+1, la gleki wrote:
>>
>>
>>
>> 2015-02-04 15:45 GMT+03:00 v4hn <m...@v4hn.de>:
>>
>>> On Tue, Feb 03, 2015 at 11:42:32AM +0300, Gleki Arxokuna wrote:
>>> > "Fred saw a plane flying over Zurich" can have several meanings
>>>
>>> Yes.
>>> However, for me, the issue here is that we (hopefully..) agree
>>> that there are different parse trees (which yield the different
>>> meanings).
>>>
>>
>> No, several trees arise after you interpret the sentence.
>>
>
> But if you had an English parser, it would yield several trees without any
> interpreting.
>

Sure! Because English parsers lack the ability to find something common in
all of the parse trees.


> Like this:
>
> "Fred saw a plane flying over Zurich"
> NAME VERB-PAST ARTICLE COUNTABLE-NOUN VERB-ING PREPOSITION NAME
>
> Some (much simplified) rules could be:
>
> Sentence ::= Noun-Phrase Verb Noun-Phrase
> Sentence ::= Noun-Phrase Verb Noun-Phrase Adverbial-Phrase
> Noun-Phrase ::= NAME | ARTICLE COUNTABLE-NOUN | Noun-Phrase VERB-ING
> Prepositional-Clause
> Verb ::= VERB-PAST
> Adverbial-Phrase ::= VERB-ING Preposition-Clause
> Preposition-Clause ::= PREPOSITION Noun-Phrase
>
> This simple grammar yields two parse trees for that sentence:
>
> Sentence
> ----Noun-Phrase
> --------NAME
> ------------Fred
> ----Verb
> --------VERB-PAST
> ------------saw
> ----Noun-Phrase
> --------Noun-Phrase
> ------------ARTICLE
> ----------------a
> ------------NOUN
> ----------------plane
> --------VERB-ING
> ------------flying
> --------Prepositional-Clause
> ------------PROPOSITION
> ----------------over
> ------------Noun-Phrase
> ----------------NAME
> --------------------Zurich
>
> Sentence
> ----Noun-Phrase
> --------NAME
> ------------Fred
> ----Verb
> --------VERB-PAST
> ------------saw
> ----Noun-Phrase
> --------Noun-Phrase
> ------------ARTICLE
> ----------------a
> ------------NOUN
> ----------------plane
> ----Adverbial-Phrase
> --------VERB-ING
> ------------flying
> --------Prepositional-Clause
> ------------PROPOSITION
> ----------------over
> ------------Noun-Phrase
> ----------------NAME
> --------------------Zurich
>
> Formal grammars for natural languages do exist, although they're not
> perfect, but the problem with multiple grammatically sensible parses (often
> millions of trees and more) is much greater than the problem with
> nonsensible trees or correct sentences that don't parse at all.
>
> Lojban was carefully designed to avoid this problem. And it doesn't have
> anything to do with {xi PA}. The Lojban grammar specifies XI clauses
> unambiguously. Parse trees are unique. Monoparsing is not a myth. XI
> clauses may add semantic ambiguity on a different level then, say, simple
> {zo'e}, but it doesn't have anything to do with syntactic ambiguity.
>

It specifies to which head a clause should attach. And since it's {mo'e
zo'e} it's vague to which head it attaches. If the parser you use doesn't
allow for that the only thing that can be done is to provide several
possible trees.


>
{la fred pu viska lo vinji do'e lo se xi vei mo'e zo'e nei poi vofli ga'u
> la tsurix} has only one syntax tree, regardless of the number of possible
> semantic interpretations.
>

If you applied {mo'e zo'e} to the English sentence you will still get the
only syntax tree.

>
> In English you can have sentences that are semantically ambiguous due to
> syntactic ambiguity. In Lojban you can have sentences with (roughly) the
> same semantic ambiguity as the English ones, but syntactically unambiguous.
>
>
>>
>>> > {la fred pu viska lo vinji do'e lo se xi vei mo'e zo'e nei poi vofli
>>> ga'u
>>> > la tsurix}
>>>
>>> camxes only produces one parse tree for that.
>>>
>>
>> And for English you don't provide any parses at all.
>> May be someone should just parse the original English sentence as camxes
>> does for Lojban one?
>> I won't be surprised if such parser for English doesn't exist since those
>> who write them might mix parsing and interpretation of it. The latter would
>> be replacing {mo'e zo'e} with some PA which will immediately lead to
>> several syntactic trees.
>>
>> So I both disagree and agree with you on whether English sentence has
>> several syntactic trees. If using one term for two operations is stopped
>> the contradiction disappears.
>>
>>
>>
>>> If you think it should produce more then one, raise a bug report.
>>>
>>
>> I'm not aware of any Lojban parsers that perform interpretation
>> operation. In most cases you just need context and one interpretation. But
>> this is semantic analysis. Producing all possible syntactic trees is a task
>> needed more seldom.
>>
>
> Camxes is intended to produce all possible syntactic trees, and there's
> only one of them for any valid sentence.
>

You may invent a Lojban parser that won't be able to parse {mo'e zo'e}.
Then you will need workarounds to output several trees.


>
> mu'o mi'e ianek
>
> --
> You received this message because you are subscribed to the Google Groups
> "lojban" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to lojban+unsubscribe@googlegroups.com.
> To post to this group, send email to lojban@googlegroups.com.
> Visit this group at http://groups.google.com/group/lojban.
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups "lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/d/optout.

--047d7ba97234cf9a69050ea598c7
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><br><div class=3D"gmail_extra"><br><div class=3D"gmail_quo=
te">2015-02-08 4:34 GMT+03:00 ianek <span dir=3D"ltr">&lt;<a href=3D"mailto=
:janek37@gmail.com" target=3D"_blank">janek37@gmail.com</a>&gt;</span>:<br>=
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><br><br>On Friday, February=
 6, 2015 at 8:13:30 AM UTC+1, la gleki wrote:<span class=3D""><blockquote c=
lass=3D"gmail_quote" style=3D"margin:0;margin-left:0.8ex;border-left:1px #c=
cc solid;padding-left:1ex"><div dir=3D"ltr"><br><div><br><div class=3D"gmai=
l_quote">2015-02-04 15:45 GMT+03:00 v4hn <span dir=3D"ltr">&lt;<a rel=3D"no=
follow">m...@v4hn.de</a>&gt;</span>:<br><blockquote class=3D"gmail_quote" s=
tyle=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rg=
b(204,204,204);border-left-style:solid;padding-left:1ex"><span>On Tue, Feb =
03, 2015 at 11:42:32AM +0300, Gleki Arxokuna wrote:<br>
&gt; &quot;Fred saw a plane flying over Zurich&quot; can have several meani=
ngs<br>
<br>
</span>Yes.<br>
However, for me, the issue here is that we (hopefully..) agree<br>
that there are different parse trees (which yield the different meanings).<=
br></blockquote><div><br></div><div>No, several trees arise after you inter=
pret the sentence.</div></div></div></div></blockquote></span><div><br>But =
if you had an English parser, it would yield several trees without any inte=
rpreting.</div></div></blockquote><div><br></div><div>Sure! Because English=
 parsers lack the ability to find something common in all of the parse tree=
s.</div><div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:=
0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><d=
iv> Like this:<br><br><div><span class=3D"">&quot;Fred saw a plane flying o=
ver Zurich&quot;<br></span>NAME VERB-PAST ARTICLE COUNTABLE-NOUN VERB-ING P=
REPOSITION NAME<br><br>Some (much simplified) rules could be:<br><br>Senten=
ce ::=3D Noun-Phrase Verb Noun-Phrase<br>Sentence ::=3D Noun-Phrase Verb No=
un-Phrase Adverbial-Phrase<br>Noun-Phrase ::=3D NAME | ARTICLE COUNTABLE-NO=
UN | Noun-Phrase VERB-ING Prepositional-Clause<br>Verb ::=3D VERB-PAST<br>A=
dverbial-Phrase ::=3D VERB-ING Preposition-Clause<br>Preposition-Clause ::=
=3D PREPOSITION Noun-Phrase<br><br>This simple grammar yields two parse tre=
es for that sentence:<br><br>Sentence<br>----Noun-Phrase<br>--------NAME<br=
>------------Fred<br>----Verb<br>--------VERB-PAST<br>------------saw<br>--=
--Noun-Phrase<br>--------Noun-Phrase<br>------------ARTICLE<br>------------=
----a<br>------------NOUN<br>----------------plane<br>--------VERB-ING<br>-=
-----------flying<br>--------Prepositional-Clause<br>------------PROPOSITIO=
N<br>----------------over<br>------------Noun-Phrase<br>----------------NAM=
E<br>--------------------Zurich<br><br>Sentence<br>----Noun-Phrase<br>-----=
---NAME<br>------------Fred<br>----Verb<br>--------VERB-PAST<br>-----------=
-saw<br>----Noun-Phrase<br>--------Noun-Phrase<br>------------ARTICLE<br>--=
--------------a<br>------------NOUN<br>----------------plane<br>----Adverbi=
al-Phrase<br>--------VERB-ING<br>------------flying<br>--------Prepositiona=
l-Clause<br>------------PROPOSITION<br>----------------over<br>------------=
Noun-Phrase<br>----------------NAME<br>--------------------Zurich<br><br>Fo=
rmal grammars for natural languages do exist, although they&#39;re not perf=
ect, but the problem with multiple grammatically sensible parses (often mil=
lions of trees and more) is much greater than the problem with nonsensible =
trees or correct sentences that don&#39;t parse at all.<br><br>Lojban was c=
arefully designed to avoid this problem. And it doesn&#39;t have anything t=
o do with {xi PA}. The Lojban grammar specifies XI clauses unambiguously. P=
arse trees are unique. Monoparsing is not a myth. XI clauses may add semant=
ic ambiguity on a different level then, say, simple {zo&#39;e}, but it does=
n&#39;t have anything to do with syntactic ambiguity.</div></div></div></bl=
ockquote><div><br></div><div>It specifies to which head a clause should att=
ach. And since it&#39;s {mo&#39;e zo&#39;e} it&#39;s vague to which head it=
 attaches. If the parser you use doesn&#39;t allow for that the only thing =
that can be done is to provide several possible trees.</div><div><br></div>=
<blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1p=
x #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div><div>=C2=A0</div></div=
></div></blockquote><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0=
 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div><d=
iv>{la fred pu viska lo vinji do&#39;e lo se xi vei mo&#39;e zo&#39;e nei p=
oi vofli ga&#39;u la tsurix} has only one syntax tree, regardless of the nu=
mber of possible semantic interpretations.<br></div></div></div></blockquot=
e><div><br></div><div>If you applied {mo&#39;e zo&#39;e} to the English sen=
tence you will still get the only syntax tree.</div><blockquote class=3D"gm=
ail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-le=
ft:1ex"><div dir=3D"ltr"><div><div><br>In English you can have sentences th=
at are semantically ambiguous due to syntactic ambiguity. In Lojban you can=
 have sentences with (roughly) the same semantic ambiguity as the English o=
nes, but syntactically unambiguous.<br>=C2=A0</div></div><span class=3D""><=
blockquote class=3D"gmail_quote" style=3D"margin:0;margin-left:0.8ex;border=
-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div><div class=3D"=
gmail_quote"><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px =
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-=
style:solid;padding-left:1ex">
<span><br>
&gt; {la fred pu viska lo vinji do&#39;e lo se xi vei mo&#39;e zo&#39;e nei=
 poi vofli ga&#39;u<br>
&gt; la tsurix}<br>
<br>
</span>camxes only produces one parse tree for that.<br></blockquote><div><=
br></div><div>And for English you don&#39;t provide any parses at all.</div=
><div>May be someone should just parse the original English sentence as cam=
xes does for Lojban one?</div><div>I won&#39;t be surprised if such parser =
for English doesn&#39;t exist since those who write them might mix parsing =
and interpretation of it. The latter would be replacing {mo&#39;e zo&#39;e}=
 with some PA which will immediately lead to several syntactic trees.</div>=
<div><br></div><div>So I both disagree and agree with you on whether Englis=
h sentence has several syntactic trees. If using one term for two operation=
s is stopped the contradiction disappears.</div><div><br></div><div>=C2=A0<=
/div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;bo=
rder-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:so=
lid;padding-left:1ex">
If you think it should produce more then one, raise a bug report.<br></bloc=
kquote><div><br></div><div>I&#39;m not aware of any Lojban parsers that per=
form interpretation operation. In most cases you just need context and one =
interpretation. But this is semantic analysis. Producing all possible synta=
ctic trees is a task needed more seldom.</div></div></div></div></blockquot=
e></span><div><br>Camxes is intended to produce all possible syntactic tree=
s, and there&#39;s only one of them for any valid sentence.<br></div></div>=
</blockquote><div><br></div><div>You may invent a Lojban parser that won=
9;t be able to parse {mo&#39;e zo&#39;e}. Then you will need workarounds to=
 output several trees.</div><div>=C2=A0</div><blockquote class=3D"gmail_quo=
te" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"=
><div dir=3D"ltr"><div><br>mu&#39;o mi&#39;e ianek<br></div></div><div clas=
s=3D"HOEnZb"><div class=3D"h5">

<p></p>

-- <br>
You received this message because you are subscribed to the Google Groups &=
quot;lojban&quot; group.<br>
To unsubscribe from this group and stop receiving emails from it, send an e=
mail to <a href=3D"mailto:lojban+unsubscribe@googlegroups.com" target=3D"_b=
lank">lojban+unsubscribe@googlegroups.com</a>.<br>
To post to this group, send email to <a href=3D"mailto:lojban@googlegroups.=
com" target=3D"_blank">lojban@googlegroups.com</a>.<br>
Visit this group at <a href=3D"http://groups.google.com/group/lojban" targe=
t=3D"_blank">http://groups.google.com/group/lojban</a>.<br>
For more options, visit <a href=3D"https://groups.google.com/d/optout" targ=
et=3D"_blank">https://groups.google.com/d/optout</a>.<br>
</div></div></blockquote></div><br></div></div>

<p></p>

-- <br />
You received this message because you are subscribed to the Google Groups &=
quot;lojban&quot; group.<br />
To unsubscribe from this group and stop receiving emails from it, send an e=
mail to <a href=3D"mailto:lojban+unsubscribe@googlegroups.com">lojban+unsub=
scribe@googlegroups.com</a>.<br />
To post to this group, send email to <a href=3D"mailto:lojban@googlegroups.=
com">lojban@googlegroups.com</a>.<br />
Visit this group at <a href=3D"http://groups.google.com/group/lojban">http:=
//groups.google.com/group/lojban</a>.<br />
For more options, visit <a href=3D"https://groups.google.com/d/optout">http=
s://groups.google.com/d/optout</a>.<br />

--047d7ba97234cf9a69050ea598c7--