Date: Thu, 12 Feb 2015 13:04:03 -0800 (PST)
From: ianek <janek37@gmail.com>
To: lojban@googlegroups.com
Message-Id: <8b2f2e8a-a89f-4544-9ee7-d5189bd4a07b@googlegroups.com>
In-Reply-To: <CAO7bV+gSL3Q-Gg3t7k=Tts7U_RkpFdpi246xfG-c=hArAz-+6A@mail.gmail.com>
References: <CAO7bV+hZTwiquY0GcTA0xVjg2t+QnRt_DcY+XafDJYfa10L7ow@mail.gmail.com>
 <20150204124517.GA1243@kuebelreiter.informatik.Uni-Osnabrueck.DE>
 <CAO7bV+jCrZy4cO62r=jJXbdM67W9RZw=k5HBvML2ZgferZiJcQ@mail.gmail.com>
 <e157ff58-8ad1-4ca0-afc9-18cf7d2f352e@googlegroups.com> <CAO7bV+gV_p8w-de=N1DcfSt8CzkmSZ=bHp6H1AV3Fqm8wYTh0A@mail.gmail.com>
 <c04eb516-981e-4338-9728-a7e852523b4a@googlegroups.com> <CAO7bV+geB+=fq33PeF0duOhwFs-D93gYnyxTZo_SFK9rif-m=g@mail.gmail.com>
 <50d5006f-f02b-4a28-9894-6608729585fc@googlegroups.com> <CAO7bV+gosqaCDxXeT4aqFHwOWWZFPfHPgfrhwck2KKg6VQh0Rg@mail.gmail.com>
 <c8161fe3-4b73-4b40-bade-41ba5993700c@googlegroups.com>
 <CAO7bV+gSL3Q-Gg3t7k=Tts7U_RkpFdpi246xfG-c=hArAz-+6A@mail.gmail.com>
Subject: Re: [lojban] the myth of monoparsing
MIME-Version: 1.0
Content-Type: multipart/mixed; 
	boundary="----=_Part_899_315185249.1423775043982"
Reply-To: lojban@googlegroups.com
Precedence: list
Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com
Sender: lojban@googlegroups.com
X-Spam_score: 0.7
X-Spam_score_int: 7
X-Spam_bar: /
X-Spam-Report: Spam detection software, running on the system "stodi.digitalkingdom.org",
 has NOT identified this incoming email as spam.  The original
 message has been attached to this so you can view it or label
 similar future email.  If you have any questions, see
 @@CONTACT_ADDRESS@@ for details.
 
 Content preview:  On Thursday, February 12, 2015 at 8:00:17 PM UTC+1, la gleki
    wrote: > > > 2015-02-12 21:42 GMT+03:00 ianek <jan...@gmail.com <javascript:>>:
    > >> >> >> On Thursday, February 12, 2015 at 7:11:12 AM UTC+1, la gleki wrote:
    >> >>> >>> >>> 2015-02-12 1:20 GMT+03:00 ianek <jan...@gmail.com>: >>> >>>>
    >>>> >>>> On Wednesday, February 11, 2015 at 1:50:49 PM UTC+1, la gleki wrote:
    >>>> >>>>> >>>>> >>>>> 2015-02-09 23:22 GMT+03:00 ianek <jan...@gmail.com>:
    >>>>> >>>>>> >>>>>> >>>>>> On Monday, February 9, 2015 at 11:54:41 AM UTC+1,
    la gleki wrote: >>>>>>> >>>>>>> >>>>>>> >>>>>>> 2015-02-08 4:34 GMT+03:00
    ianek <jan...@gmail.com>: >>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> On Friday, February
    6, 2015 at 8:13:30 AM UTC+1, la gleki wrote: >>>>>>>>> >>>>>>>>> >>>>>>>>>
    >>>>>>>>> 2015-02-04 15:45 GMT+03:00 v4hn <m...@v4hn.de>: >>>>>>>>> >>>>>>>>>>
    On Tue, Feb 03, 2015 at 11:42:32AM +0300, Gleki Arxokuna wrote: >>>>>>>>>>
    > "Fred saw a plane flying over Zurich" can have several meanings >>>>>>>>>>
    >>>>>>>>>> Yes. >>>>>>>>>> However, for me, the issue here is that we (hopefully..)
    agree >>>>>>>>>> that there are different parse trees (which yield the different
    >>>>>>>>>> meanings). >>>>>>>>>> >>>>>>>>> >>>>>>>>> No, several trees arise
    after you interpret the sentence. >>>>>>>>> >>>>>>>> >>>>>>>> But if you
   had an English parser, it would yield several trees >>>>>>>> without any interpreting.
    >>>>>>>> >>>>>>> >>>>>>> Sure! Because English parsers lack the ability to
    find something >>>>>>> common in all of the parse trees. >>>>>>> >>>>>> >>>>>>
    No. It's because words in an English sentence can be parsed as >>>>>> different
    syntactic structures. That's what parsing means: determining >>>>>> structures
    formed by words. Not "finding something common". >>>>>> >>>>> >>>>> You yourself
    just showed several parses of the same sentence. >>>>> This is how usual
   English parsers are constructed. >>>>> >>>>> However, there is another option
    to monoparse this English sentence. >>>>> >>>>> You mix English language
   and one current theory of how to parse [...] 
 
 Content analysis details:   (0.7 points, 5.0 required)
 
  pts rule name              description
 ---- ---------------------- --------------------------------------------------
  0.0 URIBL_BLOCKED          ADMINISTRATOR NOTICE: The query to URIBL was blocked.
                             See
                             http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block
                              for more information.
                             [URIs: googlegroups.com]
  2.7 DNS_FROM_AHBL_RHSBL    RBL: Envelope sender listed in dnsbl.ahbl.org
                             [listed in googlegroups.com.rhsbl.ahbl.org.	IN]
                             [A]
  0.0 T_HEADER_FROM_DIFFERENT_DOMAINS From and EnvelopeFrom 2nd level mail
                             domains are different
 -0.0 SPF_PASS               SPF: sender matches SPF record
  0.0 FREEMAIL_FROM          Sender email is commonly abused enduser mail provider
                             (janek37[at]gmail.com)
  0.0 HTML_MESSAGE           BODY: HTML included in message
 -1.9 BAYES_00               BODY: Bayes spam probability is 0 to 1%
                             [score: 0.0000]
 -0.1 DKIM_VALID             Message has at least one valid DKIM or DK signature
 -0.1 DKIM_VALID_AU          Message has a valid DKIM or DK signature from author's
                             domain
  0.1 DKIM_SIGNED            Message has a DKIM or DK signature, not necessarily valid
  0.0 T_FREEMAIL_FORGED_FROMDOMAIN 2nd level domains in From and
                             EnvelopeFrom freemail headers are different

------=_Part_899_315185249.1423775043982
Content-Type: multipart/alternative; 
	boundary="----=_Part_900_1623806441.1423775043982"

------=_Part_900_1623806441.1423775043982
Content-Type: text/plain; charset=UTF-8


On Thursday, February 12, 2015 at 8:00:17 PM UTC+1, la gleki wrote:
>
>
> 2015-02-12 21:42 GMT+03:00 ianek <jan...@gmail.com <javascript:>>:
>
>>
>>
>> On Thursday, February 12, 2015 at 7:11:12 AM UTC+1, la gleki wrote:
>>
>>>
>>>
>>> 2015-02-12 1:20 GMT+03:00 ianek <jan...@gmail.com>:
>>>
>>>>
>>>>
>>>> On Wednesday, February 11, 2015 at 1:50:49 PM UTC+1, la gleki wrote:
>>>>
>>>>>
>>>>>
>>>>> 2015-02-09 23:22 GMT+03:00 ianek <jan...@gmail.com>:
>>>>>
>>>>>>
>>>>>>
>>>>>> On Monday, February 9, 2015 at 11:54:41 AM UTC+1, la gleki wrote:
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> 2015-02-08 4:34 GMT+03:00 ianek <jan...@gmail.com>:
>>>>>>>
>>>>>>>>
>>>>>>>>
>>>>>>>> On Friday, February 6, 2015 at 8:13:30 AM UTC+1, la gleki wrote:
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> 2015-02-04 15:45 GMT+03:00 v4hn <m...@v4hn.de>:
>>>>>>>>>
>>>>>>>>>> On Tue, Feb 03, 2015 at 11:42:32AM +0300, Gleki Arxokuna wrote:
>>>>>>>>>> > "Fred saw a plane flying over Zurich" can have several meanings
>>>>>>>>>>
>>>>>>>>>> Yes.
>>>>>>>>>> However, for me, the issue here is that we (hopefully..) agree
>>>>>>>>>> that there are different parse trees (which yield the different 
>>>>>>>>>> meanings).
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>> No, several trees arise after you interpret the sentence.
>>>>>>>>>
>>>>>>>>
>>>>>>>> But if you had an English parser, it would yield several trees 
>>>>>>>> without any interpreting.
>>>>>>>>
>>>>>>>
>>>>>>> Sure! Because English parsers lack the ability to find something 
>>>>>>> common in all of the parse trees.
>>>>>>>
>>>>>>
>>>>>> No. It's because words in an English sentence can be parsed as 
>>>>>> different syntactic structures. That's what parsing means: determining 
>>>>>> structures formed by words. Not "finding something common".
>>>>>>
>>>>>
>>>>> You yourself just showed several parses of the same sentence.
>>>>> This is how usual English parsers are constructed. 
>>>>>
>>>>> However, there is another option to monoparse this English sentence.
>>>>>
>>>>> You mix English language and one current theory of how to parse it.
>>>>>
>>>>>  
>>>>>>
>>>>>>>  
>>>>>>>
>>>>>>>> Like this:
>>>>>>>>
>>>>>>>> "Fred saw a plane flying over Zurich"
>>>>>>>> NAME VERB-PAST ARTICLE COUNTABLE-NOUN VERB-ING PREPOSITION NAME
>>>>>>>>
>>>>>>>> Some (much simplified) rules could be:
>>>>>>>>
>>>>>>>> Sentence ::= Noun-Phrase Verb Noun-Phrase
>>>>>>>> Sentence ::= Noun-Phrase Verb Noun-Phrase Adverbial-Phrase
>>>>>>>> Noun-Phrase ::= NAME | ARTICLE COUNTABLE-NOUN | Noun-Phrase 
>>>>>>>> VERB-ING Prepositional-Clause
>>>>>>>> Verb ::= VERB-PAST
>>>>>>>> Adverbial-Phrase ::= VERB-ING Preposition-Clause
>>>>>>>> Preposition-Clause ::= PREPOSITION Noun-Phrase
>>>>>>>>
>>>>>>>> This simple grammar yields two parse trees for that sentence:
>>>>>>>>
>>>>>>>> Sentence
>>>>>>>> ----Noun-Phrase
>>>>>>>> --------NAME
>>>>>>>> ------------Fred
>>>>>>>> ----Verb
>>>>>>>> --------VERB-PAST
>>>>>>>> ------------saw
>>>>>>>> ----Noun-Phrase
>>>>>>>> --------Noun-Phrase
>>>>>>>> ------------ARTICLE
>>>>>>>> ----------------a
>>>>>>>> ------------NOUN
>>>>>>>> ----------------plane
>>>>>>>> --------VERB-ING
>>>>>>>> ------------flying
>>>>>>>> --------Prepositional-Clause
>>>>>>>> ------------PROPOSITION
>>>>>>>> ----------------over
>>>>>>>> ------------Noun-Phrase
>>>>>>>> ----------------NAME
>>>>>>>> --------------------Zurich
>>>>>>>>
>>>>>>>> Sentence
>>>>>>>> ----Noun-Phrase
>>>>>>>> --------NAME
>>>>>>>> ------------Fred
>>>>>>>> ----Verb
>>>>>>>> --------VERB-PAST
>>>>>>>> ------------saw
>>>>>>>> ----Noun-Phrase
>>>>>>>> --------Noun-Phrase
>>>>>>>> ------------ARTICLE
>>>>>>>> ----------------a
>>>>>>>> ------------NOUN
>>>>>>>> ----------------plane
>>>>>>>> ----Adverbial-Phrase
>>>>>>>> --------VERB-ING
>>>>>>>> ------------flying
>>>>>>>> --------Prepositional-Clause
>>>>>>>> ------------PROPOSITION
>>>>>>>> ----------------over
>>>>>>>> ------------Noun-Phrase
>>>>>>>> ----------------NAME
>>>>>>>> --------------------Zurich
>>>>>>>>
>>>>>>>> Formal grammars for natural languages do exist, although they're 
>>>>>>>> not perfect, but the problem with multiple grammatically sensible parses 
>>>>>>>> (often millions of trees and more) is much greater than the problem with 
>>>>>>>> nonsensible trees or correct sentences that don't parse at all.
>>>>>>>>
>>>>>>>> Lojban was carefully designed to avoid this problem. And it doesn't 
>>>>>>>> have anything to do with {xi PA}. The Lojban grammar specifies XI clauses 
>>>>>>>> unambiguously. Parse trees are unique. Monoparsing is not a myth. XI 
>>>>>>>> clauses may add semantic ambiguity on a different level then, say, simple 
>>>>>>>> {zo'e}, but it doesn't have anything to do with syntactic ambiguity.
>>>>>>>>
>>>>>>>
>>>>>>> It specifies to which head a clause should attach. And since it's 
>>>>>>> {mo'e zo'e} it's vague to which head it attaches. If the parser you use 
>>>>>>> doesn't allow for that the only thing that can be done is to provide 
>>>>>>> several possible trees.
>>>>>>>
>>>>>>
>>>>>> It's a feature of a language, not a parser. If English had a pronoun, 
>>>>>> say, 'lar', which would mean 'the subject or the object of the main 
>>>>>> sentence', you could say "Fred saw a plane as lar flew over Zurich", which 
>>>>>> would be ambiguous semantically, but not syntactically.
>>>>>>
>>>>>
>>>>> Even in current English theory there are a lot of zero morphemes. What 
>>>>> I'm proposing is just another zero morpheme.
>>>>>
>>>>  
>>>>
>>>>>
>>>>> This is what And agreed with me.
>>>>>
>>>>>
>>>>>>
>>>>>>>  
>>>>>>>>
>>>>>>> {la fred pu viska lo vinji do'e lo se xi vei mo'e zo'e nei poi vofli 
>>>>>>>> ga'u la tsurix} has only one syntax tree, regardless of the number of 
>>>>>>>> possible semantic interpretations.
>>>>>>>>
>>>>>>>
>>>>>>> If you applied {mo'e zo'e} to the English sentence you will still 
>>>>>>> get the only syntax tree.
>>>>>>>
>>>>>>
>>>>>> You can't "apply" {mo'e zo'e} to the English sentence, because it's 
>>>>>> not there. Likewise you don't "apply" {mo'e zo'e} to the Lojban sentence. 
>>>>>> You just parse it, because it's there.
>>>>>> In English you can have phrases like 'X of Y of Z' which could be 
>>>>>> parsed as '(X of Y) of Z' or 'X of (Y of Z)'. In Lojban it's not possible, 
>>>>>> but you can say ''either (X of Y) of Z or X of (Y of Z)", which is not 
>>>>>> syntactically ambiguous. You can't apply "either... or" to the English 
>>>>>> sentence, because you can't parse words which aren't there.
>>>>>>
>>>>>
>>>>> As I just said English parsers use this "add words that aren't there" 
>>>>>  all the time.
>>>>>
>>>>
>>>> I was searching, but I haven't found any English parser (but I know a 
>>>> Polish one). What parsers do you refer to?
>>>>
>>>
>>> Probably most. Since this concept (of adding words and morphemes of zero 
>>> length) is present in most modern theories:
>>> https://en.wikipedia.org/wiki/Zero_(linguistics) 
>>>
>>
>> This doesn't answer my question. Name at least one working English 
>> parser. I haven't found any.
>>
>
> Which requirements do you need? Take Stanford's parser.
> But if you want an English parser that would insert zero morpheme to reach 
> vague syntax I'm not aware of any although it's obvious (I hope so) that 
> it's possible to create one (although probably useless since no one 
> including me suggested any possible advantages apart from purely 
> theoretical ones).
>

It's far from obvious. Natural languages have countless types of syntactic 
ambiguity, and I'm not sure all of them could be overcome on the parser 
level.
In Lojban the natural grammar yields a monoparsing parser (no artificial 
zero words etc.). To make a polyparsing one, you'd have to do some weird 
stretches. In English it's the other way around.
 

>
> It of course results in the inability of a fair comparison of Lojban and 
> English parsers. But that's acceptable.
>
>>  
>>
>>>  
>>>>
>>>>>
>>>>>
>>>>>>
>>>>>>>
>>>>>>>> In English you can have sentences that are semantically ambiguous 
>>>>>>>> due to syntactic ambiguity. In Lojban you can have sentences with (roughly) 
>>>>>>>> the same semantic ambiguity as the English ones, but syntactically 
>>>>>>>> unambiguous.
>>>>>>>>  
>>>>>>>>
>>>>>>>>>
>>>>>>>>>> > {la fred pu viska lo vinji do'e lo se xi vei mo'e zo'e nei poi 
>>>>>>>>>> vofli ga'u
>>>>>>>>>> > la tsurix}
>>>>>>>>>>
>>>>>>>>>> camxes only produces one parse tree for that.
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>> And for English you don't provide any parses at all.
>>>>>>>>> May be someone should just parse the original English sentence as 
>>>>>>>>> camxes does for Lojban one?
>>>>>>>>> I won't be surprised if such parser for English doesn't exist 
>>>>>>>>> since those who write them might mix parsing and interpretation of it. The 
>>>>>>>>> latter would be replacing {mo'e zo'e} with some PA which will immediately 
>>>>>>>>> lead to several syntactic trees.
>>>>>>>>>
>>>>>>>>> So I both disagree and agree with you on whether English sentence 
>>>>>>>>> has several syntactic trees. If using one term for two operations is 
>>>>>>>>> stopped the contradiction disappears.
>>>>>>>>>
>>>>>>>>>  
>>>>>>>>>
>>>>>>>>>> If you think it should produce more then one, raise a bug report.
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>> I'm not aware of any Lojban parsers that perform interpretation 
>>>>>>>>> operation. In most cases you just need context and one interpretation. But 
>>>>>>>>> this is semantic analysis. Producing all possible syntactic trees is a task 
>>>>>>>>> needed more seldom.
>>>>>>>>>
>>>>>>>>
>>>>>>>> Camxes is intended to produce all possible syntactic trees, and 
>>>>>>>> there's only one of them for any valid sentence.
>>>>>>>>
>>>>>>>
>>>>>>> You may invent a Lojban parser that won't be able to parse {mo'e 
>>>>>>> zo'e}. Then you will need workarounds to output several trees.
>>>>>>>
>>>>>>
>>>>>> XI clauses have an ambiguous syntax, so I don't see how I'd need 
>>>>>> workarounfds and several trees. Of course, I could invent a Lojban parser 
>>>>>> that won't be able to parse anything, but what's the point? {mo'e zo'e} 
>>>>>> from the parser's view is just MOhE KOhA. If I can't parse it, then I have 
>>>>>> an incomplete parser.
>>>>>>
>>>>>
>>>>> And this is what I state for English: its current parsers are 
>>>>> incomplete and further improvements will make polyparsed sentences 
>>>>> monoparsed.
>>>>>  
>>>>>
>>>>>>
>>>>>> What you mean sounds rather like a semantic analyzer, which is 
>>>>>> extremely hard for any language, including Lojban.
>>>>>>
>>>>>> mu'o mi'e ianek
>>>>>>  
>>>>>>
>>>>>>>  
>>>>>>>
>>>>>>>>
>>>>>>>> mu'o mi'e ianek
>>>>>>>>
>>>>>>>> -- 
>>>>>>>> You received this message because you are subscribed to the Google 
>>>>>>>> Groups "lojban" group.
>>>>>>>> To unsubscribe from this group and stop receiving emails from it, 
>>>>>>>> send an email to lojban+un...@googlegroups.com.
>>>>>>>> To post to this group, send email to loj...@googlegroups.com.
>>>>>>>> Visit this group at http://groups.google.com/group/lojban.
>>>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>>>
>>>>>>>
>>>>>>>  -- 
>>>>>> You received this message because you are subscribed to the Google 
>>>>>> Groups "lojban" group.
>>>>>> To unsubscribe from this group and stop receiving emails from it, 
>>>>>> send an email to lojban+un...@googlegroups.com.
>>>>>> To post to this group, send email to loj...@googlegroups.com.
>>>>>> Visit this group at http://groups.google.com/group/lojban.
>>>>>> For more options, visit https://groups.google.com/d/optout.
>>>>>>
>>>>>
>>>>>  -- 
>>>> You received this message because you are subscribed to the Google 
>>>> Groups "lojban" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send 
>>>> an email to lojban+un...@googlegroups.com.
>>>> To post to this group, send email to loj...@googlegroups.com.
>>>> Visit this group at http://groups.google.com/group/lojban.
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>>
>>>  -- 
>> You received this message because you are subscribed to the Google Groups 
>> "lojban" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to lojban+un...@googlegroups.com <javascript:>.
>> To post to this group, send email to loj...@googlegroups.com 
>> <javascript:>.
>> Visit this group at http://groups.google.com/group/lojban.
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>

-- 
You received this message because you are subscribed to the Google Groups "lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/d/optout.

------=_Part_900_1623806441.1423775043982
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><br><br>On Thursday, February 12, 2015 at 8:00:17 PM UTC+1=
, la gleki wrote:<blockquote class=3D"gmail_quote" style=3D"margin: 0;margi=
n-left: 0.8ex;border-left: 1px #ccc solid;padding-left: 1ex;"><div dir=3D"l=
tr"><div><br><div class=3D"gmail_quote">2015-02-12 21:42 GMT+03:00 ianek <s=
pan dir=3D"ltr">&lt;<a href=3D"javascript:" target=3D"_blank" gdf-obfuscate=
d-mailto=3D"t3HX4hpUfS0J" rel=3D"nofollow" onmousedown=3D"this.href=3D'java=
script:';return true;" onclick=3D"this.href=3D'javascript:';return true;">j=
an...@gmail.com</a>&gt;</span>:<br><blockquote class=3D"gmail_quote" style=
=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=
=3D"ltr"><br><br>On Thursday, February 12, 2015 at 7:11:12 AM UTC+1, la gle=
ki wrote:<div><div><blockquote class=3D"gmail_quote" style=3D"margin:0;marg=
in-left:0.8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"=
><br><div><br><div class=3D"gmail_quote">2015-02-12 1:20 GMT+03:00 ianek <s=
pan dir=3D"ltr">&lt;<a rel=3D"nofollow">jan...@gmail.com</a>&gt;</span>:<br=
><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border=
-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;=
padding-left:1ex"><div dir=3D"ltr"><br><br>On Wednesday, February 11, 2015 =
at 1:50:49 PM UTC+1, la gleki wrote:<div><div><blockquote class=3D"gmail_qu=
ote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-co=
lor:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir=3D"=
ltr"><br><div><br><div class=3D"gmail_quote">2015-02-09 23:22 GMT+03:00 ian=
ek <span dir=3D"ltr">&lt;<a rel=3D"nofollow">jan...@gmail.com</a>&gt;</span=
>:<br><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;b=
order-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:s=
olid;padding-left:1ex"><div dir=3D"ltr"><br><br>On Monday, February 9, 2015=
 at 11:54:41 AM UTC+1, la gleki wrote:<span><blockquote class=3D"gmail_quot=
e" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-colo=
r:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir=3D"lt=
r"><br><div><br><div class=3D"gmail_quote">2015-02-08 4:34 GMT+03:00 ianek =
<span dir=3D"ltr">&lt;<a rel=3D"nofollow">jan...@gmail.com</a>&gt;</span>:<=
br><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;bord=
er-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:soli=
d;padding-left:1ex"><div dir=3D"ltr"><br><br>On Friday, February 6, 2015 at=
 8:13:30 AM UTC+1, la gleki wrote:<span><blockquote class=3D"gmail_quote" s=
tyle=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rg=
b(204,204,204);border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><=
br><div><br><div class=3D"gmail_quote">2015-02-04 15:45 GMT+03:00 v4hn <spa=
n dir=3D"ltr">&lt;<a rel=3D"nofollow">m...@v4hn.de</a>&gt;</span>:<br><bloc=
kquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-=
width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;paddin=
g-left:1ex"><span>On Tue, Feb 03, 2015 at 11:42:32AM +0300, Gleki Arxokuna =
wrote:<br>
&gt; "Fred saw a plane flying over Zurich" can have several meanings<br>
<br>
</span>Yes.<br>
However, for me, the issue here is that we (hopefully..) agree<br>
that there are different parse trees (which yield the different meanings).<=
br></blockquote><div><br></div><div>No, several trees arise after you inter=
pret the sentence.</div></div></div></div></blockquote></span><div><br>But =
if you had an English parser, it would yield several trees without any inte=
rpreting.</div></div></blockquote><div><br></div><div>Sure! Because English=
 parsers lack the ability to find something common in all of the parse tree=
s.</div></div></div></div></blockquote></span><div><br>No. It's because wor=
ds in an English sentence can be parsed as different syntactic structures. =
That's what parsing means: determining structures formed by words. Not "fin=
ding something common".<br></div></div></blockquote><div><br></div><div>You=
 yourself just showed several parses of the same sentence.</div><div>This i=
s how usual English parsers are constructed.&nbsp;</div><div><br></div><div=
>However, there is another option to monoparse this English sentence.</div>=
<div><br></div><div>You mix English language and one current theory of how =
to parse it.</div><div><br></div><blockquote class=3D"gmail_quote" style=3D=
"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,2=
04,204);border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div>&nb=
sp;</div><div><div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0p=
x 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border=
-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div><div class=3D"gma=
il_quote"><div>&nbsp;</div><blockquote class=3D"gmail_quote" style=3D"margi=
n:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204=
);border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div> Like thi=
s:<br><br><div><span>"Fred saw a plane flying over Zurich"<br></span>NAME V=
ERB-PAST ARTICLE COUNTABLE-NOUN VERB-ING PREPOSITION NAME<br><br>Some (much=
 simplified) rules could be:<br><br>Sentence ::=3D Noun-Phrase Verb Noun-Ph=
rase<br>Sentence ::=3D Noun-Phrase Verb Noun-Phrase Adverbial-Phrase<br>Nou=
n-Phrase ::=3D NAME | ARTICLE COUNTABLE-NOUN | Noun-Phrase VERB-ING Preposi=
tional-Clause<br>Verb ::=3D VERB-PAST<br>Adverbial-Phrase ::=3D VERB-ING Pr=
eposition-Clause<br>Preposition-Clause ::=3D PREPOSITION Noun-Phrase<br><br=
>This simple grammar yields two parse trees for that sentence:<br><br>Sente=
nce<br>----Noun-Phrase<br>--------NAME<br>------------Fred<br>----Verb<br>-=
-------VERB-PAST<br>------------saw<br>----Noun-Phrase<br>--------Noun-Phra=
se<br>------------ARTICLE<br>----------------a<br>------------NOUN<br>-----=
-----------plane<br>--------VERB-ING<br>------------flying<br>--------Prepo=
sitional-Clause<br>------------PROPOSITION<br>----------------over<br>-----=
-------Noun-Phrase<br>----------------NAME<br>--------------------Zurich<br=
><br>Sentence<br>----Noun-Phrase<br>--------NAME<br>------------Fred<br>---=
-Verb<br>--------VERB-PAST<br>------------saw<br>----Noun-Phrase<br>-------=
-Noun-Phrase<br>------------ARTICLE<br>----------------a<br>------------NOU=
N<br>----------------plane<br>----Adverbial-Phrase<br>--------VERB-ING<br>-=
-----------flying<br>--------Prepositional-Clause<br>------------PROPOSITIO=
N<br>----------------over<br>------------Noun-Phrase<br>----------------NAM=
E<br>--------------------Zurich<br><br>Formal grammars for natural language=
s do exist, although they're not perfect, but the problem with multiple gra=
mmatically sensible parses (often millions of trees and more) is much great=
er than the problem with nonsensible trees or correct sentences that don't =
parse at all.<br><br>Lojban was carefully designed to avoid this problem. A=
nd it doesn't have anything to do with {xi PA}. The Lojban grammar specifie=
s XI clauses unambiguously. Parse trees are unique. Monoparsing is not a my=
th. XI clauses may add semantic ambiguity on a different level then, say, s=
imple {zo'e}, but it doesn't have anything to do with syntactic ambiguity.<=
/div></div></div></blockquote><div><br></div><div>It specifies to which hea=
d a clause should attach. And since it's {mo'e zo'e} it's vague to which he=
ad it attaches. If the parser you use doesn't allow for that the only thing=
 that can be done is to provide several possible trees.<br></div></div></di=
v></div></blockquote></div></div><div><br>It's a feature of a language, not=
 a parser. If English had a pronoun, say, 'lar', which would mean 'the subj=
ect or the object of the main sentence', you could say "Fred saw a plane as=
 lar flew over Zurich", which would be ambiguous semantically, but not synt=
actically.<br></div></div></blockquote><div><br></div><div>Even in current =
English theory there are a lot of zero morphemes. What I'm proposing is jus=
t another zero morpheme.</div></div></div></div></blockquote><div>&nbsp;</d=
iv><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;bord=
er-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:soli=
d;padding-left:1ex"><div dir=3D"ltr"><div><div class=3D"gmail_quote"><div><=
br></div><div>This is what And agreed with me.</div><div><br></div><blockqu=
ote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-wid=
th:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-l=
eft:1ex"><div dir=3D"ltr"><div><br></div><span><blockquote class=3D"gmail_q=
uote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-c=
olor:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir=3D=
"ltr"><div><div class=3D"gmail_quote"><div></div><div><br></div><blockquote=
 class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:=
1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left=
:1ex"><div dir=3D"ltr"><div><div>&nbsp;</div></div></div></blockquote><bloc=
kquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-=
width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;paddin=
g-left:1ex"><div dir=3D"ltr"><div><div>{la fred pu viska lo vinji do'e lo s=
e xi vei mo'e zo'e nei poi vofli ga'u la tsurix} has only one syntax tree, =
regardless of the number of possible semantic interpretations.<br></div></d=
iv></div></blockquote><div><br></div><div>If you applied {mo'e zo'e} to the=
 English sentence you will still get the only syntax tree.<br></div></div><=
/div></div></blockquote></span><div><br>You can't "apply" {mo'e zo'e} to th=
e English sentence, because it's not there. Likewise you don't "apply" {mo'=
e zo'e} to the Lojban sentence. You just parse it, because it's there.<br>I=
n English you can have phrases like 'X of Y of Z' which could be parsed as =
'(X of Y) of Z' or 'X of (Y of Z)'. In Lojban it's not possible, but you ca=
n say ''either (X of Y) of Z or X of (Y of Z)", which is not syntactically =
ambiguous. You can't apply "either... or" to the English sentence, because =
you can't parse words which aren't there.<br></div></div></blockquote><div>=
<br></div><div>As I just said English parsers use this "add words that aren=
't there" &nbsp;all the time.</div></div></div></div></blockquote></div></d=
iv><div><br>I was searching, but I haven't found any English parser (but I =
know a Polish one). What parsers do you refer to?<br></div></div></blockquo=
te><div><br></div><div>Probably most. Since this concept (of adding words a=
nd morphemes of zero length) is present in most modern theories:</div><div>=
<a href=3D"https://en.wikipedia.org/wiki/Zero_(linguistics)" rel=3D"nofollo=
w" target=3D"_blank" onmousedown=3D"this.href=3D'https://www.google.com/url=
?q\75https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FZero_(linguistics)\46sa\75D\46=
sntz\0751\46usg\75AFQjCNEZgg1Lx5IFI1jsZq0KMU8eq-Q9-w';return true;" onclick=
=3D"this.href=3D'https://www.google.com/url?q\75https%3A%2F%2Fen.wikipedia.=
org%2Fwiki%2FZero_(linguistics)\46sa\75D\46sntz\0751\46usg\75AFQjCNEZgg1Lx5=
IFI1jsZq0KMU8eq-Q9-w';return true;">https://en.wikipedia.org/wiki/<u></u><w=
br>Zero_(linguistics)</a>&nbsp;</div></div></div></div></blockquote></div><=
/div><div><br>This doesn't answer my question. Name at least one working En=
glish parser. I haven't found any.<br></div></div></blockquote><div><br></d=
iv><div>Which requirements do you need? Take Stanford's parser.</div><div>B=
ut if you want an English parser that would insert zero morpheme to reach v=
ague syntax I'm not aware of any although it's obvious (I hope so) that it'=
s possible to create one (although probably useless since no one including =
me suggested any possible advantages apart from purely theoretical ones).</=
div></div></div></div></blockquote><div><br>It's far from obvious. Natural =
languages have countless types of syntactic ambiguity, and I'm not sure all=
 of them could be overcome on the parser level.<br>In Lojban the natural gr=
ammar yields a monoparsing parser (no artificial zero words etc.). To make =
a polyparsing one, you'd have to do some weird stretches. In English it's t=
he other way around.<br>&nbsp;</div><blockquote class=3D"gmail_quote" style=
=3D"margin: 0;margin-left: 0.8ex;border-left: 1px #ccc solid;padding-left: =
1ex;"><div dir=3D"ltr"><div><div class=3D"gmail_quote"><div><br></div><div>=
It of course results in the inability of a fair comparison of Lojban and En=
glish parsers. But that's acceptable.</div><blockquote class=3D"gmail_quote=
" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><=
div dir=3D"ltr"><div>&nbsp;</div><div><div><blockquote class=3D"gmail_quote=
" style=3D"margin:0;margin-left:0.8ex;border-left:1px #ccc solid;padding-le=
ft:1ex"><div dir=3D"ltr"><div><div class=3D"gmail_quote"><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;bo=
rder-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">=
<div dir=3D"ltr"><div>&nbsp;</div><div><div><blockquote class=3D"gmail_quot=
e" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-colo=
r:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir=3D"lt=
r"><div><div class=3D"gmail_quote"><div><br></div><blockquote class=3D"gmai=
l_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-lef=
t-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir=
=3D"ltr"><div><br></div><span><blockquote class=3D"gmail_quote" style=3D"ma=
rgin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,=
204);border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div><div c=
lass=3D"gmail_quote"><div><br></div><blockquote class=3D"gmail_quote" style=
=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(20=
4,204,204);border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div>=
<div><br>In English you can have sentences that are semantically ambiguous =
due to syntactic ambiguity. In Lojban you can have sentences with (roughly)=
 the same semantic ambiguity as the English ones, but syntactically unambig=
uous.<br>&nbsp;</div></div><span><blockquote class=3D"gmail_quote" style=3D=
"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,2=
04,204);border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div><di=
v class=3D"gmail_quote"><blockquote class=3D"gmail_quote" style=3D"margin:0=
px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);b=
order-left-style:solid;padding-left:1ex">
<span><br>
&gt; {la fred pu viska lo vinji do'e lo se xi vei mo'e zo'e nei poi vofli g=
a'u<br>
&gt; la tsurix}<br>
<br>
</span>camxes only produces one parse tree for that.<br></blockquote><div><=
br></div><div>And for English you don't provide any parses at all.</div><di=
v>May be someone should just parse the original English sentence as camxes =
does for Lojban one?</div><div>I won't be surprised if such parser for Engl=
ish doesn't exist since those who write them might mix parsing and interpre=
tation of it. The latter would be replacing {mo'e zo'e} with some PA which =
will immediately lead to several syntactic trees.</div><div><br></div><div>=
So I both disagree and agree with you on whether English sentence has sever=
al syntactic trees. If using one term for two operations is stopped the con=
tradiction disappears.</div><div><br></div><div>&nbsp;</div><blockquote cla=
ss=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;=
border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex=
">
If you think it should produce more then one, raise a bug report.<br></bloc=
kquote><div><br></div><div>I'm not aware of any Lojban parsers that perform=
 interpretation operation. In most cases you just need context and one inte=
rpretation. But this is semantic analysis. Producing all possible syntactic=
 trees is a task needed more seldom.</div></div></div></div></blockquote></=
span><div><br>Camxes is intended to produce all possible syntactic trees, a=
nd there's only one of them for any valid sentence.<br></div></div></blockq=
uote><div><br></div><div>You may invent a Lojban parser that won't be able =
to parse {mo'e zo'e}. Then you will need workarounds to output several tree=
s.</div></div></div></div></blockquote></span><div><br>XI clauses have an a=
mbiguous syntax, so I don't see how I'd need workarounfds and several trees=
. Of course, I could invent a Lojban parser that won't be able to parse any=
thing, but what's the point? {mo'e zo'e} from the parser's view is just MOh=
E KOhA. If I can't parse it, then I have an incomplete parser.<br></div></d=
iv></blockquote><div><br></div><div>And this is what I state for English: i=
ts current parsers are incomplete and further improvements will make polypa=
rsed sentences monoparsed.</div><div>&nbsp;</div><blockquote class=3D"gmail=
_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left=
-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir=
=3D"ltr"><div><br>What you mean sounds rather like a semantic analyzer, whi=
ch is extremely hard for any language, including Lojban.<span><br><br>mu'o =
mi'e ianek<br>&nbsp;</span></div><blockquote class=3D"gmail_quote" style=3D=
"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,2=
04,204);border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div><di=
v class=3D"gmail_quote"><div>&nbsp;</div><blockquote class=3D"gmail_quote" =
style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:r=
gb(204,204,204);border-left-style:solid;padding-left:1ex"><span><div dir=3D=
"ltr"><div><br>mu'o mi'e ianek<br></div></div></span><div><div><span>

<p></p>

-- <br>
You received this message because you are subscribed to the Google Groups "=
lojban" group.<br></span>
To unsubscribe from this group and stop receiving emails from it, send an e=
mail to <a rel=3D"nofollow">lojban+un...@<u></u>googlegroups.com</a>.<br>
To post to this group, send email to <a rel=3D"nofollow">loj...@googlegroup=
s.com</a>.<span><br>
Visit this group at <a href=3D"http://groups.google.com/group/lojban" rel=
=3D"nofollow" target=3D"_blank" onmousedown=3D"this.href=3D'http://groups.g=
oogle.com/group/lojban';return true;" onclick=3D"this.href=3D'http://groups=
.google.com/group/lojban';return true;">http://groups.google.com/<u></u>gro=
up<u></u><u></u><wbr>/lojban</a>.<br>
For more options, visit <a href=3D"https://groups.google.com/d/optout" rel=
=3D"nofollow" target=3D"_blank" onmousedown=3D"this.href=3D'https://groups.=
google.com/d/optout';return true;" onclick=3D"this.href=3D'https://groups.g=
oogle.com/d/optout';return true;">https://groups.google.com/d/<u></u>op<u><=
/u><u></u><wbr>tout</a>.<br>
</span></div></div></blockquote></div><br></div></div>
</blockquote></div><div><div>

<p></p>

-- <br>
You received this message because you are subscribed to the Google Groups "=
lojban" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an e=
mail to <a rel=3D"nofollow">lojban+un...@<u></u>googlegroups.com</a>.<br>
To post to this group, send email to <a rel=3D"nofollow">loj...@googlegroup=
s.com</a>.<br>
Visit this group at <a href=3D"http://groups.google.com/group/lojban" rel=
=3D"nofollow" target=3D"_blank" onmousedown=3D"this.href=3D'http://groups.g=
oogle.com/group/lojban';return true;" onclick=3D"this.href=3D'http://groups=
.google.com/group/lojban';return true;">http://groups.google.com/<u></u>gro=
up<u></u><wbr>/lojban</a>.<br>
For more options, visit <a href=3D"https://groups.google.com/d/optout" rel=
=3D"nofollow" target=3D"_blank" onmousedown=3D"this.href=3D'https://groups.=
google.com/d/optout';return true;" onclick=3D"this.href=3D'https://groups.g=
oogle.com/d/optout';return true;">https://groups.google.com/d/<u></u>op<u><=
/u><wbr>tout</a>.<br>
</div></div></blockquote></div><br></div></div>
</blockquote></div></div></div><div><div>

<p></p>

-- <br>
You received this message because you are subscribed to the Google Groups "=
lojban" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an e=
mail to <a rel=3D"nofollow">lojban+un...@<u></u>googlegroups.com</a>.<br>
To post to this group, send email to <a rel=3D"nofollow">loj...@googlegroup=
s.com</a>.<br>
Visit this group at <a href=3D"http://groups.google.com/group/lojban" rel=
=3D"nofollow" target=3D"_blank" onmousedown=3D"this.href=3D'http://groups.g=
oogle.com/group/lojban';return true;" onclick=3D"this.href=3D'http://groups=
.google.com/group/lojban';return true;">http://groups.google.com/<u></u>gro=
up<wbr>/lojban</a>.<br>
For more options, visit <a href=3D"https://groups.google.com/d/optout" rel=
=3D"nofollow" target=3D"_blank" onmousedown=3D"this.href=3D'https://groups.=
google.com/d/optout';return true;" onclick=3D"this.href=3D'https://groups.g=
oogle.com/d/optout';return true;">https://groups.google.com/d/<u></u>op<wbr=
>tout</a>.<br>
</div></div></blockquote></div><br></div></div>
</blockquote></div></div></div><div><div>

<p></p>

-- <br>
You received this message because you are subscribed to the Google Groups "=
lojban" group.<br>
To unsubscribe from this group and stop receiving emails from it, send an e=
mail to <a href=3D"javascript:" target=3D"_blank" gdf-obfuscated-mailto=3D"=
t3HX4hpUfS0J" rel=3D"nofollow" onmousedown=3D"this.href=3D'javascript:';ret=
urn true;" onclick=3D"this.href=3D'javascript:';return true;">lojban+un...@=
<wbr>googlegroups.com</a>.<br>
To post to this group, send email to <a href=3D"javascript:" target=3D"_bla=
nk" gdf-obfuscated-mailto=3D"t3HX4hpUfS0J" rel=3D"nofollow" onmousedown=3D"=
this.href=3D'javascript:';return true;" onclick=3D"this.href=3D'javascript:=
';return true;">loj...@googlegroups.com</a>.<br>
Visit this group at <a href=3D"http://groups.google.com/group/lojban" targe=
t=3D"_blank" rel=3D"nofollow" onmousedown=3D"this.href=3D'http://groups.goo=
gle.com/group/lojban';return true;" onclick=3D"this.href=3D'http://groups.g=
oogle.com/group/lojban';return true;">http://groups.google.com/<wbr>group/l=
ojban</a>.<br>
For more options, visit <a href=3D"https://groups.google.com/d/optout" targ=
et=3D"_blank" rel=3D"nofollow" onmousedown=3D"this.href=3D'https://groups.g=
oogle.com/d/optout';return true;" onclick=3D"this.href=3D'https://groups.go=
ogle.com/d/optout';return true;">https://groups.google.com/d/<wbr>optout</a=
>.<br>
</div></div></blockquote></div><br></div></div>
</blockquote></div>

<p></p>

-- <br />
You received this message because you are subscribed to the Google Groups &=
quot;lojban&quot; group.<br />
To unsubscribe from this group and stop receiving emails from it, send an e=
mail to <a href=3D"mailto:lojban+unsubscribe@googlegroups.com">lojban+unsub=
scribe@googlegroups.com</a>.<br />
To post to this group, send email to <a href=3D"mailto:lojban@googlegroups.=
com">lojban@googlegroups.com</a>.<br />
Visit this group at <a href=3D"http://groups.google.com/group/lojban">http:=
//groups.google.com/group/lojban</a>.<br />
For more options, visit <a href=3D"https://groups.google.com/d/optout">http=
s://groups.google.com/d/optout</a>.<br />

------=_Part_900_1623806441.1423775043982--
------=_Part_899_315185249.1423775043982--