Received: from mail-la0-f55.google.com ([209.85.215.55]:37281) by stodi.digitalkingdom.org with esmtps (TLSv1.2:AES128-GCM-SHA256:128) (Exim 4.80.1) (envelope-from ) id 1YPrcR-0002bH-7r for lojban-list-archive@lojban.org; Mon, 23 Feb 2015 03:56:17 -0800 Received: by labgq15 with SMTP id gq15sf3697791lab.4 for ; Mon, 23 Feb 2015 03:56:04 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:sender:list-subscribe:list-unsubscribe; bh=ZBdE/ghIMKoTqE7Cj1rPrGOZ1K3Eo2Rte85a/+gW73E=; b=yHD8HTkMR07yJlWJlnnAiJEulE2C3TLFUBf3IvMFgLafNYoUGuYgpSwf9BiG2TyfMp J+JpisqLP0xJE9/HhO9EW2K6YKnQZ9jSVP7f+R76Jvlap5POgvDcv7F374oUIF8Vgn29 r9ECHwZpUy89jt7SOt6W9SjsvRZQHAGSeNlyLfb9f+mHj8Rgw+P5Z34aBN23HZbSsNhT j+4NAcjQGZfpKdozc8veG8MFNmA++o3rZXwX7Gvs7PiDscpqP2jOUmMyWP9SIYwANvY+ 6+WUUhP2AvGXekYhX/R1MQsPOu5m33x36Rbhm0fMTo9ZgC2+MSasnoH10XHBzhFbcnqy JhGA== X-Received: by 10.180.106.135 with SMTP id gu7mr48103wib.7.1424692564215; Mon, 23 Feb 2015 03:56:04 -0800 (PST) X-BeenThere: lojban@googlegroups.com Received: by 10.180.102.232 with SMTP id fr8ls538844wib.19.gmail; Mon, 23 Feb 2015 03:56:03 -0800 (PST) X-Received: by 10.181.29.66 with SMTP id ju2mr1212048wid.1.1424692563604; Mon, 23 Feb 2015 03:56:03 -0800 (PST) Received: from mail-wg0-x22c.google.com (mail-wg0-x22c.google.com. [2a00:1450:400c:c00::22c]) by gmr-mx.google.com with ESMTPS id ev8si657173wib.3.2015.02.23.03.56.03 for (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 23 Feb 2015 03:56:03 -0800 (PST) Received-SPF: pass (google.com: domain of gleki.is.my.name@gmail.com designates 2a00:1450:400c:c00::22c as permitted sender) client-ip=2a00:1450:400c:c00::22c; Received: by mail-wg0-x22c.google.com with SMTP id k14so25215248wgh.3 for ; Mon, 23 Feb 2015 03:56:03 -0800 (PST) X-Received: by 10.180.206.98 with SMTP id ln2mr19799417wic.94.1424692563404; Mon, 23 Feb 2015 03:56:03 -0800 (PST) MIME-Version: 1.0 Received: by 10.194.86.200 with HTTP; Mon, 23 Feb 2015 03:55:42 -0800 (PST) In-Reply-To: References: <20150204124517.GA1243@kuebelreiter.informatik.Uni-Osnabrueck.DE> <50d5006f-f02b-4a28-9894-6608729585fc@googlegroups.com> <8b2f2e8a-a89f-4544-9ee7-d5189bd4a07b@googlegroups.com> From: Gleki Arxokuna Date: Mon, 23 Feb 2015 14:55:42 +0300 Message-ID: Subject: Re: [lojban] the myth of monoparsing To: "lojban@googlegroups.com" Content-Type: multipart/alternative; boundary=001a11c381ce287dc8050fc01660 X-Original-Sender: gleki.is.my.name@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of gleki.is.my.name@gmail.com designates 2a00:1450:400c:c00::22c as permitted sender) smtp.mail=gleki.is.my.name@gmail.com; dkim=pass header.i=@gmail.com; dmarc=pass (p=NONE dis=NONE) header.from=gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , X-Spam-Score: -1.7 (-) X-Spam_score: -1.7 X-Spam_score_int: -16 X-Spam_bar: - --001a11c381ce287dc8050fc01660 Content-Type: text/plain; charset=UTF-8 2015-02-21 2:32 GMT+03:00 ianek : > > > On Friday, February 20, 2015 at 2:24:22 PM UTC+1, la gleki wrote: > >> >> >> 2015-02-13 0:04 GMT+03:00 ianek : >> >>> >>> >>> On Thursday, February 12, 2015 at 8:00:17 PM UTC+1, la gleki wrote: >>> >>>> >>>> 2015-02-12 21:42 GMT+03:00 ianek : >>>> >>>>> >>>>> >>>>> On Thursday, February 12, 2015 at 7:11:12 AM UTC+1, la gleki wrote: >>>>> >>>>>> >>>>>> >>>>>> 2015-02-12 1:20 GMT+03:00 ianek : >>>>>> >>>>>>> >>>>>>> >>>>>>> On Wednesday, February 11, 2015 at 1:50:49 PM UTC+1, la gleki wrote: >>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> 2015-02-09 23:22 GMT+03:00 ianek : >>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> On Monday, February 9, 2015 at 11:54:41 AM UTC+1, la gleki wrote: >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> 2015-02-08 4:34 GMT+03:00 ianek : >>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> On Friday, February 6, 2015 at 8:13:30 AM UTC+1, la gleki wrote: >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> 2015-02-04 15:45 GMT+03:00 v4hn : >>>>>>>>>>>> >>>>>>>>>>>>> On Tue, Feb 03, 2015 at 11:42:32AM +0300, Gleki Arxokuna wrote: >>>>>>>>>>>>> > "Fred saw a plane flying over Zurich" can have several >>>>>>>>>>>>> meanings >>>>>>>>>>>>> >>>>>>>>>>>>> Yes. >>>>>>>>>>>>> However, for me, the issue here is that we (hopefully..) agree >>>>>>>>>>>>> that there are different parse trees (which yield the >>>>>>>>>>>>> different meanings). >>>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> No, several trees arise after you interpret the sentence. >>>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> But if you had an English parser, it would yield several trees >>>>>>>>>>> without any interpreting. >>>>>>>>>>> >>>>>>>>>> >>>>>>>>>> Sure! Because English parsers lack the ability to find something >>>>>>>>>> common in all of the parse trees. >>>>>>>>>> >>>>>>>>> >>>>>>>>> No. It's because words in an English sentence can be parsed as >>>>>>>>> different syntactic structures. That's what parsing means: determining >>>>>>>>> structures formed by words. Not "finding something common". >>>>>>>>> >>>>>>>> >>>>>>>> You yourself just showed several parses of the same sentence. >>>>>>>> This is how usual English parsers are constructed. >>>>>>>> >>>>>>>> However, there is another option to monoparse this English sentence. >>>>>>>> >>>>>>>> You mix English language and one current theory of how to parse it. >>>>>>>> >>>>>>>> >>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>>> Like this: >>>>>>>>>>> >>>>>>>>>>> "Fred saw a plane flying over Zurich" >>>>>>>>>>> NAME VERB-PAST ARTICLE COUNTABLE-NOUN VERB-ING PREPOSITION NAME >>>>>>>>>>> >>>>>>>>>>> Some (much simplified) rules could be: >>>>>>>>>>> >>>>>>>>>>> Sentence ::= Noun-Phrase Verb Noun-Phrase >>>>>>>>>>> Sentence ::= Noun-Phrase Verb Noun-Phrase Adverbial-Phrase >>>>>>>>>>> Noun-Phrase ::= NAME | ARTICLE COUNTABLE-NOUN | Noun-Phrase >>>>>>>>>>> VERB-ING Prepositional-Clause >>>>>>>>>>> Verb ::= VERB-PAST >>>>>>>>>>> Adverbial-Phrase ::= VERB-ING Preposition-Clause >>>>>>>>>>> Preposition-Clause ::= PREPOSITION Noun-Phrase >>>>>>>>>>> >>>>>>>>>>> This simple grammar yields two parse trees for that sentence: >>>>>>>>>>> >>>>>>>>>>> Sentence >>>>>>>>>>> ----Noun-Phrase >>>>>>>>>>> --------NAME >>>>>>>>>>> ------------Fred >>>>>>>>>>> ----Verb >>>>>>>>>>> --------VERB-PAST >>>>>>>>>>> ------------saw >>>>>>>>>>> ----Noun-Phrase >>>>>>>>>>> --------Noun-Phrase >>>>>>>>>>> ------------ARTICLE >>>>>>>>>>> ----------------a >>>>>>>>>>> ------------NOUN >>>>>>>>>>> ----------------plane >>>>>>>>>>> --------VERB-ING >>>>>>>>>>> ------------flying >>>>>>>>>>> --------Prepositional-Clause >>>>>>>>>>> ------------PROPOSITION >>>>>>>>>>> ----------------over >>>>>>>>>>> ------------Noun-Phrase >>>>>>>>>>> ----------------NAME >>>>>>>>>>> --------------------Zurich >>>>>>>>>>> >>>>>>>>>>> Sentence >>>>>>>>>>> ----Noun-Phrase >>>>>>>>>>> --------NAME >>>>>>>>>>> ------------Fred >>>>>>>>>>> ----Verb >>>>>>>>>>> --------VERB-PAST >>>>>>>>>>> ------------saw >>>>>>>>>>> ----Noun-Phrase >>>>>>>>>>> --------Noun-Phrase >>>>>>>>>>> ------------ARTICLE >>>>>>>>>>> ----------------a >>>>>>>>>>> ------------NOUN >>>>>>>>>>> ----------------plane >>>>>>>>>>> ----Adverbial-Phrase >>>>>>>>>>> --------VERB-ING >>>>>>>>>>> ------------flying >>>>>>>>>>> --------Prepositional-Clause >>>>>>>>>>> ------------PROPOSITION >>>>>>>>>>> ----------------over >>>>>>>>>>> ------------Noun-Phrase >>>>>>>>>>> ----------------NAME >>>>>>>>>>> --------------------Zurich >>>>>>>>>>> >>>>>>>>>>> Formal grammars for natural languages do exist, although they're >>>>>>>>>>> not perfect, but the problem with multiple grammatically sensible parses >>>>>>>>>>> (often millions of trees and more) is much greater than the problem with >>>>>>>>>>> nonsensible trees or correct sentences that don't parse at all. >>>>>>>>>>> >>>>>>>>>>> Lojban was carefully designed to avoid this problem. And it >>>>>>>>>>> doesn't have anything to do with {xi PA}. The Lojban grammar specifies XI >>>>>>>>>>> clauses unambiguously. Parse trees are unique. Monoparsing is not a myth. >>>>>>>>>>> XI clauses may add semantic ambiguity on a different level then, say, >>>>>>>>>>> simple {zo'e}, but it doesn't have anything to do with syntactic ambiguity. >>>>>>>>>>> >>>>>>>>>> >>>>>>>>>> It specifies to which head a clause should attach. And since it's >>>>>>>>>> {mo'e zo'e} it's vague to which head it attaches. If the parser you use >>>>>>>>>> doesn't allow for that the only thing that can be done is to provide >>>>>>>>>> several possible trees. >>>>>>>>>> >>>>>>>>> >>>>>>>>> It's a feature of a language, not a parser. If English had a >>>>>>>>> pronoun, say, 'lar', which would mean 'the subject or the object of the >>>>>>>>> main sentence', you could say "Fred saw a plane as lar flew over Zurich", >>>>>>>>> which would be ambiguous semantically, but not syntactically. >>>>>>>>> >>>>>>>> >>>>>>>> Even in current English theory there are a lot of zero morphemes. >>>>>>>> What I'm proposing is just another zero morpheme. >>>>>>>> >>>>>>> >>>>>>> >>>>>>>> >>>>>>>> This is what And agreed with me. >>>>>>>> >>>>>>>> >>>>>>>>> >>>>>>>>>> >>>>>>>>>>> >>>>>>>>>> {la fred pu viska lo vinji do'e lo se xi vei mo'e zo'e nei poi >>>>>>>>>>> vofli ga'u la tsurix} has only one syntax tree, regardless of the number of >>>>>>>>>>> possible semantic interpretations. >>>>>>>>>>> >>>>>>>>>> >>>>>>>>>> If you applied {mo'e zo'e} to the English sentence you will still >>>>>>>>>> get the only syntax tree. >>>>>>>>>> >>>>>>>>> >>>>>>>>> You can't "apply" {mo'e zo'e} to the English sentence, because >>>>>>>>> it's not there. Likewise you don't "apply" {mo'e zo'e} to the Lojban >>>>>>>>> sentence. You just parse it, because it's there. >>>>>>>>> In English you can have phrases like 'X of Y of Z' which could be >>>>>>>>> parsed as '(X of Y) of Z' or 'X of (Y of Z)'. In Lojban it's not possible, >>>>>>>>> but you can say ''either (X of Y) of Z or X of (Y of Z)", which is not >>>>>>>>> syntactically ambiguous. You can't apply "either... or" to the English >>>>>>>>> sentence, because you can't parse words which aren't there. >>>>>>>>> >>>>>>>> >>>>>>>> As I just said English parsers use this "add words that aren't >>>>>>>> there" all the time. >>>>>>>> >>>>>>> >>>>>>> I was searching, but I haven't found any English parser (but I know >>>>>>> a Polish one). What parsers do you refer to? >>>>>>> >>>>>> >>>>>> Probably most. Since this concept (of adding words and morphemes of >>>>>> zero length) is present in most modern theories: >>>>>> https://en.wikipedia.org/wiki/Zero_(linguistics) >>>>>> >>>>> >>>>> This doesn't answer my question. Name at least one working English >>>>> parser. I haven't found any. >>>>> >>>> >>>> Which requirements do you need? Take Stanford's parser. >>>> But if you want an English parser that would insert zero morpheme to >>>> reach vague syntax I'm not aware of any although it's obvious (I hope so) >>>> that it's possible to create one (although probably useless since no one >>>> including me suggested any possible advantages apart from purely >>>> theoretical ones). >>>> >>> >>> It's far from obvious. Natural languages have countless types of >>> syntactic ambiguity, and I'm not sure all of them could be overcome on the >>> parser level. >>> >> >> That's why I asked for examples. For those that by far been presented I >> used an alternative understanding. >> > > But what if there could be infinitely many examples? It's possible, as > there's no upper limit to the length of a sentence. You can't have infinite > grammar (technically you could, but it's pointless). > Yes, technically it's unfalsifialbe since you can't be sure that example No. 1256 won't fail to prove or disprove monoparsing in English. However, note that 1. I don't consider syntactic ambiguity and vague syntax (vague '=' operator) as methods complementing each other. In fact they are rival methods. I consider both of them fine. But this is what can make the claim that Lojban is the only monoparsing language false. It of course means that parsers of Lojban are monoparsing. 2. The grammar of English although is large probably not infinite. > > >> >> Of course, I respect hundreds of years grammarians put into studying >> English. >> >> But what I respect more is the English language itself that has thousands >> of years of development. >> >> >> >>> In Lojban the natural grammar yields a monoparsing parser (no artificial >>> zero words etc.). To make a polyparsing one, you'd have to do some weird >>> stretches. In English it's the other way around. >>> >> >> When one says "I like Lojban for monoparsing" that person doesn't love >> Lojban but loves the way it is presented. >> > > Monoparsing is natural in Lojban. You'd have to be really stubborn to > polyparse Lojban. > No, it won't be stubborn and monoparsing isn't natural to Lojban. It's natural to textbooks, learning resources and parsers of this language. Presenting a language as monoparsing creates an illusion that this language is inherently different from others. However, this is not the time to fight Chomsky's ideas. OTOH, polyparsing is much more natural for English than monoparsing (it > it's even possible, as you claim). > No, again it's only how grammarians describe this language. > When you say in Lojban {xi mo'e zo'e}, you conciously express ambiguity, > using unambiguous syntax. When you say an English sentence with >1000 > natural parses, you're not concious of them and you're usually not willing > to express ambiguity. > All of that is really obvious. > So you reduce all your reasoning to zero morphemes? This is the same argument pycyn put earlier. Different languages have different sets of zeros. However, as shown earlier by Ilmen even without {xi vei mo'e zo'e} a simple {broda i brode} creates a wide range of meanings. Note that {zo'e} here are implicit. It's also highly doubtful what is said consciously and what is not. Are zeros the only parts of the language expressed unconsciously according to you? > >> Language is one thing and its description is another one. >> >> At the moment I can't see polyparsing in English except in polysemy but >> that is said not to be relevant here. >> > > I guess that your votum is separatum. > I don't care how many people for and against here. E.g And Rosta seems to have a different understanding what grammar is and how it is connected to parsing although Parsing Expression Grammar hints at what it is for. > >> >> >>> >>> >>>> >>>> It of course results in the inability of a fair comparison of Lojban >>>> and English parsers. But that's acceptable. >>>> >>>>> >>>>> >>>>>> >>>>>>> >>>>>>>> >>>>>>>> >>>>>>>>> >>>>>>>>>> >>>>>>>>>>> In English you can have sentences that are semantically >>>>>>>>>>> ambiguous due to syntactic ambiguity. In Lojban you can have sentences with >>>>>>>>>>> (roughly) the same semantic ambiguity as the English ones, but >>>>>>>>>>> syntactically unambiguous. >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>>> > {la fred pu viska lo vinji do'e lo se xi vei mo'e zo'e nei >>>>>>>>>>>>> poi vofli ga'u >>>>>>>>>>>>> > la tsurix} >>>>>>>>>>>>> >>>>>>>>>>>>> camxes only produces one parse tree for that. >>>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> And for English you don't provide any parses at all. >>>>>>>>>>>> May be someone should just parse the original English sentence >>>>>>>>>>>> as camxes does for Lojban one? >>>>>>>>>>>> I won't be surprised if such parser for English doesn't exist >>>>>>>>>>>> since those who write them might mix parsing and interpretation of it. The >>>>>>>>>>>> latter would be replacing {mo'e zo'e} with some PA which will immediately >>>>>>>>>>>> lead to several syntactic trees. >>>>>>>>>>>> >>>>>>>>>>>> So I both disagree and agree with you on whether English >>>>>>>>>>>> sentence has several syntactic trees. If using one term for two operations >>>>>>>>>>>> is stopped the contradiction disappears. >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>>> If you think it should produce more then one, raise a bug >>>>>>>>>>>>> report. >>>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> I'm not aware of any Lojban parsers that perform interpretation >>>>>>>>>>>> operation. In most cases you just need context and one interpretation. But >>>>>>>>>>>> this is semantic analysis. Producing all possible syntactic trees is a task >>>>>>>>>>>> needed more seldom. >>>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> Camxes is intended to produce all possible syntactic trees, and >>>>>>>>>>> there's only one of them for any valid sentence. >>>>>>>>>>> >>>>>>>>>> >>>>>>>>>> You may invent a Lojban parser that won't be able to parse {mo'e >>>>>>>>>> zo'e}. Then you will need workarounds to output several trees. >>>>>>>>>> >>>>>>>>> >>>>>>>>> XI clauses have an ambiguous syntax, so I don't see how I'd need >>>>>>>>> workarounfds and several trees. Of course, I could invent a Lojban parser >>>>>>>>> that won't be able to parse anything, but what's the point? {mo'e zo'e} >>>>>>>>> from the parser's view is just MOhE KOhA. If I can't parse it, then I have >>>>>>>>> an incomplete parser. >>>>>>>>> >>>>>>>> >>>>>>>> And this is what I state for English: its current parsers are >>>>>>>> incomplete and further improvements will make polyparsed sentences >>>>>>>> monoparsed. >>>>>>>> >>>>>>>> >>>>>>>>> >>>>>>>>> What you mean sounds rather like a semantic analyzer, which is >>>>>>>>> extremely hard for any language, including Lojban. >>>>>>>>> >>>>>>>>> mu'o mi'e ianek >>>>>>>>> >>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> mu'o mi'e ianek >>>>>>>>>>> >>>>>>>>>>> -- >>>>>>>>>>> You received this message because you are subscribed to the >>>>>>>>>>> Google Groups "lojban" group. >>>>>>>>>>> To unsubscribe from this group and stop receiving emails from >>>>>>>>>>> it, send an email to lojban+un...@googlegroups.com. >>>>>>>>>>> To post to this group, send email to loj...@googlegroups.com. >>>>>>>>>>> Visit this group at http://groups.google.com/group/lojban. >>>>>>>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>>>>>>> >>>>>>>>>> >>>>>>>>>> -- >>>>>>>>> You received this message because you are subscribed to the Google >>>>>>>>> Groups "lojban" group. >>>>>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>>>>> send an email to lojban+un...@googlegroups.com. >>>>>>>>> To post to this group, send email to loj...@googlegroups.com. >>>>>>>>> Visit this group at http://groups.google.com/group/lojban. >>>>>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>> You received this message because you are subscribed to the Google >>>>>>> Groups "lojban" group. >>>>>>> To unsubscribe from this group and stop receiving emails from it, >>>>>>> send an email to lojban+un...@googlegroups.com. >>>>>>> To post to this group, send email to loj...@googlegroups.com. >>>>>>> Visit this group at http://groups.google.com/group/lojban. >>>>>>> For more options, visit https://groups.google.com/d/optout. >>>>>>> >>>>>> >>>>>> -- >>>>> You received this message because you are subscribed to the Google >>>>> Groups "lojban" group. >>>>> To unsubscribe from this group and stop receiving emails from it, send >>>>> an email to lojban+un...@googlegroups.com. >>>>> To post to this group, send email to loj...@googlegroups.com. >>>>> Visit this group at http://groups.google.com/group/lojban. >>>>> For more options, visit https://groups.google.com/d/optout. >>>>> >>>> >>>> -- >>> You received this message because you are subscribed to the Google >>> Groups "lojban" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to lojban+un...@googlegroups.com. >>> To post to this group, send email to loj...@googlegroups.com. >>> Visit this group at http://groups.google.com/group/lojban. >>> For more options, visit https://groups.google.com/d/optout. >>> >> >> -- > You received this message because you are subscribed to the Google Groups > "lojban" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to lojban+unsubscribe@googlegroups.com. > To post to this group, send email to lojban@googlegroups.com. > Visit this group at http://groups.google.com/group/lojban. > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "lojban" group. To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout. --001a11c381ce287dc8050fc01660 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable


2015-02-21 2:32 GMT+03:00 ianek <janek37@gmail.com>:
=


On Friday, February= 20, 2015 at 2:24:22 PM UTC+1, la gleki wrote:


2015-02-13 0:04 GMT+03:00 ianek <jan...@gmail.com>:


On Thursday, February 12, 2015 at 8:00:17 = PM UTC+1, la gleki wrote:

2015-02-12 21:42 GMT+= 03:00 ianek <jan...@gmail.com&= gt;:


On T= hursday, February 12, 2015 at 7:11:12 AM UTC+1, la gleki wrote:


2015-02-12 1:20 GMT+03:00 ianek <<= a rel=3D"nofollow">jan...@gmail.com>:


On Wednesday, February 11, 2015 at 1:50:49 PM UTC+1, = la gleki wrote:


2015-02-09 23:22 GMT+03:00 ianek = <jan...@gmail.com>:


On Monday, February 9, 2015 at 11:54:41 AM UTC+1= , la gleki wrote:


2015-02-08 4:34 GMT+03:00 ianek <= ;jan...@gmail.com>:
=


On Friday, February 6, 2015 at 8:13:30 AM UTC+1, l= a gleki wrote:


2015-02-04 15:45 GMT+03:00 v4hn <m...@v4hn.de>:
On= Tue, Feb 03, 2015 at 11:42:32AM +0300, Gleki Arxokuna wrote:
> "Fred saw a plane flying over Zurich" can have several meani= ngs

Yes.
However, for me, the issue here is that we (hopefully..) agree
that there are different parse trees (which yield the different meanings).<= br>

No, several trees arise after you inter= pret the sentence.

But = if you had an English parser, it would yield several trees without any inte= rpreting.

Sure! Because English= parsers lack the ability to find something common in all of the parse tree= s.

No. It's because= words in an English sentence can be parsed as different syntactic structur= es. That's what parsing means: determining structures formed by words. = Not "finding something common".
=
You yourself just showed several parses of the same sentence= .
This is how usual English parsers are constructed.=C2=A0
<= div>
However, there is another option to monoparse this Engli= sh sentence.

You mix English language and one curr= ent theory of how to parse it.

=C2=A0
=C2=A0
Like this:

"Fred saw a plane flying over Z= urich"
NAME VERB-PAST ARTICLE COUNTABLE-NOUN VERB-ING PREPOS= ITION NAME

Some (much simplified) rules could be:

Sentence ::= =3D Noun-Phrase Verb Noun-Phrase
Sentence ::=3D Noun-Phrase Verb Noun-Ph= rase Adverbial-Phrase
Noun-Phrase ::=3D NAME | ARTICLE COUNTABLE-NOUN | = Noun-Phrase VERB-ING Prepositional-Clause
Verb ::=3D VERB-PAST
Adverb= ial-Phrase ::=3D VERB-ING Preposition-Clause
Preposition-Clause ::=3D PR= EPOSITION Noun-Phrase

This simple grammar yields two parse trees for= that sentence:

Sentence
----Noun-Phrase
--------NAME
-----= -------Fred
----Verb
--------VERB-PAST
------------saw
----Noun= -Phrase
--------Noun-Phrase
------------ARTICLE
----------------a<= br>------------NOUN
----------------plane
--------VERB-ING
-------= -----flying
--------Prepositional-Clause
------------PROPOSITION
-= ---------------over
------------Noun-Phrase
----------------NAME
-= -------------------Zurich

Sentence
----Noun-Phrase
--------NAM= E
------------Fred
----Verb
--------VERB-PAST
------------saw----Noun-Phrase
--------Noun-Phrase
------------ARTICLE
--------= --------a
------------NOUN
----------------plane
----Adverbial-Phr= ase
--------VERB-ING
------------flying
--------Prepositional-Clau= se
------------PROPOSITION
----------------over
------------Noun-P= hrase
----------------NAME
--------------------Zurich

Formal g= rammars for natural languages do exist, although they're not perfect, b= ut the problem with multiple grammatically sensible parses (often millions = of trees and more) is much greater than the problem with nonsensible trees = or correct sentences that don't parse at all.

Lojban was careful= ly designed to avoid this problem. And it doesn't have anything to do w= ith {xi PA}. The Lojban grammar specifies XI clauses unambiguously. Parse t= rees are unique. Monoparsing is not a myth. XI clauses may add semantic amb= iguity on a different level then, say, simple {zo'e}, but it doesn'= t have anything to do with syntactic ambiguity.

It specifies to which head a clause should attach. A= nd since it's {mo'e zo'e} it's vague to which head it attac= hes. If the parser you use doesn't allow for that the only thing that c= an be done is to provide several possible trees.

It's a feature of a language, not a = parser. If English had a pronoun, say, 'lar', which would mean '= ;the subject or the object of the main sentence', you could say "F= red saw a plane as lar flew over Zurich", which would be ambiguous sem= antically, but not syntactically.

Even in current English theory there are a lot of zero morphemes. Wh= at I'm proposing is just another zero morpheme.
=
=C2=A0

This is what And agreed with me.


<= blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-l= eft-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;pa= dding-left:1ex">

=C2=A0
{la fred = pu viska lo vinji do'e lo se xi vei mo'e zo'e nei poi vofli ga&= #39;u la tsurix} has only one syntax tree, regardless of the number of poss= ible semantic interpretations.

<= /div>
If you applied {mo'e zo'e} to the English sentence you wi= ll still get the only syntax tree.
=

You can't "apply" {mo'e zo'e} to the = English sentence, because it's not there. Likewise you don't "= apply" {mo'e zo'e} to the Lojban sentence. You just parse it, = because it's there.
In English you can have phrases like 'X of Y= of Z' which could be parsed as '(X of Y) of Z' or 'X of (Y= of Z)'. In Lojban it's not possible, but you can say ''eit= her (X of Y) of Z or X of (Y of Z)", which is not syntactically ambigu= ous. You can't apply "either... or" to the English sentence, = because you can't parse words which aren't there.

As I just said English parsers use this &quo= t;add words that aren't there" =C2=A0all the time.

I was searching, but I haven'= t found any English parser (but I know a Polish one). What parsers do you r= efer to?

Probably most. Sin= ce this concept (of adding words and morphemes of zero length) is present i= n most modern theories:

This doesn't answer my questi= on. Name at least one working English parser. I haven't found any.
<= /div>

Which requirements do you need?= Take Stanford's parser.
But if you want an English parser th= at would insert zero morpheme to reach vague syntax I'm not aware of an= y although it's obvious (I hope so) that it's possible to create on= e (although probably useless since no one including me suggested any possib= le advantages apart from purely theoretical ones).
<= /blockquote>

It's far from obvious. Natural languag= es have countless types of syntactic ambiguity, and I'm not sure all of= them could be overcome on the parser level.

That's why I asked for examples. For those that by fa= r been presented I used an alternative understanding.

But what if there could be infinitely m= any examples? It's possible, as there's no upper limit to the lengt= h of a sentence. You can't have infinite grammar (technically you could= , but it's pointless).

= Yes, technically it's unfalsifialbe since you can't be sure that ex= ample No. 1256 won't fail to prove or disprove monoparsing in English.<= /div>

However, note that
1. I don't consid= er syntactic ambiguity and vague syntax (vague '=3D' operator) as m= ethods complementing each other. In fact they are rival methods. I consider= both of them fine. But this is what can make the claim that Lojban is the = only monoparsing language false. It of course means that parsers of Lojban = are monoparsing.
2. The grammar of English although is large prob= ably not infinite.
=C2=A0

Of course,= I respect hundreds of years grammarians put into studying English.

But what I respect more is the English language itself th= at has thousands of years of development.

=C2=A0
In Lojban the natu= ral grammar yields a monoparsing parser (no artificial zero words etc.). To= make a polyparsing one, you'd have to do some weird stretches. In Engl= ish it's the other way around.

When one says "I like Lojban for monoparsing" that person= doesn't love Lojban but loves the way it is presented.

Monoparsing is natural in Lojban. You&= #39;d have to be really stubborn to polyparse Lojban.

No, it won't be stubborn and monoparsing isn'= ;t natural to Lojban.
It's natural to textbooks, learning res= ources and parsers of this language.

Presenting a = language as monoparsing creates an illusion that this language is inherentl= y different from others.

However, this is not the = time to fight Chomsky's ideas.

OTOH, polyparsing is much more natural fo= r English than monoparsing (it it's even possible, as you claim).
=

No, again it's only how grammari= ans describe this language.
=C2=A0
When you say in Lojban {xi mo'e zo'e},= you conciously express ambiguity, using unambiguous syntax. When you say a= n English sentence with >1000 natural parses, you're not concious of= them and you're usually not willing to express ambiguity.
All of th= at is really obvious.

So yo= u reduce all your reasoning to zero morphemes? This is the same argument py= cyn put earlier. Different languages have different sets of zeros.

However, as shown earlier by Ilmen even without {xi vei mo= 'e zo'e} a simple {broda i brode} creates a wide range of meanings.= Note that {zo'e} here are implicit.

It's = also highly doubtful what is said consciously and what is not. Are zeros th= e only parts of the language expressed unconsciously according to you?



Language is one thin= g and its description is another one.

At the momen= t I can't see polyparsing in English except in polysemy but that is sai= d not to be relevant here.
=
I guess that your votum is separatum.
=
I don't care how many people for and against here.
=
E.g And Rosta seems to have a different understanding what grammar is = and how it is connected to parsing although Parsing Expression Grammar hint= s at what it is for.

=C2=A0
= =C2=A0
=C2=A0
<= div class=3D"gmail_quote">

It of course results in the i= nability of a fair comparison of Lojban and English parsers. But that's= acceptable.
=C2= =A0
=C2= =A0


<= span>


In English you c= an have sentences that are semantically ambiguous due to syntactic ambiguit= y. In Lojban you can have sentences with (roughly) the same semantic ambigu= ity as the English ones, but syntactically unambiguous.
=C2=A0
<= blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-l= eft-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;pa= dding-left:1ex">
> {la fred pu viska lo vinji do'e lo se xi vei mo'e zo'e nei= poi vofli ga'u
> la tsurix}

camxes only produces one parse tree for that.
<= br>
And for English you don't provide any parses at all.
May be someone should just parse the original English sentence as cam= xes does for Lojban one?
I won't be surprised if such parser = for English doesn't exist since those who write them might mix parsing = and interpretation of it. The latter would be replacing {mo'e zo'e}= with some PA which will immediately lead to several syntactic trees.
=

So I both disagree and agree with you on whether Englis= h sentence has several syntactic trees. If using one term for two operation= s is stopped the contradiction disappears.

=C2=A0<= /div>
If you think it should produce more then one, raise a bug report.

I'm not aware of any Lojban parsers that per= form interpretation operation. In most cases you just need context and one = interpretation. But this is semantic analysis. Producing all possible synta= ctic trees is a task needed more seldom.

Camxes is intended to produce all possible syntactic tree= s, and there's only one of them for any valid sentence.
=

You may invent a Lojban parser that won= 9;t be able to parse {mo'e zo'e}. Then you will need workarounds to= output several trees.

= XI clauses have an ambiguous syntax, so I don't see how I'd need wo= rkarounfds and several trees. Of course, I could invent a Lojban parser tha= t won't be able to parse anything, but what's the point? {mo'e = zo'e} from the parser's view is just MOhE KOhA. If I can't pars= e it, then I have an incomplete parser.
And this is what I state for English: its current parsers are = incomplete and further improvements will make polyparsed sentences monopars= ed.
=C2=A0

What y= ou mean sounds rather like a semantic analyzer, which is extremely hard for= any language, including Lojban.

mu'o mi'e ianek
= =C2=A0
=C2=A0
mu'o mi'e ianek

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+un...@googlegroups.com.
To post to this group, send email to loj...@googlegroup= s.com.
Visit this group at http://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+un...@googlegroups.com.
To post to this group, send email to loj...@googlegroup= s.com.
Visit this group at http://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+un...@googlegroups.com.
To post to this group, send email to loj...@googlegroup= s.com.
Visit this group at http://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+un...@googlegroups.com.
To post to this group, send email to loj...@googlegroup= s.com.
Visit this group at http://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+un...@googlegroups.com.
To post to this group, send email to loj...@googlegroup= s.com.
Visit this group at http://groups.google.com/group/lojba= n.
For more options, visit https://groups.google.com/d/optout.


--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsub= scribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http:= //groups.google.com/group/lojban.
For more options, visit http= s://groups.google.com/d/optout.
--001a11c381ce287dc8050fc01660--