Received: from mail-la0-f63.google.com ([209.85.215.63]:36513) by stodi.digitalkingdom.org with esmtps (TLSv1.2:AES128-GCM-SHA256:128) (Exim 4.80.1) (envelope-from ) id 1YLwZd-0008MV-Og for lojban-list-archive@lojban.org; Thu, 12 Feb 2015 08:25:18 -0800 Received: by labpn19 with SMTP id pn19sf2617630lab.3 for ; Thu, 12 Feb 2015 08:24:58 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:content-type:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe; bh=C03CQQ37n9n2P+i4F44hiVx0gXqVE9fwYU/YXVrOjFk=; b=K0Q9J/dpH/fdAEfQiN0LcrhMYK6PFSJ0n3rSLdyJMjKsmmXZ2l0vVyBsbke0TF/rO3 Lc1aRgrtNkcs1Wt5xbjrw+oQnuF1AFBu0kljQczGyfJFvydz7Vmy2X/qg8rmbqKkmZwy aCRQP8W3+FTTAtFY4LNHsJbJb8A3M1MIa9XW1BE+BFBVkoMKqaUPyiWIksDcgCL3bxKo t3Z2J9+121OXkJws5iXOxrfPGoN1Ce/QCqPLtY8kgi69X6Kx3HiVkuRkGVO2sHDwUaCD FWi8YHFXK6OEamD8pePTDjiAgd0vlWBVMMRKfVkw0Y36PVx5x2v2K2moNjzT91FFqquu rgXg== X-Received: by 10.152.88.49 with SMTP id bd17mr67255lab.37.1423758298502; Thu, 12 Feb 2015 08:24:58 -0800 (PST) X-BeenThere: lojban@googlegroups.com Received: by 10.152.3.4 with SMTP id 4ls204946lay.65.gmail; Thu, 12 Feb 2015 08:24:57 -0800 (PST) X-Received: by 10.152.29.34 with SMTP id g2mr718096lah.1.1423758297661; Thu, 12 Feb 2015 08:24:57 -0800 (PST) Received: from mail-we0-x230.google.com (mail-we0-x230.google.com. [2a00:1450:400c:c03::230]) by gmr-mx.google.com with ESMTPS id ew5si2117568wid.1.2015.02.12.08.24.57 for (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 12 Feb 2015 08:24:57 -0800 (PST) Received-SPF: pass (google.com: domain of ilmen.pokebip@gmail.com designates 2a00:1450:400c:c03::230 as permitted sender) client-ip=2a00:1450:400c:c03::230; Received: by mail-we0-x230.google.com with SMTP id x3so11160591wes.7 for ; Thu, 12 Feb 2015 08:24:57 -0800 (PST) X-Received: by 10.180.205.142 with SMTP id lg14mr7856685wic.82.1423758297011; Thu, 12 Feb 2015 08:24:57 -0800 (PST) Received: from [192.168.0.102] (95-210-212-178.ip.skylogicnet.com. [95.210.212.178]) by mx.google.com with ESMTPSA id g10sm3176976wic.7.2015.02.12.08.24.52 for (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Thu, 12 Feb 2015 08:24:56 -0800 (PST) Message-ID: <54DCD3CF.1050300@gmail.com> Date: Thu, 12 Feb 2015 17:24:47 +0100 From: Ilmen User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.4.0 MIME-Version: 1.0 To: lojban@googlegroups.com Subject: Re: [lojban] the myth of monoparsing References: <20150204124517.GA1243@kuebelreiter.informatik.Uni-Osnabrueck.DE> <20150212123856.GA831@kuebelreiter.informatik.Uni-Osnabrueck.DE> <20150212150021.GB4686@kuebelreiter.informatik.Uni-Osnabrueck.DE> In-Reply-To: Content-Type: multipart/alternative; boundary="------------060804030908070300060502" X-Original-Sender: ilmen.pokebip@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of ilmen.pokebip@gmail.com designates 2a00:1450:400c:c03::230 as permitted sender) smtp.mail=ilmen.pokebip@gmail.com; dkim=pass header.i=@gmail.com; dmarc=pass (p=NONE dis=NONE) header.from=gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , X-Spam-Score: -1.9 (-) X-Spam_score: -1.9 X-Spam_score_int: -18 X-Spam_bar: - This is a multi-part message in MIME format. --------------060804030908070300060502 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: quoted-printable On 12/02/2015 16:10, Gleki Arxokuna wrote: > > 2015-02-12 18:00 GMT+03:00 v4hn >: > > Everything you proposed in this thread to make lojban "polyparse" > is about semantic analysis, not about parsing. > > > Adding polyparsing to Lojban is possible by removing some parts of its=20 > grammar and it has nothing to do with semantic analysis. > As for the "Fred saw a plane flying over Zurich" sentence, it's=20 certainly possible to have a monoparse if we allow more than two=20 branches per node, like the below: sentence =E2=86=90 noun-clause finite-verb noun-clause verb-ing preposition= =20 noun-clause (Here the node "sentence" has six branches.) But I guess that a true (and useful) parse analysis should not allow=20 more than two branches per node, so that for each word we know what=20 other word it attaches to. As the -ing verb-form can either represent an=20 adjectival present-participle (which attaches to a noun on its left) or=20 an adjectival present-participle (which attaches to the verb, I think),=20 it's not possible to produce a single **binary** parse tree. Here's a sketchy binary+unary branching grammar: sentence =E2=86=90 noun-phrase verb-clause verb-clause =E2=86=90 verb-clause-2 adverbial-phrase verb-clause-2 =E2=86=90 verb-phrase noun-phrase verb-phrase =E2=86=90 verb-past adverbial-phrase =E2=86=90 verb-ing preposition-clause noun-phrase =E2=86=90 name | article noun | noun-phrase=20 adjectival-present-participle-clause adjectival-present-participle-clause =E2=86=90=20 adjectival-present-participle-clause preposition-clause| verb-ing preposition-clause =E2=86=90 preposition noun-phrase ... If I made no mistake, the above grammar can parse the Zurich sentence as =E2=80=A2 (Fred [{saw } {flying }]) -- adverbial pre= sent=20 participle interpretation or as =E2=80=A2 (Fred [saw { }]) -- adjectival pr= esent=20 participle interpretation So I guess that the relevant concept here is "binary parsing", that is,=20 grouping the words so that each node contains no more than two child=20 nodes at once. Just my two cents. mi'e la .ilmen. mu'o --=20 You received this message because you are subscribed to the Google Groups "= lojban" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout. --------------060804030908070300060502 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
On 12/02/2015 16:10, Gleki Arxokuna wrote:

2015-02-12 18:00 GMT+03:00 v4hn <me@v4hn.de&g= t;:
Everything you proposed in this thread to make lojban "polyparse"
is about semantic analysis, not about parsing.

Adding polyparsing to Lojban is possible by removing some parts of its grammar and it has nothing to do with semantic analysis.
=C2=A0

As for the "Fred saw a plane flying over Zurich" sentence, it's certainly possible to have a monoparse if we allow more than two branches per node, like the below:

sentence =E2=86=90 noun-clause finite-verb noun-clause verb-ing preposi= tion noun-clause
(Here the node "sentence" has six branches.)

But I guess that a true (and useful) parse analysis should not allow more than two branches per node, so that for each word we know what other word it attaches to. As the -ing verb-form can either represent an adjectival present-participle (which attaches to a noun on its left) or an adjectival present-participle (which attaches to the verb, I think), it's not possible to produce a single *binary* parse tree.

Here's a sketchy binary+unary branching grammar:

sentence =E2=86=90 noun-phrase verb-clause
verb-clause =E2=86=90 verb-clause-2 adverbial-phrase<= /tt>
verb-clause-2 =E2=86=90 verb-phrase noun-phrase
verb-phrase =E2=86=90 verb-past
adverbial-phrase =E2=86=90 verb-ing preposition-clause
noun-phrase =E2=86=90 name | article noun | noun-phrase adjectival-present-participle-clause
adjectival-present-participle-clause =E2=86=90 adjectival-present-participle-clause preposition-clause | verb-ing
preposition-clause =E2=86=90 preposition noun-phrase<= br> ...

If I made no mistake, the above grammar can parse the Zurich sentence as
=E2=80=A2 (Fred [{saw <a plane>} {flying <over Zurich>}])= =C2=A0 --=C2=A0 adverbial present participle interpretation
or as
=E2=80=A2 (Fred [saw {<a plane> <flying (over Zurich)>}])= =C2=A0 --=C2=A0 adjectival present participle interpretation

So I guess that the relevant concept here is "binary parsing", that is, grouping the words so that each node contains no more than two child nodes at once.

Just my two cents.

mi'e la .ilmen. mu'o


--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsub= scribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http:= //groups.google.com/group/lojban.
For more options, visit http= s://groups.google.com/d/optout.
--------------060804030908070300060502--