Received: from mail-la0-f62.google.com ([209.85.215.62]:34627) by stodi.digitalkingdom.org with esmtps (TLSv1.2:AES128-GCM-SHA256:128) (Exim 4.80.1) (envelope-from ) id 1Yf4XU-0003CQ-EA; Mon, 06 Apr 2015 03:46:01 -0700 Received: by lamq1 with SMTP id q1sf5068999lam.1; Mon, 06 Apr 2015 03:45:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=mime-version:from:date:message-id:subject:to:content-type :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe; bh=CnoWCWVyhEtsi5Oez7qz5b/5YYgTBlXJ85ZHybFB9tg=; b=TBlf8mCSS0k7TnNNTG9P4JtaWyzmedTqgvQbAYFJyC2hl+Qd4wyYT/3QYXcOBDtHdG gVAxFNilpkhgTOVaxb2bkPRh+XKYNm2uuGnE8DRGJYdsQKbUL60+PhtwWcrkeuq3ZVs/ REpnUt5taG2wJlDR5iqlEU4RZYqKRcVoC6XC2oRsCtz+pjzgzLLmObnug/TKffikJd0K /0oMy/c7DHBzIjt+pTv8iVMo5W/xE2Iz7dDhJOEPSm7ZjgJybBSsbmi2wIazNYbPPRFn olMTMWmk2W/cdglgVlRs3h6JRQ8F1+RS59LUpqT3klZ3lCmIK6A8AT55w+/yT7qYosf9 BtUg== X-Received: by 10.152.36.34 with SMTP id n2mr133483laj.30.1428317149275; Mon, 06 Apr 2015 03:45:49 -0700 (PDT) X-BeenThere: bpfk-list@googlegroups.com Received: by 10.152.36.163 with SMTP id r3ls548854laj.70.gmail; Mon, 06 Apr 2015 03:45:48 -0700 (PDT) X-Received: by 10.152.5.168 with SMTP id t8mr2967201lat.3.1428317148706; Mon, 06 Apr 2015 03:45:48 -0700 (PDT) Received: from mail-wg0-x22b.google.com (mail-wg0-x22b.google.com. [2a00:1450:400c:c00::22b]) by gmr-mx.google.com with ESMTPS id p4si235125wiz.0.2015.04.06.03.45.48 for (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Mon, 06 Apr 2015 03:45:48 -0700 (PDT) Received-SPF: pass (google.com: domain of gleki.is.my.name@gmail.com designates 2a00:1450:400c:c00::22b as permitted sender) client-ip=2a00:1450:400c:c00::22b; Received: by wgin8 with SMTP id n8so23710532wgi.0 for ; Mon, 06 Apr 2015 03:45:48 -0700 (PDT) X-Received: by 10.194.93.165 with SMTP id cv5mr30450801wjb.24.1428317148570; Mon, 06 Apr 2015 03:45:48 -0700 (PDT) MIME-Version: 1.0 Received: by 10.194.240.197 with HTTP; Mon, 6 Apr 2015 03:45:27 -0700 (PDT) From: Gleki Arxokuna Date: Mon, 6 Apr 2015 13:45:27 +0300 Message-ID: Subject: [bpfk] Proposal: sumti must always be tagged with "tag" even if it's elidible. To: bpfk-list@googlegroups.com Content-Type: multipart/alternative; boundary=047d7bb708c6446ab605130c00b3 X-Original-Sender: gleki.is.my.name@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of gleki.is.my.name@gmail.com designates 2a00:1450:400c:c00::22b as permitted sender) smtp.mail=gleki.is.my.name@gmail.com; dkim=pass header.i=@gmail.com; dmarc=pass (p=NONE dis=NONE) header.from=gmail.com Reply-To: bpfk-list@googlegroups.com Precedence: list Mailing-list: list bpfk-list@googlegroups.com; contact bpfk-list+owners@googlegroups.com List-ID: X-Google-Group-Id: 972099695765 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , X-Spam-Score: -1.7 (-) X-Spam_score: -1.7 X-Spam_score_int: -16 X-Spam_bar: - --047d7bb708c6446ab605130c00b3 Content-Type: text/plain; charset=UTF-8 Here is what the experimental grammar had (the official grammar is very similar): abs_term_1 = expr:(abs_tag_term / sumti / termset) {return _node("abs_term_1", expr);} abs_tag_term = expr:((!gek tag free* !selbri !gek_sentence (sumti / joik_ek sumti_3 / KU_elidible free*) / !gek (sumti free*) CO_clause tag free* / NA_clause free* KU_clause free* / !selbri !gek_sentence !ek !joik_jek !gihek NA_clause free* KU_elidible free* / SOI_clause free* subsentence SEhU_elidible free*) (joik_jek abs_tag_term)*) {return _node("abs_tag_term", expr);} Clearly, you can see that "sumti" occurs two times and what is unfortunate is that it appears on another level of the tree if a bare non-tagged sumti is used (the first, "abs_term_1 = " line above). More precisely, when FA is not used sumti is the direct child of abs_term_1 node whereas with FA it becomes a direct child of abs_tag_term which is a direct child of abs_term_1. So {i fa mi prami} and {i mi prami} produce syntactic trees with a different hierarchy! I think that no matter whether sumti is tagged with FA or not it should appear on the same level. Existing tricks with PEG make the notion of sumti obscure. Every sumti should take a certain position. Not only we need a unambiguous grammar, we need a cleancut hierarchy. So this is what I propose: abs_term_1 = expr:(abs_tag_term / termset) {return _node("abs_term_1", expr);} abs_tag_term = expr:((!gek tag free* !selbri !gek_sentence (sumti / joik_ek sumti_3 / KU_elidible free*) / !gek (tag / FA_elidible) free* !selbri !gek_sentence (sumti) / !gek (sumti free*) CO_clause tag free* / NA_clause free* KU_clause free* / !selbri !gek_sentence !ek !joik_jek !gihek NA_clause free* KU_elidible free* / SOI_clause free* subsentence FA_elidible = expr:(FA_clause?) {return (expr == "") ? ["FA"] : _node("FA", expr);} As you can see, "sumti" disappeared. I added "FA_elidible" which can fire only if a sumti is present (but not KU_elidible otherwise the grammar will crash). Probably, " joik_ek sumti_3 " should also be copied to this FA_elidible part, right? I'm not an expert in all these things. -- You received this message because you are subscribed to the Google Groups "BPFK" group. To unsubscribe from this group and stop receiving emails from it, send an email to bpfk-list+unsubscribe@googlegroups.com. To post to this group, send email to bpfk-list@googlegroups.com. Visit this group at http://groups.google.com/group/bpfk-list. For more options, visit https://groups.google.com/d/optout. --047d7bb708c6446ab605130c00b3 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Here is what the experimental grammar had (the o= fficial grammar is very similar):

abs_term_1 =3D e= xpr:(abs_tag_term / sumti / termset) {return _node("abs_term_1", = expr);}

abs_tag_term =3D expr:((!gek tag free* !se= lbri !gek_sentence (sumti / joik_ek sumti_3 / KU_elidible free*) / !gek (su= mti free*) CO_clause tag free* / NA_clause free* KU_clause free* / !selbri = !gek_sentence !ek !joik_jek !gihek NA_clause free* KU_elidible free* / SOI_= clause free* subsentence SEhU_elidible free*) (joik_jek abs_tag_term)*) {re= turn _node("abs_tag_term", expr);}


Clearly, you can see that "sumti" occurs two times and w= hat is unfortunate is that it appears on another level of the tree if a bar= e non-tagged sumti is used (the first, "abs_term_1 =C2=A0=3D " li= ne above). More precisely, when FA is not used sumti is the direct child of= =C2=A0abs_term_1 node whereas with FA it becomes a direct child of=C2=A0abs= _tag_term which is a direct child of=C2=A0abs_term_1.

<= div>So {i fa mi prami} and {i mi prami} produce syntactic trees with a diff= erent hierarchy!

I think that no matter whether su= mti is tagged with FA or not it should appear on the same level.
<= div>Existing tricks with PEG make the notion of sumti obscure.
Ev= ery sumti should take a certain position.
Not only we need a unam= biguous grammar, we need a cleancut hierarchy.

So = this is what I propose:

abs_term_1 =3D expr:(abs_t= ag_term / termset) {return _node("abs_term_1", expr);}
<= div>
abs_tag_term =3D expr:((!gek tag free* !selbri !gek_sent= ence (sumti / joik_ek sumti_3 / KU_elidible free*) / !gek (tag / FA_elidibl= e) free* !selbri !gek_sentence (sumti) =C2=A0/ !gek (sumti free*) CO_clause= tag free* / NA_clause free* KU_clause free* / !selbri !gek_sentence !ek !j= oik_jek !gihek NA_clause free* KU_elidible free* / SOI_clause free* subsent= ence=C2=A0

FA_elidible =3D expr:(FA_clause?) {= return (expr =3D=3D "") ? ["FA"] : _node("FA"= , expr);}


As you can see, "sumti"= ; disappeared. I added "FA_elidible" which can fire only if a sum= ti is present (but not KU_elidible otherwise the grammar will crash).
=

Probably, "=C2=A0joik_ek sumti_3=C2=A0" shoul= d also be copied to this FA_elidible part, right? I'm not an expert in = all these things.

--
You received this message because you are subscribed to the Google Groups &= quot;BPFK" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to bpfk-list= +unsubscribe@googlegroups.com.
To post to this group, send email to bpfk-list@googlegroups.com.
Visit this group at ht= tp://groups.google.com/group/bpfk-list.
For more options, visit http= s://groups.google.com/d/optout.
--047d7bb708c6446ab605130c00b3--