Received-SPF: pass (google.com: domain of jjllambias@gmail.com designates 2a00:1450:400c:c05::22e as permitted sender) client-ip=2a00:1450:400c:c05::22e;
MIME-Version: 1.0
In-Reply-To: <CAO7bV+gPzU7qCy6BsKqM_HXgeqqSBSJYtDARMENd8mfh3D4tJQ@mail.gmail.com>
References: <CAO7bV+je-d2vPHm-7AzaQgmgb2K91_Ye2V4ODPEv4oMGmUZAXQ@mail.gmail.com>
	<CAO7tK2eVfXoFUhqM9jEEGk51_GYOgv4xDu8utyccTV=-pWFM0w@mail.gmail.com>
	<CAO7bV+jTJ3m=sUuSvdc1cz9bvKcEiaxQXSVa-iLQ9vsKBxDPQw@mail.gmail.com>
	<CAO7tK2dJ49WvJ9vJQqonfOVx2eXyJCoi-xS6B5RXEru-bhDkaw@mail.gmail.com>
	<CAO7bV+gwQSz-eb38rgsXkGwAU+FLq+deyogWvkxxW5BV4eVshw@mail.gmail.com>
	<CAO7bV+gPzU7qCy6BsKqM_HXgeqqSBSJYtDARMENd8mfh3D4tJQ@mail.gmail.com>
Date: Fri, 27 Mar 2015 19:21:17 -0300
Message-ID: <CAO7tK2eJ8KM4BmNgm4fBDWz7t9SjsHdfzhtO4D_nwKieBiyyzA@mail.gmail.com>
Subject: Re: [bpfk] Improvements to fragments in ilmentufa parser
From: =?UTF-8?Q?Jorge_Llamb=C3=ADas?= <jjllambias@gmail.com>
To: bpfk-list@googlegroups.com
Content-Type: multipart/alternative; boundary=047d7bb0410215b19205124c8d54
Reply-To: bpfk-list@googlegroups.com
Precedence: list
Mailing-list: list bpfk-list@googlegroups.com; contact bpfk-list+owners@googlegroups.com
Sender: bpfk-list@googlegroups.com
X-Spam_score: -1.7
X-Spam_score_int: -16
X-Spam_bar: -

--047d7bb0410215b19205124c8d54
Content-Type: text/plain; charset=UTF-8

On Fri, Mar 27, 2015 at 10:34 AM, Gleki Arxokuna <gleki.is.my.name@gmail.com
> wrote:
>
> 2015-03-27 11:00 GMT+03:00 Gleki Arxokuna <gleki.is.my.name@gmail.com>:
>


> What methods do you use or want to make the development process happen
>> faster?
>> A web tool that would allow to insert a complete PEG file, compile it and
>> test it online?
>>
>
I used this one when debugging the morphology recently:
http://pegjs.org/online

A presentation of the grammar without the javascript would make it much
more readable for me.

Also, a grammar without SA is much more readable than one with SA. I find
the SA-rules extremely annoying.

 bridi-tail-3 <- selbri? tail-terms / gek-sentence
>>>
>>
>> Hard for me to determine what is the cause but this breaks {mi zo'u mi
>> mo}.
>>
>
> Probably because it thinks that prenex is a selbri.
>

Adding !ZOhU-clause at the end of tail-terms might fix that:

 tail-terms <- terms? VAU-clause? free* !ZOhU-clause

What are the minimal requirements to restore a bridi if not from terms or
> from bridi_tail ? Probably it can be restored from isolated {i} or {ni'o}
> but since this already works, then other types of restoration should be
> discussed separately since they don't touch anything here.
>

Other than fragments, I think everything else in a text is either a
sentence, a sentence connective, or the initial indicators, free modifiers,
and the strange initial bare cmevla. I think only fragments require
"restoration".


> 2. We could also add this if "fragment" is removed from the grammar:
>
> tanru_unit_1 = tanru_unit_2 linkargs? / linkargs? tanru_unit_2 */
> GOhA_elidible linkargs*
>
> This makes {i be mi} parse as (i [CU {COhE <be mi BEhO>} VAU])
>

If you're going to do that, why put it in tanru-unit-1 and not in
tanru-unit-2?

If you allow  (i [CU {COhE <be mi BEhO>} VAU]), why not (i [CU {na'e
<COhE>} VAU]), or (i [CU {jai <COhE>} VAU]) for example?

However, selpa'i's examples don't work here.
>
> Should {noi mo} a). be restored into {noi mo cu co'e}
>

I missed the part where "noi mo cu co'e" became grammatical.


> implying {fa xi xo'e zo'e noi mo cu co'e} or b). should it instead be
> considered a continuation of the previous clause said by another speaker
> like with selpa'i's example with {be ma}?
>

I would have said to "zo'e noi mo cu co'e"

Both solutions seem reasonable. Maybe take option b). and treat a discourse
> split between several people as one sentence with special FUhE .. FUhO
> markers?
>
> mi viska lo pendo FUhE [B asks] be ma [FUhO]
> mi viska lo pendo FUhE [B asks] noi mo [FUhO]
>
> A: - I see a friend.
> B: - Of whom?
>
> A: - I see a friend.
> B: - Who does what?
>
> This would reformulate fragments as parts of discourse so that we can
> remove them from the grammar. Of course, this would require somehow
> preparing existing texts by marking them with those FUhE ... FUhO so that
> we can parse them.
>

It depends on how you define "text". Is a dialogue one text, or a
succession of texts? The usual take is that it's a succession of texts,
since otherwise a lot of lojban dialogues that seem to parse would not
parse. For example the irc logs

I also allowed relative clauses in sumti without their heads. If fragments
> are removed from the grammar then similar things can be useful:
>
> sumti_4 = expr:(sumti_5 / *relative_clauses / *gek sumti gik sumti_4)
> {return _node("sumti_4", expr);}
>

This could be dangerous, as it makes "ta prenu poi do sisku" grammatical,
but not with the expected meaning. Also things lika {da poi prenu ku'o noi
melbi".


> This results in {fa noi pendo mi cu melbi} (in fact it may even make {be}
> useless except when used stylistically).
>

I think it's safer to require a "lo" for bare relative clauses: "lo noi
pendo cu melbi" (this was also discussed as a good alternative to "poi'i"
in many cases).


> At some point there was talk of making the selbri of a sumti-tail elidable
>>> as well, so that "lo ku" would be a valid sumti.
>>>
>>
> I almost never use {ku} in this sense (LE-terminator). Besides, some
> people think that {cu} should mark the beginning of a bridi tail. In this
> case I don't understand how to treat {lo cu broda}. Should it be {lo COhE
> KU cu broda} or {lo cu broda KU CU COhE} ?
>

Since a bridi-tail is not part of a sumti-tail, it can only be the first.

mu'o mi'e xorxes

-- 
You received this message because you are subscribed to the Google Groups "BPFK" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bpfk-list+unsubscribe@googlegroups.com.
To post to this group, send email to bpfk-list@googlegroups.com.
Visit this group at http://groups.google.com/group/bpfk-list.
For more options, visit https://groups.google.com/d/optout.

--047d7bb0410215b19205124c8d54
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div class=3D"gmail_extra"><br><div class=3D"gmail_quote">=
On Fri, Mar 27, 2015 at 10:34 AM, Gleki Arxokuna <span dir=3D"ltr">&lt;<a h=
ref=3D"mailto:gleki.is.my.name@gmail.com" target=3D"_blank">gleki.is.my.nam=
e@gmail.com</a>&gt;</span> wrote:<blockquote class=3D"gmail_quote" style=3D=
"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,2=
04,204);border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div cla=
ss=3D"gmail_extra"><div class=3D"gmail_quote"><span class=3D"">2015-03-27 1=
1:00 GMT+03:00 Gleki Arxokuna <span dir=3D"ltr">&lt;<a href=3D"mailto:gleki=
.is.my.name@gmail.com" target=3D"_blank">gleki.is.my.name@gmail.com</a>&gt;=
</span>:</span></div></div></div></blockquote><div>=C2=A0</div><blockquote =
class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1=
px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:=
1ex"><div dir=3D"ltr"><div class=3D"gmail_extra"><div class=3D"gmail_quote"=
><span class=3D""><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px=
 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-=
left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_ext=
ra"><div class=3D"gmail_quote"><div>What methods do you use or want to make=
 the development process happen faster?</div><div>A web tool that would all=
ow to insert a complete PEG file, compile it and test it online?</div></div=
></div></div></blockquote></span></div></div></div></blockquote><div><br></=
div><div>I used this one when debugging the morphology recently: <a href=3D=
"http://pegjs.org/online">http://pegjs.org/online</a></div><div><br></div><=
div>A presentation of the grammar without the javascript would make it much=
 more readable for me.</div><div><br></div><div>Also, a grammar without SA =
is much more readable than one with SA. I find the SA-rules extremely annoy=
ing.</div><div><br></div><blockquote class=3D"gmail_quote" style=3D"margin:=
0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);=
border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div class=3D"gm=
ail_extra"><div class=3D"gmail_quote"><span class=3D""><blockquote class=3D=
"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;borde=
r-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><di=
v dir=3D"ltr"><div class=3D"gmail_extra"><div class=3D"gmail_quote"><span><=
blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-l=
eft-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;pa=
dding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_extra"><div class=3D"g=
mail_quote"><div>=C2=A0<span style=3D"color:rgb(0,0,0);white-space:pre-wrap=
">bridi-tail-3 &lt;- selbri? tail-terms / gek-sentence</span></div></div></=
div></div></blockquote><div><br></div></span><div>Hard for me to determine =
what is the cause but this breaks {mi zo&#39;u mi mo}.</div></div></div></d=
iv></blockquote><div><br></div></span><div>Probably because it thinks that =
prenex is a selbri.=C2=A0</div></div></div></div></blockquote><div><br></di=
v><div>Adding !ZOhU-clause at the end of tail-terms might fix that:</div><d=
iv><br></div><div>=C2=A0<span style=3D"color:rgb(0,0,0);white-space:pre-wra=
p">tail-terms &lt;- terms? VAU-clause? free* !ZOhU-clause</span></div><div>=
<span style=3D"color:rgb(0,0,0);white-space:pre-wrap"><br></span></div><blo=
ckquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left=
-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;paddi=
ng-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_extra"><div class=3D"gmai=
l_quote"><div>What are the minimal requirements to restore a bridi if not f=
rom terms or from bridi_tail ? Probably it can be restored from isolated {i=
} or {ni&#39;o} but since this already works, then other types of restorati=
on should be discussed separately since they don&#39;t touch anything here.=
</div></div></div></div></blockquote><div><br></div><div>Other than fragmen=
ts, I think everything else in a text is either a sentence, a sentence conn=
ective, or the initial indicators, free modifiers, and the strange initial =
bare cmevla. I think only fragments require &quot;restoration&quot;.</div><=
div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0=
px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-le=
ft-style:solid;padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_extra=
"><div class=3D"gmail_quote"><div>2.=C2=A0We could also add this if &quot;f=
ragment&quot; is removed from the grammar:</div><div><br></div><div><div>ta=
nru_unit_1 =3D tanru_unit_2 linkargs? / linkargs? tanru_unit_2 <b>/ GOhA_el=
idible linkargs</b></div></div><div><br></div><div>This makes {i be mi} par=
se as=C2=A0(i [CU {COhE &lt;be mi BEhO&gt;} VAU])=C2=A0</div></div></div></=
div></blockquote><div><br></div><div>If you&#39;re going to do that, why pu=
t it in tanru-unit-1 and not in tanru-unit-2?=C2=A0</div><div><br></div><di=
v>If you allow =C2=A0(i [CU {COhE &lt;be mi BEhO&gt;} VAU]), why not (i [CU=
 {na&#39;e &lt;COhE&gt;} VAU]), or (i [CU {jai &lt;COhE&gt;} VAU]) for exam=
ple?</div><div><br></div><blockquote class=3D"gmail_quote" style=3D"margin:=
0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);=
border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div class=3D"gm=
ail_extra"><div class=3D"gmail_quote"><div>However, selpa&#39;i&#39;s examp=
les don&#39;t work here.</div><div><br></div><div>Should {noi mo} a). be re=
stored into {noi mo cu co&#39;e} </div></div></div></div></blockquote><div>=
<br></div><div>I missed the part where &quot;noi mo cu co&#39;e&quot; becam=
e grammatical.</div><div>=C2=A0</div><blockquote class=3D"gmail_quote" styl=
e=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(2=
04,204,204);border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div=
 class=3D"gmail_extra"><div class=3D"gmail_quote"><div>implying {fa xi xo&#=
39;e zo&#39;e noi mo cu co&#39;e} or b). should it instead be considered a =
continuation of the previous clause said by another speaker like with selpa=
&#39;i&#39;s example with {be ma}?</div></div></div></div></blockquote><div=
><br></div><div>I would have said to &quot;zo&#39;e noi mo cu co&#39;e&quot=
;=C2=A0</div><div><br></div><blockquote class=3D"gmail_quote" style=3D"marg=
in:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,20=
4);border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div class=3D=
"gmail_extra"><div class=3D"gmail_quote"><div>Both solutions seem reasonabl=
e. Maybe take option b). and treat a discourse split between several people=
 as one sentence with special FUhE .. FUhO markers?</div><div><br></div><di=
v>mi viska lo pendo FUhE [B asks] be ma [FUhO]</div><div>mi viska lo pendo =
FUhE [B asks] noi mo [FUhO]<br></div><div><br></div><div>A: - I see a frien=
d.</div><div>B: - Of whom?</div><div><br></div><div>A: - I see a friend.</d=
iv><div>B: - Who does what?</div><div><br></div><div>This would reformulate=
 fragments as parts of discourse so that we can remove them from the gramma=
r. Of course, this would require somehow preparing existing texts by markin=
g them with those FUhE ... FUhO so that we can parse them.</div></div></div=
></div></blockquote><div><br></div><div>It depends on how you define &quot;=
text&quot;. Is a dialogue one text, or a succession of texts? The usual tak=
e is that it&#39;s a succession of texts, since otherwise a lot of lojban d=
ialogues that seem to parse would not parse. For example the irc logs=C2=A0=
</div><div><br></div><blockquote class=3D"gmail_quote" style=3D"margin:0px =
0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);bord=
er-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_=
extra"><div class=3D"gmail_quote"><div>I also allowed relative clauses in s=
umti without their heads. If fragments are removed from the grammar then si=
milar things can be useful:</div><div><br></div><div><div>sumti_4 =3D expr:=
(sumti_5 / <b>relative_clauses / </b>gek sumti gik sumti_4) {return _node(&=
quot;sumti_4&quot;, expr);}</div></div></div></div></div></blockquote><div>=
<br></div><div>This could be dangerous, as it makes &quot;ta prenu poi do s=
isku&quot; grammatical, but not with the expected meaning. Also things lika=
 {da poi prenu ku&#39;o noi melbi&quot;.</div><div>=C2=A0</div><blockquote =
class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1=
px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:=
1ex"><div dir=3D"ltr"><div class=3D"gmail_extra"><div class=3D"gmail_quote"=
><div>This results in {fa noi pendo mi cu melbi} (in fact it may even make =
{be} useless except when used stylistically).</div></div></div></div></bloc=
kquote><div><br></div><div>I think it&#39;s safer to require a &quot;lo&quo=
t; for bare relative clauses: &quot;lo noi pendo cu melbi&quot; (this was a=
lso discussed as a good alternative to &quot;poi&#39;i&quot; in many cases)=
.<br></div><div>=C2=A0</div><blockquote class=3D"gmail_quote" style=3D"marg=
in:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,20=
4);border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><div class=3D=
"gmail_extra"><div class=3D"gmail_quote"><span class=3D""><blockquote class=
=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1px;bo=
rder-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">=
<div dir=3D"ltr"><div class=3D"gmail_extra"><div class=3D"gmail_quote"><spa=
n><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;borde=
r-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid=
;padding-left:1ex"><div dir=3D"ltr"><div class=3D"gmail_extra"><div class=
=3D"gmail_quote"><div><span style=3D"color:rgb(0,0,0);white-space:pre-wrap"=
></span></div><div><span style=3D"color:rgb(0,0,0);white-space:pre-wrap">At=
 some point there was talk of making the selbri of a sumti-tail elidable as=
 well, so that &quot;lo ku&quot; would be a valid sumti.</span></div></div>=
</div></div></blockquote></span></div></div></div></blockquote><div><br></d=
iv></span><div>I almost never use {ku} in this sense (LE-terminator). Besid=
es, some people think that {cu} should mark the beginning of a bridi tail. =
In this case I don&#39;t understand how to treat {lo cu broda}. Should it b=
e {lo COhE KU cu broda} or {lo cu broda KU CU COhE} ?</div></div></div></di=
v></blockquote><div><br></div><div>Since a bridi-tail is not part of a sumt=
i-tail, it can only be the first.</div><div>=C2=A0</div><div>mu&#39;o mi=
9;e xorxes</div><div><br></div></div></div></div>

<p></p>

-- <br />
You received this message because you are subscribed to the Google Groups &=
quot;BPFK&quot; group.<br />
To unsubscribe from this group and stop receiving emails from it, send an e=
mail to <a href=3D"mailto:bpfk-list+unsubscribe@googlegroups.com">bpfk-list=
+unsubscribe@googlegroups.com</a>.<br />
To post to this group, send email to <a href=3D"mailto:bpfk-list@googlegrou=
ps.com">bpfk-list@googlegroups.com</a>.<br />
Visit this group at <a href=3D"http://groups.google.com/group/bpfk-list">ht=
tp://groups.google.com/group/bpfk-list</a>.<br />
For more options, visit <a href=3D"https://groups.google.com/d/optout">http=
s://groups.google.com/d/optout</a>.<br />

--047d7bb0410215b19205124c8d54--