Received: from mail-ig0-f189.google.com ([209.85.213.189]:33403) by stodi.digitalkingdom.org with esmtps (TLSv1.2:AES128-GCM-SHA256:128) (Exim 4.80.1) (envelope-from ) id 1Yahja-00059J-9L; Wed, 25 Mar 2015 02:36:27 -0700 Received: by igbhn15 with SMTP id hn15sf9340016igb.0; Wed, 25 Mar 2015 02:36:15 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=mime-version:from:date:message-id:subject:to:content-type :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe; bh=8fGcVM11kLKVlaBg6m3m9S51iMu2wehhB55dnawvZEQ=; b=XH/UZcWwgG0yMrvRENKKAKYs3xDoBV+4s2E1yNPpMKrBdGb/tJJ2NdDNOxC6rfBmrB pdf9pbYQH18evdSWM9G+d7Hh7hvvDCCEFEQsj3ZXHHh7QxXc3DQWqew8h4Gy59rkwwIK DBEPiAKj2kMb1r7BxPgehA3DJ/plwNGMOPrs264liSQCG6+bzWoYSrAxGgk/Nwvml0sM uuX1P/eGhV0SpFzhTSsfsSEqn8Eg/lQvuJCtRI25QiIONyMU5BgULucjsmfrGOI4XHqL 4iTmn5oQy50nO8ibgdJrAtO18z0DlI5vq3l+ibFZOBGOzNsqUl67QcDrYih7EJ/WLI25 s6qA== X-Received: by 10.180.126.41 with SMTP id mv9mr142813wib.0.1427276174523; Wed, 25 Mar 2015 02:36:14 -0700 (PDT) X-BeenThere: bpfk-list@googlegroups.com Received: by 10.180.21.166 with SMTP id w6ls1226575wie.3.canary; Wed, 25 Mar 2015 02:36:14 -0700 (PDT) X-Received: by 10.180.75.232 with SMTP id f8mr4075744wiw.0.1427276174202; Wed, 25 Mar 2015 02:36:14 -0700 (PDT) Received: from mail-wg0-x234.google.com (mail-wg0-x234.google.com. [2a00:1450:400c:c00::234]) by gmr-mx.google.com with ESMTPS id gj20si882930wic.1.2015.03.25.02.36.14 for (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Wed, 25 Mar 2015 02:36:14 -0700 (PDT) Received-SPF: pass (google.com: domain of gleki.is.my.name@gmail.com designates 2a00:1450:400c:c00::234 as permitted sender) client-ip=2a00:1450:400c:c00::234; Received: by mail-wg0-x234.google.com with SMTP id m6so20156608wgd.2 for ; Wed, 25 Mar 2015 02:36:14 -0700 (PDT) X-Received: by 10.194.21.193 with SMTP id x1mr16046305wje.144.1427276174052; Wed, 25 Mar 2015 02:36:14 -0700 (PDT) MIME-Version: 1.0 Received: by 10.194.240.197 with HTTP; Wed, 25 Mar 2015 02:35:53 -0700 (PDT) From: Gleki Arxokuna Date: Wed, 25 Mar 2015 12:35:53 +0300 Message-ID: Subject: [bpfk] Improvements to fragments in ilmentufa parser To: bpfk-list@googlegroups.com Content-Type: multipart/alternative; boundary=047d7b5d561059ed84051219a10d X-Original-Sender: gleki.is.my.name@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of gleki.is.my.name@gmail.com designates 2a00:1450:400c:c00::234 as permitted sender) smtp.mail=gleki.is.my.name@gmail.com; dkim=pass header.i=@gmail.com; dmarc=pass (p=NONE dis=NONE) header.from=gmail.com Reply-To: bpfk-list@googlegroups.com Precedence: list Mailing-list: list bpfk-list@googlegroups.com; contact bpfk-list+owners@googlegroups.com List-ID: X-Google-Group-Id: 972099695765 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , X-Spam-Score: -1.7 (-) X-Spam_score: -1.7 X-Spam_score_int: -16 X-Spam_bar: - Content-Length: 5926 --047d7b5d561059ed84051219a10d Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable The following problems were addressed while analyzing ilmentufa parser based on camxes.js by Masato Hagiwara based on the original camxes: 1. {ca ma} parses as "fragment" 2. {mi zo'u ca ma} doesn't parse. 3. {mi djica lo nu lo plise} doesn't parse. 4. {mi ije ma} doesn't parse 5. {mi poi ca ma vau mo} doesn't parse. So I made 2. parse as a "fragment" and made {lo plise} in 3. parse as a "fragment". Besides I added in fragment elidible GOhA_elidible which returns a node labelled "COhE": 4. and 5. are now sentences falling back to fragments with COhE autorestored. ([mi zo'u] [ca ma] COhE VAU) (mi [CU {djica VAU}]) ([CU {mo VAU}] [i je {ma COhE VAU}] ([mi COhE VAU] [i je {ma COhE VAU}]) Notice that this grammar is somewhat experimental, e.g. it treats {cu} as bridi tail starter. Justification: 2. why not being able to parse it? Why force people to use {co'e} as the selbri while we are allowed to omit {zo'e}? 3. {tu'a} in most cases can be replaced with {lonu} now without adding more syllables but without learning tricky additional words like {tu'a}. You can test raw output here: http://vrici.lojban.org/~gleki/mediawiki-1.19.2/extensions/ilmentufa/camxes= -exp.html The exact changes implementing this are here: https://github.com/Ilmen-vodhr/ilmentufa/compare/05a7193368eaeb9198f77533ae= 6f4ab3391fa4ad...8d5d4c3fea2201eba868eb9aa678b46a3dbfaae5 (ignore javascript litter in strings, it's unavoidable) As you can see "fragment" now restores COhE in certain cases, "sentence" falls back to this renewed "fragment". Objections? E.g. I suppose COhE_elidible should be inserted to more constructs. but I haven't looked at where it can be done. What else apart from what has been done can be autocorrected to bridi status? --=20 You received this message because you are subscribed to the Google Groups "= BPFK" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to bpfk-list+unsubscribe@googlegroups.com. To post to this group, send email to bpfk-list@googlegroups.com. Visit this group at http://groups.google.com/group/bpfk-list. For more options, visit https://groups.google.com/d/optout. --047d7b5d561059ed84051219a10d Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
The following problems were addressed while anal= yzing ilmentufa parser based on camxes.js by Masato Hagiwara based on the o= riginal camxes:

1. {ca ma} parses as "fragmen= t"
2. {mi zo'u ca ma} doesn't parse.
3. {m= i djica lo nu lo plise} doesn't parse.
4. {mi ije ma} doesn&#= 39;t parse
5. {mi poi ca ma vau mo} doesn't parse.
=
So I made 2. parse as a "fragment" and made {lo pl= ise} in 3. parse as a "fragment".
Besides I added in fr= agment elidible GOhA_elidible which returns a node labelled "COhE"= ;:
4. and 5. are now sentences falling back to fragments= with COhE autorestored.

([mi zo'u] [ca = ma] COhE VAU)=C2=A0

(mi [CU {djica <lo (=C2= =B9nu [{lo plise KU} COhE VAU] KEI=C2=B9) KU> VAU}])=C2=A0
([CU {mo VAU}] [i je {ma COhE VAU}]

([mi COhE VAU] [i je= {ma COhE VAU}])=C2=A0

Notice that this grammar is somewhat experime= ntal, e.g. it treats {cu} as bridi tail starter.

J= ustification:=C2=A0
2. why not being able to parse it? Why fo= rce people to use {co'e} as the selbri while we are allowed to omit {zo= 'e}?
3. {tu'a} in most cases can be replaced with {lonu} = now without adding more syllables but without learning tricky additional wo= rds like {tu'a}.


<= /div>
(ignore javascript litter in strings, it's unavoidable)
=

As you can see "fragment" now restores COhE i= n certain cases, "sentence" falls back to this renewed "frag= ment".

Objections?

E.g.=C2=A0I suppose COhE_elidible should be inserted to more constructs.= but I haven't looked at where it can be done. What else apart from wha= t has been done can be autocorrected to bridi status?

--
You received this message because you are subscribed to the Google Groups &= quot;BPFK" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to bpfk-list= +unsubscribe@googlegroups.com.
To post to this group, send email to bpfk-list@googlegroups.com.
Visit this group at ht= tp://groups.google.com/group/bpfk-list.
For more options, visit http= s://groups.google.com/d/optout.
--047d7b5d561059ed84051219a10d--