Received: from mail-pf0-f191.google.com ([209.85.192.191]:32973) by stodi.digitalkingdom.org with esmtps (TLSv1.2:AES128-GCM-SHA256:128) (Exim 4.85) (envelope-from ) id 1aCSuw-0006b2-9r for lojban-list-archive@lojban.org; Fri, 25 Dec 2015 06:00:36 -0800 Received: by mail-pf0-f191.google.com with SMTP id q63sf10431043pfb.0 for ; Fri, 25 Dec 2015 06:00:26 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :content-type:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:x-spam-checked-in-group :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe; bh=2U6xsHRLI8rnQ51Nc1qhIG4wOy5f2th6zztL5QMQ0+I=; b=jp9Z17CczpMEJB/TXRG7k5e39N6dNx3I4ycSPT3HOAT9TQ+uSf8dkiuLn0dKXJUqfR CQZN71PDGOb7uNJKYXSUVrSw6YUwBoIdLdHp9+4aledCdYGMwlW5K9oQQMkfVQEFtCwL YazPa9EAJlAeTBA9walZTIAiLXjYZxH960Qz7zQXnnr13RRtYU55Py67/3n3avP+c0bk Am6EZB1Bxzc0zFnUjaIHtpx7du0RzmQkfYsZYZmNULuaf53O5IUPWG2RbgbFJEE8epIi n6uzSfqOtIRTBA2eFocOrzSn8Hkdep9D4anf5zifmLXc1ERDXF88XubhqZjDMAk+sWCx u7Ng== X-Received: by 10.28.47.151 with SMTP id v145mr40862wmv.21.1451052019523; Fri, 25 Dec 2015 06:00:19 -0800 (PST) X-BeenThere: lojban@googlegroups.com Received: by 10.28.170.71 with SMTP id t68ls1504129wme.21.canary; Fri, 25 Dec 2015 06:00:18 -0800 (PST) X-Received: by 10.194.110.233 with SMTP id id9mr4117644wjb.5.1451052018706; Fri, 25 Dec 2015 06:00:18 -0800 (PST) Received: from mail-wm0-x236.google.com (mail-wm0-x236.google.com. [2a00:1450:400c:c09::236]) by gmr-mx.google.com with ESMTPS id b62si1117267wmc.0.2015.12.25.06.00.18 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 25 Dec 2015 06:00:18 -0800 (PST) Received-SPF: pass (google.com: domain of gleki.is.my.name@gmail.com designates 2a00:1450:400c:c09::236 as permitted sender) client-ip=2a00:1450:400c:c09::236; Received: by mail-wm0-x236.google.com with SMTP id p187so201104065wmp.0 for ; Fri, 25 Dec 2015 06:00:18 -0800 (PST) X-Received: by 10.28.0.79 with SMTP id 76mr10638190wma.27.1451052018531; Fri, 25 Dec 2015 06:00:18 -0800 (PST) MIME-Version: 1.0 Received: by 10.28.92.206 with HTTP; Fri, 25 Dec 2015 05:59:39 -0800 (PST) In-Reply-To: References: From: Gleki Arxokuna Date: Fri, 25 Dec 2015 16:59:39 +0300 Message-ID: Subject: Re: [lojban] la cmaxes, a minimal morphology parser To: "lojban@googlegroups.com" Content-Type: multipart/alternative; boundary=001a113c70e21d8e4a0527b9602b X-Original-Sender: gleki.is.my.name@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of gleki.is.my.name@gmail.com designates 2a00:1450:400c:c09::236 as permitted sender) smtp.mailfrom=gleki.is.my.name@gmail.com; dmarc=pass (p=NONE dis=NONE) header.from=gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Spam-Checked-In-Group: lojban@googlegroups.com X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , X-Spam-Score: -1.7 (-) X-Spam_score: -1.7 X-Spam_score_int: -16 X-Spam_bar: - --001a113c70e21d8e4a0527b9602b Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable 2015-12-25 15:37 GMT+03:00 Jorge Llamb=C3=ADas : > > > On Fri, Dec 25, 2015 at 9:01 AM, Gleki Arxokuna < > gleki.is.my.name@gmail.com> wrote: > >> >> 3. can help you study lojban morphology from PEG >> , which is >> easier to grasp when everything else is removed. >> > > bgv =3D [bgv] hgu > > jz =3D [jz] hgu > > cs =3D [cs] hgv !cs !x > > oops, the website wasn't updated. I will fix later. Or you can just clear appcache for it. " !x" isn't necessary here at all. i removed it: http://mw.lojban.org/extensions/ilmentufa/morfologi.js.peg > > pf =3D [pf] hgv > > > Unfortunately, you can't do this. The !x after cs is wrong because it wil= l > reject for example "vasxu". But more importantly no consonant follows the > same rules of any other consonant. You removed the restriction against > double consonants, so "babba" will parse as a gismu. > > The only two letters that share identical rules are e and o. > Indeed, thanks for noticing. I need to explain this parser better because it changes something in ideology. Namely, it preprocesses input using a bunch or regexes. So {zk} turns into {zyk}, {bb} into {byb} etc. The idea is that the parser expects correct language in its input and determine word classes, but not show mistakes in the input. > > mu'o mi'e xorxes > > -- > You received this message because you are subscribed to the Google Groups > "lojban" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to lojban+unsubscribe@googlegroups.com. > To post to this group, send email to lojban@googlegroups.com. > Visit this group at https://groups.google.com/group/lojban. > For more options, visit https://groups.google.com/d/optout. > --=20 You received this message because you are subscribed to the Google Groups "= lojban" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at https://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout. --001a113c70e21d8e4a0527b9602b Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable


2015-12-25 15:37 GMT+03:00 Jorge Llamb=C3=ADas <jjllambias@gmail.co= m>:


On Fri, Dec 25, = 2015 at 9:01 AM, Gleki Arxokuna <gleki.is.my.name@gmail.com&g= t; wrote:

3. can help you study lojban morpholog= y from PEG, which is easier to grasp when everything else is removed.

bgv =3D [bgv] hgu

jz =3D [jz] hgu

cs =3D [cs] hgv !cs !x
oops,= the website wasn't updated. I will fix later. Or you can just clear ap= pcache for it.
" !x" isn't necessary here at all. i= removed it:
http://mw.lojban.org/extensions/ilmentufa/morfologi.js.peg<= br>

=C2=A0
=
pf =3D [pf] hgv

Unfortunately, you can't= do this. The !x after cs is wrong because it will reject for example "= ;vasxu". But more importantly no consonant follows the same rules of a= ny other consonant. You removed the restriction against double consonants, = so "babba" will parse as a gismu.

The on= ly two letters that share identical rules are e and o.

Indeed, thanks for noticing. I need to = explain this parser better because it changes something in ideology.
<= div>
Namely, it preprocesses input using a bunch or regexes.<= /div>
So {zk} turns into {zyk}, {bb} into {byb} etc.
The idea= is that the parser expects correct language in its input and determine wor= d classes, but not show mistakes in the input.

=C2= =A0

mu'o mi'= e xorxes

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at https://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsub= scribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http= s://groups.google.com/group/lojban.
For more options, visit http= s://groups.google.com/d/optout.
--001a113c70e21d8e4a0527b9602b--