From seidensticker@msn.com Fri Mar 09 21:14:40 2001
Return-Path: <Seidensticker@msn.com>
X-Sender: Seidensticker@msn.com
X-Apparently-To: lojban@yahoogroups.com
Received: (EGP: mail-7_0_4); 10 Mar 2001 05:14:39 -0000
Received: (qmail 50528 invoked from network); 10 Mar 2001 05:14:38 -0000
Received: from unknown (10.1.10.142) by l7.egroups.com with QMQP; 10 Mar 2001 05:14:38 -0000
Received: from unknown (HELO bilbo.w-link.net) (206.98.114.20) by mta3 with SMTP; 10 Mar 2001 06:15:43 -0000
Received: from bobsprimary (dhcp130-dsl-gte3.w-link.net [206.129.86.130]) by bilbo.w-link.net (8.9.3/8.9.3) with SMTP id VAA04914 for <lojban@yahoogroups.com>; Fri, 9 Mar 2001 21:12:12 -0800 (PST)
Message-ID: <002d01c0a920$b720c680$825681ce@wlink.net>
To: <lojban@yahoogroups.com>
Subject: How do you parse lujvo into the component rafsi?
Date: Fri, 9 Mar 2001 21:12:33 -0800
Organization: Microsoft Corporation
MIME-Version: 1.0
Content-Type: multipart/alternative; boundary="----=_NextPart_000_002A_01C0A8DD.A8898F00"
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 5.50.4522.1200
X-MimeOLE: Produced By Microsoft MimeOLE V5.50.4522.1200
From: "seidensticker" <seidensticker@msn.com>

------=_NextPart_000_002A_01C0A8DD.A8898F00
Content-Type: text/plain;
charset="Windows-1252"
Content-Transfer-Encoding: quoted-printable

I'm working on an algorithm for breaking a lujvo into its component parts. =
(My goal: given an unknown lujvo, break it up into parts and display the d=
efinitions of each of those parts.) Chapter 4, section 11 of the grammar b=
ook ("The lujvo-making algorithm") talks about creating lujvo, but my quest=
ion is about the reverse. Is there a place where this is simply described?

If there's not, let me try this: I've tried to compose a grammar that defin=
es a lujvo. Could someone critique it?

lujvo =3D InitialRafsi TermainlRafsi
InitialRafsi =3D Rafsi InitialRafsi | <null>
Rafsi =3D 4Rafsi | 3Rafsi

TerminalRafsi =3D CCV | CVV | CVCCV | CCVCV
4Rafsi =3D CVCCy | CCVCy
3Rafsi =3D CVV | CCV | CVVr | CVVn | CVC | CVCy

Must the parsing of the unknown lujvo begin from the right? or left? or i=
s it unambiguous regardless? Given a 4Rafsi of the form CVCCy or CCVCy, I'=
m assuming that there's only one gismu with those first 4 letters -- right?=
=20=20

Any other suggestions for how to do the parsing?

Thanks.

------=_NextPart_000_002A_01C0A8DD.A8898F00
Content-Type: text/html;
charset="Windows-1252"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=3DContent-Type content=3D"text/html; charset=3Dwindows-125=
2">
<META content=3D"MSHTML 5.50.4522.1800" name=3DGENERATOR>
<STYLE></STYLE>
</HEAD>
<BODY bgColor=3D#e0e0ff>
<DIV><FONT face=3D"Times New Roman">I'm working on an algorithm for breakin=
g a=20
lujvo into its component parts.&nbsp; (My goal: given an unknown lujvo, bre=
ak it=20
up into parts and display the definitions of each of those parts.)&nbsp; Ch=
apter=20
4, section 11 of the grammar book ("The lujvo-making algorithm") talks abou=
t=20
creating lujvo, but my question is about the reverse.&nbsp; Is there a plac=
e=20
where this is simply described?</FONT></DIV>
<DIV><FONT face=3D"Times New Roman"></FONT>&nbsp;</DIV>
<DIV><FONT face=3D"Times New Roman">If there's not, let me try this: I've t=
ried to=20
compose a grammar that defines a lujvo.&nbsp; Could someone critique=20
it?</FONT></DIV>
<DIV><FONT face=3D"Times New Roman"></FONT>&nbsp;</DIV>
<DIV><FONT face=3D"Times New Roman">lujvo&nbsp; =3D &nbsp;InitialRafsi&nbsp=
;=20
TermainlRafsi</FONT></DIV>
<DIV><FONT face=3D"Times New Roman">InitialRafsi&nbsp; =3D&nbsp; Rafsi&nbsp=
;=20
InitialRafsi&nbsp; | &nbsp;&lt;null&gt;</FONT></DIV>
<DIV><FONT face=3D"Times New Roman">Rafsi&nbsp; =3D&nbsp; 4Rafsi&nbsp; |&nb=
sp;=20
3Rafsi</FONT></DIV>
<DIV><FONT face=3D"Times New Roman"></FONT>&nbsp;</DIV>
<DIV><FONT face=3D"Times New Roman">TerminalRafsi&nbsp; =3D&nbsp; CCV | CVV=
| CVCCV=20
| CCVCV</FONT></DIV>
<DIV><FONT face=3D"Times New Roman">4Rafsi&nbsp; =3D &nbsp;CVCCy |=20
CCVCy</FONT></DIV>
<DIV><FONT face=3D"Times New Roman">3Rafsi&nbsp; =3D&nbsp; CVV | CCV | CVVr=
| CVVn |=20
CVC | CVCy</FONT></DIV>
<DIV><FONT face=3D"Times New Roman"></FONT>&nbsp;</DIV>
<DIV><FONT face=3D"Times New Roman">Must the parsing of the unknown lujvo b=
egin=20
from the right?&nbsp; or left?&nbsp; or is it unambiguous regardless?&nbsp;=
=20
Given a 4Rafsi of the form CVCCy or CCVCy, I'm assuming that there's only o=
ne=20
gismu with those first 4 letters -- right?&nbsp; </FONT></DIV>
<DIV><FONT face=3D"Times New Roman"></FONT>&nbsp;</DIV>
<DIV><FONT face=3D"Times New Roman">Any other suggestions for how to do the=
=20
parsing?</FONT></DIV>
<DIV><FONT face=3D"Times New Roman"></FONT>&nbsp;</DIV>
<DIV><FONT face=3D"Times New Roman">Thanks.</FONT></DIV></BODY></HTML>

------=_NextPart_000_002A_01C0A8DD.A8898F00--

