[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lojban] Re: Beta Release of PEG-based Lojban parser.



On Thu, Apr 08, 2004 at 10:51:57PM -0400, Pierre Abbat wrote:
> On Thu, Apr 08, 2004 at 05:28:58PM -0700, Robin Lee Powell wrote:
> > I am considering an extension to allow 'si' or 'sa' at the beginning
> > of text (presumably to erase stuff from the proceeding utterance).
> > The morphology needs massive amounts of work, and ideally I'd like
> > to get Nora and Pierre's full algorithm encoded. 
> 
> Do you mean the algorithm for breaking a speech stream into words, or
> the algorithm for telling whether the words are valid? The former is
> done, except for the option that always requires a pause before a
> cmevla and being tested by someone else; but the latter I'm not
> finished encoding myself.

Both.  The point would be to have a single program that can take any
mixture of space-seperated and stress-added character streams and output
a parse tree (assuming a successful parse is possible), including
identifying what word type each word is.

Doing all of the above entirely in a properly formalized language is my
ultimate goal with this project[1].

> I am working, when I can find time for it, on the complete brivla
> validity test. I think I got the tosmabru test working right, but it
> is claiming that {lekybumlatu} and {stanybrulspa} are valid. (The
> former is {le kybu mlatu} run together; 

I had to do Special Things for BU to work properly; it couldn't just be
treated as a normal cmavo.

-Robin

[1]: Well, actually, my ultimate goal is to create a program that will
accept mekso and output LaTeX, but I seem to have become a tad bit
sidetracked.  :-)

-- 
http://www.digitalkingdom.org/~rlpowell/  ***  I'm a *male* Robin.
"Many philosophical problems are caused by such things as the simple
inability to shut up." -- David Stove, liberally paraphrased.
http://www.lojban.org/  ***  loi pimlu na srana .i ti rocki morsi