[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lojban] Parsing lujvo



> Date: Fri, 9 Mar 2001 21:12:33 -0800
> From: "seidensticker" <seidensticker@msn.com>
> Subject: How do you parse lujvo into the component rafsi?
> 
> I'm working on an algorithm for breaking a lujvo into its
> component parts.  (My goal: given an unknown lujvo, break
> it up into parts and display the definitions of each of
> those parts.)

I have a set of programs which, given a (grammatically correct)
lojban utterance, generates a gloss of it. This includes having
the bracketing of the original to see the grammatical structure,
and looking up the word-for-word "translations" of the words,
including breaking up lujvo and finding the definitions of each
component.  It was heavily critised by everyone who tried it,
apparently because it's text based, doesn't have pretty colours,
and runs under DOS. However, I now have versions that run under
Linux, NetBSD, RiscOS and Solaris, although it's still entirely
text based. I use it all the time, the main problem being that
it doesn't cope at all gracefully with grammatically incorrect
material, and much of what's written on this list is.

Anyway, the code is yours if you want it.  It's mostly C, but
with some script or batch files to glue together the separate 
components.


cdw
-- 
\\//  ze'uku ko jmive gi'e snada