[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[lojban] Beta Release of PEG-based Lojban parser.
My PEG-based parser now works on almost everything I've thrown at it.
Known limitations (from the web page):
- Does not handle zoi or la'o, and likely will not handle it in the near
future.
- Currently its morphology knowledge is very poor. In particular, it
does not accept fu'ivla starting with a vowel at this time, nor
capital letters in brivla.
The parser, information on how it was made, the PEG it was built from,
and many other thing are at
http://www.digitalkingdom.org/~rlpowell/hobbies/lojban/grammar/index.html
The Future:
I am considering an extension to allow 'si' or 'sa' at the beginning of
text (presumably to erase stuff from the proceeding utterance). The
morphology needs massive amounts of work, and ideally I'd like to get
Nora and Pierre's full algorithm encoded. I may also hack an extremely
minimal pre-processor to do zoi. At some point the parser needs to be
taught to output something more useful than just the text it succeeded
at parsing, but I'm really hoping someone with actual Java experience
will look at that.
-Robin
--
http://www.digitalkingdom.org/~rlpowell/ *** I'm a *male* Robin.
"Many philosophical problems are caused by such things as the simple
inability to shut up." -- David Stove, liberally paraphrased.
http://www.lojban.org/ *** loi pimlu na srana .i ti rocki morsi