[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [lojban] compound cmavo classification in cmavo.txt
On Tue, Jan 11, 2011 at 05:02:54PM -0500, Bob LeChevalier wrote:
> .alyn.post. wrote:
> >The following file:
> >
> > http://www.lojban.org/publications/wordlists/cmavo.txt
> >
> >Is a list of cmavo. I believe it is the canonical list, please
> >correct me if that is a misunderstanding.
> >
> >This file includes compound cmavo, like "le go'i", but only
> >includes a single selma'o class, even when the compound cmavo
> >consists of cmavo in more than one selma'o.
>
> The use of the * indicates that it is NOT a member of that selma'o, but
> is being grouped together (with others having the same *) for some
> pedagogical reason. The list was originally designed as a teaching
> tool, but became a reference text in lieu of an actual dictionary.
>
> >I've loaded all of the entries in cmavo.txt into the parser
>
> which parser?
>
My work-in-progress parser, jbogenturfa'i:
http://wiki.call-cc.org/eggref/4/jbogenturfahi
I've got the morphology file working and tested, and the grammar
parses what I've thrown at it so far. I'm working now on cleaning
up the resulting parse tree to be useable in other applications
and adding test cases to more rigorously test the grammar parser.
It uses the PEG grammar developed by camgusmis and xorxes.
The parser is written in Scheme, and is to my knowledge the first
time someone has built tools for working with Lojban in Scheme.
I've certainly written the best PEG parser available for Scheme,
because I tried the available one before writing my own. ;-)
My near-term goal is to have a camxes-level feature set and to
maintain a second PEG parser alongside camxes, while sharing as
near the same PEG grammar between them as possible. So far, this
work has resulted in a satisfying level of cleanup to the PEG grammar,
which I would like to see become the official grammar for Lojban.
In service to that I've been collecting the available test data and
will be extending my test suite to include them. I'm currently at
6176 tests to cover the morphology.
-Alan
--
.i ko djuno fi le do sevzi
--
You received this message because you are subscribed to the Google Groups "lojban" group.
To post to this group, send email to lojban@googlegroups.com.
To unsubscribe from this group, send email to lojban+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.