[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lojban-beginners] vlastezba: First beta version released!



Hi Alan,

That would indeed be an interesting experiment, I'd be quite keen to see the results myself.

Right now, if you just call

   java -jar vlastezba.jar test.txt

with some Lojban text (legal or otherwise) in test.txt, it will return (on stdout), one valsi per line.  So "coirodo" would result in:
   coi
   ro
   do
(you can make it go look up the definitions by passing a second parameter, but it will just add junk to the output that I don't think you'd want)

Right now it doesn't check grammar at all, so you can throw any random collection of words at it (I don't intend for it to ever do this, there are tools out there that are far better at this than I could ever hope to make it).

It also won't give you a classification of valsi - it doesn't "know" when it's dealing with a cmavo (or indeed what class), or a gismu, or a lujvo.  This I DO intend to fix.

I want to add other output formats anyway, so if you want me to do something specific to make your comparison easier, let me know.  Now would be a good time, as I'm going away on holiday for a week, and wanted to spend at least a little bit of time on vlastezba.

In fact, if you are comfortable with Java, feel free to make it do what you need, the source code is on sourceforge.net (http://sourceforge.net/projects/vlastezba/), and is GPL'ed :-)

mu'o mi'e iu'an



On Wed, Apr 20, 2011 at 4:29 PM, .alyn.post. <alyn.post@lodockikumazvati.org> wrote:
Do you have an external representation for your valsi parsing
result?  If I hand you the string "coirodo" is there a print
form of that along the lines of ("coi" "ro" "do")?

I would be interested seeing the result from processing a large
data set of words and phrases and comparing that to jbogenturfa'i.
In order to do this I'd need some output format from your program
that I could parse.

jbogenturfa'i uses the morphology PEG grammar that xorxes developed,
so it contains code which I think is similar (and should be
identical in result) to what you are doing:

 $ echo "coirodo"|jbogenturfahi --rafske
 ((cmavo (COI "coi")) (cmavo (PA "ro")) (cmavo (KOhA "do")))

I'd be curious to know whether they are in fact producing identical
results.

-Alan

On Wed, Apr 20, 2011 at 11:02:28AM +0200, Johan Pretorius wrote:
>    Hi all
>
>    You can download it from here:
>    [1]http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download
>
>    I have completed the cmavo cluster breakout code, and tested it as far as
>    I was able.
>
>    It should be easy enough to run if you have Java 1.6 installed, just go
>    java -jar vlastezba.jar and it will print out usage instructions.
>
>    Please download it and test to pieces! I'd love all your feedback.
>
>    Not that it doesn't get very smart at this stage - for instance, it won't
>    know what to do if you feed it a string of lojban that doesn't have any
>    spaces in. The only clever bit is that it's able to break apart cmavo
>    clusters if they don't have any spaces.
>
>    Regards,
>    Johan
>
>    --
>    Johan Pretorius
>    Cell: 0829268327
>    [2]pretoriusjf@gmail.com
>
>    --
>    You received this message because you are subscribed to the Google Groups
>    "Lojban Beginners" group.
>    To post to this group, send email to lojban-beginners@googlegroups.com.
>    To unsubscribe from this group, send email to
>    lojban-beginners+unsubscribe@googlegroups.com.
>    For more options, visit this group at
>    http://groups.google.com/group/lojban-beginners?hl=en.
>
> References
>
>    Visible links
>    1. http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download
>    2. mailto:pretoriusjf@gmail.com

--
.i ma'a lo bradi ku penmi gi'e du

--
You received this message because you are subscribed to the Google Groups "Lojban Beginners" group.
To post to this group, send email to lojban-beginners@googlegroups.com.
To unsubscribe from this group, send email to lojban-beginners+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban-beginners?hl=en.




--
Johan Pretorius
Cell: 0829268327
pretoriusjf@gmail.com

--
You received this message because you are subscribed to the Google Groups "Lojban Beginners" group.
To post to this group, send email to lojban-beginners@googlegroups.com.
To unsubscribe from this group, send email to lojban-beginners+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban-beginners?hl=en.