[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [lojban-beginners] vlastezba: First beta version released!
On 20/04/2011, Johan Pretorius <pretoriusjf@gmail.com> wrote:
> Okay, the licensing is fixed now.
>
> Alan, The fact that you know that's a problem puts you in 1% of the
> population :-)
>
> Anyway, I'm not diametrically opposed to XML just for the sake of being
> opposed... it's worth looking at, especially, as you say, for
> interoperability.
>
> Do you think it's necessary to include the input string? I foresee
> vlastezba being used for large bodies of text, anyway that's how I intend to
> use it for myself: I feed it the terry the tiger story and let it build me
> something I can print out, which means my sucky vocabulary does not stop me
> reading the story, albeit slowly.
>
> Maybe it's a good idea to make that configurable.
>
> -Johan
>
>
> On Wed, Apr 20, 2011 at 5:12 PM, .alyn.post. <alyn.post@lodockikumazvati.org
>> wrote:
>
>> I can more-or-less work with the what it does now, so that is
>> sufficient experimentation.
>>
>> I routinely write code like |if(var=="foo")| when I mean
>> |if(var.equals("foo"))|, my Java isn't what it could be.
>>
>> I'm able to parse XML for tree-structured data, which is probably
>> the easiest choice for interoperability:
>>
>> XML:
>>
>> <pruce>
>> <selruhe>coi ro do</selruhe>
>> <teryruhe>
>> <cmavo selmaho="COI">coi</cmavo>
>> <cmavo selmaho="PA">ro</cmavo>
>> <cmavo selmaho="KOhA">do</cmavo>
>> </teryruhe>
>> </pruce>
>>
>> If this makes you cringe, then how about:
>>
>> csv:
>>
>> klesi,valsi
>> COI,coi
>> PA,ro
>> KOhA,do
>>
>> Which unfortunately doesn't include the input string; I don't see a
>> simple way to do that that is normal (as in normal form).
>>
>> -Alan
>>
>> On Wed, Apr 20, 2011 at 04:51:51PM +0200, Johan Pretorius wrote:
>> > Hi Alan,
>> >
>> > That would indeed be an interesting experiment, I'd be quite keen to
>> see
>> > the results myself.
>> >
>> > Right now, if you just call
>> >
>> > java -jar vlastezba.jar test.txt
>> >
>> > with some Lojban text (legal or otherwise) in test.txt, it will
>> > return
>> (on
>> > stdout), one valsi per line. So "coirodo" would result in:
>> > coi
>> > ro
>> > do
>> > (you can make it go look up the definitions by passing a second
>> parameter,
>> > but it will just add junk to the output that I don't think you'd
>> > want)
>> >
>> > Right now it doesn't check grammar at all, so you can throw any
>> > random
>> > collection of words at it (I don't intend for it to ever do this,
>> there
>> > are tools out there that are far better at this than I could ever
>> > hope
>> to
>> > make it).
>> >
>> > It also won't give you a classification of valsi - it doesn't "know"
>> when
>> > it's dealing with a cmavo (or indeed what class), or a gismu, or a
>> lujvo.
>> > This I DO intend to fix.
>> >
>> > I want to add other output formats anyway, so if you want me to do
>> > something specific to make your comparison easier, let me know. Now
>> would
>> > be a good time, as I'm going away on holiday for a week, and wanted
>> > to
>> > spend at least a little bit of time on vlastezba.
>> >
>> > In fact, if you are comfortable with Java, feel free to make it do
>> what
>> > you need, the source code is on [1]sourceforge.net
>> > ([2]http://sourceforge.net/projects/vlastezba/), and is GPL'ed :-)
>> >
>> > mu'o mi'e iu'an
>> >
>> > On Wed, Apr 20, 2011 at 4:29 PM, .alyn.post.
>> > <[3]alyn.post@lodockikumazvati.org> wrote:
>> >
>> > Do you have an external representation for your valsi parsing
>> > result? If I hand you the string "coirodo" is there a print
>> > form of that along the lines of ("coi" "ro" "do")?
>> >
>> > I would be interested seeing the result from processing a large
>> > data set of words and phrases and comparing that to jbogenturfa'i.
>> > In order to do this I'd need some output format from your program
>> > that I could parse.
>> >
>> > jbogenturfa'i uses the morphology PEG grammar that xorxes
>> > developed,
>> > so it contains code which I think is similar (and should be
>> > identical in result) to what you are doing:
>> >
>> > $ echo "coirodo"|jbogenturfahi --rafske
>> > ((cmavo (COI "coi")) (cmavo (PA "ro")) (cmavo (KOhA "do")))
>> >
>> > I'd be curious to know whether they are in fact producing identical
>> > results.
>> >
>> > -Alan
>> > On Wed, Apr 20, 2011 at 11:02:28AM +0200, Johan Pretorius wrote:
>> > > Hi all
>> > >
>> > > You can download it from here:
>> > >
>> > [1][4]
>> http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download
>> > >
>> > > I have completed the cmavo cluster breakout code, and tested it
>> > as
>> far
>> > as
>> > > I was able.
>> > >
>> > > It should be easy enough to run if you have Java 1.6 installed,
>> just
>> > go
>> > > java -jar vlastezba.jar and it will print out usage instructions.
>> > >
>> > > Please download it and test to pieces! I'd love all your
>> > feedback.
>> > >
>> > > Not that it doesn't get very smart at this stage - for instance,
>> it
>> > won't
>> > > know what to do if you feed it a string of lojban that doesn't
>> have
>> > any
>> > > spaces in. The only clever bit is that it's able to break apart
>> cmavo
>> > > clusters if they don't have any spaces.
>> > >
>> > > Regards,
>> > > Johan
>> > >
>> > > --
>> > > Johan Pretorius
>> > > Cell: 0829268327
>> > > [2][5]pretoriusjf@gmail.com
>> > >
>> > > --
>> > > You received this message because you are subscribed to the
>> > Google
>> > Groups
>> > > "Lojban Beginners" group.
>> > > To post to this group, send email to
>> > [6]lojban-beginners@googlegroups.com.
>> > > To unsubscribe from this group, send email to
>> > > [7]lojban-beginners+unsubscribe@googlegroups.com.
>> > > For more options, visit this group at
>> > > [8]http://groups.google.com/group/lojban-beginners?hl=en.
>> > >
>> > > References
>> > >
>> > > Visible links
>> > > 1.
>> > [9]
>> http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download
>> > > 2. mailto:[10]pretoriusjf@gmail.com
>> >
>> > --
>> > .i ma'a lo bradi ku penmi gi'e du
>> > --
>> > You received this message because you are subscribed to the Google
>> > Groups "Lojban Beginners" group.
>> > To post to this group, send email to
>> > [11]lojban-beginners@googlegroups.com.
>> > To unsubscribe from this group, send email to
>> > [12]lojban-beginners+unsubscribe@googlegroups.com.
>> > For more options, visit this group at
>> > [13]http://groups.google.com/group/lojban-beginners?hl=en.
>> >
>> > --
>> > Johan Pretorius
>> > Cell: 0829268327
>> > [14]pretoriusjf@gmail.com
>> >
>> > --
>> > You received this message because you are subscribed to the Google
>> Groups
>> > "Lojban Beginners" group.
>> > To post to this group, send email to
>> lojban-beginners@googlegroups.com.
>> > To unsubscribe from this group, send email to
>> > lojban-beginners+unsubscribe@googlegroups.com.
>> > For more options, visit this group at
>> > http://groups.google.com/group/lojban-beginners?hl=en.
>> >
>> > References
>> >
>> > Visible links
>> > 1. http://sourceforge.net/
>> > 2. http://sourceforge.net/projects/vlastezba/
>> > 3. mailto:alyn.post@lodockikumazvati.org
>> > 4.
>> http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download
>> > 5. mailto:pretoriusjf@gmail.com
>> > 6. mailto:lojban-beginners@googlegroups.com
>> > 7. mailto:lojban-beginners%2Bunsubscribe@googlegroups.com
>> > 8. http://groups.google.com/group/lojban-beginners?hl=en
>> > 9.
>> http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download
>> > 10. mailto:pretoriusjf@gmail.com
>> > 11. mailto:lojban-beginners@googlegroups.com
>> > 12. mailto:lojban-beginners%2Bunsubscribe@googlegroups.com
>> > 13. http://groups.google.com/group/lojban-beginners?hl=en
>> > 14. mailto:pretoriusjf@gmail.com
>>
>> --
>> .i ma'a lo bradi ku penmi gi'e du
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "Lojban Beginners" group.
>> To post to this group, send email to lojban-beginners@googlegroups.com.
>> To unsubscribe from this group, send email to
>> lojban-beginners+unsubscribe@googlegroups.com.
>> For more options, visit this group at
>> http://groups.google.com/group/lojban-beginners?hl=en.
>>
>>
>
>
> --
> Johan Pretorius
> Cell: 0829268327
> pretoriusjf@gmail.com
>
> --
> You received this message because you are subscribed to the Google Groups
> "Lojban Beginners" group.
> To post to this group, send email to lojban-beginners@googlegroups.com.
> To unsubscribe from this group, send email to
> lojban-beginners+unsubscribe@googlegroups.com.
> For more options, visit this group at
> http://groups.google.com/group/lojban-beginners?hl=en.
>
>
--
You received this message because you are subscribed to the Google Groups "Lojban Beginners" group.
To post to this group, send email to lojban-beginners@googlegroups.com.
To unsubscribe from this group, send email to lojban-beginners+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban-beginners?hl=en.