[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lojban-beginners] vlastezba: First beta version released!



I can more-or-less work with the what it does now, so that is
sufficient experimentation.

I routinely write code like |if(var=="foo")| when I mean
|if(var.equals("foo"))|, my Java isn't what it could be.

I'm able to parse XML for tree-structured data, which is probably
the easiest choice for interoperability:

XML:

  <pruce>
    <selruhe>coi ro do</selruhe>
    <teryruhe>
      <cmavo selmaho="COI">coi</cmavo>
      <cmavo selmaho="PA">ro</cmavo>
      <cmavo selmaho="KOhA">do</cmavo>
    </teryruhe>
  </pruce>

If this makes you cringe, then how about:

csv:

  klesi,valsi
  COI,coi
  PA,ro
  KOhA,do

Which unfortunately doesn't include the input string; I don't see a
simple way to do that that is normal (as in normal form).

-Alan

On Wed, Apr 20, 2011 at 04:51:51PM +0200, Johan Pretorius wrote:
>    Hi Alan,
> 
>    That would indeed be an interesting experiment, I'd be quite keen to see
>    the results myself.
> 
>    Right now, if you just call
> 
>    java -jar vlastezba.jar test.txt
> 
>    with some Lojban text (legal or otherwise) in test.txt, it will return (on
>    stdout), one valsi per line. So "coirodo" would result in:
>    coi
>    ro
>    do
>    (you can make it go look up the definitions by passing a second parameter,
>    but it will just add junk to the output that I don't think you'd want)
> 
>    Right now it doesn't check grammar at all, so you can throw any random
>    collection of words at it (I don't intend for it to ever do this, there
>    are tools out there that are far better at this than I could ever hope to
>    make it).
> 
>    It also won't give you a classification of valsi - it doesn't "know" when
>    it's dealing with a cmavo (or indeed what class), or a gismu, or a lujvo.
>    This I DO intend to fix.
> 
>    I want to add other output formats anyway, so if you want me to do
>    something specific to make your comparison easier, let me know. Now would
>    be a good time, as I'm going away on holiday for a week, and wanted to
>    spend at least a little bit of time on vlastezba.
> 
>    In fact, if you are comfortable with Java, feel free to make it do what
>    you need, the source code is on [1]sourceforge.net
>    ([2]http://sourceforge.net/projects/vlastezba/), and is GPL'ed :-)
> 
>    mu'o mi'e iu'an
> 
>    On Wed, Apr 20, 2011 at 4:29 PM, .alyn.post.
>    <[3]alyn.post@lodockikumazvati.org> wrote:
> 
>      Do you have an external representation for your valsi parsing
>      result? If I hand you the string "coirodo" is there a print
>      form of that along the lines of ("coi" "ro" "do")?
> 
>      I would be interested seeing the result from processing a large
>      data set of words and phrases and comparing that to jbogenturfa'i.
>      In order to do this I'd need some output format from your program
>      that I could parse.
> 
>      jbogenturfa'i uses the morphology PEG grammar that xorxes developed,
>      so it contains code which I think is similar (and should be
>      identical in result) to what you are doing:
> 
>      $ echo "coirodo"|jbogenturfahi --rafske
>      ((cmavo (COI "coi")) (cmavo (PA "ro")) (cmavo (KOhA "do")))
> 
>      I'd be curious to know whether they are in fact producing identical
>      results.
> 
>      -Alan
>      On Wed, Apr 20, 2011 at 11:02:28AM +0200, Johan Pretorius wrote:
>      > Hi all
>      >
>      > You can download it from here:
>      >
>      [1][4]http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download
>      >
>      > I have completed the cmavo cluster breakout code, and tested it as far
>      as
>      > I was able.
>      >
>      > It should be easy enough to run if you have Java 1.6 installed, just
>      go
>      > java -jar vlastezba.jar and it will print out usage instructions.
>      >
>      > Please download it and test to pieces! I'd love all your feedback.
>      >
>      > Not that it doesn't get very smart at this stage - for instance, it
>      won't
>      > know what to do if you feed it a string of lojban that doesn't have
>      any
>      > spaces in. The only clever bit is that it's able to break apart cmavo
>      > clusters if they don't have any spaces.
>      >
>      > Regards,
>      > Johan
>      >
>      > --
>      > Johan Pretorius
>      > Cell: 0829268327
>      > [2][5]pretoriusjf@gmail.com
>      >
>      > --
>      > You received this message because you are subscribed to the Google
>      Groups
>      > "Lojban Beginners" group.
>      > To post to this group, send email to
>      [6]lojban-beginners@googlegroups.com.
>      > To unsubscribe from this group, send email to
>      > [7]lojban-beginners+unsubscribe@googlegroups.com.
>      > For more options, visit this group at
>      > [8]http://groups.google.com/group/lojban-beginners?hl=en.
>      >
>      > References
>      >
>      > Visible links
>      > 1.
>      [9]http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download
>      > 2. mailto:[10]pretoriusjf@gmail.com
> 
>      --
>      .i ma'a lo bradi ku penmi gi'e du
>      --
>      You received this message because you are subscribed to the Google
>      Groups "Lojban Beginners" group.
>      To post to this group, send email to
>      [11]lojban-beginners@googlegroups.com.
>      To unsubscribe from this group, send email to
>      [12]lojban-beginners+unsubscribe@googlegroups.com.
>      For more options, visit this group at
>      [13]http://groups.google.com/group/lojban-beginners?hl=en.
> 
>    --
>    Johan Pretorius
>    Cell: 0829268327
>    [14]pretoriusjf@gmail.com
> 
>    --
>    You received this message because you are subscribed to the Google Groups
>    "Lojban Beginners" group.
>    To post to this group, send email to lojban-beginners@googlegroups.com.
>    To unsubscribe from this group, send email to
>    lojban-beginners+unsubscribe@googlegroups.com.
>    For more options, visit this group at
>    http://groups.google.com/group/lojban-beginners?hl=en.
> 
> References
> 
>    Visible links
>    1. http://sourceforge.net/
>    2. http://sourceforge.net/projects/vlastezba/
>    3. mailto:alyn.post@lodockikumazvati.org
>    4. http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download
>    5. mailto:pretoriusjf@gmail.com
>    6. mailto:lojban-beginners@googlegroups.com
>    7. mailto:lojban-beginners%2Bunsubscribe@googlegroups.com
>    8. http://groups.google.com/group/lojban-beginners?hl=en
>    9. http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download
>   10. mailto:pretoriusjf@gmail.com
>   11. mailto:lojban-beginners@googlegroups.com
>   12. mailto:lojban-beginners%2Bunsubscribe@googlegroups.com
>   13. http://groups.google.com/group/lojban-beginners?hl=en
>   14. mailto:pretoriusjf@gmail.com

-- 
.i ma'a lo bradi ku penmi gi'e du

-- 
You received this message because you are subscribed to the Google Groups "Lojban Beginners" group.
To post to this group, send email to lojban-beginners@googlegroups.com.
To unsubscribe from this group, send email to lojban-beginners+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban-beginners?hl=en.