Received: from mail-gx0-f189.google.com ([209.85.161.189]) by chain.digitalkingdom.org with esmtp (Exim 4.72) (envelope-from ) id 1QCZYN-0006BT-IO; Wed, 20 Apr 2011 08:43:00 -0700 Received: by gxk3 with SMTP id 3sf1951688gxk.16 for ; Wed, 20 Apr 2011 08:42:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=beta; h=domainkey-signature:x-beenthere:received-spf:date:from:to:subject :message-id:mail-followup-to:references:mime-version:in-reply-to :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:x-google-group-id:list-post :list-help:list-archive:sender:list-subscribe:list-unsubscribe :content-type:content-disposition; bh=nakwj3b6dOXZhxRSTViRTu7jUbiHLUM8V5k8kNvuNME=; b=lB/BYBRNdFvMICPTqEu65trXoj3Lezdafl/yQsrJHuvBxG568/sp8eRiTXpp9u+4Lx /ZY0GFGRTUbv/tUf5OaqHGkVTNfkBbXd5wdbb0X7fE6OHbkEmh3zcubO21YiyyMXQqeu JN2UmYAxLuTE7aCOxNjKW55SFfMW/3D8zx7Fg= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlegroups.com; s=beta; h=x-beenthere:received-spf:date:from:to:subject:message-id :mail-followup-to:references:mime-version:in-reply-to :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:x-google-group-id:list-post :list-help:list-archive:sender:list-subscribe:list-unsubscribe :content-type:content-disposition; b=dAvErEEUxqzAgxObNMYsSV7K/of7QfSu3DmuP7qZ1Q0jMz/VsEjC/UVPD6bfxvWoS8 t8f2zMm3ISfDLN2v+CelvDg3Xd2SyI9N3OFR179vPRvFLBRinTcSdVW0TMsXhGbEqJXm idzLbuaw8QLdsqW0UmWB93YlQ/+THiN9L7rJ4= Received: by 10.151.76.22 with SMTP id d22mr988710ybl.36.1303314164982; Wed, 20 Apr 2011 08:42:44 -0700 (PDT) X-BeenThere: lojban-beginners@googlegroups.com Received: by 10.150.228.9 with SMTP id a9ls42757ybh.6.gmail; Wed, 20 Apr 2011 08:42:43 -0700 (PDT) Received: by 10.236.103.140 with SMTP id f12mr1247920yhg.78.1303314163120; Wed, 20 Apr 2011 08:42:43 -0700 (PDT) Received: by 10.236.103.140 with SMTP id f12mr1247919yhg.78.1303314163082; Wed, 20 Apr 2011 08:42:43 -0700 (PDT) Received: from mail-gx0-f180.google.com (mail-gx0-f180.google.com [209.85.161.180]) by gmr-mx.google.com with ESMTPS id l18si415274ybn.2.2011.04.20.08.42.41 (version=TLSv1/SSLv3 cipher=OTHER); Wed, 20 Apr 2011 08:42:41 -0700 (PDT) Received-SPF: neutral (google.com: 209.85.161.180 is neither permitted nor denied by best guess record for domain of alanpost@sunflowerriver.org) client-ip=209.85.161.180; Received: by gxk10 with SMTP id 10so409657gxk.11 for ; Wed, 20 Apr 2011 08:42:41 -0700 (PDT) Received: by 10.100.104.1 with SMTP id b1mr1116788anc.163.1303314161594; Wed, 20 Apr 2011 08:42:41 -0700 (PDT) Received: from sunflowerriver.org (173-10-243-253-Albuquerque.hfc.comcastbusiness.net [173.10.243.253]) by mx.google.com with ESMTPS id c4sm980446ana.23.2011.04.20.08.42.39 (version=TLSv1/SSLv3 cipher=OTHER); Wed, 20 Apr 2011 08:42:40 -0700 (PDT) Date: Wed, 20 Apr 2011 09:42:36 -0600 From: ".alyn.post." To: lojban-beginners@googlegroups.com Subject: Re: [lojban-beginners] vlastezba: First beta version released! Message-ID: <20110420154236.GE49678@alice.local> Mail-Followup-To: lojban-beginners@googlegroups.com References: <20110420142911.GB49678@alice.local> <20110420151214.GC49678@alice.local> Mime-Version: 1.0 In-Reply-To: X-Original-Sender: alyn.post@lodockikumazvati.org X-Original-Authentication-Results: gmr-mx.google.com; spf=neutral (google.com: 209.85.161.180 is neither permitted nor denied by best guess record for domain of alanpost@sunflowerriver.org) smtp.mail=alanpost@sunflowerriver.org Reply-To: lojban-beginners@googlegroups.com Precedence: list Mailing-list: list lojban-beginners@googlegroups.com; contact lojban-beginners+owners@googlegroups.com List-ID: X-Google-Group-Id: 300742228892 List-Post: , List-Help: , List-Archive: Sender: lojban-beginners@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: text/plain; charset=ISO-8859-1 Content-Disposition: inline Content-Length: 12568 I had not considered the use case of a large story, I was thinking of individual test strings and my need to know how input was paired with output. Particularly, I didn't want erroneous input in one test case to cause another input to parse incorrectly. I can (and should, really) work around this by calling the program multiple times. BTW, what result does your program produce for: ba'e ba'er ba'ercatra That should be something like: ((cmavo (BAhE "ba'e")) (cmene "ba'er") (lujvo "ba'ercatra")) With different results being produced depending on whether the spaces are there or not. I'm curious if you're handling that correctly. -Alan On Wed, Apr 20, 2011 at 05:24:31PM +0200, Johan Pretorius wrote: > Okay, the licensing is fixed now. > > Alan, The fact that you know that's a problem puts you in 1% of the > population :-) > > Anyway, I'm not diametrically opposed to XML just for the sake of being > opposed... it's worth looking at, especially, as you say, for > interoperability. > > Do you think it's necessary to include the input string? I foresee > vlastezba being used for large bodies of text, anyway that's how I intend > to use it for myself: I feed it the terry the tiger story and let it build > me something I can print out, which means my sucky vocabulary does not > stop me reading the story, albeit slowly. > > Maybe it's a good idea to make that configurable. > > -Johan > > On Wed, Apr 20, 2011 at 5:12 PM, .alyn.post. > <[1]alyn.post@lodockikumazvati.org> wrote: > > I can more-or-less work with the what it does now, so that is > sufficient experimentation. > > I routinely write code like |if(var=="foo")| when I mean > |if(var.equals("foo"))|, my Java isn't what it could be. > > I'm able to parse XML for tree-structured data, which is probably > the easiest choice for interoperability: > > XML: > > > coi ro do > > coi > ro > do > > > > If this makes you cringe, then how about: > > csv: > > klesi,valsi > COI,coi > PA,ro > KOhA,do > > Which unfortunately doesn't include the input string; I don't see a > simple way to do that that is normal (as in normal form). > > -Alan > On Wed, Apr 20, 2011 at 04:51:51PM +0200, Johan Pretorius wrote: > > Hi Alan, > > > > That would indeed be an interesting experiment, I'd be quite keen to > see > > the results myself. > > > > Right now, if you just call > > > > java -jar vlastezba.jar test.txt > > > > with some Lojban text (legal or otherwise) in test.txt, it will return > (on > > stdout), one valsi per line. So "coirodo" would result in: > > coi > > ro > > do > > (you can make it go look up the definitions by passing a second > parameter, > > but it will just add junk to the output that I don't think you'd want) > > > > Right now it doesn't check grammar at all, so you can throw any random > > collection of words at it (I don't intend for it to ever do this, > there > > are tools out there that are far better at this than I could ever hope > to > > make it). > > > > It also won't give you a classification of valsi - it doesn't "know" > when > > it's dealing with a cmavo (or indeed what class), or a gismu, or a > lujvo. > > This I DO intend to fix. > > > > I want to add other output formats anyway, so if you want me to do > > something specific to make your comparison easier, let me know. Now > would > > be a good time, as I'm going away on holiday for a week, and wanted to > > spend at least a little bit of time on vlastezba. > > > > In fact, if you are comfortable with Java, feel free to make it do > what > > you need, the source code is on [1][2]sourceforge.net > > ([2][3]http://sourceforge.net/projects/vlastezba/), and is GPL'ed :-) > > > > mu'o mi'e iu'an > > > > On Wed, Apr 20, 2011 at 4:29 PM, .alyn.post. > > <[3][4]alyn.post@lodockikumazvati.org> wrote: > > > > Do you have an external representation for your valsi parsing > > result? If I hand you the string "coirodo" is there a print > > form of that along the lines of ("coi" "ro" "do")? > > > > I would be interested seeing the result from processing a large > > data set of words and phrases and comparing that to jbogenturfa'i. > > In order to do this I'd need some output format from your program > > that I could parse. > > > > jbogenturfa'i uses the morphology PEG grammar that xorxes developed, > > so it contains code which I think is similar (and should be > > identical in result) to what you are doing: > > > > $ echo "coirodo"|jbogenturfahi --rafske > > ((cmavo (COI "coi")) (cmavo (PA "ro")) (cmavo (KOhA "do"))) > > > > I'd be curious to know whether they are in fact producing identical > > results. > > > > -Alan > > On Wed, Apr 20, 2011 at 11:02:28AM +0200, Johan Pretorius wrote: > > > Hi all > > > > > > You can download it from here: > > > > > > [1][4][5]http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download > > > > > > I have completed the cmavo cluster breakout code, and tested it as > far > > as > > > I was able. > > > > > > It should be easy enough to run if you have Java 1.6 installed, just > > go > > > java -jar vlastezba.jar and it will print out usage instructions. > > > > > > Please download it and test to pieces! I'd love all your feedback. > > > > > > Not that it doesn't get very smart at this stage - for instance, it > > won't > > > know what to do if you feed it a string of lojban that doesn't have > > any > > > spaces in. The only clever bit is that it's able to break apart > cmavo > > > clusters if they don't have any spaces. > > > > > > Regards, > > > Johan > > > > > > -- > > > Johan Pretorius > > > Cell: 0829268327 > > > [2][5][6]pretoriusjf@gmail.com > > > > > > -- > > > You received this message because you are subscribed to the Google > > Groups > > > "Lojban Beginners" group. > > > To post to this group, send email to > > [6][7]lojban-beginners@googlegroups.com. > > > To unsubscribe from this group, send email to > > > [7][8]lojban-beginners+unsubscribe@googlegroups.com. > > > For more options, visit this group at > > > [8][9]http://groups.google.com/group/lojban-beginners?hl=en. > > > > > > References > > > > > > Visible links > > > 1. > > > [9][10]http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download > > > 2. mailto:[10][11]pretoriusjf@gmail.com > > > > -- > > .i ma'a lo bradi ku penmi gi'e du > > -- > > You received this message because you are subscribed to the Google > > Groups "Lojban Beginners" group. > > To post to this group, send email to > > [11][12]lojban-beginners@googlegroups.com. > > To unsubscribe from this group, send email to > > [12][13]lojban-beginners+unsubscribe@googlegroups.com. > > For more options, visit this group at > > [13][14]http://groups.google.com/group/lojban-beginners?hl=en. > > > > -- > > Johan Pretorius > > Cell: 0829268327 > > [14][15]pretoriusjf@gmail.com > > > > -- > > You received this message because you are subscribed to the Google > Groups > > "Lojban Beginners" group. > > To post to this group, send email to > [16]lojban-beginners@googlegroups.com. > > To unsubscribe from this group, send email to > > [17]lojban-beginners+unsubscribe@googlegroups.com. > > For more options, visit this group at > > [18]http://groups.google.com/group/lojban-beginners?hl=en. > > > > References > > > > Visible links > > 1. [19]http://sourceforge.net/ > > 2. [20]http://sourceforge.net/projects/vlastezba/ > > 3. mailto:[21]alyn.post@lodockikumazvati.org > > 4. > [22]http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download > > 5. mailto:[23]pretoriusjf@gmail.com > > 6. mailto:[24]lojban-beginners@googlegroups.com > > 7. mailto:[25]lojban-beginners%2Bunsubscribe@googlegroups.com > > 8. [26]http://groups.google.com/group/lojban-beginners?hl=en > > 9. > [27]http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download > > 10. mailto:[28]pretoriusjf@gmail.com > > 11. mailto:[29]lojban-beginners@googlegroups.com > > 12. mailto:[30]lojban-beginners%2Bunsubscribe@googlegroups.com > > 13. [31]http://groups.google.com/group/lojban-beginners?hl=en > > 14. mailto:[32]pretoriusjf@gmail.com > -- > .i ma'a lo bradi ku penmi gi'e du > > -- > You received this message because you are subscribed to the Google > Groups "Lojban Beginners" group. > To post to this group, send email to > [33]lojban-beginners@googlegroups.com. > To unsubscribe from this group, send email to > [34]lojban-beginners+unsubscribe@googlegroups.com. > For more options, visit this group at > [35]http://groups.google.com/group/lojban-beginners?hl=en. > > -- > Johan Pretorius > Cell: 0829268327 > [36]pretoriusjf@gmail.com > > -- > You received this message because you are subscribed to the Google Groups > "Lojban Beginners" group. > To post to this group, send email to lojban-beginners@googlegroups.com. > To unsubscribe from this group, send email to > lojban-beginners+unsubscribe@googlegroups.com. > For more options, visit this group at > http://groups.google.com/group/lojban-beginners?hl=en. > > References > > Visible links > 1. mailto:alyn.post@lodockikumazvati.org > 2. http://sourceforge.net/ > 3. http://sourceforge.net/projects/vlastezba/ > 4. mailto:alyn.post@lodockikumazvati.org > 5. http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download > 6. mailto:pretoriusjf@gmail.com > 7. mailto:lojban-beginners@googlegroups.com > 8. mailto:lojban-beginners%2Bunsubscribe@googlegroups.com > 9. http://groups.google.com/group/lojban-beginners?hl=en > 10. http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download > 11. mailto:pretoriusjf@gmail.com > 12. mailto:lojban-beginners@googlegroups.com > 13. mailto:lojban-beginners%2Bunsubscribe@googlegroups.com > 14. http://groups.google.com/group/lojban-beginners?hl=en > 15. mailto:pretoriusjf@gmail.com > 16. mailto:lojban-beginners@googlegroups.com > 17. mailto:lojban-beginners%2Bunsubscribe@googlegroups.com > 18. http://groups.google.com/group/lojban-beginners?hl=en > 19. http://sourceforge.net/ > 20. http://sourceforge.net/projects/vlastezba/ > 21. mailto:alyn.post@lodockikumazvati.org > 22. http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download > 23. mailto:pretoriusjf@gmail.com > 24. mailto:lojban-beginners@googlegroups.com > 25. mailto:lojban-beginners%252Bunsubscribe@googlegroups.com > 26. http://groups.google.com/group/lojban-beginners?hl=en > 27. http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download > 28. mailto:pretoriusjf@gmail.com > 29. mailto:lojban-beginners@googlegroups.com > 30. mailto:lojban-beginners%252Bunsubscribe@googlegroups.com > 31. http://groups.google.com/group/lojban-beginners?hl=en > 32. mailto:pretoriusjf@gmail.com > 33. mailto:lojban-beginners@googlegroups.com > 34. mailto:lojban-beginners%2Bunsubscribe@googlegroups.com > 35. http://groups.google.com/group/lojban-beginners?hl=en > 36. mailto:pretoriusjf@gmail.com -- .i ma'a lo bradi ku penmi gi'e du -- You received this message because you are subscribed to the Google Groups "Lojban Beginners" group. To post to this group, send email to lojban-beginners@googlegroups.com. To unsubscribe from this group, send email to lojban-beginners+unsubscribe@googlegroups.com. For more options, visit this group at http://groups.google.com/group/lojban-beginners?hl=en.