From lojban-beginners+bncCMPA7-ejFhDZ-LvtBBoEZQMM0w@googlegroups.com Wed Apr 20 08:31:52 2011 Received: from mail-gy0-f189.google.com ([209.85.160.189]) by chain.digitalkingdom.org with esmtp (Exim 4.72) (envelope-from ) id 1QCZNc-0002GY-Hf; Wed, 20 Apr 2011 08:31:52 -0700 Received: by gyf1 with SMTP id 1sf1909157gyf.16 for ; Wed, 20 Apr 2011 08:31:42 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=beta; h=domainkey-signature:x-beenthere:received-spf:mime-version :in-reply-to:references:date:message-id:subject:from:to :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:x-google-group-id:list-post :list-help:list-archive:sender:list-subscribe:list-unsubscribe :content-type; bh=sF8w1wJLjFowP2TDLa3A5kxGIrM07vI6nVqqfk0+F8A=; b=qF9Xbg8paL4ivTAq4pMqzeuLGj10afmC6y8jB5nAtmMDMXqDp8VlLApvoVmWejul5/ /Hvie/xRWjLPaCVsgP1M9fyYwV3XJYIGdJ+3AlY4b8c68cM8bdJdLoQEnHyZNN0IF+bk VIhWFUfo+5y+m9b8Oq3e52ih5d/a5xijdYszQ= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlegroups.com; s=beta; h=x-beenthere:received-spf:mime-version:in-reply-to:references:date :message-id:subject:from:to:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-google-group-id:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe:content-type; b=hiy62pK1T18q31Phqg8517SbF69MQAL8vOLwSl7lSqTHexz5wE5J+XZ9mo4PzwzYo7 AaulCPJdQfDwpY6g3zfnN7aKuqnJ3Z2st/z8sHFwBkRCG47g8Lq/t+fizQQ6yxlJ9r+P RfYMbIBmNLYdpZmkyFVx9aDMtWpQ5xbMj0x5c= Received: by 10.101.206.33 with SMTP id i33mr751985anq.35.1303313497750; Wed, 20 Apr 2011 08:31:37 -0700 (PDT) X-BeenThere: lojban-beginners@googlegroups.com Received: by 10.231.8.234 with SMTP id i42ls1531152ibi.3.gmail; Wed, 20 Apr 2011 08:31:36 -0700 (PDT) Received: by 10.42.99.78 with SMTP id v14mr2406905icn.64.1303313496932; Wed, 20 Apr 2011 08:31:36 -0700 (PDT) Received: by 10.42.99.78 with SMTP id v14mr2406904icn.64.1303313496920; Wed, 20 Apr 2011 08:31:36 -0700 (PDT) Received: from mail-iy0-f176.google.com (mail-iy0-f176.google.com [209.85.210.176]) by gmr-mx.google.com with ESMTPS id xk8si144703icb.5.2011.04.20.08.31.35 (version=TLSv1/SSLv3 cipher=OTHER); Wed, 20 Apr 2011 08:31:35 -0700 (PDT) Received-SPF: pass (google.com: domain of teapot.philosopher@googlemail.com designates 209.85.210.176 as permitted sender) client-ip=209.85.210.176; Received: by iym1 with SMTP id 1so1041512iym.21 for ; Wed, 20 Apr 2011 08:31:35 -0700 (PDT) MIME-Version: 1.0 Received: by 10.42.141.9 with SMTP id m9mr1760474icu.61.1303313495504; Wed, 20 Apr 2011 08:31:35 -0700 (PDT) Received: by 10.42.213.74 with HTTP; Wed, 20 Apr 2011 08:31:35 -0700 (PDT) In-Reply-To: References: <20110420142911.GB49678@alice.local> <20110420151214.GC49678@alice.local> Date: Wed, 20 Apr 2011 16:31:35 +0100 Message-ID: Subject: Re: [lojban-beginners] vlastezba: First beta version released! From: Brian Shannon To: lojban-beginners@googlegroups.com X-Original-Sender: teapot.philosopher@googlemail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of teapot.philosopher@googlemail.com designates 209.85.210.176 as permitted sender) smtp.mail=teapot.philosopher@googlemail.com; dkim=pass (test mode) header.i=@googlemail.com Reply-To: lojban-beginners@googlegroups.com Precedence: list Mailing-list: list lojban-beginners@googlegroups.com; contact lojban-beginners+owners@googlegroups.com List-ID: X-Google-Group-Id: 300742228892 List-Post: , List-Help: , List-Archive: Sender: lojban-beginners@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: text/plain; charset=ISO-8859-1 On 20/04/2011, Johan Pretorius wrote: > Okay, the licensing is fixed now. > > Alan, The fact that you know that's a problem puts you in 1% of the > population :-) > > Anyway, I'm not diametrically opposed to XML just for the sake of being > opposed... it's worth looking at, especially, as you say, for > interoperability. > > Do you think it's necessary to include the input string? I foresee > vlastezba being used for large bodies of text, anyway that's how I intend to > use it for myself: I feed it the terry the tiger story and let it build me > something I can print out, which means my sucky vocabulary does not stop me > reading the story, albeit slowly. > > Maybe it's a good idea to make that configurable. > > -Johan > > > On Wed, Apr 20, 2011 at 5:12 PM, .alyn.post. > wrote: > >> I can more-or-less work with the what it does now, so that is >> sufficient experimentation. >> >> I routinely write code like |if(var=="foo")| when I mean >> |if(var.equals("foo"))|, my Java isn't what it could be. >> >> I'm able to parse XML for tree-structured data, which is probably >> the easiest choice for interoperability: >> >> XML: >> >> >> coi ro do >> >> coi >> ro >> do >> >> >> >> If this makes you cringe, then how about: >> >> csv: >> >> klesi,valsi >> COI,coi >> PA,ro >> KOhA,do >> >> Which unfortunately doesn't include the input string; I don't see a >> simple way to do that that is normal (as in normal form). >> >> -Alan >> >> On Wed, Apr 20, 2011 at 04:51:51PM +0200, Johan Pretorius wrote: >> > Hi Alan, >> > >> > That would indeed be an interesting experiment, I'd be quite keen to >> see >> > the results myself. >> > >> > Right now, if you just call >> > >> > java -jar vlastezba.jar test.txt >> > >> > with some Lojban text (legal or otherwise) in test.txt, it will >> > return >> (on >> > stdout), one valsi per line. So "coirodo" would result in: >> > coi >> > ro >> > do >> > (you can make it go look up the definitions by passing a second >> parameter, >> > but it will just add junk to the output that I don't think you'd >> > want) >> > >> > Right now it doesn't check grammar at all, so you can throw any >> > random >> > collection of words at it (I don't intend for it to ever do this, >> there >> > are tools out there that are far better at this than I could ever >> > hope >> to >> > make it). >> > >> > It also won't give you a classification of valsi - it doesn't "know" >> when >> > it's dealing with a cmavo (or indeed what class), or a gismu, or a >> lujvo. >> > This I DO intend to fix. >> > >> > I want to add other output formats anyway, so if you want me to do >> > something specific to make your comparison easier, let me know. Now >> would >> > be a good time, as I'm going away on holiday for a week, and wanted >> > to >> > spend at least a little bit of time on vlastezba. >> > >> > In fact, if you are comfortable with Java, feel free to make it do >> what >> > you need, the source code is on [1]sourceforge.net >> > ([2]http://sourceforge.net/projects/vlastezba/), and is GPL'ed :-) >> > >> > mu'o mi'e iu'an >> > >> > On Wed, Apr 20, 2011 at 4:29 PM, .alyn.post. >> > <[3]alyn.post@lodockikumazvati.org> wrote: >> > >> > Do you have an external representation for your valsi parsing >> > result? If I hand you the string "coirodo" is there a print >> > form of that along the lines of ("coi" "ro" "do")? >> > >> > I would be interested seeing the result from processing a large >> > data set of words and phrases and comparing that to jbogenturfa'i. >> > In order to do this I'd need some output format from your program >> > that I could parse. >> > >> > jbogenturfa'i uses the morphology PEG grammar that xorxes >> > developed, >> > so it contains code which I think is similar (and should be >> > identical in result) to what you are doing: >> > >> > $ echo "coirodo"|jbogenturfahi --rafske >> > ((cmavo (COI "coi")) (cmavo (PA "ro")) (cmavo (KOhA "do"))) >> > >> > I'd be curious to know whether they are in fact producing identical >> > results. >> > >> > -Alan >> > On Wed, Apr 20, 2011 at 11:02:28AM +0200, Johan Pretorius wrote: >> > > Hi all >> > > >> > > You can download it from here: >> > > >> > [1][4] >> http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download >> > > >> > > I have completed the cmavo cluster breakout code, and tested it >> > as >> far >> > as >> > > I was able. >> > > >> > > It should be easy enough to run if you have Java 1.6 installed, >> just >> > go >> > > java -jar vlastezba.jar and it will print out usage instructions. >> > > >> > > Please download it and test to pieces! I'd love all your >> > feedback. >> > > >> > > Not that it doesn't get very smart at this stage - for instance, >> it >> > won't >> > > know what to do if you feed it a string of lojban that doesn't >> have >> > any >> > > spaces in. The only clever bit is that it's able to break apart >> cmavo >> > > clusters if they don't have any spaces. >> > > >> > > Regards, >> > > Johan >> > > >> > > -- >> > > Johan Pretorius >> > > Cell: 0829268327 >> > > [2][5]pretoriusjf@gmail.com >> > > >> > > -- >> > > You received this message because you are subscribed to the >> > Google >> > Groups >> > > "Lojban Beginners" group. >> > > To post to this group, send email to >> > [6]lojban-beginners@googlegroups.com. >> > > To unsubscribe from this group, send email to >> > > [7]lojban-beginners+unsubscribe@googlegroups.com. >> > > For more options, visit this group at >> > > [8]http://groups.google.com/group/lojban-beginners?hl=en. >> > > >> > > References >> > > >> > > Visible links >> > > 1. >> > [9] >> http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download >> > > 2. mailto:[10]pretoriusjf@gmail.com >> > >> > -- >> > .i ma'a lo bradi ku penmi gi'e du >> > -- >> > You received this message because you are subscribed to the Google >> > Groups "Lojban Beginners" group. >> > To post to this group, send email to >> > [11]lojban-beginners@googlegroups.com. >> > To unsubscribe from this group, send email to >> > [12]lojban-beginners+unsubscribe@googlegroups.com. >> > For more options, visit this group at >> > [13]http://groups.google.com/group/lojban-beginners?hl=en. >> > >> > -- >> > Johan Pretorius >> > Cell: 0829268327 >> > [14]pretoriusjf@gmail.com >> > >> > -- >> > You received this message because you are subscribed to the Google >> Groups >> > "Lojban Beginners" group. >> > To post to this group, send email to >> lojban-beginners@googlegroups.com. >> > To unsubscribe from this group, send email to >> > lojban-beginners+unsubscribe@googlegroups.com. >> > For more options, visit this group at >> > http://groups.google.com/group/lojban-beginners?hl=en. >> > >> > References >> > >> > Visible links >> > 1. http://sourceforge.net/ >> > 2. http://sourceforge.net/projects/vlastezba/ >> > 3. mailto:alyn.post@lodockikumazvati.org >> > 4. >> http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download >> > 5. mailto:pretoriusjf@gmail.com >> > 6. mailto:lojban-beginners@googlegroups.com >> > 7. mailto:lojban-beginners%2Bunsubscribe@googlegroups.com >> > 8. http://groups.google.com/group/lojban-beginners?hl=en >> > 9. >> http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download >> > 10. mailto:pretoriusjf@gmail.com >> > 11. mailto:lojban-beginners@googlegroups.com >> > 12. mailto:lojban-beginners%2Bunsubscribe@googlegroups.com >> > 13. http://groups.google.com/group/lojban-beginners?hl=en >> > 14. mailto:pretoriusjf@gmail.com >> >> -- >> .i ma'a lo bradi ku penmi gi'e du >> >> -- >> You received this message because you are subscribed to the Google Groups >> "Lojban Beginners" group. >> To post to this group, send email to lojban-beginners@googlegroups.com. >> To unsubscribe from this group, send email to >> lojban-beginners+unsubscribe@googlegroups.com. >> For more options, visit this group at >> http://groups.google.com/group/lojban-beginners?hl=en. >> >> > > > -- > Johan Pretorius > Cell: 0829268327 > pretoriusjf@gmail.com > > -- > You received this message because you are subscribed to the Google Groups > "Lojban Beginners" group. > To post to this group, send email to lojban-beginners@googlegroups.com. > To unsubscribe from this group, send email to > lojban-beginners+unsubscribe@googlegroups.com. > For more options, visit this group at > http://groups.google.com/group/lojban-beginners?hl=en. > > -- You received this message because you are subscribed to the Google Groups "Lojban Beginners" group. To post to this group, send email to lojban-beginners@googlegroups.com. To unsubscribe from this group, send email to lojban-beginners+unsubscribe@googlegroups.com. For more options, visit this group at http://groups.google.com/group/lojban-beginners?hl=en.