Received: from mail-gw0-f61.google.com ([74.125.83.61]) by chain.digitalkingdom.org with esmtp (Exim 4.72) (envelope-from ) id 1QCYvZ-0005mp-W3; Wed, 20 Apr 2011 08:02:56 -0700 Received: by gwb11 with SMTP id 11sf1846616gwb.16 for ; Wed, 20 Apr 2011 08:02:43 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=beta; h=domainkey-signature:x-beenthere:received-spf:mime-version :in-reply-to:references:date:message-id:subject:from:to :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:x-google-group-id:list-post :list-help:list-archive:sender:list-subscribe:list-unsubscribe :content-type; bh=Ayiz36C9KapnHPPCGnEyqRMT+G5i32TON/o/a90nL+E=; b=sv2tWUKqbFmr7JPDEuTnyHR+7OEgMZ4ftFqSyQpFOd2tpKqgsipxG9jPkhRtTKgpkN I9HbDkPaIiGE8w68vnrW7iiuzIrvsKMxwfu1KaQneLYcDHlPWJYKp/lkV8IlKKvyW14c f3gI2l59E7NbUSs5Ptl3oNwmm8wdVJsxobzH4= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlegroups.com; s=beta; h=x-beenthere:received-spf:mime-version:in-reply-to:references:date :message-id:subject:from:to:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-google-group-id:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe:content-type; b=BU0wXZ8SCaP13ubNlJ0F+reNxp6p2lsZnfRi9ezDNizIBlQcAHATRuHwjM4wnxmPcz tfnLO5eZgwtzIY4yETMmygsKfZaqgMVWd4z298YAeXUUgweTeTfOh/ZC7aUOAXAaMfQo AvT5KrwsVQng2i2zZ7q/7+h90YgMrUV69NgtQ= Received: by 10.236.189.67 with SMTP id b43mr590074yhn.19.1303311758834; Wed, 20 Apr 2011 08:02:38 -0700 (PDT) X-BeenThere: lojban-beginners@googlegroups.com Received: by 10.231.8.234 with SMTP id i42ls1513248ibi.3.gmail; Wed, 20 Apr 2011 08:02:37 -0700 (PDT) Received: by 10.231.185.153 with SMTP id co25mr3589131ibb.10.1303311757909; Wed, 20 Apr 2011 08:02:37 -0700 (PDT) Received: by 10.231.185.153 with SMTP id co25mr3589130ibb.10.1303311757873; Wed, 20 Apr 2011 08:02:37 -0700 (PDT) Received: from mail-iw0-f175.google.com (mail-iw0-f175.google.com [209.85.214.175]) by gmr-mx.google.com with ESMTPS id r31si238961ibu.6.2011.04.20.08.02.36 (version=TLSv1/SSLv3 cipher=OTHER); Wed, 20 Apr 2011 08:02:36 -0700 (PDT) Received-SPF: pass (google.com: domain of teapot.philosopher@googlemail.com designates 209.85.214.175 as permitted sender) client-ip=209.85.214.175; Received: by mail-iw0-f175.google.com with SMTP id 10so802844iwn.20 for ; Wed, 20 Apr 2011 08:02:36 -0700 (PDT) MIME-Version: 1.0 Received: by 10.42.19.194 with SMTP id d2mr9414880icb.463.1303311756483; Wed, 20 Apr 2011 08:02:36 -0700 (PDT) Received: by 10.42.213.74 with HTTP; Wed, 20 Apr 2011 08:02:36 -0700 (PDT) In-Reply-To: References: <20110420142911.GB49678@alice.local> Date: Wed, 20 Apr 2011 16:02:36 +0100 Message-ID: Subject: Re: [lojban-beginners] vlastezba: First beta version released! From: Brian Shannon To: lojban-beginners@googlegroups.com X-Original-Sender: teapot.philosopher@googlemail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of teapot.philosopher@googlemail.com designates 209.85.214.175 as permitted sender) smtp.mail=teapot.philosopher@googlemail.com; dkim=pass (test mode) header.i=@googlemail.com Reply-To: lojban-beginners@googlegroups.com Precedence: list Mailing-list: list lojban-beginners@googlegroups.com; contact lojban-beginners+owners@googlegroups.com List-ID: X-Google-Group-Id: 300742228892 List-Post: , List-Help: , List-Archive: Sender: lojban-beginners@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: text/plain; charset=ISO-8859-1 Content-Length: 5521 "This software may not be copied or distributed in any form without the written permission of Postilion." This is *not* GPL'ed. Assuming you are the sole copyright holder, follow the GNU guide to license your software under the GPL. http://www.gnu.org/licenses/gpl-howto.html On 20/04/2011, Johan Pretorius wrote: > Hi Alan, > > That would indeed be an interesting experiment, I'd be quite keen to see the > results myself. > > Right now, if you just call > > java -jar vlastezba.jar test.txt > > with some Lojban text (legal or otherwise) in test.txt, it will return (on > stdout), one valsi per line. So "coirodo" would result in: > coi > ro > do > (you can make it go look up the definitions by passing a second parameter, > but it will just add junk to the output that I don't think you'd want) > > Right now it doesn't check grammar at all, so you can throw any random > collection of words at it (I don't intend for it to ever do this, there are > tools out there that are far better at this than I could ever hope to make > it). > > It also won't give you a classification of valsi - it doesn't "know" when > it's dealing with a cmavo (or indeed what class), or a gismu, or a lujvo. > This I DO intend to fix. > > I want to add other output formats anyway, so if you want me to do something > specific to make your comparison easier, let me know. Now would be a good > time, as I'm going away on holiday for a week, and wanted to spend at least > a little bit of time on vlastezba. > > In fact, if you are comfortable with Java, feel free to make it do what you > need, the source code is on sourceforge.net ( > http://sourceforge.net/projects/vlastezba/), and is GPL'ed :-) > > mu'o mi'e iu'an > > > > On Wed, Apr 20, 2011 at 4:29 PM, .alyn.post. > wrote: > >> Do you have an external representation for your valsi parsing >> result? If I hand you the string "coirodo" is there a print >> form of that along the lines of ("coi" "ro" "do")? >> >> I would be interested seeing the result from processing a large >> data set of words and phrases and comparing that to jbogenturfa'i. >> In order to do this I'd need some output format from your program >> that I could parse. >> >> jbogenturfa'i uses the morphology PEG grammar that xorxes developed, >> so it contains code which I think is similar (and should be >> identical in result) to what you are doing: >> >> $ echo "coirodo"|jbogenturfahi --rafske >> ((cmavo (COI "coi")) (cmavo (PA "ro")) (cmavo (KOhA "do"))) >> >> I'd be curious to know whether they are in fact producing identical >> results. >> >> -Alan >> >> On Wed, Apr 20, 2011 at 11:02:28AM +0200, Johan Pretorius wrote: >> > Hi all >> > >> > You can download it from here: >> > [1] >> http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download >> > >> > I have completed the cmavo cluster breakout code, and tested it as >> > far >> as >> > I was able. >> > >> > It should be easy enough to run if you have Java 1.6 installed, just >> go >> > java -jar vlastezba.jar and it will print out usage instructions. >> > >> > Please download it and test to pieces! I'd love all your feedback. >> > >> > Not that it doesn't get very smart at this stage - for instance, it >> won't >> > know what to do if you feed it a string of lojban that doesn't have >> any >> > spaces in. The only clever bit is that it's able to break apart cmavo >> > clusters if they don't have any spaces. >> > >> > Regards, >> > Johan >> > >> > -- >> > Johan Pretorius >> > Cell: 0829268327 >> > [2]pretoriusjf@gmail.com >> > >> > -- >> > You received this message because you are subscribed to the Google >> Groups >> > "Lojban Beginners" group. >> > To post to this group, send email to >> lojban-beginners@googlegroups.com. >> > To unsubscribe from this group, send email to >> > lojban-beginners+unsubscribe@googlegroups.com. >> > For more options, visit this group at >> > http://groups.google.com/group/lojban-beginners?hl=en. >> > >> > References >> > >> > Visible links >> > 1. >> http://sourceforge.net/projects/vlastezba/files/vlastezba.jar/download >> > 2. mailto:pretoriusjf@gmail.com >> >> -- >> .i ma'a lo bradi ku penmi gi'e du >> >> -- >> You received this message because you are subscribed to the Google Groups >> "Lojban Beginners" group. >> To post to this group, send email to lojban-beginners@googlegroups.com. >> To unsubscribe from this group, send email to >> lojban-beginners+unsubscribe@googlegroups.com. >> For more options, visit this group at >> http://groups.google.com/group/lojban-beginners?hl=en. >> >> > > > -- > Johan Pretorius > Cell: 0829268327 > pretoriusjf@gmail.com > > -- > You received this message because you are subscribed to the Google Groups > "Lojban Beginners" group. > To post to this group, send email to lojban-beginners@googlegroups.com. > To unsubscribe from this group, send email to > lojban-beginners+unsubscribe@googlegroups.com. > For more options, visit this group at > http://groups.google.com/group/lojban-beginners?hl=en. > > -- You received this message because you are subscribed to the Google Groups "Lojban Beginners" group. To post to this group, send email to lojban-beginners@googlegroups.com. To unsubscribe from this group, send email to lojban-beginners+unsubscribe@googlegroups.com. For more options, visit this group at http://groups.google.com/group/lojban-beginners?hl=en.