From lojban+bncCLr6ktCfBBCKrrPpBBoE-cxBiw@googlegroups.com Tue Jan 11 14:18:03 2011 Received: from mail-yw0-f61.google.com ([209.85.213.61]) by chain.digitalkingdom.org with esmtp (Exim 4.72) (envelope-from ) id 1PcmXO-0005SI-0x; Tue, 11 Jan 2011 14:18:03 -0800 Received: by ywh1 with SMTP id 1sf19638314ywh.16 for ; Tue, 11 Jan 2011 14:17:51 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=beta; h=domainkey-signature:received:x-beenthere:received:received:received :received:received-spf:received:received:received:date:from:to :subject:message-id:mail-followup-to:references:mime-version :in-reply-to:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:sender:list-subscribe:list-unsubscribe:content-type :content-disposition; bh=QNFv6KxDcHLZKeNKYa/ix8qkjkhw6/9wU2IbES4k9Cs=; b=lYa1CgjJzbpOP0py+rC8Iw1jYceD4NWLlACkt+NFYG4tkniXP2sk8h/UMYbNiQEH2+ 6KBLF244hZ1WeuUZINWoOcRvRN2O17TQpThujostuZLxC4Lbz6bxJmhtFAQ98c34ztKM CfY03EaSEPl+IYMvt/wb7sI+irsuGpl/Uchos= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlegroups.com; s=beta; h=x-beenthere:received-spf:date:from:to:subject:message-id :mail-followup-to:references:mime-version:in-reply-to :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe:content-type :content-disposition; b=wd5xLh9T4NdAdgyTgtDkbcbpLdqG3faw7XlNZicTVfRoRFlkuPD243D8N62rXojvd/ Ah4IgItNn0fwhVMfUJ2A4h9snIoDcr2u8bfbyErR5ymmJsx8GPS10Vsg8e5YtVwYFQcC AzSlhR8YtKK2JEbw6zT35vrnRzf5iSmDuMjYM= Received: by 10.100.95.6 with SMTP id s6mr6749anb.54.1294784266701; Tue, 11 Jan 2011 14:17:46 -0800 (PST) X-BeenThere: lojban@googlegroups.com Received: by 10.100.156.5 with SMTP id d5ls10606ane.1.p; Tue, 11 Jan 2011 14:17:46 -0800 (PST) Received: by 10.100.34.9 with SMTP id h9mr48647anh.37.1294784266116; Tue, 11 Jan 2011 14:17:46 -0800 (PST) Received: by 10.100.34.9 with SMTP id h9mr48646anh.37.1294784266102; Tue, 11 Jan 2011 14:17:46 -0800 (PST) Received: from mail-gx0-f178.google.com (mail-gx0-f178.google.com [209.85.161.178]) by gmr-mx.google.com with ESMTP id q24si5692ybk.9.2011.01.11.14.17.45; Tue, 11 Jan 2011 14:17:45 -0800 (PST) Received-SPF: neutral (google.com: 209.85.161.178 is neither permitted nor denied by best guess record for domain of alanpost@sunflowerriver.org) client-ip=209.85.161.178; Received: by gxk25 with SMTP id 25so6116212gxk.9 for ; Tue, 11 Jan 2011 14:17:45 -0800 (PST) Received: by 10.236.109.11 with SMTP id r11mr290440yhg.95.1294784264116; Tue, 11 Jan 2011 14:17:44 -0800 (PST) Received: from sunflowerriver.org (173-10-243-253-Albuquerque.hfc.comcastbusiness.net [173.10.243.253]) by mx.google.com with ESMTPS id n67sm18284280yha.26.2011.01.11.14.17.42 (version=TLSv1/SSLv3 cipher=RC4-MD5); Tue, 11 Jan 2011 14:17:43 -0800 (PST) Date: Tue, 11 Jan 2011 15:17:39 -0700 From: ".alyn.post." To: lojban@googlegroups.com Subject: Re: [lojban] compound cmavo classification in cmavo.txt Message-ID: <20110111221738.GD38541@alice.local> Mail-Followup-To: lojban@googlegroups.com References: <20110111164936.GC38541@alice.local> <4D2CD38E.6030708@lojban.org> Mime-Version: 1.0 In-Reply-To: <4D2CD38E.6030708@lojban.org> X-Original-Sender: alyn.post@lodockikumazvati.org X-Original-Authentication-Results: gmr-mx.google.com; spf=neutral (google.com: 209.85.161.178 is neither permitted nor denied by best guess record for domain of alanpost@sunflowerriver.org) smtp.mail=alanpost@sunflowerriver.org Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: text/plain; charset=ISO-8859-1 Content-Disposition: inline On Tue, Jan 11, 2011 at 05:02:54PM -0500, Bob LeChevalier wrote: > .alyn.post. wrote: > >The following file: > > > > http://www.lojban.org/publications/wordlists/cmavo.txt > > > >Is a list of cmavo. I believe it is the canonical list, please > >correct me if that is a misunderstanding. > > > >This file includes compound cmavo, like "le go'i", but only > >includes a single selma'o class, even when the compound cmavo > >consists of cmavo in more than one selma'o. > > The use of the * indicates that it is NOT a member of that selma'o, but > is being grouped together (with others having the same *) for some > pedagogical reason. The list was originally designed as a teaching > tool, but became a reference text in lieu of an actual dictionary. > > >I've loaded all of the entries in cmavo.txt into the parser > > which parser? > My work-in-progress parser, jbogenturfa'i: http://wiki.call-cc.org/eggref/4/jbogenturfahi I've got the morphology file working and tested, and the grammar parses what I've thrown at it so far. I'm working now on cleaning up the resulting parse tree to be useable in other applications and adding test cases to more rigorously test the grammar parser. It uses the PEG grammar developed by camgusmis and xorxes. The parser is written in Scheme, and is to my knowledge the first time someone has built tools for working with Lojban in Scheme. I've certainly written the best PEG parser available for Scheme, because I tried the available one before writing my own. ;-) My near-term goal is to have a camxes-level feature set and to maintain a second PEG parser alongside camxes, while sharing as near the same PEG grammar between them as possible. So far, this work has resulted in a satisfying level of cleanup to the PEG grammar, which I would like to see become the official grammar for Lojban. In service to that I've been collecting the available test data and will be extending my test suite to include them. I'm currently at 6176 tests to cover the morphology. -Alan -- .i ko djuno fi le do sevzi -- You received this message because you are subscribed to the Google Groups "lojban" group. To post to this group, send email to lojban@googlegroups.com. To unsubscribe from this group, send email to lojban+unsubscribe@googlegroups.com. For more options, visit this group at http://groups.google.com/group/lojban?hl=en.