From lojban+bncCLr6ktCfBBCWyo3pBBoEMk3K-g@googlegroups.com Tue Jan 04 10:21:26 2011 Received: from mail-gy0-f189.google.com ([209.85.160.189]) by chain.digitalkingdom.org with esmtp (Exim 4.72) (envelope-from ) id 1PaBVY-0001gC-DY; Tue, 04 Jan 2011 10:21:25 -0800 Received: by gyb11 with SMTP id 11sf13679850gyb.16 for ; Tue, 04 Jan 2011 10:21:14 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=beta; h=domainkey-signature:received:x-beenthere:received:received:received :received:received-spf:received:received:received:date:from:to :subject:message-id:mail-followup-to:mime-version:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type:content-disposition; bh=Vi4CWxb5EMs3VPduhxw2Aa9YyvxkiGdEn3yJv2GrC0s=; b=wzMrkdmJuz909BD6qoNVpUY8PrDXzJvqoIzEdk2rffOK2a+Pal7RdUhy4AdAWk9t34 IOwtflG6LfpoVG2r0ctUwJXfrROS3dy0bVxIsPLcWMcYAi4kioBM2KGNhKzX4GsWvnxO eBLuwTKpXRzlCA4Zc7/TrGD9ERZ8OBsrM8YbI= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlegroups.com; s=beta; h=x-beenthere:received-spf:date:from:to:subject:message-id :mail-followup-to:mime-version:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type:content-disposition; b=ODgt7EAyjSIKRP6psiCFnBBJYAtmCQAKQxSIcQEydgjWOWFpBEfq36tBgmuGHJ795x a/GaMlOWj5lJN6h81YmfsZJQW3Xl2+aYcNzBFb4LZmvwX/edw/Uem2krKTBC2PNspWxm m52+BSEVJcE0aDc2A6Yq2jhio5OL7jS8sN0Cw= Received: by 10.150.69.3 with SMTP id r3mr1447112yba.50.1294165270313; Tue, 04 Jan 2011 10:21:10 -0800 (PST) X-BeenThere: lojban@googlegroups.com Received: by 10.231.76.225 with SMTP id d33ls11829831ibk.2.p; Tue, 04 Jan 2011 10:21:09 -0800 (PST) Received: by 10.231.172.146 with SMTP id l18mr8083623ibz.2.1294165269437; Tue, 04 Jan 2011 10:21:09 -0800 (PST) Received: by 10.231.172.146 with SMTP id l18mr8083622ibz.2.1294165269395; Tue, 04 Jan 2011 10:21:09 -0800 (PST) Received: from mail-iy0-f177.google.com (mail-iy0-f177.google.com [209.85.210.177]) by gmr-mx.google.com with ESMTP id d9si4941019ibq.7.2011.01.04.10.21.09; Tue, 04 Jan 2011 10:21:09 -0800 (PST) Received-SPF: neutral (google.com: 209.85.210.177 is neither permitted nor denied by best guess record for domain of alanpost@sunflowerriver.org) client-ip=209.85.210.177; Received: by mail-iy0-f177.google.com with SMTP id 21so14322788iyj.36 for ; Tue, 04 Jan 2011 10:21:09 -0800 (PST) Received: by 10.231.36.5 with SMTP id r5mr22205919ibd.134.1294165269121; Tue, 04 Jan 2011 10:21:09 -0800 (PST) Received: from sunflowerriver.org (173-10-243-253-Albuquerque.hfc.comcastbusiness.net [173.10.243.253]) by mx.google.com with ESMTPS id d21sm20057628ibg.3.2011.01.04.10.21.06 (version=TLSv1/SSLv3 cipher=RC4-MD5); Tue, 04 Jan 2011 10:21:07 -0800 (PST) Date: Tue, 4 Jan 2011 11:21:03 -0700 From: ".alyn.post." To: Lojban List Subject: [lojban] fu'ivla and lujvo wordlists? Message-ID: <20110104182103.GB22902@alice.local> Mail-Followup-To: Lojban List Mime-Version: 1.0 X-Original-Sender: alyn.post@lodockikumazvati.org X-Original-Authentication-Results: gmr-mx.google.com; spf=neutral (google.com: 209.85.210.177 is neither permitted nor denied by best guess record for domain of alanpost@sunflowerriver.org) smtp.mail=alanpost@sunflowerriver.org Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: text/plain; charset=ISO-8859-1 Content-Disposition: inline I've just added test cases to test for proper morphological classification of all cmavo and gismu to my Lojban parser, jbogenturfa'i[1]. I'm using cmavo.txt[2] and gismu.txt[3] from lojban.org's publications. I'd like to add test cases for proper classification of lujvo and fu'ivla as well. These word classes aren't closed in the same way that cmavo and gismu are. I'm hoping someone has available one of the following: 1) The minimal set of fu'ivla and lujvo to cover all of the productions used in the PEG grammar to classify these word types. i.e., all of the different kinds of fu'ivla and lujvo. 2) A representative set of fu'ivla and lujvo that I can use much like I'm using cmavo.txt and gismu.txt: a big pile of already-classified words that I can run through the parser and verify that the parser agrees that each word belongs in it's respective class. Does anyone have morphological test data available? -Alan 1: http://wiki.call-cc.org/eggref/4/jbogenturfahi 2: http://www.lojban.org/publications/wordlists/cmavo.txt 3: http://www.lojban.org/publications/wordlists/gismu.txt -- .i ko djuno fi le do sevzi -- You received this message because you are subscribed to the Google Groups "lojban" group. To post to this group, send email to lojban@googlegroups.com. To unsubscribe from this group, send email to lojban+unsubscribe@googlegroups.com. For more options, visit this group at http://groups.google.com/group/lojban?hl=en.