From lojbab@lojban.org Wed Apr 30 18:04:39 2003
Return-Path: <lojban-out@lojban.org>
X-Sender: lojban-out@lojban.org
X-Apparently-To: lojban@yahoogroups.com
Received: (EGP: mail-8_2_6_6); 1 May 2003 01:04:38 -0000
Received: (qmail 6797 invoked from network); 1 May 2003 01:04:37 -0000
Received: from unknown (66.218.66.216)
  by m13.grp.scd.yahoo.com with QMQP; 1 May 2003 01:04:37 -0000
Received: from unknown (HELO digitalkingdom.org) (204.152.186.175)
  by mta1.grp.scd.yahoo.com with SMTP; 1 May 2003 01:04:36 -0000
Received: from lojban-out by digitalkingdom.org with local (Exim 4.12)
  id 19B2V2-0007H2-00
  for lojban@yahoogroups.com; Wed, 30 Apr 2003 18:04:36 -0700
Received: from digitalkingdom.org ([204.152.186.175] helo=chain)
  by digitalkingdom.org with esmtp (Exim 4.12)
  id 19B2Us-0007Gd-00; Wed, 30 Apr 2003 18:04:26 -0700
Received: with ECARTIS (v1.0.0; list lojban-list); Wed, 30 Apr 2003 18:04:24 -0700 (PDT)
Received: from lakemtao04.cox.net ([68.1.17.241])
  by digitalkingdom.org with esmtp (Exim 4.12)
  id 19B2Uk-0007GG-00
  for lojban-list@lojban.org; Wed, 30 Apr 2003 18:04:18 -0700
Received: from lojban.lojban.org ([68.100.92.1]) by lakemtao04.cox.net
  (InterMail vM.5.01.04.05 201-253-122-122-105-20011231) with ESMTP
  id <20030501010346.OXUR13930.lakemtao04.cox.net@lojban.lojban.org>
  for <lojban-list@lojban.org>; Wed, 30 Apr 2003 21:03:46 -0400
Message-Id: <5.2.0.9.0.20030430205803.03785e80@pop.east.cox.net>
X-Sender: rlechevalier@pop.east.cox.net
X-Mailer: QUALCOMM Windows Eudora Version 5.2.0.9
Date: Wed, 30 Apr 2003 21:03:13 -0400
To: lojban-list@lojban.org
Subject: [lojban] Re: Lujvo frequency list
In-Reply-To: <20030430164152.GU20953@digitalkingdom.org>
References: <20030430123136.84738.qmail@web20514.mail.yahoo.com>
  <20030430024227.GA13634@mit.edu>
  <20030430123136.84738.qmail@web20514.mail.yahoo.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; format=flowed
X-archive-position: 5078
X-ecartis-version: Ecartis v1.0.0
Sender: lojban-list-bounce@lojban.org
Errors-to: lojban-list-bounce@lojban.org
X-original-sender: lojbab@lojban.org
Precedence: bulk
X-list: lojban-list
From: Robert LeChevalier <lojbab@lojban.org>
Reply-To: lojbab@lojban.org
X-Yahoo-Group-Post: member; u=1120595
X-Yahoo-Profile: lojbab

At 09:41 AM 4/30/03 -0700, Robin Lee Powell wrote:
>On Wed, Apr 30, 2003 at 05:31:36AM -0700, Jorge Llamb?as wrote:
> > > The lujvo frequency list is now up too. It's linked in the same
> > > places, and it's at http://takeneggs.com/lojban/lujvo_freq.txt .
> > [...]
> > > Does anyone care enough for a fu'ivla list?
> >
> > ki'e doi rab, these lists are useful and interesting. If it is not
> > a lot of trouble, it would be nice to have a fu'ivla list, and
> > even a cmene list.
>
>A cmene list is not do-able, I expect, as the stuff he is drawing
>from contains at least some English.

Note that there ARE lists of fu'ivla and I think cmene in the dictionary 
work files on lojban.org. Only type 3 fu'ivla, I believe; there was a lot 
of hand work involved, and there will be some English and other language 
words that crept in.

You can weed out a lot of English by looking for invalid consonants and 
vowel or consonant clusters, and handweed a lot more fairly quickly by 
looking for common English letter clumps like "ing", "nce".

lojbab

-- 
lojbab lojbab@lojban.org
Bob LeChevalier, President, The Logical Language Group, Inc.
2904 Beau Lane, Fairfax VA 22031-1303 USA 703-385-0273
Artificial language Loglan/Lojban: http://www.lojban.org






