From rspeer@MIT.EDU Mon Apr 28 20:34:29 2003 Received: with ECARTIS (v1.0.0; list lojban-list); Mon, 28 Apr 2003 20:34:30 -0700 (PDT) Received: from fort-point-station.mit.edu ([18.7.7.76]) by digitalkingdom.org with esmtp (Exim 4.12) id 19ALso-0007JI-00 for lojban-list@lojban.org; Mon, 28 Apr 2003 20:34:18 -0700 Received: from grand-central-station.mit.edu (GRAND-CENTRAL-STATION.MIT.EDU [18.7.21.82]) by fort-point-station.mit.edu (8.12.4/8.9.2) with ESMTP id h3T3YHSf020140 for ; Mon, 28 Apr 2003 23:34:17 -0400 (EDT) Received: from melbourne-city-street.mit.edu (MELBOURNE-CITY-STREET.MIT.EDU [18.7.21.86]) by grand-central-station.mit.edu (8.12.4/8.9.2) with ESMTP id h3T3YGep007951 for ; Mon, 28 Apr 2003 23:34:17 -0400 (EDT) Received: from torg.mit.edu (TORG.MIT.EDU [18.243.1.228]) ) by melbourne-city-street.mit.edu (8.12.4/8.12.4) with ESMTP id h3T3YGU8008920 for ; Mon, 28 Apr 2003 23:34:16 -0400 (EDT) Received: from rob by torg.mit.edu with local (Exim 3.36 #1 (Debian)) id 19ALsg-0004j4-00 for ; Mon, 28 Apr 2003 23:34:10 -0400 Date: Mon, 28 Apr 2003 23:34:10 -0400 From: Rob Speer To: lojban-list@lojban.org Subject: [lojban] Updated word frequency lists Message-ID: <20030429033410.GA18131@mit.edu> Mail-Followup-To: lojban-list@lojban.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline X-Is-It-Not-Nifty: www.sluggy.com User-Agent: Mutt/1.5.3i X-archive-position: 5000 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: rspeer@MIT.EDU Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list I've updated my corpus of Lojban text to search to include the jbosnu archives, plus many things published only on the Wiki. I now have updated word frequency lists for gismu, cmavo, and cmavo compounds (so you can see how {ka'enai} is doing), at http://takeneggs.com/lojban/ or linked on the Wiki at http://www.lojban.org/wiki/index.php/Word%20frequency%20lists . There are no longer any real surprises in the zero-uses section of the cmavo list. For instance, the only UI there now is {ta'u}. There's still all of those embarrassing single-syllable ones, though: koi sau foi lau tau zai dau jau rei vai pai tei -- mu'o mi'e rab.spir