From lojbanlists@wonderclown.com Fri Jan 04 18:53:49 2002
Return-Path: <randy@wonderclown.com>
X-Sender: randy@wonderclown.com
X-Apparently-To: lojban@yahoogroups.com
Received: (EGP: mail-8_0_1_3); 5 Jan 2002 02:53:49 -0000
Received: (qmail 98709 invoked from network); 5 Jan 2002 02:53:48 -0000
Received: from unknown (216.115.97.171)
  by m3.grp.snv.yahoo.com with QMQP; 5 Jan 2002 02:53:48 -0000
Received: from unknown (HELO mail5.nc.rr.com) (24.93.67.52)
  by mta3.grp.snv.yahoo.com with SMTP; 5 Jan 2002 02:53:48 -0000
Received: from aerosol ([24.162.238.42]) by mail5.nc.rr.com with Microsoft SMTPSVC(5.5.1877.687.68);
  Fri, 4 Jan 2002 21:53:47 -0500
Received: from rnortman by aerosol with local (Exim 3.33 #1 (Debian))
  id 16Mgxv-00018y-00
  for <lojban@yahoogroups.com>; Fri, 04 Jan 2002 21:53:47 -0500
Date: Fri, 4 Jan 2002 21:53:47 -0500
To: lojban@yahoogroups.com
Subject: Re: [lojban] Re: cmavo index?
Message-ID: <20020105025347.GH1650@aerosol>
References: <20020104232522.GG1650@aerosol> <Pine.GSO.4.40.0201041718190.8109-100000@ucsub.colorado.edu>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <Pine.GSO.4.40.0201041718190.8109-100000@ucsub.colorado.edu>
User-Agent: Mutt/1.3.24i
Sender: Randall Nortman <randy@wonderclown.com>
Return-Path: randy@wonderclown.com
From: "randl. nortmn." <lojbanlists@wonderclown.com>
X-Yahoo-Group-Post: member; u=91794573
X-Yahoo-Profile: uyndrklaun

On Fri, Jan 04, 2002 at 05:19:40PM -0700, Jay Kominek wrote:
[...]
> Who is going to write the examples to be reviewed, though?

They would, for the most part, be directly pulled from mailing list
posts, IRC chats, and possibly other Lojban electronic texts (e.g. the
reference grammar, the translations in www.lojban.org/files/texts,
etc.). Not all these sources are to be trusted, obviously, hence the
need for review to ensure that they are good usage examples.

I think this could be a good tool not only for people trying to learn
the langauge, but for analysis of its real-world usage. For this
purpose the errors are as relevant as the correct usages. With such a
database, we could compare IRC chat usage to mailing list usage to the
translations. We could see how often certain words/constructs are
used and misused. Over time, we could track linguistic drift.

I recognize that there is substantial complexity in this proposal.
For that matter, I didn't really intend to make it a proposal; I
simply mentioned that I had thought about it. I myself do not have
the time to put any work into it right now, but maybe down the road.

mu'omi'e randl.