From richard@rrbcurnow.freeuk.com Sun Mar 18 15:22:08 2001
Return-Path: <richard@rrbcurnow.freeuk.com>
X-Sender: richard@rrbcurnow.freeuk.com
X-Apparently-To: lojban@yahoogroups.com
Received: (EGP: mail-7_0_4); 18 Mar 2001 23:22:07 -0000
Received: (qmail 75927 invoked from network); 18 Mar 2001 23:22:07 -0000
Received: from unknown (10.1.10.142) by m8.onelist.org with QMQP; 18 Mar 2001 23:22:07 -0000
Received: from unknown (HELO s1.uklinux.net) (212.1.130.11) by mta3 with SMTP; 19 Mar 2001 00:23:11 -0000
Received: from rrbcurnow.freeuk.com (root@ppp-1-47.cvx4.telinco.net [212.1.148.47]) by s1.uklinux.net (8.11.2/8.11.1) with ESMTP id f2INM3f11755 for <lojban@yahoogroups.com>; Sun, 18 Mar 2001 23:22:03 GMT
Envelope-To: <lojban@yahoogroups.com>
Received: from richard by rrbcurnow.freeuk.com with local (Exim 2.02 #2) id 14elre-0000CL-00 for lojban@yahoogroups.com; Sun, 18 Mar 2001 22:41:30 +0000
Date: Sun, 18 Mar 2001 22:41:30 +0000
To: lojban@yahoogroups.com
Subject: Re: [lojban] Breaking up compound cmavo
Message-ID: <20010318224130.B106@rrbcurnow.freeuk.com>
Reply-To: Richard Curnow <rpc@myself.com>
Mail-Followup-To: lojban@yahoogroups.com
References: <992t8s+mls7@eGroups.com> <E14eiFs-0001Ei-00@mercury.ccil.org>
Mime-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
User-Agent: Mutt/1.2i-nntp
In-Reply-To: <E14eiFs-0001Ei-00@mercury.ccil.org>; from cowan@ccil.org on Sun, Mar 18, 2001 at 01:50:16PM -0500
Sender: Richard Curnow <richard@rrbcurnow.freeuk.com>
X-eGroups-From: Richard Curnow <rpc@myself.com>
From: Richard Curnow <richard@rrbcurnow.freeuk.com>

On Sun, Mar 18, 2001 at 01:50:16PM -0500, John Cowan wrote:
> seidensticker@msn.com scripsit:
> > I'm trying to figure out how to divide cmavo that have been stuck 
> > together. For example, consider co'omi'e. The approach I'd taken 
> > was to compare the word against a sorted cmavo list, increasing the 
> > size of the extracted token character by character until I found an 
> > exact match. The problem with this is that after extracting "co", 
> > I'd have found a match and then would try to make sense out 
> > of "'omi'e" -- without success. 
> 
> Break in front of each consonant, that's all. FOr this purpose ' is
> not a consonant.
> 

Also, if any periods (full-stops) occur in the word, break at those too,
as though they were a space.

-- 
----------------------------------------------------------------------
Richard P. Curnow rpc@myself.com
Weston-super-Mare
United Kingdom http://go.to/richard.curnow/


