From cowan@ccil.org Sun Mar 18 10:50:27 2001
Return-Path: <cowan@mercury.ccil.org>
X-Sender: cowan@mercury.ccil.org
X-Apparently-To: lojban@yahoogroups.com
Received: (EGP: mail-7_0_4); 18 Mar 2001 18:50:26 -0000
Received: (qmail 97568 invoked from network); 18 Mar 2001 18:49:50 -0000
Received: from unknown (10.1.10.27) by l7.egroups.com with QMQP; 18 Mar 2001 18:49:50 -0000
Received: from unknown (HELO mercury.ccil.org) (192.190.237.100) by mta2 with SMTP; 18 Mar 2001 18:49:49 -0000
Received: from cowan by mercury.ccil.org with local (Exim 3.12 #1 (Debian)) id 14eiFs-0001Ei-00; Sun, 18 Mar 2001 13:50:16 -0500
Subject: Re: [lojban] Breaking up compound cmavo
In-Reply-To: <992t8s+mls7@eGroups.com> from "seidensticker@msn.com" at "Mar 18, 2001 06:03:08 pm"
To: seidensticker@msn.com
Date: Sun, 18 Mar 2001 13:50:16 -0500 (EST)
Cc: lojban@yahoogroups.com
X-Mailer: ELM [version 2.4ME+ PL66 (25)]
MIME-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit
Message-Id: <E14eiFs-0001Ei-00@mercury.ccil.org>
X-eGroups-From: John Cowan <cowan@mercury.ccil.org>
From: John Cowan <cowan@ccil.org>

seidensticker@msn.com scripsit:
> I'm trying to figure out how to divide cmavo that have been stuck 
> together. For example, consider co'omi'e. The approach I'd taken 
> was to compare the word against a sorted cmavo list, increasing the 
> size of the extracted token character by character until I found an 
> exact match. The problem with this is that after extracting "co", 
> I'd have found a match and then would try to make sense out 
> of "'omi'e" -- without success. 

Break in front of each consonant, that's all. FOr this purpose ' is
not a consonant.

-- 
John Cowan cowan@ccil.org
One art/there is/no less/no more/All things/to do/with sparks/galore
--Douglas Hofstadter

