From cowan@ccil.org Sun Mar 18 10:50:27 2001 Return-Path: X-Sender: cowan@mercury.ccil.org X-Apparently-To: lojban@yahoogroups.com Received: (EGP: mail-7_0_4); 18 Mar 2001 18:50:26 -0000 Received: (qmail 97568 invoked from network); 18 Mar 2001 18:49:50 -0000 Received: from unknown (10.1.10.27) by l7.egroups.com with QMQP; 18 Mar 2001 18:49:50 -0000 Received: from unknown (HELO mercury.ccil.org) (192.190.237.100) by mta2 with SMTP; 18 Mar 2001 18:49:49 -0000 Received: from cowan by mercury.ccil.org with local (Exim 3.12 #1 (Debian)) id 14eiFs-0001Ei-00; Sun, 18 Mar 2001 13:50:16 -0500 Subject: Re: [lojban] Breaking up compound cmavo In-Reply-To: <992t8s+mls7@eGroups.com> from "seidensticker@msn.com" at "Mar 18, 2001 06:03:08 pm" To: seidensticker@msn.com Date: Sun, 18 Mar 2001 13:50:16 -0500 (EST) Cc: lojban@yahoogroups.com X-Mailer: ELM [version 2.4ME+ PL66 (25)] MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Message-Id: X-eGroups-From: John Cowan From: John Cowan X-Yahoo-Message-Num: 5905 seidensticker@msn.com scripsit: > I'm trying to figure out how to divide cmavo that have been stuck > together. For example, consider co'omi'e. The approach I'd taken > was to compare the word against a sorted cmavo list, increasing the > size of the extracted token character by character until I found an > exact match. The problem with this is that after extracting "co", > I'd have found a match and then would try to make sense out > of "'omi'e" -- without success. Break in front of each consonant, that's all. FOr this purpose ' is not a consonant. -- John Cowan cowan@ccil.org One art/there is/no less/no more/All things/to do/with sparks/galore --Douglas Hofstadter