Return-Path: Received: from kantti.helsinki.fi by xiron.pc.helsinki.fi with smtp (Linux Smail3.1.28.1 #1) id m0rK7s1-00007DC; Tue, 20 Dec 94 18:57 EET Received: from fiport.funet.fi (fiport.funet.fi [128.214.109.150]) by kantti.helsinki.fi (8.6.9/8.6.5) with ESMTP id SAA04110 for ; Tue, 20 Dec 1994 18:57:03 +0200 Received: from SEARN.SUNET.SE (MAILER@SEARN) by FIPORT.FUNET.FI (PMDF V4.3-13 #2494) id <01HKVGR25C280000JE@FIPORT.FUNET.FI>; Tue, 20 Dec 1994 16:56:07 +0200 (EET) Received: from SEARN.SUNET.SE (NJE origin LISTSERV@SEARN) by SEARN.SUNET.SE (LMail V1.2a/1.8a) with BSMTP id 1468; Tue, 20 Dec 1994 17:53:40 +0100 Date: Tue, 20 Dec 1994 11:54:56 -0500 From: Logical Language Group Subject: Re: fyfyfyfy In-reply-to: <199412200920.AA14766@access1.digex.net> from "Logical Language Group" at Dec 20, 94 04:20:55 am Sender: Lojban list Reply-to: Logical Language Group Message-id: <01HKVGR27OTU0000JE@FIPORT.FUNET.FI> X-Envelope-to: veion@XIRON.PC.HELSINKI.FI Content-transfer-encoding: 7BIT To: Lojban List MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Content-Length: 1405 Lines: 30 la lojbab. cusku di'e > I tried this on the parser, and suspect a problem. > It does not break up fyfyfyfy before the selbri, nor implies that ti takes > it as a sumti. When followed by a separate fy "fyfyfyfy fy klama", it blows > up. But "fy fy fy fy klama" parses fine, and with a single sumti, unless > split up by BOI. There are two problems here: one is with the current machine parser's feeble morphology algorithm, one with what I said. The parser distinguishes brivla from compound cmavo by looking for consonant clusters, defined as two successive consonants optionally separated by "y". By that standard, the word "fyfyfyfy" appears to have a y-hyphenated consonant cluster in the first 3 letters. Thus it is lexed as a brivla, with resulting problems. IMAO, this is no worse that the treatment of "secmene" as a brivla rather than as "se cmene", or the fact that the lexer breaks up stuff within "zoi" quotes as if it were Lojban. Eventually, there will be a proper morphological preprocessor that handles all cases. However, I was wrong to say that "fy fy fy fy" was four sumti; it is a single sumti, because the lerfu-word pro-sumti actually consist of a string of lerfu words. To get four instances of the "f" pro-sumti, we need "fyboi fyboi fyboi fy[boi]". -- John Cowan sharing account for now e'osai ko sarji la lojban.