From jjllambias@gmail.com Mon Jan 04 09:56:28 2010 Received: from mail-gx0-f224.google.com ([209.85.217.224]) by chain.digitalkingdom.org with esmtp (Exim 4.69) (envelope-from ) id 1NRrAH-0007kK-D4 for lojban-list@lojban.org; Mon, 04 Jan 2010 09:56:28 -0800 Received: by gxk24 with SMTP id 24so15127948gxk.6 for ; Mon, 04 Jan 2010 09:56:19 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=NFVhG08FySGG633ch0tsjmtoSY6QhtkDyRjPOW48tzA=; b=xWexaqv5RLGK8nSjSNBMzP6OL5wl1p0RLKHu9k4gQxEK7OB+i5AX/ZJolRmCMxreOo ZxSW78/eyow6a4w0uWNqL4RwT4KYNSSloDDNqWKGzY+iy1H2FBVan2P0DXYVtsDyS8ET Lnq5Ab6DM0XjAb89kd60AhT3zTF6BP+BE4x6w= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=LJ0P6Fy7HaL28ATkS5z8u56Uz+BT4SfSeQSmNlnwMrkOzPZAK1MEEQRm9FVaHGAI5K n6ov2xdfHTBPc/Cc07zs0sF/Pw3I4ec7XKaW5/qqoIVpHqYSRJDUuf0qtGv//8R7UEGH TivHvUUO85l3QHPpG9kEWjn7UahNKKW/vro1c= MIME-Version: 1.0 Received: by 10.90.13.6 with SMTP id 6mr11324404agm.109.1262627778982; Mon, 04 Jan 2010 09:56:18 -0800 (PST) In-Reply-To: References: <425e4ac21001031952t22834298oa24977c0eef72d35@mail.gmail.com> <425e4ac21001032050h48991b70rdf63974aab3da6a9@mail.gmail.com> Date: Mon, 4 Jan 2010 14:56:18 -0300 Message-ID: <925d17561001040956n33e7c7edn30558cc45710a3e6@mail.gmail.com> Subject: Re: [lojban] Re: Initial impression From: =?ISO-8859-1?Q?Jorge_Llamb=EDas?= To: lojban-list@lojban.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable On Mon, Jan 4, 2010 at 1:34 PM, Adam D. Lopresto wrot= e: > > Actually, I think ZOhOI ({la'oi} and {zo'oi}) can be parsed just fine > (though > it requires pauses/stops/those hated periods before and after the non-loj= ban > word). =A0It just requires a change to the grammar. =A0I have yet to see = any > actual problems. It can be done, but it requires some rethinking of the morphology algorithm. What the PEG algorithm currently does is first break a string of phonemes into words, so for example "la'oi lopresto" will be read as three words, "la'oi", "lo" and "presto", before doing any syntactic parsing of the words. It doesn't care whether there is a pause/space between "lo" and "presto" or not, or whether "la'oi" is a defined cmavo or not. Selmaho ZOhOI would require the algorithm to pay attention to pause/spaces at the syntactic level, which it currently does not. mu'o mi'e xorxes