From nobody@digitalkingdom.org Mon Jan 04 09:56:35 2010 Received: with ECARTIS (v1.0.0; list lojban-list); Mon, 04 Jan 2010 09:56:35 -0800 (PST) Received: from nobody by chain.digitalkingdom.org with local (Exim 4.69) (envelope-from ) id 1NRrAQ-0007nO-39 for lojban-list-real@lojban.org; Mon, 04 Jan 2010 09:56:34 -0800 Received: from mail-gx0-f224.google.com ([209.85.217.224]) by chain.digitalkingdom.org with esmtp (Exim 4.69) (envelope-from ) id 1NRrAH-0007kK-D4 for lojban-list@lojban.org; Mon, 04 Jan 2010 09:56:28 -0800 Received: by gxk24 with SMTP id 24so15127948gxk.6 for ; Mon, 04 Jan 2010 09:56:19 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=NFVhG08FySGG633ch0tsjmtoSY6QhtkDyRjPOW48tzA=; b=xWexaqv5RLGK8nSjSNBMzP6OL5wl1p0RLKHu9k4gQxEK7OB+i5AX/ZJolRmCMxreOo ZxSW78/eyow6a4w0uWNqL4RwT4KYNSSloDDNqWKGzY+iy1H2FBVan2P0DXYVtsDyS8ET Lnq5Ab6DM0XjAb89kd60AhT3zTF6BP+BE4x6w= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=LJ0P6Fy7HaL28ATkS5z8u56Uz+BT4SfSeQSmNlnwMrkOzPZAK1MEEQRm9FVaHGAI5K n6ov2xdfHTBPc/Cc07zs0sF/Pw3I4ec7XKaW5/qqoIVpHqYSRJDUuf0qtGv//8R7UEGH TivHvUUO85l3QHPpG9kEWjn7UahNKKW/vro1c= MIME-Version: 1.0 Received: by 10.90.13.6 with SMTP id 6mr11324404agm.109.1262627778982; Mon, 04 Jan 2010 09:56:18 -0800 (PST) In-Reply-To: References: <425e4ac21001031952t22834298oa24977c0eef72d35@mail.gmail.com> <425e4ac21001032050h48991b70rdf63974aab3da6a9@mail.gmail.com> Date: Mon, 4 Jan 2010 14:56:18 -0300 Message-ID: <925d17561001040956n33e7c7edn30558cc45710a3e6@mail.gmail.com> Subject: [lojban] Re: Initial impression From: =?ISO-8859-1?Q?Jorge_Llamb=EDas?= To: lojban-list@lojban.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from quoted-printable to 8bit by Ecartis X-archive-position: 16790 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: jjllambias@gmail.com Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list On Mon, Jan 4, 2010 at 1:34 PM, Adam D. Lopresto wrote: > > Actually, I think ZOhOI ({la'oi} and {zo'oi}) can be parsed just fine > (though > it requires pauses/stops/those hated periods before and after the non-lojban > word).  It just requires a change to the grammar.  I have yet to see any > actual problems. It can be done, but it requires some rethinking of the morphology algorithm. What the PEG algorithm currently does is first break a string of phonemes into words, so for example "la'oi lopresto" will be read as three words, "la'oi", "lo" and "presto", before doing any syntactic parsing of the words. It doesn't care whether there is a pause/space between "lo" and "presto" or not, or whether "la'oi" is a defined cmavo or not. Selmaho ZOhOI would require the algorithm to pay attention to pause/spaces at the syntactic level, which it currently does not. mu'o mi'e xorxes To unsubscribe from this list, send mail to lojban-list-request@lojban.org with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if you're really stuck, send mail to secretary@lojban.org for help.