From adam@pubcrawler.org Mon Jan 04 10:54:41 2010 Received: from express.cec.wustl.edu ([128.252.21.16] helo=mail.cec.wustl.edu) by chain.digitalkingdom.org with esmtp (Exim 4.69) (envelope-from ) id 1NRs4a-00033k-Di for lojban-list@lojban.org; Mon, 04 Jan 2010 10:54:40 -0800 Received: from grid.cec.wustl.edu (grid.cec.wustl.edu [128.252.20.97]) by mail.cec.wustl.edu (Postfix) with ESMTP id 837AA1E8028; Mon, 4 Jan 2010 12:54:30 -0600 (CST) Received: by grid.cec.wustl.edu (Postfix, from userid 29287) id C95FF1F7876; Mon, 4 Jan 2010 12:54:29 -0600 (CST) Received: from localhost (localhost [127.0.0.1]) by grid.cec.wustl.edu (Postfix) with ESMTP id BBF2A1F7875; Mon, 4 Jan 2010 12:54:29 -0600 (CST) Date: Mon, 4 Jan 2010 12:54:29 -0600 (CST) From: "Adam D. Lopresto" To: lojban-list@lojban.org Subject: Re: [lojban] Re: Initial impression In-Reply-To: <925d17561001040956n33e7c7edn30558cc45710a3e6@mail.gmail.com> Message-ID: References: <425e4ac21001031952t22834298oa24977c0eef72d35@mail.gmail.com> <425e4ac21001032050h48991b70rdf63974aab3da6a9@mail.gmail.com> <925d17561001040956n33e7c7edn30558cc45710a3e6@mail.gmail.com> User-Agent: Alpine 2.00 (LRH 1167 2008-08-23) MIME-Version: 1.0 Content-Type: MULTIPART/MIXED; BOUNDARY="-58695404-178737731-1262631269=:8573" This message is in MIME format. The first part should be readable text, while the remaining parts are likely unreadable without MIME-aware tools. ---58695404-178737731-1262631269=:8573 Content-Type: TEXT/PLAIN; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: quoted-printable On Mon, 4 Jan 2010, Jorge Llamb=EDas wrote: > On Mon, Jan 4, 2010 at 1:34 PM, Adam D. Lopresto = wrote: >> >> Actually, I think ZOhOI ({la'oi} and {zo'oi}) can be parsed just fine >> (though >> it requires pauses/stops/those hated periods before and after the non-= lojban >> word). =A0It just requires a change to the grammar. =A0I have yet to s= ee any >> actual problems. > > It can be done, but it requires some rethinking of the morphology > algorithm. What the PEG algorithm currently does is first break a > string of phonemes into words, so for example "la'oi lopresto" will be > read as three words, "la'oi", "lo" and "presto", before doing any > syntactic parsing of the words. It doesn't care whether there is a > pause/space between "lo" and "presto" or not, or whether "la'oi" is a > defined cmavo or not. Selmaho ZOhOI would require the algorithm to pay > attention to pause/spaces at the syntactic level, which it currently > does not. Interesting. How does it deal with ZOI? --=20 Adam Lopresto http://cec.wustl.edu/~adam/ Just because I have a short attention span doesn't mean I ---58695404-178737731-1262631269=:8573--