From jjllambias@gmail.com Mon Jan 04 11:14:43 2010 Received: from mail-yw0-f186.google.com ([209.85.211.186]) by chain.digitalkingdom.org with esmtp (Exim 4.69) (envelope-from ) id 1NRsNy-00059W-Th for lojban-list@lojban.org; Mon, 04 Jan 2010 11:14:42 -0800 Received: by ywh16 with SMTP id 16so14096207ywh.25 for ; Mon, 04 Jan 2010 11:14:32 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:in-reply-to:references :date:message-id:subject:from:to:content-type :content-transfer-encoding; bh=g+FT5FxonUknQzH4F1Hhh2SsBHkHuI8tqrY+glgAWpQ=; b=Edao3ByGX7xhnzRJxcIr+cf/rB6zPmxjH6AjfSCC6PODFA5G4eE94/d8MQqJuJgOGL pYFH5wRB2wzM17ZdpmORE0B/xwrXJAwERTVJeMijpA5edJxI4+SmmQQQr/PRqEelLP7l 6XoND4pz+0OCgYMEkghZ6IB64Qaka5/SHqXHc= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type:content-transfer-encoding; b=bH8Q7raw7uOyb84Ps5EEfCg7YisvDCnKHe/Fx0vcEi5ibZu9Nblljb+ClqyBaBUegg cRefmqTKkJQd5XJTZbACbLcjTkLExlBolBh3Z2D/CrUszBtBXERHmaTY/a5N6Ok+FpVP IYgIMir7pkdks1oJ2VLleAqmpoewwJaMywfO4= MIME-Version: 1.0 Received: by 10.90.23.38 with SMTP id 38mr3254273agw.57.1262632470092; Mon, 04 Jan 2010 11:14:30 -0800 (PST) In-Reply-To: References: <425e4ac21001031952t22834298oa24977c0eef72d35@mail.gmail.com> <425e4ac21001032050h48991b70rdf63974aab3da6a9@mail.gmail.com> <925d17561001040956n33e7c7edn30558cc45710a3e6@mail.gmail.com> Date: Mon, 4 Jan 2010 16:14:30 -0300 Message-ID: <925d17561001041114q730db24w765a865db4fa6bb1@mail.gmail.com> Subject: Re: [lojban] Re: Initial impression From: =?ISO-8859-1?Q?Jorge_Llamb=EDas?= To: lojban-list@lojban.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable On Mon, Jan 4, 2010 at 3:54 PM, Adam D. Lopresto wrot= e: > On Mon, 4 Jan 2010, Jorge Llamb=EDas wrote: >> Selmaho ZOhOI would require the algorithm to pay >> attention to pause/spaces at the syntactic level, which it currently >> does not. > > Interesting. =A0How does it deal with ZOI? Once the phoneme string has been split into words, when the parser runs into an active ZOI (i.e. a ZOI that has not been deactivated by a preceding magic word) it looks at the following word and keeps it in memory, then it absorbs all words that don't match that word. When it runs into a matching word, it closes the ZOI quote. This "keeping in memory" part is the only part of the grammar that is actually not really a true PEG. The text inside the ZOI quote might be completely butchered from the point of view of the foreign language, which could have different morphological rules than Lojban. From the point of view of Lojban it is just a string of Lojban words and non-words, but already processed into "word"-chunks. This wouldn't work for ZOhOI because now it matters whether these pseudo-words are separated by pause/spaces or not. mu'o mi'e xorxes