[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[lojban] My parser and ZOI

To: lojban@yahoogroups.com
Subject: [lojban] My parser and ZOI
From: Robin Lee Powell <lojban-out@lojban.org>
Date: Fri, 30 Apr 2004 16:39:19 -0700
Mail-followup-to: lojban-list@lojban.org
Reply-to: rlpowell@digitalkingdom.org
Sender: lojban-list-bounce@lojban.org
User-agent: Mutt/1.5.5.1+cvs20040105i

I've decided that having a pre-processor just to handle ZOI wasn't
compromising my principles too badly.  The entire pre-processor is as
follows:


- cut -

#!/bin/sh

cd /home/rlpowell/www/hobbies/lojban/grammar/rats

cat - | \
perl -p -e "s/(zoi[\s.]+|la'o[\s.]+)([a-zA-Z']+)([\s.]+.*[\s.]+)(\2)([\s.]+|\z)/\1QZOIMARKER\2\3\4\5QZOIMARKER/" | \
/usr/local/java/bin/java xtc/parser/PParser /dev/stdin

- cut -

The perl isn't nearly as bad as it looks; there's nothing there that
egrep can't do, except for the replacement and \z.

What this does is replace[1] the zoi boundary word (i.e. gy in "zoi gy
whee! gy") with the string QZOIMARKER.  That string was chosen to be
descriptive.  The Q is there to make sure it's not valid Lojban, so that
the pre-processor's efforts can never be mis-construed as valid for some
reason other than processing ZOI correctly.

-Robin

[1]:  Technically, it prepends it in front of the first boundary word
and postpends it after the second; this is so that the entire original
text is recoverable.

-- 
http://www.digitalkingdom.org/~rlpowell/  ***  I'm a *male* Robin.
"Many philosophical problems are caused by such things as the simple
inability to shut up." -- David Stove, liberally paraphrased.
http://www.lojban.org/  ***  loi pimlu na srana .i ti rokci morsi

Follow-Ups:
- [lojban] Re: My parser and ZOI
  - From: Jorge "Llambías" <jjllambias2000@yahoo.com.ar>
- [lojban] Re: My parser and ZOI
  - From: Robin Lee Powell <rlpowell@digitalkingdom.org>
- [lojban] Re: My parser and ZOI
  - From: Robin Lee Powell <lojban-out@lojban.org>

Prev by Date: [lojban] Re: Why capital letters standing in for letterals is a *bad* idea.
Next by Date: [lojban] Re: Why capital letters standing in for letterals is a *bad* idea.
Previous by thread: Re: [lojban] Re: Why capital letters standing in for letterals is a *bad* idea.
Next by thread: [lojban] Re: My parser and ZOI
Index(es):
- Date
- Thread