[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[lojban] Re: Lexing text with {fa'o} and {zoi} in it
On Monday 03 February 2003 15:46, Robin Lee Powell wrote:
> On Sun, Feb 02, 2003 at 12:59:24AM -0500, Pierre Abbat wrote:
li'o
> > E. If the piece contains a consonant followed two letters later,
> > not...
>
> Followed by what two letters later?
By a vowel. If a piece that got this far contains a consonant followed two
letters later by a vowel, and there is no 'y' after that, either it contains
what looks like a brivla, or someone erroneously pronounced a word of selma'o
BY followed by a CVV cmavo without inserting the required pause (such as
/kybusO'e/).
> I assume all the ellipses are resolved elsewhere?
The tarball, which includes the lexing algorithm (which I have some slight
changes to make to) and the program, is at
http://phma.hn.org/Language/valfendi-0.1.0.tar.gz.
> > 6. If the piece is a cmavo of selma'o ZOI and the ZO flag is clear,
> > resolve the next piece as the starting delimiter and set the ZOI flag
> > to a copy of it.
> >
> > 7. If the piece is a cmavo of selma'o FAhO and the ZO flag is clear,
> > set the FAhO flag.
> >
> > 8. If the piece is a cmavo of selma'o ZO and the ZO flag is clear, set
> > the ZO flag; otherwise clear the ZO flag.
>
> Seems like it would be simpler if Step 3 was:
>
> "If the ZO flag is set, lex the piece and clear the ZO flag", or
> something like that.
>
> Then you'd only have to say it once.
I think that wouldn't work, because step 5 doesn't always produce a fully
resolved piece at the left; e.g. /zo.AisecUskuladjAn/ becomes /zo Aisecusku
la djAn/ after two steps.
> > 9. If there are any more pieces unresolved, return to step 3.
> >
> > According to 19:10, if {le'u} appears in a {zoi} quotation which is
> > inside a {lo'u} quotation, the {lo'u} quotation is prematurely
> > terminated. If the {le'u} is not surrounded by pauses, I don't see how
> > that can work. For instance:
> > /lo'ugA'ocUskuzoixy.vaiic.alE'umal`asOt.xy.axacuEroc.le'u/ The {le'u}
> > inside the {zoi} quotation is embedded in the Hebrew word
> > "wayish'alehu", which, being inside a {zoi} quotation, is not
> > attempted to lex (and if it did, it would choke on the `ayin of "mah
> > l`asot").
>
> Wow. Yeah, there's no way that can work.
>
> > Chapter 21, step 2, says that the word following {zoi} *should* be
> > delimited with a pause, not that it *must*. If it is not delimited
> > with a pause, this can cause text to be lexed that is part of the
> > quotation. For instance: /zoiladjAn.bagram.ladjAn/ is lexed as {zoi la
> > djan. bagram. la djan.}.
>
> I'm sory; what is the delimiter in that case? la?
Yes. It's ungrammatical, but the lexer doesn't know that.
> > Conversely, two words could be taken as the delimiter, as in
> > /zoi.flAlukAvbu.../ that I mentioned before. Should the word be
> > changed to "must"?
>
> Seems like it to me.
Likewise, must {fa'o} be followed by a pause?
phma