[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [lojban] Re: valfendi algorithm
At 08:19 AM 1/24/03 -0500, Pierre Abbat wrote:
>On Friday 24 January 2003 07:56, Robert LeChevalier wrote:
> > This does not sound like it is a proper defining algorithm for the Lojban
> > morphology as you've described it, and as a first glance at the text
> > indicates. It may parse all well-formed Lojban words, but it also may
> > successfully lex some not-well-formed Lojban (your algorithm seems to allow
> > fu'ivla with embedded text strings that would invalidate the fu'ivla if it
> > is a proper gismu or rafsi, but allows the fu'ivla if it is not). This is
> > merely another stage in our long running dispute as to whether type IV
> > fu'ivla are to be constrained to specific forms positively defined, or can
> > consist of anything lexable word that could be a brivla that isn't a gismu
> > or lujvo.
>
>I am planning further versions which will check all words for
>well-formedness.
>Currently it accepts anything ending in a consonant and not containing a
>cmegadri as a cmene, including such unpronounceable messes as {mzantcesg}.
>What do you mean by "fu'ivla with embedded text strings that would invalidate
>the fu'ivla if it is a proper gismu or rafsi, but allows the fu'ivla if it is
>not"? Can you give an example?
My understanding of:
>A slinku'i, as far as word breaking is concerned, is anything that matches
>the following regex:
>^C[raf3]*([gim]?$|[raf4]?y)
>where
>C matches any consonant
>[raf3] matches any 3-letter rafsi
>[raf4] matches any 4-letter rafsi
>[gim] matches any gismu.
A correct algorithm would use the structures CVC/CVV/CCV for raf3,
CVCC/CCVC for raf4 and CVCCV/CCVCV for gim. It doesn't matter whether the
values are in fact actually used. Post-freeze it seems logical that it
would and should be easier to add and subtract from the gismu/rafsi lists
than to change the entire morphology, so the morphology is defined at a
higher level than the specific list of words.
(In addition "ala'um" is not an "option"; there should be no options in an
official algorithm. It is either valid or invalid according to the rules.)
lojbab
--
lojbab lojbab@lojban.org
Bob LeChevalier, President, The Logical Language Group, Inc.
2904 Beau Lane, Fairfax VA 22031-1303 USA 703-385-0273
Artificial language Loglan/Lojban: http://www.lojban.org
To unsubscribe, send mail to lojban-unsubscribe@onelist.com
Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/