[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[lojban-beginners] Re: welcome and question about brivla recognizing



Jorge Llambías pisze:
On Sun, Jul 20, 2008 at 4:54 AM, Mateusz Grotek
<unoduetre@poczta.onet.pl> wrote:
Hello.
I'm new to lojban, recently started reading CLL, but have some questions
about brivla recognition from speech stream.
What is exact algorithm for doing it? I tried to create one, but it looks
like i have to count letters before stress, what i don't wanna do. Is it
really needed? (Because of something what is called "tosmabru failure" in
book). And point 5b) in draft look for me somehow wrong, but  maybe it's my
fault. Could you explain it to me please?

You can find the (an) algorithm here:
< http://www.lojban.org/tiki/tiki-index.php?page=BPFK%20Section%3A%20PEG%20Morphology%20Algorithm>

Basically it works as follows:

(1) If the speech stream that you are considering (from the start up
to the first pause)
contains some non-Lojban phoneme or an impermissibe cluster then it is
a non-Lojban
word.

(2) Otherwise, if it ends with a consonant, it is a cmevla.

(3) Otherwise, if it starts with something that could be a cmavo, and
what remains
is a possible word or words, that first part is indeed a cmavo.
(That's for example
what happens with "tosmabru", "to" is a cmavo because "smabru" is a possible
word.) But there is one exception here: if it starts with CVCy and it
is a lujvo, then
CV cannot be a cmavo (so "tosymabru" is not "to sy mabru").

(4) Otherwise, unless it is a "slinku'i", it is a brivla. A "slinku'i"
consists of a
consonant followed by a string of rafsi. A slinku'i is also a non-Lojban word.

Unfortunately, it is not possible to characterize brivla without
recourse to rafsi.
Both the slinku'i rule, and the tosymabru rule make use of rafsi strings.

mu'o mi'e xorxes




Thanks. As I said, i'm new to lojban. I had thought that lojban is something polished etc. but now I see it's more like work in progress. Am i correct? I suppose algorithm for recognizing words in lojban should be simple. If it is not, why use such fancy features like limitation of allowable consonant pair, using consonant pair only in brivla, not in cmavo etc. if it still doesn't help in recognizing what is brivla, and what isn't. It would be completely equal to learning all rafsi and cmavo from dictionary, if i understand what you said. So it'll be the similar problem which other languages have. Please, could you correct me if i'm wrong. As i said i'm new to lojban, and still don't know much of it.
Thank you for your help
Mateusz Grotek