[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lojban] Named multiples

To: lojban@googlegroups.com
Subject: Re: [lojban] Named multiples
From: Jorge Llambías <jjllambias@gmail.com>
Date: Wed, 19 May 2010 19:13:47 -0300
Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=beta; h=domainkey-signature:received:x-beenthere:received:received:received :received:received-spf:received:mime-version:received:received :in-reply-to:references:date:message-id:subject:from:to :x-original-authentication-results:x-original-sender:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe:content-type :content-transfer-encoding; bh=9AwPk2A3knUMbMxOoe0TXlcV/uYPo++Q/4N9aMO1f3I=; b=dyTvacl7RTL/KFk9nJfxsxfoWMoDqvCQNao7L7FwCEZOaeeVc56E+MyyWzDWDMdv46 +iuIYClrUgujUZCA0w1GTxzjQRM20hVwk0sP86bW6s+2GD597CrIyv8PRVooSoyxVyeq TXHsbzp1v2/mjD/+roqIHiHZY+NFCZJIOmsuw=
Domainkey-signature: a=rsa-sha1; c=nofws; d=googlegroups.com; s=beta; h=x-beenthere:received-spf:mime-version:in-reply-to:references:date :message-id:subject:from:to:x-original-authentication-results :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type:content-transfer-encoding; b=HxKDJx0zLviqb9N5Ns0RharABmZtBebrCtcKzCtAHQNXmMLVQmvBJQMxFdudvCyagM fL3gmnscEMywLqEU+nBaKhvUHJIM5We+DT19J+4D+f6Eo90Moar2B2tiHRtzac/BYu1h ZTEUG5OFtLXjW6WiPCvFeVga6gbwKPl+kb/so=
In-reply-to: <AANLkTilMz5dY_bo6j1Gi-nRG-Zu1LyjiWn0dN1v_VFcq@mail.gmail.com>
List-archive: <http://groups.google.com/group/lojban?hl=en_US>
List-help: <http://groups.google.com/support/?hl=en_US>, <mailto:lojban+help@googlegroups.com>
List-id: <lojban.googlegroups.com>
List-post: <http://groups.google.com/group/lojban/post?hl=en_US>, <mailto:lojban@googlegroups.com>
List-subscribe: <http://groups.google.com/group/lojban/subscribe?hl=en_US>, <mailto:lojban+subscribe@googlegroups.com>
List-unsubscribe: <http://groups.google.com/group/lojban/subscribe?hl=en_US>, <mailto:lojban+unsubscribe@googlegroups.com>
Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com
References: <x2na36d16c81005070403he9fc3715j2ba204bbb352b3e7@mail.gmail.com> <AANLkTikRzl3oCjgencB1pMtCVVM0DXb1Tqq5eYBhm0D7@mail.gmail.com> <AANLkTiktSBYwThJ14M5RWTuTwOE9GT2-xgSQsad5WSk_@mail.gmail.com> <AANLkTilJGdhWa6j8qSw6LSUwvcCY7bwFNsqWZCvz0rbU@mail.gmail.com> <20100518160715.GA6617@sdf.lonestar.org> <AANLkTikawYaExese00dPXEIyh25zBfn7-PEKNU73ispV@mail.gmail.com> <AANLkTimBgGc9yTMrS2yrvq1pU2aqVz3B2KMy3H62WZSE@mail.gmail.com> <AANLkTinwyXwLuFw-J7xo8rlPd8ngzkkeuwRpaUd9Brqb@mail.gmail.com> <AANLkTikKgAgi9hP25cPn57F6bsS2gI81IBZVUnoi8KXm@mail.gmail.com> <AANLkTilMz5dY_bo6j1Gi-nRG-Zu1LyjiWn0dN1v_VFcq@mail.gmail.com>
Reply-to: lojban@googlegroups.com
Sender: lojban@googlegroups.com

On Wed, May 19, 2010 at 5:12 PM, Oleksii Melnyk <lamelnyk@gmail.com> wrote:
> 2010/5/19 Jorge Llambías <jjllambias@gmail.com>
>
>> Can you give an example? ... Are you talking about human parsing?
>
> Yes. Let's take an example from NORALUJV.txt. I've spotted "backemselrErkru"
> (: it is long enough to either got out of air in the middle of it, or just
> need to look into the dictionary to recall the next rafsi, whatever :).

A human parser will either recognize the word, in which case any minor
errors in pronunciation will probably be ignored, or (most likely)
they won't recognize it, in which case they will say "ki'a" whether
there were any errors in pronouncing that monster-word or not.

> Any
> pause after the vowel and the stressed "lrE" leaves us with "rkru" as the
> last rafsi, so here was some error. Any pause after the consonant will alert
> us about "no LA before the cmene"(not morphology level, but easy/close
> enough for humans).

There are many contexts in which cmevla can occur, not just after LA.
You could be talking about the word "zo backemselrerkru" for example.

> Pause after the "CVV" rafsi (not in an example), will be
> noticed as "no stress in previous word". Only in "la backemselrErkru" the
> pause between the rafsi's will go unnoticed. In the cmevla, after the
> required LA, we do expect the sequence of names, so breaking one into
> several will cause almost no harm, we'll just glue them together up to the
> last consonant ending word.

I'm not sure I follow what you are saying. LA can be followed by more
than one cmevla, so if you say for example: "la pip backemselrerkru",
and you pause somewhere after a consonant of the lujvo, it will be
taken (at least by a mechanical parser) as a cmevla.

> So, we are almost immune to the extra pauses inside the long words. Now,
> without the required LA, we'll get:
>
> ba.ckemselrErkru - can fail
> bac.kemselrErkru - OK
> back.emselrErkru - can fail

That's a weird place for an unplanned pause.

> backe.mselrErkru - can fail

Another weird place to stop.

> backem.selrErkru - OK
> backems.elrErkru - can fail

Another weird place to stop.

> backemse.lrErkru - can fail

Another weird place to stop.

> backemsel.rErkru - OK
> backemselr.Erkru - can fail

Another weird place to stop.

> backemselrE.rkru - can fail

Another weird place to stop.

> backemselrEr.kru - OK, can eat the next word
> backemselrErk.ru - OK

Another weird place to stop.

> backemselrErkr.u - OK

Another weird place to stop.

> Note, that all the "fails" only "_can_ be"; if there are the otherwise
> _allowed_ pause after the entire word, it will parse, giving us 2 wrong
> meaningful chunks.

I still don't get what the point of this is.

The only realistic unintended pauses are:

   ba.ckemselrerkru
   bac.kemselrerkru
   backem.selrerkru
   backemsel.rerkru
   backemselrer.kru

The last one the least realistic, because of the stress.

All the others are not reasonable places for unintended stops.

The human hearer will have to decide whether or not to take any such
pauses seriously based on the resulting meaning.

>> The exact same problem exists with or without the change. The
>> change has no relevance to word parsing.
>
> So, the change affects an error detection. If the humans all were the
> reliable electronic devices with the error detection codes in the
> communication channel, that would be irrelevant. However they are not, and
> the language was meant to be "error detecting communication channel" for
> them.

Are you saying that with the current grammar the human will say: "lo
backem.selrerkru" is ungrammatical, therefore I will attempt a
possible correction to "lo backemselrerkru, which fixes the problem",
whereas with the change the human will say "lo backem.selrerkru" is
grammatical, therefore I will not attempt a correction, even though
what I'm hearing makes very little sense". Is that the point?

>> Why would a text be full of cmevla? As you say, cmevla are a
>> cumbersome type of word, so they don't blend well with normal Lojban
>> words. Simplifying their syntax would not make them morphologically
>> prettier, it would only make the syntax simpler.
>
> I do like the idea. I just looking for the bad consequences. The result of
> the change would be the wider usage of cmevla. Otherwise, there are no
> reason to make it simpler.

That's not the motivation though. If we don't want cmevla, we should
remove them from the language, not complicate their grammar
needlessly. (Not that their grammar is too complicated, just more
complicated than what it needs to be.) The motivation is not to use
more cmevla, but to allow things like "la cmalu djan" for a name like
"Little John". There is no reason why the change should encourage more
cmevla, since anything that can  be said with the change can already
be said in some other way without the change.

> Trying to read them aloud would push "the live
> language" towards toki pona (as the best case).

I don't understand what you mean by that. Trying to read what aloud
would push the language towards toki pona?

> "." is OK in writing. Not so in speech. We do need to think/breath/rest/etc.
> sometimes. That sounds just as ".......". So, using "." as the syntax marker
> looks bad for a human-spoken audio-video isomorphic language.

Long words are bad for Lojban, I agree, whether they are lujvo,
fu'ivla or cmevla. But the proposed change doesn't really affect that.

mu'o mi'e xorxes

-- 
You received this message because you are subscribed to the Google Groups "lojban" group.
To post to this group, send email to lojban@googlegroups.com.
To unsubscribe from this group, send email to lojban+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.

Follow-Ups:
- Re: [lojban] Named multiples
  - From: Oleksii Melnyk <lamelnyk@gmail.com>

References:
- [lojban] Named multiples
  - From: Daniel Brockman <daniel@brockman.se>
- Re: [lojban] Named multiples
  - From: Luke Bergen <lukeabergen@gmail.com>
- Re: [lojban] Named multiples
  - From: Daniel Brockman <daniel@brockman.se>
- Re: [lojban] Named multiples
  - From: Luke Bergen <lukeabergen@gmail.com>
- Re: [lojban] Named multiples
  - From: Minimiscience <minimiscience@gmail.com>
- Re: [lojban] Named multiples
  - From: Oleksii Melnyk <lamelnyk@gmail.com>
- Re: [lojban] Named multiples
  - From: Jorge Llambías <jjllambias@gmail.com>
- Re: [lojban] Named multiples
  - From: Oleksii Melnyk <lamelnyk@gmail.com>
- Re: [lojban] Named multiples
  - From: Jorge Llambías <jjllambias@gmail.com>
- Re: [lojban] Named multiples
  - From: Oleksii Melnyk <lamelnyk@gmail.com>

Prev by Date: Re: [lojban] Named multiples
Next by Date: Re: [lojban] Named multiples
Previous by thread: Re: [lojban] Named multiples
Next by thread: Re: [lojban] Named multiples
Index(es):
- Date
- Thread