Received: from nobody by stodi.digitalkingdom.org with local (Exim 4.80.1) (envelope-from ) id 1W5KYy-0002eN-M7 for lojban-newreal@lojban.org; Mon, 20 Jan 2014 11:31:12 -0800 Received: from mail-qa0-f43.google.com ([209.85.216.43]:65513) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.80.1) (envelope-from ) id 1W5KYp-0002de-R1 for lojban@lojban.org; Mon, 20 Jan 2014 11:31:12 -0800 Received: by mail-qa0-f43.google.com with SMTP id o15so5845216qap.16 for ; Mon, 20 Jan 2014 11:30:57 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:date:message-id:subject:from:to:content-type; bh=kH3YjpjG1msn0ebdMxqTFXmzgTved3K/rkJOj3BLs7U=; b=diD19TthWA/zgJ/AMw9Ek/ISmXyutCoNkLCIA1oO+U7vSbY6kROZcP/wY95cPYotNA K3kQjMlTQC0vsdweaaCRYqbCQgtmI+lrWQedBaHSL/mltjcq0ldkFvqyFA4QbfiP28gj d76AA/04F8hc7aXthmHxbpLp9iNO6mpeMPlIvOcTk/JRWOZsByNsPGcaGfh9f0XMuhCq WuWgzrodbRIWHt1oqlS58Jji4hYfnoqSzbsYucl6iq4gzCiRFJb5oHcfhz8RBbJZVqa9 NDRwGcjSNFlfr7Jb+8OPQWRcmefZFAcmQu0yJKlL/MoXuGBf8Zv1xLPVgGnvwOBF7B2I w3mA== MIME-Version: 1.0 X-Received: by 10.224.127.131 with SMTP id g3mr30792723qas.98.1390246257418; Mon, 20 Jan 2014 11:30:57 -0800 (PST) Received: by 10.96.0.6 with HTTP; Mon, 20 Jan 2014 11:30:57 -0800 (PST) Date: Mon, 20 Jan 2014 14:30:57 -0500 Message-ID: Subject: simpler language enjoying lojban non-ambiguity properties? From: Warren D Smith To: lojban@lojban.org Content-Type: text/plain; charset=ISO-8859-1 X-Spam-Score: -0.1 (/) X-Spam_score: -0.1 X-Spam_score_int: 0 X-Spam_bar: / I'm interested in the idea of an artificial language enabling human-computer (H-C; also H-H and C-C) communication. The unambiguity properties (every sentence has exactly 1 parse, the sound-sequence uniquely specifies the word sequence) of lojban would be very helpful. More generally, every artificial language so far (500-1000 tries), has failed. Why try again for sure failure? Well, there is this: H-C communication would offer a big economic motivation, and to individuals, not just society wide. That is the first time ever in human history that this can be said. ---- However, it seems to me lojban is too complicated and at least some of that is intentional. Linguist Arika Okrent said constructing a valid lojban sentence is "like doing long division in your head," and I observe that a large fraction of the lojban sentences on your lojban.org web site are incorrect (proof: run your jboski program on them - I already did), ditto sentences on online lojban fora -- i.e. it is so difficult that even *you* often cannot do it even with considerably longer than real time to work on it. If incorrect lojban were common that would defeat the purpose (for me) of enabling easy H-C communication taking advantage of unambiguity theorems. Okrent also pointed out the full specification of lojban grammar was 600 pages, a hellish amount. As an example of "intentional" complexity: the "emotion tags" feature, while maybe desirable for poetry & stuff, is completely counterproductive for communicating with computers. Another criticism of lojban is the "culture neutral" feature. The result is that learning words is basically maximally difficult for everybody. The amount of (say) English in it has been diluted so much that English speakers have essentially no start-off advantage. In contrast Esperanto and Interlingua intentionally tried to give euro-speakers a start-off advantage by making the words and rules highly euro-like. That may not have been very useful for (say) Arabs trying to learn Interlingua, but it was a big win for euro-speakers. For H-C purposes, most computer users already are familiar with euro-languages (although maybe this is less true than it used to be) so the culture-neutral thing is a disadvantage. Also, as a matter of marketing, culture neutral seems bad -- you want to get a large core group of speakers fast, and being culture-biased will enable that. Esperanto and Interlingua are the most successful attempts ever and they went the biased route. I saw it was claimed at least one fluent lojban speaker existed (Nick Nicholas) with no further details (how many are there?). That suggests these objections are overcomable, EXCEPT that I saw, zero, I repeat zero, data on what fraction of Nick Nicholas's high speed utterances actually were valid lojban that passes jboski. If, say, only 20% do, then the claim he is a fluent lojban speaker, is kind of debatable and for H-C purposes this would be nearly useless. OK, this brings me to my QUESTION. Suppose it were desired to produce a simpler language (perhaps related to lojban the way "basic english" is related to English) still enjoying all un-ambiguity theorems, but with grammar describable in only 100 not 600 pages. Do you think this would be possible? I mean, it might be that the lojban creators did a pretty good job, and it is just not possible to do it that much simpler. Your guesses solicited. -- Warren D. Smith http://RangeVoting.org <-- add your endorsement (by clicking "endorse" as 1st step)