Received-SPF: pass (google.com: domain of gleki.is.my.name@gmail.com designates 2a00:1450:400c:c00::22b as permitted sender) client-ip=2a00:1450:400c:c00::22b;
MIME-Version: 1.0
In-Reply-To: <f30c2627-3654-459a-be58-a02df1c15b19@googlegroups.com>
References: <CAO7bV+gR3H8+M4m2yVkgZbf66j8v9S0OAoTpduLCbNudgm=tbQ@mail.gmail.com>
	<df43dc5e-60f5-4171-b75e-57faa4fcdc21@googlegroups.com>
	<CAO7bV+g=4BwP3JQDE913cBmXJjAdrPCr8zxjf9RKcUDfgE8Xcg@mail.gmail.com>
	<f30c2627-3654-459a-be58-a02df1c15b19@googlegroups.com>
Date: Mon, 10 Nov 2014 10:07:58 +0300
Message-ID: <CAO7bV+gsFgoiW6s+zrqqwGRwDUUFMp7jxHgaKm=7H8EgKYEaqw@mail.gmail.com>
Subject: Re: [lojban] Re: se klani be lo kafkylerfu
From: Gleki Arxokuna <gleki.is.my.name@gmail.com>
To: "lojban@googlegroups.com" <lojban@googlegroups.com>
Reply-To: lojban@googlegroups.com
Precedence: list
Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com
Sender: lojban@googlegroups.com
Content-Type: multipart/alternative; boundary=047d7b874afe8dbe1c05077bd2d7
X-Spam_score: -1.9
X-Spam_score_int: -18
X-Spam_bar: -

--047d7b874afe8dbe1c05077bd2d7
Content-Type: text/plain; charset=UTF-8

2014-11-10 0:42 GMT+03:00 TR NS <transfire@gmail.com>:

> On Saturday, November 8, 2014 9:31:49 AM UTC-5, la gleki wrote:
>>
>>
>>
>> 2014-11-08 17:15 GMT+03:00 TR NS <tran...@gmail.com>:
>>
>>>
>>>
>>> On Sunday, November 2, 2014 12:26:12 PM UTC-5, la gleki wrote:
>>>>
>>>> just for the record. some stats.
>>>> we take irc logs, only sentences in lojban.
>>>> we count the number of words with a given letter multiplied by their
>>>> frequency divided by the number*frequency of all words. If the same letter
>>>> occurs more than once in a word we count it as a singular occurrence. We
>>>> limit ourselves only to the first 3000 most frequent words.
>>>>
>>>> we get:
>>>> [x]  - found in 2.7% of all spoken in IRC logs words
>>>> ['] - 16.8%
>>>> [c] - 13.9%
>>>> [cx'] - 31.75% (at least one of those letters in each word)
>>>> [x'] - 19.47% (at least one of those letters in each word)
>>>> [cx] followed by a consonant - 2.11%
>>>>
>>>> to'u one of three words contains at least one of the three letters: [']
>>>> or [x] or [c].
>>>>
>>>
>>> So what can be done about it?  I think it's clear that there are too
>>> many "static noise" sounds in the language. And as a logical language there
>>> is no reason that it has to rank so low in sound quality (one need only
>>> look at online polls to see that Chinese, which has similar qualities,
>>> never ranks well).
>>>
>>
>> What? Impossible. Mandarin has two levels of fighting statis noise: tones
>> and the rest part of the sound system that to some degree overlap.
>>
>>
> Hmm... I don't mean static as in "hard to understand" I just mean the
> nature of the sound which doesn't rank high as an aesthetically pleasant
> sound. Chinese consistently ranks in the top of worst sounding language
> polls.
>

Okay, I thought you were talking about signal to noise ratio.
As for Chinese being aesthetically not pleasing  then this was certainly a
biased poll. Then why would >1 billion of speakers of various dialects
still use it? Why won't they start speaking let's say English instead? :D
And what if I tell you that I find it aesthetically pleasing?


But back to Lojban the solution can be to make /^CiV/ and /^CuV/ cmavo a
new alternative sounding preserving the existing sounding. Also if a /V'V/
dipthong is forbidden then it's allowed to pronounce it as /VV/. This will
lead to the following options in pronouncing words:

{ku'i} => {kui}/{ku'i} (choose the pronunciation that you like).
{o'e} => {oe}/{o'e} (choose the pronunciation that you like).
However,
{i'e} => {i'e} (since /ie/ is an allowed dipthong

I tried counting how many ' can be removed this way. Now I get 7.8% of all
words. Thus 16.8 - 7.8 = 9. However, I could miss some words.
I don't know if 9% of words with ' would be fine to you.
Another advantage  of such approach is that 7.8% of words can now be
pronounced shorter.
Also many ' are found in lujvo. If '-less rafsi are dispreferred then the
number of ' will be decreased even more. There is nothing wrong in saying
{selprami} instead of {selpa'i} unless someone uses the latter as a
nickname.


As for [x] it covers only 2.7% of all words. This letter can probably be
eliminated from gismu by replacing with {k}, short rafsi with {x} can be
eliminated at all and the corpus can be corrected since it can have
mistakes of another kind anyway. E.g. {xrula} can get an alternative
pronunciation of e.g. {flora}.

If such alternative make people happier then why not use it?
However, notice that 0.9% of all words is {xu}. The word {xamgu} is No. 86
in the frequency list.

i xu la'edi'u xamgu da'i do

-- 
You received this message because you are subscribed to the Google Groups "lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/d/optout.

--047d7b874afe8dbe1c05077bd2d7
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><br><div class=3D"gmail_extra"><br><div class=3D"gmail_quo=
te">2014-11-10 0:42 GMT+03:00 TR NS <span dir=3D"ltr">&lt;<a href=3D"mailto=
:transfire@gmail.com" target=3D"_blank">transfire@gmail.com</a>&gt;</span>:=
<br><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;bor=
der-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:sol=
id;padding-left:1ex"><div dir=3D"ltr">On Saturday, November 8, 2014 9:31:49=
 AM UTC-5, la gleki wrote:<blockquote class=3D"gmail_quote" style=3D"margin=
:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204)=
;border-left-style:solid;padding-left:1ex"><div dir=3D"ltr"><br><div><br><d=
iv class=3D"gmail_quote">2014-11-08 17:15 GMT+03:00 TR NS <span dir=3D"ltr"=
>&lt;<a>tran...@gmail.com</a>&gt;</span>:<span class=3D""><br><blockquote c=
lass=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left-width:1p=
x;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1=
ex"><div dir=3D"ltr"><div><div><br><br>On Sunday, November 2, 2014 12:26:12=
 PM UTC-5, la gleki wrote:<blockquote class=3D"gmail_quote" style=3D"margin=
:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204)=
;border-left-style:solid;padding-left:1ex"><div dir=3D"ltr">just for the re=
cord. some stats.<div>we take irc logs, only sentences in lojban.</div><div=
>we count the number of words with a given letter multiplied by their frequ=
ency divided by the number*frequency of all words. If the same letter occur=
s more than once in a word we count it as a singular occurrence. We limit o=
urselves only to the first 3000 most frequent words.</div><div><br></div><d=
iv>we get:</div><div>[x] =C2=A0- found in 2.7% of all spoken in IRC logs wo=
rds</div><div>[&#39;] - 16.8%</div><div>[c] - 13.9%</div><div>[cx&#39;] - 3=
1.75% (at least one of those letters in each word)</div><div>[x&#39;] - 19.=
47% (at least one of those letters in each word)<br>[cx] followed by a cons=
onant - 2.11%</div><div><br></div><div>to&#39;u one of three words contains=
 at least one of the three letters: [&#39;] or [x] or [c].</div></div></blo=
ckquote><div><br></div></div></div><div>So what can be done about it?=C2=A0=
 I think it&#39;s clear that there are too many &quot;static noise&quot; so=
unds in the language. And as a logical language there is no reason that it =
has to rank so low in sound quality (one need only look at online polls to =
see that Chinese, which has similar qualities, never ranks well).</div></di=
v></blockquote><div><br></div><div>What? Impossible. Mandarin has two level=
s of fighting statis noise: tones and the rest part of the sound system tha=
t to some degree overlap.</div><div><br></div></span></div></div></div></bl=
ockquote><div><br></div><div>Hmm... I don&#39;t mean static as in &quot;har=
d to understand&quot; I just mean the nature of the sound which doesn&#39;t=
 rank high as an aesthetically pleasant sound. Chinese consistently ranks i=
n the top of worst sounding language polls.</div></div></blockquote><div><b=
r></div><div>Okay, I thought you were talking about signal to noise ratio.<=
/div><div>As for Chinese being aesthetically not pleasing =C2=A0then this w=
as certainly a biased poll. Then why would &gt;1 billion of speakers of var=
ious dialects still use it? Why won&#39;t they start speaking let&#39;s say=
 English instead? :D</div><div>And what if I tell you that I find it aesthe=
tically pleasing?</div><div><br></div><div><br></div><div>But back to Lojba=
n the solution can be to make /^CiV/ and /^CuV/ cmavo a new alternative sou=
nding preserving the existing sounding. Also if a /V&#39;V/ dipthong is for=
bidden then it&#39;s allowed to pronounce it as /VV/. This will lead to the=
 following options in pronouncing words:</div><div><br></div><div>{ku&#39;i=
} =3D&gt; {kui}/{ku&#39;i} (choose the pronunciation that you like).</div><=
div>{o&#39;e} =3D&gt; {oe}/{o&#39;e}=C2=A0(choose the pronunciation that yo=
u like).</div><div>However,</div><div>{i&#39;e} =3D&gt; {i&#39;e} (since /i=
e/ is an allowed dipthong</div><div><br></div><div>I tried counting how man=
y &#39; can be removed this way. Now I get 7.8% of all words. Thus 16.8 - 7=
.8 =3D 9. However, I could miss some words.</div><div>I don&#39;t know if 9=
% of words with &#39; would be fine to you.</div><div>Another advantage =C2=
=A0of such approach is that 7.8% of words can now be pronounced shorter.</d=
iv><div>Also many &#39; are found in lujvo. If &#39;-less rafsi are dispref=
erred then the number of &#39; will be decreased even more. There is nothin=
g wrong in saying {selprami} instead of {selpa&#39;i} unless someone uses t=
he latter as a nickname.</div><div><br></div><div><br></div><div>As for [x]=
 it covers only 2.7% of all words. This letter can probably be eliminated f=
rom gismu by replacing with {k}, short rafsi with {x} can be eliminated at =
all and the corpus can be corrected since it can have mistakes of another k=
ind anyway. E.g. {xrula} can get an alternative pronunciation of e.g. {flor=
a}.</div><div><br></div><div>If such alternative make people happier then w=
hy not use it?</div><div>However, notice that 0.9% of all words is {xu}. Th=
e word {xamgu} is No. 86 in the frequency list.</div><div><br></div><div>i =
xu la&#39;edi&#39;u xamgu da&#39;i do</div></div></div></div>

<p></p>

-- <br />
You received this message because you are subscribed to the Google Groups &=
quot;lojban&quot; group.<br />
To unsubscribe from this group and stop receiving emails from it, send an e=
mail to <a href=3D"mailto:lojban+unsubscribe@googlegroups.com">lojban+unsub=
scribe@googlegroups.com</a>.<br />
To post to this group, send email to <a href=3D"mailto:lojban@googlegroups.=
com">lojban@googlegroups.com</a>.<br />
Visit this group at <a href=3D"http://groups.google.com/group/lojban">http:=
//groups.google.com/group/lojban</a>.<br />
For more options, visit <a href=3D"https://groups.google.com/d/optout">http=
s://groups.google.com/d/optout</a>.<br />

--047d7b874afe8dbe1c05077bd2d7--