Received: from mail-wi0-f184.google.com ([209.85.212.184]:56869) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.80.1) (envelope-from ) id 1XLw7U-0003MU-Dn for lojban-list-archive@lojban.org; Mon, 25 Aug 2014 08:23:47 -0700 Received: by mail-wi0-f184.google.com with SMTP id n3sf272337wiv.21 for ; Mon, 25 Aug 2014 08:23:37 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe:content-type; bh=ksTjuxGej9/87cZjI53qF7EzFN8HpYuXg6V78NAvtC0=; b=vto6mbwZtETkr6KOuKCs44JogLF+5pU3I9ZlLev3YkUAOjifPpi21yQxD0j9tZE18c keKqqOiOijo51F3CDd4f01etYYzKAU70NH8Lf7/uLvMG2W8PmFbBCVYPxz6G4Epm2BWl fIw4g382f1BMSS2dNElaNb2IeoG4qlwUuZ0Q9cAm2eoN8Q0BhgCR4teD3G+3vOT1V2dv u4TKZAF8F2Kxcw4xeVSYRWrDeeVSgb5lcQAxzO9TCAODSa8Lylc6CJbrEZMoW3bCY4uy yx/O4s33CDgP5HAzRn7y9GaJKIuG4dkdzfcFqpS7/QSZta5Iv7MVpMiBWkhbeP3d0nG7 TGPg== X-Received: by 10.152.2.202 with SMTP id 10mr9097law.34.1408980217315; Mon, 25 Aug 2014 08:23:37 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.152.21.137 with SMTP id v9ls415149lae.7.gmail; Mon, 25 Aug 2014 08:23:36 -0700 (PDT) X-Received: by 10.112.50.47 with SMTP id z15mr29749lbn.19.1408980216511; Mon, 25 Aug 2014 08:23:36 -0700 (PDT) Received: from mail-we0-x22a.google.com (mail-we0-x22a.google.com [2a00:1450:400c:c03::22a]) by gmr-mx.google.com with ESMTPS id gk5si35925wic.1.2014.08.25.08.23.36 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Mon, 25 Aug 2014 08:23:36 -0700 (PDT) Received-SPF: pass (google.com: domain of gleki.is.my.name@gmail.com designates 2a00:1450:400c:c03::22a as permitted sender) client-ip=2a00:1450:400c:c03::22a; Received: by mail-we0-f170.google.com with SMTP id w62so13455433wes.29 for ; Mon, 25 Aug 2014 08:23:36 -0700 (PDT) MIME-Version: 1.0 X-Received: by 10.180.20.142 with SMTP id n14mr1873179wie.22.1408980216314; Mon, 25 Aug 2014 08:23:36 -0700 (PDT) Received: by 10.194.89.193 with HTTP; Mon, 25 Aug 2014 08:23:36 -0700 (PDT) In-Reply-To: References: <8D08DAC0705BEED-E34-41DDD@webmail-d263.sysops.aol.com> <48cd77a8-350c-472c-b0f7-e1f527500707@googlegroups.com> <390cce56-e2e2-480e-8287-d58023e9ae6a@googlegroups.com> <2997de16-c428-4e61-ab16-0e593b58adfa@googlegroups.com> Date: Mon, 25 Aug 2014 19:23:36 +0400 Message-ID: Subject: Re: [lojban] Re: Letter Frequency in lojban From: Gleki Arxokuna To: lojban@googlegroups.com X-Original-Sender: gleki.is.my.name@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of gleki.is.my.name@gmail.com designates 2a00:1450:400c:c03::22a as permitted sender) smtp.mail=gleki.is.my.name@gmail.com; dkim=pass header.i=@gmail.com; dmarc=pass (p=NONE dis=NONE) header.from=gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Content-Type: multipart/alternative; boundary=bcaec53f35eb4a0160050175c502 X-Spam-Score: -1.9 (-) X-Spam_score: -1.9 X-Spam_score_int: -18 X-Spam_bar: - --bcaec53f35eb4a0160050175c502 Content-Type: text/plain; charset=UTF-8 2014-08-25 18:58 GMT+04:00 TR NS : > Ever since I started learning Lojban, there was something about the sound > of it that felt very unnatural. At first I thought it was just me not being > familiar with it. As I spent more time with it and listened to usages of > the language on YouTube, it became clear to me that it was more than this. > In particular the letter `c` really stood out. Then last night I looked at > letter frequency comparisons. > > I've seen a couple of different lists for Lojban and may do my own, but I > think this list > > iaouel'ncmsdkrtpbjzgvfxy > > (Note the list on the wiki isn't too far off from this, but includes > various example lists in its corpus which is not really a good sample of > usage.) > > Now compare this to a wide swath of natural languages, which you can view > here: http://simia.net/letters/. > This list doesnt contain Mandarin pinyin which is important. (Note that the Russian alphbet can be a bit misleading so refer to > http://www.russianlessons.net/lessons/lesson1_main.php) > > While I do not think the placement of vowels is so significant, it is > interesting to note that no natural language appears to have more than four > vowels at the top of its list, and even that is fairly rare. `u` is almost > always much further down the list. Also `i` is very rarely the number one > letter, `e` and `a` dominate. Of course, that is almost certainly from the > use of `.i` to start sentences. Regardless, Lojban is clearly vowel heavy > and a lot rides on clearly distinguishing all five of the primary vowel > sounds. > > The more significant difference is in the constants where almost > invariably the letters `n` `r` `s` `t` are near the top of every natural > language. Following them are frequently `l` and `d`. `m`, `k` or `g` tend > to be in the middle but sometimes creep further up. Compare that to Lojban > with `l` `'` `n` `c` and `m` (l h n sh m) at the top and it really stands > out. Only `n` is in a position we could deem natural. > > Some may dismiss this out of hand, but I think it is important and an > inescapable reality: Lojban can not become a widely spoken language for the > simple reason that, in this regard, it is running contrary to many > millennia of natural language evolution. > > -- > You received this message because you are subscribed to the Google Groups > "lojban" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to lojban+unsubscribe@googlegroups.com. > To post to this group, send email to lojban@googlegroups.com. > Visit this group at http://groups.google.com/group/lojban. > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "lojban" group. To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout. --bcaec53f35eb4a0160050175c502 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable



2014-08-25 18:58 GMT+04:00 TR NS <transfire@gmail.com>:
Ever since I started l= earning Lojban, there was something about the sound of it that felt very un= natural. At first I thought it was just me not being familiar with it. As I= spent more time with it and listened to usages of the language on YouTube,= it became clear to me that it was more than this. In particular the letter= `c` really stood out. Then last night I looked at =C2=A0letter frequency c= omparisons.

I've seen a couple of different lists for Lojban an= d may do my own, but I think this list

=C2=A0 =C2=A0 iaouel'ncmsdkrtpbjzgvfxy
(Note the list on the wiki isn't too far off from thi= s, but includes various example lists in its corpus which is not really a g= ood sample of usage.)

<= div> Now compare this to a wide swath of natural languages, which you can view h= ere: http://simia.n= et/letters/.

This lis= t doesnt contain Mandarin pinyin which is important.

(= Note that the Russian alphbet can be a bit misleading so refer to http://www.russianlessons.net/lessons/lesson1_main.php)

While I do not think the placement of vowels is s= o significant, it is interesting to note that no natural language appears t= o have more than four vowels at the top of its list, and even that is fairl= y rare. `u` is almost always much further down the list. Also `i` is very r= arely the number one letter, `e` and `a` dominate. Of course, that is almos= t certainly from the use of `.i` to start sentences. Regardless, Lojban is = clearly vowel heavy and a lot rides on clearly distinguishing all five of t= he primary vowel sounds.

The more significant difference is in the constants whe= re almost invariably the letters `n` `r` `s` `t` are near the top of every = natural language. Following them are frequently `l` and `d`. `m`, `k` or `g= ` tend to be in the middle but sometimes creep further up. Compare that to = Lojban with `l` `'` `n` `c` and `m` (l h n sh m) at the top and it real= ly stands out. Only `n` is in a position we could deem natural.

Some may dismiss this out of hand, but I think it= is important and an inescapable reality: Lojban can not become a widely sp= oken language for the simple reason that, in this regard, it is running con= trary to many millennia of natural language evolution.=C2=A0

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsub= scribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http:= //groups.google.com/group/lojban.
For more options, visit http= s://groups.google.com/d/optout.
--bcaec53f35eb4a0160050175c502--