Received: from mail-wi0-f187.google.com ([209.85.212.187]:35849) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.76) (envelope-from ) id 1UlcRx-0006iV-DH for lojban-list-archive@lojban.org; Sun, 09 Jun 2013 03:02:21 -0700 Received: by mail-wi0-f187.google.com with SMTP id hn3sf51227wib.14 for ; Sun, 09 Jun 2013 03:02:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=x-beenthere:date:from:to:subject:message-id:in-reply-to:references :x-mailer:mime-version:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-google-group-id:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe:content-type :content-transfer-encoding; bh=4Ha/4a4CPZz9waw1Yo3qEBUfIRs4vqBSCQaJ8h615ls=; b=PHItPXl0FUgSbVyn5oMpKJvuSQLnHu9nc3OG5W6IBDyhR5dvZMbXqQ77ttqh7imkvd 5xoMh3lWOJGQtvv3U2b01lwvXG4giu9O4dtGCDQ11zpjoPHYgI1xf5Vjh4szKBAD7vi7 d0nZJxKCf7rjjDDhRqIpOxaZtZcbvRhKMYHpj87Pgto7+pQSj2U6z5BNaiBbJK03Y7Yj ZSE0JDfX+hMr59A2Xm6WdMVvcq869XLeZlWZCsTV09H97uhe6lywpJrfSHuWEGPI5Yrr +8EQDAvtTNsqUaxoJZR2vPL+Aq7DwnSY5tsjSc3yeUCJJs5YG598fIMy+rqLwuUQNop2 n5zg== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=x-beenthere:date:from:to:subject:message-id:in-reply-to:references :x-mailer:mime-version:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-google-group-id:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe:content-type :content-transfer-encoding; bh=4Ha/4a4CPZz9waw1Yo3qEBUfIRs4vqBSCQaJ8h615ls=; b=GdZtlFqeaCD1QOq9XwsohNLldA+ZTJ9AUNsUsFZEAUSbrpnxxT9BvRHy5dFx9vBfTL 2imyvVDo9QFwwonLuziicPNQewlgsHv9VybdvJsaECh2PQPTNTfKTovvp7o/58KJq9vb yP/Qv3f30ued5Vx1A0zAJSYBAdCsB4fn4vRkVxtsEGvBswljmWBlIxsVgxp9cfCZYNUY +j3bZIhpBLcq/3Yip6m+eJPh/7QTsHq1xgMEvExkgXUG+mL/KEnJGiYd/AAtGwhNp6uL 5vmVRvJux+P93uZKSXYrsU9XSDbByydqgTAnTbWl9QB7dYp0vdqPmdNxYfxSlvHFRQrE IfmA== X-Received: by 10.180.10.234 with SMTP id l10mr182332wib.0.1370772125960; Sun, 09 Jun 2013 03:02:05 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.180.109.136 with SMTP id hs8ls568956wib.37.gmail; Sun, 09 Jun 2013 03:02:05 -0700 (PDT) X-Received: by 10.180.76.76 with SMTP id i12mr1991618wiw.6.1370772125227; Sun, 09 Jun 2013 03:02:05 -0700 (PDT) Received: by 10.194.6.194 with SMTP id d2mswja; Sat, 8 Jun 2013 23:53:35 -0700 (PDT) X-Received: by 10.180.160.212 with SMTP id xm20mr1912920wib.0.1370760815033; Sat, 08 Jun 2013 23:53:35 -0700 (PDT) Received: from mail-we0-x22f.google.com (mail-we0-x22f.google.com [2a00:1450:400c:c03::22f]) by gmr-mx.google.com with ESMTPS id e9si176182wib.3.2013.06.08.23.53.35 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Sat, 08 Jun 2013 23:53:35 -0700 (PDT) Received-SPF: pass (google.com: domain of daeldir@gmail.com designates 2a00:1450:400c:c03::22f as permitted sender) client-ip=2a00:1450:400c:c03::22f; Received: by mail-we0-f175.google.com with SMTP id t59so3953022wes.6 for ; Sat, 08 Jun 2013 23:53:35 -0700 (PDT) X-Received: by 10.194.63.229 with SMTP id j5mr2733043wjs.79.1370760814943; Sat, 08 Jun 2013 23:53:34 -0700 (PDT) Received: from nezumi ([2a01:e35:2ef1:f370:76e5:43ff:fe7e:86f9]) by mx.google.com with ESMTPSA id b11sm4966986wiv.10.2013.06.08.23.53.32 for (version=TLSv1.2 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Sat, 08 Jun 2013 23:53:33 -0700 (PDT) Date: Sun, 9 Jun 2013 08:53:47 +0200 From: Daeldir To: lojban@googlegroups.com Subject: Re: [lojban] Number of unique symbols needed to write lojban Message-Id: <20130609085347.30689d8aded469d57d5a7967@gmail.com> In-Reply-To: References: X-Mailer: Sylpheed 3.3.0 (GTK+ 2.24.18; i686-pc-linux-gnu) Mime-Version: 1.0 X-Original-Sender: daeldir@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of daeldir@gmail.com designates 2a00:1450:400c:c03::22f as permitted sender) smtp.mail=daeldir@gmail.com; dkim=pass header.i=@gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable X-Spam-Score: -0.1 (/) X-Spam_score: -0.1 X-Spam_score_int: 0 X-Spam_bar: / On Sat, 8 Jun 2013 13:36:41 -0500 Terry wrote: > How many unique symbols would be needed for all sounds and punctuation in= written lojban? Do you plan to do some stuff with lojban alphabet? I did some work on lojban symbols a while ago to pass the time (when I had = some=85). An HTML page to compare all proposed alphabets for lojban. Curren= tly, there is latin alphabet, cyrillic alphabet, tengwar mode, and I added = braille, just to see what it gives. I wanted to add hiragana, but the conversion algorithm I found was=85 A lit= tle too complex/undefined for me to work on it. I also thought about a hang= ul system, as hangul has some interesting linguistic properties (so says wi= kipedia), but=85 I don't know hangul, so I didn't spend time on it neither. I also tried to design an alphabet from scratch, with sound features in min= d but=85 Well, Tolkien did a better job, mine was useless compared to tengw= ars (even if tengwars are more complicated that needed for lojban, using a = subset works fine). For now, stressed characters are in uppercase (it doesn't recognize accents= ). Unknown characters are replaced with =93..=94 and numbers are replaced w= ith their name (1 is =93pa=94, 45 is =93vomu=94=85). While that doesn't answer your question, I can say that for that work, I us= ed 27 lowercase characters (including punctuation and *space*). Uppercase = was checked only for latin letters, not the apostrophe (so, I don't recogni= ze the "h" as a letter in my program), which gives 23 more symbols. So, 50 = symbols are used (counting space, no =93h=94). However, at least in the tengwar mode, there are more (or less) letters: 12= vowels (=93full=94 vowels, which are a symbol, and =93small=94 vowels, whi= ch are like accents), 18 consonants, one punctuation (=93'=94 is counted as= a letter, and with tengwar, the comma is useless). Because each full lette= r (six vowels, 18 consonnants) can have an =93accent=94, that is, can be fo= llowed by a small vowel, we have 24=D77=3D168 letters, (we count 7 =93vowel= s=94, one being =93no vowel=94), that, time two, for stressed syllables: 33= 6 symbols are used in the tengwar mode. However, it is really 32 symbols, s= ince each of the 336 final symbols are from one to three superposed symbols= (24 full letters + 6 small vowels + stress + space). So, that is less than= the 50 symbols used in the latin alphabet. Also, we could think about the = way tengwar are designed, and decompose them in even less symbols (each one= being a feature of the represented sound). Here is the code I used to =93classify=94 each latin letters: > maps.latin =3D { > letters: "abcdefgijklmnoprstuvxyz.,' ", > vowels: "aeiouy", > consonants: "bcdfgjklmnprstvxz", > punctuations: ".,' " // (=93'=94 should be a consonant and not a punctu= ation, but, he!=85)=20 > }; What matters for =93counting=94 the symbol is only the =93letters=94 field.= And adding stress (uppercase characters). You can give it a try here: http://daeldir.ninm.net/lojban-alphabets/alphabets.html (needs javascript t= o work, and a recent navigator, at least for displaying tengwar) I wanted to make the code more readable before releasing it, but as I don't= have time, and will spend the small amount I have on translations rather t= han on this code, I guess I won't wait more=85 I have another page with my attempt at a new alphabet, but it is not on the= net, and not very interesting. That may not answer totally your question, but it shows that the number of = symbols needed to write lojban can vary from 32 to 336, depending on how yo= u analyze the language and the alphabet (are =93a=94, =93e=94, =93i=94, =93= =E4=94, =93=EB=94 and =93=EF=94, six symbols, or four? Is the dot really ne= eded, as it is said (at least in the wave lessons, I don't remember for the= CLL) that it is always optionnal?). Also, keep in mind that I'm a beginner=85 But you already have better answe= rs from more experienced people. Mine is a=85 Complement. mu'o mi'e la .daeldir.lurni'acadz. --=20 You received this message because you are subscribed to the Google Groups "= lojban" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban?hl=3Den. For more options, visit https://groups.google.com/groups/opt_out.