[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Top 12 langanguages in speakers - Loglan/Lojban 1995 list



The 12 most spoken languages - 1995 Loglan/Lojban Project baseline values

Following is data derived from the 1995 Encyclopedia Brittanica Book of
the Year regarding language populations for the top 12 languages, which
are the baseline set for the Loglan/Lojban project (only the top 6 are
used for Lojban gismu making).  For comparison, summary numbers from
1994 are also shown, along with amount of change.  I think that these
numbers serve as a fairly authoritative estimate of the number of
speakers of the 12 languages, and unlike other published estimates, my
methodology in generating the numbers is open to inspection, along with
source data for individual countries.  (See signature for the Lojban ftp
site, filename "LANGSTAT.95" has the supporting data)

The number of 2nd language speakers is determined by taking actual
counts of such 2nd language (or creole) speakers generated by official
sources and reported in the Brittanica.  An increment is added to
reflect 2nd language literacy in the official language of a country,
presuming that all official languages of the country are taught in the
schools, based on official-source literacy figures.  Finally, for
Arabic/Moslem countries, the status of Arabic as a religious language is
used to generate an additional increment.  This is most significant for
Iran where the religion is heavily state supported even though the
official language is not Arabic, and there are few native speakers of
that language.

Having determined these numbers, the Lojban gismu-making weights are
determined by summing the number of native speakers and 1/2 the total
from all 3 methods of estimating 2nd language speakers (since these 3
methods include an elimination of overlap in the calculation).  The
total of 1st and all 2nd language speakers is not used in the Lojban
algorithm.

The 1995 numbers are summarized as follows (in millions):

             native          2nd/creole+literacy+religion   Total speakers
             native+1/2*2nd
             normalized weight for 6 languages based on 1.0 total.

Chinese      801.552         314.039+25.225                 1140.816
             971.184
             .347   (-.001)

Hindi        413.231         66.39+206.000                   685.621
             549.426
             .196   (+.002)

English      334.786         187.907+59.895                  582.588
             448.343
             .160   (-.003)

Spanish      330.999         12.644+11.531                   355.174
             343.086
             .123   ( 0)

Russian      210.948         0+77.965                        288.913
             249.930
             .089   (+.001)

Arabic       205.272         0+19.705+46.991                 271.968
             238.620
             .085   (+.001)


Bengali      183.860         0+.927                          184.787
             184.323
Portuguese   166.662         6.294+10.028                    182.984
             174.823
Japanese     125.086         0                               125.086
             125.086
French       74.529          41.198+29.477                   145.204
             109.866
Malay-Indon. 37.752          137.526                         175.278
             106.515
German       94.768          1.714+8.511                     104.993
             99.880

These were the 1994 numbers (The Russian 2nd language numbers were of
lower quality because most of the ex-CIS states had no reported literacy
figures in the Brittanica, and I used a floor value that turned out to
be significantly low).

             native          2nd/creole+literacy+religion
             native+1/2*2nd
             normalized weight for 6 languages based on 1.0 total.

Chinese      792.183         310.584+21.957
             958.454
             .348

Hindi        405.745         56.6+201.225
             534.658
             .194

English      329.906         163.662+73.214
             448.343
             .163

Spanish      325.856         7.069+15.723
             337.252
             .123

Russian      210.772         0+62.456
             242.000
             .088

Arabic       198.468         0+19.264+48.323
             232.262
             .084


Bengali      180.290         0+.904
             180.742
Portuguese   164.124         6.199+9.238
             171.843
Japanese     124.6059        0
             124.6059
French       72.589         41.384+23.010
             104.786
Malay-Indon. 36.656         .04+133.053
             103.203
German       91.616         1.716+7.625
             96.287

For comprison, here is the total speakers from the 1987 World Almanac
and the numbers used in the 1987 original Lojban gismu-remaking effort,
which were based on the 1985 Brittanica BotY.  Note that Hindi passed up
English in about 1989 due to rapidly increasing numbers of native
speakers along with a major increase in literacy which is continuing.
The drop in native English, French, German, and Indonesian speakers is
due to the switching of creole speakers and some estimates of non-native
official language speakers (especially in Africa) from native to 2nd
language totals.

                1987               1987 gismu-remaking                  1995
             World Almanac      native  2nd     n+1/2s  norm. weight    weight
Chinese         788             752.1   319.1   911.7   .360            .347
English         420             366.5   322.4   527.7   .208            .160
Hindi           382             294     200.3   394.2   .156            .196
Spanish         296             264.7   58.2    293.8   .116            .123
Russian         285             164.3   109.7   219.12  .087            .089
Arabic          177             155.9   57.7    184.8   .073            .085

Bengali         171             87      80.8    127.4
Portuguese      164             110.4   45.5    133.2
Malay/Indon.    128             121.1   39.5    140.9
Japanese        122             120.1   0.6     120.4
German          118             105.4   18.3    114.6
French          114             81.1    75.5    118.9

----
lojbab = Bob LeChevalier, President, The Logical Language Group, Inc.
2904 Beau Lane, Fairfax VA 22031-1303 USA  703-385-0273  lojbab@access.digex.net
Ask me about the artificial language Loglan/Lojban, or see the Lojban WWW Server
               href="http://xiron.pc.helsinki.fi/lojban/";
We also have material available via ftp (ftp.cs.yale.edu, directory pub/lojban).
 email mailing list (listserv@ and lojban@cuvmb.cc.columbia.edu).  The
   LLG is funded solely by contributions, and are needed in order to
               support electronic and paper distribution.