[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[lojban] Re: Two new gismu, "stomach" and "back of body". And a proposal for an updated method of generating gismu.



coi la gleki

Just commenting on one point, the German source words.

pe'i If German "Magen" and "Rücken" were to be considered, they should be transcribed "magyn" and "rikyn", respectively. IPA is, more or less, ['maːgən]/[maːgn̩] and ['ʁʏkən]/[ʁʏkn̩]. "ü" is customarily transcribed as "ю" in, eg, Russian, which would be lojbanised as "iu", but it's a simple front vowel in German, the nearest Lojban equivalent of which is probably "i". Also, although "Rücken" is historically "rück" + "en" (I guess), the "en" in "Rücken" is pe'i not a productive suffix, so probably it shouldn't be removed before Lojbanisation. I'm not sure about that last point, though.

Also, I have of course not the slightest idea if this changes the outcome of your gismu generation.

-iesk

Le mardi 21 août 2012 17:07:36 UTC+2, la gleki a écrit :
Robin wanted those two new gismu and so I started preparing their sounding.

At first I asked myself "Are those frequencies taken from Atlas of the world true or correct? Are there any flaws in the method of generating gismu from 6 languages?"

So here is my new method.

1.At first I opened this wikipedia article. Unfortunately Wikipedia just removed the data that we need in it's newer revisions of the page.
2.Write out the number of native (I'll call them L1) speakers of first most common languages. Determine the number of L2 speakers as L2=(total number of speakers - L1 speakers).
3. In one special case we have L3 speakers.
4. Determine the corrected frequency as =L1+L2 / 2 + L3 / 3
5. Leave first 12 languages (I'll tell you later why 12)
5. Convert them to fractions so that the sum=1.
6.Therefore we get (3-letter ISO-codes of languages in the 1st column)

  l1 l2 l3 total cor.freq. fraction
cmn 845 180   1025 935 0.246117
eng 375 375 750   812.5 0.213872
spa 329 61   390 359.5 0.09463
ara 232 220   452 342 0.090024
hin 240 165   405 322.5 0.084891
ben 181 69   250 215.5 0.056725
rus 144 106   250 197 0.051856
por 178 15   193 185.5 0.048829
jap 122 1   123 122.5 0.032245
pun 109 0   109 109 0.028692
deu 90 28   118 104 0.027376
fra 68 52   120 94 0.024743
jav 85 0   85 85  
wu 77 0   77 77  
mar 75 3     76.5  
tel 70 5   75 72.5  
vie 69 0   69 69  
             
            1
7.Now some general stuff. I stopped at French cuz it's a Romance language similar to Poruguese and Spanish so it might shift the frequencies of phonemes a bit.
8.Punjabi and Bengali are no longer considered as variations of Hindi.

9.Now let's find etymologies for our gismu.
I can recommend the following links (warning! unsorted)
for transliterating Arabic script robsmart.co.uk » Transliteration again … now complete
 punjabi dic stomach | Meaning of stomach | Punjabi Dictionary | iJunoon
 punjabi dic Punjabi(Gurmukhi,Shahmukhi) to English Dictionary:: ACTDPL Punjabi University, Patiala
 no comments Google Translate
 Hindi stomach meaning in Hindi and English - Shabdkosh.com | शब्दकोश.कॉम
Hindi English to Hindi Dictionary and Translation
Hindi Hindi English Dictionary Online
 Bengali stomach meaning in Bengali and English - Shabdkosh.com | অভিধান.कॉम
 Bengali stomach - A Bengali-English dictionary
intend translation Portuguese | English-Portuguese dictionary | Reverso Collins
 Japanese Find words - Denshi Jisho
German am going to : Dictionary / Wörterbuch (BEOLINGUS, TU Chemnitz)
audio samples forvo.com


10.Some dictionaries contain audio samples.
11.Notice that Punjabi, Bengali and Hindi (together with Urdu) are quite similar. You usually should get the same sounding.
12. So prepare the sounding using standard methods of it's adapting to lojban phonology as described in the CLL.
13.Now let's use scoreGismu perl app.

For "stomach" we get
cmn vei 0.253388
eng beli 0.22019
spa bientre 0.097425
ara muada 0.092683
hin pet 0.087398
ben pet 0.058401
rus jeludyk 0.053388
por bariga 0.050271
jap xara 0.033198
pun pet 0.028692
deu magen 0.028184
fra vontr 0.025474

With first 4 languages it outputs 
vreli velti venli velbi 0.691286 (rating)
With all 12 languages it  outputs
vetli 0.535
(rating)


I'd choose {vetli} as it's similar to Chinese vei, English belly, Indian "pet" and Romance "ventr-"


For "back of human body" we get
cmn beiji 0.253388
eng bak 0.22019
spa espalda 0.097425
ara vyxra 0.092683
hin pic 0.087398
ben pic 0.058401
rus spin 0.053388
por kostas 0.050271
jap se 0.033198
pun pic 0.028692
deu riuk 0.028184
fra dos 0.025474

Top score is
bekpi bajdi 0.50684 4 langs
bekpi 0.4697 12 langs

I'd choose {bekpi} which reminds me of Chinese bei, English back, Indian pic and Russian sPIna.

Robin approved them several hours ago in chat but I'm free to suggestions.

Notes.
Still 6 languages is enough. 4 languages are not enough. More than 12 languages is a useless waste of time (but If you can collect translations please do).

--
You received this message because you are subscribed to the Google Groups "lojban" group.
To view this discussion on the web visit https://groups.google.com/d/msg/lojban/-/DrO1ONmWqngJ.
To post to this group, send email to lojban@googlegroups.com.
To unsubscribe from this group, send email to lojban+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.