[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[lojban] Re: Updated Letter Frequency Data
Robin Lee Powell scripsit:
> My data, sorted by number of occurences:
[snip]
> The only previous work on this I'm aware of is:
>
> http://www.lojban.org/files/papers/scrabble.unf
>
> Which, it turns out, is amazingly flawed (which is fine, because
> that was a long time ago!).
The two sets of statistics aren't comparable, because the Scrabble
data counts each distinct word only once, which is appropriate for
Scrabble. Your data (I assume) counts every letter in the running text.
--
John Cowan www.reutershealth.com www.ccil.org/~cowan jcowan@reutershealth.com
Arise, you prisoners of Windows / Arise, you slaves of Redmond, Wash,
The day and hour soon are coming / When all the IT folks say "Gosh!"
It isn't from a clever lawsuit / That Windowsland will finally fall,
But thousands writing open source code / Like mice who nibble through a wall.
--The Linux-nationale by Greg Baker