[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Lojban Certification Program
On Fri, Sep 18, 2009 at 2:30 PM, Matt Arnold <email@example.com> wrote:
> I think the question is whether to use the most common 500 words, or
> weight it in favor of cmavo.
In favor or against cmavo? I think it is cmavo that are
overrepresented in the initial segment. In the first 100 words there
are only 18 gismu. It's pretty hard to construct sentences which use
82 cmavo but are constrained to only 18 gismu.
> I still think 500 is too many. How many of you agree?
These are the top 50 cmavo from
But we probably need to do some fiddling. For example, "no" and "pa"
are the only numbers that made it to the top 50, but I think all
numbers should be tested in the first level. The only FA that made it
is "fi". It's reasonable that "fi" is the most frequent, but
fa/fe/fi/fo/fu are learned together and should be tested together, so
if "fi" is included they should all be (they might be left for the
second level). Similarly for se/te/ve/xe. Some of them I think we can
safely exclude, like "sei", which is there because of the frequent
"sei X cusku" especially in the Alice translation. Also lu-li'u maybe
need not be included. (But I would include "zo", especially if we
include "cmene". We can't use "cmene" without "zo".)
Mark's proposed list also has about 50 cmavo by my count, and it has
much overlap with the above list, as expected, but also some
lo, la, cu, mi, do, ti, ta, tu, and some other KOhA, nu, ka, ni, all
of SE, ca, pu, ba, NOI, GOI, .i, A,...
ku, kei and when they're needed, and cu as mentioned above.
A small selection of UI/CAI and COI (and DOI)
Numerals no-so and base-10 construction, perhaps also ro.
I think some 50 cmavo is about right for the first level. Then there
should be some cmevla, not too many but in any case cmevla are easy as
they don't need to be memorized, just recognized, and they are one of
the first things people learn anyway, so I don't think we need to
worry about how many of them we include. And then some reasonable
number of gismu that allow us to write meaningful sentences.
These are the top 50 gismu from
Again we will probably need to do adjustements, but we won't know
which ones until we start producing the questions. We could start with
that list and then add/substract words as needed.
I would not include fu'ivla in the first level. A few lujvo perhaps
yes, but unfortunately I can't open the lujvo frequency list to get
some idea what the most frequent are. Probably things with sel-, nun-,
-gau, and such.
mu'o mi'e xorxes