[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lojban] to-do list (was Re: New Members, Board of Directors, other LogFest results)



At 08:19 PM 7/24/02 -0600, Jay F Kominek wrote:
>On Wed, Jul 24, 2002 at 09:28:12PM -0400, Bob LeChevalier wrote:
> > >How complicated can a dictionnary spec be?
> >
> > I dunno.  People seem to think we need something more than is being done,
> > but there have been no articulated   specifics.
>
>http://groups.yahoo.com/group/lojban/message/14204
>http://groups.yahoo.com/group/lojban/message/6906
>
>http://nuzban.wiw.org/wiki/index.php?The%20Lojban%20Dictionary
>http://nuzban.wiw.org/wiki/index.php?Great%20Dictionary%20Problem
>http://nuzban.wiw.org/wiki/index.php?jbovlaste
>
>There have been more messages, but it'd take too long to find them, and
>I just don't care at this point, beyond preventing this slandering.

Sorry for offending.  We have a couple of severe communication problems 
here, and I was quite stressed out when writing earlier (and getting 
interrupted by my kids every 2 minutes so I couldn't spend good time 
reviewing what I had written).

1. I had a VERY different idea what you meant above by a "dictionary spec". 
I come from the government software world where the magic word "spec" is a 
real document standing on its own, and not a discussion of possibilities 
with no agreement and resolution.  I especially view the wiki as think 
tanking, and have no idea what parts of what is written on any page 
represent any consensus.

2.  To the extent that there is a "spec", for example in the two list 
messages referenced, it looks like a spec for the data base tool that 
people use to enter data, etc.  I don't see anything that tells me what the 
dictionary is supposed to look like, what it is supposed to contain, etc.

3.  Most of the discussion on the wiki seems to be debating whether we 
should even be writing a dictionary at all, with all the politics of this 
cabal thing wherein I admit to not fully understanding what is wanted by 
which people.  It appears that even Nick got frustrated with the debate and 
gave up.

To all this I observe that the closest that I see anyone coming to a spec 
on the wiki is actually an example, and that is Nick's (I think) FAhA 
definition:  http://nuzban.wiw.org/wiki/index.php?fa%27a%20%28Dictionary%29
That page actually looks more like a spec for the Elephant and how it will 
be used to create a cmavo dictionary, but his definition at the top of the 
page looks something like a dictionary definition (for someone well-versed 
in technical linguistic terminology like Nick) and it is something that 
actually tells me what ONE person thinks a good definition of FAhA looks 
like, possibly enriched by the examples further down.  Whether the pro and 
con debate on that page has any place in a dictionary as opposed to a tool 
for writing one like the Elephant, I am not clear on, but printwise we 
can't print a dictionary with several-page-debates as the definition.  It 
also seems quite unclear whether we would finish a meaningful Elephant for 
all the cmavo, much less any other words, in less than a decade.

Unfortunately that page comes across as Nick's proposal, which received 
relatively little comment as a proposal for a format, but instead turned 
into a debate on the words fa'a and mo'i and the role usage will play in 
determining the dictionary definition, and even THAT was not resolved.

But I responded positively to what I saw as the proposal on the page:
>I think that what you've done is quite excellent as a start, though the 
>dictionary entry would have to be grossly abbreviated, with references to 
>the discussion/Elephant rather than the arguments themselves (the 
>arguments do not belong in the dictionary). It might indeed be possible to 
>come up with a way to cover the multiple approaches coherently, once they 
>are fully elaborated. (The only improvement I could request is to find a 
>way to more clearly link the examples to their appropriate definitional 
>segments.) --lojbab

It looks like at some point, Nick was planning to actually create what he 
though the dictionary entry would look like, as the quote on "The Lojban 
Dictionary" indicates:
>fa'a? needs to be made, using the content from fa'a (Dictionary)

But that, if people had agreed upon whatever it said, would have been a 
useful "spec" to me of a good cmavo definition.

For gismu, I've gotten the impression from many people, pc most recently in 
response to my comment on bancu a couple of days ago, that people want some 
sort of expansion of the current gismu definitions clarifying the 
places.  Again, I see no consensus on what exactly people want - the main 
thing that good dictionaries have that we do not is of course examples that 
fully explicate the meanings.  I'd love something like that, but I quickly 
suspect that between the gismu and the cmavo alone, based on the above two 
"specs", we are talking about a 200 page dictionary before we add a single 
lujvo.

I have no idea what people want in a lujvo dictionary entry either, but 
presume that it will be similar to a gismu entry with the etymology 
explicated.  The discussion of which lujvo to put in always turns into a 
debate about whether we should use new words or old words and whether my 
usage frequency data is worth anything.  Your tool discussion seems to 
suggest something Elephant-like with different place structure proposals, 
labeled by originator being voted on by reviewers.  While it sounds good in 
theory, it sounds like something that will either never be used (because 
adding new words takes precedence) or will be excessively used (because 
people are more interested in debating the conventions and meanings than in 
expanding dictionary coverage).

I have my own ideas of what I would like to put in a dictionary myself, but 
I admit that I've never written it up either, nor any examples; it always 
seemed premature when I was not in fact going to follow through and no one 
else seemed to be doing so.  I've always focused on what would be useful to 
the Lojban student for looking up the words the way we usually use a 
dictionary, whereas the Elephant version looks more like an adjunct to the 
Reference Grammar - a cmavo catalog after the fashion of the selma'o 
catalog - to which the average person would respond by throwing up his 
hands and retreating "too much information".  Rather I think that what the 
world is looking for in a Lojban dictionary is something that shows how 
Lojban words map to the semantic space of other languages, something that 
requires LOTS of word entries that don't need to be particularly thoroughly 
defined (and indeed if not for the criticality of place structures when 
people actually go to use the dictionary, even THEY are of secondary 
importance.  What people really want is "What is the Lojban word for X, or 
what possible words can I use for X, and how do I use them.)

At any rate, to the extent that Nick's fa'a example was a spec for the 
Elephant, I can only presume that Cowan has been working on that or 
something like it.  It isn't done, and you know at least as well as I do 
what the status is.  If I come across as critical of "uncompleted work", 
then the Elephant, and your dictionary tools come to mind, and it is NOT 
that I really am "critical", but that in the absence of something there, I 
see nothing for me to either manage or use (assuming there is a role for me 
in either effort at all, and my understanding from Logfest last year was 
that I was NOT going to have any especial role in the Elephant), and I 
don't know what you think I should be doing that I am not.

>I've been on this for quite awhile, trying to get feedback as to what
>other people think is important in the system.

And I get the impression that no one has any real opinions, or will have 
any, until you actually create the system.  It is hard to get the average 
person excited about a software spec, especially when they are interested 
in the spec for the book.

> > You have to understand that many people don't want to install ANY software
> > downloaded from the net.
>
>I have, at no point, ever suggested doing this as anything except a web
>based system. Anything beyond that would be entirely optional on the user's
>part.
>
>I have some vague idea how to design software appropriate for a given
>target audience. I've been doing it for quite awhile, now. I'm able to
>eat and putz about on the Internet because I'm paid to develop
>software.  I'm told I'm fairly good at it. So could we maybe at least
>stop with the concerns that all the evil, horrible, mean UNIX-using
>Lojbanists are incapable of making something usable by plebes? Its
>tiresome, and insulting.

How is the above a rant against UNIX?

It is a rant against the focus on the tools to make the book and not on 
making the book itself.  It is a rant against working on-line or 
downloading and uploading software and files from on line, some variation 
of which is what I understand CVS to be, figure the Elephant will be and 
presume your tools will also be.  I don't particularly see myself as having 
any special role in creating either files or entries in either your program 
or the Elephant, so my concerns are 1) are you producing something that 
people will actually use to enter data, 2) will they actually enter most or 
all of the data you are designing the data base to hold, or will we end up 
with a data base filled with oodles of gaps such that any automated 
dictionary produced from the data base will look like garbage because no 
two entries will have the same level of completeness of definition, and 
there will be no consistency in style, and 3)  What are the actual 
dictionary entries supposed to look like, and what will I or someone else 
acting as "dictionary editor" actually need to do with the result in order 
to produce the book.

> > But for me also it is simply learning curve.  Time spent learning news
> > software cuts into the time I can spend doing what I do now, and I don't
> > have enough time for THAT.
>
>Most people can use their web browser and fill out forms.

That means little to me, since I don't use web browsers to fill out 
forms.  To the extent I do, they seem quite user unfriendly, either trying 
to automatically guess what I want to enter and filling it in based on 
matches with what I've entered before, or making me type the whole thing 
out, and having no particular editing tools other than that associated with 
point/click/copy/cut/paste.  For someone used to editing with a  full 
screen text editor with powerful commands and macro capability, this sounds 
like torture, but if you think people will fill in such forms, then let's 
see the forms and lets see them do it.

I guess I'm saying that the flat files are out there somewhere on the web 
site, and no one is doing anything.  If you've got an idea for a tool 
whereby people can and will do something with it that gets us closer to a 
dictionary, then more power to you and go ahead - I'm not interested in 
debating the pros and cons, but I am interested in the results and the 
demonstrated usage of the tool to advance the work.  Please forgive my 
skepticism that what you propose will get any more words defined than have 
already been defined - I would LOVE to be proven wrong.

>(And, since you'll likely not read the URLs I've posted above,

I read them, and I read them the last time we debated this wherein I think 
you posed the same or similar URLs.

>I'll simply
>readdress the concern you had last time, wherein you said that you normally
>did a few hundred words at a time, and wouldn't want to deal with a form
>each time. (How many times have you done a block of a few hundred words, by
>the way?

Recently, not at all of course %^).  When I last worked on the 
word-frequency lujvo file I indeed worked on the entire list, jumping 
around and annotating and correcting word forms for around 500 words in a 
sitting.  When I last worked on the KWIC files that are used to generate 
the text of the form ENGDICT.GIS, I again did a lot of jumping around, but 
modified around 200 entries and deleted even more.  Each of these sessions 
was several hours a days for a couple of days - I don't start dictionary 
work unless I'm going to spend a lot of hours on it.

In developing the gismu list itself, the final editing pass that I did in 
1994 consisted again in jumping around and comparing ALL 1300 odd words at 
once, while stepping through them alphabetically.  I spent probably over 
100 hours over the course of a month or so on that final pass, after a pre 
final pass of a comparable number of hours spread over a longer time 
(around 3 months if I recall).

When I say that I can't think of things that people can do in less than 40 
hours, it is because I seldom tackle tasks that are anywhere near that 
small myself.  I don't break work down into chunks that small, and I think 
of the big picture, and not the little pieces.  This is perhaps a failing 
of mine if I am to be seen as a "manager" rather than as a "leader" and 
especially a "leader by example" which is what I traditionally thought 
myself - though I admit my example has been pretty poor of late.

>There are only 1350 some odd gismu. And what are you doing with
>them?) I can easily set something up so that people who want to make a
>bulk contribution can write themselves up a specially formatted text file
>and upload that to the web site.

Jay, I've been on the net for years, and yet I have NO IDEA how to upload a 
file to the Web except using ftp commands after Telneting into a shell 
account.  That is the ONLY way I've ever done it or seen it done, though 
I'm sure you know many others in use.  I presume that CVS has something 
FTP-like to do the same function, but I've never seen how it works.  And 
I've never been prone to figuring things out by fiddling with them, so I'm 
unlikely to even try until shown how personally some LogFest.

If I'm that way, and won't know what you are talking about until I actually 
see it or read a good users manual of the sort I doubt would ever be 
written, what about the "AOL-mentality" type you have implicitly criticized 
that are the majority of those connected to the net?

But whether you have an answer for that question or not, my point is that I 
can only make comments in ignorance until I can actually see what you have 
in mind, with data in it, and being shown how that data will produce the 
book.  I've been a software developer oh so long ago, but on this effort I 
am a (l)user who is ONLY focused on how >I< will use the tool to get what I 
want, and hoping that the wise software developer will be giving me 
something I can use, and also giving everyone else something they can use.

(I should add, BTW, that now that I am connected via cable modem, SOME of 
my bias against on-line lookups is dwindling.  I still hate to wait for the 
lag times, and I hate the clumsy net search functions that don't so far as 
I know handle even simple regular expressions.  I still download and use 
offline copies of anything I use more than once or twice if I have the 
option of doing so.  But at least I have some vision of what people who 
live always connected to the net might be able to do if they weren't as 
stuck on their old ways of doing things as I tend to be; I still prefer my 
ways, and I think that those who live always-connected have a different 
perspective and mindset than those who use dialup and 56K or slower 
modems.  I now see for example the possible virtue of hypertext, which was 
always too slow and cumbersome at low data rates, and rarely usable off 
line.  But I am also still one of those old fashioned types who sometimes 
actually reads a dictionary as a book, skimming through a couple pages of 
entries for interesting words.)

>  Why, hypothetically, it could even read stupid fixed column width ASCII 
> files.)

%^)

> > I have the impression that several people have done that already.  Since I
> > can't imagine using the computer to look up individual words, I've never
> > seen much use in it.
>
>Looking a word up as quickly as you can type it is in no way appealing to
>you?

I avoid typing when I can. I skim files.  When I search a textfile with my 
full screen editor, I prefer to have it select and highlight all 
occurrences of the line containing the text with context at once, and then 
skim those.  I use what you describe on the web and CAN use the bloody 
"find"/"find next" popup that in Windows passes for a search function, but 
it is painfully inefficient for me to step through the find text one item 
at a time to find the spot on a page that I am looking for (if it is there 
- else I have to load another page and another until I find it.)


-- 
lojbab                                             lojbab@lojban.org
Bob LeChevalier, President, The Logical Language Group, Inc.
2904 Beau Lane, Fairfax VA 22031-1303 USA                    703-385-0273
Artificial language Loglan/Lojban:                 http://www.lojban.org



------------------------ Yahoo! Groups Sponsor ---------------------~-->
Free $5 Love Reading
Risk Free!
http://us.click.yahoo.com/NsdPZD/PfREAA/Ey.GAA/GSaulB/TM
---------------------------------------------------------------------~->

To unsubscribe, send mail to lojban-unsubscribe@onelist.com 

Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/