[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[lojban-beginners] Re: latest lujvo.txt file?



jkominek@miranda.org wrote:

On Fri, Aug 12, 2005 at 10:00:20AM -0300, Jorge Llamb?as wrote:
On 8/11/05, Hal Fulton <hal9000@hypermetrics.com> wrote:
Well, I want to get at the data myself so as to write one or more
tools of my own. So if I can't access the jbovlaste data easily,
I may just use an old noralujv.txt file.
I think the data in jbovlaste is meant to be freely accessible.
The reason it isn't is not so much that anyone is trying
to hoard it but just that nobody has had the time to do the necessary work for putting it in a suitable format.
But you may want to talk with Jay Kominek or Robin Powell
about that, I'm just a user. noralujv.txt is really outdated
compared with jbovlaste.

The real problem is, a decent format hasn't been devised. Nobody in
their right mind *wants* to use the noralujv.txt format for anything.

If someone would come up with an XML DTD/Schema/whatever for Lojban
definitional data, I'll write the exporter, and you'll have your data.

So far, nobody has been sufficiently interested.

The SQL schema is at: http://jbovlaste.lojban.org/design/jbovlaste.sql

Use TEI - it's an XML format specifically for dictionaries. I have written a converter in Python to output to TEI ages ago, but it was a one-pager and I think I threw it away. I was going to write functionality for jbovlaste, but it never happened. I'm too lazy :-)

Refs:
http://www.freedict.de/
http://www.tei-c.org/

There's also the dict protocol, which I use for looking up words - http://www.lojban.org/tiki/tiki-index.php?page=Word+Lists&bl

The beauty of TEI is that I've seen dictionary programs use it as an import format - most notibly, dictd which is the dictionary server that comes with most Linuxes.

Mikevdg.