From nobody@digitalkingdom.org Tue Aug 23 01:34:41 2005 Received: with ECARTIS (v1.0.0; list lojban-beginners); Tue, 23 Aug 2005 08:38:34 -0700 (PDT) Received: from nobody by chain.digitalkingdom.org with local (Exim 4.52) id 1E7UEz-0004Uc-4x for lojban-beginners-real@lojban.org; Tue, 23 Aug 2005 01:34:41 -0700 Received: from mxsrv2.tranzpeer.net ([202.180.66.215]) by chain.digitalkingdom.org with esmtps (TLS-1.0:DHE_RSA_3DES_EDE_CBC_SHA:24) (Exim 4.52) id 1E7UEs-0004UV-Lg for lojban-beginners@chain.digitalkingdom.org; Tue, 23 Aug 2005 01:34:41 -0700 Received: from [203.184.13.201] (helo=gulik.co.nz) by mxsrv2.tranzpeer.net with ESMTP (Exim 4.34) id 1E7UEn-0004Of-Gk for lojban-beginners@chain.digitalkingdom.org; Tue, 23 Aug 2005 20:34:30 +1200 Message-ID: <430ADF93.2030006@gulik.co.nz> Date: Tue, 23 Aug 2005 20:34:27 +1200 From: Michael van der Gulik User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-GB; rv:1.6) Gecko/20040216 Debian/1.6.x.1-10 X-Accept-Language: en MIME-Version: 1.0 To: lojban-beginners@chain.digitalkingdom.org Subject: [lojban-beginners] Re: latest lujvo.txt file? References: <42FABC77.6020100@hypermetrics.com> <925d1756050811072360742c39@mail.gmail.com> <42FBF813.9050504@hypermetrics.com> <925d175605081206004c5815ca@mail.gmail.com> <20050812142221.GL29546@miranda.org> In-Reply-To: <20050812142221.GL29546@miranda.org> Content-Type: text/plain; charset=us-ascii; format=flowed X-Spam-Score: -2.6 (--) X-archive-position: 1840 X-Approved-By: jkominek@miranda.org X-ecartis-version: Ecartis v1.0.0 Sender: lojban-beginners-bounce@chain.digitalkingdom.org Errors-to: lojban-beginners-bounce@chain.digitalkingdom.org X-original-sender: mikevdg@gulik.co.nz Precedence: bulk Reply-to: lojban-beginners@chain.digitalkingdom.org X-list: lojban-beginners jkominek@miranda.org wrote: >On Fri, Aug 12, 2005 at 10:00:20AM -0300, Jorge Llamb?as wrote: > > >>On 8/11/05, Hal Fulton wrote: >> >> >>>Well, I want to get at the data myself so as to write one or more >>>tools of my own. So if I can't access the jbovlaste data easily, >>>I may just use an old noralujv.txt file. >>> >>> >>I think the data in jbovlaste is meant to be freely accessible. >>The reason it isn't is not so much that anyone is trying >>to hoard it but just that nobody has had the time >>to do the necessary work for putting it in a suitable format. >>But you may want to talk with Jay Kominek or Robin Powell >>about that, I'm just a user. noralujv.txt is really outdated >>compared with jbovlaste. >> >> > >The real problem is, a decent format hasn't been devised. Nobody in >their right mind *wants* to use the noralujv.txt format for anything. > >If someone would come up with an XML DTD/Schema/whatever for Lojban >definitional data, I'll write the exporter, and you'll have your data. > >So far, nobody has been sufficiently interested. > >The SQL schema is at: http://jbovlaste.lojban.org/design/jbovlaste.sql > > > Use TEI - it's an XML format specifically for dictionaries. I have written a converter in Python to output to TEI ages ago, but it was a one-pager and I think I threw it away. I was going to write functionality for jbovlaste, but it never happened. I'm too lazy :-) Refs: http://www.freedict.de/ http://www.tei-c.org/ There's also the dict protocol, which I use for looking up words - http://www.lojban.org/tiki/tiki-index.php?page=Word+Lists&bl The beauty of TEI is that I've seen dictionary programs use it as an import format - most notibly, dictd which is the dictionary server that comes with most Linuxes. Mikevdg.