Received: from mail-pw0-f61.google.com ([209.85.160.61]:50476) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.76) (envelope-from ) id 1S5EM9-0000ev-I9; Wed, 07 Mar 2012 02:44:36 -0800 Received: by pbcwz17 with SMTP id wz17sf515937pbc.16 for ; Wed, 07 Mar 2012 02:44:23 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=beta; h=x-beenthere:date:from:to:message-id:in-reply-to:references:subject :mime-version:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:x-google-group-id :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type; bh=b60Ag83HBzUwK0SgN/l2SO4+oarPEis89kSMosQSjtw=; b=xJZSomqtSJpp2RMXjJX15MMbmwIK/hRwYZ4r8kElL6WNBZ4RXT2wETVOnWTFG6FTU9 Evq/qMKNrU9fChtW7PMl/MHGSfNMlhXKLV+UrZx1y7PON1JbTO2vPMlmH5VKQIzHaPMe Uw7wq2PIye9F6CgjfaWzgLgiQu/mpbkml8m4A= Received: by 10.236.201.161 with SMTP id b21mr108503yho.4.1331117058749; Wed, 07 Mar 2012 02:44:18 -0800 (PST) X-BeenThere: lojban@googlegroups.com Received: by 10.101.201.10 with SMTP id d10ls1380692anq.1.gmail; Wed, 07 Mar 2012 02:44:17 -0800 (PST) Received: by 10.236.181.71 with SMTP id k47mr110806yhm.19.1331117057497; Wed, 07 Mar 2012 02:44:17 -0800 (PST) Date: Wed, 7 Mar 2012 02:44:16 -0800 (PST) From: gleki To: lojban@googlegroups.com Message-ID: <20567224.17.1331117056640.JavaMail.geo-discussion-forums@ynic10> In-Reply-To: <8f2d80fb-7cda-4645-854d-4f119e0d5726@l14g2000vbe.googlegroups.com> References: <29741151.5374.1331043579316.JavaMail.geo-discussion-forums@vbkc1> <8f2d80fb-7cda-4645-854d-4f119e0d5726@l14g2000vbe.googlegroups.com> Subject: [lojban] Re: How to export tatoeba in simple format MIME-Version: 1.0 X-Original-Sender: gleki.is.my.name@gmail.com X-Original-Authentication-Results: ls.google.com; spf=pass (google.com: domain of gleki.is.my.name@gmail.com designates internal as permitted sender) smtp.mail=gleki.is.my.name@gmail.com; dkim=pass header.i=@gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: multipart/alternative; boundary="----=_Part_16_26460251.1331117056636" X-Spam-Score: 0.0 (/) X-Spam_score: 0.0 X-Spam_score_int: 0 X-Spam_bar: / ------=_Part_16_26460251.1331117056636 Content-Type: text/plain; charset=ISO-8859-1 I'm interested. And actually in periodically doing it myself. Not by request. Because the database is live and is being updated by us. Of course I know about those three files. For now, I'd prefer such export for several directions at one (a multilingual spreadsheet). I want all sentences for which we have lojban translations. i.e. first column lojban 2 column english then i need japanese chinese russian arabic spanish polish french german I'll repeat once again. An automated script for doing so would be awesome. On Wednesday, March 7, 2012 2:47:17 AM UTC+4, ianek wrote: > > I've created the list for you, but it was an ugly hack in bash. A > better way would be to create a database and import sentences.csv and > links.csv to it, and then write a very simple program instead of > hacking around with grep etc. But it would be more work of course. And > maybe not faster, considering that import would take time. > > Here you go: http://dl.dropbox.com/u/17805197/jbo-eng.csv > It's tab-seperated list, any spreadsheet program should read it. > > As a by-product, I am able to produce such a list for any other > language available in tatoeba instantly, if anyone's interested. > > mu'o mi'e ianek > > On 6 Mar, 22:17, ianek wrote: > > > http://tatoeba.org/pol/download_tatoeba_example_sentenceshttp://tatoeba.org/files/downloads/sentences.csv > > > > There are actually three columns: id, language, sentence, but with > > some database-fu or script-fu or maybe even spreadsheet-fu you can get > > what you want. Or maybe I'll hack it together in a while. > > > > mu'o mi'e ianek > > > > On 6 Mar, 15:19, gleki wrote: > > > > > > > > > > > > > > > > > I wanna export tatoeba databse into a simple spreadsheet with two > columns. > > > One for English and another one for Lojban > > > > > Does anyone know how to do that ? -- You received this message because you are subscribed to the Google Groups "lojban" group. To view this discussion on the web visit https://groups.google.com/d/msg/lojban/-/e2-SqQ9btL4J. To post to this group, send email to lojban@googlegroups.com. To unsubscribe from this group, send email to lojban+unsubscribe@googlegroups.com. For more options, visit this group at http://groups.google.com/group/lojban?hl=en. ------=_Part_16_26460251.1331117056636 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable I'm interested. And actually in periodically doing it myself.  Not by request. 
Because the database is live and is being updated= by us.

Of course I know about those three files.
For now, I'd prefer such export for several directions at one = (a multilingual spreadsheet).
I want all sentences for which we h= ave lojban translations.
i.e. 
first column  =  lojban
2 column   english
then i need
=
japanese
chinese
russian
arabic
spanish
polish
french
german

=
I'll repeat once again. An automated script for doing so  w= ould be awesome.

On Wednesday, March 7, 2012 2:47:17 AM UTC+4, ianek= wrote:
I've created the list f= or you, but it was an ugly hack in bash. A
better way would be to create a database and import sentences.csv and
links.csv to it, and then write a very simple program instead of
hacking around with grep etc. But it would be more work of course. And
maybe not faster, considering that import would take time.

Here you go: http://dl.dropbox.com/u/17805197/jbo-eng.csv
It's tab-seperated list, any spreadsheet program should read it.

As a by-product, I am able to produce such a list for any other
language available in tatoeba instantly, if anyone's interested.

mu'o mi'e ianek

On 6 Mar, 22:17, ianek <jane...@gmail.com> wrote:
> http:= //tatoeba.org/pol/download_tatoeba_example_sentenceshttp://tatoeb= a.org/files/downloads/sentences.csv
>
> There are actually three columns: id, language, sentence, but with
> some database-fu or script-fu or maybe even spreadsheet-fu you can= get
> what you want. Or maybe I'll hack it together in a while.
>
> mu'o mi'e ianek
>
> On 6 Mar, 15:19, gleki <gleki.is.my.n...@gmail.com> w= rote:
>
>
>
>
>
>
>
> > I wanna export tatoeba databse into a simple spreadsheet with= two columns.
> > One for English and another one for Lojban
>
> > Does anyone know how to do that ?

--
You received this message because you are subscribed to the Google Groups "= lojban" group.
To view this discussion on the web visit https://groups.google.com/d/msg/lojban/-/e2= -SqQ9btL4J.
=20 To post to this group, send email to lojban@googlegroups.com.
To unsubscribe from this group, send email to lojban+unsubscribe@googlegrou= ps.com.
For more options, visit this group at http://groups.google.com/group/lojban= ?hl=3Den.
------=_Part_16_26460251.1331117056636--