Received: from mail-bk0-f61.google.com ([209.85.214.61]:53322) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.76) (envelope-from ) id 1S60dE-0002WC-I7; Fri, 09 Mar 2012 06:17:29 -0800 Received: by bkcjg9 with SMTP id jg9sf1687171bkc.16 for ; Fri, 09 Mar 2012 06:17:13 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=beta; h=x-beenthere:received-spf:mime-version:in-reply-to:references:date :message-id:subject:from:to:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-google-group-id:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe:content-type :content-transfer-encoding; bh=q1bhL8sG4/Qf6Oj72i2B4W2nRnIuRBnXz+48m8Ifyp0=; b=shh0cWJkiVJMcM9FRGOGpxl+xgH9qXtDT8EMWIU0frqS6JEMPJojuXuBq006/2GpnL TDCSkvMl8DXB0ZBwIPhXUowCapdrYa3gGapLuKRGELiV5BH0ZSg/8ldj99uLU0lD8afG Q2I12ZSizbREH4IjFZUT58HlTqK0BuFva1sxw= Received: by 10.204.149.212 with SMTP id u20mr166680bkv.24.1331302629291; Fri, 09 Mar 2012 06:17:09 -0800 (PST) X-BeenThere: lojban@googlegroups.com Received: by 10.204.143.211 with SMTP id w19ls5402719bku.9.gmail; Fri, 09 Mar 2012 06:17:08 -0800 (PST) Received: by 10.204.150.90 with SMTP id x26mr179179bkv.6.1331302628290; Fri, 09 Mar 2012 06:17:08 -0800 (PST) Received: by 10.204.150.90 with SMTP id x26mr179178bkv.6.1331302628269; Fri, 09 Mar 2012 06:17:08 -0800 (PST) Received: from mail-bk0-f52.google.com (mail-bk0-f52.google.com [209.85.214.52]) by gmr-mx.google.com with ESMTPS id 8si6139313bka.1.2012.03.09.06.17.08 (version=TLSv1/SSLv3 cipher=OTHER); Fri, 09 Mar 2012 06:17:08 -0800 (PST) Received-SPF: pass (google.com: domain of mturniansky@gmail.com designates 209.85.214.52 as permitted sender) client-ip=209.85.214.52; Received: by mail-bk0-f52.google.com with SMTP id jm19so1719106bkc.11 for ; Fri, 09 Mar 2012 06:17:08 -0800 (PST) MIME-Version: 1.0 Received: by 10.204.156.65 with SMTP id v1mr936140bkw.109.1331302628048; Fri, 09 Mar 2012 06:17:08 -0800 (PST) Received: by 10.205.119.4 with HTTP; Fri, 9 Mar 2012 06:17:07 -0800 (PST) In-Reply-To: <71256296-e6c0-468b-8257-53b293e6650e@k24g2000yqe.googlegroups.com> References: <397501cf-35fa-4360-bdc1-fcc8defafb9e@hs8g2000vbb.googlegroups.com> <71256296-e6c0-468b-8257-53b293e6650e@k24g2000yqe.googlegroups.com> Date: Fri, 9 Mar 2012 09:17:07 -0500 Message-ID: Subject: Re: [lojban] Re: lojbo jufsisku - Lojban sentence search From: Michael Turniansky To: lojban@googlegroups.com X-Original-Sender: mturniansky@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of mturniansky@gmail.com designates 209.85.214.52 as permitted sender) smtp.mail=mturniansky@gmail.com; dkim=pass header.i=@gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable X-Spam-Score: -0.7 (/) X-Spam_score: -0.7 X-Spam_score_int: -6 X-Spam_bar: / Thanks for putting me first ;-) Although now that I've looked at your tool, for the first time, I realize there was a (minor) mistake in my original English rendering of the story. At one point, I said "the Terry" instead of "Terry" (marked as "teris:15" in your app) I corrected it on the wiki, but can you change it in jufsisku, please? --gejyspa (Mike Turniansky) On Sat, Mar 3, 2012 at 12:31 PM, ALbaros wro= te: > coi rodo > > I've been doing exactly this, for learning everything in action > (grammar, vocabulary, usage). I've already finished: > - Turniansky - la teris. po'u lo tirxu cu vitke zi'o le barda tcadu > - Carver, Raymond - tu'a loi cmalu > - Kafka, Franz - lo nu binxo > - Stevenson - le saske smani (with listed and explained vocabulary) > - Lovecraft, H.P. - tu'a le cukta (listed and explained vocabulary on > the way). > > and am half the way with: > - Gibran, Kahlil - The Prophet > - la xorxes. - lo cipra le jufra loi zbasu bangu > - Moser, David - dei tcita ledei lisri > - Saint Exupery, Antoine - le cmalu noltru > - Nicholas, Nick (translator) - Some fables by Aesop > - Nicholas, Nick - Ckafybarja Piece 1 > > I was planning, once finished to submit these to the Lojban community > and to contact Masato once again to do a massive upload to his > excellent tool. > > I will upload a few of these after I get back from a trip and have > decent internet access. > > mu'o mi'e la .albaros > > > On Feb 29, 3:27=A0am, ianek wrote: >> coi ro do >> >> What do you think about taking the works translated into Lojban (like >> cmalu noltru or alis) to divide them into sets of parallel Lojban- >> English sentences ready to be put onto jufsisku? I think I could try >> to work on it. There has to be some tool for dividing English text >> into sentences (which is not trivial only if you have many >> abbreviations, which is not the case of The Little Prince or Alice). >> Dividing Lojban text is trivial of course. So the main question is, >> are you happy with the idea of adding many sentences taken out of >> context? A few times I was looking into Alice or The Little Prince for >> word when I needed a translation to Lojban when jbovlaste had nothing, >> to see how xorxes did it. >> >> mu'o mi'e ianek >> >> On 15 Lut, 05:36, Masato Hagiwara wrote: >> >> >> >> >> >> >> >> > > stevo >> > > How hard would it be to add synonym lookup, i.e. all variants of a l= ujvo. E.g., I looked up {ga'orgau} and got lots of results. Looking up {gan= lygansu} got nothing. I'd also like to see the expansion of a lujvo into it= s gismu, since I don't know all the rafsi yet. >> >> > Thanks, actually, synonym lookup has the highest priority in my to-do >> > list forjufsisku:) >> >> > Since lojbojufsisku'ssearch back-end is Apache solrhttp://lucene.apach= e.org/solr/, it's relatively easy to implement the >> > functionality, provided that we have a complete dictionary of >> > cmavo/rafsi/lujvo. I'm planning to implement a Lojban parser plug-in >> > for Apache solr first so that it also indexes morphemes, not just >> > space-separated words. >> >> > coi .iesk. ki'e pinka >> >> > > .i ji'a xu ka'e te bangu fi so'i da baku .i mu'a la jbovlaste zu'u .= e >> > > la tatoebas. zu'unai cu sorbau tutci .i ba'a da'i mutce plixau fa lo >> > > nu fanva fo la lojban. fi so'o bangu >> >> > > .i lo'e ve cilre be fi la lojban. cu glico .i lo ni mi kakne lo nu s= e >> > > bangu lo bangenugu cu banzu ju'oru'e .i ku'i lo nu mi jbopli zo'u mi >> > > milxe se fanza .oiro'eru'e lo nu za'a mi pensi bau lo bangenugu >> > > vau .uuse'inaipei >> >> > Please forgive me for writing back in English, since this is too >> > complicated and time-consuming to be explained in Lojban for me :) >> >> > It is totally possible to extend to two or more languages other than >> > English, but doing so makes the difference betweenjufsiskuand >> > Tatoeba pretty much blurred. My ultimate goal here is not to construct >> > a comprehensive database of multilingual translations of lojban, but >> > to create a really useful tool for anybody who writes in lojban. >> >> > Of course, there two goals overlap considerably, and we could consider >> > implementing it if there's a large group of jbopre's whose native >> > language is not English. >> >> > A minor issue here, which is related to the previous question as well, >> > is that in order to make it more useful, language-dependent features >> > such as synonym expansion, morphological analysis, etc. should be done >> > on the target language (not lojban but natlangs, I mean) as well. I >> > feel comfortable speaking in Japanese, Chinese, and English, but not >> > in other languages. I'm not very sure if I can provide good search >> > user experience for Arabic-speaking users, for example. >> >> > Anyway thank you very much for your encouraging remarks! I'll >> > definitely improve it pretty soon. >> >> > mu'o mi'e masatos. >> >> > 2012/2/14 paul_faehrbrorn : >> >> > > coi masatos. xagi,uaras. nelci .uisai ki'esai >> >> > > .i ji'a xu ka'e te bangu fi so'i da baku .i mu'a la jbovlaste zu'u .= e >> > > la tatoebas. zu'unai cu sorbau tutci .i ba'a da'i mutce plixau fa lo >> > > nu fanva fo la lojban. fi so'o bangu >> >> > > .i lo'e ve cilre be fi la lojban. cu glico .i lo ni mi kakne lo nu s= e >> > > bangu lo bangenugu cu banzu ju'oru'e .i ku'i lo nu mi jbopli zo'u mi >> > > milxe se fanza .oiro'eru'e lo nu za'a mi pensi bau lo bangenugu >> > > vau .uuse'inaipei >> >> > > ki'e co'o mi'e .iesk. >> >> > > (I hope that was halfway comprehensible. Corrections are welcome! >> > > Intended meaning: >> >> > > Thank you very much! I really like it! Do you think it may be >> > > possible, in the future, to make your project multilingual? I think = it >> > > may be useful to get translations into several languages from the >> > > Lojban. >> >> > > Lojban learning materials tend to be in English. I do well enough in >> > > English (I think), but I realise that I think *in English* when I us= e >> > > Lojban, and that is somewhat distracting. I don't know about you all= , >> > > maybe you know what I mean =85) >> >> > > On Feb 13, 2:21=A0am, Masato Hagiwara wrote: >> > >> coi ro do >> >> > >> I've just finished the initial version of "lojbojufsisku" (Lojban >> > >> sentence search), which is a bilingual (Lojban-English) translation >> > >> search environment in order to facilitate dictionary and translatio= n >> > >> look-up for anyone who writes in Lojban: >> >> > >>http://lojban.lilyx.net/jufsisku/ >> >> > >> I've added sentences from Tatoeba, some part of CLL (Chapter 1 to 4= ), >> > >> lu la teris. po'u le tirxu li'u, and jbovlaste. >> > >> I'm planning to add more sentences myself, but it'd be great if >> > >> jbopre's in the community help make it a better tool by adding more >> > >> (hopefully high-quality) sentences. >> >> > >> Quality and quantity of sentences are critical to its usefulness. >> >> > >> Any feedback is appreciated! >> >> > >> Thanks, >> > >> Masato >> >> > >> -- >> > >> Masato HAGIWARAhttp://lilyx.net/ >> >> > > -- >> > > You received this message because you are subscribed to the Google G= roups "lojban" group. >> > > To post to this group, send email to lojban@googlegroups.com. >> > > To unsubscribe from this group, send email to lojban+unsubscribe@goo= glegroups.com. >> > > For more options, visit this group athttp://groups.google.com/group/= lojban?hl=3Den. >> >> > -- >> > Masato HAGIWARAhttp://lilyx.net/ > > -- > You received this message because you are subscribed to the Google Groups= "lojban" group. > To post to this group, send email to lojban@googlegroups.com. > To unsubscribe from this group, send email to lojban+unsubscribe@googlegr= oups.com. > For more options, visit this group at http://groups.google.com/group/lojb= an?hl=3Den. > --=20 You received this message because you are subscribed to the Google Groups "= lojban" group. To post to this group, send email to lojban@googlegroups.com. To unsubscribe from this group, send email to lojban+unsubscribe@googlegrou= ps.com. For more options, visit this group at http://groups.google.com/group/lojban= ?hl=3Den.