Received: from mail-yw0-f61.google.com ([209.85.213.61]:56342) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.76) (envelope-from ) id 1RxWbZ-0006Sm-AX; Tue, 14 Feb 2012 20:36:40 -0800 Received: by yhgg71 with SMTP id g71sf583667yhg.16 for ; Tue, 14 Feb 2012 20:36:23 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=beta; h=x-beenthere:received-spf:mime-version:in-reply-to:references:date :message-id:subject:from:to:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-google-group-id:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe:content-type :content-transfer-encoding; bh=9/M4u60LoUNV55etfX+X2eDCKGA7BrUw3FlhBEPkwxA=; b=BysS1xzg1Y+v9c+s8tg6aUYDimzlZzM2ur1kFODe0AeRcUWtMBHFK4ISJFWLQ/OADj v++uV+FN2ovcTpv0hhI5extZqUX9jRUxsnEKhOHsQofKngmJIWLhLqDapkEPzE9FvYQv zIMa3zsSaz7RjuPYwNSO+pDXIkKLzF2xnafL4= Received: by 10.236.123.12 with SMTP id u12mr3222373yhh.19.1329280581072; Tue, 14 Feb 2012 20:36:21 -0800 (PST) X-BeenThere: lojban@googlegroups.com Received: by 10.101.196.22 with SMTP id y22ls88610anp.2.gmail; Tue, 14 Feb 2012 20:36:19 -0800 (PST) Received: by 10.236.116.101 with SMTP id f65mr45329447yhh.0.1329280579618; Tue, 14 Feb 2012 20:36:19 -0800 (PST) Received: by 10.236.116.101 with SMTP id f65mr45329444yhh.0.1329280579609; Tue, 14 Feb 2012 20:36:19 -0800 (PST) Received: from mail-yx0-f180.google.com (mail-yx0-f180.google.com [209.85.213.180]) by gmr-mx.google.com with ESMTPS id z63si883143yhg.5.2012.02.14.20.36.19 (version=TLSv1/SSLv3 cipher=OTHER); Tue, 14 Feb 2012 20:36:19 -0800 (PST) Received-SPF: pass (google.com: domain of hagisan@gmail.com designates 209.85.213.180 as permitted sender) client-ip=209.85.213.180; Received: by yenr11 with SMTP id r11so481973yen.25 for ; Tue, 14 Feb 2012 20:36:19 -0800 (PST) MIME-Version: 1.0 Received: by 10.236.154.165 with SMTP id h25mr40952yhk.17.1329280578592; Tue, 14 Feb 2012 20:36:18 -0800 (PST) Received: by 10.146.144.39 with HTTP; Tue, 14 Feb 2012 20:36:18 -0800 (PST) In-Reply-To: References: Date: Tue, 14 Feb 2012 23:36:18 -0500 Message-ID: Subject: Re: [lojban] Re: lojbo jufsisku - Lojban sentence search From: Masato Hagiwara To: lojban@googlegroups.com X-Original-Sender: hagisan@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of hagisan@gmail.com designates 209.85.213.180 as permitted sender) smtp.mail=hagisan@gmail.com; dkim=pass header.i=@gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 0.0 (/) X-Spam_score: 0.0 X-Spam_score_int: 0 X-Spam_bar: / > stevo > How hard would it be to add synonym lookup, i.e. all variants of a lujvo.= E.g., I looked up {ga'orgau} and got lots of results. Looking up {ganlygan= su} got nothing. I'd also like to see the expansion of a lujvo into its gis= mu, since I don't know all the rafsi yet. Thanks, actually, synonym lookup has the highest priority in my to-do list for jufsisku :) Since lojbo jufsisku's search back-end is Apache solr http://lucene.apache.org/solr/, it's relatively easy to implement the functionality, provided that we have a complete dictionary of cmavo/rafsi/lujvo. I'm planning to implement a Lojban parser plug-in for Apache solr first so that it also indexes morphemes, not just space-separated words. coi .iesk. ki'e pinka > .i ji'a xu ka'e te bangu fi so'i da baku .i mu'a la jbovlaste zu'u .e > la tatoebas. zu'unai cu sorbau tutci .i ba'a da'i mutce plixau fa lo > nu fanva fo la lojban. fi so'o bangu > > .i lo'e ve cilre be fi la lojban. cu glico .i lo ni mi kakne lo nu se > bangu lo bangenugu cu banzu ju'oru'e .i ku'i lo nu mi jbopli zo'u mi > milxe se fanza .oiro'eru'e lo nu za'a mi pensi bau lo bangenugu > vau .uuse'inaipei Please forgive me for writing back in English, since this is too complicated and time-consuming to be explained in Lojban for me :) It is totally possible to extend to two or more languages other than English, but doing so makes the difference between jufsisku and Tatoeba pretty much blurred. My ultimate goal here is not to construct a comprehensive database of multilingual translations of lojban, but to create a really useful tool for anybody who writes in lojban. Of course, there two goals overlap considerably, and we could consider implementing it if there's a large group of jbopre's whose native language is not English. A minor issue here, which is related to the previous question as well, is that in order to make it more useful, language-dependent features such as synonym expansion, morphological analysis, etc. should be done on the target language (not lojban but natlangs, I mean) as well. I feel comfortable speaking in Japanese, Chinese, and English, but not in other languages. I'm not very sure if I can provide good search user experience for Arabic-speaking users, for example. Anyway thank you very much for your encouraging remarks! I'll definitely improve it pretty soon. mu'o mi'e masatos. 2012/2/14 paul_faehrbrorn : > coi masatos. xagi,uaras. nelci .uisai ki'esai > > .i ji'a xu ka'e te bangu fi so'i da baku .i mu'a la jbovlaste zu'u .e > la tatoebas. zu'unai cu sorbau tutci .i ba'a da'i mutce plixau fa lo > nu fanva fo la lojban. fi so'o bangu > > .i lo'e ve cilre be fi la lojban. cu glico .i lo ni mi kakne lo nu se > bangu lo bangenugu cu banzu ju'oru'e .i ku'i lo nu mi jbopli zo'u mi > milxe se fanza .oiro'eru'e lo nu za'a mi pensi bau lo bangenugu > vau .uuse'inaipei > > ki'e co'o mi'e .iesk. > > (I hope that was halfway comprehensible. Corrections are welcome! > Intended meaning: > > Thank you very much! I really like it! Do you think it may be > possible, in the future, to make your project multilingual? I think it > may be useful to get translations into several languages from the > Lojban. > > Lojban learning materials tend to be in English. I do well enough in > English (I think), but I realise that I think *in English* when I use > Lojban, and that is somewhat distracting. I don't know about you all, > maybe you know what I mean =85) > > On Feb 13, 2:21=A0am, Masato Hagiwara wrote: >> coi ro do >> >> I've just finished the initial version of "lojbo jufsisku" (Lojban >> sentence search), which is a bilingual (Lojban-English) translation >> search environment in order to facilitate dictionary and translation >> look-up for anyone who writes in Lojban: >> >> http://lojban.lilyx.net/jufsisku/ >> >> I've added sentences from Tatoeba, some part of CLL (Chapter 1 to 4), >> lu la teris. po'u le tirxu li'u, and jbovlaste. >> I'm planning to add more sentences myself, but it'd be great if >> jbopre's in the community help make it a better tool by adding more >> (hopefully high-quality) sentences. >> >> Quality and quantity of sentences are critical to its usefulness. >> >> Any feedback is appreciated! >> >> Thanks, >> Masato >> >> -- >> Masato HAGIWARAhttp://lilyx.net/ > > -- > You received this message because you are subscribed to the Google Groups= "lojban" group. > To post to this group, send email to lojban@googlegroups.com. > To unsubscribe from this group, send email to lojban+unsubscribe@googlegr= oups.com. > For more options, visit this group at http://groups.google.com/group/lojb= an?hl=3Den. > --=20 Masato HAGIWARA http://lilyx.net/ --=20 You received this message because you are subscribed to the Google Groups "= lojban" group. To post to this group, send email to lojban@googlegroups.com. To unsubscribe from this group, send email to lojban+unsubscribe@googlegrou= ps.com. For more options, visit this group at http://groups.google.com/group/lojban= ?hl=3Den.