Received: from mail-fa0-f61.google.com ([209.85.161.61]:41213) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.76) (envelope-from ) id 1S3tX4-00086b-3b; Sat, 03 Mar 2012 10:18:23 -0800 Received: by fabs1 with SMTP id s1sf2758769fab.16 for ; Sat, 03 Mar 2012 10:18:06 -0800 (PST) Received-SPF: pass (google.com: domain of lojban+bncCJbznvHdFRDbwMn6BBoEr29psQ@googlegroups.com designates 10.205.141.79 as permitted sender) client-ip=10.205.141.79; Authentication-Results: mr.google.com; spf=pass (google.com: domain of lojban+bncCJbznvHdFRDbwMn6BBoEr29psQ@googlegroups.com designates 10.205.141.79 as permitted sender) smtp.mail=lojban+bncCJbznvHdFRDbwMn6BBoEr29psQ@googlegroups.com; dkim=pass header.i=lojban+bncCJbznvHdFRDbwMn6BBoEr29psQ@googlegroups.com Received: from mr.google.com ([10.205.141.79]) by 10.205.141.79 with SMTP id jd15mr4268905bkc.6.1330798686230 (num_hops = 1); Sat, 03 Mar 2012 10:18:06 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=beta; h=x-beenthere:received-spf:received-spf:mime-version:in-reply-to :references:from:date:message-id:subject:to:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-google-group-id:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe:content-type; bh=estvONKCUp6bTX2FyZc3LD5OEU6ba20tyoCmUjPYJw8=; b=MlGVnF8bwd1ZfNJZ+QqzMDvjUE+wdOX0AmPWpEGgGSwxUonnzsxolHgtQzWqEcHKW/ DRPKSqjLNV3fZVH7h8IpG/1PfM/HxdZLVG/j59mYpZTGKKpCCLXprwhXsMAx6ZAzEDcF 1ATfOdVgRVXlXHCosFEl/7zoOQFEe5ZJ7SkBE= Received: by 10.205.141.79 with SMTP id jd15mr1248047bkc.6.1330798683534; Sat, 03 Mar 2012 10:18:03 -0800 (PST) X-BeenThere: lojban@googlegroups.com Received: by 10.204.42.74 with SMTP id r10ls4988952bke.6.gmail; Sat, 03 Mar 2012 10:18:01 -0800 (PST) Received: by 10.204.131.75 with SMTP id w11mr1150288bks.0.1330798681474; Sat, 03 Mar 2012 10:18:01 -0800 (PST) Received: by 10.204.131.75 with SMTP id w11mr1150287bks.0.1330798681444; Sat, 03 Mar 2012 10:18:01 -0800 (PST) Received: from mail-lpp01m010-f47.google.com (mail-lpp01m010-f47.google.com [209.85.215.47]) by gmr-mx.google.com with ESMTPS id ba12si10940006bkb.2.2012.03.03.10.18.01 (version=TLSv1/SSLv3 cipher=OTHER); Sat, 03 Mar 2012 10:18:01 -0800 (PST) Received-SPF: pass (google.com: domain of lytlesw@gmail.com designates 209.85.215.47 as permitted sender) client-ip=209.85.215.47; Received: by mail-lpp01m010-f47.google.com with SMTP id w12so3657308lag.20 for ; Sat, 03 Mar 2012 10:18:01 -0800 (PST) Received-SPF: pass (google.com: domain of lytlesw@gmail.com designates 10.152.102.173 as permitted sender) client-ip=10.152.102.173; Received: from mr.google.com ([10.152.102.173]) by 10.152.102.173 with SMTP id fp13mr10964933lab.38.1330798681308 (num_hops = 1); Sat, 03 Mar 2012 10:18:01 -0800 (PST) Received: by 10.152.102.173 with SMTP id fp13mr9044841lab.38.1330798681161; Sat, 03 Mar 2012 10:18:01 -0800 (PST) MIME-Version: 1.0 Received: by 10.112.67.41 with HTTP; Sat, 3 Mar 2012 10:17:41 -0800 (PST) In-Reply-To: <71256296-e6c0-468b-8257-53b293e6650e@k24g2000yqe.googlegroups.com> References: <397501cf-35fa-4360-bdc1-fcc8defafb9e@hs8g2000vbb.googlegroups.com> <71256296-e6c0-468b-8257-53b293e6650e@k24g2000yqe.googlegroups.com> From: MorphemeAddict Date: Sat, 3 Mar 2012 13:17:41 -0500 Message-ID: Subject: Re: [lojban] Re: lojbo jufsisku - Lojban sentence search To: lojban@googlegroups.com X-Original-Sender: lytlesw@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of lytlesw@gmail.com designates 209.85.215.47 as permitted sender) smtp.mail=lytlesw@gmail.com; dkim=pass header.i=@gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: multipart/alternative; boundary=f46d040716c7a87e6e04ba5ab6a9 X-Spam-Score: 0.0 (/) X-Spam_score: 0.0 X-Spam_score_int: 0 X-Spam_bar: / --f46d040716c7a87e6e04ba5ab6a9 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable I think this is excellent! I've often wanted some sort of KWIC search, whether sorted by keyword or not. stevo On Sat, Mar 3, 2012 at 12:31 PM, ALbaros wrot= e: > coi rodo > > I've been doing exactly this, for learning everything in action > (grammar, vocabulary, usage). I've already finished: > - Turniansky - la teris. po'u lo tirxu cu vitke zi'o le barda tcadu > - Carver, Raymond - tu'a loi cmalu > - Kafka, Franz - lo nu binxo > - Stevenson - le saske smani (with listed and explained vocabulary) > - Lovecraft, H.P. - tu'a le cukta (listed and explained vocabulary on > the way). > > and am half the way with: > - Gibran, Kahlil - The Prophet > - la xorxes. - lo cipra le jufra loi zbasu bangu > - Moser, David - dei tcita ledei lisri > - Saint Exupery, Antoine - le cmalu noltru > - Nicholas, Nick (translator) - Some fables by Aesop > - Nicholas, Nick - Ckafybarja Piece 1 > > I was planning, once finished to submit these to the Lojban community > and to contact Masato once again to do a massive upload to his > excellent tool. > > I will upload a few of these after I get back from a trip and have > decent internet access. > > mu'o mi'e la .albaros > > > On Feb 29, 3:27 am, ianek wrote: > > coi ro do > > > > What do you think about taking the works translated into Lojban (like > > cmalu noltru or alis) to divide them into sets of parallel Lojban- > > English sentences ready to be put onto jufsisku? I think I could try > > to work on it. There has to be some tool for dividing English text > > into sentences (which is not trivial only if you have many > > abbreviations, which is not the case of The Little Prince or Alice). > > Dividing Lojban text is trivial of course. So the main question is, > > are you happy with the idea of adding many sentences taken out of > > context? A few times I was looking into Alice or The Little Prince for > > word when I needed a translation to Lojban when jbovlaste had nothing, > > to see how xorxes did it. > > > > mu'o mi'e ianek > > > > On 15 Lut, 05:36, Masato Hagiwara wrote: > > > > > > > > > > > > > > > > > > stevo > > > > How hard would it be to add synonym lookup, i.e. all variants of a > lujvo. E.g., I looked up {ga'orgau} and got lots of results. Looking up > {ganlygansu} got nothing. I'd also like to see the expansion of a lujvo > into its gismu, since I don't know all the rafsi yet. > > > > > Thanks, actually, synonym lookup has the highest priority in my to-do > > > list forjufsisku:) > > > > > Since lojbojufsisku'ssearch back-end is Apache solrhttp:// > lucene.apache.org/solr/, it's relatively easy to implement the > > > functionality, provided that we have a complete dictionary of > > > cmavo/rafsi/lujvo. I'm planning to implement a Lojban parser plug-in > > > for Apache solr first so that it also indexes morphemes, not just > > > space-separated words. > > > > > coi .iesk. ki'e pinka > > > > > > .i ji'a xu ka'e te bangu fi so'i da baku .i mu'a la jbovlaste zu'u = .e > > > > la tatoebas. zu'unai cu sorbau tutci .i ba'a da'i mutce plixau fa l= o > > > > nu fanva fo la lojban. fi so'o bangu > > > > > > .i lo'e ve cilre be fi la lojban. cu glico .i lo ni mi kakne lo nu = se > > > > bangu lo bangenugu cu banzu ju'oru'e .i ku'i lo nu mi jbopli zo'u m= i > > > > milxe se fanza .oiro'eru'e lo nu za'a mi pensi bau lo bangenugu > > > > vau .uuse'inaipei > > > > > Please forgive me for writing back in English, since this is too > > > complicated and time-consuming to be explained in Lojban for me :) > > > > > It is totally possible to extend to two or more languages other than > > > English, but doing so makes the difference betweenjufsiskuand > > > Tatoeba pretty much blurred. My ultimate goal here is not to construc= t > > > a comprehensive database of multilingual translations of lojban, but > > > to create a really useful tool for anybody who writes in lojban. > > > > > Of course, there two goals overlap considerably, and we could conside= r > > > implementing it if there's a large group of jbopre's whose native > > > language is not English. > > > > > A minor issue here, which is related to the previous question as well= , > > > is that in order to make it more useful, language-dependent features > > > such as synonym expansion, morphological analysis, etc. should be don= e > > > on the target language (not lojban but natlangs, I mean) as well. I > > > feel comfortable speaking in Japanese, Chinese, and English, but not > > > in other languages. I'm not very sure if I can provide good search > > > user experience for Arabic-speaking users, for example. > > > > > Anyway thank you very much for your encouraging remarks! I'll > > > definitely improve it pretty soon. > > > > > mu'o mi'e masatos. > > > > > 2012/2/14 paul_faehrbrorn : > > > > > > coi masatos. xagi,uaras. nelci .uisai ki'esai > > > > > > .i ji'a xu ka'e te bangu fi so'i da baku .i mu'a la jbovlaste zu'u = .e > > > > la tatoebas. zu'unai cu sorbau tutci .i ba'a da'i mutce plixau fa l= o > > > > nu fanva fo la lojban. fi so'o bangu > > > > > > .i lo'e ve cilre be fi la lojban. cu glico .i lo ni mi kakne lo nu = se > > > > bangu lo bangenugu cu banzu ju'oru'e .i ku'i lo nu mi jbopli zo'u m= i > > > > milxe se fanza .oiro'eru'e lo nu za'a mi pensi bau lo bangenugu > > > > vau .uuse'inaipei > > > > > > ki'e co'o mi'e .iesk. > > > > > > (I hope that was halfway comprehensible. Corrections are welcome! > > > > Intended meaning: > > > > > > Thank you very much! I really like it! Do you think it may be > > > > possible, in the future, to make your project multilingual? I think > it > > > > may be useful to get translations into several languages from the > > > > Lojban. > > > > > > Lojban learning materials tend to be in English. I do well enough i= n > > > > English (I think), but I realise that I think *in English* when I u= se > > > > Lojban, and that is somewhat distracting. I don't know about you al= l, > > > > maybe you know what I mean =85) > > > > > > On Feb 13, 2:21 am, Masato Hagiwara wrote: > > > >> coi ro do > > > > > >> I've just finished the initial version of "lojbojufsisku" (Lojban > > > >> sentence search), which is a bilingual (Lojban-English) translatio= n > > > >> search environment in order to facilitate dictionary and translati= on > > > >> look-up for anyone who writes in Lojban: > > > > > >>http://lojban.lilyx.net/jufsisku/ > > > > > >> I've added sentences from Tatoeba, some part of CLL (Chapter 1 to > 4), > > > >> lu la teris. po'u le tirxu li'u, and jbovlaste. > > > >> I'm planning to add more sentences myself, but it'd be great if > > > >> jbopre's in the community help make it a better tool by adding mor= e > > > >> (hopefully high-quality) sentences. > > > > > >> Quality and quantity of sentences are critical to its usefulness. > > > > > >> Any feedback is appreciated! > > > > > >> Thanks, > > > >> Masato > > > > > >> -- > > > >> Masato HAGIWARAhttp://lilyx.net/ > > > > > > -- > > > > You received this message because you are subscribed to the Google > Groups "lojban" group. > > > > To post to this group, send email to lojban@googlegroups.com. > > > > To unsubscribe from this group, send email to > lojban+unsubscribe@googlegroups.com. > > > > For more options, visit this group athttp:// > groups.google.com/group/lojban?hl=3Den. > > > > > -- > > > Masato HAGIWARAhttp://lilyx.net/ > > -- > You received this message because you are subscribed to the Google Groups > "lojban" group. > To post to this group, send email to lojban@googlegroups.com. > To unsubscribe from this group, send email to > lojban+unsubscribe@googlegroups.com. > For more options, visit this group at > http://groups.google.com/group/lojban?hl=3Den. > > --=20 You received this message because you are subscribed to the Google Groups "= lojban" group. To post to this group, send email to lojban@googlegroups.com. To unsubscribe from this group, send email to lojban+unsubscribe@googlegrou= ps.com. For more options, visit this group at http://groups.google.com/group/lojban= ?hl=3Den. --f46d040716c7a87e6e04ba5ab6a9 Content-Type: text/html; charset=windows-1252 Content-Transfer-Encoding: quoted-printable I think this is excellent! I've often wanted some sort of KWIC search, = whether sorted by keyword or not.=A0

stevo

On Sat, Mar 3, 2012 at 12:31 PM, ALbaros <avallejo@carbondecisions.com<= /a>> wrote:
coi rodo

I've been doing exactly this, for learning everything in action
(grammar, vocabulary, usage). I've already finished:
- Turniansky - la teris. po'u lo tirxu cu vitke zi'o le barda tcadu=
- Carver, Raymond - tu'a loi cmalu
- Kafka, Franz - lo nu binxo
- Stevenson - le saske smani (with listed and explained vocabulary)
- Lovecraft, H.P. - tu'a le cukta (listed and explained vocabulary on the way).

and am half the way with:
- Gibran, Kahlil - The Prophet
- la xorxes. - lo cipra le jufra loi zbasu bangu
- Moser, David - dei tcita ledei lisri
- Saint Exupery, Antoine - le cmalu noltru
- Nicholas, Nick (translator) - Some fables by Aesop
- Nicholas, Nick - Ckafybarja Piece 1

I was planning, once finished to submit these to the Lojban community
and to contact Masato once again to do a massive upload to his
excellent tool.

I will upload a few of these after I get back from a trip and have
decent internet access.

mu'o mi'e la .albaros


On Feb 29, 3:27=A0am, ianek <jane..= .@gmail.com> wrote:
> coi ro do
>
> What do you think about taking the works translated into Lojban (like<= br> > cmalu noltru or alis) to divide them into sets of parallel Lojban-
> English sentences ready to be put onto jufsisku? I think I could try > to work on it. There has to be some tool for dividing English text
> into sentences (which is not trivial only if you have many
> abbreviations, which is not the case of The Little Prince or Alice). > Dividing Lojban text is trivial of course. So the main question is, > are you happy with the idea of adding many sentences taken out of
> context? A few times I was looking into Alice or The Little Prince for=
> word when I needed a translation to Lojban when jbovlaste had nothing,=
> to see how xorxes did it.
>
> mu'o mi'e ianek
>
> On 15 Lut, 05:36, Masato Hagiwara <hagi...@gmail.com> wrote:
>
>
>
>
>
>
>
> > > stevo
> > > How hard would it be to add synonym lookup, i.e. all variant= s of a lujvo. E.g., I looked up {ga'orgau} and got lots of results. Loo= king up {ganlygansu} got nothing. I'd also like to see the expansion of= a lujvo into its gismu, since I don't know all the rafsi yet.
>
> > Thanks, actually, synonym lookup has the highest priority in my t= o-do
> > list forjufsisku:)
>
> > Since lojbojufsisku'ssearch back-end is Apache solrhttp://lucene.apache.org= /solr/, it's relatively easy to implement the
> > functionality, provided that we have a complete dictionary of
> > cmavo/rafsi/lujvo. I'm planning to implement a Lojban parser = plug-in
> > for Apache solr first so that it also indexes morphemes, not just=
> > space-separated words.
>
> > coi .iesk. ki'e pinka
>
> > > .i ji'a xu ka'e te bangu fi so'i da baku .i mu&#= 39;a la jbovlaste zu'u .e
> > > la tatoebas. zu'unai cu sorbau tutci .i ba'a da'= i mutce plixau fa lo
> > > nu fanva fo la lojban. fi so'o bangu
>
> > > .i lo'e ve cilre be fi la lojban. cu glico .i lo ni mi k= akne lo nu se
> > > bangu lo bangenugu cu banzu ju'oru'e .i ku'i lo = nu mi jbopli zo'u mi
> > > milxe se fanza .oiro'eru'e lo nu za'a mi pensi b= au lo bangenugu
> > > vau .uuse'inaipei
>
> > Please forgive me for writing back in English, since this is too<= br> > > complicated and time-consuming to be explained in Lojban for me := )
>
> > It is totally possible to extend to two or more languages other t= han
> > English, but doing so makes the difference betweenjufsiskuand
> > Tatoeba pretty much blurred. My ultimate goal here is not to cons= truct
> > a comprehensive database of multilingual translations of lojban, = but
> > to create a really useful tool for anybody who writes in lojban.<= br> >
> > Of course, there two goals overlap considerably, and we could con= sider
> > implementing it if there's a large group of jbopre's whos= e native
> > language is not English.
>
> > A minor issue here, which is related to the previous question as = well,
> > is that in order to make it more useful, language-dependent featu= res
> > such as synonym expansion, morphological analysis, etc. should be= done
> > on the target language (not lojban but natlangs, I mean) as well.= I
> > feel comfortable speaking in Japanese, Chinese, and English, but = not
> > in other languages. I'm not very sure if I can provide good s= earch
> > user experience for Arabic-speaking users, for example.
>
> > Anyway thank you very much for your encouraging remarks! I'll=
> > definitely improve it pretty soon.
>
> > mu'o mi'e masatos.
>
> > 2012/2/14 paul_faehrbrorn <pa= ....@gmx.de>:
>
> > > coi masatos. xagi,uaras. nelci .uisai ki'esai
>
> > > .i ji'a xu ka'e te bangu fi so'i da baku .i mu&#= 39;a la jbovlaste zu'u .e
> > > la tatoebas. zu'unai cu sorbau tutci .i ba'a da'= i mutce plixau fa lo
> > > nu fanva fo la lojban. fi so'o bangu
>
> > > .i lo'e ve cilre be fi la lojban. cu glico .i lo ni mi k= akne lo nu se
> > > bangu lo bangenugu cu banzu ju'oru'e .i ku'i lo = nu mi jbopli zo'u mi
> > > milxe se fanza .oiro'eru'e lo nu za'a mi pensi b= au lo bangenugu
> > > vau .uuse'inaipei
>
> > > ki'e co'o mi'e .iesk.
>
> > > (I hope that was halfway comprehensible. Corrections are wel= come!
> > > Intended meaning:
>
> > > Thank you very much! I really like it! Do you think it may b= e
> > > possible, in the future, to make your project multilingual? = I think it
> > > may be useful to get translations into several languages fro= m the
> > > Lojban.
>
> > > Lojban learning materials tend to be in English. I do well e= nough in
> > > English (I think), but I realise that I think *in English* w= hen I use
> > > Lojban, and that is somewhat distracting. I don't know a= bout you all,
> > > maybe you know what I mean =85)
>
> > > On Feb 13, 2:21=A0am, Masato Hagiwara <hagi...@gmail.com> wrote:
> > >> coi ro do
>
> > >> I've just finished the initial version of "lojb= ojufsisku" (Lojban
> > >> sentence search), which is a bilingual (Lojban-English) = translation
> > >> search environment in order to facilitate dictionary and= translation
> > >> look-up for anyone who writes in Lojban:
>
> > >>http://lojban.lilyx.net/jufsisku/
>
> > >> I've added sentences from Tatoeba, some part of CLL = (Chapter 1 to 4),
> > >> lu la teris. po'u le tirxu li'u, and jbovlaste.<= br> > > >> I'm planning to add more sentences myself, but it= 9;d be great if
> > >> jbopre's in the community help make it a better tool= by adding more
> > >> (hopefully high-quality) sentences.
>
> > >> Quality and quantity of sentences are critical to its us= efulness.
>
> > >> Any feedback is appreciated!
>
> > >> Thanks,
> > >> Masato
>
> > >> --
> > >> Masato HAGIWARAhttp://lilyx.net/
>
> > > --
> > > You received this message because you are subscribed to the = Google Groups "lojban" group.
> > > To post to this group, send email to lojban@googlegroups.com.
> > > To unsubscribe from this group, send email to lojban+unsubscribe@googlegroups.= com.
> > > For more options, visit this group athttp://groups.google.c= om/group/lojban?hl=3Den.
>
> > --
> > Masato HAGIWARAhttp://lilyx.net/

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To post to this group, send email to lojban@googlegroups.com.
To unsubscribe from this group, send email to lojban+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojba= n?hl=3Den.


--
You received this message because you are subscribed to the Google Groups "= lojban" group.
To post to this group, send email to lojban@googlegroups.com.
To unsubscribe from this group, send email to lojban+unsubscribe@googlegrou= ps.com.
For more options, visit this group at http://groups.google.com/group/lojban= ?hl=3Den.
--f46d040716c7a87e6e04ba5ab6a9--