From lojban+bncCKqU0bWBHxCD08_tBBoEeGnQHw@googlegroups.com Sun Apr 24 02:12:51 2011 Received: from mail-qy0-f189.google.com ([209.85.216.189]) by chain.digitalkingdom.org with esmtp (Exim 4.72) (envelope-from ) id 1QDvN2-0008HL-2R; Sun, 24 Apr 2011 02:12:51 -0700 Received: by qyk36 with SMTP id 36sf5631468qyk.16 for ; Sun, 24 Apr 2011 02:12:42 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=beta; h=domainkey-signature:x-beenthere:mime-version:date:in-reply-to:x-ip :references:user-agent:x-http-useragent:message-id:subject:from:to :x-original-sender:reply-to:precedence:mailing-list:list-id :x-google-group-id:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe:content-type :content-transfer-encoding; bh=5sFBh4OueSF447ALaqupTNKghXEIwAcdJQbka4AoiwE=; b=wjrp3Zs6zksQyA9Iq318z0E0f/aBT7vGQHLBB+K8pvsaX/tVd0zYBXfjHmeUv02kW0 iZkTV+zABT2iMi78HwB++swCqKpAu2mL6gh7NWaObvEQmIlLbcwWf43zxQaq9AqELfKs gCtZO3qZzHGrztvCRULCIvnTDdWjuw0lwzY2Y= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlegroups.com; s=beta; h=x-beenthere:mime-version:date:in-reply-to:x-ip:references :user-agent:x-http-useragent:message-id:subject:from:to :x-original-sender:reply-to:precedence:mailing-list:list-id :x-google-group-id:list-post:list-help:list-archive:sender :list-subscribe:list-unsubscribe:content-type :content-transfer-encoding; b=bkUzSmeBOCatiXVAo+EYwtmuSIbxES8RQ0AS54CuFtyzj7wPPn3e97kbrdOf2HwQz9 4z7muDIG5h33dWkO+mhD7LdXz58JnzLCdIpuP3nJ9eUC63w+kYdLVQb5fEn/9CxKDtwq FNJATWaB3pmxH92j+UOdSmTAuwU6yboyLYaL0= Received: by 10.229.88.147 with SMTP id a19mr315838qcm.26.1303636355971; Sun, 24 Apr 2011 02:12:35 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.224.179.137 with SMTP id bq9ls771119qab.6.gmail; Sun, 24 Apr 2011 02:12:35 -0700 (PDT) MIME-Version: 1.0 Received: by 10.224.33.131 with SMTP id h3mr315080qad.3.1303636355408; Sun, 24 Apr 2011 02:12:35 -0700 (PDT) Received: by dr5g2000vbb.googlegroups.com with HTTP; Sun, 24 Apr 2011 02:12:35 -0700 (PDT) Date: Sun, 24 Apr 2011 02:12:35 -0700 (PDT) In-Reply-To: X-IP: 92.113.160.163 References: <5e7bdd85-5c9f-4c47-9ddc-3a5de443ee80@hd10g2000vbb.googlegroups.com> <20110422203058.GC80702@alice.local> <46db26a7-603f-4443-a4c7-8d423e9811ed@cu4g2000vbb.googlegroups.com> User-Agent: G2/1.0 X-HTTP-UserAgent: Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/534.24 (KHTML, like Gecko) Chrome/11.0.696.50 Safari/534.24,gzip(gfe) Message-ID: <68bf269b-2093-4437-a520-089b0eb915b0@dr5g2000vbb.googlegroups.com> Subject: [lojban] Re: fosay translator From: "zoltan.kochan" To: lojban X-Original-Sender: zoltankochan@gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable On this level I'm just giving all the alternative translations. When this part of the translator will be finished then I will think about the disambiguation. But I assume that I will be able to achieve a good result because it is a deep translation method and I have all the information about the translated text. What I really wanted to do at this point was to provide the possibility of editing and adding languages in the translator. With this possibility it will be a good tool for linguistic experiments and for translations from/to not widely used languages. On 24 =D0=9A=D0=B2=D1=96, 05:02, Ivo Doko wrote: > On 22 April 2011 22:49, Z-CORE wrote: > > > > > > > > > > > Sorry, but I did implement it only for declerative sentences yet. And > > it translates > > between English, Hungarian and Ukrainian. Lojban is used in the > > interlingua. E.g. > > the interlingua of "Zolt=C3=A1n Kocs=C3=A1n l=C3=A1tott egy h=C3=A1zat.= " will be: > > sentence: > > { > > =C2=A0 =C2=A0 =C2=A0clause: > > =C2=A0 =C2=A0 =C2=A0{ > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 accusative: > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 { > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0noun-phrase: > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0{ > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 q= uantity: none > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 d= ifinity: undifinite > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 n= oun: > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 { > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0order-number: 3 > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0form: formal > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0real-number: singular > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0lemma: zdani > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0noun-type: common > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 } > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 p= ersone: third > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0} > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 } > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 verb-phrase: > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 { > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0truth: 1.0 > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0verb: > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0{ > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 l= emma: viska > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 o= rder-number: 2 > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0} > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0tense: past > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0mood: declarativ= e > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0aspect: indefini= te > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 } > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 subject: > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 { > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0noun-phrase: > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0{ > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 q= uantity: none > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 d= ifinity: difinite > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 n= oun: > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 { > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0personal-name: first-name > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0order-number: 0 > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0form: formal > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0tags: ukrainian > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0real-number: singular > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0lemma: zoltan. > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0noun-type: proper > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 } > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 n= oun: > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 { > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0personal-name: surname > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0order-number: 1 > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0form: formal > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0real-number: singular > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0lemma: kotcan. > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0noun-type: proper > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 } > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 p= ersone: third > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0} > > =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 } > > =C2=A0 =C2=A0 =C2=A0} > > } > > As you can see, Lojban words are used as lemmas. I'm planning to use > > Lojban when > > describing the knowladge base so I will implement the translation for > > it too. > > The translation of the "fuzzy" Lojban phrases to the target languages > > will be a > > set of alternatives. My Lojban is poor yet but I know that there's a > > possibility > > to say a sentence with an undifined tense. Thus, it will give three > > translations > > for such sentences: > > "A girl saw a house." > > "A girl is seeing a house." > > "A girl will see a house." > > Something like that. > > > On 22 =D0=9A=D0=B2=D1=96, 23:30, ".alyn.post." > > wrote: > >> On Mon, Apr 18, 2011 at 01:20:28PM -0700, Z-CORE wrote: > >> > Hello, the Lojban community. > >> > My name is Zoltan Kochan. I'm working on an Interlingual Machine > >> > translator. I've started it for 3 natural languages: English, > >> > Hungarian and Ukrainian. > > >> > But I'm a big fan of constructed languages and I've realized that > >> > Lojban can be used as a good pivot-language during machine > >> > translation. That's why I've chosen it's dictionary for representing > >> > meaning during the translation process. > > >> > It's not a super-translator... hopefully yet. But it can translate > >> > many types of simple and complex sentences in any! direction between > >> > the added languages. And it has the ability of extending itself with > >> > other (custom) languages. I've "invented" ATNL and CWS for describin= g > >> > the grammar and the dictionary of any other language. > > >> > Anyway, it's an open-source project (https://code.google.com/p/fosay= /) > >> > and anybody can download it, try it, have fun with it:) > > >> > And I'll be very grateful for any kind of help! Somebody can help me > >> > with filling the dictionaries. Others can support the development > >> > process (It's written in Python 3). > > >> Cool! > > >> Will you provide some examples of what your program does? =C2=A0How do= es > >> it translate the following? =C2=A0(note that these translations are no= t > >> precisely identical, I'm not assuming your program flips between them, > >> I'm curious what it looks like to go from Lojban =3D> English and from > >> the English to Lojban.) > > >> .i ta mo / what is that? > >> .i ta rokci / that is a rock. > >> .i ta me lo do xu rokci / is that your rock? > >> .i ta me lo mi rokci / that is my rock. > >> .i xu do ponse lo do rokci / do you have your rock? > >> .i mi ponse lo mi rokci / I do have my rock. > >> .i mi djica lo za'i ponse lo do rokci / I want your rock. > > >> -Alan > >> -- > >> .i ma'a lo bradi ku penmi gi'e du > > > -- > > You received this message because you are subscribed to the Google Grou= ps "lojban" group. > > To post to this group, send email to lojban@googlegroups.com. > > To unsubscribe from this group, send email to lojban+unsubscribe@google= groups.com. > > For more options, visit this group athttp://groups.google.com/group/loj= ban?hl=3Den. > > That's a pretty neat idea. Since natural languages are ambiguous the > best thing you can do except for teaching a computer how to recognise > context (which you probably couldn't accomplish successfully in a > reasonable timespan) is to provide several translations, while > "grading" each one with how probable it is. > > Is this accomplished through neural network or some other means of > programming artificial intelligence? > > -- > mu'o mi'e .ivan. --=20 You received this message because you are subscribed to the Google Groups "= lojban" group. To post to this group, send email to lojban@googlegroups.com. To unsubscribe from this group, send email to lojban+unsubscribe@googlegrou= ps.com. For more options, visit this group at http://groups.google.com/group/lojban= ?hl=3Den.