From lojban+bncCNTPpI2KGxCg3rvkBBoExTI2qw@googlegroups.com Mon Sep 13 20:42:40 2010 Received: from mail-pz0-f61.google.com ([209.85.210.61]) by chain.digitalkingdom.org with esmtp (Exim 4.72) (envelope-from ) id 1OvMPk-0008Np-N5; Mon, 13 Sep 2010 20:42:40 -0700 Received: by pzk7 with SMTP id 7sf525482pzk.16 for ; Mon, 13 Sep 2010 20:42:30 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=beta; h=domainkey-signature:received:x-beenthere:received:received:received :received:received-spf:x-authority-analysis:x-cloudmark-score :x-originating-ip:received:received:from:to:subject:date:user-agent :references:in-reply-to:mime-version:message-id:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type:content-disposition; bh=f99fO7qX6ANzWMWpJmukhW2tqEhoNM0/GgMAVeotNlk=; b=Nk7KeoAvoZQvbZjStu01ZUKOXktr1Ft3WwZtM2EaY1jgQ5bVWzgS5xc6zPNrzvdMcz YOkPO+cbYQIXkDsj5TjrZC0K7UdZx+dN/ysGR2yarkCIAv4g/y5Lyd/nmLEv3w7zlRSe pU1v5yYK3S4s2mx/b5HoQ7JaGaV5L45KnsGUk= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlegroups.com; s=beta; h=x-beenthere:received-spf:x-authority-analysis:x-cloudmark-score :x-originating-ip:from:to:subject:date:user-agent:references :in-reply-to:mime-version:message-id:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type:content-disposition; b=R2Ugdi4Ip1NrhphsjXBqr9cQ0XyUgPDkc+R3nPlgfXkIy0pJdpBdMOT790UYDIR6Qh 6xoWyNESlq+a/IY3ALgCylmNjA7qK5CHuKpDlHZb2tL+V5Axn8z6YuZUtiSM+msQMijk xgmPffzX7Ayg0wSSfjwxVOQlIc//XupGDXwv0= Received: by 10.115.101.15 with SMTP id d15mr320921wam.15.1284435744976; Mon, 13 Sep 2010 20:42:24 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.115.66.21 with SMTP id t21ls1129397wak.2.p; Mon, 13 Sep 2010 20:42:24 -0700 (PDT) Received: by 10.114.53.9 with SMTP id b9mr1221106waa.21.1284435744155; Mon, 13 Sep 2010 20:42:24 -0700 (PDT) Received: by 10.114.53.9 with SMTP id b9mr1221105waa.21.1284435744119; Mon, 13 Sep 2010 20:42:24 -0700 (PDT) Received: from cdptpa-omtalb.mail.rr.com (cdptpa-omtalb.mail.rr.com [75.180.132.122]) by gmr-mx.google.com with ESMTP id j18si8433263wan.5.2010.09.13.20.42.23; Mon, 13 Sep 2010 20:42:23 -0700 (PDT) Received-SPF: neutral (google.com: 75.180.132.122 is neither permitted nor denied by best guess record for domain of phma@phma.optus.nu) client-ip=75.180.132.122; X-Authority-Analysis: v=1.1 cv=qYL/ltCbfNJUfGmhDGumtphIH30dseuhEnbjja0E7b4= c=1 sm=0 a=2VlS1xfKK4kA:10 a=wPDyFdB5xvgA:10 a=8nJEP1OIZ-IA:10 a=9o99xeNKNPYSmM5t9x5+TQ==:17 a=8Ph_vcHEAAAA:20 a=Q7o3BMFtjvbaEiEAX1sA:9 a=t4frAqGsnUejhuaUZd-a9nf577oA:4 a=wPNLvfGTeEIA:10 a=9o99xeNKNPYSmM5t9x5+TQ==:117 X-Cloudmark-Score: 0 X-Originating-IP: 75.176.118.168 Received: from [75.176.118.168] ([75.176.118.168:38436] helo=chausie) by cdptpa-oedge04.mail.rr.com (envelope-from ) (ecelerity 2.2.2.39 r()) with ESMTP id 86/0F-23867-F1FEE8C4; Tue, 14 Sep 2010 03:42:23 +0000 Received: from localhost (localhost [127.0.0.1]) by chausie (Postfix) with ESMTP id 7EB1C1B1B for ; Mon, 13 Sep 2010 23:42:22 -0400 (EDT) From: Pierre Abbat To: lojban@googlegroups.com Subject: Re: [lojban] gismu creating program Date: Mon, 13 Sep 2010 23:42:05 -0400 User-Agent: KMail/1.9.6 (enterprise 0.20070907.709405) References: <201009131432.18890.phma@phma.optus.nu> <20100914011736.GA2157@sdf.lonestar.org> In-Reply-To: <20100914011736.GA2157@sdf.lonestar.org> MIME-Version: 1.0 Message-Id: <201009132342.07339.phma@phma.optus.nu> X-Original-Sender: phma@phma.optus.nu X-Original-Authentication-Results: gmr-mx.google.com; spf=neutral (google.com: 75.180.132.122 is neither permitted nor denied by best guess record for domain of phma@phma.optus.nu) smtp.mail=phma@phma.optus.nu Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: text/plain; charset=ISO-8859-1 Content-Disposition: inline On Monday 13 September 2010 21:17:38 Minimiscience wrote: > You mean the {gismu} scoring program? > 7/61e555d419c5723b> should answer both questions. Thanks. I got it to run on the example and say "mlino". I'm now trying to run it on the following data, which I haven't finished expanding: # zh en hi es ru ar # zh: cie (crab); cia (shrimp) # en: krab, krastecn, crimp, labstr # hi: kekd (crab) # es: kangrex, (crab); krustase (crustacean); kamaron (shrimp); langost (lobster) # ru: krab (crab); krevetk (shrimp); omar, lobster (lobster) # ar: saratan (crab); rubian, jambari (shrimp); karkand (lobster) cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 krab 0.09 saratan 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 krab 0.09 rubian 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 krab 0.09 jambari 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 krab 0.09 karkand 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 krevetk 0.09 saratan 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 krevetk 0.09 rubian 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 krevetk 0.09 jambari 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 krevetk 0.09 karkand 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 omar 0.09 saratan 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 omar 0.09 rubian 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 omar 0.09 jambari 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 omar 0.09 karkand 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 lobster 0.09 saratan 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 lobster 0.09 rubian 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 lobster 0.09 jambari 0.07 cie 0.36 krab 0.21 kekd 0.16 kangrex 0.11 lobster 0.09 karkand 0.07 It tries to make a gismu out of the comments. What's the correct comment character for this program? Is there a way to tell it to use one set of weights for all lines? How can I tell it "try all combinations of these words for Russian, those words for Arabic, etc. and tell me the ten best-scoring words that come up"? Pierre -- The Black Garden on the Mountain is not on the Black Mountain. -- You received this message because you are subscribed to the Google Groups "lojban" group. To post to this group, send email to lojban@googlegroups.com. To unsubscribe from this group, send email to lojban+unsubscribe@googlegroups.com. For more options, visit this group at http://groups.google.com/group/lojban?hl=en.