Received: from mail-ia0-f187.google.com ([209.85.210.187]:47538) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.76) (envelope-from ) id 1UKTjb-0003De-HA; Tue, 26 Mar 2013 06:16:31 -0700 Received: by mail-ia0-f187.google.com with SMTP id l25sf2550773iad.24 for ; Tue, 26 Mar 2013 06:16:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=x-received:x-beenthere:x-received:received-spf:x-authority-analysis :x-cloudmark-score:x-authenticated-user:x-originating-ip:from:to :subject:date:message-id:user-agent:in-reply-to:references :mime-version:x-spam-score:x-spam_score:x-spam_score_int:x-spam_bar :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:x-google-group-id:list-post :list-help:list-archive:sender:list-subscribe:list-unsubscribe :content-type; bh=zcXJnE2vnl2eD0gSOW7QKO2iSGI/poaGEAcXXbCtj/o=; b=d8d+NITPRSReEp/aBW3jFnHXHDpws2riGWZEJE1b6p6pTkQO9COyZIa7h4gq70B8T3 H9VeAwWJXioCKmKiqxS5iXIwEwwOAoem6Ivts6cfgnfJ47hYgC8QUucP5TjonqJYCmnT 00sS60fF8zBBVNnwl5jX2N5Dxp5FZVDvDzUmxNouejcG3qxr9eEz25KIp+FXljVLQTwp xo9vjR4V28jmAvAaEoEALhgSxVcNmrNx8RJzrctzPst8m0NWCMkSzxTXl6eqnJyWPvFs r3vUZJM08MebNuNyTTdG9oNQMrGUQ+hAjLf5xf3yMACZcCul1yEQh55AKTwp2ZIuWJC1 fjqA== X-Received: by 10.50.5.197 with SMTP id u5mr262685igu.0.1364303768962; Tue, 26 Mar 2013 06:16:08 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.50.158.231 with SMTP id wx7ls4493444igb.5.gmail; Tue, 26 Mar 2013 06:16:08 -0700 (PDT) X-Received: by 10.66.12.161 with SMTP id z1mr2287770pab.11.1364303767888; Tue, 26 Mar 2013 06:16:07 -0700 (PDT) Received: from stodi.digitalkingdom.org (mail.digitalkingdom.org. [173.13.139.236]) by gmr-mx.google.com with ESMTPS id cu1si2792291pbc.1.2013.03.26.06.16.07 (version=TLSv1 cipher=RC4-SHA bits=128/128); Tue, 26 Mar 2013 06:16:07 -0700 (PDT) Received-SPF: neutral (google.com: 173.13.139.236 is neither permitted nor denied by best guess record for domain of phma@bezitopo.org) client-ip=173.13.139.236; Received: from nobody by stodi.digitalkingdom.org with local (Exim 4.76) (envelope-from ) id 1UKTjS-0003DZ-Sv for lojban@googlegroups.com; Tue, 26 Mar 2013 06:16:07 -0700 Received: from cdptpa-omtalb.mail.rr.com ([75.180.132.120]:49295) by stodi.digitalkingdom.org with esmtp (Exim 4.76) (envelope-from ) id 1UKTjN-0003D9-Rn for lojban-list@lojban.org; Tue, 26 Mar 2013 06:16:06 -0700 X-Authority-Analysis: v=2.0 cv=TO3HuiZa c=1 sm=0 a=sqYj6zCFaLTNQ95sbP+eeQ==:17 a=W3V8cEvdJLQA:10 a=1wfwBGJGynAA:10 a=SRq_hl3J41MA:10 a=Yr05hP5UdKEA:10 a=kj9zAlcOel0A:10 a=xqWC_Br6kY4A:10 a=JNFw9bs7AAAA:8 a=kwqKA2INyzMA:10 a=zZ5gfuN7w-MYsvT8KywA:9 a=CjuIK1q_8ugA:10 a=sqYj6zCFaLTNQ95sbP+eeQ==:117 X-Cloudmark-Score: 0 X-Authenticated-User: X-Originating-IP: 69.132.98.107 Received: from [69.132.98.107] ([69.132.98.107:48120] helo=leopard.ixazon.lan) by cdptpa-oedge01.mail.rr.com (envelope-from ) (ecelerity 2.2.3.46 r()) with ESMTP id 8A/71-06772-B8F91515; Tue, 26 Mar 2013 13:15:55 +0000 Received: from caracal.localnet (unknown [IPv6:2001:470:8:42:c509:fddf:e704:7600]) by leopard.ixazon.lan (Postfix) with ESMTPS id EFECB15A7 for ; Tue, 26 Mar 2013 09:15:54 -0400 (EDT) From: Pierre Abbat To: lojban-list@lojban.org Subject: Re: [lojban] Need some jbovlaste programming help. Date: Tue, 26 Mar 2013 09:15:47 -0400 Message-ID: <3520631.PfPziTd2d6@caracal> User-Agent: KMail/4.8.5 (Linux/3.2.0-38-generic; KDE/4.8.5; x86_64; ; ) In-Reply-To: <20130325180820.GU6328@stodi.digitalkingdom.org> References: <20130325180820.GU6328@stodi.digitalkingdom.org> MIME-Version: 1.0 X-Spam-Score: 0.0 (/) X-Spam_score: 0.0 X-Spam_score_int: 0 X-Spam_bar: / X-Original-Sender: phma@bezitopo.org X-Original-Authentication-Results: gmr-mx.google.com; spf=neutral (google.com: 173.13.139.236 is neither permitted nor denied by best guess record for domain of phma@bezitopo.org) smtp.mail=phma@bezitopo.org Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: text/plain; charset=ISO-8859-1 X-Spam-Score: 0.0 (/) X-Spam_score: 0.0 X-Spam_score_int: 0 X-Spam_bar: / On Monday, March 25, 2013 11:08:21 Robin Lee Powell wrote: > Some bad data has snuck its way in to jbovlaste (a good chunk from > an import script I screwed up that we can't just re-run, but some of > it isn't from that, so not sure what's going on) and it needs > cleaning. > > I have neither the time nor inclination. > > I don't much care what it's written in as long as it's UTF-8 safe > (i.e. bash isn't going to cut it), but we need something that does > the following: > > For every natlang word: > > if a duplicate (same word, meaning, and langid) exists, > consolidate them. This means deleting the duplicate, combining > the "notes" field for the two of them, and updating all instances > of the id you just deleted to point to the one that still exists > in the tables threads, keywordmapping, natlangwordbestguesses, and > natlangwordvotes. natlangwordbestguesses has to be handled > specially there, as it shouldn't end up with two identical rows > (identical across all 3 fields); that shouldn't be possible given > that manipulation, but check anyway. Is PHP okay? All the database programming I've done is in PHP, except a tiny bit that's in bash. Can I (or anyone else who's going to do this) have a copy of the database to practice on? I don't want to delete from the live database a word by mistake. Pierre -- sei do'anai mi'a djuno puze'e noroi nalselganse srera -- You received this message because you are subscribed to the Google Groups "lojban" group. To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban?hl=en. For more options, visit https://groups.google.com/groups/opt_out.