From lojban+bncCIGHwM2rDhD_mpzkBBoEv30g9Q@googlegroups.com Tue Sep 07 21:14:18 2010 Received: from mail-gy0-f189.google.com ([209.85.160.189]) by chain.digitalkingdom.org with esmtp (Exim 4.72) (envelope-from ) id 1OtC2r-0004vT-Oj; Tue, 07 Sep 2010 21:14:17 -0700 Received: by gya1 with SMTP id 1sf9882121gya.16 for ; Tue, 07 Sep 2010 21:13:55 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=beta; h=domainkey-signature:received:x-beenthere:received:received:received :received:received:received-spf:received:received:received:received :received:date:from:to:subject:message-id:mail-followup-to :references:mime-version:in-reply-to:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type:content-disposition; bh=v+ZD8f8uXadOv5iddvgsF50eUsD+R5b7c7dVSG+lEYQ=; b=pMatd6p0GM2YFOSTusTU2LkEmaD0xm9p6+7W4YY1XZq4Z5Z/+B6yS5uP7ns6EsWb95 XXQYGrcnqrWQ2dhJPoZxTqcvu/e2SxEQWSLDWKXgmLPuJ/93sZkU8RCow0Ng3vzP2y3N dy1t0wSfG0joZ+ctX1kuO4Q6XWIdo1Z0dowbE= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlegroups.com; s=beta; h=x-beenthere:received-spf:date:from:to:subject:message-id :mail-followup-to:references:mime-version:in-reply-to :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe:content-type :content-disposition; b=OLFikUgV2tdkKFrxLhrK30IUKqunPMyfzMnNTWA7AA8ujzgX8lKfYMUKzXSlvyjDFV GsV0yYbPrkbc722jX41+fcO6+dZ4NOFqbpRQAA/LmZ0ZoYyi9QvhJy3IITDGpzRuPFge HsZwEkK9PodY222H+sCffKLNcj6bV9YTsWH6w= Received: by 10.90.86.1 with SMTP id j1mr743985agb.18.1283919231047; Tue, 07 Sep 2010 21:13:51 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.150.48.3 with SMTP id v3ls1003308ybv.5.p; Tue, 07 Sep 2010 21:13:50 -0700 (PDT) Received: by 10.151.49.1 with SMTP id b1mr1862769ybk.48.1283919230635; Tue, 07 Sep 2010 21:13:50 -0700 (PDT) Received: by 10.142.148.3 with SMTP id v3mr1079599wfd.5.1283919153745; Tue, 07 Sep 2010 21:12:33 -0700 (PDT) Received: by 10.142.148.3 with SMTP id v3mr1079598wfd.5.1283919153719; Tue, 07 Sep 2010 21:12:33 -0700 (PDT) Received: from chain.digitalkingdom.org (chain.digitalkingdom.org [64.81.66.169]) by gmr-mx.google.com with ESMTP id k8si6132249wfa.2.2010.09.07.21.12.33; Tue, 07 Sep 2010 21:12:33 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of nobody@digitalkingdom.org designates 64.81.66.169 as permitted sender) client-ip=64.81.66.169; Received: from nobody by chain.digitalkingdom.org with local (Exim 4.72) (envelope-from ) id 1OtC1Q-0004rQ-GQ for lojban@googlegroups.com; Tue, 07 Sep 2010 21:12:32 -0700 Received: from mail-pw0-f53.google.com ([209.85.160.53]) by chain.digitalkingdom.org with esmtp (Exim 4.72) (envelope-from ) id 1OtC1N-0004qo-2p for lojban-list@lojban.org; Tue, 07 Sep 2010 21:12:32 -0700 Received: by pwi5 with SMTP id 5so2435998pwi.40 for ; Tue, 07 Sep 2010 21:12:23 -0700 (PDT) Received: by 10.114.39.16 with SMTP id m16mr723167wam.221.1283919143159; Tue, 07 Sep 2010 21:12:23 -0700 (PDT) Received: from sunflowerriver.org (c-68-35-167-179.hsd1.nm.comcast.net [68.35.167.179]) by mx.google.com with ESMTPS id d2sm14780865wam.2.2010.09.07.21.12.19 (version=TLSv1/SSLv3 cipher=RC4-MD5); Tue, 07 Sep 2010 21:12:22 -0700 (PDT) Date: Tue, 7 Sep 2010 22:12:12 -0600 From: Alan Post To: lojban-list@lojban.org Subject: Re: [lojban] CLL diffs Message-ID: <20100908041212.GA55480@alice.local> Mail-Followup-To: lojban-list@lojban.org References: <20100611173115.GM7321@digitalkingdom.org> <20100730181130.GS4511@digitalkingdom.org> <20100730183052.GA38308@alice.local> <20100907233227.GI5990@digitalkingdom.org> <20100908035951.GM38255@alice.local> <20100908040843.GL5990@digitalkingdom.org> Mime-Version: 1.0 In-Reply-To: <20100908040843.GL5990@digitalkingdom.org> X-Original-Sender: alanpost@sunflowerriver.org X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: best guess record for domain of nobody@digitalkingdom.org designates 64.81.66.169 as permitted sender) smtp.mail=nobody@digitalkingdom.org Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: text/plain; charset=ISO-8859-1 Content-Disposition: inline On Tue, Sep 07, 2010 at 09:08:43PM -0700, Robin Lee Powell wrote: > On Tue, Sep 07, 2010 at 09:59:51PM -0600, Alan Post wrote: > > My favorite change so far is the following: > > > > [-forbidden.-] {+forbilien .+} > > > > Someone changed forbidden to forbilien, twice no less. > > > > My largest challenge in this project are the fact that I did not > > get consistent conversion of non-ASCII characters, so the wdiff > > patch is very noisy--anytime a non-ascii character, or an ascii > > character with a non-ascii representation (e.g., single and double > > quote) appears, it shows up as a diff. I've managed to remove > > certain classes of these, and am still finding patterns as I go. > > There's a unix command called "recode" which can almost certainly > fix those problem, just so you know. > This particular problem happened when I converted the .doc to .rtf. I'm aware of iconv's support for squashing non-ascii characters. I did not know about recode. I'll review that part of my pipeline and see if I can catch the program doing it. It *looks* like Word just made all the non-ascii characters '?', but I could be seeing an effect of not having full UTF support in my terminal/editor too. -Alan -- ko djuno fi le do sevzi -- You received this message because you are subscribed to the Google Groups "lojban" group. To post to this group, send email to lojban@googlegroups.com. To unsubscribe from this group, send email to lojban+unsubscribe@googlegroups.com. For more options, visit this group at http://groups.google.com/group/lojban?hl=en.