Received: from mail-lb0-f184.google.com ([209.85.217.184]:55411) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.80.1) (envelope-from ) id 1XQeFf-00038d-PK for lojban-list-archive@lojban.org; Sun, 07 Sep 2014 08:19:42 -0700 Received: by mail-lb0-f184.google.com with SMTP id z11sf205226lbi.1 for ; Sun, 07 Sep 2014 08:19:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=message-id:date:from:user-agent:mime-version:to:subject:references :in-reply-to:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:sender:list-subscribe:list-unsubscribe:content-type :content-transfer-encoding; bh=JRk+J0kowWOj6vD5H3dFif7cDU/Auk5hb83xvqC3ph0=; b=ZcOiJKG6fFRzNvOju6YjhYQE6654c1V9T5T/2EIYhtutrMcNgEFeuFK8gB5ADEQ1Kc 9go8CBtL4ut/HRyvGy/pcb42OAbae7F3X3QCVxigimY5bRd7mTSMnC/bXj6Nx76wSsva 0CmjUMuMJrd9QgMvtvxNiWY9TtkHojSkeAAjqsWd8Ucr/ZDxRrdC2WMF3F6LPrn0UfIR KgXUCJ0iH9hP9TfocTH47ERMaH5fZOviERismRC/KotuTAUP/tvQYO3DjsU76g52YwGz HdufI3F9SwQXIQ3ylYuT+DnWinggeVoTB6grCK8yCsuHtgDekby6EqIRgalon9IQI2q5 LVKg== X-Received: by 10.180.8.69 with SMTP id p5mr70617wia.10.1410103172028; Sun, 07 Sep 2014 08:19:32 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.180.104.228 with SMTP id gh4ls288626wib.19.canary; Sun, 07 Sep 2014 08:19:31 -0700 (PDT) X-Received: by 10.180.206.66 with SMTP id lm2mr1380225wic.1.1410103171598; Sun, 07 Sep 2014 08:19:31 -0700 (PDT) Received: from mail-we0-x236.google.com (mail-we0-x236.google.com [2a00:1450:400c:c03::236]) by gmr-mx.google.com with ESMTPS id j12si421905wie.2.2014.09.07.08.19.31 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Sun, 07 Sep 2014 08:19:31 -0700 (PDT) Received-SPF: pass (google.com: domain of and.rosta@gmail.com designates 2a00:1450:400c:c03::236 as permitted sender) client-ip=2a00:1450:400c:c03::236; Received: by mail-we0-f182.google.com with SMTP id w62so13862834wes.41 for ; Sun, 07 Sep 2014 08:19:31 -0700 (PDT) X-Received: by 10.180.38.84 with SMTP id e20mr16295923wik.43.1410103171533; Sun, 07 Sep 2014 08:19:31 -0700 (PDT) Received: from [192.168.0.4] (97e1e8ee.skybroadband.com. [151.225.232.238]) by mx.google.com with ESMTPSA id xm4sm8572487wib.9.2014.09.07.08.19.29 for (version=TLSv1 cipher=ECDHE-RSA-RC4-SHA bits=128/128); Sun, 07 Sep 2014 08:19:30 -0700 (PDT) Message-ID: <540C7784.7050303@gmail.com> Date: Sun, 07 Sep 2014 16:19:32 +0100 From: And Rosta User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:14.0) Gecko/20120711 Thunderbird/14.0 MIME-Version: 1.0 To: lojban@googlegroups.com Subject: Re: [lojban] Why left-grouping of tanru? References: <54090077.7050803@lojban.org> <1409940819.12603.YahooMailNeo@web181104.mail.ne1.yahoo.com> In-Reply-To: X-Original-Sender: and.rosta@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; spf=pass (google.com: domain of and.rosta@gmail.com designates 2a00:1450:400c:c03::236 as permitted sender) smtp.mail=and.rosta@gmail.com; dkim=pass header.i=@gmail.com; dmarc=pass (p=NONE dis=NONE) header.from=gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: quoted-printable X-Spam-Score: -1.9 (-) X-Spam_score: -1.9 X-Spam_score_int: -18 X-Spam_bar: - djeikon, On 07/09/2014 16:00: > Right-branching noun compounds do tend to be more common than left-branch= ing in English corpora. The conversion of CCGbank (a corpus of CCG-annotate= d Wall Street Journal text; see http://groups.inf.ed.ac.uk/ccg/ccgbank.html= ) from the Penn Treebank required imposing binary branching on all noun com= pounds, and they opted for universal right-branching, given that recovering= the correct bracketing was considered too difficult / out of scope, and ri= ght-branching at least guarantees correctness >50% of the time. Wow that was a quick reply! I'd like to see the evidence for the claim that= right-branching compounds are more common. For instance, the count of righ= t-branching might be erroneously inflated by misanalysing attributive nouns= as compound-initial (e.g. "London fight club"). --And. --=20 You received this message because you are subscribed to the Google Groups "= lojban" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout.