Received: from mail-yh0-f62.google.com ([209.85.213.62]:43304) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.80.1) (envelope-from ) id 1XQdwr-00031U-4R for lojban-list-archive@lojban.org; Sun, 07 Sep 2014 08:00:14 -0700 Received: by mail-yh0-f62.google.com with SMTP id b6sf2913850yha.7 for ; Sun, 07 Sep 2014 08:00:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type; bh=JS/leQ+zMCOGb+AI6CHelUTzE5jmbHsHj/D6JGwoORc=; b=pQRV5ZCLcK7VjZHQfT3SUvT4oa0q2I5ITIBjswA/hgIGKCtK7PG26BHEQfQre1RffN 5qNlo7HJNnGDx5foAGIM8QTZDFtP1xSRlwcSWoTY4qajwEMkIx71gi4xDUQFXIoB3keM QaIIqauFGioKtAc/IX1nPDyEpMxsclNPp1122+/y/W2Ve88njD1czB4JcSo6osrBXiUM Va6urdu4/cPBI51oBAlLHQPqUQA0Ym1eSNL8UX4/3N/8CZAACpaGdK5W/EKKE4AtOH1K 5WuK0iarCWOAD3QEhmSODjDqoMKLPlenZrbPTTsgB4Tk7Zih5b+jv+OhjQSeShHE06nF mzng== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type; bh=JS/leQ+zMCOGb+AI6CHelUTzE5jmbHsHj/D6JGwoORc=; b=wZ5sLljOLxZr0df5hohIH1LpbDFjKvnAjNuQ5xfwDE1sGfyc3GFlVqk8LJVCTDZ1uy jABehHsY7ESJgciE4sWcXanVc3xWp4DSXgs13wRZb7Szba8B0ntKYk4MEbR0CX0EcY8o I2OXyxNKzSUDwWD6jJXq5hXaEfo1hppqMTuMeK3iNwUlY9Ap/R/rkCEaFWINwu6Isa2g nTcp52CIU1CKHti7wDAjrBkTlRdNT0iFL6gvUCWm+19eAo9Lsdh3bDCfKpGU/EO/HFWt Y/CNgrjzdQMREqWpJUnYkFQ4bDgYIJ5fyebE/mNx8w5DqisDF2Q4CkucJ8CEo2xEmgrL +RAA== X-Received: by 10.50.142.104 with SMTP id rv8mr194479igb.11.1410102006395; Sun, 07 Sep 2014 08:00:06 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.50.12.73 with SMTP id w9ls572410igb.29.gmail; Sun, 07 Sep 2014 08:00:06 -0700 (PDT) X-Received: by 10.50.79.201 with SMTP id l9mr197427igx.11.1410102006137; Sun, 07 Sep 2014 08:00:06 -0700 (PDT) Date: Sun, 7 Sep 2014 08:00:04 -0700 (PDT) From: djeikon To: lojban@googlegroups.com Message-Id: In-Reply-To: References: <54090077.7050803@lojban.org> <1409940819.12603.YahooMailNeo@web181104.mail.ne1.yahoo.com> Subject: Re: [lojban] Why left-grouping of tanru? MIME-Version: 1.0 X-Original-Sender: jwdconstable@gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Content-Type: multipart/alternative; boundary="----=_Part_10425_734929335.1410102005064" X-Spam-Note: SpamAssassin invocation failed ------=_Part_10425_734929335.1410102005064 Content-Type: text/plain; charset=UTF-8 Right-branching noun compounds do tend to be more common than left-branching in English corpora. The conversion of CCGbank (a corpus of CCG-annotated Wall Street Journal text; see http://groups.inf.ed.ac.uk/ccg/ccgbank.html) from the Penn Treebank required imposing binary branching on all noun compounds, and they opted for universal right-branching, given that recovering the correct bracketing was considered too difficult / out of scope, and right-branching at least guarantees correctness >50% of the time. -- You received this message because you are subscribed to the Google Groups "lojban" group. To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout. ------=_Part_10425_734929335.1410102005064 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Right-branching noun compounds do tend to be more common t= han left-branching in English corpora. The conversion of CCGbank (a corpus = of CCG-annotated Wall Street Journal text; see http://groups.inf.ed.ac.uk/ccg/ccgbank.html= ) from the Penn Treebank required imposing binary branching on all noun com= pounds, and they opted for universal right-branching, given that recovering= the correct bracketing was considered too difficult / out of scope, and ri= ght-branching at least guarantees correctness >50% of the time.

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsub= scribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http:= //groups.google.com/group/lojban.
For more options, visit http= s://groups.google.com/d/optout.
------=_Part_10425_734929335.1410102005064--