[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lojban] Why left-grouping of tanru?



Right-branching noun compounds do tend to be more common than left-branching in English corpora. The conversion of CCGbank (a corpus of CCG-annotated Wall Street Journal text; see http://groups.inf.ed.ac.uk/ccg/ccgbank.html) from the Penn Treebank required imposing binary branching on all noun compounds, and they opted for universal right-branching, given that recovering the correct bracketing was considered too difficult / out of scope, and right-branching at least guarantees correctness >50% of the time.

--
You received this message because you are subscribed to the Google Groups "lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/d/optout.