Jorge Llambías, On 21/01/2015 12:33:On Tue, Jan 20, 2015 at 8:35 PM, And Rosta <and.rosta@gmail.com <mailto:and.rosta@gmail.com>> wrote:
Step (3') yields something like Tersmu output, probably augmented by some purely syntactic (i.e. without logical import) structure. I think that can and should be done without reference to the formal grammars.
But Tersmu output is basically FOPL, which has its own formal grammar
(on which Lojban's formal grammar is based). I still don't see what
problems formal grammars create.
(3') must certainly involve a grammar, and I can't think of any sense in which a grammar could meaningfully be called 'informal', so I'm happy to call that grammar 'formal'. But it differs from the CS (or at least the Lojban) notion primarily in not having phonological objects as any of its nodes and secondarily in not necessarily being simply a labelled bracketing of a string.
To the extent that Lojban is a language, (3) doesn't really constitute any part of Lojban (despite the mistaken belief of many Lojbanists to the contrary). Also, to the extent that Lojban is a language, there exists an implicit version of (3'), albeit not necessarily one that is coherent or unambiguous. So I would recommend removing the current Formal Grammars from the definition of Lojban, and replacing them by one -- an explicit (3') -- that more credibly represents actual human language (but is unambiguous etc.).
Also questionable is the extent to which a nonterminal node can have properties/labels not simply derived from the label of the head daughter: the range of views among syntacticians is too hard to summarize in one sentence here, but certainly one does not come across syntactic trees for natlang sentences with a pattern of labellings resembling Lojban's, i.e. where the relationship between labels on the mother and the daughters is unconstrained.
Unary branches don't do
anything useful, but are they harmful other than in cluttering the
tree with superfluous nodes?
They're harmless clutter if there's no contrast with a version of the tree where mother and singleton daughter merge into the same node. You need to consider the branching issue together with the labelling issue. If mother and head-daughter have the same label, then the redundancy of unary branching is plain.
Ok, but in Lojban there's almost a one-to-one match between
phonological and syntactic words.
That remains to be seen, because there isn't yet an explicit real syntax for Lojban. However, it's perfectly possible that in Lojban, phonology--syntax mismatches are rare.
I'm not sure if choosing a simple Lojban example is going to reveal why you can't have beliefs about binary branching in natlangs.
Syntax is a set of rules for combining the combinatorial units of syntax in ways that are combinatorially licit and that combine the units' phonological forms and their meanings. I suspect (but excuse me if I'm mistaken) that for you every set of rules that defines the correct set of sentences is equally valid, so that so long as the rules match the right sentence sounds to the right sentence meanings, it doesn't matter what the intermediate structure is like; if the syntactician has a job, it is to work out *a* set of rules, but there is no reason to think there is only one correct set of rules. In contrast, pretty much all linguisticians think (but not always for the same reasons) that of the sets of rules that generate the same, correct, set of sentences, some of those sets are right and some are wrong or at least some are righter and some are wronger
. In my case I think the rules matter because (i) to understand the system you need to understand its internal mechanics, and (ii) a speaker knows a certain set of rules. and it's known-rules that are my object of study.