[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lojban] The efficacy of Lojban's grammar.



On Apr 10, 2010, at 9:51, And Rosta wrote:

I wager that syntactic structures that would be assigned to Lojban sentences by (1) syntacticians and (2) Lojban speakers would differ very substantially from the syntactic structures assigned by the formal grammar.

[Disclaimer: I Am Not A Linguist.]

In my experience developing software which works with the results of parsing using formal grammar (well, the PEG version), the trees produced by the formal grammar are not like how I internally think of Lojban grammar, but insofar as they are, *they aren't what I would design as a formal AST for Lojban either*.

In particular, the trees have a huge number of nodes which pertain only to the implementation structure of the grammar and are both redundant and unrelated to the semantics of Lojban. This falls out from the fact that the parser produces one tree node per nonterminal, named according to that nonterminal, unconditionally: no appropriate specialized actions/transformations have been defined.

There are two major problems with the usefulness of the produced parse trees:

1. There are many nodes with exactly one child which reflect rules that exist only due to the factoring of the grammar, or nested cases in order to produce the proper parse tree for various optional clauses which
     usually don't exist.

2. The nodes are named according to the nonterminal, not according to the matched rule. This means that the names reflect the syntactic role, the slot it fills, rather than what the slot was filled with. The result of this is that an interpretation of a given subtree has be inferred from the number and kind of child nodes rather than an actual symbol in the
     tree.

Particularly, note that the second problem is because the information *simply does not exist* in a formal system. The formal grammar(s) we have are simply defined to accept/reject sentences; the information about "what are these particular alternatives called" exists only informally in the CLL and other semantic-description documents.

So:

I agree that the formal grammar produces weird structures.

However, I believe it would be possible to create a parser, or transform the output of the current parser(s), such that the structure *is* similar to what a syntactician, or a Lojban speaker who is familiar with parsers and formal grammars (such as for programming languages), would assign.

Furthermore, I believe this particular project *should* be done, as it would (a) aid the development of computer software which interprets Lojban, and (b) be a useful tool in discussing “what does this Lojban sentence mean” and making sure that the semantics of Lojban are fully defined.

--
Kevin Reid                                  <http://switchb.org/kpreid/>




--
You received this message because you are subscribed to the Google Groups "lojban" group.
To post to this group, send email to lojban@googlegroups.com.
To unsubscribe from this group, send email to lojban+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.