From lojbab@xxxxxx.xxxx Sat Jan 1 11:23:32 2000 X-Digest-Num: 326 Message-ID: <44114.326.1773.959273825@eGroups.com> Date: Sat, 01 Jan 2000 14:23:32 -0500 From: "Bob LeChevalier (lojbab)" Here's a combined reply to Lojbab and Brook. > > > It can be spoken and written (unambiguously), and it is languagy enough > > that someone can learn it as a language (I doubt that many can learn > > predicate logic as a language). > >There's nothing to say but that we disagree, in that I do think that >people could learn it at least at the intellectual level that people >learn programming languages. They mightn't be able to learn it as >a natlang, but on the other hand, for a limited purpose like legislative >texts this might not be a bad thing, and anyway, Lojban too, taken as >a whole, might be insuperably unnatural. > > > By this argument, the efforts to develop an interlanguage for machine > > translation would need only use predicate calculus. But instead the > > closest that anyone has come to successfully using an > > interlanguage was the DLT project that used Esperanto. > >I don't know why this was. Maybe they wanted to translate aspects of >form as well as meaning. And very probably it was easier to work out >the mapping from natlangs to Esperanto than from natlangs to something >as stark as predicate logic form. I would contend that form sometimes is part of meaning. We know for example that sumti/object ordering can put convey emphasis. But it is precisely the problem of working out mappings from natlangs to predicate logic vs. Lojban that makes the difference. To map to predicate logic, you have to explicitly think of things that are subliminal in Lojban, like prenex/quantification issues. There is also brevity: I believe pc once said that quantifiers other than the standard logical quantifiers in pure predicate calculus form get extremely verbose. That is why learning something as a natlang or quasi-natlang is critical to these sorts of applications. It is a matter of speed and confidence that you've done it correctly. Programming languages come close but fall a little short of the mark of knowing you've done it right just by writing it out, as well as of fluency in writing. And the fact that not all programmers are good lawyers and vice versa suggests that there is a distinction in skills between the kinds of accuracy in logic that are needed. > > > In a Polish/Reverse Polish predicate > > >logic notation ... > > >Likewise, the entire syntax could be formulated in a single > > >sentence. > > > > But could human beings use it to describe a patent? > >I don't see why not. I said human beings. And maybe I should have included "read" a patent too. Human beings wrote Principia, but very few human beings can read it with any understanding. > > And what happens when you need to translate an indirect question? After > > all, haven't you just found that it is a fairly intractible problem for > > predicate logic? > >Hopefully it is tractable. But I'd use a special WH quantifier if the >language had to be done today. >From what pc has said, different schools believe different things, and that in itself means that the problem is intractable at a logical level. At a language level it is tractable because language is more tolerant of ad hoc conventions than predicate calculus. Legal language is of course nothing but a full raft of conventions that have been established based on natlang understandings. The closer a "coding language" can come to mimicking the natlang conventions, the easier it is to get judges to agree that the conventions are understood the way that the lawyer intends. That translating legal language into a new language in itself runs a risk of destroying conventional precedents that is as hard an argument to overcome as any linguistic one; our main advantage in this case is that the patents have to be translated anyway. > > >Of course the predicate words' senses have to be defined. But in > Lojban the > > >predicate words' senses are not defined -- this task has been left to > > >'usage' to achieve. > > > > And patent translation would be a large amount of usage. > >Exactly. I don't know anything about patents, but a great deal of law >involves decreeing definitions of terms. Yes. And you can define Lojban predicate words, so that is not a problem. Law would not rely on "usage" in Lojban any more than it does in English. Important terms will be carefully defined. > > > If so, I guess that they problem with predicate calculus is > > >that there's no fudgeability with it, which nonfudgeability is exactly why > > >one wants a logical language. > > > > But fudgeability is fine for patent translation (maybe even desirable to > > the lawyers), so long as the fudguing does not create an ambiguity > > comparable to those of natlangs. > >By fudgeability, I mean the possibility of being ambiguous when you >can't be bothered to disambiguate, or when disambiguation is more >trouble than it's worth. Well then fudgeability is IMPORTANT for legal translation, because there are places where lawyers WANT to leave meaning a little ambiguous. Ambiguity can allow two sides in disagreement to come to terms, leaving the ambiguous areas to be worked out later if necessary. You need to be arbitrarily precise in some areas, but Lojban's allowing of ambiguity when precision is too much trouble (or agreement is impossible) is an advantage. > > You learn the forms of predicate logic, more or less (but how many people > > actually use the full set of Lojban logical connectives, for > > example?). But you do not learn to reason according to the rules of > > inference along the lines of the predicate calculus. > >True, but that's also true even if you learn pred calc notation. I think though that you have to learn to reason with predicate calculus in order to accurately write in it. You also need to be able to do so with Lojban. In looking at the indirect question discussion that you recently conducted, I can imagine identifying and discussing the logical problems with everyone's formulations entirely in Lojban. I cannot imagine doing so entirely confining ourselves to predicate notation. Maybe some logicians can, but not me. > > If you did learn this > > inherently while learning Lojban without having to study the subject, I > > daresay that the original SWH concept for Loglan will have been proven. > > > > I also think that a lot of that "extra stuff" is exactly the sort > > of thing needed for patents, technical writing, formal specification, etc. > >Which stuff? Brook answered this better than I did. > Again, > > the attempts I know of anguages for these arenas have tended to have some > > logical construct to them, but have always had to fall back on a natlang > > like form. If not enough like a natlang, they haven't been learnable; if > > too much like a natlang, they haven't been sufficiently unambiguous. > >Given that pred logic notation differs from natlangs in its extreme >simplicity, it would be interesting to find if some system can be so >simple it is unlearnable. Not unlearnable - just too difficult to use fluently. The computer language APL was loved by some people for its terseness, but the terseness also prevented some people from being able to use it effectively. They couldn't grok things that were so terse. I have similar problems with C. I understand what "i++" means (increment the variable i), but when it is used in a C program written by skilled C programmers, too often my understanding hits a wall when I hit that usage. >Maybe an advantage of Lojban here is that it is public domain and >culturally-neutral in the sense of not, say, being the progeny of >a single commercial organization. The Linux of logical languages, >as it were. > >But if I wanted a logical lang that was only going to be used >in-house, then numerous modifications to Lojban could be made to >make the language easier to learn than Lojban, and yet as >effective. Basically, the main modifications I'd make would >involve discarding tons of stuff and changing the gismu forms to >lightly modified English. "Discarding tons of stuff" means using a language subset. I wouldn't expect usage of Lojban for patents to do anything other than use language subsets. As for the gismu forms, as I said: you are reproposing "Anglan" and the advantages people see in the idea just don't pan out in actual trial. If you change the morphology, of course, you lose the self-segregating aspects and thus spoken/written interchangeability. > > The point is that any top-down application of an artificial language big > > enough to point the way to large scale usage will inherently require that > > the language be easy to learn, with sufficient language learning materials > > that many can learn it by self-study and most anyone with more than minimal > > verbal ability can learn it with a teacher. The latter will of course take > > sufficient skilled Lojbanists to serve as teachers. In short it is the > > "rapid bootstrap" problem that anything really new tends to have a steep > > and time-expensive learning curve. We have to reduce that steepness to > > make Lojban successful. > > > > > I agree that there are these > > >obstacles to the adoption of Lojban. And as I've said, I think Lojban > > >and Esperanto would be poor choices for a patent language, or for a > > >European IAL. > > > > As a patent language it has to go beyond pan-European. Our German > > proponent of the Lojban patent effort cited the difficulty of > > translating Japanese patents as a strong reason in Lojban's favor. > >Two different things are getting confused here. A patent language (which >I am actually thinking of a more broadly a legal language), and an EU >IAL. I think I have been sticking solely to the legal/specification language issue. Legal language is still a natlang, albeit an extremely conventionalized one. (Indeed legal English can be said to be a subset of English just as legal Lojban would be a subset of Lojban). > > I see it as my job as President of LLG to work towards the goals of all > > Lojbanists, including some that you may feel are less practical. I got > > into this project out of a sense of duty, which later expanded to > > become a sense of mission. There are people who want to seriously work > > on finding applications for Lojban. I need to make sure that LLG > > provides the resources needed to make such efforts realizable, > > regardless of whether the goals that the efforts are aiming at will be > > achieved. The best thing I can do is to make it so that when people > > try to promote Lojban for a top-down goal, that there is enough > > substance backing them that they do not seem foolish just for trying. > > And if I do that, real money might "happen", since venture capitalists > > these days are betting on a lot of things with > > even slimmer odds for success than Lojban. > >Okay, now I do see why you should care so much. If I didn't care, I never would have gotten started. I never had the "conlang bug" and wasn't all that interested in language or linguistics before I was forced to do so by my role leading the Lojban effort. > > > And the original idea that a loglan-speaking > > >community would test sapirwhorf, I've always regarded as a bit of blarney > > >baloney by JC Brown who really wanted to invent a language but > > was trying to > > >(a) gain respectability for an ill-respected activity, (b) differentiate > > >the product from others, (c) attract adherents. > > > > I know the history enough to be sure that it was not blarney at the > > time. Remember that in the mid-50s, testing SWH was a big deal. > > Remember also that JCB came from the Campbell school of science fiction > > which I think had a certain amount of SWH built into it. He does seem to > > have conceived of Loglan before the SWH became big, but I think that he > > seriously wanted to make the language a research tool. > >You know the history better than me. The main reasons why I came to the above >conclusions are firstly that it seems improbable that anyone would seriously >propose Loglan as a psycholinguistic experiment, since the experiment is >so uncontrolled and it takes so much effort to set up, I think that JCB himself has a fundamental lack of understanding of the problem of controls, as well as the nature of scientific testing. If he had understood, he might never have started %^). Look at Chapter 7 of the now on-line L1, and you can see what he envisions as a SWH test. But the problems are tractable, IMO - they just need a LOT more thinking out. There were similar criticisms of JCBs scientific procedures during the redesign of 1979-84. JCB conducted "taste tests" with Loglan users to choose among various options for affix formation, but his experiments were uncontrolled (not to mention only performed by English native speakers), and his statistical conclusions were apparently, umm, skewed to get the answers that he wanted. McIvor wrote a devastating criticism of the methodology which was a key trigger starting of the political split; McIvor later reconciled with JCB of course. I won't say any more, since he can speak for himself if he wishes (I think he is subscribed at the moment - RAM?) > and secondly that >thereafter JCB showed himself much more interested in all the designing >and tinkering and so forth that conlangers are so familiar with, and also >introduced additional design goals. If he'd been serious, he could have >genuinely invented a speakable predicate logic in a short span of time, >and then, say, raised funds to pay people to learn and use it or to >participate in experiments that oblige or incentivate them to learn and >use it. Well here is where the history comes in. JCB did try to do these things. His original Loglan, the 1956 version, "rattled around in people's heads" (Cowan quoted this more accurately than I did), (I've never seen a copy of the 1956 'book', of which only a couple hundred mimeo copies were made), and people couldn't learn or use it effectively. So he started adding flesh to the language, developed his peculiar learnability schemes (the word making algorithm is something he actually claims to have tested on groups of students, unlike most other aspects of the language, though again we have no idea what typo of controls he used). With his first wife, JCB apparently did a fair amount of serious language work more aimed at teaching rather than fiddling in the early 60s. In many ways I think that we he went through then was something akin to what we went through in the few years of writing the Book - questions came up which he made decisions on, but he did not have the advantage we had of an Internet community to poke holes in his occasional 'brain farts'. He published the first L1 in microfilm in the mid 60s, which was the first chance more than a couple people had to actually look at what he was working on, but few changes were made in the language until after 1974. The book publishing in 1974-5 shows that he probably thought that the language as "done" then. He DID stop fiddling, and started research grant proposal writing to get funding for SWH experiments and such. Put politely, his 1976 proposal was mabla. Meanwhile the computer people showed him that the language wasn't as unambiguous as he had thought, the attempts of the earliest to try to write in the language showed copious flaws, and then the fact emerged that he had screwed up the "pretty little girls school" analysis and omitted needed cases. Meanwhile NSF told him the proposal was inadequate, and JCB reacted by producing two more proposals that were even worse (not having worked in academia for 15 years, he apparently did not know the standards for proposal writing), and then declared war on the NSF evaluators, using its formal protest procedure to the utmost while having no valid issues. That was the end of the possibility for JCB/TLI ever conducting funded research. I think that JCB turned to language fiddling only because the research aspect was closed to him and the fiddling was where the action was in the community that was left. I think he has always wanted to see people using his language, as the necessary step to wherever, and the fiddling was in his mind probably kept to the minimum necessary to gain usage (as well as to keep control once we started outdoing him). TLI's attitude towards change was MUCH more conservative than LLG's until we baselined. I think JCB honestly thought the language was done in 1974 and then again in 1984, but in both cases his community showed him he was wrong; now he cannot commit to saying it is done, because at that point he will lose control (actually, he HAS lost control, because we exist and thrive relatively speaking, but TLI has nothing left once JCB ceases to be the hub of the wheel of change). >I disagree on this. It depends on the nature of the more focused problem, >but for the patent/legal problem, I claim that the lg could be done >quickly, possibly be using parts of Lojban. And I have no qualms against using a language subset. It is changes and not subsetting that would hurt the project. Use of a language subset in a top down application will cause people to look at the rest of the language for other, less constrained applications, including those where a full range of communication is more desireable. (International diplomacy for example is a superset of international law, but requires all of language including a thorough set of attitudinals in order to be effective. What is the attitudinal for Khrushchev's shoe banging, I wonder?) > > Culturally neutral is a biggy. Nonconstraining and flexible are probably > > important, because patent writeups in various natlangs to be translated > > into Lojban will have their own natlang style and idiosyncrasies. And > > patent writeups tend to use very complicated language structures. > >I'm probably displaying my ignorance of patent write-ups then. I was >assuming that all that counts is their pure content, and that issues >of style and the structures used in the source language are irrelevant. They might come to be. But at first the patent application will be as an interlanguage for translating patents that were originally written in other languages. Once Lojban patent legal conventions were established and people wrote patents directly in Lojban (which would inevitably happen if indeed Lojban was adopted as the "real" language for patents), it will be Lojban conventions that dictate the meaning not the translated stylistics of a language multitude. > > Finally, looking at predicate calculus (or *any* calculus, for > > that matter) is misleading because it ignores the question of *vocabulary* > > (a point lojbab makes below). Try programming in Prolog with no libraries. > > Sure, you can write quick-sort in a couple of lines, but what are you >going > > to *do* with it? And how do you map it to the real world? > > > > Lojban has that in the form of gismu - predicate calculus does not. > >It's precisely because this is an area of noncomparability that this issue >is a red herring. Pred logic notation is proposed as an alternative to >Lojban syntax and cmavo. As I originally said, you need predicates. I >don't think Lojban gismu are adequate, because they're not yet defined, >but if you think they're adequate, then the Lojban gismu could be used >as the predicates in the language I'm mooting. If Lojban predicates are not sufficiently defined, I fail to see how any alternative approach would be better. You mentioned using English-like gismu earlier, but that just reduces to the problem of translating into legal English. English has ambiguous grossly polysemous words. Legal English has conventions to deal with English grammatical ambiguities. Legal Japanese has different and incompatible conventions in both words and grammar. To write something intelligible to both requires choosing something that tackles both sorts of ambiguity. In addition, for the European world, English-like means un-sellable to the French. > > > In other > > > words, setting aside how variables are handled, you could have > > > a language with only 3 cmavo! I'll admit that that number might > > > be expanded a bit, e.g. to include numbers, but even an expanded > > > cmavo inventory would be only a tiny proportion of Lojban's. > > > > A perfect example of going too far towards minimalism. Sure, > > AND, OR, and NOT are all you need to make any other truth function > > but you better believe people that build ICs for a living don't > > recreate a one-bit adder from AND, OR, and NOT every time they need > > one - they don't even use one-bit adders - they pull more useful > > things from a library - 32-bit add, multiply, etc. and a whole lot > > of other stuff. > >I can't think of which Lojban cmavo represent huge savings in >convenience over a combination of predicates and basic connectives >and quantifiers. The quantifier 5/mu is much briefer than the predicate logic method of saying "5". Lojban's set selection operators and massifiers are much more flexible than logical objects. (pc established long ago that logical connectives are ineffective for truth-tables of 4 or more elements and aren't that great even for 3). And then there is tense. I've looked at pc's book on tense logic. I will never understand it. > > What? Okay, lojbab is hard at work on a dictionary (in his, I'm > > sure, copious spare time) but gismu are as well defined as any word > > you care to name in any language - better, in fact, because they only > > ever have *one* definition (try looking up "run" or "fork"). > >Either you have in mind some notion of "defined" that I cannot apprehend, >or what you say is wrong. Lojban Central correctly declares that gismu >aren't defined and that definition will be left to usage. In other >words, in its gismu definitions Lojban will operate like a natural >language rather than an invented language. The place structures are baselined. But this I think is incidental since the gismu alone are not adequate to a legal language. So there will be need to coin new words and define new meanings in any event, and that process will mimic the natlang way. > In natural languages, usage has already defined the meanings of words, For any given language. But the translation problem makes these polysemous definitions less than ideal. I also think that natlang words are often less well-defined than some Lojban gismu (in that you often don't even know the implied place structure without contextual interpretation). > and on them there is >intersubjective agreement, such that they are a fit subject for >rational debate and quasiempirical research (in the subfield of linguistics >usually called 'lexical semantics'). Which field is understood and followed by lawyers, right? I wonder if any percentage of lawyers have ever heard of Montague. > > (but then, I believe in Sapir-Worff)). But learning > > predicate calculus is not the same as learning a language. Kids soak up > > language without having to be taught. No one does that with > > predicate calculus. > >It has never been put to the test whether kids could soak up a language >with the grammar of predicate logic notation. Or, on another view, many >linguists believe that this is effectively what all children do: Deep >Structure in the Generative Semantics version of Transformational >Grammar, and Logical Form in the Government-Binding version (and its >equivalent in the Minimalist version) are close approximations to >certain versions of predicate logic notation. > > > Now, unless you believe that "subject", "verb", and "object" are > > hardwired into brains, I'd submit that a young child exposed to a > > fluent lojban speaker could pick it up easily enough (I'm not fluent, > > but my three-year-old daughter seems to get the hang of lojban easily > > enough). > >I don't understand. Since we agree that a subset of Lojban is a close >approximation of predicate logic notation, surely if a child picks >up Lojban, they have picked up predicate logic notation. No. They have picked up a linguistic manipulation of predicate logic notation. I doubt that his daughter could read the notation (if she can read at all). We may be getting into an issue over the primacy of speech vs. writing here. > > Um, but you may also recognize that "Better is the enemy of good > > enough," and see the reduced costs of using something in existence, even > > if it isn't quite what you were looking for. > >I do. I realize and appreciate that. At one time, you were our pet iconoclast. Now you write Lojban. But I think anyone who'd been on the list for a few years would know >it, and would know that there's a hallowed tradition of me being impolite >to Lojbab. But I promise you I'll be giving him a big hug when I meet >him, assuming he's the sort of chap who acquiesces in being given big >hugs by people who've spent years being impolite to him... I keep waiting for the day. But you'll likely come over here before I have a chance to get there, alas. > > Flexibility isn't necessary for describing *inventions*? > >I wouldn't have thought so. By flexibility, I was thinking of "having >many ways to say the same thing". Not "being able to say lots of >things". Flexibility in being able to say things at an arbitrary (but variable) level of specificity depending on the relative importance of precision vs. ambiguity. For example, a patent on exercise equipment (say a treadmill) may need to distinguish precisely distinguish between "run" and "walk", but a patent on a procedure that involves a person travelling on foot does not need to. > > And cultural neutrality seems like a very desirable trait for patent > > description in the *European Union*! Even more so for global patent uses. > >For global, yes. But the EU is very eurocentric and there is no tradition >whatever of cultural neutrality; rather, europeanicity is celebrated at >every opportunity. But the EU has to deal with patents written in non-European languages, which is where this thing started. Hartmut Pilch mentioned the difficulty of translating Japanese patents into European languages, none of which was particularly compatible with some of the problems in Japanese language use. Lojban is MUCH better than any European language in representing ARBITRARY language features unambiguously. > > If you pared Lojban down to the smallest adequate portion you'd still > > > be left with unnecessary stuff (e.g. zo'u, terminators) and > > what remained > > > > Your particular examples of zo'u and terminators seem again, perhaps not > > strictly *necessary* but so useful to merit inclusion. > >Again, I don't see why, but I guess that would be more appropriate for >discussion on some generic loglang list. There isn't any such thing, and I think expanding this could be informative (outside the context of multihundred line posts on an unrelated thread %^) > > Em, no, I'd disagree - look to the incredibly broad applications > > of patents in software that are being put in place, in part because of > > ambiguity of definitions (see "run" and "fork" again, only talk about it > > to a computer programmer). > >I don't understand what you mean. I was making a point about the etymology >or mnemonicality of lexis and your response seems not to bear on this. >Perhaps you are saying that for wholly new ideas we need wholly new words? Well it certainly would be nice if people wouldn't reuse old ones. Lojban allows useful blending of reuse and coinage both with fu'ivla and lujvo-making. > > > Second, if it is possible to do better than Lojban, with the same > > > set of goals, this is largely because it is possible to learn > > > from Lojban's 'mistakes', i.e. it is by standing on Lojban's > > > shoulders that Lojban can be bettered. > > > > Please, do so! :-) > >I'm not clear why you keep responding in this way. If you had a genuine >idea to explore the way in which we think Lojban might be bettered, I think such a topic is germane to Lojban List (so long as we don't actually try to change it). Knowing limitations and flaws is a useful thing, especially since Lojban will see no end to criticism whether it catches on or not. > > > Also, in a certain sense, it has been proved that it is easy to do better > > > than Lojban, because over the years people have often proposed valid > > > improvements that were not adopted (on the grounds that completion was > > > a more important goal than improvement). > > > > A cost/benefit tradeoff - did the change improve things enough to > > destabilize the design? I wasn't there, but I'll bet it didn't. > >Whether it did or didn't, that wasn't an issue. However much it did improve >things, it would only be permitted to destabilize the design if the >design were shown to be failing to meet the language's explicit goals >(e.g. if a hitherto unnoticed syntactic ambiguity were discovered). The >rationale for this was as I stated before. In practice, if you look at the set of changes that were incorporated during the grammar baseline, you will see that we did add things that were not "broken". In addition to destabilization, "relearning" was an issue that affected tradeoffs. earlier in the development, backwards compatibility with TLI Loglan was very high in priority because we believed rapprochement was inevitable. (I still believe it is inevitable, but probably not while JCB is still alive to keep his core together. Afterwards, similarity between the language versions will be useful in gaining their support, but it is hardly as important to us as it was 10 years ago. Still, eventually reuniting the Loglan community behind Lojban would greatly enhance our credibility in the auxlang world - there is a first for everything!) lojbab ---- lojbab ***NOTE NEW ADDRESS*** lojbab@lojban.org Bob LeChevalier, President, The Logical Language Group, Inc. 2904 Beau Lane, Fairfax VA 22031-1303 USA 703-385-0273 Artificial language Loglan/Lojban: see Lojban WWW Server: href=" http://xiron.pc.helsinki.fi/lojban/ " Order _The Complete Lojban Language_ - see our Web pages or ask me.