From nobody@digitalkingdom.org Sat Aug 13 22:13:10 2005 Received: with ECARTIS (v1.0.0; list lojban-list); Sat, 13 Aug 2005 22:13:10 -0700 (PDT) Received: from nobody by chain.digitalkingdom.org with local (Exim 4.52) id 1E4Ant-0006vh-GS for lojban-list-real@lojban.org; Sat, 13 Aug 2005 22:13:01 -0700 Received: from phma.hn.org ([216.189.113.165] helo=ixazon.dynip.com) by chain.digitalkingdom.org with esmtp (Exim 4.52) id 1E4Anq-0006vU-EK for lojban-list@lojban.org; Sat, 13 Aug 2005 22:13:01 -0700 Received: from [192.168.25.135] (margay.ixazon.lan [192.168.25.135]) by ixazon.dynip.com (Postfix) with ESMTP id 89D0CCBFA9 for ; Sun, 14 Aug 2005 01:12:51 -0400 (EDT) Message-ID: <42FED29B.7040908@phma.hn.org> Date: Sun, 14 Aug 2005 01:11:55 -0400 From: Pierre Abbat User-Agent: Mozilla Thunderbird 1.0.5 (Windows/20050711) X-Accept-Language: en-us, en MIME-Version: 1.0 To: lojban-list@lojban.org Subject: [lojban] Re: Loglish: A Modest Proposal References: <2EA51D67-CF97-4B71-89C5-439FA19CED1A@neosynapse.net> In-Reply-To: <2EA51D67-CF97-4B71-89C5-439FA19CED1A@neosynapse.net> Content-Type: text/plain; charset=ISO-8859-1; format=flowed X-Spam-Score: -2.4 (--) X-archive-position: 10337 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: phma@phma.hn.org Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list Steven Arnold wrote: > Wordnet is a system that attempts to take a set of "core meanings" and > associate those meanings with words from different languages. It is > accessible over the Internet. I invented a language by writing a > program in Python that fetched the list of core meanings and assigned > words to them from a list. It was a very fast route to a 26,000+ word > dictionary. Granted, the dictionary needed a little data grooming -- > there were a number of words that, to me, didn't deserve a separate > term. There were also words that I wanted to make sure got shorter > words, since I expected them to be used more often. But I think the > data grooming was by far the minor portion of the task, and by using > Wordnet, I saved probably hundreds of hours of word development > compared to doing it all by hand. > > That, combined with using Markov chains for word generation, created an > excellent base language in a very short time. I'd be happy to share > the source code of these tools with anyone who is interested; email me > privately for that. That is at odds with the way we add new words to Lojban. We make compounds called "lujvo" from gismu, or we borrow words from other languages, usually one of the Big Six or biological Latin, though we have a handful of Tupi words (mandioka, markuja) and one from an Algonquian language (ckankua). The only exceptions that come to mind are {tsaparatsa'i}, which is my attempt to imitate the rhythm called "ratamacue", and {vonpaso}, which has fu'ivla form but is made from Lojban words. Most of the original gismu were made by putting together bits and pieces of words from the Big Six using a weighting algorithm. AFAIK no Lojban word was made by a Markov chain. phma To unsubscribe from this list, send mail to lojban-list-request@lojban.org with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if you're really stuck, send mail to secretary@lojban.org for help.