From nobody@digitalkingdom.org Mon Nov 10 23:57:29 2008 Received: with ECARTIS (v1.0.0; list lojban-list); Mon, 10 Nov 2008 23:57:29 -0800 (PST) Received: from nobody by chain.digitalkingdom.org with local (Exim 4.69) (envelope-from ) id 1Kzo7t-0003wv-LD for lojban-list-real@lojban.org; Mon, 10 Nov 2008 23:57:29 -0800 Received: from sabre-wulf.nvg.ntnu.no ([129.241.210.67]) by chain.digitalkingdom.org with esmtp (Exim 4.69) (envelope-from ) id 1Kzo7p-0003wf-QQ for lojban-list@lojban.org; Mon, 10 Nov 2008 23:57:29 -0800 Received: from hagbart.nvg.ntnu.no (unknown [IPv6:2001:700:300:2000:2a0:c9ff:feab:76e2]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by sabre-wulf.nvg.ntnu.no (Postfix) with ESMTP id 15AF194786 for ; Tue, 11 Nov 2008 08:57:10 +0100 (CET) Received: from hagbart.nvg.ntnu.no (localhost.localdomain [127.0.0.1]) by hagbart.nvg.ntnu.no (8.13.8/8.12.8) with ESMTP id mAB7vAv5019995 for ; Tue, 11 Nov 2008 08:57:10 +0100 Received: (from arj@localhost) by hagbart.nvg.ntnu.no (8.13.8/8.13.1/Submit) id mAB7vAnO019994 for lojban-list@lojban.org; Tue, 11 Nov 2008 08:57:10 +0100 Date: Tue, 11 Nov 2008 08:57:09 +0100 From: Arnt Richard Johansen To: lojban-list@lojban.org Subject: [lojban] Re: Lojban Sentence Templates Message-ID: <20081111075709.GC9019@nvg.org> References: Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.2.1i X-NVG-MailScanner-Information: Please contact the ISP for more information X-NVG-MailScanner: Found to be clean X-MailScanner-From: arj@nvg.ntnu.no Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from quoted-printable to 8bit by Ecartis X-Spam-Score: -0.0 X-Spam-Score-Int: 0 X-Spam-Bar: / X-archive-position: 14992 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: arj@nvg.org Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list On Mon, Nov 10, 2008 at 01:37:22PM -0500, Matt Arnold wrote: > The reason I bring this up now is that I would like to find out which > templates are the most common in the searchable corpus of Lojban > utterances, such as IRC logs. This would suggest very useful templates > for the home-game I'm building with dice, paper, and ceramics. I am > considering recording audio learning courses in which I would group > Lojban utterances (sensable ones) by template, to provide a variety of > examples of simple valid sentence structures, and relate selma'o > through substitution. Perhaps someone could intuit the most common and > useful templates, if not search it with textfile processing. With the help of #lojban, I've cobbled together a small collection of scripts that runs through the IRC logs and outputs the sequence of selma'o/word classes that are used. It's taking very long to run -- I don't expect it to complete in days -- but here are the ten most frequent templates as of now (about 10% complete): 3191 COI 2071 COI cmene 1675 UI 1290 COI PA KOhA 564 gismu 415 UI CAI 363 KOhA gismu 337 PA 332 COI gismu 324 GOhA No big surprises here. -- Arnt Richard Johansen http://arj.nvg.org/ Etter revolusjonen har jeg ordnet meg slik at jeg får meg statue. Har avtalt dette med nøkkelpersoner på venstresiden. Som takk for min innsats. Det blir en 150m høy statue i havnebassenget. skal du ha restaurant i hodet? To unsubscribe from this list, send mail to lojban-list-request@lojban.org with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if you're really stuck, send mail to secretary@lojban.org for help.