From lojban-out@lojban.org Mon Mar 31 04:38:54 2003 Return-Path: X-Sender: lojban-out@lojban.org X-Apparently-To: lojban@yahoogroups.com Received: (EGP: mail-8_2_6_5); 31 Mar 2003 12:38:54 -0000 Received: (qmail 28108 invoked from network); 31 Mar 2003 12:38:54 -0000 Received: from unknown (66.218.66.216) by m15.grp.scd.yahoo.com with QMQP; 31 Mar 2003 12:38:54 -0000 Received: from unknown (HELO digitalkingdom.org) (204.152.186.175) by mta1.grp.scd.yahoo.com with SMTP; 31 Mar 2003 12:38:54 -0000 Received: from lojban-out by digitalkingdom.org with local (Exim 4.12) id 18zyYv-0005Si-00 for lojban@yahoogroups.com; Mon, 31 Mar 2003 04:38:53 -0800 Received: from digitalkingdom.org ([204.152.186.175] helo=chain) by digitalkingdom.org with esmtp (Exim 4.12) id 18zyYm-0005SN-00; Mon, 31 Mar 2003 04:38:44 -0800 Received: with ECARTIS (v1.0.0; list lojban-list); Mon, 31 Mar 2003 04:38:42 -0800 (PST) Received: from manyas.bcc.bilkent.edu.tr ([139.179.30.24]) by digitalkingdom.org with esmtp (Exim 4.12) id 18zyYe-0005Rz-00 for lojban-list@lojban.org; Mon, 31 Mar 2003 04:38:36 -0800 Received: from localhost (localhost [127.0.0.1]) by manyas.bcc.bilkent.edu.tr (Postfix) with ESMTP id D4B3D32279 for ; Mon, 31 Mar 2003 15:38:01 +0300 (EEST) Received: from bilkent.edu.tr (neo.fen.bilkent.edu.tr [139.179.97.69]) by manyas.bcc.bilkent.edu.tr (Postfix) with ESMTP id AC22D3213D for ; Mon, 31 Mar 2003 15:38:00 +0300 (EEST) Message-ID: <3E8837D4.5090606@bilkent.edu.tr> Date: Mon, 31 Mar 2003 15:43:00 +0300 User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.3) Gecko/20030313 X-Accept-Language: en-us, en MIME-Version: 1.0 To: lojban-list@lojban.org Subject: [lojban] Re: Concordance References: <200303281308.52133.phma@webjockey.net> In-Reply-To: <200303281308.52133.phma@webjockey.net> Content-Type: text/plain; charset=us-ascii; format=flowed X-Virus-Scanned: by AMaViS snapshot-20020531 X-archive-position: 4697 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: robin@bilkent.edu.tr Precedence: bulk X-list: lojban-list X-eGroups-From: robin From: robin Reply-To: robin@bilkent.edu.tr X-Yahoo-Group-Post: member; u=116389790 X-Yahoo-Profile: lojban_out X-Yahoo-Message-Num: 19177 Pierre Abbat wrote: >There are a large IRC log and a few small files in the corpus section of the >TWiki. Do we have software that can find all occurrences of a particular >cmavo, or all fu'ivla? > >phma > > I wrote a simple Perl-CGI script that does concordances and word frequency counts - you can see it action at http://lists.bilkent.edu.tr/~robin/cgibin/concord.cgi robin.tr -- "A Perl script is "correct" if it gets the job done before your boss fires you." - Larry Wall Robin Turner IDMYO Bilkent Univeritesi Ankara 06533 Turkey www.bilkent.edu.tr/~robin