From robin@bilkent.edu.tr Mon Mar 31 04:38:42 2003 Received: with ECARTIS (v1.0.0; list lojban-list); Mon, 31 Mar 2003 04:38:42 -0800 (PST) Received: from manyas.bcc.bilkent.edu.tr ([139.179.30.24]) by digitalkingdom.org with esmtp (Exim 4.12) id 18zyYe-0005Rz-00 for lojban-list@lojban.org; Mon, 31 Mar 2003 04:38:36 -0800 Received: from localhost (localhost [127.0.0.1]) by manyas.bcc.bilkent.edu.tr (Postfix) with ESMTP id D4B3D32279 for ; Mon, 31 Mar 2003 15:38:01 +0300 (EEST) Received: from bilkent.edu.tr (neo.fen.bilkent.edu.tr [139.179.97.69]) by manyas.bcc.bilkent.edu.tr (Postfix) with ESMTP id AC22D3213D for ; Mon, 31 Mar 2003 15:38:00 +0300 (EEST) Message-ID: <3E8837D4.5090606@bilkent.edu.tr> Date: Mon, 31 Mar 2003 15:43:00 +0300 From: robin User-Agent: Mozilla/5.0 (X11; U; Linux i686; en-US; rv:1.3) Gecko/20030313 X-Accept-Language: en-us, en MIME-Version: 1.0 To: lojban-list@lojban.org Subject: [lojban] Re: Concordance References: <200303281308.52133.phma@webjockey.net> In-Reply-To: <200303281308.52133.phma@webjockey.net> Content-Type: text/plain; charset=us-ascii; format=flowed X-Virus-Scanned: by AMaViS snapshot-20020531 X-archive-position: 4697 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: robin@bilkent.edu.tr Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list Pierre Abbat wrote: >There are a large IRC log and a few small files in the corpus section of the >TWiki. Do we have software that can find all occurrences of a particular >cmavo, or all fu'ivla? > >phma > > I wrote a simple Perl-CGI script that does concordances and word frequency counts - you can see it action at http://lists.bilkent.edu.tr/~robin/cgibin/concord.cgi robin.tr -- "A Perl script is "correct" if it gets the job done before your boss fires you." - Larry Wall Robin Turner IDMYO Bilkent Univeritesi Ankara 06533 Turkey www.bilkent.edu.tr/~robin