From philip.newton@gmail.com Sun Aug 01 06:57:24 2004 Received: with ECARTIS (v1.0.0; list lojban-list); Sun, 01 Aug 2004 06:57:25 -0700 (PDT) Received: from rproxy.gmail.com ([64.233.170.197] helo=mproxy.gmail.com) by chain.digitalkingdom.org with esmtp (Exim 4.32) id 1BrGpu-00035y-HX for lojban-list@lojban.org; Sun, 01 Aug 2004 06:57:15 -0700 Received: by mproxy.gmail.com with SMTP id 77so31296rnl for ; Sun, 01 Aug 2004 06:57:13 -0700 (PDT) Received: by 10.38.164.39 with SMTP id m39mr9586rne; Sun, 01 Aug 2004 06:57:13 -0700 (PDT) Message-ID: <537d06d0040801065766539b23@mail.gmail.com> Date: Sun, 1 Aug 2004 15:57:13 +0200 From: Philip Newton To: lojban-list@lojban.org Subject: [lojban] Re: jimpe: Language Processing in Lojban In-Reply-To: <20040730160437.GB3982@mit.edu> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII References: <20040729215411.GA11429@nerd-xing.mit.edu> <20040730160437.GB3982@mit.edu> X-archive-position: 8367 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: philip.newton@gmail.com Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list On Fri, 30 Jul 2004 12:04:37 -0400, Rob Speer wrote: > But if anyone has suggestions on how to determine which terminators are > elidable (for example, how does Nora's random sentence generator do it?) that > would be really helpful for the project. You could have a look at jbofihe; its source code is available, and it has a mode called "show elidable terminators" which will tell you which terminators in a given text are elidable - if several combinations are possible, it will tell you all of them. For example: $ jbofihe.exe -se broda le nu brode le brodi ku kei ku le brodo vau ^Z The words 'ku kei ku' (from line 1 col 28 to line 1 col 35) could be safely reduced to any of these minimal patterns: Pattern 1 : kei Pattern 2 : ku ku You could omit the word 'vau' (at line 1 col 47) (0[broda {<(1le [nu {brode } kei] ku)1 (1le brodo)1> vau}])0 mu'o mi'e .filip. -- Philip Newton