From lojban-out@lojban.org Fri Apr 09 13:11:22 2004 Return-Path: X-Sender: lojban-out@lojban.org X-Apparently-To: lojban@yahoogroups.com Received: (qmail 8625 invoked from network); 9 Apr 2004 20:11:22 -0000 Received: from unknown (66.218.66.217) by m8.grp.scd.yahoo.com with QMQP; 9 Apr 2004 20:11:22 -0000 Received: from unknown (HELO chain.digitalkingdom.org) (64.81.49.134) by mta2.grp.scd.yahoo.com with SMTP; 9 Apr 2004 20:11:22 -0000 Received: from lojban-out by chain.digitalkingdom.org with local (Exim 4.30) id 1BC2L4-0002Ik-BM for lojban@yahoogroups.com; Fri, 09 Apr 2004 13:10:58 -0700 Received: from dsl081-049-134.sfo1.dsl.speakeasy.net ([64.81.49.134] helo=chain.digitalkingdom.org) by chain.digitalkingdom.org with esmtp (Exim 4.30) id 1BC2Jp-0002H0-9U; Fri, 09 Apr 2004 13:09:41 -0700 Received: with ECARTIS (v1.0.0; list lojban-list); Fri, 09 Apr 2004 13:09:38 -0700 (PDT) Received: from rlpowell by chain.digitalkingdom.org with local (Exim 4.30) id 1BC2Jg-0002Gk-MN for lojban-list@lojban.org; Fri, 09 Apr 2004 13:09:32 -0700 Date: Fri, 9 Apr 2004 13:09:32 -0700 Message-ID: <20040409200932.GR14789@digitalkingdom.org> Mail-Followup-To: lojban-list@lojban.org References: <20040409002858.GH14789@digitalkingdom.org> <20040409015132.GA28275@mit.edu> <200404082251.57764.phma@webjockey.net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <200404082251.57764.phma@webjockey.net> User-Agent: Mutt/1.5.5.1+cvs20040105i X-archive-position: 7517 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: rlpowell@digitalkingdom.org X-list: lojban-list To: lojban@yahoogroups.com X-eGroups-Remote-IP: 64.81.49.134 X-eGroups-From: Robin Lee Powell From: Robin Lee Powell Reply-To: rlpowell@digitalkingdom.org Subject: [lojban] Re: Beta Release of PEG-based Lojban parser. X-Yahoo-Group-Post: member; u=116389790 X-Yahoo-Profile: lojban_out X-Yahoo-Message-Num: 21991 On Thu, Apr 08, 2004 at 10:51:57PM -0400, Pierre Abbat wrote: > On Thu, Apr 08, 2004 at 05:28:58PM -0700, Robin Lee Powell wrote: > > I am considering an extension to allow 'si' or 'sa' at the beginning > > of text (presumably to erase stuff from the proceeding utterance). > > The morphology needs massive amounts of work, and ideally I'd like > > to get Nora and Pierre's full algorithm encoded. > > Do you mean the algorithm for breaking a speech stream into words, or > the algorithm for telling whether the words are valid? The former is > done, except for the option that always requires a pause before a > cmevla and being tested by someone else; but the latter I'm not > finished encoding myself. Both. The point would be to have a single program that can take any mixture of space-seperated and stress-added character streams and output a parse tree (assuming a successful parse is possible), including identifying what word type each word is. Doing all of the above entirely in a properly formalized language is my ultimate goal with this project[1]. > I am working, when I can find time for it, on the complete brivla > validity test. I think I got the tosmabru test working right, but it > is claiming that {lekybumlatu} and {stanybrulspa} are valid. (The > former is {le kybu mlatu} run together; I had to do Special Things for BU to work properly; it couldn't just be treated as a normal cmavo. -Robin [1]: Well, actually, my ultimate goal is to create a program that will accept mekso and output LaTeX, but I seem to have become a tad bit sidetracked. :-) -- http://www.digitalkingdom.org/~rlpowell/ *** I'm a *male* Robin. "Many philosophical problems are caused by such things as the simple inability to shut up." -- David Stove, liberally paraphrased. http://www.lojban.org/ *** loi pimlu na srana .i ti rocki morsi