From nobody@digitalkingdom.org Fri Jun 20 07:29:01 2008 Received: with ECARTIS (v1.0.0; list lojban-list); Fri, 20 Jun 2008 07:29:01 -0700 (PDT) Received: from nobody by chain.digitalkingdom.org with local (Exim 4.69) (envelope-from ) id 1K9hbo-0000Wd-VL for lojban-list-real@lojban.org; Fri, 20 Jun 2008 07:29:01 -0700 Received: from express.cec.wustl.edu ([128.252.21.16]) by chain.digitalkingdom.org with esmtp (Exim 4.69) (envelope-from ) id 1K9hbk-0000WA-E1 for lojban-list@lojban.org; Fri, 20 Jun 2008 07:29:00 -0700 Received: from grid.cec.wustl.edu (grid.cec.wustl.edu [128.252.20.97]) by express.cec.wustl.edu (8.13.6/8.13.1) with ESMTP id m5KESM5g025128; Fri, 20 Jun 2008 09:28:22 -0500 (CDT) Received: by grid.cec.wustl.edu (Postfix, from userid 29287) id 6D9BD6805E; Fri, 20 Jun 2008 09:28:17 -0500 (CDT) Received: from localhost (localhost [127.0.0.1]) by grid.cec.wustl.edu (Postfix) with ESMTP id 624766805D; Fri, 20 Jun 2008 09:28:17 -0500 (CDT) Date: Fri, 20 Jun 2008 09:28:17 -0500 (CDT) From: adam@wustl.edu To: lojban-list@lojban.org Subject: [lojban] Re: lojgloss extraneous characters In-Reply-To: <737b61f30806180402g18ece640m43e0d18642c2280e@mail.gmail.com> Message-ID: References: <737b61f30806180402g18ece640m43e0d18642c2280e@mail.gmail.com> User-Agent: Alpine 1.00 (LRH 882 2007-12-20) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-Spam-Score: 0.0 X-Spam-Score-Int: 0 X-Spam-Bar: / X-archive-position: 14526 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: adam@wustl.edu Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list On Wed, 18 Jun 2008, Chris Capel wrote: > How should I handle non-lojban characters? I can strip out some that > should be superfluous, and others that I can't make any sense out of, > but should I translate some characters into cmavo? I'm thinking of > parentheses, braces, and brackets here. I could translate parentheses > into to-toi, and square brackets into sei-se'u. Any other ideas? > > Should I just ignore any random non-lojban characters? I would probably translate 0..9 to qw(no pa re ci vo mu xa ze bi so) and probably a . between digits as a pi. Others get trickier. ; in numbers is usually pi'e. / and - in dates are pi'e, but - is also ni'u, va'a, and vu'u, while + is ma'u and su'i. Most of the times I've seen punctuation other than that, it's along with the lojban word in question, like "(to ... toi)" and quotes along with lu/li'u. I think second guessing those would probably be more problems than it's worth. -- Adam Lopresto System Administrator Engineering IT, Washington University To unsubscribe from this list, send mail to lojban-list-request@lojban.org with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if you're really stuck, send mail to secretary@lojban.org for help.