From nobody@digitalkingdom.org Fri Jun 20 06:18:52 2008 Received: with ECARTIS (v1.0.0; list lojban-list); Fri, 20 Jun 2008 06:18:52 -0700 (PDT) Received: from nobody by chain.digitalkingdom.org with local (Exim 4.69) (envelope-from ) id 1K9gVw-0003xa-7s for lojban-list-real@lojban.org; Fri, 20 Jun 2008 06:18:52 -0700 Received: from wx-out-0506.google.com ([66.249.82.237]) by chain.digitalkingdom.org with esmtp (Exim 4.69) (envelope-from ) id 1K9gVs-0003xK-4N for lojban-list@lojban.org; Fri, 20 Jun 2008 06:18:52 -0700 Received: by wx-out-0506.google.com with SMTP id r21so393741wxc.25 for ; Fri, 20 Jun 2008 06:18:47 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to :subject:in-reply-to:mime-version:content-type:references; bh=efgeyKkWqZheRLk5sGbURzJK5w88szPFLcCppyUZRjs=; b=h1hVK1PRWDXT7NH7POKcofra6+IuOu87d1EsFQI0KY8ebbmKh3eKVvpYEHWFU+mXb/ rij9FOMeGZMysG05kqxd3yYRrljlICzbUlTmAVUB6euADIMDG/itIvhQIC+0jZANhBqG /YMuEXSlrEEk7pdVQ9AyCr5dBMpEMcBlTbmb8= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:in-reply-to:mime-version :content-type:references; b=EDOVVoRxHCCD+y4DgAdMMH5PDVv2/ZzO3Xgh//HF0xJCA9O9rf0MmiXk3wEsuEqg8N cfMPYDNylwKpJyHnGOei8pIOgGHUYw37vKHr8D/v566XfTPiMSnV5NpRqIkKJMyGCJXj N4JWLdW/hOuSYldge1ZMs0YpO+/Jvd2mPIVv4= Received: by 10.142.133.8 with SMTP id g8mr1178119wfd.312.1213967925382; Fri, 20 Jun 2008 06:18:45 -0700 (PDT) Received: by 10.143.41.17 with HTTP; Fri, 20 Jun 2008 06:18:45 -0700 (PDT) Message-ID: Date: Fri, 20 Jun 2008 08:18:45 -0500 From: "Marjorie Scherf" To: lojban-list@lojban.org Subject: [lojban] Re: lojgloss and linebreaks In-Reply-To: <737b61f30806192246u28840568pfb92f4a0728c123e@mail.gmail.com> MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----=_Part_12066_152196.1213967925382" References: <737b61f30806192246u28840568pfb92f4a0728c123e@mail.gmail.com> X-Spam-Score: 0.0 X-Spam-Score-Int: 0 X-Spam-Bar: / X-archive-position: 14525 X-ecartis-version: Ecartis v1.0.0 Sender: lojban-list-bounce@lojban.org Errors-to: lojban-list-bounce@lojban.org X-original-sender: skaryzgik@gmail.com Precedence: bulk Reply-to: lojban-list@lojban.org X-list: lojban-list ------=_Part_12066_152196.1213967925382 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline On Fri, Jun 20, 2008 at 12:46 AM, Chris Capel wrote: > I'm using alice as a source of stuff to translate to test my > parser/glosser. When alice starts a new HTML paragraph, the official > parser doesn't say that it's starting a new lojban paragraph. For > that, NIhO is required. Is a NIhO implied by an HTML paragraph break? > "la nicte cadzu", on the other hand, mostly starts new text paragraphs > with {ni'o}, but there are numerous exceptions which might have some > meaning. (Half a {ni'o}?) > > Conventionally, casually, I think a paragraph break in text (or a > double-linebreak in plain text) does imply a new paragraph, and I'll > probably treat it as such whether or not it's technically accurate. > But I was wondering if the convention had any formal backing. > > Chris Capel Sometimes people write like this. If someone had typed that way, for whatever pretty (or not) typesetting reasons they had, and then copy/pasted it into the parser/glosser, and that glosser puts in a {ni'o} at every double line break, then it would all get treated like separate sentences, even if, as in my example, it really is intended only to be one. It should be easy to tell from hearing something to know how to write it; and from seeing it written, how to say it; but as far as I know no one has said that a given pronunciation should have only one typesetting. People might, then, have reason to put words into what might look like paragraphs by shape, when actually they are in the same sentence. As an example, anyway. There are probably other, less extreme things that might be more reasonable and yet still would cause problems for a parser/glosser that inserted {ni'o} at double line breaks or at "paragraph breaks". .mu'omi'e .skaryzgik. -- .i ko tcesi'a la .diskord. http://skaryzgik.blogspot.com .i mi'e la poi jitro be lo jdaca'i ku'o .skaryzgik. poi raibalralju selsi'afanva ------=_Part_12066_152196.1213967925382 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Content-Disposition: inline

On Fri, Jun 20, 2008 at 12:46 AM, Chris = Capel <pdf23ds@gmail.com> wr= ote:
I'm using alice as a source of stuff to translate to test my
parser/glosser. When alice starts a new HTML paragraph, the official
parser doesn't say that it's starting a new lojban paragraph. For that, NIhO is required. Is a NIhO implied by an HTML paragraph break?
"la nicte cadzu", on the other hand, mostly starts new text parag= raphs
with {ni'o}, but there are numerous exceptions which might have some meaning. (Half a {ni'o}?)

Conventionally, casually, I think a paragraph break in text (or a
double-linebreak in plain text) does imply a new paragraph, and I'll probably treat it as such whether or not it's technically accurate.
But I was wondering if the convention had any formal backing.

Chris Capel

Sometimes

people

write
like

this.

If someone had typed that way, for whatever pret= ty (or not) typesetting reasons they had, and then copy/pasted it into the = parser/glosser, and that glosser puts in a {ni'o} at every double line = break, then it would all get treated like separate sentences, even if, as i= n my example, it really is intended only to be one. It should be easy to te= ll from hearing something to know how to write it; and from seeing it writt= en, how to say it; but as far as I know no one has said that a given pronun= ciation should have only one typesetting. People might, then, have reason t= o put words into what might look like paragraphs by shape, when actually th= ey are in the same sentence. As an example, anyway. There are probably othe= r, less extreme things that might be more reasonable and yet still would ca= use problems for a parser/glosser that inserted {ni'o} at double line b= reaks or at "paragraph breaks".

.mu'omi'e .skaryzgik.


--
.i ko tcesi= 'a la .diskord.
http://ska= ryzgik.blogspot.com
.i mi'e la poi jitro be lo jdaca'i ku= 9;o .skaryzgik. poi raibalralju selsi'afanva ------=_Part_12066_152196.1213967925382-- To unsubscribe from this list, send mail to lojban-list-request@lojban.org with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if you're really stuck, send mail to secretary@lojban.org for help.