[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [lojban] proposed grammar definition for ZOhOI



Around here I call "transforming the shorthand to the words they
represent" parsing.  In other places it is called having a hammer,
you take your pick.  ;-)

-Alan

On Wed, Jan 26, 2011 at 10:18:52PM -0500, Alex Rozenshteyn wrote:
>    Regarding the response to the third point, couldn't you just have an
>    exhaustive list of the shorthand transformations (wouldn't you need one
>    anyway) and preprocess the text, transforming the shorthand to the words
>    they represent?
> 
>    I feel like the biggest problem is not that of parsing.
> 
>    On Wed, Jan 26, 2011 at 9:36 PM, .alyn.post.
>    <[1]alyn.post@lodockikumazvati.org> wrote:
> 
>      On Wed, Jan 26, 2011 at 08:18:02PM -0500, Alex Rozenshteyn wrote:
>      > pe'i there should be 3 ways of writing lojban:
>      >
> 
>      After a brief brainstorm, I could support these three modes in my
>      parser should that be desireable.
> 
>      > 1. Strict: the only characters allowed (barring alphabet shifts) are
>      > lojban characters.
> 
>      The PEG grammar currently allows digits and some punctuation. I'd
>      need to add an immediate rule when these productions are matched to
>      reject those productions if strict mode was enable and forbidden
>      characters appear.
> 
>      > 2. Visually mnemonic: characters such as quotation marks and
>      parentheses
>      > etc. are allowed to make skimming the text easier; there is no need to
>      > standardize (although suggestions might be welcome) what means what
>      > because the characters will be ignored (treated as whitespace) by the
>      > parser, and so every spoken syllable will still need to be spelled
>      > out.
> 
>      This is how the PEG grammar works now. I believe my parser allows
>      more punctuation than camxes, which is a trivial fix should that be
>      a problem.
> 
>      > 3. Visual shorthand: It will develop anyway, so it's best to
>      standardize
>      > it. e.g. {xu} can be *replaced* by a question mark, {to} and {to'o}
>      > might be *replaced* by left and right parentheses, etc. It would make
>      > sense to speak of {xubu}, the grapheme representing the cmavo {xu}
>      >
> 
>      This would require defining what this visual shorthand was and
>      modifying any rule affected. It would also require not permitting
>      the defined shorthand punctuation to be whitespace.
>      -Alan
>      --
>      .i ko djuno fi le do sevzi
> 
>      --
>      You received this message because you are subscribed to the Google
>      Groups "lojban" group.
>      To post to this group, send email to [2]lojban@googlegroups.com.
>      To unsubscribe from this group, send email to
>      [3]lojban+unsubscribe@googlegroups.com.
>      For more options, visit this group at
>      [4]http://groups.google.com/group/lojban?hl=en.
> 
>    --
>    Alex R
> 
>    --
>    You received this message because you are subscribed to the Google Groups
>    "lojban" group.
>    To post to this group, send email to lojban@googlegroups.com.
>    To unsubscribe from this group, send email to
>    lojban+unsubscribe@googlegroups.com.
>    For more options, visit this group at
>    http://groups.google.com/group/lojban?hl=en.
> 
> References
> 
>    Visible links
>    1. mailto:alyn.post@lodockikumazvati.org
>    2. mailto:lojban@googlegroups.com
>    3. mailto:lojban%2Bunsubscribe@googlegroups.com
>    4. http://groups.google.com/group/lojban?hl=en

-- 
.i ko djuno fi le do sevzi

-- 
You received this message because you are subscribed to the Google Groups "lojban" group.
To post to this group, send email to lojban@googlegroups.com.
To unsubscribe from this group, send email to lojban+unsubscribe@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/lojban?hl=en.