[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[lojban] Re: Grammar checking wikipedia bot.

To: lojban@yahoogroups.com
Subject: [lojban] Re: Grammar checking wikipedia bot.
From: Pierre Abbat <lojban-out@lojban.org>
Date: Tue, 29 Aug 2006 19:32:32 -0400
In-reply-to: <44F0F0AD.4090901@bommelibom.com>
References: <44F0F0AD.4090901@bommelibom.com>
Reply-to: phma@phma.optus.nu
User-agent: KMail/1.9.1

On Saturday 26 August 2006 21:09, Einar Faanes wrote:
> coi ro do
>
> I mentioned this on the irc-channel earlier today, but I think I should
> post it here as well. I have an idea which I think is possible and which
> I think should be set alive. When lojban is parseable (and we have a
> parser) we should take advantage of that by automatically check the
> lojban wikipedia for spelling errors by channeling the text through
> jbofi'e. I think that this can be done by using the wikimedia bot
> framework, which is among other things used to update and add
> interlanguage links.
>
> It should be possible to make the bot download a page, strip it of
> non-lojban elements (wikimarkup etc.), check the text and post an
> errormessage on either the articles discussion page or a centralized
> reference page. The bot is written in python. I'm no programmer, but
> know others here which are and which may find this interesting.

Sounds good, though there are a couple of things that would cause false 
errors:
1. Some words are valid fu'ivla, but jbofi'e doesn't recognize them, such as 
{srutio} (a discarded form for {strutione}, ostrich) and {largectremia} 
(crape myrtle). Also the PEG accepts some lujvo made with fu'ivla that 
jbofi'e doesn't.
2. Some tables don't parse if you just remove the markup. The prefix chart in 
[[treci'e]] is set up so that you put the cells in the row in the blanks in 
the sentence formed by the header row. If you removed the markup in a table 
in the English Wikipedia, the result wouldn't parse in English either.

phma

To unsubscribe from this list, send mail to lojban-list-request@lojban.org
with the subject unsubscribe, or go to http://www.lojban.org/lsg2/, or if
you're really stuck, send mail to secretary@lojban.org for help.

References:
- [lojban] Grammar checking wikipedia bot.
  - From: Einar Faanes <lojban-out@lojban.org>

Prev by Date: THIS LIST HAS BEEN MOVED\!
Next by Date: [lojban] jbofi'e and fu'ivla
Previous by thread: [lojban] Grammar checking wikipedia bot.
Next by thread: [lojban] Lojban card game
Index(es):
- Date
- Thread