On Thu, Jan 06, 2011 at 07:09:25PM -0500, Brian D. Eubanks wrote:
This is related to what I've wanted to do with the jorne.org project.
I am definitely interested in being a part of the discussion. No time
to go to Penguicon, but I am willing to chat with the group.
Reports of Semantic Web's death have been greatly exaggerated, and I
can assure you that it is quite alive and well. Early adopters include
Best Buy, US Library of Congress, Reuters, US Census, US DoD, EU
governments, Google (Rich Snippets), and biomedical researchers, to
mention only a few who are using and publishing Linked Data and are
willing to put serious time and money into it. Perhaps Robin meant
something different by his use of "Semantic Web", but a large part of
Tim Berners-Lee's original vision is being implemented now in a big way.
Lojban is unique among languages in that the entire Lojban corpus
could become a part of the Linked Data "cloud" (see linkeddata.org).
To make this happen, you would:
1. parse text into predicates (moderately hard but doable)
2. convert predicates to RDF (easy enough once you have the
predicates, mostly this involves defining a standard URI/ID for each
primitive)
3. publish RDF/OWL correlations between each Lojban primitive and
dbpedia, wordnet, and other linked data sets (this is a manual process
done once for each Lojban primitive).
I posted something about this before, but I do have a very rudimentary
web service that takes Lojban text and returns the parse tree as XML
(using Robin's PEG parser). I haven't had time to work much on it, but
I started to look at using Jena (a Java RDF API) to produce RDF as
output instead. I would need help with converting the tree into
predicates, since my Lojban ability is limited.
See
http://groups.google.com/group/lojban/browse_thread/thread/b39f94b183cf344f
and
http://groups.google.com/group/lojban/browse_thread/thread/7e48f11c43f62b63/68095f3ae30b02dd
The web service is at http://jorne.org/form.jsp
Try pasting in some short text examples and let me know what you think.
You know, I remember now seeing this project the first time you
posted about it, I'm not sure why I didn't say anything at the time.
I've added a link to your project on the sidebar to my website. See
roughly in the middle of "Lojban Resources":
http://lodockikumazvati.org/
One of our summer researchers recently gave a presentation on the
semantic web, focusing on DBpedia[1], which extracts formatted
information from wikipedia (like sidebar tables) and allows
structured access to that data (using SPARQL and I'm sure other
query methods). [Brian of course knows about DBpedia and the
related technology already.]
That was a pretty fascinating talk, In that "We could use two or
three of these features right now to make our product better" kind
of way.
I note, reading your referenced threads, that you said:
Anyone with any background in RDF or Lojban is welcome to join the
project. I particularly need help with mapping the Lojban gismu to
synsets in English and other Wordnets. That would bring immediate
benefit in being able to generate simple lojban sentences from
dbpedia and other linked data content. This part could done without
any knowledge in RDF. Drop me a line if you are interested in mapping
some gismu.
I'd love to be wrong, but I'd wager you didn't get any responses to
this request. I think it is too large, uninteresting, or abstract.
I wonder what would happen if you said "I'd love it if one of you
could take all the gismu referring to farm animals and give me the
link X and Y from dataset Z for them." Substituting farm animals
for whatever topic seems reasonably interesting and repeating as
necessary. The idea being to make a focused, specific request that
someone could just go do. There are definitely subjects for which
I'm a close enough domain expert to do this work, while for others
I'd just rather someone else with more motivation work on them.
Are you interested in experimenting with this kind of crowdsourcing
to build this data? Is that your next step? Oren showed a heck of
a lot of interest in your project, it took a non-zero amount of time
to give such a thoughtful reply!
How was the Semantic Web conference in June? Do we get to see the
source code?
-Alan
1: http://dbpedia.org/
--
.i ko djuno fi le do sevzi
--
You received this message because you are subscribed to the Google
Groups "lojban" group.
To post to this group, send email to lojban@googlegroups.com.
To unsubscribe from this group, send email to
lojban+unsubscribe@googlegroups.com.
For more options, visit this group at
http://groups.google.com/group/lojban?hl=en.