Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

Andy Seaborne Mon, 08 Nov 2010 15:15:23 -0800


On 08/11/10 18:32, Jeremy Carroll wrote:

To make the commons discussion more concrete I would suggest the
following items for the commons:

- an IRI library
- some code to do with vocabularies.
- connecting to a URL and doing semweb aware content negotiation (this
is typically done badly)

(Actually the IRI code should probably be wider, Jena initially used the
xerces URI code but then the needs exceeded what they supported)

Jeremy

Good idea. The IRI code is independent of the rest of Jena and isvaluable in it's own right.

ARP (Jena RDF/XML parser) is also independent of the Jena code structureand once was (is it still possible to get just ARP?). It's just thefinal step of generation that turns the output of parsing intoJena-specific objects. Might be worth splitting out if it would be useful.

The lowest level of RIOT parsing, which defines the tokens for creatingany of the Turtle family of langauges, is not Jena dependent. Theactual RIOT parsers themselves are as they directly generateJena-specific objects to avoid the copy overhead. It's a performancetrade-off.

[RIOT is a set of faster parsers for non-XML serializations of RDF,currently part of ARQ, but should migrate to Jena core when fullystable. - original need was parsers for formats capable of delivering tothe TDB database at full loading speed without heavy CPU load.]

But the command line tools based on RIOT which parse or validate oneformat are reusable - they use Jena internally, but the input and outputare completely standard.


The RDF validator Eyeball is also a useful tool in its own right.

        Andy


---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org

Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

Reply via email to