16.10.3. How Does It Work?

Prev	Up	Next
16.10.2. Why is it Important?	Home	16.10.4. Installation Steps

¶

16.10.3. How Does It Work?

When an RDF aware client requests data from a network accessible resource via the Sponger the following events occur:

A requests in made for data in RDF form, and if RDF is returned nothing further happens
If RDF isn't returned, then the Sponger passes the data through a Metadata Extraction Pipeline process (using Metadata Extractors)
The extracted data is transformed to RDF via a Mapping Pipeline process (RDF is extracted by way of Ontology matching and mapping) that results in RDF Entities (instance data) generation
RDF Entities are returned to the client

The imported data forms a local cache and its invalidation rules conform to those of traditional HTTP clients (Web Browsers). Thus, expiration time is determined based on subsequent data fetches of the same resource (note: the first data load will record the 'expires' header) with current time compared to expiration time stored in the local cache. If HTTP 'expires' header data isn't returned by the source data server, then the "Sponger" will derive it's own invalidation time frame by evaluating the 'date' header and 'last-modified' HTTP headers. Irrespective of path taken, local cache invalidation is driven by an assessment of current time relative to recorded expiration time.

To manage the cache expiration, set the MinExpiration parameter in your Virtuoso.ini file.

Read full description of the parameter in the [SPARQL] ini section .

Designed with a pluggable architecture, the Sponger's core functionality is provided by Cartridges. Each cartridge includes Data Extractors which extract data from one or more data sources, and Ontology Mappers which map the extracted data to one or more ontologies/schemas, and route to producing RDF Linked Data.

The Schema Mappers are typically XSLT (e.g. GRDDL and other OpenLink Mapping Schemas) or Virtuoso PL based. The Metadata Extractors may be developed in Virtuoso PL, C/C++, Java, or any other language that can be integrated into the Virtuoso via it's server extensions APIs.

The Sponger also includes a pluggable name resolution mechanism that enables the development of Custom Resolvers for naming schemes (e.g. URNs) associated with protocols beyond HTTP. Examples of custom resolvers include:

LSID
DOI

Prev	Up	Next
16.10.2. Why is it Important?	Home	16.10.4. Installation Steps

Prefix	Namespace IRI
n3	http://docs.openlinksw.com/virtuoso/virtuosospongerworkpr/
schema	http://schema.org/
n4	http://creativecommons.org/licenses/by/4.0/deed.
rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
n5	http://www.openlinksw.com/#
xsdh	http://www.w3.org/2001/XMLSchema#

Namespace Prefix	Namespace URI
xmlns:n3	http://docs.openlinksw.com/virtuoso/virtuosospongerworkpr/
xmlns:schema	http://schema.org/
xmlns:n4	http://creativecommons.org/licenses/by/4.0/deed.
xmlns:rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
xmlns:n5	http://www.openlinksw.com/#
xmlns:xsdh	http://www.w3.org/2001/XMLSchema#

Subject	Predicate	Object
n3:	rdf:type	schema:TechArticle
n3:	rdf:type	schema:APIReference
n3:	schema:name	16.10.3.ÃÂ How Does It Work?
n3:	schema:copyrightHolder	_:vb82452
n3:	schema:datePublished	2016-09-09 16:16:54
n3:	schema:headline	16.10.3.ÃÂ How Does It Work?
n3:	schema:keywords	OpenLink,Virtuoso,database,RDBMS,relational,SQL,RDF,triple store,linked data,linked open data,Big Data,SPARQL
n3:	schema:license	n4:en_US
n3:	schema:publisher	_:vb82451
n3:	schema:url	n3:
_:vb82451	rdf:type	schema:Organization
_:vb82451	schema:name	OpenLink Software
_:vb82451	schema:url	n5:this
_:vb82452	rdf:type	schema:Organization
_:vb82452	schema:name	OpenLink Software
_:vb82452	schema:url	n5:this