1.4. Virtuoso FAQ

We have received various inquiries on high-end metadata stores. We will here go through some salient questions. The requested features include:

Scaling to trillions of triples
Running on clusters of commodity servers
Running in federated environments, possibly over wide-area networks
Built-in inference
Transactions
Security
Support for extra triple level metadata, such as security attributes

Questions:

1.4.1. What is the storage cost per triple?

This depends on the index scheme. If indexed 2 ways, assuming that the graph will always be stated in queries, this is 31 bytes.

With 4 indices, supporting queries where the graph can be left unspecified (i.e., triples from any graph will be considered in query evaluation), this is 39 bytes. The numbers are measured with the LUBM validation data set of 121K triples, with no full-text index on literals.

With 4 indices and a full text index on all literals, the Billion Triples Challenge data set, 1115M triples, is about 120 GB of database pages. The database file size is larger due to space in reserve and other factors. 120 GB is the number to use when assessing RAM-to-disk ratio, i.e., how much RAM the system ought to have in order to provide good response. This data set is a heterogeneous collection including social network data, conversations harvested from the Web, DBpedia, Freebase, etc., with relatively numerous and long text literals.

The numbers do not involve any database page stream compression such as gzip. Using such compression does not save in terms of RAM because cached pages must be kept uncompressed but will cut the disk usage to about half.

Prefix	IRI
n2	http://docs.openlinksw.com/virtuoso/virtuosofaq/
schema	http://schema.org/
n5	http://creativecommons.org/licenses/by/4.0/
rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
n4	http://www.openlinksw.com/#
xsdh	http://www.w3.org/2001/XMLSchema#

Prefix	URI
xmlns:n2	http://docs.openlinksw.com/virtuoso/virtuosofaq/
xmlns:schema	http://schema.org/
xmlns:n5	http://creativecommons.org/licenses/by/4.0/
xmlns:rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
xmlns:n4	http://www.openlinksw.com/#
xmlns:xsdh	http://www.w3.org/2001/XMLSchema#

Prefix

URI

xmlns:n2

http://docs.openlinksw.com/virtuoso/virtuosofaq/

xmlns:schema

http://schema.org/

xmlns:n5

http://creativecommons.org/licenses/by/4.0/

xmlns:rdf

http://www.w3.org/1999/02/22-rdf-syntax-ns#

xmlns:n4

http://www.openlinksw.com/#

xmlns:xsdh

http://www.w3.org/2001/XMLSchema#

Subject	Predicate	Object
n2:	rdf:type	schema:TechArticle
n2:	rdf:type	schema:APIReference
n2:	schema:name	1.4.ÃÂ Virtuoso FAQ
n2:	schema:copyrightHolder	_:vb82400
n2:	schema:datePublished	2016-09-09 16:16:54
n2:	schema:headline	1.4.ÃÂ Virtuoso FAQ
n2:	schema:keywords	OpenLink,Virtuoso,database,RDBMS,relational,SQL,RDF,triple store,linked data,linked open data,Big Data
n2:	schema:license	n5:deed.en_US
n2:	schema:publisher	_:vb82399
n2:	schema:url	n2:
_:vb82399	rdf:type	schema:Organization
_:vb82399	schema:name	OpenLink Software
_:vb82399	schema:url	n4:this
_:vb82400	rdf:type	schema:Organization
_:vb82400	schema:name	OpenLink Software
_:vb82400	schema:url	n4:this

Subject

Predicate

Object

n2:

rdf:type

schema:TechArticle

n2:

rdf:type

schema:APIReference

n2:

schema:name

1.4.ÃÂ Virtuoso FAQ

n2:

schema:copyrightHolder

_:vb82400

n2:

schema:datePublished

2016-09-09 16:16:54

n2:

schema:headline

1.4.ÃÂ Virtuoso FAQ

n2:

schema:keywords

OpenLink,Virtuoso,database,RDBMS,relational,SQL,RDF,triple store,linked data,linked open data,Big Data

n2:

schema:license

n5:deed.en_US

n2:

schema:publisher

_:vb82399

n2:

schema:url

n2:

_:vb82399

rdf:type

schema:Organization

_:vb82399

schema:name

OpenLink Software

_:vb82399

schema:url

n4:this

_:vb82400

rdf:type

schema:Organization

_:vb82400

schema:name

OpenLink Software

_:vb82400

schema:url

n4:this

Prev	Up	Next
1.3.8. NNTP Aggregation & Serving	Home	1.4.2. What is the cost to insert a triple (for the insertion itself, as well as for updating any indices)?

1.4. Virtuoso FAQ

1.4.1. What is the storage cost per triple?

Namespace Prefixes

Statements

Namespace Prefixes

Statements