1.4.11. How many triples can a single server handle?

With free-form data and text indexing enabled, 500M triples per 16G RAM can be a ballpark guideline. If the triples are very short and repetitive, like the LUBM test data, then 16G per one billion triples is a possibility. Much depends on the expected query load. If queries are simple lookups, then less memory per billion triples is needed. If queries will be complex (analytics, join sequences, and aggregations all over the data set), then relatively more RAM is necessary for good performance.

The count of quads has little impact on performance as long as the working set fits in memory. If the working set is in memory, there may be 15-20% difference between a million and a billion triples. If the database must frequently go to disk, this degrades performance since one can easily do 2000 random accesses in memory in the time it takes to do one random access from disk. But working-set characteristics depend entirely on the application.

Whether the quads in a store all belong to one graph or any number of graphs makes no difference. There are Virtuoso instances in regular online use with hundreds of millions of triples, such as DBpedia and the Neurocommons databases.

Prefix	IRI
schema	http://schema.org/
n5	http://creativecommons.org/licenses/by/4.0/
n2	http://docs.openlinksw.com/virtuoso/virtuosofaq11/
rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
n4	http://www.openlinksw.com/#
xsdh	http://www.w3.org/2001/XMLSchema#

Prefix	URI
xmlns:schema	http://schema.org/
xmlns:n5	http://creativecommons.org/licenses/by/4.0/
xmlns:n2	http://docs.openlinksw.com/virtuoso/virtuosofaq11/
xmlns:rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
xmlns:n4	http://www.openlinksw.com/#
xmlns:xsdh	http://www.w3.org/2001/XMLSchema#

Prefix

URI

xmlns:schema

http://schema.org/

xmlns:n5

http://creativecommons.org/licenses/by/4.0/

xmlns:n2

http://docs.openlinksw.com/virtuoso/virtuosofaq11/

xmlns:rdf

http://www.w3.org/1999/02/22-rdf-syntax-ns#

xmlns:n4

http://www.openlinksw.com/#

xmlns:xsdh

http://www.w3.org/2001/XMLSchema#

Subject	Predicate	Object
n2:	rdf:type	schema:TechArticle
n2:	rdf:type	schema:APIReference
n2:	schema:name	1.4.11.ÃÂ How many triples can a single server handle?
n2:	schema:copyrightHolder	_:vb82334
n2:	schema:datePublished	2016-09-09 16:16:54
n2:	schema:headline	1.4.11.ÃÂ How many triples can a single server handle?
n2:	schema:keywords	OpenLink,Virtuoso,database,RDBMS,relational,SQL,RDF,triple store,linked data,linked open data,Big Data
n2:	schema:license	n5:deed.en_US
n2:	schema:publisher	_:vb82333
n2:	schema:url	n2:
_:vb82333	rdf:type	schema:Organization
_:vb82333	schema:name	OpenLink Software
_:vb82333	schema:url	n4:this
_:vb82334	rdf:type	schema:Organization
_:vb82334	schema:name	OpenLink Software
_:vb82334	schema:url	n4:this

Subject

Predicate

Object

n2:

rdf:type

schema:TechArticle

n2:

rdf:type

schema:APIReference

n2:

schema:name

1.4.11.ÃÂ How many triples can a single server handle?

n2:

schema:copyrightHolder

_:vb82334

n2:

schema:datePublished

2016-09-09 16:16:54

n2:

schema:headline

1.4.11.ÃÂ How many triples can a single server handle?

n2:

schema:keywords

OpenLink,Virtuoso,database,RDBMS,relational,SQL,RDF,triple store,linked data,linked open data,Big Data

n2:

schema:license

n5:deed.en_US

n2:

schema:publisher

_:vb82333

n2:

schema:url

n2:

_:vb82333

rdf:type

schema:Organization

_:vb82333

schema:name

OpenLink Software

_:vb82333

schema:url

n4:this

_:vb82334

rdf:type

schema:Organization

_:vb82334

schema:name

OpenLink Software

_:vb82334

schema:url

n4:this

Prev	Up	Next
1.4.10. How can data be partitioned across multiple servers?	Home	1.4.12. What is the performance impact of going from the billion to the trillion triples?

1.4.11. How many triples can a single server handle?

Namespace Prefixes

Statements

Namespace Prefixes

Statements