6.4.6. Optimizing Schema for Fault Tolerance

Having the working set in memory is the single most important factor of database performance. When storing partitions in duplicate, one in principle also requires double the memory to keep adequate working set during write operations.

However, most web and data warehouse workloads are read-intensive. In this situation, the reading load can be balanced over the replicated copies. If this balancing were done at random or round robin, all copies would eventually maintain the same working set. In other words, 64G of RAM spread over two machines would behave like 32G. If the data volune is larger than memory, it makes sense to have the different replicas cache different parts of the partition they share.

Consider, using the example of cluster DUP mentioned above:

create table customer (c_id int primary key, c_name varchar, c_state varchar);
alter index customer on customer partition cluster DUP (c_id int (0hexffff0000));

create table orders (o_id int primary key, o_c_id  int, o_date datetime, o_value numeric);
alter index orders on orders partition cluster DUP (o_id int (0hexffff0000));
create index o_c_id on orders (o_c_id) partition cluster DUP (o_c_id (0hexffff0000));

This has the effect of saying that the 16 low bits of c_id or o_id do not participate in the partition hash. The hash is made from bits 32-16. Thus c_id 0-64K will be in one partition, 64K-128K in another, 128K-192K in a third and so on, these partitions are then spread by hash over the host groups listed in the create cluster.

Now, doing the join

select sum (o_value) from customer, orders where c_state = 'MA' and c_id = o_c_id;

will take o_c_id's 0-32K from the first copy of the first partition, id's 32K-64K from the second copy of the first partition, c_o_id's 64K-96K from the first copy of the second partition and so forth.

The load is split by applying range partition on the low bits of id's, so that a system with 64G split over two replicas behaves like 64G RAM for read committed reading but as 32G of RAM for writing. This is enabled by leaving low bits of id's outside of the partition hash by specifying a mask, as shown above.

Prefix	IRI
schema	http://schema.org/
n5	http://creativecommons.org/licenses/by/4.0/
rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
n4	http://www.openlinksw.com/#
n2	http://docs.openlinksw.com/virtuoso/faultfaulttoleroptm/
xsdh	http://www.w3.org/2001/XMLSchema#

Prefix	URI
xmlns:schema	http://schema.org/
xmlns:n5	http://creativecommons.org/licenses/by/4.0/
xmlns:rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
xmlns:n4	http://www.openlinksw.com/#
xmlns:n2	http://docs.openlinksw.com/virtuoso/faultfaulttoleroptm/
xmlns:xsdh	http://www.w3.org/2001/XMLSchema#

Prefix

URI

xmlns:schema

http://schema.org/

xmlns:n5

http://creativecommons.org/licenses/by/4.0/

xmlns:rdf

http://www.w3.org/1999/02/22-rdf-syntax-ns#

xmlns:n4

http://www.openlinksw.com/#

xmlns:n2

http://docs.openlinksw.com/virtuoso/faultfaulttoleroptm/

xmlns:xsdh

http://www.w3.org/2001/XMLSchema#

Subject	Predicate	Object
n2:	rdf:type	schema:TechArticle
n2:	schema:name	6.4.6.ÃÂ Optimizing Schema for Fault Tolerance
n2:	schema:copyrightHolder	_:vb79006
n2:	schema:datePublished	2016-09-09 16:16:54
n2:	schema:headline	6.4.6.ÃÂ Optimizing Schema for Fault Tolerance
n2:	schema:keywords	OpenLink,Virtuoso,database,RDBMS,relational,SQL,RDF,triple store,linked data,linked open data,Big Data
n2:	schema:license	n5:deed.en_US
n2:	schema:publisher	_:vb79005
n2:	schema:url	n2:
_:vb79005	rdf:type	schema:Organization
_:vb79005	schema:name	OpenLink Software
_:vb79005	schema:url	n4:this
_:vb79006	rdf:type	schema:Organization
_:vb79006	schema:name	OpenLink Software
_:vb79006	schema:url	n4:this

Subject

Predicate

Object

n2:

rdf:type

schema:TechArticle

n2:

schema:name

6.4.6.ÃÂ Optimizing Schema for Fault Tolerance

n2:

schema:copyrightHolder

_:vb79006

n2:

schema:datePublished

2016-09-09 16:16:54

n2:

schema:headline

6.4.6.ÃÂ Optimizing Schema for Fault Tolerance

n2:

schema:keywords

OpenLink,Virtuoso,database,RDBMS,relational,SQL,RDF,triple store,linked data,linked open data,Big Data

n2:

schema:license

n5:deed.en_US

n2:

schema:publisher

_:vb79005

n2:

schema:url

n2:

_:vb79005

rdf:type

schema:Organization

_:vb79005

schema:name

OpenLink Software

_:vb79005

schema:url

n4:this

_:vb79006

rdf:type

schema:Organization

_:vb79006

schema:name

OpenLink Software

_:vb79006

schema:url

n4:this

Prev	Up	Next
6.4.5. Managing Availability	Home	6.4.7. Interpreting Status Messages

6.4.6. Optimizing Schema for Fault Tolerance

Namespace Prefixes

Statements

Namespace Prefixes

Statements