6.3.5. Transactions

A Virtuoso cluster is fully transactional and supports the 4 isolation levels identically with a single server Virtuoso. Transactions are committed using single to two phase commit as may be appropriate and this is transparent to the application program.

Distributed deadlocks are detected and one of the deadlocking transactions is killed, just as with a single process.

Transactions are logged on the cluster nodes which perform updates pertaining to the transaction.

A transaction has a single owner connection. Each client connection has a distinct transaction. From the application program's viewpoint there is a single thread per transaction. Any parallelization of queries is transparent.

For roll forward recovery, each node is independent. If a transaction is found in the log for which a prepare was received but no final commit or rollback, the recovering node will ask the owner of the transaction whether the transaction did commit. Virtuoso server processes can provide this information during roll forward, hence a simultaneous restart of cluster nodes will not deadlock.

Performance Considerations

A lock wait in a clustered database requires an asynchronous notification to a monitor node. This is done so that a distributed deadlock can be detected. Thus the overhead of waiting is slightly larger than with a single process.

We recommend that read committed be set as the default isolation since this avoids most waiting. A read committed transaction will show the last committed state of rows that have exclusive locks and uncommitted state. This is set as DefaultIsolation = 2.

In the parameters section of each virtuoso.ini file.

Row Autocommit Mode

Virtuoso has a mode where insert/update/delete statements commit after each row. This is called row autocommit mode and is useful for bulk operations that need no transactional semantic.

The row autocommit mode is set by executing log_enable (2) or log_enable (3), for no logging and logging respectively. The setting stays in effect until set again or for the duration of the connection. Do not confuse this with the autocommit mode of SQL client connection.

In a clustered database the row autocommit mode is supported but it will commit at longer intervals in order to save on message latency. Statements are guaranteed to commit at least once, at the end of the statement.

A searched update or delete statement in row autocommit mode processes a few thousand keys between commits, all in a distributed transaction with 2PC. These are liable to deadlock. Since the transaction boundary is not precisely defined for the application, a row autocommit batch update must be such that one can distinguish between updated and non-updated if one must restart after a deadlock. This is of course not an issue if updating several times makes no difference to the application.

Naturally, since a row can be deleted only once, the problem does not occur with deletes. Both updates and deletes in row autocommit mode are guaranteed to keep row integrity, i.e. all index entries of one row will be in the same transaction.

A row autocommit insert sends all keys of the row at once and each commit independently. Hence, a checkpoint may for example cause a situation where one index of a row is in the checkpoint state and the other is not.

Thus, a row autocommit insert on a non-empty application table with transactional semantic is not recommended. This will be useful for bulk loads into empty tables and the like, though.

Prefix	IRI
n3	http://docs.openlinksw.com/virtuoso/clusteroperationtransc/
schema	http://schema.org/
n5	http://creativecommons.org/licenses/by/4.0/
rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
n4	http://www.openlinksw.com/#
xsdh	http://www.w3.org/2001/XMLSchema#

Prefix	URI
xmlns:n3	http://docs.openlinksw.com/virtuoso/clusteroperationtransc/
xmlns:schema	http://schema.org/
xmlns:n5	http://creativecommons.org/licenses/by/4.0/
xmlns:rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
xmlns:n4	http://www.openlinksw.com/#
xmlns:xsdh	http://www.w3.org/2001/XMLSchema#

Prefix

URI

xmlns:n3

http://docs.openlinksw.com/virtuoso/clusteroperationtransc/

xmlns:schema

http://schema.org/

xmlns:n5

http://creativecommons.org/licenses/by/4.0/

xmlns:rdf

http://www.w3.org/1999/02/22-rdf-syntax-ns#

xmlns:n4

http://www.openlinksw.com/#

xmlns:xsdh

http://www.w3.org/2001/XMLSchema#

Subject	Predicate	Object
n3:	rdf:type	schema:TechArticle
n3:	schema:name	6.3.5.ÃÂ Transactions
n3:	schema:copyrightHolder	_:vb78710
n3:	schema:datePublished	2016-09-09 16:16:54
n3:	schema:headline	6.3.5.ÃÂ Transactions
n3:	schema:keywords	OpenLink,Virtuoso,database,RDBMS,relational,SQL,RDF,triple store,linked data,linked open data,Big Data
n3:	schema:license	n5:deed.en_US
n3:	schema:publisher	_:vb78709
n3:	schema:url	n3:
_:vb78709	rdf:type	schema:Organization
_:vb78709	schema:name	OpenLink Software
_:vb78709	schema:url	n4:this
_:vb78710	rdf:type	schema:Organization
_:vb78710	schema:name	OpenLink Software
_:vb78710	schema:url	n4:this

Subject

Predicate

Object

n3:

rdf:type

schema:TechArticle

n3:

schema:name

6.3.5.ÃÂ Transactions

n3:

schema:copyrightHolder

_:vb78710

n3:

schema:datePublished

2016-09-09 16:16:54

n3:

schema:headline

6.3.5.ÃÂ Transactions

n3:

schema:keywords

OpenLink,Virtuoso,database,RDBMS,relational,SQL,RDF,triple store,linked data,linked open data,Big Data

n3:

schema:license

n5:deed.en_US

n3:

schema:publisher

_:vb78709

n3:

schema:url

n3:

_:vb78709

rdf:type

schema:Organization

_:vb78709

schema:name

OpenLink Software

_:vb78709

schema:url

n4:this

_:vb78710

rdf:type

schema:Organization

_:vb78710

schema:name

OpenLink Software

_:vb78710

schema:url

n4:this

Prev	Up	Next
6.3.4. Partitioning	Home	6.3.6. Administration