6.3.7. Cluster Network Diagnostics and Metrics

Proper cluster operation requires that each process in the cluster be capable of initiating a connection to any other process. This may be prevented by firewall settings or the like. If a connection can be initiated from host 1 to host 2, it does not follow that host 2 can initiate a connection to host 1. These situations can lead to intermittent errors. These errors can be difficult to pinpoint since operations from host 2 to host 1 can work for most of the time if there is a connection available that was already established by the other host.

To check point to point connectivity, do the following on each host in turn, with no other activity on the cluster:

SQL> cl_reset ();
SQL status ('cluster');
SQL status ('cluster');

The first status ('cluster') may show no samples if this is the first time it is called. At the second call you should see a status line that does not contain mentions of any host being down.

The cl_reset function disconnects any connections to other cluster hosts from this host. This makes sure that a fresh connection will be started for the status command.

net_meter utility

The net_meter utility is a SQL stored procedure that measures the aggregate throughput of a cluster network with different types of workload.

First load the netmeter.sql file on the master node of the cluster.

SQL> load netmeter.sql;

Then run

SQL> net_meter (1, 1000, 1000, 1);

This returns a single result row with two numbers: The count of round trips per second and the throughput in megabytes per second.

net_meter ( in n_threads int,
            in n_batches int,
            in bytes int,
            in ops_per_batch int)

This SQL procedure runs a network test procedure on every host of the cluster. The network test procedure sends a message to every other host of the cluster and waits for the replies from each host. After the last reply is received the action is repeated. This results in a symmetrical load of the network, all points acting as both clients and servers to all other points.

The parameters have the following meaning:

n_threads

- The number of network test instances started on each host. A value of 4 on a cluster of 4 hosts would result in a total of 16 network test procedures spread over 4 processes.
n_batches

- The number of message exchanges done by each network test procedure. A message exchange consists of sending one request to every other host of the cluster and of waiting for all to have replied.
bytes

- The number of bytes sent to each host in each message exchange. The reply from each host has the same number of bytes.
ops_per_batch

- This causes each message batch to contain several operations. In practice this is a multiplier on the number of bytes.

cl_ping

cl_ping ( in target_host int,
          in n_pings int,
          in bytes_per_ping int)

This built-in function measures raw point to point network throughput. Whereas net_meter includes a more complex n to n point traffic pattern and scheduling of functions on multiple threads, cl_ping does not involve anything except a process to process connection and no thread switching, transaction contexts or other overhead.

cl_io_report

This built in function prints out a summary of the cluster connections of the host on which it is run. The output goes to the server process' standard output. This lists the bytes in and out as well as the file descriptor numbers of any connections this host has with any other host.

Prefix	IRI
n3	http://docs.openlinksw.com/virtuoso/clusteroperationdiagnostics/
schema	http://schema.org/
n5	http://creativecommons.org/licenses/by/4.0/
rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
n4	http://www.openlinksw.com/#
xsdh	http://www.w3.org/2001/XMLSchema#

Prefix	URI
xmlns:n3	http://docs.openlinksw.com/virtuoso/clusteroperationdiagnostics/
xmlns:schema	http://schema.org/
xmlns:n5	http://creativecommons.org/licenses/by/4.0/
xmlns:rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
xmlns:n4	http://www.openlinksw.com/#
xmlns:xsdh	http://www.w3.org/2001/XMLSchema#

Prefix

URI

xmlns:n3

http://docs.openlinksw.com/virtuoso/clusteroperationdiagnostics/

xmlns:schema

http://schema.org/

xmlns:n5

http://creativecommons.org/licenses/by/4.0/

xmlns:rdf

http://www.w3.org/1999/02/22-rdf-syntax-ns#

xmlns:n4

http://www.openlinksw.com/#

xmlns:xsdh

http://www.w3.org/2001/XMLSchema#

Subject	Predicate	Object
n3:	rdf:type	schema:TechArticle
n3:	schema:name	6.3.7.ÃÂ Cluster Network Diagnostics and Metrics
n3:	schema:copyrightHolder	_:vb78698
n3:	schema:datePublished	2016-09-09 16:16:54
n3:	schema:headline	6.3.7.ÃÂ Cluster Network Diagnostics and Metrics
n3:	schema:keywords	OpenLink,Virtuoso,database,RDBMS,relational,SQL,RDF,triple store,linked data,linked open data,Big Data
n3:	schema:license	n5:deed.en_US
n3:	schema:publisher	_:vb78697
n3:	schema:url	n3:
_:vb78697	rdf:type	schema:Organization
_:vb78697	schema:name	OpenLink Software
_:vb78697	schema:url	n4:this
_:vb78698	rdf:type	schema:Organization
_:vb78698	schema:name	OpenLink Software
_:vb78698	schema:url	n4:this

Subject

Predicate

Object

n3:

rdf:type

schema:TechArticle

n3:

schema:name

6.3.7.ÃÂ Cluster Network Diagnostics and Metrics

n3:

schema:copyrightHolder

_:vb78698

n3:

schema:datePublished

2016-09-09 16:16:54

n3:

schema:headline

6.3.7.ÃÂ Cluster Network Diagnostics and Metrics

n3:

schema:keywords

OpenLink,Virtuoso,database,RDBMS,relational,SQL,RDF,triple store,linked data,linked open data,Big Data

n3:

schema:license

n5:deed.en_US

n3:

schema:publisher

_:vb78697

n3:

schema:url

n3:

_:vb78697

rdf:type

schema:Organization

_:vb78697

schema:name

OpenLink Software

_:vb78697

schema:url

n4:this

_:vb78698

rdf:type

schema:Organization

_:vb78698

schema:name

OpenLink Software

_:vb78698

schema:url

n4:this

Prev	Up	Next
6.3.6. Administration	Home	6.3.8. Elastic Cluster Operations