20.8. Internationalization & Unicode

The text being indexed and the text query expression may both be wide strings. The word boundaries used to cut the text in words in both queries and index maintenance may depend on a language declared for the text index.

The default language has white space and punctuation as word delimiters and will recognize Unicode ideographic characters as self standing. A single non-ideographic character will always be considered noise and not indexed.

Non-ASCII Unicode values are converted to UTF8 before being stored into the word table as narrow strings. Narrow 8 bit strings are stored in the words table as is.

	See Also:
	The LANGUAGE option in CREATE TEXT INDEX .

Prefix	IRI
n3	http://docs.openlinksw.com/virtuoso/ftinternationalization/
schema	http://schema.org/
n4	http://creativecommons.org/licenses/by/4.0/
rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
n5	http://www.openlinksw.com/#
xsdh	http://www.w3.org/2001/XMLSchema#

Prefix	URI
xmlns:n3	http://docs.openlinksw.com/virtuoso/ftinternationalization/
xmlns:schema	http://schema.org/
xmlns:n4	http://creativecommons.org/licenses/by/4.0/
xmlns:rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
xmlns:n5	http://www.openlinksw.com/#
xmlns:xsdh	http://www.w3.org/2001/XMLSchema#

Prefix

URI

xmlns:n3

http://docs.openlinksw.com/virtuoso/ftinternationalization/

xmlns:schema

http://schema.org/

xmlns:n4

http://creativecommons.org/licenses/by/4.0/

xmlns:rdf

http://www.w3.org/1999/02/22-rdf-syntax-ns#

xmlns:n5

http://www.openlinksw.com/#

xmlns:xsdh

http://www.w3.org/2001/XMLSchema#

Subject	Predicate	Object
n3:	rdf:type	schema:TechArticle
n3:	rdf:type	schema:APIReference
n3:	schema:name	20.8.ÃÂ Internationalization & Unicode
n3:	schema:copyrightHolder	_:vb80642
n3:	schema:datePublished	2016-09-09 16:16:54
n3:	schema:headline	20.8.ÃÂ Internationalization & Unicode
n3:	schema:keywords	OpenLink,Virtuoso,database,RDBMS,relational,SQL,RDF,triple store,linked data,linked open data,Big Data
n3:	schema:license	n4:deed.en_US
n3:	schema:publisher	_:vb80641
n3:	schema:url	n3:
_:vb80641	rdf:type	schema:Organization
_:vb80641	schema:name	OpenLink Software
_:vb80641	schema:url	n5:this
_:vb80642	rdf:type	schema:Organization
_:vb80642	schema:name	OpenLink Software
_:vb80642	schema:url	n5:this

Subject

Predicate

Object

n3:

rdf:type

schema:TechArticle

n3:

rdf:type

schema:APIReference

n3:

schema:name

20.8.ÃÂ Internationalization & Unicode

n3:

schema:copyrightHolder

_:vb80642

n3:

schema:datePublished

2016-09-09 16:16:54

n3:

schema:headline

20.8.ÃÂ Internationalization & Unicode

n3:

schema:keywords

OpenLink,Virtuoso,database,RDBMS,relational,SQL,RDF,triple store,linked data,linked open data,Big Data

n3:

schema:license

n4:deed.en_US

n3:

schema:publisher

_:vb80641

n3:

schema:url

n3:

_:vb80641

rdf:type

schema:Organization

_:vb80641

schema:name

OpenLink Software

_:vb80641

schema:url

n5:this

_:vb80642

rdf:type

schema:Organization

_:vb80642

schema:name

OpenLink Software

_:vb80642

schema:url

n5:this

Prev	Up	Next
20.7. Removing A Text Trigger	Home	20.9. Performance

20.8. Internationalization & Unicode

Namespace Prefixes

Statements

Namespace Prefixes

Statements