Virtuoso can be set up to retrieve content from external web sites and host it in its own WebDAV repository via this page.
|
Target Description lets you provide a friendly description for the target that you are defining.
Target URL is the url of the web site that you are trying to retrieve content from. Only the hostname should be provided here, along with the protocol. For example http://www.myhost.com.
Login name on target is the username for accessing the remote server, if required.
Login password on target is the password for the login name above.
Copy to Local DAV collection is the name of the collection (folder) where retrieved content will be stored in Dav.
Single page download radio button specifies whether Virtuoso will retrieved linked content also.
Local Resources Owner The DAV user that will be the owner of the content that will copied to DAV.
The Download only newer than field allows you to specify a datetime value to prevent Virtuoso downloading files that are older than that datetime.
Use the Follow Links Matching field to limit the content that is downloaded by specify pattern matching criteria.
Do NOT follow links matching allows you limit content by specifying what files not to download.
Download Images radio buttons to allow Virtuoso to pull down image type also. You may want to prevent this if you are more interested in the textual content rather than bandwidth draining images.
Use WebDAV methods can be checked if the host is known to support WebDAV methods. This would enable better copying of sites that support DAV.
Delete if Remove on Remote is Detected can be switched on so that when Virtuoso synchronizes its content with that on the remote host it will check for files that have been removed on the remote and remove them from the local copy also.
Store metadata* when checked offers to be stored respectively metadata from FOAF, RDF, RSS/RDF and GRDLL data depending on which check-box is checked.
When all details have been completed press the Add (or Update if updating) button to submit the web robot task to the queue.
The "Imported Queues" page shows you a list of web copy targets that have been enlisted with the Virtuoso Server, and a list of web robot update schedules. Several options are available for each item listed: Start, Update, Schedule, Reset, and Stop. You can configure the scheduled update interval by pressing the Schedule link and entering a value in minutes. Once that is done you can start the schedule by pressing the Start link. You make a manual update of the content by pressing the Update link. You can stop the scheduled updates taking place by pressing the Stop link. To reset the details of the web copy item press the Reset link.
|
You can view a list of the links retrieved from the web copy from this page. You are also able to remove some of the content from this page by following the Edit link.
|
You can export content from the WebDAV repository. Note that you can only export content that has been retrieved using Virtuoso's Web Robot.
When you click the "Export" link for a retrieved collection, you will be presented with a form for selecting the export target location. Choose the export method: either File System or DAV by clicking the "External WebDAV Server URL" check-box. This lets you indicate to the remote target where to store the exported content. Then type the target URL to an existing location on the server. Finally press the Export button to export. A confirmation will be supplied once the operation is complete.
|
If is not checked the "External WebDAV Server URL" check-box, i.e. you are selecting the filesystem method, then you are restricted to Virtuoso targets. However WebDAV methods can be applied to any WebDAV server. WebDAV methods assume that the target is publically available for writing.
From "System Admin"/"Access Controls" you can manage Rules and ACL respectively for HTTP, News and Proxy.
|
For each of the tabs "HTTP", "NEWS", "PROXY" the created rules will be shown in a list with Filter, Access, Destination, Object, Mode, Rate values. You can also add/delete rules, re-arrange rules order.
|
Click the link "Edit" for a rule. Then specify the filter and access values.
|
From "Web Services" / "WSDL Import/Export" you can provide a URL to a WSDL description. In return Virtuoso will automatically provide a wrapper for the services available, hence stored procedures and user-defined types that are callable within Virtuoso while the processing and mechanics of the services are actually handled at the source.
|
After Virtuoso examines the supplied URL to a WSDL you are presented with the source code for the PL wrappers and Virtuoso user-defined types to be created. You have the chance to edit the code for more specific needs and then you can either save this to a file for later work, or execute it in Virtuoso to create the procedures and types.
|
Any errors in the code will be highlighted if you try and execute it.
If you wish to save the file the appropriate file system ACLs must be in place for the destination.
|
Previous
Runtime Hosting |
Chapter Contents |
Next
WebDAV Administration |