Dataset Metadata Exporter

The Purpose

Semantic Turkey supports the editing and (sometimes) the automatic generation of dataset metadata about the data associated with a project. The support to different metadata vocabularies is achieved by the definition of a dedicated extension point, DatasetMetadataExporter: its implementations may combine user input and profiling techniques in order to generate RDF metadata about the dataset.

The Model

A DatasetMetadataExporter (it.uniroma2.art.semanticturkey.extension.extpts.datasetmetadata.DatasetMetadataExporter) accepts a connection to a data repository (together with an indication of the actual data graph) and returns a Model containing the dataset metadata. The returned model may be based on actual data or on configurations/settings of the exporter.

Extension Point Details

The tables below summarizes the details of the DatasetMetadataExporter extension point.

Name Dataset Metadata Exporter
Description A DatasetMetadataExporter supports the generation of project metadata based on specific metadata vocabularies. Specifically, it supports a hybrid process, in which metadata may be based on manually entered information as well as automatic profiling procedures over the project data
Identifier it.uniroma2.art.semanticturkey.extension.extpts.datasetmetadata.DatasetMetadataExporter
Configuration n/a
Settings
Common settings for dataset metadata exporter (it.uniroma2.art.semanticturkey.extension.extpts.datasetmetadata.DatasetMetadataExporterSettings)
Scope: PROJECT
Empty

Interfaces

The following interface defines the contract for the DatasetMetadataExporter extension point.


/**
* Extension point for the dataset metadata exporters 
*/
public interface DatasetMetadataExporter extends Extension {

	Model produceDatasetMetadata(Project project, RepositoryConnection conn, IRI dataGraph)
			throws DatasetMetadataExporterException, STPropertyAccessException;
}

Choosing and Configuring a Dataset Metadata Exporter

It is assumed that a DatasetMetadataExporter can be instantiated on demand by the system without specific configuration. However, it can access settings (at project-level) that can be used as a mechanism to allow human-supplied information (e.g. general descriptive metadata, such as the title, creator and license of the dataset). These settings are stored according to the standard configuration/settings mechanism.

Dataset Metadata Exporter Implementations

ADMS Dataset Metadata Exporter

Name ADMS Dataset Metadata Exporter
Description Exports dataset metadata using the ADMS vocabulary
Identifier it.uniroma2.art.semanticturkey.extension.impl.datasetmetadata.adms.ADMSDatasetMetadataExporter
Configuration n/a
Settings
ADMS Dataset Metadata Exporter Settings (it.uniroma2.art.semanticturkey.extension.impl.datasetmetadata.adms.ADMSDatasetMetadataExporterSettings)
Scope: PROJECT
asset_iri (asset_iri)
Asset IRI
java.lang.String
Required
asset_title (asset_title)
Asset Title Key
java.lang.String
Required
asset_description (asset_description)
Asset Description
java.lang.String
Required
asset_skos_label (asset_skos_label)
Asset SKOS AltLabel
java.lang.String
Optional
asset_issued (asset_issued)
Asset Issued
java.lang.String
Optional
asset_modified (asset_modified)
Asset Modified
java.lang.String
Optional
asset_keywords (asset_keywords)
Asset Keywords (comma separated)
java.lang.String
Optional
asset_versionInfo (asset_versionInfo)
Asset Version Info
java.lang.String
Optional
asset_versionsNotes (asset_versionsNotes)
Asset Version Notes
java.lang.String
Optional
asset_theme (asset_theme)
Asset Theme
java.lang.String
Optional
asset_spatial (asset_spatial)
Asset Spatial
java.lang.String
Optional
asset_contactPoint (asset_contactPoint)
Asset Contact Point
java.lang.String
Optional
asset_landingPage (asset_landingPage)
Asset Landing Pages
java.lang.String
Optional
asset_landingPage_title (asset_landingPage_title)
Asset Landing Pages - Title
java.lang.String
Optional
asset_describedBy (asset_describedBy)
Asset Described By
java.lang.String
Optional
asset_describedBy_title (asset_describedBy_title)
Asset Described By - Title
java.lang.String
Required
asset_page (asset_page)
Asset Page
java.lang.String
Optional
asset_page_title (asset_page_title)
Asset Page - Title
java.lang.String
Optional
asset_identifier (asset_identifier)
Asset Identifier
java.lang.String
Optional
asset_identifier_notation (asset_identifier_notation)
Asset Identifier - Notation (a rdfs:Literal which should be typed. )
java.lang.String
Optional
asset_identifie_creator (asset_identifie_creator)
Asset Identifier - creator
java.lang.String
Optional
asset_identifier_schemeAgency (asset_identifier_schemeAgency)
Asset Identifier - Scheme Agency
java.lang.String
Optional
asset_temporal (asset_temporal)
Asset Temporal
java.lang.String
Optional
asset_temporal_startDate (asset_temporal_startDate)
Asset Temporal - Start Date
java.lang.String
Optional
asset_temporal_endDate (asset_temporal_endDate)
Asset Temporal - End Date
java.lang.String
Optional
asset_language (asset_language)
Asset Language (if an ISO 639-1 (two-letter) code is defined for language, then its corresponding IRI should be used; if no ISO 639-1 code is defined, then IRI corresponding to the ISO 639-2 (three-letter) code should be used. )
java.lang.String
Optional
asset_status (asset_status)
Asset Status
java.lang.String
Optional
asset_interoperability_level (asset_interoperability_level)
Asset Interoperability Level
java.lang.String
Optional
asset_type (asset_type)
Asset Type
java.lang.String
Optional
asset_includedAsset (asset_includedAsset)
Asset Included Asset
java.lang.String
Optional
asset_sample (asset_sample)
Asset sample
java.lang.String
Optional
asset_traslation (asset_traslation)
Asset traslation
java.lang.String
Optional
asset_prev (asset_prev)
Asset Prev
java.lang.String
Optional
asset_last (asset_last)
Asset Last
java.lang.String
Optional
asset_next (asset_next)
Asset Next
java.lang.String
Optional
asset_relation (asset_relation)
Asset Relation
java.lang.String
Optional
asset_distribution (asset_distribution)
Asset Distribution
java.lang.String
Optional
asset_publisher_name (asset_publisher_name)
Asset Publisher - Name
java.lang.String
Optional
asset_publisher_type (asset_publisher_type)
Asset Publisher - Type
java.lang.String
Optional
distribution_issued (distribution_issued)
Distribution Issued
java.lang.String
Optional
distribution_modified (distribution_modified)
Distribution Modified
java.lang.String
Optional
distribution_title (distribution_title)
Distribution Title
java.lang.String
Optional
distribution_description (distribution_description)
Distribution Description
java.lang.String
Optional
distribution_accessUurl (distribution_accessUurl)
Distribution Access URL
java.lang.String
Optional
distribution_downloadUrl (distribution_downloadUrl)
Distribution Download URL
java.lang.String
Optional
distribution_licence (distribution_licence)
Distribution Licence
java.lang.String
Optional
distribution_licence_title (distribution_licence_title)
Distribution Licence - Title
java.lang.String
Optional
distribution_licence_description (distribution_licence_description)
Distribution Licence - Description
java.lang.String
Optional
distribution_licence_type (distribution_licence_type)
Distribution Licence - type
java.lang.String
Optional
distribution_format (distribution_format)
Distribution Format
java.lang.String
Optional
distribution_media_type (distribution_media_type)
Distribution Media type
java.lang.String
Optional

DCAT Dataset Metadata Exporter

Name DCAT Dataset Metadata Exporter
Description Exports dataset metadata using the DCAT vocabulary
Identifier it.uniroma2.art.semanticturkey.extension.impl.datasetmetadata.dcat.DCATDatasetMetadataExporter
Configuration n/a
Settings
DCAT Dataset Metadata Exporter Settings (it.uniroma2.art.semanticturkey.extension.impl.datasetmetadata.dcat.DCATDatasetMetadataExporterSettings)
Scope: PROJECT
dataset_iri (dataset_iri)
Dataset IRI
java.lang.String
Required
dataset_title (dataset_title)
Dataset Title: A name given to the dataset
java.lang.String
Required
dataset_description (dataset_description)
Dataset Description: free-text account of the dataset
java.lang.String
Optional
dataset_issued (dataset_issued)
Dataset Issued: The date of listing the corresponding dataset in the catalog
java.lang.String
Optional
dataset_modified (dataset_modified)
Dataset Modified
java.lang.String
Optional
dataset_identifier (dataset_identifier)
Dataset Identifier: A unique identifier of the dataset
java.lang.String
Optional
dataset_keywords (dataset_keywords)
Dataset keywords (comma separated): A keyword or tag describing the dataset
java.lang.String
Optional
dataset_language (dataset_language)
Dataset Language: The language of the dataset
java.lang.String
Optional
dataset_contactPoint (dataset_contactPoint)
Dataset Contact Point: Link a dataset to relevant contact information which is provided using VCard
java.lang.String
Optional
dataset_temporal (dataset_temporal)
Dataset Temporal: The temporal period that the dataset covers
java.lang.String
Optional
dataset_temporal_startDate (dataset_temporal_startDate)
Dataset Temporal - Start Date
java.lang.String
Optional
dataset_temporal_endDate (dataset_temporal_endDate)
Dataset Temporal - End Date
java.lang.String
Optional
dataset_spatial (dataset_spatial)
Dataset Spatial: Spatial coverage of the dataset
java.lang.String
Optional
dataset_accrualPeriodicy (dataset_accrualPeriodicy)
Dataset Accrual Periodicy: The frequency at which dataset is published
java.lang.String
Optional
dataset_landingPage (dataset_landingPage)
Dataset Landing Page
java.lang.String
Optional
dataset_theme (dataset_theme)
Dataset Theme(comma separated): The main category of the dataset. A dataset can have multiple themes
java.lang.String
Optional
dataset_distribution (dataset_distribution)
Dataset Distribution: Connects a dataset to its available distributions
java.lang.String
Required
distribution_title (distribution_title)
Distribution Title: A name given to the distribution
java.lang.String
Required
distribution_description (distribution_description)
Distribution Description: free-text account of the distribution
java.lang.String
Required
distribution_issued (distribution_issued)
Distribution Issued: Date of formal issuance (e.g., publication) of the distribution
java.lang.String
Optional
distribution_licence (distribution_licence)
Distribution Modified: Most recent date on which the distribution was changed, updated or modified
java.lang.String
Optional
distribution_licence_title (distribution_licence_title)
Distribution Licence - Title
java.lang.String
Optional
distribution_licence_description (distribution_licence_description)
Distribution Licence - Description
java.lang.String
Optional
distribution_licence_type (distribution_licence_type)
Distribution Licence - type
java.lang.String
Optional
distribution_rights (distribution_rights)
Distribution Rights: Information about rights held in and over the distribution
java.lang.String
Optional
distribution_accessUrl (distribution_accessUrl)
Distribution AccessURL: A landing page, feed, SPARQL endpoint or other type of resource that gives access to the distribution of the dataset
java.lang.String
Optional
distribution_downloadUrl (distribution_downloadUrl)
Distribution DownloadURL: A file that contains the distribution of the dataset in a given format
java.lang.String
Optional
distribution_mediaType (distribution_mediaType)
Distribution MediaType: The media type of the distribution as defined by IANA
java.lang.String
Optional
distribution_format (distribution_format)
Distribution Format: The file format of the distribution
java.lang.String
Optional
distribution_byteSize (distribution_byteSize)
Distribution Byte Size: The size of a distribution in bytes
java.lang.String
Optional

DCAT-AP Dataset Metadata Exporter

Name DCAT-AP Dataset Metadata Exporter
Description Exports dataset metadata using the DCAT-AP vocabulary
Identifier it.uniroma2.art.semanticturkey.extension.impl.datasetmetadata.dcatap.DCATAPDatasetMetadataExporter
Configuration n/a
Settings
DCAT-AP Dataset Metadata Exporter Settings (it.uniroma2.art.semanticturkey.extension.impl.datasetmetadata.dcatap.DCATAPDatasetMetadataExporterSettings)
Scope: PROJECT
catalogue_iri (catalogue_iri)
Catalogue IRI
java.lang.String
Required
dataset_iri (dataset_iri)
Dataset IRI
java.lang.String
Required
catalogue_description (catalogue_description)
Catalogue - description
java.lang.String
Required
catalogue_publisherAgentIRI (catalogue_publisherAgentIRI)
Catalogue - publisher Agent IRI
java.lang.String
Required
catalogue_publisherAgentName (catalogue_publisherAgentName)
Catalogue - publisher Agent Name
java.lang.String
Required
catalogue_publisherAgentType (catalogue_publisherAgentType)
Catalogue - publisher Agent Type
java.lang.String
Optional
catalogue_title (catalogue_title)
Catalogue - title
java.lang.String
Required
catalogue_homepage (catalogue_homepage)
Catalogue - homepage
java.lang.String
Optional
catalogue_language (catalogue_language)
Catalogue - language
java.lang.String
Optional
catalogue_licence (catalogue_licence)
Catalogue - licence
java.lang.String
Optional
catalogue_releaseDate (catalogue_releaseDate)
Catalogue - release date
java.lang.String
Optional
catalogue_themes (catalogue_themes)
Catalogue - themes
java.lang.String
Optional
catalogue_modificationDate (catalogue_modificationDate)
Catalogue - modification date
java.lang.String
Optional
dataset_description (dataset_description)
Dataset - description
java.lang.String
Required
dataset_title (dataset_title)
Dataset - title
java.lang.String
Required
dataset_distributionIRI (dataset_distributionIRI)
Dataset - distribution IRI
java.lang.String
Required
dataset_distributionAccessUrl (dataset_distributionAccessUrl)
Dataset - Distribution access URL
java.lang.String
Required
dataset_distributionDescription (dataset_distributionDescription)
Dataset - Distribution description
java.lang.String
Optional
dataset_distributionFormat (dataset_distributionFormat)
Dataset - Distribution format
java.lang.String
Optional
dataset_distributionLicence (dataset_distributionLicence)
Dataset - Distribution licence
java.lang.String
Optional
dataset_keyword (dataset_keyword)
Dataset - keyword
java.lang.String
Optional
dataset_publisherAgentIRI (dataset_publisherAgentIRI)
Dataset - publisher Agent IRI
java.lang.String
Required
dataset_publisherAgentName (dataset_publisherAgentName)
Dataset - publisher Agent Name
java.lang.String
Required
dataset_publisherAgentType (dataset_publisherAgentType)
Dataset - publisher Agent Type
java.lang.String
Optional
dataset_theme (dataset_theme)
Dataset theme
java.lang.String
Optional
category_preferredLabel (category_preferredLabel)
Category preferred label
java.lang.String
Required
categoryScheme_title (categoryScheme_title)
Category Scheme title
java.lang.String
Required

VoID/LIME Dataset Metadata Exporter

Name VoID/LIME Dataset Metadata Exporter
Description Exports dataset metadata using the VoID and LIME vocabularies
Identifier it.uniroma2.art.semanticturkey.extension.impl.datasetmetadata.voidlime.VOIDLIMEDatasetMetadataExporter
Configuration n/a
Settings
VoID/LIME Dataset Metadata Exporter Settings (it.uniroma2.art.semanticturkey.extension.impl.datasetmetadata.voidlime.VOIDLIMEDatasetMetadataExporterSettings)
Scope: PROJECT
dataset_description_baseUri (dataset_description_baseUri)
Dataset Description base URI
java.lang.String
Required
dataset_localName (dataset_localName)
Dataset Local Name
java.lang.String
Required
dataset_title (dataset_title)
The name of the dataset
java.lang.String
Required
dataset_description (dataset_description)
A textual description of the dataset
java.lang.String
Required
dataset_homePage (dataset_homePage)
Dataset Home Page
java.lang.String
Optional
dataset_creators (dataset_creators)
An entity, such as a person, organisation, or service, that is primarily responsible for creating the dataset (comma-separated IRIs)
java.lang.String
Optional
dataset_publisher (dataset_publisher)
An entity, such as a person, organisation, or service, that is responsible for making the dataset available (it should be an IRI)
java.lang.String
Optional
dataset_contributors (dataset_contributors)
An entity, such as a person, organisation, or service, that is responsible for making contributions to the dataset (comma-separated IRIs)
java.lang.String
Optional
dataset_date (dataset_date)
A point or period of time associated with an event in the life-cycle of the resource. The value should be formatted and data-typed as an xsd:date
java.lang.String
Optional
dataset_created (dataset_created)
Date of creation of the dataset. The value should be formatted and data-typed as an xsd:date
java.lang.String
Optional
dataset_issued (dataset_issued)
Date of formal issuance (e.g., publication) of the dataset. The value should be formatted and datatyped as an xsd:date
java.lang.String
Optional
dataset_modified (dataset_modified)
Date on which the dataset was changed. The value should be formatted and datatyped as an xsd:date
java.lang.String
Optional
dataset_subjects (dataset_subjects)
Dataset Subjects (comma-separated IRIs)
java.lang.String
Optional
dataset_subjects (dataset_subjects)
Technical features of the dataset (comma-separated IRIs)
java.lang.String
Optional
dataset_dataDumps (dataset_dataDumps)
Dataset Data Dumps (comma-separated IRIs)
java.lang.String
Optional
dataset_sparqlEndpoint (dataset_sparqlEndpoint)
Dataset SPARQL Endpoint (it should be an IRI)
java.lang.String
Optional
dataset_uriLookupEndpoint (dataset_uriLookupEndpoint)
A protocol endpoint for simple URI lookups for a void:Dataset (it should be an IRI)
java.lang.String
Optional
dataset_openSearchDescription (dataset_openSearchDescription)
An OpenSearch description document for a free-text search service over a void:Dataset (it should be an IRI)
java.lang.String
Optional
dataset_uriSpace (dataset_uriSpace)
A URI that is a common string prefix of all the entity URIs in a void:Datset
java.lang.String
Optional
dataset_exampleResources (dataset_exampleResources)
An example entity that is representative for the entities described in a void:Dataset (comma-separated IRIs)
java.lang.String
Optional
dataset_rootResources (dataset_rootResources)
A top concept or entry point for a void:Dataset that is structured in a tree-like fashion. (comma-separated IRIs)
java.lang.String
Optional