SUNCAT: exploring open metadata
GoGeo Linked Data (GOLD)
Six months: February – July 2011
GOLD: Scottish Government
Conventional approaches to building Spatial Data Infrastructures (SDIs) presuppose some form of 'discovery' service that provides potential users with metadata describing available datasets and services in order to expedite access and to minimise potential for duplication of effort. The widely adopted Open Geospatial Consortia (OGC) Catalogue Services for the Web (CSW) specification defines a set of abstract interfaces and a protocol binding for the discovery, access, maintenance and organisation of metadata repositories of geospatial information and related resources. CSW underpins the UK Location Programmes metadata infrastructure and examples are found in the UK academic geoportal, GoGeo, and the Scottish Government Spatial Discovery Portal.
However, CSW is arguably ill-suited to RESTful style geo-mashup APIs and the emerging linked data publishing paradigm. Essentially, there is a tension between the 'popular' less formal style discovery services which is in opposition to the more 'formal' style adopted by CSW.
The work will expose the GoGeo Catalogue and the Scottish Government Spatial Discovery Portal CSW content as a Linked Data triple store the objective of which is to improve the 'discoverability' of metadata records and the catalogues 'openess'. A range of technical approaches will be reviewed (schema cross-walking; production direct form relational databases and an UML to RDF approach), and open source software produced.
SUNCAT includes serials’ information from 79 contributing libraries. In addition it contains records from the CONSER database, the ISSN Register and the Directory of Open Access Journals (DOAJ). A freely accessible Z39.50 connection supports search of the full catalogue but this does not enable download of records in MARC format due to licensing restrictions. In addition a number of Z39.50 connections, available only to SUNCAT contributing libraries, offer access to a subset of the catalogue records for download in MARC format.
This project will seek to determine what must be done to extend access to the catalogue, including holdings’ information and will explore what may be done within the project timescales. This will include:
The Metadata guidelines for the UK RDTF will guide this work.