LOD Platform
Highly innovative technological system, unparalleled in its reference market, for the structuring of bibliographic catalogues, organized according to the data model BIBFRAME and the conversion of data into Linked Open Data.
The system allows:
- the creation of structured data based on distinct records for Persons and Works entities, as defined by BIBFRAME;
- data enrichment through connections to external projects
- bibliographic and authority data conversion, according to the standard model indicated by the W3C for LOD, RDF – Resource Description Framework, using sector-specific ontologies selected among those of reference in a global context, both integrated and extended;
- publication of the dataset in LOD on RDF storage (Triple Store);
- creation of a portal with interface navigation based on BIBFRAME: Person/Work, Publications (Instances) and Item
Components of technological architecture
- AUTHIFY, RESTFul module that provides full text search services of external datasets (downloaded, stored and indexed in the system), primarily relative to Authority file (VIAF, Library of Congress Name Authority file, …) but also extendable to other types of datasets. AUTHIFY is composed of two main parts: a SOLR infrastructure for indexing datasets and related search services, and a logical level that orchestrates these services to find a match within the clusters of defined entities (typically Names and Works)
- CLUSTER KNOWLEDGE BASE, on PostgreSQL database, is the result of the elaboration and enrichment process of data with external sources to the bibliographic catalogue, for each defined entity; typically: cluster of names (authorized and variant forms for Name of Person) and cluster of titles (authorized access points and variant forms for the Title of Works)
- RDFizer, Hadoop module that automates the entire process of conversion and publishing dataset in RDF format
- TRIPLE STORE, a database that can be selected among those open source or property based, according to specific needs, for storing RDF files
- PORTAL SKIN, instance of data publication portal
- JCRICKET Entity Editor, a new and innovative tool for collaborative linked data entity management and shared cataloguing. It enables - according to the BIBFRAME ontology - entity curation (e.g. creation of new entities, entity modification, the application of entity merge and split functions)