SD ID: Frictionless Data Exchange Across Research Data, Software and Scientific Paper Repositories Organisations & contacts Petr Knoth, Lucas Anastasiou and Giorgio Basile, The Open University, UK Martin Klein and Herbert Van de Sompel, Los Alamos National Laboratory, USA |
OVERVIEW: The Science Demonstrator pilots a demonstrator service for fast and highly scalable exchange of data across repositories storing research datasets, manuscripts and scientific software.
SCIENTIFIC OBJECTIVES OF THE DEMONSTRATOR:
This pilot wants to showcase how scholarly communication resources can be effectively, regularly and reliably exchanged across systems, and to apply ResourceSync protocol on real-world use cases. Moreover, the objective is also to show data synchronization across a cross-disciplinary network of repositories and between repositories and global added-value services.
- Showcase how scholarly communication resources can be effectively, regularly and reliably exchanged across systems;
-
Apply ResourceSync protocol on real-world use cases;
-
Show data synchronization across a cross-disciplinary network of repositories and between repositories and global added-value services.
FAIR CHALLENGES:
-
FINDABILITY & ACCESSIBILITY: increasing resources findability and accessibility on the web;
-
INTEROPERABILITY: improving interoperability of scientific repositories;
-
REUSABILITY: increase resources reuse in big data analytics tasks involving text and data mining;
-
increasing scalability of data exchange among scholarly communication systems.
MAIN ACHIEVEMENTS:
- Benchmarkingg OAI-PMH against ResourceSync in a number of scenarios and according to a number of criteria, including:
- Enhancing ResourceSync by means of developing of a ResourceSync On Demand Dump concept and its implementation and aditpion in a real use cases.
- Supporting OpenAIRE in adopting ResourceSync-based ingestion of scientific papers from publishers.
IMPACT:
-
A clear path to go beyond the current state of the art in information exchange between EOSC data providers and services;
-
Scalable client/server implementation of the ResourceSync protocol for easy adoption by data providers;
-
Awareness of existing issues and the proposed solution.
TECHNICAL FOCUS:
-
the demonstrator will be deployed within the CORE infrastructure
-
the CORE infrastructure consists of small clusters for harvesting, indexing, processing and storing data