Frictionless Data Exchange Across Research Data, Software and Scientific Paper Repositories

SD ID: Frictionless Data Exchange Across Research Data, Software and Scientific Paper Repositories

Organisations & contacts

Petr Knoth, Lucas Anastasiou and Giorgio Basile, The Open University, UK

Martin Klein and Herbert Van de Sompel, Los Alamos National Laboratory, USA

 

OVERVIEW:  The Science Demonstrator pilots a demonstrator service for fast and highly scalable exchange of data across repositories storing research datasets, manuscripts and scientific software.

SCIENTIFIC OBJECTIVES OF THE DEMONSTRATOR:

This pilot wants to showcase how scholarly communication resources can be effectively, regularly and reliably exchanged across systems, and to apply ResourceSync protocol on real-world use cases. Moreover, the objective is also to show data synchronization across a cross-disciplinary network of repositories and between repositories and global added-value services.

  • Showcase how scholarly communication resources can be effectively, regularly and reliably exchanged across systems;
  • Apply ResourceSync protocol on real-world use cases;

  • Show data synchronization across a cross-disciplinary network of repositories and between repositories and global added-value services.

FAIR CHALLENGES:

  • FINDABILITY & ACCESSIBILITY: increasing resources findability and accessibility on the web;

  • INTEROPERABILITY: improving interoperability of scientific repositories;

  • REUSABILITY: increase resources reuse in big data analytics tasks involving text and data mining;

  • increasing scalability of data exchange among scholarly communication systems.

MAIN ACHIEVEMENTS:

  • Benchmarkingg OAI-PMH against ResourceSync in a number of scenarios and according to a number of criteria, including:
  • Enhancing ResourceSync by means of developing of a ResourceSync On Demand Dump concept and its implementation and aditpion in a real use cases.
  • Supporting OpenAIRE in adopting ResourceSync-based ingestion of scientific papers from publishers.

IMPACT:

  • A clear path to go beyond the current state of the art in information exchange between EOSC data providers and services;

  • Scalable client/server implementation of the ResourceSync protocol for easy adoption by data providers;

  • Awareness of existing issues and the proposed solution.

TECHNICAL FOCUS:

  • the demonstrator will be deployed within the CORE infrastructure

  • the CORE infrastructure consists of small clusters for harvesting, indexing, processing and storing data

 

Public Attachment: