Video embedded: 

Introduction

The agriculture, food and environment research community faces many challenges common to all: Easily find and publish data, preserve them, and facilitate their treatment and analyse through computing solutions.

Examples of needs:

“I need to integrate innovative services that allow researchers to analyse data and publish it easily. Our information system must guarantee long-term storage of experimental data” - Engineer, Phenome-Emphasis community

“I need an easy access to the data produced on the experimental platforms. I need to correlate various datasets.  I need an easy access to publish curated datasets or models.” - Research Engineer, Phenome-Emphasis community

To address these needs, this use case aims to create a flexible federated research data ecosystem for the agrifood community through four aspects:

  • long term data preservation,
  • connecting data repositories,
  • virtual research environments,
  • cloud computing.

Challenges addressed

  • Data findability and reusability
  • Data integration
  • Data processing
  • Reproducibility

Benefits through EOSC-Pillar

By using the EOSC-Pillar Federated FAIR Data Space (F2DS), data providers and repositories will be able to make their data findable, accessible, and reusable by the whole community within the context of EOSC. As a direct consequence, this task will enable and/or increase interoperability among the repositories. Furthermore, the use case will leverage EOSC data services such as B2SAFE in order to implement long-term preservation of the institutional data repositories.

With EOSC distribution, the community as a whole will gain access to a research environment on which to process, analyse and visualise data in-situ with appropriate compute infrastructure, without the need to download them first, fostering collaborations and cross-fertilisation.

Highlights

What has been achieved

  • Widening the agrifood initial scope through collaboration with partners.
  • Deployment of a Virtual Research Environment.
  • Deployment of OpenStack over INRAE and France Grilles infrastructures.
  • Provisioning of Kubernetes Clusters based on INRAE and France Grilles infrastructures.
  • Deployment of JupyterHub on INRAE infrastructures.
  • Deployment of renku on INRAE infrastructures.
  • Mapping between CINES archiving tool (VITAM) and DataINRAE metadata.
  • Creation of Data INRAE openAPI definition for help integrating Dataverse based repositories data in the Federated FAIR Data Space (F2DS).
  • Connected Galaxy to Dataverse.

 

Next steps

  • Setup a connector between CINES archiving tool (VITAM) and Data INRAE for archiving automaticaly Data INRAE’s data in VITAM.
  • Connect INRAE jupyterHub to the D4Science platform as a Jupyter notebook provider.
  • Developing a connector between Dataverse and INRAE’s Jupyterhub and renku instances.
  • Developing a connector between Dataverse and D4science VRE.
  • Setup interoperability between Fraunhofer Market Place and F2DS.
  • Setup interoperability between Fraunhofer Market Place and Dataverse based repository.