Pan-Cancer Analyses & Cloud Computing within the EOSC

SD ID: Pan-Cancer Analyses & Cloud Computing within the EOSC

ORGANISATION: Genome Biology Unit, European Molecular Biology Laboratory (EMBL)

CONTACT: Sergei Iakhnin

Email: llevar(at)


There are a multitude of challenges being faced in the life sciences, health, food, fishery and agriculture sectors. In cancer research, Europe has taken a technical lead within international consortia around cloud-based pan-cancer genomic analysis. This global competitive advantage can be maintained by leveraging open science analysis models around controlled access data sets developed in collaboration with researchers elsewhere in the world. These analysis frameworks could also be re-used to analyse cardiovascular and neuro-degenerative diseases as well as stimulating biotech/pharmaceutical industries to use public cancer genomic data in R&D.


The Pan-Cancer Analysis of Whole Genomes Project (PCAWG)

The pan-cancer analysis of whole genomes project (PCAWG) is analysing large cohorts of cancer genomes, and pursuing so-called pan-cancer studies to identify factors that may be involved in tumour formation and disease progression across multiple cancer types.  PCAWG is currently analyzing >2800 cancer whole genomes, largely on academic and public clouds, and is also developing approaches for data integration with transcriptome & clinical data to address specific hypotheses.

The EOSCpilot science demonstrator aims to establish a portable cloud-based federated solution for collaborative cancer genomics and associated health data management, and an environment accessible to European scientists for analysis.


Engagement in the EOSC will enable Europe to maintain its current global competitive advantage in pan-cancer analyses & foster cloud-based solutions for genomic data analysis. Broader impact and the ability to reuse solutions in other areas (e.g. for cardiovascular & neuro-degenerative diseases) is also foreseen.

Public Attachment: