Building the world’s largest spatial multiomics dataset in oncology

Blue

How to access the MOSAIC dataset

Option 1 - Access via the EGA

MOSAIC-Window, a subset of the MOSAIC dataset, is available via the European Genome-Phenome Archive (EGA).

Option 2 - Explore via K Pro

Explore the MOSAIC-Window dataset via the free tier of our agentic AI co-pilot K Pro Free.

The full MOSAIC dataset is available as an add on to the premium tier of K Pro. Talk to our team or book a demo to learn more about this option.

SIGN UP TO K PRO FREE

Large-scale spatial omics data is essential to fuel research in cancer

The convergence of large-scale spatial omics data and AI will power the next revolution of oncology research. There is an urgent need for the whole cancer community to have access to these data to reveal groups of patients with distinct tumour/immune biology.

The MOSAIC initiative will overcome this limitation by creating a large scale dataset of tumor spatial transcriptomes, allowing scientists to conduct research on data from cohorts larger than what is currently possible.

The MOSAIC dataset today:

10
Therapy areas
6
Data modalities
100
Clinical variables per patient
2,646
Total patients included
1,833
Spatial omics data generated
1,449
Single-cell omics generated
Blue

Data modalities

MOSAIC covers six data modalities to capture the complexity of disease biology at multiple scales.

Here is a breakdown of current number of patient samples per modality in the dataset.

2666
Digitized H&E
Digitized H&E
2082
Whole Exome Sequencing
Whole Exome Sequencing
1857
Bulk RNA-Seq
RNA-Seq
2164
Spatial transcriptomics
Spatial transcriptomics
2068
Single-cell RNA-Seq
Single-cell RNA-Seq
Clinical data for every patient
Clinical data

Cancer therapy areas

MOSAIC focuses on selected cohorts that include ten cancer indications.

Here is a breakdown of the number of patients per therapy area currently in the dataset

296
Bladder cancer
Bladder Cancer
301
Glioblastoma
GBM
384
Breast cancer
Breast cancer
189
Diffuse large B cell lymphoma
DLBCL
96
Mesothelioma
Mesothelioma
473
Ovarian cancer
Ovarian Cancer
159
HNSCC
HNSCC
521
Non-small cell lung cancer
NSCLC
144
Pancreatic cancer
Pancreatic Cancer
83
Colorectal cancer
Colorectal Cancer

Why spatial biology?

Spatial biology brings a new dimension to cancer research because it maps biological molecules to their location within a tissue, capturing the spatial context.

When combined with data from technologies like imaging and genomics, spatial omics contributes to a rich, multiscale understanding of disease mechanisms.

How will MOSAIC help us fight cancer?

Through spatial omics MOSAIC will paint a picture of the dynamics between cells, highlighting key biological networks to enable precision medicine via the discovery of:

Disease Biology Illustration

New disease biology

  • Integrated, multimodal analysis of the tumor in its native architecture
  • A new understanding of the tumor microenvironment
Patient Subtypes

New patient subtypes

  • Identification of patients who are resistant or respond differently to certain drugs
  • A better understanding of how tumor-immune cell interactions differ in those patients
Biomarkers

New biomarkers

  • Discovery of biomarkers that characterize patient subtypes
  • Improved diagnostics, treatment decision and clinical trial design
Drug Targets

New drug targets

  • Discovery of patient-specific drug targets
  • Development of more effective combination and personalized therapies
Match Treatment

Matching the right patient to the right treatment

  • Improved connection between patient biology and target discovery
  • Prescription of personalized therapies to the right patients

Partners