A1 Refereed data article in a scientific journal

A comprehensive European Colorectal Cancer Cohort dataset




AuthorsHolub, Petr; Törnwall, Outi; Garcia Alvarez, Eva; Proynova, Rumyana; Stampe, Florian; Maqsood, Saher; Ataian, Maxmilian; Schlünder, Irene; Carpén, Olli; Meijer, Gerrit; Nenutil, Rudolf; Valík, Dalibor; Parodi, Barbara; Hiemstra, Annemieke; Bierkens, Mariska; Castanos-Vélez, Esmeralda; Ückert, Frank; Alexandre, Diogo; Vojtíšek, Ondřej; Bader, Anna-Liisa; Simell, Birgit; Ahern, Caitlin; Horvat, Vitomir; Steinfelder, Erik; Gnocchi, Matteo; Moscatelli, Marco; Fürbaß, Alexander; Horák, Jiří; Frexia, Francesca; Mascia, Cecilia; Sulis, Alessandro; Delussu, Giovanni; Del Rio, Mauro; Meloni, Vittorio; Pireddu, Luca; Leo, Simone; Piras, Marco Enrico; Kačenga, Martin; Gofflot, Stéphanie; Mate, Sebastian; Prokosch, Hans-Ulrich; Romano, Paolo; Pistillo, Daniela; Hoffmeister, Michael; Brobeil, Alexander; Kugic, Amila; Huppertz, Berthold; Paleari, Valentina; Müller, Heimo; Reihs, Robert; Gemoll, Timo; Bantel, Yannick; Sjöblom, Tobias; Kyriacou, Kyriacos; di Martino, Simona; Ciliberto, Gennaro; Kock-Schoppenhauer, Ann-Kristin; Oberländer, Martina; Habermann, Jens K.; Husmann, Gabriele; Edqvist, Per-Henrik D.; Zlobec, Inti; Berger, Martin D.; Boeckmann, Lars; George, Fabienne; Southerington, Tom; Brucker, Daniel P.; Faugeras, Laurence; Vella, Joanna; Felice, Alex; Pace, Malcolm; Fallerini, Chiara; Renieri, Alessandra; Hadjisavvas, Andreas; Sargsyan, Karine; Loizidou, Maria A.; Besse-Hammer, Tatiana; Vogl, Franziska; Litton, Jan-Eric; Hummel, Michael; Zatloukal, Kurt; Lavitrano, Marialuisa

PublisherSpringer Nature

Publication year2026

Journal: Scientific Data

Article number662

Volume13

eISSN2052-4463

DOIhttps://doi.org/10.1038/s41597-026-06822-2

Publication's open availability at the time of reportingOpen Access

Publication channel's open availability Open Access publication channel

Web address https://www.nature.com/articles/s41597-026-06822-2

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/516223372

Self-archived copy's licenceCC BY NC ND

Self-archived copy's versionPublisher`s PDF


Abstract

Colorectal cancer (CRC) is a leading cause of cancer-related deaths worldwide. The Biobanking and Biomolecular Resources European Research Infrastructure (BBMRI-ERIC) established a CRC-Cohort with European coverage contributed by 26 biobanks from 12 countries. This retrospective, multi-center study contains structured and curated clinical data, supporting research on biomarkers for early detection, prognosis, and treatment. A phenotypical/clinical data model has been defined and individual-level data from 10,780 CRC patients have been collected at BBMRI-ERIC in the central data deposition service hosted as part of its services. The participating biobanks host additional data, which can be accessed on request and used to derive additional data. This mechanism has been used to extend the collected data with scans of histopathological slides to support research in artificial intelligence in digital pathology and with whole genome sequencing data to pilot a use case of the upcoming European Health Data Space (EHDS). Here we present the methodology, the quality assurance mechanisms, and the implementation of FAIR and FAIR-Health principles applied to build the CRC-Cohort.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.




Funding information in the publication
This work has been co/funded by ADOPT BBMRI-ERIC project supported by EU Horizon 2020, grant agreement no. 676550; EOSC-Life project, supported by EU Horizon 2020, grant agreement no. 824087, as a part of WP1 Demonstrators under APPID 1228 “Cloudification of BBMRI-ERIC CRC-Cohort and its Digital Pathology Imaging”; the HealthData@EU Pilot project, funded under the EU4Health Programme, grant agreement no. 101079839, and the XDATA Project, financed by the Sardinian Regional Government. Parts of this work have received funding from the Austrian Science Fund (FWF), Austria, Project P-32554 (Explainable Artificial Intelligence). WSI visualization infrastructructure has been partially supported by the BioMedAI TWINNING project, supported by EU Horizon Europe, grant agreement no. 101079183. In addition, this work has been supported by funding of BBMRI-ERIC National and Organizational Nodes: Czech Ministry of Education, Youth and Sports (LM2018125–BBMRI-CZ); the Austrian Federal Ministry for Education, Science and Research (BMBWF-10.470/0010-V3c/2018; BBMRI.at).


Last updated on 13/05/2026 10:52:40 AM