A1 Refereed original research article in a scientific journal

Improving performance in colorectal cancer histology decomposition using deep and ensemble machine learning




AuthorsPrezja, Fabi; Annala, Leevi; Kiiskinen, Sampsa; Lahtinen, Suvi; Ojala, Timo; Ruusuvuori, Pekka; Kuopio, Teijo

Publication year2024

JournalHeliyon

Journal name in sourceHELIYON

Article numberARTN e37561

Volume10

Issue18

eISSN2405-8440

DOIhttps://doi.org/10.1016/j.heliyon.2024.e37561

Web address https://doi.org/10.1016/j.heliyon.2024.e37561

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/458235771


Abstract

In routine colorectal cancer management, histologic samples stained with hematoxylin and eosin are commonly used. Nonetheless, their potential for defining objective biomarkers for patient stratification and treatment selection is still being explored. The current gold standard relies on expensive and time-consuming genetic tests. However, recent research highlights the potential of convolutional neural networks (CNNs) to facilitate the extraction of clinically relevant biomarkers from these readily available images. These CNN-based biomarkers can predict patient outcomes comparably to golden standards, with the added advantages of speed, automation, and minimal cost. The predictive potential of CNN-based biomarkers fundamentally relies on the ability of CNNs to accurately classify diverse tissue types from whole slide microscope images. Consequently, enhancing the accuracy of tissue class decomposition is critical to amplifying the prognostic potential of imaging-based biomarkers. This study introduces a hybrid deep transfer learning and ensemble machine learning model that improves upon previous approaches, including a transformer and neural architecture search baseline for this task. We employed a pairing of the EfficientNetV2 architecture with a random forest classification head. Our model achieved 96.74% accuracy (95% CI: 96.3%-97.1%) on the external test set and 99.89% on the internal test set. Recognizing the potential of these models in the task, we have made them publicly available.


Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.





Last updated on 2025-28-01 at 12:27