Staining normalization in histopathology: Method benchmarking using multicenter dataset
: Khan, Umair; Härkönen, Jouni; Friman, Marjukka; Hakimnejad, Hesam; Latonen, Leena; Kuopio, Teijo; Ruusuvuori, Pekka
: 2026
Scientific Reports
: 11097
: 16
: 1
: 2045-2322
DOI: https://doi.org/10.1038/s41598-026-40943-3
: https://doi.org/10.1038/s41598-026-40943-3
: https://research.utu.fi/converis/portal/detail/Publication/515761745
Hematoxylin and Eosin (H&E) has been the gold standard in tissue analysis for decades, however, tissue specimens stained in different laboratories vary, often significantly, in appearance. This variation poses a challenge for both pathologists’ and AI-based downstream analysis. Minimizing stain variation computationally is an active area of research. To further investigate this problem, we collected a unique multi-center tissue image dataset, wherein tissue samples from colon, kidney, and skin tissue blocks were distributed to 66 different labs for routine H&E staining. To isolate staining variation, other factors affecting the tissue appearance were kept constant. Further, we used this tissue image dataset to compare the performance of eight different stain normalization methods, including four traditional methods, namely, histogram matching, Macenko, Vahadane, and Reinhard normalization, and two deep learning-based methods namely CycleGAN and Pixp2pix, both with two variants each. We used both quantitative and qualitative evaluation to assess the performance of these methods. The dataset’s inter-laboratory staining variation could also guide strategies to improve model generalizability through varied training data.
:
Financial support from Research Council of Finland (PR, LL), Cancer Foundation Finland (PR, LL), Sigrid Juselius Foundation (PR), University of Turku Graduate School (UK) is gratefully acknowledged.