Elementary methods provide more replicable results in microbial differential abundance analysis




Pelto, Juho; Auranen, Kari; Kujala, Janne V.; Lahti, Leo

PublisherOxford University Press

2025

Briefings in Bioinformatics

bbaf130

26

2

1467-5463

1477-4054

DOIhttps://doi.org/10.1093/bib/bbaf130

https://doi.org/10.1093/bib/bbaf130

https://research.utu.fi/converis/portal/detail/Publication/491925599



Differential abundance analysis (DAA) is a key component of microbiome studies. Although dozens of methods exist, there is currently no consensus on the preferred methods. While the correctness of results in DAA is an ambiguous concept and cannot be fully evaluated without setting the ground truth and employing simulated data, we argue that a well-performing method should be effective in producing highly reproducible results. We compared the performance of 14 DAA methods by employing datasets from 53 taxonomic profiling studies based on 16S rRNA gene or shotgun metagenomic sequencing. For each method, we examined how the results replicated between random partitions of each dataset and between datasets from separate studies. While certain methods showed good consistency, some widely used methods were observed to produce a substantial number of conf licting findings. Overall, when considering consistency together with sensitivity, the best performance was attained by analyzing relative abundances with a nonparametric method (Wilcoxon test or ordinal regression model) or linear regression/t-test. Moreover, a comparable performance was obtained by analyzing presence/absence of taxa with logistic regression.



This work was supported by the Research Council of Finland
[330887 to J.P., L.L.]; and the European Union’s Horizon 2020
research and innovation programme [952914 to J.P., L.L.].


Last updated on 2025-21-05 at 12:22