A1 Refereed original research article in a scientific journal
Benchmarking tools for detecting longitudinal differential expression in proteomics data allows establishing a robust reproducibility optimization regression approach
Authors: Välikangas Tommi, Suomi Tomi, Chandler Courtney E., Scott Alison J., Tran Bao Q., Ernst Robert K., Goodlett David R., Elo Laura L.
Publication year: 2022
Journal: Nature Communications
Journal name in source: Nature communications
Journal acronym: Nat Commun
Volume: 13
Issue: 1
ISSN: 2041-1723
eISSN: 2041-1723
DOI: https://doi.org/10.1038/s41467-022-35564-z
Web address : https://doi.org/10.1038/s41467-022-35564-z
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/177864928
Quantitative proteomics has matured into an established tool and longitudinal proteomics experiments have begun to emerge. However, no effective, simple-to-use differential expression method for longitudinal proteomics data has been released. Typically, such data is noisy, contains missing values, and has only few time points and biological replicates. To address this need, we provide a comprehensive evaluation of several existing differential expression methods for high-throughput longitudinal omics data and introduce a Robust longitudinal Differential Expression (RolDE) approach. The methods are evaluated using over 3000 semi-simulated spike-in proteomics datasets and three large experimental datasets. In the comparisons, RolDE performs overall best; it is most tolerant to missing values, displays good reproducibility and is the top method in ranking the results in a biologically meaningful way. Furthermore, RolDE is suitable for different types of data with typically unknown patterns in longitudinal expression and can be applied by non-experienced users.
Downloadable publication This is an electronic reprint of the original article. |