A1 Refereed original research article in a scientific journal
Adaptive sequence alignment for metagenomic data analysis
Authors: Pietilä, Sami; Suomi, Tomi; Paulin, Niklas; Laiho, Asta; Sclivagnotis, Yannes S.; Elo, Laura L.
Publisher: Elsevier BV
Publication year: 2025
Journal: Computers in Biology and Medicine
Journal name in source: Computers in Biology and Medicine
Article number: 109743
Volume: 186
ISSN: 0010-4825
eISSN: 1879-0534
DOI: https://doi.org/10.1016/j.compbiomed.2025.109743
Web address : https://doi.org/10.1016/j.compbiomed.2025.109743
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/484861396
With advances in sequencing technologies, the use of high-throughput sequencing to characterize microbial communities is becoming increasingly feasible. However, metagenomic assembly poses computational challenges in reconstructing genes and organisms from complex samples. To address this issue, we introduce a new concept called Adaptive Sequence Alignment (ASA) for analyzing metagenomic DNA sequence data. By iteratively adapting a set of partial alignments of reference sequences to match the sample data, the approach can be applied in multiple scenarios, from taxonomic identification to assembly of target regions of interest. To demonstrate the benefits of ASA, we present two application scenarios and compare the results with state-of-the-art methods conventionally used for the same tasks. In the first, ASA accurately detected microorganisms from a sequenced metagenomic sample with a known composition. The second illustrated the utility of ASA in assembling target genetic regions of the microorganisms. An example implementation of the ASA concept is available at https://github.com/elolab/ASA.
Downloadable publication This is an electronic reprint of the original article. |
Funding information in the publication:
Prof. Elo reports grants from the European Union's Horizon 2020 research and innovation programme (955321), Research Council of Finland (310561, 314443, 329278, 335434, 335611 and 341342), and Sigrid Juselius Foundation, during the conduct of the study. Our research is also supported by Biocenter Finland, and ELIXIR Finland.