A1 Refereed original research article in a scientific journal

Adaptive sequence alignment for metagenomic data analysis




AuthorsPietilä, Sami; Suomi, Tomi; Paulin, Niklas; Laiho, Asta; Sclivagnotis, Yannes S.; Elo, Laura L.

PublisherElsevier BV

Publication year2025

JournalComputers in Biology and Medicine

Journal name in sourceComputers in Biology and Medicine

Article number109743

Volume186

ISSN0010-4825

eISSN1879-0534

DOIhttps://doi.org/10.1016/j.compbiomed.2025.109743

Web address https://doi.org/10.1016/j.compbiomed.2025.109743

Self-archived copy’s web addresshttps://research.utu.fi/converis/portal/detail/Publication/484861396


Abstract
With advances in sequencing technologies, the use of high-throughput sequencing to characterize microbial communities is becoming increasingly feasible. However, metagenomic assembly poses computational challenges in reconstructing genes and organisms from complex samples. To address this issue, we introduce a new concept called Adaptive Sequence Alignment (ASA) for analyzing metagenomic DNA sequence data. By iteratively adapting a set of partial alignments of reference sequences to match the sample data, the approach can be applied in multiple scenarios, from taxonomic identification to assembly of target regions of interest. To demonstrate the benefits of ASA, we present two application scenarios and compare the results with state-of-the-art methods conventionally used for the same tasks. In the first, ASA accurately detected microorganisms from a sequenced metagenomic sample with a known composition. The second illustrated the utility of ASA in assembling target genetic regions of the microorganisms. An example implementation of the ASA concept is available at https://github.com/elolab/ASA.

Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.




Funding information in the publication
Prof. Elo reports grants from the European Union's Horizon 2020 research and innovation programme (955321), Research Council of Finland (310561, 314443, 329278, 335434, 335611 and 341342), and Sigrid Juselius Foundation, during the conduct of the study. Our research is also supported by Biocenter Finland, and ELIXIR Finland.


Last updated on 2025-07-03 at 12:23