X-Mapper: fast and accurate sequence alignment via gapped x-mers
Sequence alignment is foundational to many bioinformatic analyses. Many aligners start by splitting sequences into contiguous, fixed-length seeds, called k-mers. Alignment is faster with longer, unique seeds, but more accurate with shorter seeds avoiding mutations. Here, we introduce X-Mapper, aimin...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2025
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/182567 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-182567 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1825672025-02-10T06:36:18Z X-Mapper: fast and accurate sequence alignment via gapped x-mers Gaston, Jeffry M. Alm, Eric J. Zhang, An-Ni School of Biological Sciences Singapore Centre for Environmental Life Sciences and Engineering Medicine, Health and Life Sciences Bioinformatics Sequence alignment algorithms Sequence alignment is foundational to many bioinformatic analyses. Many aligners start by splitting sequences into contiguous, fixed-length seeds, called k-mers. Alignment is faster with longer, unique seeds, but more accurate with shorter seeds avoiding mutations. Here, we introduce X-Mapper, aiming to offer high speed and accuracy via dynamic-length seeds containing gaps, called gapped x-mers. We observe 11–24-fold fewer suboptimal alignments analyzing a human reference and 3–579-fold lower inconsistency across bacterial references than other aligners, improving on 53% and 30% of reads aligned to non-target strains and species, respectively. Other seed-based analysis algorithms might benefit from gapped x-mers too. Nanyang Technological University Published version Open Access funding provided by the MIT Libraries’ This research topic is supported by Gut Microbiome/Biomarker Dynamics 6934500 for the research associate fellowship of A.N.Z and Nanyang Technological University SUG 024164– 00001 for the senior research scientist fellowship of J.M.G. 2025-02-10T06:33:48Z 2025-02-10T06:33:48Z 2025 Journal Article Jeffry M. Gaston, Eric J. Alm & An-Ni Zhang (2025). X-Mapper: fast and accurate sequence alignment via gapped x-mers. Genome Biology, 26(1), 15-. https://dx.doi.org/10.1186/s13059-024-03473-7 1474-760X https://hdl.handle.net/10356/182567 10.1186/s13059-024-03473-7 39844205 1 26 15 en SUG 024164– 00001 Genome Biology © 2025 The Author(s). This article is licensed under a Creative Commons Attribution 4.0 Interna‑ tional License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Medicine, Health and Life Sciences Bioinformatics Sequence alignment algorithms |
spellingShingle |
Medicine, Health and Life Sciences Bioinformatics Sequence alignment algorithms Gaston, Jeffry M. Alm, Eric J. Zhang, An-Ni X-Mapper: fast and accurate sequence alignment via gapped x-mers |
description |
Sequence alignment is foundational to many bioinformatic analyses. Many aligners start by splitting sequences into contiguous, fixed-length seeds, called k-mers. Alignment is faster with longer, unique seeds, but more accurate with shorter seeds avoiding mutations. Here, we introduce X-Mapper, aiming to offer high speed and accuracy via dynamic-length seeds containing gaps, called gapped x-mers. We observe 11–24-fold fewer suboptimal alignments analyzing a human reference and 3–579-fold lower inconsistency across bacterial references than other aligners, improving on 53% and 30% of reads aligned to non-target strains and species, respectively. Other seed-based analysis algorithms might benefit from gapped x-mers too. |
author2 |
School of Biological Sciences |
author_facet |
School of Biological Sciences Gaston, Jeffry M. Alm, Eric J. Zhang, An-Ni |
format |
Article |
author |
Gaston, Jeffry M. Alm, Eric J. Zhang, An-Ni |
author_sort |
Gaston, Jeffry M. |
title |
X-Mapper: fast and accurate sequence alignment via gapped x-mers |
title_short |
X-Mapper: fast and accurate sequence alignment via gapped x-mers |
title_full |
X-Mapper: fast and accurate sequence alignment via gapped x-mers |
title_fullStr |
X-Mapper: fast and accurate sequence alignment via gapped x-mers |
title_full_unstemmed |
X-Mapper: fast and accurate sequence alignment via gapped x-mers |
title_sort |
x-mapper: fast and accurate sequence alignment via gapped x-mers |
publishDate |
2025 |
url |
https://hdl.handle.net/10356/182567 |
_version_ |
1823807400781545472 |