X-Mapper: fast and accurate sequence alignment via gapped x-mers

Sequence alignment is foundational to many bioinformatic analyses. Many aligners start by splitting sequences into contiguous, fixed-length seeds, called k-mers. Alignment is faster with longer, unique seeds, but more accurate with shorter seeds avoiding mutations. Here, we introduce X-Mapper, aimin...

Full description

Saved in:
Bibliographic Details
Main Authors: Gaston, Jeffry M., Alm, Eric J., Zhang, An-Ni
Other Authors: School of Biological Sciences
Format: Article
Language:English
Published: 2025
Subjects:
Online Access:https://hdl.handle.net/10356/182567
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-182567
record_format dspace
spelling sg-ntu-dr.10356-1825672025-02-10T06:36:18Z X-Mapper: fast and accurate sequence alignment via gapped x-mers Gaston, Jeffry M. Alm, Eric J. Zhang, An-Ni School of Biological Sciences Singapore Centre for Environmental Life Sciences and Engineering Medicine, Health and Life Sciences Bioinformatics Sequence alignment algorithms Sequence alignment is foundational to many bioinformatic analyses. Many aligners start by splitting sequences into contiguous, fixed-length seeds, called k-mers. Alignment is faster with longer, unique seeds, but more accurate with shorter seeds avoiding mutations. Here, we introduce X-Mapper, aiming to offer high speed and accuracy via dynamic-length seeds containing gaps, called gapped x-mers. We observe 11–24-fold fewer suboptimal alignments analyzing a human reference and 3–579-fold lower inconsistency across bacterial references than other aligners, improving on 53% and 30% of reads aligned to non-target strains and species, respectively. Other seed-based analysis algorithms might benefit from gapped x-mers too. Nanyang Technological University Published version Open Access funding provided by the MIT Libraries’ This research topic is supported by Gut Microbiome/Biomarker Dynamics 6934500 for the research associate fellowship of A.N.Z and Nanyang Technological University SUG 024164– 00001 for the senior research scientist fellowship of J.M.G. 2025-02-10T06:33:48Z 2025-02-10T06:33:48Z 2025 Journal Article Jeffry M. Gaston, Eric J. Alm & An-Ni Zhang (2025). X-Mapper: fast and accurate sequence alignment via gapped x-mers. Genome Biology, 26(1), 15-. https://dx.doi.org/10.1186/s13059-024-03473-7 1474-760X https://hdl.handle.net/10356/182567 10.1186/s13059-024-03473-7 39844205 1 26 15 en SUG 024164– 00001 Genome Biology © 2025 The Author(s). This article is licensed under a Creative Commons Attribution 4.0 Interna‑ tional License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Medicine, Health and Life Sciences
Bioinformatics
Sequence alignment algorithms
spellingShingle Medicine, Health and Life Sciences
Bioinformatics
Sequence alignment algorithms
Gaston, Jeffry M.
Alm, Eric J.
Zhang, An-Ni
X-Mapper: fast and accurate sequence alignment via gapped x-mers
description Sequence alignment is foundational to many bioinformatic analyses. Many aligners start by splitting sequences into contiguous, fixed-length seeds, called k-mers. Alignment is faster with longer, unique seeds, but more accurate with shorter seeds avoiding mutations. Here, we introduce X-Mapper, aiming to offer high speed and accuracy via dynamic-length seeds containing gaps, called gapped x-mers. We observe 11–24-fold fewer suboptimal alignments analyzing a human reference and 3–579-fold lower inconsistency across bacterial references than other aligners, improving on 53% and 30% of reads aligned to non-target strains and species, respectively. Other seed-based analysis algorithms might benefit from gapped x-mers too.
author2 School of Biological Sciences
author_facet School of Biological Sciences
Gaston, Jeffry M.
Alm, Eric J.
Zhang, An-Ni
format Article
author Gaston, Jeffry M.
Alm, Eric J.
Zhang, An-Ni
author_sort Gaston, Jeffry M.
title X-Mapper: fast and accurate sequence alignment via gapped x-mers
title_short X-Mapper: fast and accurate sequence alignment via gapped x-mers
title_full X-Mapper: fast and accurate sequence alignment via gapped x-mers
title_fullStr X-Mapper: fast and accurate sequence alignment via gapped x-mers
title_full_unstemmed X-Mapper: fast and accurate sequence alignment via gapped x-mers
title_sort x-mapper: fast and accurate sequence alignment via gapped x-mers
publishDate 2025
url https://hdl.handle.net/10356/182567
_version_ 1823807400781545472