A PHP Error was encountered

Severity: Warning

Message: file_get_contents(https://...@gmail.com&api_key=61f08fa0b96a73de8c900d749fcb997acc09&a=1): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests

Filename: helpers/my_audit_helper.php

Line Number: 176

Backtrace:

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 176
Function: file_get_contents

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 250
Function: simplexml_load_file_from_url

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 3122
Function: getPubMedXML

File: /var/www/html/application/controllers/Detail.php
Line: 575
Function: pubMedSearch_Global

File: /var/www/html/application/controllers/Detail.php
Line: 489
Function: pubMedGetRelatedKeyword

File: /var/www/html/index.php
Line: 316
Function: require_once

Building a pangenome alignment index via recursive prefix-free parsing. | LitMetric

Building a pangenome alignment index via recursive prefix-free parsing.

iScience

Department of Computer and Information Science and Engineering, Herbert-Wertheim College of Engineering, University of Florida, Gainesville, FL 32607, USA.

Published: October 2024

AI Article Synopsis

  • Pangenomics alignment helps reduce bias in biomedical research by allowing multiple genomes to be indexed, unlike traditional methods that focus on a single reference genome.
  • New aligners like VG, Giraffe, and Moni use advanced techniques to manage the complexities of indexing more genomes while tackling challenges related to data size.
  • The proposed method for scaling Moni improves efficiency by optimizing the construction of key data structures (RLBWT, suffix array, and LCP) through recursive prefix-free parsing, which is crucial for handling large pangenomes like those from the Human Pangenome Reference Consortium.

Article Abstract

Pangenomics alignment offers a solution to reduce bias in biomedical research. Traditionally, short-read aligners like Bowtie and BWA indexed a single reference genome to find approximate alignments. These methods, limited by linear-memory requirements, can only index a few genomes. Emerging pangenome aligners, such as VG, Giraffe, and Moni, address this by indexing more genomes. VG and Giraffe use a variation graph, while Moni indexes sequences accounting for repetition using prefix-free parsing to build a dictionary and parse. The main challenge is the parse's size, which becomes significantly larger than the dictionary. To scale Moni, we propose removing the parse from the construction of the run-length encoded BWT (RLBWT), suffix array, and Longest Common Prefix (LCP) by applying prefix-free parsing recursively. This approach improves construction time and memory requirements, enabling efficient construction of RLBWT, suffix array, and LCP for large pangenomes, such as those from the Human Pangenome Reference Consortium.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11465122PMC
http://dx.doi.org/10.1016/j.isci.2024.110933DOI Listing

Publication Analysis

Top Keywords

prefix-free parsing
12
rlbwt suffix
8
suffix array
8
building pangenome
4
pangenome alignment
4
alignment recursive
4
recursive prefix-free
4
parsing pangenomics
4
pangenomics alignment
4
alignment offers
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!