Motivation: As genome graphs are powerful data structures for representing the genetic diversity within populations, they can help identify genomic variations that traditional linear references miss, but their complexity and size makes the analysis of genome graphs challenging. We sought to develop a genome graph analysis tool that helps these analyses to become more accessible by addressing the limitations of existing tools. Specifically, we improve scalability and user-friendliness, and we provide many new statistics tailored to variation graphs for graph evaluation, including sample-specific features.
Results: We developed an efficient, comprehensive, and integrated tool, gretl, to analyze genome graphs and gain insights into their structure and composition by providing a wide range of statistics. gretl can be utilized to evaluate different graphs, compare the output of graph construction pipelines with different parameters, as well as perform an in-depth analysis of individual graphs, including sample-specific analysis. With the assistance of gretl, novel patterns of genetic variation and potential regions of interest can be identified, for later, more detailed inspection. We demonstrate that gretl outperforms other tools in terms of speed, particularly for larger genome graphs.
Availability And Implementation: Commented Rust source code and documentation is available under MIT license at https://github.com/MoinSebi/gretl together with Python scripts and step-by-step usage examples. The package is available at Bioconda for easy installation.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11729725 | PMC |
http://dx.doi.org/10.1093/bioinformatics/btae755 | DOI Listing |
Int J Mol Sci
January 2025
School of Mathematics and Computer Science, Gannan Normal University, Ganzhou 341000, China.
Due to advances in big data technology, deep learning, and knowledge engineering, biological sequence visualization has been extensively explored. In the post-genome era, biological sequence visualization enables the visual representation of both structured and unstructured biological sequence data. However, a universal visualization method for all types of sequences has not been reported.
View Article and Find Full Text PDFSci Rep
January 2025
Sanofi R&D - Translational Medicine & Early Development - Translational Precision Medicine, 13 Quai Jules Guesde, 94400, Vitry-sur-Seine, France.
Precision medicine is defined by the U.S. Food & Drug Administration as "an innovative approach to tailoring disease prevention and treatment that considers differences in people's genes, environments, and lifestyles".
View Article and Find Full Text PDFInt J Biol Macromol
January 2025
Umeå Plant Science Centre, Department Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, SE-90183 Umeå, Sweden.
Resin biosynthesis in conifer is a complex process, controlled by multiple quantitative trait loci (QTLs). Quantifying resin components is traditionally expensive and labor-intensive. In this study, we employed near infrared (NIR) spectroscopy to quantify resin components in Slash pine using 240 genotypes.
View Article and Find Full Text PDFProc Natl Acad Sci U S A
January 2025
Department of Chemistry, New York University, New York, NY 10003.
Frameshifting is an essential mechanism employed by many viruses including coronaviruses to produce viral proteins from a compact RNA genome. It is facilitated by specific RNA folds in the frameshift element (FSE), which has emerged as an important therapeutic target. For SARS-CoV-2, a specific 3-stem pseudoknot has been identified to stimulate frameshifting.
View Article and Find Full Text PDFMol Hortic
January 2025
State Key Laboratory of Plant Diversity and Specialty Crops, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, 430074, Hubei, China.
Kiwifruit is an economically and nutritionally important horticultural fruit crop worldwide. The genomic data of several kiwifruit species have been released, providing an unprecedented opportunity for pan-genome analysis to comprehensively investigate the inter- and intra-species genetic diversity and facilitate utilization for kiwifruit breeding. Here, we generated a kiwifruit super pan-genome using 15 high-quality assemblies of eight Actinidia species.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!