Publications by authors named "Ben Busby"

Article Synopsis
  • * The authors introduce "stratifications," or specific BED files, that outline different genomic contexts for GRCh37/38 and the new T2T-CHM13 reference, which includes previously challenging regions to sequence.
  • * They also compare the performance of sequencing benchmarks across these references, showing how difficult regions in CHM13 impact the overall performance, and provide a snakemake pipeline for generating stratifications to aid in optimizing sequencing platforms.
View Article and Find Full Text PDF
Article Synopsis
  • The study investigates how clonal hematopoiesis (CH) may lead to myeloid neoplasms (MN) and explores the potential of the plasma proteome in predicting MN risk.
  • Researchers analyzed the plasma proteomic profiles of over 46,000 individuals and identified 115 proteins linked to MN risk, with 34 of these also associated with CH.
  • The findings suggest that immune system regulation plays a crucial role in the progression from CH to MN, and the use of plasma proteomics can enhance MN risk prediction beyond traditional clinical factors.
View Article and Find Full Text PDF

Science, technology, engineering, mathematics, and medicine (STEMM) fields change rapidly and are increasingly interdisciplinary. Commonly, STEMM practitioners use short-format training (SFT) such as workshops and short courses for upskilling and reskilling, but unaddressed challenges limit SFT's effectiveness and inclusiveness. Education researchers, students in SFT courses, and organizations have called for research and strategies that can strengthen SFT in terms of effectiveness, inclusiveness, and accessibility across multiple dimensions.

View Article and Find Full Text PDF

For a number of neurological diseases, such as Alzheimer's disease, amyotrophic lateral sclerosis, and many others, certain genes are known to be involved in the disease mechanism. A common question is whether a structural variant in any such gene may be related to drug response in clinical trials and how this relationship can contribute to the lifecycle of drug development. To this end, we introduce VariantSurvival, a tool that identifies changes in survival relative to structural variants within target genes.

View Article and Find Full Text PDF

In October 2021, 59 scientists from 14 countries and 13 U.S. states collaborated virtually in the Third Annual Baylor College of Medicine & DNANexus Structural Variation hackathon.

View Article and Find Full Text PDF

Serum tryptase is a biomarker used to aid in the identification of certain myeloid neoplasms, most notably systemic mastocytosis, where basal serum tryptase (BST) levels >20 ng/mL are a minor criterion for diagnosis. Although clonal myeloid neoplasms are rare, the common cause for elevated BST levels is the genetic trait hereditary α-tryptasemia (HαT) caused by increased germline TPSAB1 copy number. To date, the precise structural variation and mechanism(s) underlying elevated BST in HαT and the general clinical utility of tryptase genotyping, remain undefined.

View Article and Find Full Text PDF

iCn3D was initially developed as a web-based 3D molecular viewer. It then evolved from visualization into a full-featured interactive structural analysis software. It became a collaborative research instrument through the sharing of permanent, shortened URLs that encapsulate not only annotated visual molecular scenes, but also all underlying data and analysis scripts in a FAIR manner.

View Article and Find Full Text PDF

Introduction: Pancreatitis is a complex syndrome that results from many etiologies. Large well-characterized cohorts are needed to further understand disease risk and prognosis.

Methods: A pancreatitis cohort of more than 4,200 patients and 24,000 controls were identified in the UK BioBank (UKBB) consortium.

View Article and Find Full Text PDF
Article Synopsis
  • * The event aimed to assess the current status of research, highlight ongoing challenges, and explore how to leverage various strengths to enhance scientific progress.
  • * Over four days, eight groups developed new open-source methods to improve species variation analysis and created a resource for the research community, with daily summaries and methods available on GitHub.
View Article and Find Full Text PDF

Characterizing the gut microbiota in terms of their capacity to interfere with drug metabolism is necessary to achieve drug efficacy and safety. Although examples of drug-microbiome interactions are well-documented, little has been reported about a computational pipeline for systematically identifying and characterizing bacterial enzymes that process particular classes of drugs. The goal of our study is to develop a computational approach that compiles drugs whose metabolism may be influenced by a particular class of microbial enzymes and that quantifies the variability in the collective level of those enzymes among individuals.

View Article and Find Full Text PDF

Viruses represent important test cases for data federation due to their genome size and the rapid increase in sequence data in publicly available databases. However, some consequences of previously decentralized (unfederated) data are lack of consensus or comparisons between feature annotations. Unifying or displaying alternative annotations should be a priority both for communities with robust entry representation and for nascent communities with burgeoning data sources.

View Article and Find Full Text PDF

The Sequence Read Archive (SRA) is a large public repository that stores raw next-generation sequencing data from thousands of diverse scientific investigations.  Despite its promise, reuse and re-analysis of SRA data has been challenged by the heterogeneity and poor quality of the metadata that describe its biological samples. Recently, the MetaSRA project standardized these metadata by annotating each sample with terms from biomedical ontologies.

View Article and Find Full Text PDF

Purpose: The modern researcher is confronted with hundreds of published methods to interpret genetic variants. There are databases of genes and variants, phenotype-genotype relationships, algorithms that score and rank genes, and in silico variant effect prediction tools. Because variant prioritization is a multifactorial problem, a welcome development in the field has been the emergence of decision support frameworks, which make it easier to integrate multiple resources in an interactive environment.

View Article and Find Full Text PDF

: Basic and clinical scientific research at the University of South Florida (USF) have intersected to support a multi-faceted approach around a common focus on rare iron-related diseases. We proposed a modified version of the National Center for Biotechnology Information's (NCBI) Hackathon-model to take full advantage of local expertise in building "Iron Hack", a rare disease-focused hackathon. As the collaborative, problem-solving nature of hackathons tends to attract participants of highly-diverse backgrounds, organizers facilitated a symposium on rare iron-related diseases, specifically porphyrias and Friedreich's ataxia, pitched at general audiences.

View Article and Find Full Text PDF

In March 2019, 45 scientists and software engineers from around the world converged at the University of California, Santa Cruz for the first pangenomics codeathon. The purpose of the meeting was to propose technical specifications and standards for a usable human pangenome as well as to build relevant tools for genome graph infrastructures. During the meeting, the group held several intense and productive discussions covering a diverse set of topics, including advantages of graph genomes over a linear reference representation, design of new methods that can leverage graph-based data structures, and novel visualization and annotation approaches for pangenomes.

View Article and Find Full Text PDF

A wealth of viral data sits untapped in publicly available metagenomic data sets when it might be extracted to create a usable index for the virological research community. We hypothesized that work of this complexity and scale could be done in a hackathon setting. Ten teams comprised of over 40 participants from six countries, assembled to create a crowd-sourced set of analysis and processing pipelines for a complex biological data set in a three-day event on the San Diego State University campus starting 9 January 2019.

View Article and Find Full Text PDF
Article Synopsis
  • Magic-BLAST is a new RNA aligner designed to accurately map RNA sequences to the genome, with a focus on discovering introns from long reads, addressing limitations of existing tools.
  • It employs advanced techniques like optimized spliced alignment scores and selective masking, proving effective in analyzing RNA-seq data from various sequencing platforms (like PacBio and Illumina).
  • In tests, Magic-BLAST outperformed other aligners in intron discovery and mapping of long reads, demonstrating versatility, robustness against mismatches, and efficiency in handling different input formats.
View Article and Find Full Text PDF

Background: During the last decade, plant biotechnological laboratories have sparked a monumental revolution with the rapid development of next sequencing technologies at affordable prices. Soon, these sequencing technologies and assembling of whole genomes will extend beyond the plant computational biologists and become commonplace within the plant biology disciplines. The current availability of large-scale genomic resources for non-traditional plant model systems (the so-called 'orphan crops') is enabling the construction of high-density integrated physical and genetic linkage maps with potential applications in plant breeding.

View Article and Find Full Text PDF

Background: Anthozoa, Endocnidozoa, and Medusozoa are the 3 major clades of Cnidaria. Medusozoa is further divided into 4 clades, Hydrozoa, Staurozoa, Cubozoa, and Scyphozoa-the latter 3 lineages make up the clade Acraspeda. Acraspeda encompasses extraordinary diversity in terms of life history, numerous nuisance species, taxa with complex eyes rivaling other animals, and some of the most venomous organisms on the planet.

View Article and Find Full Text PDF

Genome graphs are emerging as an important novel approach to the analysis of high-throughput human sequencing data. By explicitly representing genetic variants and alternative haplotypes in a mappable data structure, they can enable the improved analysis of structurally variable and hyperpolymorphic regions of the genome. In most existing approaches, graphs are constructed from variant call sets derived from short-read sequencing.

View Article and Find Full Text PDF

Quantification of gene expression and characterization of gene transcript structures are central problems in molecular biology. RNA sequencing (RNA-Seq) and chromatin immunoprecipitation sequencing (ChIP-Seq) are important methods, but can be cumbersome and difficult for beginners to learn. To teach interested students and scientists how to analyze RNA-Seq and ChIP-Seq data, we present a start-to-finish tutorial for analyzing RNA-Seq and ChIP-Seq data: SeqAcademy ( https://github.

View Article and Find Full Text PDF

Ever return from a meeting feeling elated by all those exciting talks, yet unsure how all those presented glamorous and/or exciting tools can be useful in your research?  Or do you have a great piece of software you want to share, yet only a handful of people visited your poster? We have all been there, and that is why we organized the Matchmaking for Computational and Experimental Biologists Session at the latest ISCB/GLBIO'2017 meeting in Chicago (May 15-17, 2017). The session exemplifies a novel approach, mimicking "matchmaking", to encouraging communication, making connections and fostering collaborations between computational and non-computational biologists. More specifically, the session facilitates face-to-face communication between researchers with similar or differing research interests, which we feel are critical for promoting productive discussions and collaborations.

View Article and Find Full Text PDF

Biomedical text mining promises to assist biologists in quickly navigating the combined knowledge in their domain. This would allow improved understanding of the complex interactions within biological systems and faster hypothesis generation. New biomedical research articles are published daily and text mining tools are only as good as the corpus from which they work.

View Article and Find Full Text PDF

A PHP Error was encountered

Severity: Warning

Message: fopen(/var/lib/php/sessions/ci_sessiononschf4g0bsun0snb1cdbspp5fgheena): Failed to open stream: No space left on device

Filename: drivers/Session_files_driver.php

Line Number: 177

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once

A PHP Error was encountered

Severity: Warning

Message: session_start(): Failed to read session data: user (path: /var/lib/php/sessions)

Filename: Session/Session.php

Line Number: 137

Backtrace:

File: /var/www/html/index.php
Line: 316
Function: require_once