Publications by authors named "Battle A"

Motivation: Genome-wide association studies (GWAS) have identified genetic variants, usually single-nucleotide polymorphisms (SNPs), associated with human traits, including disease and disease risk. These variants (or causal variants in linkage disequilibrium with them) usually affect the regulation or function of a nearby gene. A GWAS locus can span many genes, however, and prioritizing which gene or genes in a locus are most likely to be causal remains a challenge.

View Article and Find Full Text PDF

Suboptimal gestational weight gain (GWG) is associated with pregnancy complications and postpartum weight retention (PPWR). Little data exists about GWG and PPWR attitudes and beliefs in low-and-middle-income countries (LMICs) to inform interventions. We examined GWG and PPWR attitudes, beliefs, and intentions among pregnant people, with and without HIV, in Cape Town, South Africa.

View Article and Find Full Text PDF
Article Synopsis
  • Genetic variation linked to complex traits is highly pleiotropic, meaning it affects multiple traits, which can be better understood through multi-phenotype analyses to identify shared and specific genetic factors.
  • Traditional matrix factorization (MF) methods struggle with issues like sample-sharing confounding and often yield factors too broad to map onto biological pathways, prompting a need for improvement.
  • The newly introduced method GLEANR effectively addresses these challenges by detecting sparse genetic factors from GWAS summary statistics, improves the replication of genetic factors across different studies, and offers clearer interpretations aligned with diseases and biological processes, as demonstrated through its evaluation of the UK Biobank.
View Article and Find Full Text PDF

Identifying the molecular effects of human genetic variation across cellular contexts is crucial for understanding the mechanisms underlying disease-associated loci, yet many cell types and developmental stages remain underexplored. Here, we harnessed the potential of heterogeneous differentiating cultures (HDCs), an in vitro system in which pluripotent cells asynchronously differentiate into a broad spectrum of cell types. We generated HDCs for 53 human donors and collected single-cell RNA sequencing data from over 900,000 cells.

View Article and Find Full Text PDF

The effects of genetic variation on complex traits act mainly through changes in gene regulation. Although many genetic variants have been linked to target genes in cis, the trans-regulatory cascade mediating their effects remains largely uncharacterized. Mapping trans-regulators based on natural genetic variation has been challenging due to small effects, but experimental perturbations offer a complementary approach.

View Article and Find Full Text PDF

Background: Understanding the genetic causes for variability in chromatin accessibility can shed light on the molecular mechanisms through which genetic variants may affect complex traits. Thousands of ATAC-seq samples have been collected that hold information about chromatin accessibility across diverse cell types and contexts, but most of these are not paired with genetic information and come from diverse distinct projects and laboratories.

Results: We report here joint genotyping, chromatin accessibility peak calling, and discovery of quantitative trait loci which influence chromatin accessibility (caQTLs), demonstrating the capability of performing caQTL analysis on a large scale in a diverse sample set without pre-existing genotype information.

View Article and Find Full Text PDF
Article Synopsis
  • Clonal hematopoiesis (CH) occurs when genetically identical blood cells expand, often influenced by genetic mutations linked to blood cancers; however, many cases happen without known driver mutations.
  • Researchers analyzed 51,399 genomes to study a specific type of CH (CH-LPMneg) without detectable leukemia-related mutations, developing a new method (GEM rate) to estimate mutation burden without paired samples.
  • Through their study, they identified seven genes linked to CH-LPMneg and found that alterations in hematopoietic stem cell (HSC) behavior may drive this mutation burden, while a broader analysis revealed relationships between GEM and the expression of 404 genes.
View Article and Find Full Text PDF

Inference of causal transcriptional regulatory networks (TRNs) from transcriptomic data suffers notoriously from false positives. Approaches to control the false discovery rate (FDR), for example, via permutation, bootstrapping, or multivariate Gaussian distributions, suffer from several complications: difficulty in distinguishing direct from indirect regulation, nonlinear effects, and causal structure inference requiring "causal sufficiency," meaning experiments that are free of any unmeasured, confounding variables. Here, we use a recently developed statistical framework, model-X knockoffs, to control the FDR while accounting for indirect effects, nonlinear dose-response, and user-provided covariates.

View Article and Find Full Text PDF

Genome-wide association studies (GWAS) have found widespread evidence of pleiotropy, but characterization of global patterns of pleiotropy remain highly incomplete due to insufficient power of current approaches. We develop fastASSET, a method that allows efficient detection of variant-level pleiotropic association across many traits. We analyze GWAS summary statistics of 116 complex traits of diverse types collected from the GRASP repository and large GWAS Consortia.

View Article and Find Full Text PDF

Allele-specific expression (ASE) is a powerful signal that can be used to investigate multiple molecular mechanisms, such as cis-regulatory effects and imprinting. Single-cell RNA-sequencing (scRNA-seq) enables ASE characterization at the resolution of individual cells. In this review, we highlight the computational methods for processing and analyzing single-cell ASE data.

View Article and Find Full Text PDF

Genetic variation that influences gene expression and splicing is a key source of phenotypic diversity. Although invaluable, studies investigating these links in humans have been strongly biased towards participants of European ancestries, which constrains generalizability and hinders evolutionary research. Here to address these limitations, we developed MAGE, an open-access RNA sequencing dataset of lymphoblastoid cell lines from 731 individuals from the 1000 Genomes Project, spread across 5 continental groups and 26 populations.

View Article and Find Full Text PDF

Large Language Models (LLMs) stand on the brink of reshaping the field of aging and dementia care, challenging the one-size-fits-all paradigm with their capacity for precision medicine and individualized treatment strategies. The "Large Pre-Trained Models with a Focus on AD/ADRD and Healthy Aging" symposium, organized by the National Institute on Aging and the Johns Hopkins Artificial Intelligence & Technology Collaboratory for Aging Research, served as a platform for exploring this potential. The symposium brought together diverse experts to discuss the integration of LLMs in aging and dementia care.

View Article and Find Full Text PDF
Article Synopsis
  • Genome-wide association studies (GWAS) have successfully identified genes linked to telomere length, but previous research hadn't validated these findings until now.
  • In a large analysis involving over 211,000 people, the study discovered five new signals linked to telomere length and highlighted the importance of blood/immune cells in this area.
  • The researchers confirmed that the genes KBTBD6 and POP5 truly affect telomere length by demonstrating that manipulating these genes can lengthen telomeres and that their regulation is crucial for understanding telomere biology.
View Article and Find Full Text PDF
Article Synopsis
  • The study focuses on the molecular organization of the human neocortex, particularly the dorsolateral prefrontal cortex, using advanced spatial transcriptomic technologies.
  • Researchers created a detailed neuroanatomical atlas that highlights different spatial domains based on gene expression patterns, moving beyond traditional histological layers.
  • By integrating data from various sources, the team identified specific cell types and interactions linked to neuropsychiatric disorders, showing how these relate to spatial domains in the brain.
View Article and Find Full Text PDF

Identifying the molecular effects of human genetic variation across cellular contexts is crucial for understanding the mechanisms underlying disease-associated loci, yet many cell-types and developmental stages remain underexplored. Here we harnessed the potential of heterogeneous differentiating cultures ( ), an system in which pluripotent cells asynchronously differentiate into a broad spectrum of cell-types. We generated HDCs for 53 human donors and collected single-cell RNA-sequencing data from over 900,000 cells.

View Article and Find Full Text PDF

Background: Survivors of adolescent and young adult (AYA) cancer experience significant psychological distress and encounter barriers to accessing mental health care. Few studies have investigated racial/ethnic disparities in psychological health outcomes among AYA survivors, and none have compared outcomes within a racially minoritized population.

Methods: National Health Interview Survey data (2010-2018) were analyzed that identified non-Hispanic Black (hereafter, Black) survivors of AYA cancer and age- and sex-matched Black noncancer controls.

View Article and Find Full Text PDF

The Stingray sensor system is a 15-camera optical array dedicated to the nightly astrometric and photometric survey of the geosynchronous Earth orbit (GEO) belt visible above Tucson, Arizona. The primary scientific goal is to characterize GEO and near-GEO satellites based on their observable properties. This system is completely autonomous in both data acquisition and processing, with human oversight reserved for data quality assurance and system maintenance.

View Article and Find Full Text PDF

Rare structural variants (SVs) - insertions, deletions, and complex rearrangements - can cause Mendelian disease, yet they remain difficult to accurately detect and interpret. We sequenced and analyzed Oxford Nanopore long-read genomes of 68 individuals from the Undiagnosed Disease Network (UDN) with no previously identified diagnostic mutations from short-read sequencing. Using our optimized SV detection pipelines and 571 control long-read genomes, we detected 716 long-read rare (MAF < 0.

View Article and Find Full Text PDF

Multi-view datasets are becoming increasingly prevalent. These datasets consist of different modalities that provide complementary characterizations of the same underlying system. They can include heterogeneous types of information with complex relationships, varying degrees of missingness, and assorted sample sizes, as is often the case in multi-omic biological studies.

View Article and Find Full Text PDF

Background: Gene co-expression networks (GCNs) describe relationships among expressed genes key to maintaining cellular identity and homeostasis. However, the small sample size of typical RNA-seq experiments which is several orders of magnitude fewer than the number of genes is too low to infer GCNs reliably. , a publicly available dataset comprised of 316,443 uniformly processed human RNA-seq samples, provides an opportunity to improve power for accurate network reconstruction and obtain biological insight from the resulting networks.

View Article and Find Full Text PDF

This perspective outlines the Artificial Intelligence and Technology Collaboratories (AITC) at Johns Hopkins University, University of Pennsylvania, and University of Massachusetts, highlighting their roles in developing AI-based technologies for older adult care, particularly targeting Alzheimer's disease (AD). These National Institute on Aging (NIA) centers foster collaboration among clinicians, gerontologists, ethicists, business professionals, and engineers to create AI solutions. Key activities include identifying technology needs, stakeholder engagement, training, mentoring, data integration, and navigating ethical challenges.

View Article and Find Full Text PDF
Article Synopsis
  • Genetic regulation of gene expression varies based on different cell types and environmental contexts, making it a complex process.* -
  • SURGE is a new method developed to discover context-specific expression quantitative trait loci (eQTLs) from single-cell transcriptomic data without needing prior information.* -
  • When applied to peripheral blood data, SURGE effectively identifies specific cell types and their relationships, demonstrating its relevance to diseases through advanced analysis methods.*
View Article and Find Full Text PDF

Genetic variation influencing gene expression and splicing is a key source of phenotypic diversity. Though invaluable, studies investigating these links in humans have been strongly biased toward participants of European ancestries, diminishing generalizability and hindering evolutionary research. To address these limitations, we developed MAGE, an open-access RNA-seq data set of lymphoblastoid cell lines from 731 individuals from the 1000 Genomes Project spread across 5 continental groups and 26 populations.

View Article and Find Full Text PDF