Publications by authors named "Ruhollah Shemirani"

Article Synopsis
  • Biobank-scale studies often overlook Hispanic/Latino(a) and African American populations, which hinders the understanding of disease-related genetic factors in these groups.
  • The analysis of data from a New York City biobank reveals that phenome-wide admixture mapping (AM) identified 77 significant genetic signals, 48 of which were missed by genome-wide association studies (GWAS), indicating that the two methods complement each other.
  • The study shows that AM can uncover novel genetic associations in underrepresented populations, particularly for variants that traditional GWAS might miss.
View Article and Find Full Text PDF
Article Synopsis
  • Peripheral artery disease (PAD) affects about 8 million Americans and shows notable racial and ethnic disparities, with higher prevalence in African Americans and varying rates among Hispanic/Latino groups compared to European Americans.
  • In a study of diverse adults in New York City, researchers found PAD rates of 8.5% in African Americans and 9.4% in Hispanic/Latinos, with Puerto Rican and Dominican populations showing even higher rates.
  • Genetic analysis indicated a specific Native American ancestry tract linked to increased PAD risk, although attempts to confirm these findings in other Hispanic groups were not successful.
View Article and Find Full Text PDF

An individual's disease risk is affected by the populations that they belong to, due to shared genetics and environmental factors. The study of fine-scale populations in clinical care is important for identifying and reducing health disparities and for developing personalized interventions. To assess patterns of clinical diagnoses and healthcare utilization by fine-scale populations, we leveraged genetic data and electronic medical records from 35,968 patients as part of the UCLA ATLAS Community Health Initiative.

View Article and Find Full Text PDF
Article Synopsis
  • - Peripheral artery disease (PAD) affects about 8 million Americans and shows significant racial and ethnic disparities, particularly higher prevalence in African Americans compared to non-Hispanic Europeans.
  • - A study involving diverse participants from the Bio biobank in New York City found PAD prevalence rates of 8.5% in African Americans and 9.4% in Hispanic/Latino individuals, with Puerto Rican and Dominican sub-groups showing even higher rates.
  • - Genetic analysis revealed a specific ancestry tract linked to PAD risk among Dominicans, indicating a potential genetic component that could explain their higher prevalence, especially related to a region on chromosome 2q35 associated with blood vessel health and function.
View Article and Find Full Text PDF

Groups of distantly related individuals who share a short segment of their genome identical-by-descent (IBD) can provide insights about rare traits and diseases in massive biobanks using IBD mapping. Clustering algorithms play an important role in finding these groups accurately and at scale. We set out to analyze the fitness of commonly used, fast and scalable clustering algorithms for IBD mapping applications.

View Article and Find Full Text PDF

The ability to identify segments of genomes identical-by-descent (IBD) is a part of standard workflows in both statistical and population genetics. However, traditional methods for finding local IBD across all pairs of individuals scale poorly leading to a lack of adoption in very large-scale datasets. Here, we present iLASH, an algorithm based on similarity detection techniques that shows equal or improved accuracy in simulations compared to current leading methods and speeds up analysis by several orders of magnitude on genomic datasets, making IBD estimation tractable for millions of individuals.

View Article and Find Full Text PDF
Article Synopsis
  • Understanding health disparities is crucial for creating fair precision health initiatives, as traditional race and ethnicity definitions may not accurately reflect disease burdens in specific communities.
  • The study proposes using electronic health records (EHRs) and genomic data to analyze links between genetic ancestry and health outcomes, identifying 17 communities in NYC with shared genetic backgrounds.
  • Findings reveal significant health outcome variations linked to these communities, highlighting the importance of integrating genomic data with EHRs for better monitoring and prediction of health risks across different populations.
View Article and Find Full Text PDF

Summary: Finding informative predictive features in high-dimensional biological case-control datasets is challenging. The Extreme Pseudo-Sampling (EPS) algorithm offers a solution to the challenge of feature selection via a combination of deep learning and linear regression models. First, using a variational autoencoder, it generates complex latent representations for the samples.

View Article and Find Full Text PDF

Whole transcriptome studies typically yield large amounts of data, with expression values for all genes or transcripts of the genome. The search for genes of interest in a particular study setting can thus be a daunting task, usually relying on automated computational methods. Moreover, most biological questions imply that such a search should be performed in a multivariate setting, to take into account the inter-genes relationships.

View Article and Find Full Text PDF