Objective: Several studies have shown how sets of single-nucleotide polymorphisms (SNPs) can help to classify subjects on the basis of their continental origins, with applications to case-control studies and population genetics. However, most of these studies use dimensionality-reduction methods, such as principal component analysis, or clustering methods that result in unipartite (either subjects or SNPs) representations of the data. Such analyses conceal important bipartite relationships, such as how subject and SNP clusters relate to each other, and the genotypes that determine their cluster memberships.

Methods: To overcome the limitations of current methods of analyzing SNP data, the authors used three bipartite analytical representations (bipartite network, heat map with dendrograms, and Circos ideogram) that enable the simultaneous visualization and analysis of subjects, SNPs, and subject attributes.

Results: The results demonstrate (1) novel insights into SNP data that are difficult to derive from purely unipartite views of the data, (2) the strengths and limitations of each method, revealing the role that each play in revealing novel insights, and (3) implications for how the methods can be used for the analysis of SNPs in genomic studies associated with disease.

Conclusion: The results suggest that bipartite representations can reveal new patterns in SNP data compared with existing unipartite representations. However, the novel insights require multiple representations to discover, verify, and comprehend the complex relationships. The results therefore motivate the need for a complementary visual analytical framework that guides the use of multiple bipartite representations to analyze complex relationships in SNP data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3392853PMC
http://dx.doi.org/10.1136/amiajnl-2011-000745DOI Listing

Publication Analysis

Top Keywords

snp data
16
novel insights
12
visual analytical
8
analytical representations
8
analysis snps
8
subjects snps
8
bipartite representations
8
complex relationships
8
representations
7
bipartite
6

Similar Publications

This study aimed to identify splicing quantitative trait loci (cis-sQTL) in Nelore cattle muscle tissue and explore the involvement of spliced genes (sGenes) in immune system-related biological processes. Genotypic data from 80 intact male Nelore cattle were obtained using SNP-Chip technology, while RNA-Seq analysis was performed to measure gene expression levels, enabling the integration of genomic and transcriptomic datasets. The normalized expression levels of spliced transcripts were associated with single nucleotide polymorphisms (SNPs) through an analysis of variance using an additive linear model with the MatrixEQTL package.

View Article and Find Full Text PDF

It has been debated whether endometriosis (EMS) adversely affects oocyte quality, potentially leading to a higher incidence of genetically unbalanced embryos or other egg factors that affect the developmental potential. In this study, we explored the effects of endometriosis on risk of chromosomally aberrant in miscarried products of conception (POC) after assisted reproductive treatment (ART), including fresh and frozen cycles. Miscarried POCs were collected from EMS patients (N = 102) and non-EMS patients (N = 441).

View Article and Find Full Text PDF

Association between ESR1 and COL1A1 gene polymorphisms and skeletal fluorosis in Tibetan, Kazakh, Mongolian and Russian populations, China.

Environ Pollut

January 2025

Center for Endemic Disease Control, Chinese Center for Disease Control and Prevention, Harbin Medical University, Harbin, People's Republic of China; NHC Key Laboratory of Etiology and Epidemiology(Harbin Medical University); Joint Key Laboratory of Endemic Diseases(Harbin Medical University, Guizhou Medical University, Xi'an Jiaotong University); Center for Chronic Disease Prevention and Control, Harbin Medical University, Harbin, People's Republic of China. Electronic address:

Background: Skeletal fluorosis is a chronic metabolic bone disease caused by excessive accumulation of fluoride in the bones. Previous studies have found that when the intake of tea fluoride is similar, the prevalence of skeletal fluorosis varies greatly among different ethnic groups, which may be related to different genetic backgrounds. Single nucleotide polymorphisms (SNPs) of estrogen receptor 1 (ESR1) and collagen type 1 α1 (COL1A1) were strongly associated with bone metabolism as well as bone growth and development, but their association with the risk of skeletal fluorosis has not been reported.

View Article and Find Full Text PDF

Introduction: The exponential growth of genomic datasets necessitates advanced analytical tools to effectively identify genetic loci from large-scale high throughput sequencing data. This study presents Deep-Block, a multi-stage deep learning framework that incorporates biological knowledge into its AI architecture to identify genetic regions as significantly associated with Alzheimer's disease (AD). The framework employs a three-stage approach: (1) genome segmentation based on linkage disequilibrium (LD) patterns, (2) selection of relevant LD blocks using sparse attention mechanisms, and (3) application of TabNet and Random Forest algorithms to quantify single nucleotide polymorphism (SNP) feature importance, thereby identifying genetic factors contributing to AD risk.

View Article and Find Full Text PDF

In recent years, black beans (Phaseolus vulgaris L.) have gained popularity in the U.S.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!