A PHP Error was encountered

Severity: Warning

Message: file_get_contents(https://...@pubfacts.com&api_key=b8daa3ad693db53b1410957c26c9a51b4908&a=1): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests

Filename: helpers/my_audit_helper.php

Line Number: 176

Backtrace:

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 176
Function: file_get_contents

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 250
Function: simplexml_load_file_from_url

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 3122
Function: getPubMedXML

File: /var/www/html/application/controllers/Detail.php
Line: 575
Function: pubMedSearch_Global

File: /var/www/html/application/controllers/Detail.php
Line: 489
Function: pubMedGetRelatedKeyword

File: /var/www/html/index.php
Line: 316
Function: require_once

Normalization of gene counts affects principal components-based exploratory analysis of RNA-sequencing data. | LitMetric

Normalization of gene counts affects principal components-based exploratory analysis of RNA-sequencing data.

Biochim Biophys Acta Gene Regul Mech

Laboratory of Systems and Synthetic Biology, Wageningen University & Research, the Netherlands. Electronic address:

Published: December 2024

AI Article Synopsis

  • - Normalization of gene expression data is crucial for analyzing RNA-sequencing data, particularly in differential expression studies, but its effects on multivariate analysis, like PCA, are less understood.
  • - This study examines twelve normalization methods on both simulated and real data to see how they influence PCA outcomes, including sample clustering and gene ranking.
  • - Results show that while PCA score plots might look similar across different normalization techniques, the biological insights drawn from these models can vary significantly based on the normalization choice.

Article Abstract

Normalization of gene expression count data is an essential step of in the analysis of RNA-sequencing data. Its statistical analysis has been mostly addressed in the context of differential expression analysis, that is in the univariate setting. However, relationships among genes and samples are better explored and quantified using multivariate exploratory data analysis tools like Principal Component Analysis (PCA). In this study we investigate how normalization impacts the PCA model and its interpretation, considering twelve different widely used normalization methods that were applied on simulated and experimental data. Correlation patterns in the normalized data were explored using both summary statistics and Covariance Simultaneous Component Analysis. The impact of normalization on the PCA solution was assessed by exploring the model complexity, the quality of sample clustering in the low-dimensional PCA space and gene ranking in the model fit to normalized data. PCA models upon normalization were interpreted in the context gene enrichment pathway analysis. We found that although PCA score plots are often similar independently form the normalization used, biological interpretation of the models can depend heavily on the normalization method applied.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.bbagrm.2024.195058DOI Listing

Publication Analysis

Top Keywords

normalization
8
normalization gene
8
analysis
8
analysis rna-sequencing
8
rna-sequencing data
8
component analysis
8
analysis pca
8
normalized data
8
data
7
pca
6

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!