Microarray Data Processing Techniques for Genome-Scale Network Inference from Large Public Repositories.

Microarrays (Basel)

School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA.

Published: September 2016

Pre-processing of microarray data is a well-studied problem. Furthermore, all popular platforms come with their own recommended best practices for differential analysis of genes. However, for genome-scale network inference using microarray data collected from large public repositories, these methods filter out a considerable number of genes. This is primarily due to the effects of aggregating a diverse array of experiments with different technical and biological scenarios. Here we introduce a pre-processing pipeline suitable for inferring genome-scale gene networks from large microarray datasets. We show that partitioning of the available microarray datasets according to biological relevance into tissue- and process-specific categories significantly extends the limits of downstream network construction. We demonstrate the effectiveness of our pre-processing pipeline by inferring genome-scale networks for the model plant Arabidopsis thaliana using two different construction methods and a collection of 11,760 Affymetrix ATH1 microarray chips. Our pre-processing pipeline and the datasets used in this paper are made available at http://alurulab.cc.gatech.edu/microarray-pp.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5040970PMC
http://dx.doi.org/10.3390/microarrays5030023DOI Listing

Publication Analysis

Top Keywords

microarray data
12
pre-processing pipeline
12
genome-scale network
8
network inference
8
large public
8
public repositories
8
inferring genome-scale
8
microarray datasets
8
microarray
6
data processing
4

Similar Publications

Background: The accurate staging of multiple myeloma (MM) is essential for optimizing treatment strategies, while predicting the progression of asymptomatic patients, also referred to as monoclonal gammopathy of undetermined significance (MGUS), to symptomatic MM remains a significant challenge due to limited data. This study aimed to develop machine learning models to enhance MM staging accuracy and stratify asymptomatic patients by their risk of progression.

Methods: We utilized gene expression microarray datasets to develop machine learning models, combined with various data transformations.

View Article and Find Full Text PDF

Background: Argonautes (AGOs) are a type of protein that degrade specific messenger RNAs, consequently reducing the expression of a specific gene. These proteins consist of small, single-stranded RNA or DNA and may provide a route for detecting and silencing complementary mobile genetic elements. In this research, we investigated which AGO(s) were involved in Kawasaki disease (KD).

View Article and Find Full Text PDF

Objective: Distinguishing grade 3 pancreatic neuroendocrine tumors (PanNETs) from neuroendocrine carcinomas (PanNECs) is sometimes challenging. Recently, a diffuse p16-positive pattern was reported in PanNECs but not in grade 3 PanNETs, suggesting that p16 could help differentiate these entities. This study aimed to investigate p16 expression in PanNETs of various grades and its association with clinicopathologic features.

View Article and Find Full Text PDF

Revealing the role of natural killer cells in ankylosing spondylitis: identifying diagnostic biomarkers and therapeutic targets.

Ann Med

December 2025

Department of Rheumatology and Immunology, The Seventh Affiliated Hospital, Sun Yat-sen University, Shenzhen, Guangdong Province, People's Republic of China.

Background: Ankylosing spondylitis (AS) is a chronic autoimmune disease that primarily affects the axial joints. Immune cells play a key role in the pathogenesis of AS. This study integrated bioinformatics methods with experimental validation to explore the role of natural killer (NK) cells in AS.

View Article and Find Full Text PDF

Background: Thymidine kinases (TKs) are key enzymes involved in DNA synthesis and repair, with alterations in their expression associated with various cancers. Thymidine kinase 1 (TK1) and TK2 are cytosolic enzyme proteins that catalyze the addition of a gamma-phosphate group to thymidine. The existing literature on TK1 in cervical squamous cell carcinoma (CESC) fails to address the clinical role of TK1 overexpression and its possible molecular mechanism in CESC.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!