Background: Genomic analysis will greatly benefit from considering in a global way various sources of molecular data with the related biological knowledge. It is thus of great importance to provide useful integrative approaches dedicated to ease the interpretation of microarray data.

Results: Here, we introduce a data-mining approach, Multiple Factor Analysis (MFA), to combine multiple data sets and to add formalized knowledge. MFA is used to jointly analyse the structure emerging from genomic and transcriptomic data sets. The common structures are underlined and graphical outputs are provided such that biological meaning becomes easily retrievable. Gene Ontology terms are used to build gene modules that are superimposed on the experimentally interpreted plots. Functional interpretations are then supported by a step-by-step sequence of graphical representations.

Conclusion: When applied to genomic and transcriptomic data and associated Gene Ontology annotations, our method prioritize the biological processes linked to the experimental settings. Furthermore, it reduces the time and effort to analyze large amounts of 'Omics' data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2636827PMC
http://dx.doi.org/10.1186/1471-2164-10-32DOI Listing

Publication Analysis

Top Keywords

data sets
12
biological knowledge
8
multiple factor
8
factor analysis
8
genomic transcriptomic
8
transcriptomic data
8
gene ontology
8
data
6
simultaneous analysis
4
analysis distinct
4

Similar Publications

Background: This two-stage individual patient data meta-analysis (IPD-MA) compared the efficacy of a shorter duration (≤ 2 days) of vasoactive (VA) drug therapy to standard duration (3-5 days) after acute variceal bleeding (AVB) in patients with liver cirrhosis.

Patients And Methods: Randomized clinical trials on patients with cirrhosis and AVB undergoing endoscopic band ligation which compared a short duration versus the standard duration of VA therapy were included. The primary outcome was 5-day rebleeding rate.

View Article and Find Full Text PDF

Background: The differential impact of serum lipids and their targets for lipid modification on cardiometabolic disease risk is debated. This study used Mendelian randomization to investigate the causal relationships and underlying mechanisms.

Methods: Genetic variants related to lipid profiles and targets for lipid modification were sourced from the Global Lipids Genetics Consortium.

View Article and Find Full Text PDF

Background: Osteoporosis is a common age-related disease with disabling consequences, the early diagnosis of which is difficult due to its long and hidden course, which often leads to diagnosis only after a fracture. In this regard, great expectations are placed on advanced developments in machine learning technologies aimed at predicting osteoporosis at an early stage of development, including the use of large data sets containing information on genetic and clinical predictors of the disease. Nevertheless, the inclusion of DNA markers in prediction models is fraught with a number of difficulties due to the complex polygenic and heterogeneous nature of the disease.

View Article and Find Full Text PDF

Proteins can be rapidly prototyped with cell-free expression (CFE) but in most cases there is a lack of probes or assays to measure their function directly in the cell lysate, thereby limiting the throughput of these screens. Increased throughput is needed to build standardized, sequence to function data sets to feed machine learning guided protein optimization. Herein, we describe the use of fluorescent single-walled carbon nanotubes (SWCNT) as effective probes for measuring protease activity directly in cell-free lysate.

View Article and Find Full Text PDF

Fully Synthetic Data for Complex Surveys.

Surv Methodol

December 2024

Department of Statistical Science, 214a Old Chemistry Building, Duke University, Durham, NC 27708-0251.

When seeking to release public use files for confidential data, statistical agencies can generate fully synthetic data. We propose an approach for making fully synthetic data from surveys collected with complex sampling designs. Our approach adheres to the general strategy proposed by Rubin (1993).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!