Data integration in genetics and genomics: methods and challenges.

Hum Genomics Proteomics

Biostatistics Methodology Unit, The Hospital for Sick Children Research Institute, 555 University Avenue, Toronto, ON, Canada M5G 1X8.

Published: January 2009

Due to rapid technological advances, various types of genomic and proteomic data with different sizes, formats, and structures have become available. Among them are gene expression, single nucleotide polymorphism, copy number variation, and protein-protein/gene-gene interactions. Each of these distinct data types provides a different, partly independent and complementary, view of the whole genome. However, understanding functions of genes, proteins, and other aspects of the genome requires more information than provided by each of the datasets. Integrating data from different sources is, therefore, an important part of current research in genomics and proteomics. Data integration also plays important roles in combining clinical, environmental, and demographic data with high-throughput genomic data. Nevertheless, the concept of data integration is not well defined in the literature and it may mean different things to different researchers. In this paper, we first propose a conceptual framework for integrating genetic, genomic, and proteomic data. The framework captures fundamental aspects of data integration and is developed taking the key steps in genetic, genomic, and proteomic data fusion. Secondly, we provide a review of some of the most commonly used current methods and approaches for combining genomic data with focus on the statistical aspects.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2950414PMC
http://dx.doi.org/10.4061/2009/869093DOI Listing

Publication Analysis

Top Keywords

data integration
16
data
12
genomic proteomic
12
proteomic data
12
genomic data
8
genetic genomic
8
genomic
5
integration genetics
4
genetics genomics
4
genomics methods
4

Similar Publications

Introduction: The Society for Pediatric Anesthesia Quality and Safety Committee developed the Pediatric Regional Anesthesia Time-Out Checklist, consisting of 14 safety items intended to be reviewed by an anesthesia team prior to a regional anesthetic. Primarily, we hypothesized that use of this Checklist would increase the number of safety items performed compared with no checklist, evaluating the usefulness of this tool. Secondarily, we hypothesized that, after checklist training, subjects would show better clinical judgment by electing to perform a regional anesthetic in scenarios in which no programmed error existed and electing to not perform a regional anesthetic in scenarios in which a programmed error did exist.

View Article and Find Full Text PDF

Caution when using network partners for target identification in drug discovery.

HGG Adv

January 2025

Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, Québec, Canada; Department of Human Genetics, McGill University, Montréal, Québec, Canada; 5 Prime Sciences Inc, Montréal, Quebec, Canada; Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montréal, QC, Canada; Department of Medicine, McGill University, Montréal, Québec, Canada; Department of Twin Research, King's College London, London, UK. Electronic address:

Identifying novel, high-yield drug targets is challenging and often results in a high failure rate. However, recent data indicates that leveraging human genetic evidence to identify and validate these targets significantly increases the likelihood of success in drug development. Two recent papers from Open Targets claimed that around half of FDA-approved drugs had targets with direct human genetic evidence.

View Article and Find Full Text PDF

Aim: To explore the meaning of adaptation after visceral transplantation in terms of patient experiences, symptoms, self-efficacy, transplant-specific and mental well-being.

Design: A convergent parallel mixed-methods study, consisting of interviews and generic as well as transplant-specific questionnaires. Results were integrated using meta-inference.

View Article and Find Full Text PDF

Background: Despite the high acuity of coronary care unit (CCU) patients and their risk of deterioration, little is known about how nurses assess them.

Aim: Increase understanding of the scope of nurses' assessments of deteriorating CCU patients.

Design: Online mixed methods survey.

View Article and Find Full Text PDF

This study investigates the relationship between SARS-CoV-2 RT-PCR cycle threshold (Ct) values and key COVID-19 transmission and outcome metrics across five years of the pandemic in Jalisco, Mexico. Utilizing a comprehensive time-series analysis, we evaluated weekly median Ct values as proxies for viral load and their temporal associations with positivity rates, reproduction numbers (Rt), hospitalizations, and mortality. Cross-correlation and lagged regression analyses revealed significant lead-lag relationships, with declining Ct values consistently preceding surges in positivity rates and hospitalizations, particularly during the early phases of the pandemic.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!