Driven by the recent advances of next generation sequencing (NGS) technologies and an urgent need to decode complex human diseases, a multitude of large-scale studies were conducted recently that have resulted in an unprecedented volume of whole transcriptome sequencing (RNA-seq) data, such as the Genotype Tissue Expression project (GTEx) and The Cancer Genome Atlas (TCGA). While these data offer new opportunities to identify the mechanisms underlying disease, the comparison of data from different sources remains challenging, due to differences in sample and data processing. Here, we developed a pipeline that processes and unifies RNA-seq data from different studies, which includes uniform realignment, gene expression quantification, and batch effect removal. We find that uniform alignment and quantification is not sufficient when combining RNA-seq data from different sources and that the removal of other batch effects is essential to facilitate data comparison. We have processed data from GTEx and TCGA and successfully corrected for study-specific biases, enabling comparative analysis between TCGA and GTEx. The normalized datasets are available for download on figshare.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5903355PMC
http://dx.doi.org/10.1038/sdata.2018.61DOI Listing

Publication Analysis

Top Keywords

data sources
12
rna-seq data
12
data
9
unifying cancer
4
cancer normal
4
normal rna
4
rna sequencing
4
sequencing data
4
sources driven
4
driven advances
4

Similar Publications

Background: In 2024, the Korean Ministry of Health and Welfare enforced a policy to increase the number of medical school students by 2,000 over the next 5 years, despite opposition from doctors. This study aims to predict the trend of excess or shortage of medical personnel in Korea due to the policy of increasing the number of medical school students by 2035.

Methods: Data from multiple sources, including the Ministry of Health and Welfare, National Health Insurance Corporation, and the Korean Medical Association, were used to estimate supply and demand.

View Article and Find Full Text PDF

Background: It was our impression that safety outcome trials were getting more frequent, raising ethical issues mainly related to patient autonomy. We and others had also proposed this autonomy would be best served if wording of the informed consents would be in the public domain.

Methods: Initially two observers and an arbiter tabulated the main aims of randomized controlled trials (RCTs) published in 1990-1991 vs.

View Article and Find Full Text PDF

Background And Objectives: Gingivitis and periodontitis are common periodontal diseases that can significantly harm overall oral health, affecting the teeth and their supporting tissues, along with the surrounding anatomical structures, and if left untreated, leading to the total destruction of the alveolar bone and the connective tissues, tooth loss, and other more serious systemic health issues. Numerous studies have shown that propolis can help reduce gum inflammation, inhibit the growth of pathogenic bacteria, and promote tissue regeneration, but with varying degrees of success reported. For this reason, this comprehensive systematic review aims at finding out the truth concerning the efficacy of propolis mouthwashes in treating gingivitis and periodontitis, as its main objective.

View Article and Find Full Text PDF

Turning to critical illness is a common stage of various diseases and injuries before death. Patients usually have complex health conditions, while the treatment process involves a wide range of content, along with high requirements for doctor's professionalism and multi-specialty teamwork, as well as a great demand for time-sensitive treatments. However, this is not matched with critical care professionals and the current state of medical care in China.

View Article and Find Full Text PDF

Global insight into rare disease and orphan drug definitions: a systematic literature review.

BMJ Open

January 2025

Centre for Public Health, Institute of Clinical Sciences B, Royal Victoria Hospital, Queen's University Belfast School of Medicine, Dentistry and Biomedical Sciences, Belfast, UK.

Objectives: This study sheds light on the available global definitions, classifications, and criteria used for rare diseases (RDs), ultrarare diseases (URDs), orphan drugs (ODs) and ultraorphan drugs (UODs) and provides insights into the rationale behind these definitions.

Design: A systematic literature review was conducted to identify existing definitions and the criteria used to define RDs, ODs and their subtypes.

Data Sources: Searches were performed in the PubMed/Medline, Embase, Scopus and Web of Science (Science and Social Sciences Citation Index) databases covering articles published from 1985 to 2021.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!