Molecular subtypes, such as defined by The Cancer Genome Atlas (TCGA), delineate a cancer's underlying biology, bringing hope to inform a patient's prognosis and treatment plan. However, most approaches used in the discovery of subtypes are not suitable for assigning subtype labels to new cancer specimens from other studies or clinical trials. Here, we address this barrier by applying five different machine learning approaches to multi-omic data from 8,791 TCGA tumor samples comprising 106 subtypes from 26 different cancer cohorts to build models based upon small numbers of features that can classify new samples into previously defined TCGA molecular subtypes-a step toward molecular subtype application in the clinic.
View Article and Find Full Text PDFThe National Institutes of Health-US Food and Drug Administration Joint Leadership Council Next-Generation Sequencing and Radiomics Working Group was formed by the National Institutes of Health-Food and Drug Administration Joint Leadership Council to promote the development and validation of innovative next-generation sequencing tests, radiomic tools, and associated data analysis and interpretation enhanced by artificial intelligence and machine learning technologies. A 2-day workshop was held on September 29-30, 2021, to convene members of the scientific community to discuss how to overcome the "ground truth" gap that has frequently been acknowledged as 1 of the limiting factors impeding high-quality research, development, validation, and regulatory science in these fields. This report provides a summary of the resource gaps identified by the working group and attendees, highlights existing resources and the ways they can potentially be employed to accelerate growth in these fields, and presents opportunities to support next-generation sequencing and radiomic tool development and validation using technologies such as artificial intelligence and machine learning.
View Article and Find Full Text PDFData-driven basic, translational, and clinical research has resulted in improved outcomes for children, adolescents, and young adults (AYAs) with pediatric cancers. However, challenges in sharing data between institutions, particularly in research, prevent addressing substantial unmet needs in children and AYA patients diagnosed with certain pediatric cancers. Systematically collecting and sharing data from every child and AYA can enable greater understanding of pediatric cancers, improve survivorship, and accelerate development of new and more effective therapies.
View Article and Find Full Text PDFDifferential mRNA expression between ancestry groups can be explained by both genetic and environmental factors. We outline a computational workflow to determine the extent to which germline genetic variation explains cancer-specific molecular differences across ancestry groups. Using multi-omics datasets from The Cancer Genome Atlas (TCGA), we enumerate ancestry-informative markers colocalized with cancer-type-specific expression quantitative trait loci (e-QTLs) at ancestry-associated genes.
View Article and Find Full Text PDFFully automated machine learning (AutoML) for predictive modeling is becoming a reality, giving rise to a whole new field. We present the basic ideas and principles of Just Add Data Bio (JADBio), an AutoML platform applicable to the low-sample, high-dimensional omics data that arise in translational medicine and bioinformatics applications. In addition to predictive and diagnostic models ready for clinical use, JADBio focuses on knowledge discovery by performing feature selection and identifying the corresponding biosignatures, i.
View Article and Find Full Text PDFCellular and molecular aberrations contribute to the disparity of human cancer incidence and etiology between ancestry groups. Multiomics profiling in The Cancer Genome Atlas (TCGA) allows for querying of the molecular underpinnings of ancestry-specific discrepancies in human cancer. Here, we provide a protocol for integrative associative analysis of ancestry with molecular correlates, including somatic mutations, DNA methylation, mRNA transcription, miRNA transcription, and pathway activity, using TCGA data.
View Article and Find Full Text PDFWhen it comes to precision oncology, proteogenomics may provide better prospects to the clinical characterization of tumors, help make a more accurate diagnosis of cancer, and improve treatment for patients with cancer. This perspective describes the significant contributions of The Cancer Genome Atlas and the Clinical Proteomic Tumor Analysis Consortium to precision oncology and makes the case that proteogenomics needs to be fully integrated into clinical trials and patient care in order for precision oncology to deliver the right cancer treatment to the right patient at the right dose and at the right time.
View Article and Find Full Text PDFA small fraction of cancer patients with advanced disease survive significantly longer than patients with clinically comparable tumors. Molecular mechanisms for exceptional responses to therapy have been identified by genomic analysis of tumor biopsies from individual patients. Here, we analyzed tumor biopsies from an unbiased cohort of 111 exceptional responder patients using multiple platforms to profile genetic and epigenetic aberrations as well as the tumor microenvironment.
View Article and Find Full Text PDFGaps in the translation of research findings to clinical management have been recognized for decades. They exist for the diagnosis as well as the management of cancer. The international standards for cancer diagnosis are contained within the World Health Organization (WHO) Classification of Tumours, published by the International Agency for Research on Cancer (IARC) and known worldwide as the WHO Blue Books.
View Article and Find Full Text PDFWe evaluated ancestry effects on mutation rates, DNA methylation, and mRNA and miRNA expression among 10,678 patients across 33 cancer types from The Cancer Genome Atlas. We demonstrated that cancer subtypes and ancestry-related technical artifacts are important confounders that have been insufficiently accounted for. Once accounted for, ancestry-associated differences spanned all molecular features and hundreds of genes.
View Article and Find Full Text PDFBackground: Tumor molecular profiling from patients experiencing exceptional responses to systemic therapy may provide insights into cancer biology and improve treatment tailoring. This pilot study evaluates the feasibility of identifying exceptional responders retrospectively, obtaining pre-exceptional response treatment tumor tissues, and analyzing them with state-of-the-art molecular analysis tools to identify potential molecular explanations for responses.
Methods: Exceptional response was defined as partial (PR) or complete (CR) response to a systemic treatment with population PR or CR rate less than 10% or an unusually long response (eg, duration >3 times published median).
We present the genome-wide chromatin accessibility profiles of 410 tumor samples spanning 23 cancer types from The Cancer Genome Atlas (TCGA). We identify 562,709 transposase-accessible DNA elements that substantially extend the compendium of known cis-regulatory elements. Integration of ATAC-seq (the assay for transposase-accessible chromatin using sequencing) with TCGA multi-omic data identifies a large number of putative distal enhancers that distinguish molecular subtypes of cancers, uncovers specific driving transcription factors via protein-DNA footprints, and nominates long-range gene-regulatory interactions in cancer.
View Article and Find Full Text PDFMalignant pleural mesothelioma (MPM) is a highly lethal cancer of the lining of the chest cavity. To expand our understanding of MPM, we conducted a comprehensive integrated genomic study, including the most detailed analysis of alterations to date. We identified histology-independent molecular prognostic subsets, and defined a novel genomic subtype with and mutations and extensive loss of heterozygosity.
View Article and Find Full Text PDFGlioma diagnosis is based on histomorphology and grading; however, such classification does not have predictive clinical outcome after glioblastomas have developed. To date, no bona fide biomarkers that significantly translate into a survival benefit to glioblastoma patients have been identified. We previously reported that the IDH mutant G-CIMP-high subtype would be a predecessor to the G-CIMP-low subtype.
View Article and Find Full Text PDFThis SnapShot provides a list of the tumor types characterized by The Cancer Genome Atlas (TCGA) program. Key findings shown are the most relevant discoveries described in each marker paper for the tumor type.
View Article and Find Full Text PDFThe Cancer Genome Atlas (TCGA) has catalyzed systematic characterization of diverse genomic alterations underlying human cancers. At this historic junction marking the completion of genomic characterization of over 11,000 tumors from 33 cancer types, we present our current understanding of the molecular processes governing oncogenesis. We illustrate our insights into cancer through synthesis of the findings of the TCGA PanCancer Atlas project on three facets of oncogenesis: (1) somatic driver mutations, germline pathogenic variants, and their interactions in the tumor; (2) the influence of the tumor genome and epigenome on transcriptome and proteome; and (3) the relationship between tumor and the microenvironment, including implications for drugs targeting driver events and immunotherapies.
View Article and Find Full Text PDFThe Cancer Genome Atlas (TCGA) team now presents the Pan-Cancer Atlas, investigating different aspects of cancer biology by analyzing the data generated during the 10+ years of the TCGA project.
View Article and Find Full Text PDFThymic epithelial tumors (TETs) are one of the rarest adult malignancies. Among TETs, thymoma is the most predominant, characterized by a unique association with autoimmune diseases, followed by thymic carcinoma, which is less common but more clinically aggressive. Using multi-platform omics analyses on 117 TETs, we define four subtypes of these tumors defined by genomic hallmarks and an association with survival and World Health Organization histological subtype.
View Article and Find Full Text PDF