Publications by authors named "Jean Zenklusen"

Molecular subtypes, such as defined by The Cancer Genome Atlas (TCGA), delineate a cancer's underlying biology, bringing hope to inform a patient's prognosis and treatment plan. However, most approaches used in the discovery of subtypes are not suitable for assigning subtype labels to new cancer specimens from other studies or clinical trials. Here, we address this barrier by applying five different machine learning approaches to multi-omic data from 8,791 TCGA tumor samples comprising 106 subtypes from 26 different cancer cohorts to build models based upon small numbers of features that can classify new samples into previously defined TCGA molecular subtypes-a step toward molecular subtype application in the clinic.

View Article and Find Full Text PDF
Article Synopsis
  • The study examines gene regulatory changes associated with cancer by analyzing chromatin accessibility across eight different tumor types, revealing the influence of copy number alterations on tumor characteristics.
  • Researchers found specific chromatin signatures in cancer that are closely related to healthy cell types, particularly noting similarities between basal-like breast cancer and secretory-type luminal epithelial cells.
  • Advanced neural network models highlighted the significance of noncoding mutations near cancer-associated genes, suggesting that widely dispersed mutations in cancer have important functional roles in gene regulation.
View Article and Find Full Text PDF

The National Institutes of Health-US Food and Drug Administration Joint Leadership Council Next-Generation Sequencing and Radiomics Working Group was formed by the National Institutes of Health-Food and Drug Administration Joint Leadership Council to promote the development and validation of innovative next-generation sequencing tests, radiomic tools, and associated data analysis and interpretation enhanced by artificial intelligence and machine learning technologies. A 2-day workshop was held on September 29-30, 2021, to convene members of the scientific community to discuss how to overcome the "ground truth" gap that has frequently been acknowledged as 1 of the limiting factors impeding high-quality research, development, validation, and regulatory science in these fields. This report provides a summary of the resource gaps identified by the working group and attendees, highlights existing resources and the ways they can potentially be employed to accelerate growth in these fields, and presents opportunities to support next-generation sequencing and radiomic tool development and validation using technologies such as artificial intelligence and machine learning.

View Article and Find Full Text PDF
Article Synopsis
  • The National Cancer Institute (NCI) has developed various data commons since 2014 to support cancer research, focusing on sharing genomic, proteomic, imaging, and clinical data from NCI-funded studies.
  • This review provides an overview of the different data commons, highlighting their specific features, achievements, and associated challenges.
  • It also addresses how these commons adhere to FAIR principles (Findable, Accessible, Interoperable, Reusable) and align with the NIH's new Data Management and Sharing Policy.
View Article and Find Full Text PDF

Data-driven basic, translational, and clinical research has resulted in improved outcomes for children, adolescents, and young adults (AYAs) with pediatric cancers. However, challenges in sharing data between institutions, particularly in research, prevent addressing substantial unmet needs in children and AYA patients diagnosed with certain pediatric cancers. Systematically collecting and sharing data from every child and AYA can enable greater understanding of pediatric cancers, improve survivorship, and accelerate development of new and more effective therapies.

View Article and Find Full Text PDF
Article Synopsis
  • * A study involved whole-genome sequencing of 230 BL and 295 diffuse large B-cell lymphoma (DLBCL) tumors, revealing key mutated genes and new subgroups of BL with specific genetic traits.
  • * The research findings indicate significant genetic and clinical disparities between pediatric and adult BL, suggesting that identifying these subtypes could inform better approaches to diagnosis and treatment.
View Article and Find Full Text PDF

Differential mRNA expression between ancestry groups can be explained by both genetic and environmental factors. We outline a computational workflow to determine the extent to which germline genetic variation explains cancer-specific molecular differences across ancestry groups. Using multi-omics datasets from The Cancer Genome Atlas (TCGA), we enumerate ancestry-informative markers colocalized with cancer-type-specific expression quantitative trait loci (e-QTLs) at ancestry-associated genes.

View Article and Find Full Text PDF

Fully automated machine learning (AutoML) for predictive modeling is becoming a reality, giving rise to a whole new field. We present the basic ideas and principles of Just Add Data Bio (JADBio), an AutoML platform applicable to the low-sample, high-dimensional omics data that arise in translational medicine and bioinformatics applications. In addition to predictive and diagnostic models ready for clinical use, JADBio focuses on knowledge discovery by performing feature selection and identifying the corresponding biosignatures, i.

View Article and Find Full Text PDF

Cellular and molecular aberrations contribute to the disparity of human cancer incidence and etiology between ancestry groups. Multiomics profiling in The Cancer Genome Atlas (TCGA) allows for querying of the molecular underpinnings of ancestry-specific discrepancies in human cancer. Here, we provide a protocol for integrative associative analysis of ancestry with molecular correlates, including somatic mutations, DNA methylation, mRNA transcription, miRNA transcription, and pathway activity, using TCGA data.

View Article and Find Full Text PDF

When it comes to precision oncology, proteogenomics may provide better prospects to the clinical characterization of tumors, help make a more accurate diagnosis of cancer, and improve treatment for patients with cancer. This perspective describes the significant contributions of The Cancer Genome Atlas and the Clinical Proteomic Tumor Analysis Consortium to precision oncology and makes the case that proteogenomics needs to be fully integrated into clinical trials and patient care in order for precision oncology to deliver the right cancer treatment to the right patient at the right dose and at the right time.

View Article and Find Full Text PDF

A small fraction of cancer patients with advanced disease survive significantly longer than patients with clinically comparable tumors. Molecular mechanisms for exceptional responses to therapy have been identified by genomic analysis of tumor biopsies from individual patients. Here, we analyzed tumor biopsies from an unbiased cohort of 111 exceptional responder patients using multiple platforms to profile genetic and epigenetic aberrations as well as the tumor microenvironment.

View Article and Find Full Text PDF

Gaps in the translation of research findings to clinical management have been recognized for decades. They exist for the diagnosis as well as the management of cancer. The international standards for cancer diagnosis are contained within the World Health Organization (WHO) Classification of Tumours, published by the International Agency for Research on Cancer (IARC) and known worldwide as the WHO Blue Books.

View Article and Find Full Text PDF

We evaluated ancestry effects on mutation rates, DNA methylation, and mRNA and miRNA expression among 10,678 patients across 33 cancer types from The Cancer Genome Atlas. We demonstrated that cancer subtypes and ancestry-related technical artifacts are important confounders that have been insufficiently accounted for. Once accounted for, ancestry-associated differences spanned all molecular features and hundreds of genes.

View Article and Find Full Text PDF

Background: Tumor molecular profiling from patients experiencing exceptional responses to systemic therapy may provide insights into cancer biology and improve treatment tailoring. This pilot study evaluates the feasibility of identifying exceptional responders retrospectively, obtaining pre-exceptional response treatment tumor tissues, and analyzing them with state-of-the-art molecular analysis tools to identify potential molecular explanations for responses.

Methods: Exceptional response was defined as partial (PR) or complete (CR) response to a systemic treatment with population PR or CR rate less than 10% or an unusually long response (eg, duration >3 times published median).

View Article and Find Full Text PDF
Article Synopsis
  • The analysis focuses on syncing a large multi-omic dataset from The Cancer Genome Atlas (TCGA) with the updated human reference genome (GRCh38), aiming to measure similarities and differences with the older version (GRCh37).
  • Comprehensive studies were conducted across five molecular data types to assess the degree of consistency between the old and new reference genomes and to identify remaining differences.
  • The findings reveal that both datasets are highly consistent, providing guidelines for researchers on how to effectively utilize either version while considering possible discrepancies in biological interpretation.
View Article and Find Full Text PDF

We present the genome-wide chromatin accessibility profiles of 410 tumor samples spanning 23 cancer types from The Cancer Genome Atlas (TCGA). We identify 562,709 transposase-accessible DNA elements that substantially extend the compendium of known cis-regulatory elements. Integration of ATAC-seq (the assay for transposase-accessible chromatin using sequencing) with TCGA multi-omic data identifies a large number of putative distal enhancers that distinguish molecular subtypes of cancers, uncovers specific driving transcription factors via protein-DNA footprints, and nominates long-range gene-regulatory interactions in cancer.

View Article and Find Full Text PDF

Malignant pleural mesothelioma (MPM) is a highly lethal cancer of the lining of the chest cavity. To expand our understanding of MPM, we conducted a comprehensive integrated genomic study, including the most detailed analysis of alterations to date. We identified histology-independent molecular prognostic subsets, and defined a novel genomic subtype with and mutations and extensive loss of heterozygosity.

View Article and Find Full Text PDF
Article Synopsis
  • * Our analysis revealed unique molecular patterns for the main TGCT histologic subtypes: seminoma, embryonal carcinoma, yolk sac tumor, and teratoma, with notable differences in DNA methylation and microRNA expression influencing these subtypes.
  • * We identified potential biomarkers for risk assessment and diagnosis, including specific miRNAs linked to teratomas and distinct methylation patterns that could help identify embryonal carcinomas.
View Article and Find Full Text PDF

Glioma diagnosis is based on histomorphology and grading; however, such classification does not have predictive clinical outcome after glioblastomas have developed. To date, no bona fide biomarkers that significantly translate into a survival benefit to glioblastoma patients have been identified. We previously reported that the IDH mutant G-CIMP-high subtype would be a predecessor to the G-CIMP-low subtype.

View Article and Find Full Text PDF

This SnapShot provides a list of the tumor types characterized by The Cancer Genome Atlas (TCGA) program. Key findings shown are the most relevant discoveries described in each marker paper for the tumor type.

View Article and Find Full Text PDF

The Cancer Genome Atlas (TCGA) has catalyzed systematic characterization of diverse genomic alterations underlying human cancers. At this historic junction marking the completion of genomic characterization of over 11,000 tumors from 33 cancer types, we present our current understanding of the molecular processes governing oncogenesis. We illustrate our insights into cancer through synthesis of the findings of the TCGA PanCancer Atlas project on three facets of oncogenesis: (1) somatic driver mutations, germline pathogenic variants, and their interactions in the tumor; (2) the influence of the tumor genome and epigenome on transcriptome and proteome; and (3) the relationship between tumor and the microenvironment, including implications for drugs targeting driver events and immunotherapies.

View Article and Find Full Text PDF

The Cancer Genome Atlas (TCGA) team now presents the Pan-Cancer Atlas, investigating different aspects of cancer biology by analyzing the data generated during the 10+ years of the TCGA project.

View Article and Find Full Text PDF

Thymic epithelial tumors (TETs) are one of the rarest adult malignancies. Among TETs, thymoma is the most predominant, characterized by a unique association with autoimmune diseases, followed by thymic carcinoma, which is less common but more clinically aggressive. Using multi-platform omics analyses on 117 TETs, we define four subtypes of these tumors defined by genomic hallmarks and an association with survival and World Health Organization histological subtype.

View Article and Find Full Text PDF