TULIP: An RNA-seq-based Primary Tumor Type Prediction Tool Using Convolutional Neural Networks.

Cancer Inform

Frederick National Laboratory for Cancer Research, Cancer Data Science Initiatives, Cancer Research Technology Program, Rockville, MD, USA.

Published: December 2022

Background: With cancer as one of the leading causes of death worldwide, accurate primary tumor type prediction is critical in identifying genetic factors that can inhibit or slow tumor progression. There have been efforts to categorize primary tumor types with gene expression data using machine learning, and more recently with deep learning, in the last several years.

Methods: In this paper, we developed four 1-dimensional (1D) Convolutional Neural Network (CNN) models to classify RNA-seq count data as one of 17 highly represented primary tumor types or 32 primary tumor types regardless of imbalanced representation. Additionally, we adapted the models to take as input either all Ensembl genes (60,483) or protein coding genes only (19,758). Unlike previous work, we avoided selection bias by not filtering genes based on expression values. RNA-seq count data expressed as FPKM-UQ of 9,025 and 10,940 samples from The Cancer Genome Atlas (TCGA) were downloaded from the Genomic Data Commons (GDC) corresponding to 17 and 32 primary tumor types respectively for training and validating the models.

Results: All 4 1D-CNN models had an overall accuracy of 94.7% to 97.6% on the test dataset. Further evaluation indicates that the models with protein coding genes only as features performed with better accuracy compared to the models with all Ensembl genes for both 17 and 32 primary tumor types. For all models, the accuracy by primary tumor type was above 80% for most primary tumor types.

Conclusions: We packaged all 4 models as a Python-based deep learning classification tool called TULIP (TUmor CLassIfication Predictor) for performing quality control on primary tumor samples and characterizing cancer samples of unknown tumor type. Further optimization of the models is needed to improve the accuracy of certain primary tumor types.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9729992PMC
http://dx.doi.org/10.1177/11769351221139491DOI Listing

Publication Analysis

Top Keywords

primary tumor
44
tumor types
24
tumor type
16
tumor
14
primary
11
type prediction
8
convolutional neural
8
deep learning
8
models
8
rna-seq count
8

Similar Publications

Objective: This study aimed to evaluate and compare the clinicopathologic features of primary fallopian tubal carcinoma (PFTC) and high-grade serous ovarian cancer (HGSOC) and explore the prognostic factors of these two malignant tumors.

Methods: Fifty-seven patients diagnosed with PFTC from 2006 to 2015 and 60 patients diagnosed with HGSOC from 2014 to 2015 with complete prognostic information were identified at Women's Hospital of Zhejiang University. The clinicopathological and surgical data were collected, and the survival of the patients was followed for 5 years after surgery.

View Article and Find Full Text PDF

Background: Adenoid cystic carcinoma of the breast is a rare subtype, constituting less than 3.5% of primary breast carcinomas. Despite being categorized as a type of triple-negative breast cancer, it generally has a favorable prognosis.

View Article and Find Full Text PDF

Background: To date, there remains a paucity of comparative investigations pertaining to preoperative immunochemotherapy and conventional chemotherapy in the context of limited-stage small-cell lung cancer (LS-SCLC) patients. This study conducted a comprehensive comparative assessment concerning the safety and efficacy profiles of preoperative immunochemotherapy and chemotherapy in individuals diagnosed with stage I-IIIB SCLC.

Methods: This investigation collected 53 consecutive patients diagnosed with LS-SCLC spanning stage I to IIIB who underwent preoperative immunochemotherapy or conventional chemotherapy at our hospital from January 2019 to July 2021.

View Article and Find Full Text PDF

New treatment approaches are warranted for patients with advanced melanoma refractory to immune checkpoint blockade (ICB) or BRAF-targeted therapy. We designed BNT221, a personalized, neoantigen-specific autologous T cell product derived from peripheral blood, and tested this in a 3 + 3 dose-finding study with two dose levels (DLs) in patients with locally advanced or metastatic melanoma, disease progression after ICB, measurable disease (Response Evaluation Criteria in Solid Tumors version 1.1) and, where appropriate, BRAF-targeted therapy.

View Article and Find Full Text PDF

Malaria vaccines consisting of metabolically active Plasmodium falciparum (Pf) sporozoites can offer improved protection compared with currently deployed subunit vaccines. In a previous study, we demonstrated the superior protective efficacy of a three-dose regimen of late-arresting genetically attenuated parasites administered by mosquito bite (GA2-MB) compared with early-arresting counterparts (GA1-MB) against a homologous controlled human malaria infection. Encouraged by these results, we explored the potency of a single GA2-MB immunization in a placebo-controlled randomized trial.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!