Publications by John E Evangelista | LitMetric

Publications by authors named "John E Evangelista"

Page 1 of 1

Harmonizome 3.0: integrated knowledge about genes and proteins from diverse multi-omics resources.

Ido Diamant Daniel J B Clarke John Erol Evangelista Nathania Lingam Avi Ma'ayan

Nucleic Acids Res

November 2024

By processing and abstracting diverse omics datasets into associations between genes and their attributes, the Harmonizome database enables researchers to explore and integrate knowledge about human genes from many central omics resources. Here, we introduce Harmonizome 3.0, a significant upgrade to the original Harmonizome database.

View Article and Find Full Text PDF

Regulatory elements in (7q21.3) locus contribute to genetic control of coronal nonsyndromic craniosynostosis and bone density-related traits.

Paola Nicoletti Samreen Zafer Lital Matok Inbar Irron Meidva Patrick John Erol Evangelista

Genet Med Open

May 2024

Purpose: The etiopathogenesis of coronal nonsyndromic craniosynostosis (cNCS), a congenital condition defined by premature fusion of 1 or both coronal sutures, remains largely unknown.

Methods: We conducted the largest genome-wide association study of cNCS followed by replication, fine mapping, and functional validation of the most significant region using zebrafish animal model.

Results: Genome-wide association study identified 6 independent genome-wide-significant risk alleles, 4 on chromosome 7q21.

View Article and Find Full Text PDF

Rummagene: massive mining of gene sets from supporting materials of biomedical research publications.

Daniel J B Clarke Giacomo B Marino Eden Z Deng Zhuorui Xie John Erol Evangelista

Commun Biol

April 2024

Many biomedical research publications contain gene sets in their supporting tables, and these sets are currently not available for search and reuse. By crawling PubMed Central, the Rummagene server provides access to hundreds of thousands of such mammalian gene sets. So far, we scanned 5,448,589 articles to find 121,237 articles that contain 642,389 gene sets.

View Article and Find Full Text PDF

Pan-cancer proteogenomics characterization of tumor immunity.

Francesca Petralia Weiping Ma Tomer M Yaron Francesca Pia Caruso Nicole Tignor John Erol Evangelista

Cell

February 2024

Despite the successes of immunotherapy in cancer treatment over recent decades, less than <10%-20% cancer cases have demonstrated durable responses from immune checkpoint blockade. To enhance the efficacy of immunotherapies, combination therapies suppressing multiple immune evasion mechanisms are increasingly contemplated. To better understand immune cell surveillance and diverse immune evasion responses in tumor tissues, we comprehensively characterized the immune landscape of more than 1,000 tumors across ten different cancers using CPTAC pan-cancer proteogenomic data.

View Article and Find Full Text PDF

Toxicology knowledge graph for structural birth defects.

John Erol Evangelista Daniel J B Clarke Zhuorui Xie Giacomo B Marino Vivian Utti

Commun Med (Lond)

July 2023

Article Synopsis

- Birth defects affect about 1 in 33 births in the U.S., stemming from genetic and various environmental factors, but their exact causes are often unknown.
- Researchers created the Reproductive Toxicity Knowledge Graph (ReproTox-KG) to analyze connections between drugs, genes, and birth defects, gathering data from multiple scientific sources.
- This tool scored over 30,000 small molecules for their potential to cause birth defects and identifies over 500 relevant associations, providing a valuable resource for understanding the underlying mechanisms of drug-induced birth defects.

View Article and Find Full Text PDF

Enrichr-KG: bridging enrichment analysis across multiple libraries.

John Erol Evangelista Zhuorui Xie Giacomo B Marino Nhi Nguyen Daniel J B Clarke

Nucleic Acids Res

July 2023

Gene and protein set enrichment analysis is a critical step in the analysis of data collected from omics experiments. Enrichr is a popular gene set enrichment analysis web-server search engine that contains hundreds of thousands of annotated gene sets. While Enrichr has been useful in providing enrichment analysis with many gene set libraries from different categories, integrating enrichment results across libraries and domains of knowledge can further hypothesis generation.

View Article and Find Full Text PDF

A multi-omic analysis of MCF10A cells provides a resource for integrative assessment of ligand-mediated molecular and phenotypic responses.

Sean M Gross Mark A Dane Rebecca L Smith Kaylyn L Devlin Ian C McLean John Erol Evangelista

Commun Biol

October 2022

The phenotype of a cell and its underlying molecular state is strongly influenced by extracellular signals, including growth factors, hormones, and extracellular matrix proteins. While these signals are normally tightly controlled, their dysregulation leads to phenotypic and molecular states associated with diverse diseases. To develop a detailed understanding of the linkage between molecular and phenotypic changes, we generated a comprehensive dataset that catalogs the transcriptional, proteomic, epigenomic and phenotypic responses of MCF10A mammary epithelial cells after exposure to the ligands EGF, HGF, OSM, IFNG, TGFB and BMP2.

View Article and Find Full Text PDF

Transforming L1000 profiles to RNA-seq-like profiles with deep learning.

Minji Jeon Zhuorui Xie John E Evangelista Megan L Wojciechowicz Daniel J B Clarke

BMC Bioinformatics

September 2022

The L1000 technology, a cost-effective high-throughput transcriptomics technology, has been applied to profile a collection of human cell lines for their gene expression response to > 30,000 chemical and genetic perturbations. In total, there are currently over 3 million available L1000 profiles. Such a dataset is invaluable for the discovery of drug and target candidates and for inferring mechanisms of action for small molecules.

View Article and Find Full Text PDF

Getting Started with LINCS Datasets and Tools.

Zhuorui Xie Eryk Kropiwnicki Megan L Wojciechowicz Kathleen M Jagodnik Ingrid Shu John Erol Evangelista

Curr Protoc

July 2022

The Library of Integrated Network-based Cellular Signatures (LINCS) was an NIH Common Fund program that aimed to expand our knowledge about human cellular responses to chemical, genetic, and microenvironment perturbations. Responses to perturbations were measured by transcriptomics, proteomics, cellular imaging, and other high content assays. The second phase of the LINCS program, which lasted 7 years, involved the engagement of six data and signature generation centers (DSGCs) and one data coordination and integration center (DCIC).

View Article and Find Full Text PDF

SigCom LINCS: data and metadata search engine for a million gene expression signatures.

John Erol Evangelista Daniel J B Clarke Zhuorui Xie Alexander Lachmann Minji Jeon

Nucleic Acids Res

July 2022

Millions of transcriptome samples were generated by the Library of Integrated Network-based Cellular Signatures (LINCS) program. When these data are processed into searchable signatures along with signatures extracted from Genotype-Tissue Expression (GTEx) and Gene Expression Omnibus (GEO), connections between drugs, genes, pathways and diseases can be illuminated. SigCom LINCS is a webserver that serves over a million gene expression signatures processed, analyzed, and visualized from LINCS, GTEx, and GEO.

View Article and Find Full Text PDF

Gene and drug landing page aggregator.

Daniel J B Clarke Maxim V Kuleshov Zhuorui Xie John E Evangelista Marilyn R Meyers

Bioinform Adv

February 2022

Motivation: Many biological and biomedical researchers commonly search for information about genes and drugs to gather knowledge from these resources. For the most part, such information is served as landing pages in disparate data repositories and web portals.

Results: The Gene and Drug Landing Page Aggregator (GDLPA) provides users with access to 50 gene-centric and 19 drug-centric repositories, enabling them to retrieve landing pages corresponding to their gene and drug queries.

View Article and Find Full Text PDF

KEA3: improved kinase enrichment analysis via data integration.

Maxim V Kuleshov Zhuorui Xie Alexandra B K London Janice Yang John Erol Evangelista

Nucleic Acids Res

July 2021

Phosphoproteomics and proteomics experiments capture a global snapshot of the cellular signaling network, but these methods do not directly measure kinase state. Kinase Enrichment Analysis 3 (KEA3) is a webserver application that infers overrepresentation of upstream kinases whose putative substrates are in a user-inputted list of proteins. KEA3 can be applied to analyze data from phosphoproteomics and proteomics studies to predict the upstream kinases responsible for observed differential phosphorylations.

View Article and Find Full Text PDF

Drugmonizome and Drugmonizome-ML: integration and abstraction of small molecule attributes for drug enrichment analysis and machine learning.

Eryk Kropiwnicki John E Evangelista Daniel J Stein Daniel J B Clarke Alexander Lachmann

Database (Oxford)

March 2021

Understanding the underlying molecular and structural similarities between seemingly heterogeneous sets of drugs can aid in identifying drug repurposing opportunities and assist in the discovery of novel properties of preclinical small molecules. A wealth of information about drug and small molecule structure, targets, indications and side effects; induced gene expression signatures; and other attributes are publicly available through web-based tools, databases and repositories. By processing, abstracting and aggregating information from these resources into drug set libraries, knowledge about novel properties of drugs and small molecules can be systematically imputed with machine learning.

View Article and Find Full Text PDF

Gene Set Knowledge Discovery with Enrichr.

Zhuorui Xie Allison Bailey Maxim V Kuleshov Daniel J B Clarke John E Evangelista

Curr Protoc

March 2021

Profiling samples from patients, tissues, and cells with genomics, transcriptomics, epigenomics, proteomics, and metabolomics ultimately produces lists of genes and proteins that need to be further analyzed and integrated in the context of known biology. Enrichr (Chen et al., 2013; Kuleshov et al.

View Article and Find Full Text PDF

Predicting Lyme Disease From Patients' Peripheral Blood Mononuclear Cells Profiled With RNA-Sequencing.

Daniel J B Clarke Alison W Rebman Allison Bailey Megan L Wojciechowicz Sherry L Jenkins John E Evangelista

Front Immunol

April 2021

Although widely prevalent, Lyme disease is still under-diagnosed and misunderstood. Here we followed 73 acute Lyme disease patients and uninfected controls over a period of a year. At each visit, RNA-sequencing was applied to profile patients' peripheral blood mononuclear cells in addition to extensive clinical phenotyping.

View Article and Find Full Text PDF

Appyters: Turning Jupyter Notebooks into data-driven web apps.

Daniel J B Clarke Minji Jeon Daniel J Stein Nicole Moiseyev Eryk Kropiwnicki John Erol Evangelista

Patterns (N Y)

March 2021

Jupyter Notebooks have transformed the communication of data analysis pipelines by facilitating a modular structure that brings together code, markdown text, and interactive visualizations. Here, we extended Jupyter Notebooks to broaden their accessibility with Appyters. Appyters turn Jupyter Notebooks into fully functional standalone web-based bioinformatics applications.

View Article and Find Full Text PDF

The COVID-19 Drug and Gene Set Library.

Maxim V Kuleshov Daniel J Stein Daniel J B Clarke Eryk Kropiwnicki Kathleen M Jagodnik John E Evangelista

Patterns (N Y)

September 2020

In a short period, many research publications that report sets of experimentally validated drugs as potential COVID-19 therapies have emerged. To organize this accumulating knowledge, we developed the COVID-19 Drug and Gene Set Library (https://amp.pharm.

View Article and Find Full Text PDF

The COVID-19 Gene and Drug Set Library.

Maxim V Kuleshov Daniel J B Clarke Eryk Kropiwnicki Kathleen M Jagodnik Alon Bartal John E Evangelista

Res Sq

May 2020

The coronavirus (CoV) severe acute respiratory syndrome (SARS)-CoV-2 (COVID-19) pandemic has received rapid response by the research community to offer suggestions for repurposing of approved drugs as well as to improve our understanding of the COVID-19 viral life cycle molecular mechanisms. In a short period, tens of thousands of research preprints and other publications have emerged including those that report lists of experimentally validated drugs and compounds as potential COVID-19 therapies. In addition, gene sets from interacting COVID-19 virus-host proteins and differentially expressed genes when comparing infected to uninfected cells are being published at a fast rate.

View Article and Find Full Text PDF

Towards Intelligent Integration and Sharing of Stem Cell Research Data.

Kirill Borziak Tianye Qi John Erol Evangelista Daniel J B Clarke Avi Ma'ayan

Stud Health Technol Inform

June 2020

Advancements in regenerative medicine have brought to the fore the need for increased standardization and sharing of stem cell product characterization to help drive these innovative interventions toward public availability. Although numerous attempts have been made to store this data, there is still a lack of a platform that incorporates heterogeneous stem cell information into a harmonized project-based framework. The aim of this project was to introduce and pilot-test an intelligent informatics solution which integrates diverse stem cell product characteristics with study subject and omics information.

View Article and Find Full Text PDF

LINCS Data Portal 2.0: next generation access point for perturbation-response signatures.

Vasileios Stathias John Turner Amar Koleti Dusica Vidovic Daniel Cooper John Erol Evangelista

Nucleic Acids Res

January 2020

The Library of Integrated Network-Based Cellular Signatures (LINCS) is an NIH Common Fund program with the goal of generating a large-scale and comprehensive catalogue of perturbation-response signatures by utilizing a diverse collection of perturbations across many model systems and assay types. The LINCS Data Portal (LDP) has been the primary access point for the compendium of LINCS data and has been widely utilized. Here, we report the first major update of LDP (http://lincsportal.

View Article and Find Full Text PDF