The activity of CRISPR-Cas9 target sites can be measured experimentally through phenotypic assays or mutation rate and used to build computational models to predict activity of novel target sites. However, currently published models have been reported to perform poorly in situations other than their training conditions. In this study, we hence investigate how different sources of data influence predictive power and identify the best data set for the most robust predictive model. We use the activity of 28,606 target sites and a machine learning approach to train a predictive model of CRISPR-Cas9 activity, outperforming other published methods by an average increase in accuracy of 80% for prediction of the degree of activity and 13% for classification into active and inactive categories. We find that using data sets that measure CRISPR-Cas9 activity through sequencing provides more accurate predictions of activity. Our model, dubbed TUSCAN, is highly scalable, predicting the activity of 5000 target sites in under 7 s, making it suitable for genome-wide screens. We conclude that sophisticated machine learning methods can classify binary CRISPR-Cas9 activity; however, predicting fine-scale activity scores will require larger data sets directly measuring Indel insertion rate.

Download full-text PDF

Source
http://dx.doi.org/10.1089/crispr.2017.0021DOI Listing

Publication Analysis

Top Keywords

target sites
16
crispr-cas9 activity
12
activity
10
predictive model
8
machine learning
8
data sets
8
crispr-cas9
5
high activity
4
activity target-site
4
target-site identification
4

Similar Publications

Pancreatic cancer is a lethal disease with an insidious onset, and little is known about its early molecular events. Here, we found that the sterol regulatory element-binding protein 1 (SREBP1) expression is gradually upregulated during the initiation of pancreatic cancer. Through in vitro 3D culture of pancreatic acinar cells and experiments in LSL-Kras;Pdx1-Cre (KC) mice, we found that pharmacological inhibition of SREBP1 suppressed pancreatic tumorigenesis.

View Article and Find Full Text PDF

Background: Acute lung injury (ALI) is a severe condition with multifaceted causes, including inflammation and oxidative stress. This research investigates the influence of m6A (N6-methyladenosine) modification on GBP4, a protein pivotal for macrophage polarization, a critical immune response in ALI.

Methods: Utilizing a mouse model to induce ALI, the study analyzed GBP4 expression in alveolar macrophages.

View Article and Find Full Text PDF

The prevalence and death due to cancer have been rising over the past few decades, and eliminating tumour cells without sacrificing healthy cells remains a difficult task. Due to the low specificity and solubility of drug molecules, patients often require high dosages to achieve the desired therapeutic effects. Silica nanoparticles (SiNPs) can effectively deliver therapeutic agents to targeted sites in the body, addressing these challenges.

View Article and Find Full Text PDF

Tongue squamous cell carcinoma-targeting Au-HN-1 nanosystem for CT imaging and photothermal therapy.

Int J Oral Sci

January 2025

State Key Laboratory for Diagnosis and Treatment of Severe Zoonotic Infectious Diseases, Key Laboratory for Zoonosis Research of the Ministry of Education, College of Basic Medical Science, Jilin University, Changchun, China.

Tongue squamous cell carcinoma (TSCC) is a prevalent malignancy that afflicts the head and neck area and presents a high incidence of metastasis and invasion. Accurate diagnosis and effective treatment are essential for enhancing the quality of life and the survival rates of TSCC patients. The current treatment modalities for TSCC frequently suffer from a lack of specificity and efficacy.

View Article and Find Full Text PDF

Proximity labeling (PL) has emerged as a powerful technique for the in situ elucidation of biomolecular interaction networks. However, PL methods generally rely on single-biological-hierarchy control of spatial localization at the labeling site, which limits their application in multi-tiered biological systems. Here, we introduced another enzymatic reaction upstream of an enzyme-based PL reaction and targeted the two enzymes to markers indicating different biological hierarchies, establishing a two-level spatially localized proximity labeling (P2L) platform for in situ molecular measurement and manipulation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!