A fusion framework of deep learning and machine learning for predicting sgRNA cleavage efficiency.

Comput Biol Med

Department of Biomedical Informatics, MOE Key Lab of Cardiovascular Sciences, School of Basic Medical Sciences, Peking University, Beijing, China. Electronic address:

Published: October 2023

CRISPR/Cas9 system is a powerful tool for genome editing. Numerous studies have shown that sgRNAs can strongly affect the efficiency of editing. However, it is still not clear what rules should be followed for designing sgRNA with high cleavage efficiency. At present, several machine learning or deep learning methods have been developed to predict the cleavage efficiency of sgRNAs, however, the prediction accuracy of these tools is still not satisfactory. Here we propose a fusion framework of deep learning and machine learning, which first deals with the primary sequence and secondary structure features of the sgRNAs using both convolutional neural network (CNN) and recurrent neural network (RNN), and then uses the features extracted by the deep neural network to train a conventional machine learning model with LGBM. As a result, the new approach overwhelmed previous methods. The Spearman's correlation coefficient between predicted and measured sgRNA cleavage efficiency of our model (0.917) is improved by over 5% compared with the most advanced method (0.865), and the mean square error reduces from 7.89 × 10 to 4.75 × 10. Finally, we developed an online tool, CRISep (http://www.cuilab.cn/CRISep), to evaluate the availability of sgRNAs based on our models.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2023.107476DOI Listing

Publication Analysis

Top Keywords

machine learning
16
cleavage efficiency
16
deep learning
12
neural network
12
fusion framework
8
framework deep
8
learning machine
8
sgrna cleavage
8
learning
7
efficiency
5

Similar Publications

Background: Developing drugs for treating Alzheimer's disease (AD) has been extremely challenging and costly due to limited knowledge on underlying biological mechanisms and therapeutic targets. Repurposing drugs or their combination has shown potential in accelerating drug development due to the reduced drug toxicity while targeting multiple pathologies.

Method: To address the challenge in AD drug development, we developed a multi-task machine learning pipeline to integrate a comprehensive knowledge graph on biological/pharmacological interactions and multi-level evidence on drug efficacy, to identify repurposable drugs and their combination candidates RESULT: Using the drug embedding from the heterogeneous graph representation model, we ranked drug candidates based on evidence from post-treatment transcriptomic patterns, mechanistic efficacy in preclinical models, population-based treatment effect, and Phase 2/3 clinical trials.

View Article and Find Full Text PDF

Drug Development.

Alzheimers Dement

December 2024

AbbVie Inc., North Chicago, IL, USA.

Background: In Alzheimer's Disease (AD) trials, clinical scales are used to assess treatment effect in patients. Minimizing statistical uncertainty of trial outcomes is an important consideration to increase statistical power. Machine learning models can leverage baseline data to create AI-generated digital twins - individualized predictions (or prognostic scores) of how each patient's clinical outcomes may change during a trial assuming they received placebo.

View Article and Find Full Text PDF

Background: The prohibitive costs of drug development for Alzheimer's Disease (AD) emphasize the need for alternative in silico drug repositioning strategies. Graph learning algorithms, capable of learning intrinsic features from complex network structures, can leverage existing databases of biological interactions to improve predictions in drug efficacy. We developed a novel machine learning framework, the PreSiBOGNN, that integrates muti-modal information to predict cognitive improvement at the subject level for precision medicine in AD.

View Article and Find Full Text PDF

Dementia Care Research and Psychosocial Factors.

Alzheimers Dement

December 2024

Imperial College London, London, United Kingdom; UK Dementia Research Institute, Care Research and Technology Centre, London, United Kingdom.

Background: Close to 23% of unplanned hospital admissions for people living with dementia (PLWD) are due to potentially preventable causes such as severe urinary tract infections (UTIs), falls, and respiratory problems. These affect the well-being of PLWD, cause stress to carers and increase pressure on healthcare services.

Method: We use routinely collected in-home sensory data to monitor nocturnal activity and sleep data.

View Article and Find Full Text PDF

Dementia Care Research and Psychosocial Factors.

Alzheimers Dement

December 2024

Department of Psychology & Language Sciences, University College London, London, United Kingdom.

Background: Dysphagia is an important feature of neurodegenerative diseases and potentially life-threatening in primary progressive aphasia (PPA), but remains poorly characterised in these syndromes. We hypothesised that dysphagia would be more prevalent in nonfluent/agrammatic variant (nfv)PPA than other PPA syndromes, predicted by accompanying motor features and associated with atrophy affecting regions implicated in swallowing control.

Methods: In a retrospective case-control study at our tertiary referral centre, we recruited 56 patients with PPA (21 nfvPPA, 22 semantic variant (sv)PPA, 13 logopenic variant (lv)PPA).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!