AttCRISPR: a spacetime interpretable model for prediction of sgRNA on-target activity.

BMC Bioinformatics

School of Computer Science and Technology, East China Normal University, Shanghai, 200062, China.

Published: December 2021

AI Article Synopsis

  • More Cas9 variants are being created to improve specificity and reduce off-target effects, but existing machine learning methods struggle with accuracy and interpretability.
  • A new method called AttCRISPR is introduced, which combines deep learning with ensemble learning to provide high performance while maintaining interpretability in predicting on-target activity.
  • AttCRISPR uses attention modules for better understanding of decision-making processes and reveals nucleotide preferences in sgRNA sequences, offering guidance for designing more effective sgRNAs.

Article Abstract

Background: More and more Cas9 variants with higher specificity are developed to avoid the off-target effect, which brings a significant volume of experimental data. Conventional machine learning performs poorly on these datasets, while the methods based on deep learning often lack interpretability, which makes researchers have to trade-off accuracy and interpretability. It is necessary to develop a method that can not only match deep learning-based methods in performance but also with good interpretability that can be comparable to conventional machine learning methods.

Results: To overcome these problems, we propose an intrinsically interpretable method called AttCRISPR based on deep learning to predict the on-target activity. The advantage of AttCRISPR lies in using the ensemble learning strategy to stack available encoding-based methods and embedding-based methods with strong interpretability. Comparison with the state-of-the-art methods using WT-SpCas9, eSpCas9(1.1), SpCas9-HF1 datasets, AttCRISPR can achieve an average Spearman value of 0.872, 0.867, 0.867, respectively on several public datasets, which is superior to these methods. Furthermore, benefits from two attention modules-one spatial and one temporal, AttCRISPR has good interpretability. Through these modules, we can understand the decisions made by AttCRISPR at both global and local levels without other post hoc explanations techniques.

Conclusion: With the trained models, we reveal the preference for each position-dependent nucleotide on the sgRNA (short guide RNA) sequence in each dataset at a global level. And at a local level, we prove that the interpretability of AttCRISPR can be used to guide the researchers to design sgRNA with higher activity.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8667445PMC
http://dx.doi.org/10.1186/s12859-021-04509-6DOI Listing

Publication Analysis

Top Keywords

on-target activity
8
conventional machine
8
machine learning
8
based deep
8
deep learning
8
good interpretability
8
attcrispr
7
methods
6
interpretability
6
learning
5

Similar Publications

Advancing Chimeric Antigen Receptor T-Cell Therapy for Acute Myeloid Leukemia: Current Limitations and Emerging Strategies.

Pharmaceuticals (Basel)

December 2024

Division of Hematology and Stem Cell Transplantation, University Hospital, 33100 Udine, Italy.

Chimeric antigen receptor (CAR) T-cell therapy represents one of the most impressive advances in anticancer therapy of the last decade. While CAR T-cells are gaining ground in various B cell malignancies, their use in acute myeloid leukemia (AML) remains limited, and no CAR-T product has yet received approval for AML. The main limitation of CAR-T therapy in AML is the lack of specific antigens that are expressed in leukemic cells but not in their healthy counterparts, such as hematopoietic stem cells (HSCs), as their targeting would result in an on-target/off-tumor toxicity.

View Article and Find Full Text PDF

Grape pomace, the solid residue from winemaking, is a rich source of polyphenolic compounds with significant antioxidant properties. However, the efficient extraction of these valuable compounds remains a challenge. This study focuses on optimizing the conditions for the enzyme-assisted extraction of polyphenolic compounds from red grape pomace using cellulase, hemicellulase, and pectinase.

View Article and Find Full Text PDF

Telomerase and telomeres are crucial in cancer cell immortalization, making them key targets for anticancer therapies. Currently, 6-thio-dG (THIO) combined with the anti-PD-1 inhibitor Cemiplimab is under phase II clinical investigation (NCT05208944) in NSCLC patients resistant to prior immunotherapies. This study presents the design, synthesis, and evaluation of novel bimodular conjugate molecules combining telomere-targeting nucleoside analogs and phosphatidyl diglyceride groups.

View Article and Find Full Text PDF

Background: Atypical teratoid rhabdoid tumor (ATRT) is the most common malignant brain tumor in infants, and more than 60% of children with ATRT die from their tumor. ATRT is associated with mutational inactivation/deletion of , a member of the SWI/SNF chromatin remodeling complex, suggesting that epigenetic events play a critical role in tumor development and progression. Moreover, disruption of SWI/SNF allows unopposed activity of epigenetic repressors, which contribute to tumorigenicity.

View Article and Find Full Text PDF

SUMOylation involves covalent attachment of small ubiquitin-like modifier (SUMO) proteins to specific lysine residues on target proteins and regulates various aspects of their function. Sentrin-specific proteases (SENPs) are key players in both the conjugation reaction of SUMO proteins to their targets and the subsequent deconjugation of SUMO-conjugated substrates. Here, we provide the first comprehensive prenatal description of a lethal syndrome linked to a novel homozygous stop-gain variant in SENP7 c.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!