Protein engineering is an emerging field in biotechnology that has the potential to revolutionize various areas, such as antibody design, drug discovery, food security, ecology, and more. However, the mutational space involved is too vast to be handled through experimental means alone. Leveraging accumulative protein databases, machine learning (ML) models, particularly those based on natural language processing (NLP), have considerably expedited protein engineering. Moreover, advances in topological data analysis (TDA) and artificial intelligence-based protein structure prediction, such as AlphaFold2, have made more powerful structure-based ML-assisted protein engineering strategies possible. This review aims to offer a comprehensive, systematic, and indispensable set of methodological components, including TDA and NLP, for protein engineering and to facilitate their future development.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516362PMC
http://dx.doi.org/10.1093/bib/bbad289DOI Listing

Publication Analysis

Top Keywords

protein engineering
20
protein
8
topological data
8
data analysis
8
engineering
5
artificial intelligence-aided
4
intelligence-aided protein
4
engineering topological
4
analysis deep
4
deep protein
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!