Deep learning technologies have been adopted to predict the functions of newly identified proteins in silico. However, most current models are not suitable for poorly characterized proteins because they require diverse information on target proteins. We designed a binary classification deep learning program requiring only sequence information. This program was named 'FUTUSA' (function teller using sequence alone). It applied sequence segmentation during the sequence feature extraction process, by a convolution neural network, to train the regional sequence patterns and their relationship. This segmentation process improved the predictive performance by 49% than the full-length process. Compared with a baseline method, our approach achieved higher performance in predicting oxidoreductase activity. In addition, FUTUSA also showed dramatic performance in predicting acetyltransferase and demethylase activities. Next, we tested the possibility that FUTUSA can predict the functional consequence of point mutation. After trained for monooxygenase activity, FUTUSA successfully predicted the impact of point mutations on phenylalanine hydroxylase, which is responsible for an inherited metabolic disease PKU. This deep-learning program can be used as the first-step tool for characterizing newly identified or poorly studied proteins.•We proposed new deep learning program to predict protein functions in silico that requires nothing more than the protein sequence information.•Due to application of sequence segmentation, the efficiency of prediction is improved.•This method makes prediction of the clinical impact of mutations or polymorphisms possible.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8790617 | PMC |
http://dx.doi.org/10.1016/j.mex.2022.101622 | DOI Listing |
BMC Res Notes
December 2024
Department of Computer Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran.
This dataset contains demographic, morphological and pathological data, endoscopic images and videos of 191 patients with colorectal polyps. Morphological data is included based on the latest international gastroenterology classification references such as Paris, Pit and JNET classification. Pathological data includes the diagnosis of the polyps including Tubular, Villous, Tubulovillous, Hyperplastic, Serrated, Inflammatory and Adenocarcinoma with Dysplasia Grade & Differentiation.
View Article and Find Full Text PDFBMC Genomics
December 2024
College of Physics and Electronic Information, Gannan Normal University, Ganzhou, 341000, Jiangxi, China.
Long non-coding RNAs (lncRNAs) play crucial roles in numerous biological processes and are involved in complex human diseases through interactions with proteins. Accurate identification of lncRNA-protein interactions (LPI) can help elucidate the functional mechanisms of lncRNAs and provide scientific insights into the molecular mechanisms underlying related diseases. While many sequence-based methods have been developed to predict LPIs, efficiently extracting and effectively integrating potential feature information that reflects functional attributes from lncRNA and protein sequences remains a significant challenge.
View Article and Find Full Text PDFAcad Radiol
December 2024
Radiomics and Augmented Intelligence Laboratory (RAIL), Department of Radiology and the Norman Fixel Institute for Neurological Diseases, University of Florida College of Medicine, Gainesville, FL (M.H-S., H.S.S., A.G.R., S.E.M., J.C.P., E.Y.A., B.H., R.F.); Department of Radiology, University of Florida College of Medicine, Gainesville, FL (M.H-S., H.S.S., A.G.R., J.C.P., E.Y.A., B.H., R.F.); Division of Medical Physics, University of Florida College of Medicine, Gainesville, FL (R.F.); Department of Neurology, Division of Movement Disorders, University of Florida College of Medicine, Gainesville, FL (R.F.); Department of Otolaryngology - Head and Neck Surgery, McGill University, Montreal, Quebec, Canada (R.F.); Department of Radiology, AdventHealth Medical Group, Maitland, FL (R.F.). Electronic address:
Rationale And Objectives: To evaluate and compare image quality of different energy levels of virtual monochromatic images (VMIs) using standard versus strong deep learning spectral reconstruction (DLSR) on dual-energy CT pulmonary angiogram (DECT-PA).
Materials And Methods: A retrospective study was performed on 70 patients who underwent DECT-PA (15 PE present; 55 PE absent) scans. VMIs were reconstructed at different energy levels ranging from 35 to 200 keV using standard and strong levels with deep learning spectral reconstruction.
Am J Pathol
December 2024
Department of Computer Science, Faculty of Engineering Sciences, University College London, Gower Street, London, WC1E 6BT, United Kingdom.
Understanding the tumor hypoxic microenvironment is crucial for grasping tumor biology, clinical progression, and treatment responses. This study presents a novel application of AI in computational histopathology to evaluate hypoxia in breast cancer. Weakly Supervised Deep Learning (WSDL) models can accurately detect morphological changes associated with hypoxia in routine Hematoxylin and Eosin (H&E) whole slide images (WSI).
View Article and Find Full Text PDFNeurobiol Dis
December 2024
Department of Neurology, University Hospital of Wuerzburg, Germany. Electronic address:
DYT-THAP1 dystonia is a monogenetic form of dystonia, a movement disorder characterized by the involuntary co-contraction of agonistic and antagonistic muscles. The disease is caused by mutations in the THAP1 gene, although the precise mechanisms by which these mutations contribute to the pathophysiology of dystonia remain unclear. The incomplete penetrance of DYT-THAP1 dystonia, estimated at 40 to 60 %, suggests that an environmental trigger may be required for the manifestation of the disease in genetically predisposed individuals.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!