AI Article Synopsis

  • A novel method for creating 3D protein descriptors is introduced, relying on various mathematical mappings and dissimilarity matrices to capture interactions among amino acids.
  • The study proposes a new local-fragment approach focusing on amino acid types and groups to highlight key regions in proteins and applies normalization techniques to better analyze inter-amino acid distances.
  • The effectiveness of these new descriptors is validated through predictive models using Multiple Linear Regression and Support Vector Machine, achieving high accuracy compared to existing methods for predicting folding rates and secondary structures of proteins.

Article Abstract

Novel 3D protein descriptors based on bilinear, quadratic and linear algebraic maps in R are proposed. The latter employs the k 2-tuple (dis) similarity matrix to codify information related to covalent and non-covalent interactions in these biopolymers. The calculation of the inter-amino acid distances is generalized by using several dis-similarity coefficients, where normalization procedures based on the simple stochastic and mutual probability schemes are applied. A new local-fragment approach based on amino acid-types and amino acid-groups is proposed to characterize regions of interest in proteins. Topological and geometric macromolecular cutoffs are defined using local and total indices to highlight non-covalent interactions existing between the side-chains of each amino acid. Moreover, local and total indices calculations are generalized considering a LEGO approach, by using several aggregation operators. Collinearity and variability analyses are performed to evaluate every generalizing component applied to the definition of these novel indices. These experiments are oriented to reduce the number of MDs obtained for performing prediction models. The predictive power of the proposed indices was evaluated using two benchmark datasets, folding rate and secondary structural classification of proteins. The proposed MDs are modeled using the following strategies: Multiple Linear Regression (MLR) and Support Vector Machine (SVM), respectively. The best regression model developed for the folding rate of proteins yields a cross-validation coefficient of 0.875 (Test Set) and the best model developed for secondary structural classification obtained 98% of instances correctly classified (Test Set). These statistical parameters are superior to the ones obtained with existing MDs reported in the literature. Overall, the new theoretical generalization enhanced the information extraction into the MDs, allowing a better correlation between these two evaluated benchmark datasets and the proposed indices. The optimal theoretical configurations defined for the calculation of these MDs consider low collinearity and less information redundancy among them. These theoretical configurations and the software are available at http://tomocomd.com/mulims-mcompas.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jtbi.2019.110039DOI Listing

Publication Analysis

Top Keywords

linear algebraic
8
non-covalent interactions
8
local total
8
total indices
8
proposed indices
8
evaluated benchmark
8
benchmark datasets
8
folding rate
8
secondary structural
8
structural classification
8

Similar Publications

Unlabelled: Automated reasoning techniques based on computer algebra have seen renewed interest in recent years and are for example heavily used in formal verification of arithmetic circuits. However, the verification process might contain errors. Generating and checking proof certificates is important to increase the trust in automated reasoning tools.

View Article and Find Full Text PDF

Dynamical symmetries, time-dependent operators that almost commute with the Hamiltonian, extend the role of ordinary symmetries. Motivated by progress in quantum technologies, we illustrate a practical algebraic approach to computing such time-dependent operators. Explicitly we expand them as a linear combination of time-independent operators with time-dependent coefficients.

View Article and Find Full Text PDF

Small-angle rigid-unit modes requiring linear strain compensation.

Acta Crystallogr A Found Adv

January 2025

Department of Physics and Astronomy, Brigham Young University, Provo, Utah 84602, USA.

Group-theoretical and linear-algebraic methods and tools have recently been developed that aim to exhaustively identify the small-angle rotational rigid-unit modes (RUMs) of a given framework material. But in their current form, they fail to detect RUMs that require a compensating lattice strain which grows linearly with the amplitude of the rigid-unit rotations. Here, we present a systematic approach to including linear strain compensation within the linear-algebraic RUM-search method, so that any geometrically possible small-angle RUM can be detected.

View Article and Find Full Text PDF

In light of the ponderomotive force, this article focuses on establishing the exact wave structures of the ion sound system. It is the result of non-linear force and affects a charged particle oscillating in an inhomogeneous electromagnetic field. By using the Riemann-Liouville operator, -operator, and Atangana-Baleanu fractional analysis, the examined equation-which consists of the normalized electric field of the Langmuir oscillation and normalized density perturbation-is thoroughly examined.

View Article and Find Full Text PDF

Susceptibility formulation of density matrix perturbation theory.

J Chem Phys

December 2024

Division of Scientific Computing, Department of Information Technology, Uppsala University, Box 337, SE-751 05 Uppsala, Sweden.

Density matrix perturbation theory based on recursive Fermi-operator expansions provides a computationally efficient framework for time-independent response calculations in quantum chemistry and materials science. From a perturbation in the Hamiltonian, we can calculate the first-order perturbation in the density matrix, which then gives us the linear response in the expectation values for some chosen set of observables. We present an alternative, dual formulation, where we instead calculate the static susceptibility of an observable, which then gives us the linear response in the expectation values for any number of different Hamiltonian perturbations.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!