Motivation: Increase the discriminatory power of PROSITE profiles to facilitate function determination and provide biologically relevant information about domains detected by profiles for the annotation of proteins.
Summary: We have created a new database, ProRule, which contains additional information about PROSITE profiles. ProRule contains notably the position of structurally and/or functionally critical amino acids, as well as the condition they must fulfill to play their biological role. These supplementary data should help function determination and annotation of the UniProt Swiss-Prot knowledgebase. ProRule also contains information about the domain detected by the profile in the Swiss-Prot line format. Hence, ProRule can be used to make Swiss-Prot annotation more homogeneous and consistent. The format of ProRule can be extended to provide information about combination of domains.
Availability: ProRule can be accessed through ScanProsite at http://www.expasy.org/tools/scanprosite. A file containing the rules will be made available under the PROSITE copyright conditions on our ftp site (ftp://www.expasy.org/databases/prosite/) by the next PROSITE release.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1093/bioinformatics/bti614 | DOI Listing |
Bioinformatics
July 2018
Department of Computer Science and Engineering, Seoul National University, Seoul, Korea.
Motivation: A large number of newly sequenced proteins are generated by the next-generation sequencing technologies and the biochemical function assignment of the proteins is an important task. However, biological experiments are too expensive to characterize such a large number of protein sequences, thus protein function prediction is primarily done by computational modeling methods, such as profile Hidden Markov Model (pHMM) and k-mer based methods. Nevertheless, existing methods have some limitations; k-mer based methods are not accurate enough to assign protein functions and pHMM is not fast enough to handle large number of protein sequences from numerous genome projects.
View Article and Find Full Text PDFSci Rep
February 2018
Climate Change Cluster (C3), University of Technology Sydney, PO Box 123 Broadway, NSW 2007, Australia.
Seagrasses and aquatic plants are important clades of higher plants, significant for carbon sequestration and marine ecological restoration. They are valuable in the sense that they allow us to understand how plants have developed traits to adapt to high salinity and photosynthetically challenged environments. Here, we present a large-scale phylogenetically profiled transcriptomics repository covering seagrasses and aquatic plants.
View Article and Find Full Text PDFVirus Genes
April 2017
Department of Molecular Biology, Umeå University, 901 87, Umeå, Sweden.
Proteins harbor domains or short linear motifs, which facilitate their functions and interactions. Finding functional motifs in protein sequences could predict the putative cellular roles or characteristics of hypothetical proteins. In this study, we present Shetti-Motif, which is an interactive tool to (i) map UniProt and PROSITE flat files, (ii) search for multiple pre-defined consensus patterns or experimentally validated functional motifs in large datasets protein sequences (proteome-wide), (iii) search for motifs containing repeated residues (low-complexity regions, e.
View Article and Find Full Text PDFJ Proteome Res
January 2015
Institute of Modern Biopharmaceuticals, State Key Laboratory Breeding Base of Eco-Environment and Bio-Resource of the Three Gorges Area, Key Laboratory of Eco-environments in Three Gorges Reservoir Region, Ministry of Education, School of Life Sciences, Southwest University, Beibei, Chongqing 400715, China.
Protein lysine succinylation, an emerging protein post-translational modification widespread among eukaryotic and prokaryotic cells, represents an important regulator of cellular processes. However, the extent and function of lysine succinylation in Mycobacterium tuberculosis, especially extensively drug-resistant strain, remain elusive. Combining protein/peptide prefractionation, immunoaffinity enrichment, and LC-MS/MS analysis, a total of 686 succinylated proteins and 1739 succinylation sites of M.
View Article and Find Full Text PDFMol Biol Rep
December 2014
Human Genome Centre, School of Medical Sciences, Universiti Sains Malaysia, 16150, Kubang Kerian, Kelantan, Malaysia,
Computational epigenetics is a new area of research focused on exploring how DNA methylation patterns affect transcription factor binding that affect gene expression patterns. The aim of this study was to produce a new protocol for the detection of DNA methylation patterns using computational analysis which can be further confirmed by bisulfite PCR with serial pyrosequencing. The upstream regulatory element and pre-initiation complex relative to CpG islets within the methylenetetrahydrofolate reductase gene were determined via computational analysis and online databases.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!