DWARF--a data warehouse system for analyzing protein families.

BMC Bioinformatics

Institute of Technical Biochemistry, University of Stuttgart, Allmandring 31, D-70569, Germany.

Published: November 2006

Background: The emerging field of integrative bioinformatics provides the tools to organize and systematically analyze vast amounts of highly diverse biological data and thus allows to gain a novel understanding of complex biological systems. The data warehouse DWARF applies integrative bioinformatics approaches to the analysis of large protein families.

Description: The data warehouse system DWARF integrates data on sequence, structure, and functional annotation for protein fold families. The underlying relational data model consists of three major sections representing entities related to the protein (biochemical function, source organism, classification to homologous families and superfamilies), the protein sequence (position-specific annotation, mutant information), and the protein structure (secondary structure information, superimposed tertiary structure). Tools for extracting, transforming and loading data from public available resources (ExPDB, GenBank, DSSP) are provided to populate the database. The data can be accessed by an interface for searching and browsing, and by analysis tools that operate on annotation, sequence, or structure. We applied DWARF to the family of alpha/beta-hydrolases to host the Lipase Engineering database. Release 2.3 contains 6138 sequences and 167 experimentally determined protein structures, which are assigned to 37 superfamilies 103 homologous families.

Conclusion: DWARF has been designed for constructing databases of large structurally related protein families and for evaluating their sequence-structure-function relationships by a systematic analysis of sequence, structure and functional annotation. It has been applied to predict biochemical properties from sequence, and serves as a valuable tool for protein engineering.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1647292PMC
http://dx.doi.org/10.1186/1471-2105-7-495DOI Listing

Publication Analysis

Top Keywords

data warehouse
12
sequence structure
12
protein
9
warehouse system
8
protein families
8
integrative bioinformatics
8
structure functional
8
functional annotation
8
data
7
structure
6

Similar Publications

After-hours, Severity, and Distance are Associated with Non-VHA Emergency Department Use for Older Veterans: Insights from a Regional Health Information Exchange.

J Geriatr Emerg Med

December 2024

Geriatric Research Education and Clinic Center, James J. Peters VA Medical Center, 130 W Kingsbridge Rd, Bronx, NY 10468 & Department of Medicine, Icahn School of Medicine at Mount Sinai, 1 Gustave L. Levy Pl, New York, NY 10029.

Background: Older adults treated in emergency departments (EDs) are at higher risk for adverse outcomes. Using multiple facilities can worsen this issue through service duplication and poor care transitions. Veterans with dual insurance coverage can access both Veterans Health Administration (VHA) and non-VHA EDs.

View Article and Find Full Text PDF

Psocids are difficult to manage using grain protectants and phosphine hence research on non-chemical control methods. This study evaluated the effectiveness of (Reuter) (Hemiptera: Anthocoridae) at managing (Pearman) (Psocodea: Liposcelididae). The functional responses of adult♀ and nymphs of on a diet of nymphs, adult♂, and adult♀ of were determined under laboratory conditions at 28 ± 1 °C, 63 ± 5% RH, and a 0:24 (L:D) photoperiod.

View Article and Find Full Text PDF

With the increasing speed of genomic, transcriptomic, and metagenomic data generation driven by the advancement and widespread adoption of next-generation sequencing technologies, the management and analysis of large-scale, diverse data in the fields of life science and biotechnology have become critical challenges. In this paper, we thoroughly discuss the use of cloud data warehouses to address these challenges. Specifically, we propose a data management and analysis framework using Snowflake, a SaaS-based data platform.

View Article and Find Full Text PDF

Many planning and decision activities in logistics and supply chain management are based on forecasts of multiple time dependent factors. Therefore, the quality of planning depends on the quality of the forecasts. We compare different state-of-the-art forecasting methods in terms of forecasting performance.

View Article and Find Full Text PDF

Receipt of medications for opioid use disorder among rural and urban veterans health administration patients.

Drug Alcohol Depend Rep

March 2025

Center to Improve Veteran Involvement in Care, VA Portland Health Care System,  3710 SW US Veterans Hospital Rd, Portland, OR 97239, United States.

Aim: We examined differences in medications for opioid use disorder (MOUD) receipt between rural and urban veteran patients following initiatives within the US Department of Veterans Affairs (VA) to expand access to MOUD.

Methods: Data for this retrospective cohort study were obtained from the VA Corporate Data Warehouse, which contains national electronic health record data for all VA patients. The analytic sample included all patients diagnosed with OUD from 10/1/2018-9/30/20.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!