Comparison of false-discovery rates of various decoy databases.

Proteome Sci

Center for Supercomputing Applications, Korea Institute of Science and Technology Information, Daejeon, 34141, Republic of Korea.

Published: September 2021

Background: The target-decoy strategy effectively estimates the false-discovery rate (FDR) by creating a decoy database with a size identical to that of the target database. Decoy databases are created by various methods, such as, the reverse, pseudo-reverse, shuffle, pseudo-shuffle, and the de Bruijn methods. FDR is sometimes over- or under-estimated depending on which decoy database is used because the ratios of redundant peptides in the target databases are different, that is, the numbers of unique (non-redundancy) peptides in the target and decoy databases differ.

Results: We used two protein databases (the UniProt Saccharomyces cerevisiae protein database and the UniProt human protein database) to compare the FDRs of various decoy databases. When the ratio of redundant peptides in the target database is low, the FDR is not overestimated by any decoy construction method. However, if the ratio of redundant peptides in the target database is high, the FDR is overestimated when the (pseudo) shuffle decoy database is used. Additionally, human and S. cerevisiae six frame translation databases, which are large databases, also showed outcomes similar to that from the UniProt human protein database.

Conclusion: The FDR must be estimated using the correction factor proposed by Elias and Gygi or that by Kim et al. when (pseudo) shuffle decoy databases are used.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8449453PMC
http://dx.doi.org/10.1186/s12953-021-00179-7DOI Listing

Publication Analysis

Top Keywords

decoy databases
20
peptides target
16
decoy database
12
target database
12
redundant peptides
12
decoy
9
databases
9
database
8
protein database
8
uniprot human
8

Similar Publications

The increasing utilization of deep learning models in drug repositioning has proven to be highly efficient and effective. In this study, we employed an integrated deep-learning model followed by traditional drug screening approach to screen a library of FDA-approved drugs, aiming to identify novel inhibitors targeting the TNF-α converting enzyme (TACE). TACE, also known as ADAM17, plays a crucial role in the inflammatory response by converting pro-TNF-α to its active soluble form and cleaving other inflammatory mediators, making it a promising target for therapeutic intervention in diseases such as rheumatoid arthritis.

View Article and Find Full Text PDF

Plasmapheresis for extracorporeal membrane oxygenation (ECMO)-induced hemolysis in infants.

J Extra Corpor Technol

December 2024

Division of Pediatric Nephrology, Joe DiMaggio Children's Hospital, 1131 N35th Ave, Hollywood, FL 33021, USA - Charles E. Schmidt College of Medicine at Florida Atlantic University, 777 Glades Rd BC-71, Boca Raton, FL 33431, USA.

Background: Intravascular hemolysis is a known complication of extracorporeal membrane oxygenation (ECMO). Characterized by elevated plasma-free hemoglobin (PFH), intravascular hemolysis is associated with cytotoxic effects leading to renal replacement therapy (RRT), longer ECMO runs, and mortality. Therapeutic plasma exchange (TPE) in tandem with ECMO was described as a therapy for various pathologic conditions, but there are no Extracorporeal Life Support Organization (ELSO) guidelines for the treatment of ECMO-induced hemolysis.

View Article and Find Full Text PDF

Redesigning error control in cross-linking mass spectrometry enables more robust and sensitive protein-protein interaction studies.

Mol Syst Biol

January 2025

Research group "Structural Interactomics", Leibniz Forschungsinstitut für Molekulare Pharmakologie, Robert-Rössle-Str. 10, 13125, Berlin, Germany.

Cross-linking mass spectrometry (XL-MS) allows characterizing protein-protein interactions (PPIs) in native biological systems by capturing cross-links between different proteins (inter-links). However, inter-link identification remains challenging, requiring dedicated data filtering schemes and thorough error control. Here, we benchmark existing data filtering schemes combined with error rate estimation strategies utilizing concatenated target-decoy protein sequence databases.

View Article and Find Full Text PDF

Developing selective kinase inhibitors remains a formidable challenge in drug discovery because of the highly conserved structural information on adenosine triphosphate (ATP) binding sites across the kinase family. Tailoring docking protocols to identify promising kinase inhibitor candidates for optimization has long been a substantial obstacle to drug discovery. Therefore, we introduced "Kinase-Bench," a pioneering benchmark suite designed for an advanced virtual screen, to improve the selectivity and efficacy of kinase inhibitors.

View Article and Find Full Text PDF

Even though COVID-19 is no longer the primary focus of the global scientific community, its high mutation rate (nearly 30 substitutions per year) poses a threat of a potential comeback. Effective vaccines have been developed and administered to the population, ending the pandemic. Nonetheless, reinfection by newly emerging subvariants, particularly the latest JN.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!