Publications by authors named "RyangGuk Kim"

Cancer sequencing studies are increasingly comprehensive and well powered, returning long lists of somatic mutations that can be difficult to sort and interpret. Diligent analysis and quality control can require multiple computational tools of distinct utility and producing disparate output, creating additional challenges for the investigator. The Cancer-Related Analysis of Variants Toolkit (CRAVAT) is an evolving suite of informatics tools for mutation interpretation that includes mutation mapping and quality control, impact prediction and extensive annotation, gene- and mutation-level interpretation, including joint prioritization of all nonsilent mutation consequence types, and structural and mechanistic visualization.

View Article and Find Full Text PDF

Mammalian chromosome replication starts from distinct sites; however, the principles governing initiation site selection are unclear because proteins essential for DNA replication do not exhibit sequence-specific DNA binding. Here we identify a replication-initiation determinant (RepID) protein that binds a subset of replication-initiation sites. A large fraction of RepID-binding sites share a common G-rich motif and exhibit elevated replication initiation.

View Article and Find Full Text PDF

Background: Next-generation sequencing techniques such as ChIP-seq allow researchers to investigate the genomic position of nuclear components and events. These experiments provide researchers with thousands of regions of interest to probe in order to identify biological relevance. As the cost of sequencing decreases and its robustness increases, more and more researchers turn to genome-wide studies to better understand the genomic elements they are studying.

View Article and Find Full Text PDF

The Mus81 endonuclease resolves recombination intermediates and mediates cellular responses to exogenous replicative stress. Here, we show that Mus81 also regulates the rate of DNA replication during normal growth by promoting replication fork progression while reducing the frequency of replication initiation events. In the absence of Mus81 endonuclease activity, DNA synthesis is slowed and replication initiation events are more frequent.

View Article and Find Full Text PDF

Mutation position imaging toolbox (MuPIT) interactive is a browser-based application for single-nucleotide variants (SNVs), which automatically maps the genomic coordinates of SNVs onto the coordinates of available three-dimensional (3D) protein structures. The application is designed for interactive browser-based visualization of the putative functional relevance of SNVs by biologists who are not necessarily experts either in bioinformatics or protein structure. Users may submit batches of several thousand SNVs and review all protein structures that cover the SNVs, including available functional annotations such as binding sites, mutagenesis experiments, and common polymorphisms.

View Article and Find Full Text PDF

Mammalian DNA replication starts at distinct chromosomal sites in a tissue-specific pattern coordinated with transcription, but previous studies have not yet identified a chromatin modification that correlates with the initiation of DNA replication at particular genomic locations. Here we report that a distinct fraction of replication initiation sites in the human genome are associated with a high frequency of dimethylation of histone H3 lysine K79 (H3K79Me2). H3K79Me2-containing chromatin exhibited the highest genome-wide enrichment for replication initiation events observed for any chromatin modification examined thus far (23.

View Article and Find Full Text PDF

Background: One of the crucial steps in regulation of gene expression is the binding of transcription factor(s) to specific DNA sequences. Knowledge of the binding affinity and specificity at a structural level between transcription factors and their target sites has important implications in our understanding of the mechanism of gene regulation. Due to their unique functions and binding specificity, there is a need for a transcription factor-specific, structure-based database and corresponding web service to facilitate structural bioinformatics studies of transcription factor-DNA interactions, such as development of knowledge-based interaction potential, transcription factor-DNA docking, binding induced conformational changes, and the thermodynamics of protein-DNA interactions.

View Article and Find Full Text PDF

Summary: SpliceSeq is a resource for RNA-Seq data that provides a clear view of alternative splicing and identifies potential functional changes that result from splice variation. It displays intuitive visualizations and prioritized lists of results that highlight splicing events and their biological consequences. SpliceSeq unambiguously aligns reads to gene splice graphs, facilitating accurate analysis of large, complex transcript variants that cannot be adequately represented in other formats.

View Article and Find Full Text PDF

Background: Structural insight from transcription factor-DNA (TF-DNA) complexes is of paramount importance to our understanding of the affinity and specificity of TF-DNA interaction, and to the development of structure-based prediction of TF binding sites. Yet the majority of the TF-DNA complexes remain unsolved despite the considerable experimental efforts being made. Computational docking represents a promising alternative to bridge the gap.

View Article and Find Full Text PDF

This report investigates the mechanisms by which mammalian cells coordinate DNA replication with transcription and chromatin assembly. In yeast, DNA replication initiates within nucleosome-free regions, but studies in mammalian cells have not revealed a similar relationship. Here, we have used genome-wide massively parallel sequencing to map replication initiation events, thereby creating a database of all replication initiation sites within nonrepetitive DNA in two human cell lines.

View Article and Find Full Text PDF

Background: Protein sequence insertions/deletions (indels) can be introduced during evolution or through alternative splicing (AS). Alternative splicing is an important biological phenomenon and is considered as the major means of expanding structural and functional diversity in eukaryotes. Knowledge of the structural changes due to indels is critical to our understanding of the evolution of protein structure and function.

View Article and Find Full Text PDF

Background: Knowledge of protein-DNA interactions at the structural-level can provide insights into the mechanisms of protein-DNA recognition and gene regulation. Although over 1400 protein-DNA complex structures have been deposited into Protein Data Bank (PDB), the structural details of protein-DNA interactions are generally not available. In addition, current approaches to comparison of protein-DNA complexes are mainly based on protein sequence similarity while the DNA sequences are not taken into account.

View Article and Find Full Text PDF

The prediction of the binding free energy between a ligand and a protein is an important component in the virtual screening and lead optimization of ligands for drug discovery. To determine the quality of current binding free energy estimation programs, we examined FlexX, X-Score, AutoDock, and BLEEP for their performance in binding free energy prediction in various situations including cocrystallized complex structures, cross docking of ligands to their non-cocrystallized receptors, docking of thermally unfolded receptor decoys to their ligands, and complex structures with "randomized" ligand decoys. In no case was there a satisfactory correlation between the experimental and estimated binding free energies over all the datasets tested.

View Article and Find Full Text PDF

The representation system for protein conformation has a crucial effect on the speed of various protein-related simulations, including ab initio protein structure prediction and protein-protein docking simulation. Usually, the finer a representation system, the longer is the computational time required to employ the representation system in simulations. On the other hand, very coarse lattice systems cannot be directly applied to the simulation problems with real proteins.

View Article and Find Full Text PDF