Proteins with domains that recognize and bind post-translational modifications (PTMs) of histones are collectively termed epigenetic readers. Numerous interactions between specific reader protein domains and histone PTMs and their regulatory outcomes have been reported, but little is known about how reader proteins may in turn be modulated by these interactions. Tripartite motif-containing protein 24 (TRIM24) is a histone reader aberrantly expressed in multiple cancers.
View Article and Find Full Text PDFReprogramming to induced pluripotent stem cells (iPSCs) and differentiation of pluripotent stem cells (PSCs) are regulated by epigenetic machinery. Tripartite motif protein 28 (TRIM28), a universal mediator of Krüppel-associated box domain zinc fingers (KRAB-ZNFs), is known to regulate both processes; however, the exact mechanism and identity of participating KRAB-ZNF genes remain unknown. Here, using a reporter system, we show that TRIM28/KRAB-ZNFs alter DNA methylation patterns in addition to H3K9me3 to cause stable gene repression during reprogramming.
View Article and Find Full Text PDFThe expression of Tripartite motif-containing protein 28 (TRIM28)/Krüppel-associated box (KRAB)-associated protein 1 (KAP1), is elevated in at least 14 tumor types, including solid and hematopoietic tumors. High level of TRIM28 is associated with triple-negative subtype of breast cancer (TNBC), which shows higher aggressiveness and lower survival rates. Interestingly, TRIM28 is essential for maintaining the pluripotent phenotype in embryonic stem cells.
View Article and Find Full Text PDFCurrent treatment regimens for pancreatic ductal adenocarcinoma (PDAC) yield poor 5-year survival, emphasizing the critical need to identify druggable targets essential for PDAC maintenance. We developed an unbiased and in vivo target discovery approach to identify molecular vulnerabilities in low-passage and patient-derived PDAC xenografts or genetically engineered mouse model-derived allografts. Focusing on epigenetic regulators, we identified WDR5, a core member of the COMPASS histone H3 Lys4 (H3K4) MLL (1-4) methyltransferase complex, as a top tumor maintenance hit required across multiple human and mouse tumors.
View Article and Find Full Text PDFThe SWI/SNF multisubunit complex modulates chromatin structure through the activity of two mutually exclusive catalytic subunits, SMARCA2 and SMARCA4, which both contain a bromodomain and an ATPase domain. Using RNAi, cancer-specific vulnerabilities have been identified in SWI/SNF-mutant tumors, including SMARCA4-deficient lung cancer; however, the contribution of conserved, druggable protein domains to this anticancer phenotype is unknown. Here, we functionally deconstruct the SMARCA2/4 paralog dependence of cancer cells using bioinformatics, genetic, and pharmacologic tools.
View Article and Find Full Text PDFOur current understanding of cancer genetics is grounded on the principle that cancer arises from a clone that has accumulated the requisite somatically acquired genetic aberrations, leading to the malignant transformation. It also results in aberrent of gene and protein expression. Next generation sequencing (NGS) or deep sequencing platforms are being used to create large catalogues of changes in copy numbers, mutations, structural variations, gene fusions, gene expression, and other types of information for cancer patients.
View Article and Find Full Text PDFAndrogen deprivation is the standard treatment for advanced prostate cancer (PCa), but most patients ultimately develop resistance and tumor recurrence. We found that MYB is transcriptionally activated by androgen deprivation therapy or genetic silencing of the androgen receptor (AR). MYB silencing inhibited PCa growth in culture and xenografts in mice.
View Article and Find Full Text PDFBackground: Multiple myeloma (MM) is a malignant proliferation of plasma B cells. Based on recurrent aneuploidy such as copy number alterations (CNAs), myeloma is divided into two subtypes with different CNA patterns and patient survival outcomes. How aneuploidy events arise, and whether they contribute to cancer cell evolution are actively studied.
View Article and Find Full Text PDFMultiple myeloma (MM) is a cancer of antibody-making plasma cells. It frequently harbors alterations in DNA and chromosome copy numbers, and can be divided into two major subtypes, hyperdiploid (HMM) and non-hyperdiploid multiple myeloma (NHMM). The two subtypes have different survival prognosis, possibly due to different but converging paths to oncogenesis.
View Article and Find Full Text PDFBackground & Objective: Genome-wide profiles of tumors obtained using functional genomics platforms are being deposited to the public repositories at an astronomical scale, as a result of focused efforts by individual laboratories and large projects such as the Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium. Consequently, there is an urgent need for reliable tools that integrate and interpret these data in light of current knowledge and disseminate results to biomedical researchers in a user-friendly manner. We have built the canEvolve web portal to meet this need.
View Article and Find Full Text PDFBortezomib therapy has proven successful for the treatment of relapsed/refractory, relapsed, and newly diagnosed multiple myeloma (MM); however, dose-limiting toxicities and the development of resistance limit its long-term utility. Here, we show that P5091 is an inhibitor of deubiquitylating enzyme USP7, which induces apoptosis in MM cells resistant to conventional and bortezomib therapies. Biochemical and genetic studies show that blockade of HDM2 and p21 abrogates P5091-induced cytotoxicity.
View Article and Find Full Text PDFWe describe here a novel method for integrating gene and miRNA expression profiles in cancer using feed-forward loops (FFLs) consisting of transcription factors (TFs), miRNAs and their common target genes. The dChip-GemiNI (Gene and miRNA Network-based Integration) method statistically ranks computationally predicted FFLs by their explanatory power to account for differential gene and miRNA expression between two biological conditions such as normal and cancer. GemiNI integrates not only gene and miRNA expression data but also computationally derived information about TF-target gene and miRNA-mRNA interactions.
View Article and Find Full Text PDFOver the last decade, multiple functional genomic datasets studying chromosomal aberrations and their downstream effects on gene expression have accumulated for several cancer types. A vast majority of them are in the form of paired gene expression profiles and somatic copy number alterations (CNA) information on the same patients identified using microarray platforms. In response, many algorithms and software packages are available for integrating these paired data.
View Article and Find Full Text PDFBackground: Target specific antibodies are pivotal for the design of vaccines, immunodiagnostic tests, studies on proteomics for cancer biomarker discovery, identification of protein-DNA and other interactions, and small and large biochemical assays. Therefore, it is important to understand the properties of protein sequences that are important for antigenicity and to identify small peptide epitopes and large regions in the linear sequence of the proteins whose utilization result in specific antibodies.
Results: Our analysis using protein properties suggested that sequence composition combined with evolutionary information and predicted secondary structure, as well as solvent accessibility is sufficient to predict successful peptide epitopes.
Systematic annotation of gene regulatory elements is a major challenge in genome science. Direct mapping of chromatin modification marks and transcriptional factor binding sites genome-wide has successfully identified specific subtypes of regulatory elements. In Drosophila several pioneering studies have provided genome-wide identification of Polycomb response elements, chromatin states, transcription factor binding sites, RNA polymerase II regulation and insulator elements; however, comprehensive annotation of the regulatory genome remains a significant challenge.
View Article and Find Full Text PDFBackground: Genome-wide expression signatures are emerging as potential marker for overall survival and disease recurrence risk as evidenced by recent commercialization of gene expression based biomarkers in breast cancer. Similar predictions have recently been carried out using genome-wide copy number alterations and microRNAs. Existing software packages for microarray data analysis provide functions to define expression-based survival gene signatures.
View Article and Find Full Text PDFInsulators are DNA sequences that control the interactions among genomic regulatory elements and act as chromatin boundaries. A thorough understanding of their location and function is necessary to address the complexities of metazoan gene regulation. We studied by ChIP-chip the genome-wide binding sites of 6 insulator-associated proteins-dCTCF, CP190, BEAF-32, Su(Hw), Mod(mdg4), and GAF-to obtain the first comprehensive map of insulator elements in Drosophila embryos.
View Article and Find Full Text PDFMotivation: The highly coordinated expression of thousands of genes in an organism is regulated by the concerted action of transcription factors, chromatin proteins and epigenetic mechanisms. High-throughput experimental data for genome wide in vivo protein-DNA interactions and epigenetic marks are becoming available from large projects, such as the model organism ENCyclopedia Of DNA Elements (modENCODE) and from individual labs. Dissemination and visualization of these datasets in an explorable form is an important challenge.
View Article and Find Full Text PDFWe demonstrate an integrated approach to the study of a transcriptional regulatory cascade involved in the progression of breast cancer and we identify a protein associated with disease progression. Using chromatin immunoprecipitation and genome tiling arrays, whole genome mapping of transcription factor-binding sites was combined with gene expression profiling to identify genes involved in the proliferative response to estrogen (E2). Using RNA interference, selected ERalpha and c-MYC gene targets were knocked down to identify mediators of E2-stimulated cell proliferation.
View Article and Find Full Text PDFSystematically annotating function of enzymes that belong to large protein families encoded in a single eukaryotic genome is a very challenging task. We carried out such an exercise to annotate function for serine-protease family of the trypsin fold in Drosophila melanogaster, with an emphasis on annotating serine-protease homologues (SPHs) that may have lost their catalytic function. Our approach involves data mining and data integration to provide function annotations for 190 Drosophila gene products containing serine-protease-like domains, of which 35 are SPHs.
View Article and Find Full Text PDFMotivation: Availability of large volumes of genomic and enzymatic data for taxonomically and phenotypically diverse organisms allows for exploration of the adaptive mechanisms that led to diversification of enzymatic functions. We present Chisel, a computational framework and a pipeline for an automated, high-resolution analysis of evolutionary variations of enzymes. Chisel allows automatic as well as interactive identification, and characterization of enzymatic sequences.
View Article and Find Full Text PDFIn a genome-wide analysis, we have identified 85 human genes encoding 103 protein isoforms that resemble retroviral Gag proteins. These genes were domesticated from retrotransposons in at least five independent events during vertebrate evolution and were subsequently duplicated further in mammals. Structural insights into the mammalian proteins can be inferred by homology to Gag from viruses such as HIV; in turn, the cellular roles of the mammalian Gag homologs, such as apoptosis-related functions and binding to ubiquitin ligases, might hint at further functionality of viral Gag itself.
View Article and Find Full Text PDFMotivation: Generation of alternative transcripts from the same gene is an important biological event due to their contribution in creating functional diversity in eukaryotes. In this work, we choose the task of extracting information around this complex topic using a two-step procedure involving machine learning and information extraction.
Results: In the first step, we trained a classifier that inductively learns to identify sentences about physiological transcript diversity from the MEDLINE abstracts.
Transcript diversity generated by alternative splicing and associated mechanisms contributes heavily to the functional complexity of biological systems. The numerous examples of the mechanisms and functional implications of these events are scattered throughout the scientific literature. Thus, it is crucial to have a tool that can automatically extract the relevant facts and collect them in a knowledge base that can aid the interpretation of data from high-throughput methods.
View Article and Find Full Text PDF