Recent studies have demonstrated extensive transcriptional activity across the human genome, a substantial fraction of which is not associated with any functional annotation. However, very little is known regarding the post-transcriptional processes that operate within the different classes of RNA molecules. To characterize the post-transcriptional properties of expressed sequences from human chromosome 21 (HSA21), we separated RNA molecules from three cell lines (GM06990, HeLa S3, and SK-N-AS) according to their ribosome content by sucrose gradient fractionation.
View Article and Find Full Text PDFTo date, the contribution of disrupted potentially cis-regulatory conserved non-coding sequences (CNCs) to human disease is most likely underestimated, as no systematic screens for putative deleterious variations in CNCs have been conducted. As a model for monogenic disease we studied the involvement of genetic changes of CNCs in the cis-regulatory domain of FOXL2 in blepharophimosis syndrome (BPES). Fifty-seven molecularly unsolved BPES patients underwent high-resolution copy number screening and targeted sequencing of CNCs.
View Article and Find Full Text PDFRapid amplification of cDNA ends (RACE) is a widely used approach for transcript identification. Random clone selection from the RACE mixture, however, is an ineffective sampling strategy if the dynamic range of transcript abundances is large. To improve sampling efficiency of human transcripts, we hybridized the products of the RACE reaction onto tiling arrays and used the detected exons to delineate a series of reverse-transcriptase (RT)-PCRs, through which the original RACE transcript population was segregated into simpler transcript populations.
View Article and Find Full Text PDFFunctional RNA structures play an important role both in the context of noncoding RNA transcripts as well as regulatory elements in mRNAs. Here we present a computational study to detect functional RNA structures within the ENCODE regions of the human genome. Since structural RNAs in general lack characteristic signals in primary sequence, comparative approaches evaluating evolutionary conservation of structures are most promising.
View Article and Find Full Text PDFBackground: The GENCODE consortium was formed to identify and map all protein-coding genes within the ENCODE regions. This was achieved by a combination of initial manual annotation by the HAVANA team, experimental validation by the GENCODE consortium and a refinement of the annotation based on these experimental results.
Results: The GENCODE gene features are divided into eight different categories of which only the first two (known and novel coding sequence) are confidently predicted to be protein-coding genes.
Background: We present the results of EGASP, a community experiment to assess the state-of-the-art in genome annotation within the ENCODE regions, which span 1% of the human genome sequence. The experiment had two major goals: the assessment of the accuracy of computational methods to predict protein coding genes; and the overall assessment of the completeness of the current human genome annotations as represented in the ENCODE regions. For the computational prediction assessment, eighteen groups contributed gene predictions.
View Article and Find Full Text PDFCartilage-hair hypoplasia (CHH) is a pleiotropic disease caused by recessive mutations in the RMRP gene that result in a wide spectrum of manifestations including short stature, sparse hair, metaphyseal dysplasia, anemia, immune deficiency, and increased incidence of cancer. Molecular diagnosis of CHH has implications for management, prognosis, follow-up, and genetic counseling of affected patients and their families. We report 20 novel mutations in 36 patients with CHH and describe the associated phenotypic spectrum.
View Article and Find Full Text PDFThere are five members of the RFX family of transcription factors in mammals. While RFX5 plays a well-defined role in the immune system, the functions of RFX1 to RFX4 remain largely unknown. We have generated mice with a deletion of the Rfx3 gene.
View Article and Find Full Text PDFAnalysis of the human and mouse genomes identified an abundance of conserved non-genic sequences (CNGs). The significance and evolutionary depth of their conservation remain unanswered. We have quantified levels and patterns of conservation of 191 CNGs of human chromosome 21 in 14 mammalian species.
View Article and Find Full Text PDFA primary motivation for sequencing the mouse genome was to accelerate the discovery of mammalian genes by using sequence conservation between mouse and human to identify coding exons. Achieving this goal proved challenging because of the large proportion of the mouse and human genomes that is apparently conserved but apparently does not code for protein. We developed a two-stage procedure that exploits the mouse and human genome sequences to produce a set of genes with a much higher rate of experimental verification than previously reported prediction methods.
View Article and Find Full Text PDFGenome-wide expression analyses have a crucial role in functional genomics. High resolution methods, such as RNA in situ hybridization provide an accurate description of the spatiotemporal distribution of transcripts as well as a three-dimensional 'in vivo' gene expression overview. We set out to analyse systematically the expression patterns of genes from an entire chromosome.
View Article and Find Full Text PDFThe use of comparative genomics to infer genome function relies on the understanding of how different components of the genome change over evolutionary time. The aim of such comparative analysis is to identify conserved, functionally transcribed sequences such as protein-coding genes and non-coding RNA genes, and other functional sequences such as regulatory regions, as well as other genomic features. Here, we have compared the entire human chromosome 21 with syntenic regions of the mouse genome, and have identified a large number of conserved blocks of unknown function.
View Article and Find Full Text PDFThe sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences.
View Article and Find Full Text PDFWilliams-Beuren syndrome (WBS) is a developmental disorder associated with haploinsufficiency of multiple genes at 7q11.23. Here, we report the characterization of WBSCR16, WBSCR17, WBSCR18, WBSCR20A, WBSCR20B, WBSCR20C, WBSCR21, WBSCR22, and WBSCR23, nine novel genes contained in the WBS commonly deleted region or its flanking sequences.
View Article and Find Full Text PDFThe identification of all human chromosome 21 (HC21) genes is a necessary step in understanding the molecular pathogenesis of trisomy 21 (Down syndrome). The first analysis of the sequence of 21q included 127 previously characterized genes and predicted an additional 98 novel anonymous genes. Recently we evaluated the quality of this annotation by characterizing a set of HC21 open reading frames (C21orfs) identified by mapping spliced expressed sequence tags (ESTs) and predicted genes (PREDs), identified only in silico.
View Article and Find Full Text PDFA supernumerary copy of human chromosome 21 (HC21) causes Down syndrome. To understand the molecular pathogenesis of Down syndrome, it is necessary to identify all HC21 genes. The first annotation of the sequence of 21q confirmed 127 genes, and predicted an additional 98 previously unknown "anonymous" genes (predictions (PREDs) and open reading frames (C21orfs)), which were foreseen by exon prediction programs and/or spliced expressed sequence tags.
View Article and Find Full Text PDFWe earlier proposed that a human endogenous retroviral (HERV) superantigen (SAg) IDDMK(1,2)22 may cause type I diabetes by activating autoreactive T cells. Viral infections and induction of interferon-alpha (IFN-alpha) are tightly associated with the onset of autoimmunity. Here we establish a link between viral infections and IFN-alpha-regulated SAg expression of the polymorphic and defective HERV-K18 provirus.
View Article and Find Full Text PDFMajor histocompatibility complex class II (MHC-II) molecules present peptide antigens to CD4-positive T cells and are of critical importance for the immune response. The MHC-II transactivator CIITA is essential for all aspects of MHC-II gene expression examined so far and thus constitutes a master regulator of MHC-II expression. In this study, we generated and analyzed mutant CIITA molecules which are able to suppress endogenous MHC-II expression in a dominant negative manner for both constitutive and inducible MHC-II expression.
View Article and Find Full Text PDFCongenital MHC class II deficiency or bare lymphocyte syndrome (BLS; McKusick 209920) is caused by defects in trans-acting regulatory factors that control MHC class II expression and is therefore a disease of gene regulation. There are at least four complementation groups and the genetic and molecular dissection of this rare disease has contributed considerably to our current understanding of the molecular mechanisms governing MHC class II expression. Identification of the gene that is defective in BLS complementation group A, CIITA (MHC class II transactivator), has led to the discovery that CIITA acts as a master control factor of MHC class II expression.
View Article and Find Full Text PDFRFX transcription factors constitute a highly conserved family of site-specific DNA binding proteins involved in the expression of a variety of cellular and viral genes, including major histocompatibility complex class II genes and genes in human hepatitis B virus. Five members of the RFX gene family have been isolated from human and mouse, and all share a highly characteristic DNA binding domain that is distinct from other known DNA binding motifs. The human RFX1 and RFX2 genes have been assigned by in situ hybridization to chromosome 19p13.
View Article and Find Full Text PDF