Publications by authors named "Shawn J Cokus"

Article Synopsis
  • The genus Quercus began diversifying about 55 million years ago, resulting in around 450 species, including the California oak Quercus lobata, which has a high-quality genome assembly that showcases its evolutionary advantages.
  • Analysis of the oak's genome revealed a large effective population size despite a historical decline, with extensive gene duplications contributing to its genetic and phenotypic diversity.
  • Unique patterns of DNA methylation connected to transposable elements indicate a presence of heterochromatin similar to grasses, supporting the idea that these genetic features enhance adaptability to environmental changes.
View Article and Find Full Text PDF

Epithelial cells are the building blocks of many organs, including skin. The vertebrate skin initially consists of two epithelial layers, the outer periderm and inner basal cell layers, which have distinct properties, functions, and fates. The embryonic periderm ultimately disappears during development, whereas basal cells proliferate to form the mature, stratified epidermis.

View Article and Find Full Text PDF

Microalgae have potential to help meet energy and food demands without exacerbating environmental problems. There is interest in the unicellular green alga , because it produces lipids for biofuels and a highly valuable carotenoid nutraceutical, astaxanthin. To advance understanding of its biology and facilitate commercial development, we present a chromosome-level nuclear genome, organelle genomes, and transcriptome from diverse growth conditions.

View Article and Find Full Text PDF

Previously, we have shown that loss of the histone 3 lysine 27 (H3K27) monomethyltransferases ARABIDOPSIS TRITHORAX-RELATED 5 (ATXR5) and ATXR6 (ATXR6) results in the overreplication of heterochromatin. Here we show that the overreplication results in DNA damage and extensive chromocenter remodeling into unique structures we have named "overreplication-associated centers" (RACs). RACs have a highly ordered structure with an outer layer of condensed heterochromatin, an inner layer enriched in the histone variant H2AX, and a low-density core containing foci of phosphorylated H2AX (a marker of double-strand breaks) and the DNA-repair enzyme RAD51.

View Article and Find Full Text PDF

Background: Improvement in the performance of eukaryotic microalgae for biofuel and bioproduct production is largely dependent on characterization of metabolic mechanisms within the cell. The marine diatom which was originally identified in the Aquatic Species Program, is a promising strain of microalgae for large-scale production of biofuel and bioproducts, such as omega-3 fatty acids.

Results: We sequenced the nuclear genome and methylome of this oleaginous diatom to identify the genetic traits that enable substantial accumulation of triacylglycerol.

View Article and Find Full Text PDF

DNA methylation is an epigenetic mechanism that has important functions in transcriptional silencing and is associated with repressive histone methylation (H3K9me). To further investigate silencing mechanisms, we screened a mutagenized Arabidopsis thaliana population for expression of SDCpro-GFP, redundantly controlled by DNA methyltransferases DRM2 and CMT3. Here, we identify the hypomorphic mutant mthfd1-1, carrying a mutation (R175Q) in the cytoplasmic bifunctional methylenetetrahydrofolate dehydrogenase/methenyltetrahydrofolate cyclohydrolase (MTHFD1).

View Article and Find Full Text PDF

Background: Reference transcriptomes provide valuable resources for understanding evolution within and among species. We de novo assembled and annotated a reference transcriptome for Quercus lobata and Q. garryana and identified single-nucleotide polymorphisms (SNPs) to provide resources for forest genomicists studying this ecologically and economically important genus.

View Article and Find Full Text PDF

Chromosomes form 3D structures that are critical to the regulation of cellular and genetic processes. Here, we present a study of global chromatin interaction patterns in Arabidopsis thaliana. Our genome-wide approach confirmed interactions that were previously observed by other methods as well as uncovered long-range interactions such as those among small heterochromatic regions embedded in euchromatic arms.

View Article and Find Full Text PDF

Transposable elements (TEs) and DNA repeats are commonly targeted by DNA and histone methylation to achieve epigenetic gene silencing. We isolated mutations in two Arabidopsis genes, AtMORC1 and AtMORC6, which cause derepression of DNA-methylated genes and TEs but no losses of DNA or histone methylation. AtMORC1 and AtMORC6 are members of the conserved Microrchidia (MORC) adenosine triphosphatase (ATPase) family, which are predicted to catalyze alterations in chromosome superstructure.

View Article and Find Full Text PDF

Algae have recently gained attention as a potential source for biodiesel; however, much is still unknown about the biological triggers that cause the production of triacylglycerols. We used RNA-Seq as a tool for discovering genes responsible for triacylglycerol (TAG) production in Chlamydomonas and for the regulatory components that activate the pathway. Three genes encoding acyltransferases, DGAT1, DGTT1, and PDAT1, are induced by nitrogen starvation and are likely to have a role in TAG accumulation based on their patterns of expression.

View Article and Find Full Text PDF

Background: Progress in genome sequencing is proceeding at an exponential pace, and several new algal genomes are becoming available every year. One of the challenges facing the community is the association of protein sequences encoded in the genomes with biological function. While most genome assembly projects generate annotations for predicted protein sequences, they are usually limited and integrate functional terms from a limited number of databases.

View Article and Find Full Text PDF

De novo DNA methylation in Arabidopsis thaliana is catalyzed by the methyltransferase DRM2, a homolog of the mammalian de novo methyltransferase DNMT3. DRM2 is targeted to DNA by small interfering RNAs (siRNAs) in a process known as RNA-directed DNA Methylation (RdDM). While several components of the RdDM pathway are known, a functional understanding of the underlying mechanism is far from complete.

View Article and Find Full Text PDF

Nucleosomes compact and regulate access to DNA in the nucleus, and are composed of approximately 147 bases of DNA wrapped around a histone octamer. Here we report a genome-wide nucleosome positioning analysis of Arabidopsis thaliana using massively parallel sequencing of mononucleosomes. By combining this data with profiles of DNA methylation at single base resolution, we identified 10-base periodicities in the DNA methylation status of nucleosome-bound DNA and found that nucleosomal DNA was more highly methylated than flanking DNA.

View Article and Find Full Text PDF

Background: Bisulfite sequencing using next generation sequencers yields genome-wide measurements of DNA methylation at single nucleotide resolution. Traditional aligners are not designed for mapping bisulfite-treated reads, where the unmethylated Cs are converted to Ts. We have developed BS Seeker, an approach that converts the genome to a three-letter alphabet and uses Bowtie to align bisulfite-treated reads to a reference genome.

View Article and Find Full Text PDF

Cytosine DNA methylation is a heritable epigenetic mark present in many eukaryotic organisms. Although DNA methylation likely has a conserved role in gene silencing, the levels and patterns of DNA methylation appear to vary drastically among different organisms. Here we used shotgun genomic bisulfite sequencing (BS-Seq) to compare DNA methylation in eight diverse plant and animal genomes.

View Article and Find Full Text PDF

Epigenetic reprogramming including demethylation of DNA occurs in mammalian primordial germ cells (PGCs) and in early embryos, and is important for the erasure of imprints and epimutations, and the return to pluripotency. The extent of this reprogramming and its molecular mechanisms are poorly understood. We previously showed that the cytidine deaminases AID and APOBEC1 can deaminate 5-methylcytosine in vitro and in Escherichia coli, and in the mouse are expressed in tissues in which demethylation occurs.

View Article and Find Full Text PDF

Background: Many of the functional units in cells are multi-protein complexes such as RNA polymerase, the ribosome, and the proteasome. For such units to work together, one might expect a high level of regulation to enable co-appearance or repression of sets of complexes at the required time. However, this type of coordinated regulation between whole complexes is difficult to detect by existing methods for analyzing mRNA co-expression.

View Article and Find Full Text PDF

Copy number variants (CNVs) are genomic segments which are duplicated or deleted among different individuals. CNVs have been implicated in both Mendelian and complex traits, including immune and behavioral disorders, but the study of the mechanisms by which CNVs influence gene expression and clinical phenotypes in humans is complicated by the limited access to tissues and by population heterogeneity. We now report studies of the effect of 19 CNVs on gene expression and metabolic traits in a mouse intercross between strains C57BL/6J and C3H/HeJ.

View Article and Find Full Text PDF

Cytosine DNA methylation is important in regulating gene expression and in silencing transposons and other repetitive sequences. Recent genomic studies in Arabidopsis thaliana have revealed that many endogenous genes are methylated either within their promoters or within their transcribed regions, and that gene methylation is highly correlated with transcription levels. However, plants have different types of methylation controlled by different genetic pathways, and detailed information on the methylation status of each cytosine in any given genome is lacking.

View Article and Find Full Text PDF

Acetylation of histones plays an important role in regulating transcription. Histone acetylation is mediated partly by the recruitment of specific histone acetyltransferases (HATs) and deacetylases (HDACs) to genomic loci by transcription factors, resulting in modulation of gene expression. Although several specific interactions between transcription factors and HATs and HDACs have been elaborated in Saccharomyces cerevisiae, the full regulatory network remains uncharacterized.

View Article and Find Full Text PDF

The wealth of available genomic data has spawned a corresponding interest in computational methods that can impart biological meaning and context to these experiments. Traditional computational methods have drawn relationships between pairs of proteins or genes based on notions of equality or similarity between their patterns of occurrence or behavior. For example, two genes displaying similar variation in expression, over a number of experiments, may be predicted to be functionally related.

View Article and Find Full Text PDF

A major focus of genome research is to decipher the networks of molecular interactions that underlie cellular function. We describe a computational approach for identifying detailed relationships between proteins on the basis of genomic data. Logic analysis of phylogenetic profiles identifies triplets of proteins whose presence or absence obey certain logic relationships.

View Article and Find Full Text PDF