SEM: sized-based expectation maximization for characterizing nucleosome positions and subtypes.

bioRxiv

Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA, USA.

Published: October 2023

Genome-wide nucleosome profiles are predominantly characterized using MNase-seq, which involves extensive MNase digestion and size selection to enrich for mono-nucleosome-sized fragments. Most available MNase-seq analysis packages assume that nucleosomes uniformly protect 147bp DNA fragments. However, some nucleosomes with atypical histone or chemical compositions protect shorter lengths of DNA. The rigid assumptions imposed by current nucleosome analysis packages ignore variation in nucleosome lengths, potentially blinding investigators to regulatory roles played by atypical nucleosomes. To enable the characterization of different nucleosome types from MNase-seq data, we introduce the Size-based Expectation Maximization (SEM) nucleosome calling package. SEM employs a hierarchical Gaussian mixture model to estimate the positions and subtype identity of nucleosomes from MNase-seq fragments. Nucleosome subtypes are automatically identified based on the distribution of protected DNA fragment lengths at nucleosome positions. Benchmark analysis indicates that SEM is on par with existing packages in terms of standard nucleosome-calling accuracy metrics, while uniquely providing the ability to characterize nucleosome subtype identities. Using SEM on a low-dose MNase H2B MNase-ChIP-seq dataset from mouse embryonic stem cells, we identified three nucleosome types: short-fragment nucleosomes, canonical nucleosomes, and di-nucleosomes. The short-fragment nucleosomes can be divided further into two subtypes based on their chromatin accessibility. Interestingly, the subset of short-fragment nucleosomes in accessible regions exhibit high MNase sensitivity and display distribution patterns around transcription start sites (TSSs) and CTCF peaks, similar to the previously reported "fragile nucleosomes". These SEM-defined accessible short-fragment nucleosomes are found not just in promoters, but also in enhancers and other regulatory regions. Additional investigations reveal their co-localization with the chromatin remodelers Chd6, Chd8, and Ep400. In summary, SEM provides an effective platform for distinguishing various nucleosome subtypes, paving the way for future exploration of non-standard nucleosomes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10614873PMC
http://dx.doi.org/10.1101/2023.10.17.562727DOI Listing

Publication Analysis

Top Keywords

short-fragment nucleosomes
16
nucleosome
11
nucleosomes
10
expectation maximization
8
nucleosome positions
8
analysis packages
8
nucleosome types
8
nucleosome subtypes
8
sem
6
sem sized-based
4

Similar Publications

Size-based expectation maximization for characterizing nucleosome positions and subtypes.

Genome Res

October 2024

Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA;

Genome-wide nucleosome profiles are predominantly characterized using MNase-seq, which involves extensive MNase digestion and size selection to enrich for mononucleosome-sized fragments. Most available MNase-seq analysis packages assume that nucleosomes uniformly protect 147 bp DNA fragments. However, some nucleosomes with atypical histone or chemical compositions protect shorter lengths of DNA.

View Article and Find Full Text PDF

SEM: sized-based expectation maximization for characterizing nucleosome positions and subtypes.

bioRxiv

October 2023

Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA, USA.

Genome-wide nucleosome profiles are predominantly characterized using MNase-seq, which involves extensive MNase digestion and size selection to enrich for mono-nucleosome-sized fragments. Most available MNase-seq analysis packages assume that nucleosomes uniformly protect 147bp DNA fragments. However, some nucleosomes with atypical histone or chemical compositions protect shorter lengths of DNA.

View Article and Find Full Text PDF

DeNOPA: decoding nucleosome positions sensitively with sparse ATAC-seq data.

Brief Bioinform

January 2022

CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, and China National Center for Bioinformation, Beijing 100101, China.

As the basal bricks, the dynamics and arrangement of nucleosomes orchestrate the higher architecture of chromatin in a fundamental way, thereby affecting almost all nuclear biology processes. Thanks to its rather simple protocol, assay for transposase-accessible chromatin using sequencing (ATAC)-seq has been rapidly adopted as a major tool for chromatin-accessible profiling at both bulk and single-cell levels; however, to picture the arrangement of nucleosomes per se remains a challenge with ATAC-seq. In the present work, we introduce a novel ATAC-seq analysis toolkit, named decoding nucleosome organization profile based on ATAC-seq data (deNOPA), to predict nucleosome positions.

View Article and Find Full Text PDF

Fine-Resolution Mapping of TF Binding and Chromatin Interactions.

Cell Rep

March 2018

School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel; Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel. Electronic address:

Transcription factor (TF) binding to DNA is crucial for transcriptional regulation. There are multiple methods for mapping such binding. These methods balance between input requirements, spatial resolution, and compatibility with high-throughput automation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!