CoBRA: Containerized Bioinformatics Workflow for Reproducible ChIP/ATAC-seq Analysis.

Genomics Proteomics Bioinformatics

Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02215, USA. Electronic address:

Published: August 2021

Chromatin immunoprecipitation sequencing (ChIP-seq) and the Assay for Transposase-Accessible Chromatin with high-throughput sequencing (ATAC-seq) have become essential technologies to effectively measure protein-DNA interactions and chromatin accessibility. However, there is a need for a scalable and reproducible pipeline that incorporates proper normalization between samples, correction of copy number variations, and integration of new downstream analysis tools. Here we present Containerized Bioinformatics workflow for Reproducible ChIP/ATAC-seq Analysis (CoBRA), a modularized computational workflow which quantifies ChIP-seq and ATAC-seq peak regions and performs unsupervised and supervised analyses. CoBRA provides a comprehensive state-of-the-art ChIP-seq and ATAC-seq analysis pipeline that can be used by scientists with limited computational experience. This enables researchers to gain rapid insight into protein-DNA interactions and chromatin accessibility through sample clustering, differential peak calling, motif enrichment, comparison of sites to a reference database, and pathway analysis. CoBRA is publicly available online at https://bitbucket.org/cfce/cobra.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9039557PMC
http://dx.doi.org/10.1016/j.gpb.2020.11.007DOI Listing

Publication Analysis

Top Keywords

containerized bioinformatics
8
bioinformatics workflow
8
workflow reproducible
8
reproducible chip/atac-seq
8
chip/atac-seq analysis
8
protein-dna interactions
8
interactions chromatin
8
chromatin accessibility
8
analysis cobra
8
chip-seq atac-seq
8

Similar Publications

Article Synopsis
  • - CAKUT (Congenital Anomalies of the Kidney and Urinary Tract) is a major cause of chronic kidney diseases in both children and adults, with its development influenced by genetic and environmental factors, though the exact mechanisms remain unclear.
  • - Researchers analyzed the omics data (miRNome, peptidome, and proteome) from over 30 amniotic fluid samples of patients with varying severity of CAKUT, making this data FAIR for better integration with external resources.
  • - Using bioinformatics, the study highlighted the significance of collagen and the PI3K-AKT signaling pathway in CAKUT, identifying key genes and microRNAs, and developed workflows that can be used for studying other rare
View Article and Find Full Text PDF

synphage: a pipeline for phage genome synteny graphics focused on gene conservation.

Bioinform Adv

August 2024

Department of Microbiology, Biotechnical Faculty, University of Ljubljana, 1000 Ljubljana, Slovenia.

Motivation: Visualization and comparison of genome maps of bacteriophages can be very effective, but none of the tools available on the market allow visualization of gene conservation between multiple sequences at a glance. In addition, most bioinformatic tools running locally are command line only, making them hard to setup, debug, and monitor.

Results: To address these motivations, we developed synphage, an easy-to-use and intuitive tool to generate synteny diagrams from GenBank files.

View Article and Find Full Text PDF

Background: Determining the appropriate computational requirements and software performance is essential for efficient genomic surveillance. The lack of standardized benchmarking complicates software selection, especially with limited resources.

Methods: We developed a containerized benchmarking pipeline to evaluate seven long-read assemblers-Canu, GoldRush, MetaFlye, Strainline, HaploDMF, iGDA, and RVHaplo-for viral haplotype reconstruction, using both simulated and experimental Oxford Nanopore sequencing data of HIV-1 and other viruses.

View Article and Find Full Text PDF

GENTANGLE: integrated computational design of gene entanglements.

Bioinformatics

July 2024

Global Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, CA 94550, United States.

Article Synopsis
  • The design of overlapping genes in microbes is a new method aimed at enhancing control and stability in genetically engineered organisms.
  • GENTANGLE is a computational pipeline that helps design and test these overlapping gene pairs, making the process more efficient.
  • The software and its related data are freely accessible on platforms like GitHub and Singularity, and comprehensive resources for users are provided in the repository wiki and manual.
View Article and Find Full Text PDF

Recent developments in machine-learning (ML) and deep-learning (DL) have immense potential for applications in proteomics, such as generating spectral libraries, improving peptide identification, and optimizing targeted acquisition modes. Although new ML/DL models for various applications and peptide properties are frequently published, the rate at which these models are adopted by the community is slow, which is mostly due to technical challenges. We believe that, for the community to make better use of state-of-the-art models, more attention should be spent on making models easy to use and accessible by the community.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!