Background: Whole genome duplication (WGD) events are common in the evolutionary history of many living organisms. For decades, researchers have been trying to understand the genetic and epigenetic impact of WGD and its underlying molecular mechanisms. Particular attention was given to allopolyploid study systems, species resulting from an hybridization event accompanied by WGD. Investigating the mechanisms behind the survival of a newly formed allopolyploid highlighted the key role of DNA methylation. With the improvement of high-throughput methods, such as whole genome bisulfite sequencing (WGBS), an opportunity opened to further understand the role of DNA methylation at a larger scale and higher resolution. However, only a few studies have applied WGBS to allopolyploids, which might be due to lack of genomic resources combined with a burdensome data analysis process. To overcome these problems, we developed the Automated Reproducible Polyploid EpiGenetic GuIdance workflOw (ARPEGGIO): the first workflow for the analysis of epigenetic data in polyploids. This workflow analyzes WGBS data from allopolyploid species via the genome assemblies of the allopolyploid's parent species. ARPEGGIO utilizes an updated read classification algorithm (EAGLE-RC), to tackle the challenge of sequence similarity amongst parental genomes. ARPEGGIO offers automation, but more importantly, a complete set of analyses including spot checks starting from raw WGBS data: quality checks, trimming, alignment, methylation extraction, statistical analyses and downstream analyses. A full run of ARPEGGIO outputs a list of genes showing differential methylation. ARPEGGIO was made simple to set up, run and interpret, and its implementation ensures reproducibility by including both package management and containerization.

Results: We evaluated ARPEGGIO in two ways. First, we tested EAGLE-RC's performance with publicly available datasets given a ground truth, and we show that EAGLE-RC decreases the error rate by 3 to 4 times compared to standard approaches. Second, using the same initial dataset, we show agreement between ARPEGGIO's output and published results. Compared to other similar workflows, ARPEGGIO is the only one supporting polyploid data.

Conclusions: The goal of ARPEGGIO is to promote, support and improve polyploid research with a reproducible and automated set of analyses in a convenient implementation. ARPEGGIO is available at https://github.com/supermaxiste/ARPEGGIO .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8285871PMC
http://dx.doi.org/10.1186/s12864-021-07845-2DOI Listing

Publication Analysis

Top Keywords

arpeggio
10
automated reproducible
8
reproducible polyploid
8
polyploid epigenetic
8
epigenetic guidance
8
guidance workflow
8
role dna
8
dna methylation
8
wgbs data
8
set analyses
8

Similar Publications

Article Synopsis
  • As romantic relationships develop, partners align on goals, enhance teamwork, and share emotions, but the brain mechanisms behind these experiences are not fully understood.
  • In this study, researchers used RNA-sequencing to analyze the nucleus accumbens in prairie voles, focusing on their pairing dynamics in social or mating contexts.
  • Findings revealed that prairie voles show synchronized gene expression in their brain, particularly in cells linked to myelin production, which is tied to their social behaviors and responds to being apart, suggesting that shared experiences can biologically strengthen their bond.
View Article and Find Full Text PDF

Several animal species prefer consonant over dissonant sounds, a building block of musical scales and harmony. Could consonance and dissonance be linked, beyond music, to the emotional valence of vocalizations? We extracted the fundamental frequency from calls of young chickens with either positive or negative emotional valence, i.e.

View Article and Find Full Text PDF

Protein Arginine Methyltransferase 5 (PRMT5) regulates RNA splicing and transcription by symmetric dimethylation of arginine residues (Rme2s/SDMA) in many RNA binding proteins. However, the mechanism by which PRMT5 couples splicing to transcriptional output is unknown. Here, we demonstrate that a major function of PRMT5 activity is to promote chromatin escape of a novel, large class of mRNAs that we term Genomically Retained Incompletely Processed Polyadenylated Transcripts (GRIPPs).

View Article and Find Full Text PDF

Breathing for singing is a highly contested issue in singing pedagogy with a wide variety of strategies recommended by teachers and the tendency for individuals to find more success with some strategies than others. The concept of body type as a determining factor has been suggested and supported by Hixon and Hoit, but little research has been conducted on this topic since and especially little research has been conducted using biologically female subjects. The investigators recruited eight female, classically trained singers and evaluated their body composition based on several anthropometric measurements (height, body mass, body fat percentage, and ectomorphy as determined by the Heath-Carter Somatotype system).

View Article and Find Full Text PDF

In experiments with significant perturbations to transcription, nascent RNA sequencing protocols are dependent on external spike-ins for reliable normalization. Unlike in RNA-seq, these spike-ins are not standardized and, in many cases, depend on a run-on reaction that is assumed to have constant efficiency across samples. To assess the validity of this assumption, we analyze a large number of published nascent RNA spike-ins to quantify their variability across existing normalization methods.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!