Sequencing databases contain enormous amounts of functional genomics data, making them an extensive resource for genome-scale analysis. Reanalyzing publicly available data, and integrating it with new, project-specific data sets, can be invaluable. With current technologies, genomic experiments have become feasible for virtually any species of interest. However, using and integrating this data comes with its challenges, such as standardized and reproducible analysis. Seq2science is a multi-purpose workflow that covers preprocessing, quality control, visualization, and analysis of functional genomics sequencing data. It facilitates the downloading of sequencing data from all major databases, including NCBI SRA, EBI ENA, DDBJ, GSA, and ENCODE. Furthermore, it automates the retrieval of any genome assembly available from Ensembl, NCBI, and UCSC. It has been tested on a variety of species, and includes diverse workflows such as ATAC-, RNA-, and ChIP-seq. It consists of both generic as well as advanced steps, such as differential gene expression or peak accessibility analysis and differential motif analysis. Seq2science is built on the Snakemake workflow language and thus can be run on a range of computing infrastructures. It is available at https://github.com/vanheeringen-lab/seq2science.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10656911PMC
http://dx.doi.org/10.7717/peerj.16380DOI Listing

Publication Analysis

Top Keywords

functional genomics
12
analysis seq2science
8
sequencing data
8
analysis
6
data
6
seq2science end-to-end
4
end-to-end workflow
4
workflow functional
4
genomics analysis
4
analysis sequencing
4

Similar Publications

Early leaf spot (ELS), caused by (syn. ), is a highly damaging peanut disease worldwide. While there are limited sources of resistance in cultivated peanut cultivars, wild relatives carry alleles for strong resistance, making them a valuable strategic resource for peanut improvement.

View Article and Find Full Text PDF

A Prognostic Riskscore Model Related to Infection in Stomach Adenocarcinoma.

Int J Genomics

January 2025

Department of General Medicine, Chongqing University Central Hospital, Chongqing Emergency Medical Center, Chongqing Key Laboratory of Emergency Medicine, Chongqing, China.

() is associated with the development of various stomach diseases, one of the major risk factors for stomach adenocarcinoma (STAD). The infection score between tumor and normal groups was compared by single-sample gene set enrichment analysis (ssGSEA). The key modules related to infection were identified by weighted gene coexpression network analysis (WGCNA), and functional enrichment analysis was conducted on these module genes.

View Article and Find Full Text PDF

Introduction: Clinicopathological correlations differ by sex in Lewy body dementia (LBD). However, previous studies have focused on pathological staging systems that place less emphasis on regional pathologies.

Methods: We included 357 people (131 female, 226 male) with a high likelihood of LBD based on pathology from the Brain Bank for Neurodegenerative (Jacksonville, FL).

View Article and Find Full Text PDF

Background: Clonal hematopoiesis of indeterminate potential (CHIP) has been linked to intensified systemic inflammation and represents a novel risk factor for atherosclerotic cardiovascular diseases, including aortic stenosis (AS).

Objectives: This study aimed to assess the clinical impact of CHIP in a cohort of severe AS patients undergoing transcatheter aortic valve implantation (TAVI).

Methods: We enrolled 110 severe AS patients in this retrospective study.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!