Background: While next-generation sequencing (NGS) costs have fallen in recent years, the cost and complexity of computation remain substantial obstacles to the use of NGS in bio-medical care and genomic research. The rapidly increasing amounts of data available from the new high-throughput methods have made data processing infeasible without automated pipelines. The integration of data and analytic resources into workflow systems provides a solution to the problem by simplifying the task of data analysis.

Results: To address this challenge, we developed a cloud-based workflow management system, Closha, to provide fast and cost-effective analysis of massive genomic data. We implemented complex workflows making optimal use of high-performance computing clusters. Closha allows users to create multi-step analyses using drag and drop functionality and to modify the parameters of pipeline tools. Users can also import the Galaxy pipelines into Closha. Closha is a hybrid system that enables users to use both analysis programs providing traditional tools and MapReduce-based big data analysis programs simultaneously in a single pipeline. Thus, the execution of analytics algorithms can be parallelized, speeding up the whole process. We also developed a high-speed data transmission solution, KoDS, to transmit a large amount of data at a fast rate. KoDS has a file transfer speed of up to 10 times that of normal FTP and HTTP. The computer hardware for Closha is 660 CPU cores and 800 TB of disk storage, enabling 500 jobs to run at the same time.

Conclusions: Closha is a scalable, cost-effective, and publicly available web service for large-scale genomic data analysis. Closha supports the reliable and highly scalable execution of sequencing analysis workflows in a fully automated manner. Closha provides a user-friendly interface to all genomic scientists to try to derive accurate results from NGS platform data. The Closha cloud server is freely available for use from http://closha.kobic.re.kr/ .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5836837PMC
http://dx.doi.org/10.1186/s12859-018-2019-3DOI Listing

Publication Analysis

Top Keywords

data
11
closha
10
analysis massive
8
genomic data
8
analysis programs
8
data analysis
8
analysis
6
closha bioinformatics
4
bioinformatics workflow
4
workflow system
4

Similar Publications

Comprehensive data on the epidemiology of cancer-related thrombosis in Africa has been sparse until recently. Thus, this review was aimed to investigate the magnitude of cancer-related thrombosis in Africa. To obtain key articles, comprehensive search was conducted using various databases.

View Article and Find Full Text PDF

Over the last decade, Hippo signaling has emerged as a major tumor-suppressing pathway. Its dysregulation is associated with abnormal expression of and -family genes. Recent works have highlighted the role of YAP1/TEAD activity in several cancers and its potential therapeutic implications.

View Article and Find Full Text PDF

Background: Despite extensive studies of the Mesozoic-Cenozoic magmatic history of Svalbard, little has been done on the Paleozoic magmatism due to fewer available outcrops.

Methods: 2D seismic reflection data were used to study magmatic intrusions in the subsurface of eastern Svalbard.

Results: This work presents seismic evidence for west-dipping, Middle Devonian-Mississippian sills in eastern Spitsbergen, Svalbard.

View Article and Find Full Text PDF

Artificial intelligence-based framework for early detection of heart disease using enhanced multilayer perceptron.

Front Artif Intell

January 2025

Department of Computer Science and Artificial Intelligence, College of Computing and Information Technology, University of Bisha, Bisha, Saudi Arabia.

Cardiac disease refers to diseases that affect the heart such as coronary artery diseases, arrhythmia and heart defects and is amongst the most difficult health conditions known to humanity. According to the WHO, heart disease is the foremost cause of mortality worldwide, causing an estimated 17.8 million deaths every year it consumes a significant amount of time as well as effort to figure out what is causing this, especially for medical specialists and doctors.

View Article and Find Full Text PDF

Objective: To assess the effects of inferior vena cava and/or hepatic vein (IVC±HV) venoplasty on liver volumetry and function in individuals with Budd Chiari syndrome (BCS) who present with ascites and at least one patent hepatic vein.

Methods: A retrospective analysis was conducted on the clinical data of 17 patients with BCS (6 males and 11 females, average age of 42.3 ± 11.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!