A highly optimized grid deployment: the metagenomic analysis example.

Stud Health Technol Inform

Universidad Politécnica de Valencia - ITACA.

Published: October 2008

Computational resources and computationally expensive processes are two topics that are not growing at the same ratio. The availability of large amounts of computing resources in Grid infrastructures does not mean that efficiency is not an important issue. It is necessary to analyze the whole process to improve partitioning and submission schemas, especially in the most critical experiments. This is the case of metagenomic analysis, and this text shows the work done in order to optimize a Grid deployment, which has led to a reduction of the response time and the failure rates. Metagenomic studies aim at processing samples of multiple specimens to extract the genes and proteins that belong to the different species. In many cases, the sequencing of the DNA of many microorganisms is hindered by the impossibility of growing significant samples of isolated specimens. Many bacteria cannot survive alone, and require the interaction with other organisms. In such cases, the information of the DNA available belongs to different kinds of organisms. One important stage in Metagenomic analysis consists on the extraction of fragments followed by the comparison and analysis of their function stage. By the comparison to existing chains, whose function is well known, fragments can be classified. This process is computationally intensive and requires of several iterations of alignment and phylogeny classification steps. Source samples reach several millions of sequences, which could reach up to thousands of nucleotides each. These sequences are compared to a selected part of the "Non-redundant" database which only implies the information from eukaryotic species. From this first analysis, a refining process is performed and alignment analysis is restarted from the results. This process implies several CPU years. The article describes and analyzes the difficulties to fragment, automate and check the above operations in current Grid production environments. This environment has been tuned-up from an experimental study which has tested the most efficient and reliable resources, the optimal job size, and the data transference and database reindexation overhead. The environment should re-submit faulty jobs, detect endless tasks and ensure that the results are correctly retrieved and workflow synchronised. The paper will give an outline on the structure of the system, and the preparation steps performed to deal with this experiment.

Download full-text PDF

Source

Publication Analysis

Top Keywords

metagenomic analysis
12
grid deployment
8
analysis
6
highly optimized
4
grid
4
optimized grid
4
metagenomic
4
deployment metagenomic
4
analysis example
4
example computational
4

Similar Publications

Metagenomic next-generation sequencing and galactomannan testing for the diagnosis of invasive pulmonary aspergillosis.

Sci Rep

December 2024

Department of Respiratory and Critical Care Medicine, Zhengzhou University People's Hospital, Henan Provincial People's Hospital, Weiwu Road No. 7, Zhengzhou, 450003, Henan, China.

To evaluate the diagnostic value of metagenomic next-generation sequencing (mNGS) and galactomannan (GM) testing in invasive pulmonary aspergillosis (IPA) and to compare mNGS with other diagnostic approaches (serum/bronchoalveolar lavage fluid (BALF)-GM and conventional microbiological tests (CMTs) including sputum smears and culture, BALF fungal culture, and bronchial brushing). In all, 237 patients were enrolled in this retrospective study, including 120 patients with IPA and 117 with non-IPA pulmonary infections treated at Henan Provincial People's Hospital between June 2021 and February 2024. The diagnostic performance of mNGS was compared to conventional diagnostic methods including serum GM, BALF-GM, sputum smear microscopy, sputum culture, bronchial brushings, and BALF culture.

View Article and Find Full Text PDF

Profiling and comprehensive analysis of microbiome and ARGs of nurses and nursing workers in China: a cross-sectional study.

Sci Rep

December 2024

Cancer Center, Department of Pulmonary and Critical Care Medicine, Zhejiang Provincial People's Hospital (Affiliated People's Hospital), Hangzhou Medical College, Hangzhou, 310014, Zhejiang, China.

Hospital-acquired infection (HAI) and antimicrobial resistance (AMR) represent major challenges in healthcare system. Despite numerous studies have assessed environmental and patient samples, very few studies have explored the microbiome and resistome profiles of medical staff including nursing workers. This cross-sectional study was performed in a tertiary hospital in China and involved 25 nurses (NSs), 25 nursing workers (NWs), and 55 non-medical control (NC).

View Article and Find Full Text PDF

Short-chain fatty acids play a key role in antibody response to SARS-CoV-2 infection in people living with HIV.

Sci Rep

December 2024

State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310000, China.

High SARS-CoV-2-specific antibody levels can protect against SARS-CoV-2 reinfection. The gut microbiome can affect a host's immune response. However, its role in the antibody response to SARS-CoV-2 in people living with HIV (PLWH) remains poorly understood.

View Article and Find Full Text PDF

Deciphering the key role of biofilm and mechanisms in high-strength nitrogen removal within the anammox coupled partial S-driven autotrophic denitrification system.

Bioresour Technol

December 2024

Key Laboratory of Environmental Remediation and Ecological Health, Ministry of Industry and Information Technology, School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing, Jiangsu, 210094, China; Engineering Research Centre of Chemical Pollution Control, Ministry of Education, School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing, Jiangsu, 210094, China. Electronic address:

Anammox coupled partial S-driven autotrophic denitrification (PSAD) technology represents an innovative approach for removing nitrogen from wastewater. The research highlighted the crucial role of biofilm on sulfur particles in the nitrogen removal process. Further analysis revealed that sulfur-oxidizing bacteria (SOB) are primarily distributed in the inner layer of the biofilm, while anammox bacteria (AnAOB) are relatively evenly distributed in inner and outer layers, with Thiobacillus and Candidatus Brocadia being the dominant species, respectively.

View Article and Find Full Text PDF

Background: India has a high incidence of gallstones, which can cause chronic inflammation and increase the risk of gallbladder cancer. Understanding the age and composition of gallstones can provide insights into their formation and growth. This study used ¹⁴C dating, FTIR, and metagenome analysis to explore the natural history, deposition rate, and microbial/chemical composition of gallstones.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!