Cataloguing the taxonomic origins of sequences from a heterogeneous sample using phylogenomics: applications in adventitious agent detection.

PDA J Pharm Sci Technol

Sanofi Pasteur, Analytical Research and Development Europe, Marcy L'Étoile, France.

Published: January 2018

Unlabelled: We have designed and implemented a software system, named PhyloID™, that can be used to detect putative adventitious agents in biological samples characterized by next-generation sequencing. PhyloID is run in two steps, each being a self-contained automated process amenable to GMP validation. The first module, MiLY, is responsible for assembling individual sequence reads into contigs, and annotating all sequences with a unique sequence identifier, the number of reads in each contig, and the length of the sequence. The trimmed, assembled and annotated data are then processed by PhyloID's second module, NGmapper. NGmapper takes the FASTA-formatted output from MiLY and identifies the taxonomic origins of the contigs and singletons therein. It compares each sequence's BLASTN hit profile against the patterns of evolutionary relationships described within phylogenomic distance matrices for all of the various taxonomic groups, in order to find the best fit. NGmapper then produces lists of taxonomic assignments in both summarized and detailed form, and tree files for viewing results graphically. We optimized PhyloID's parameters and measured its performance using simulated metagenomic data and subsets of the reference phylogenies. PhyloID's precision and recall in identifying simulated sequences were measured by information retrieval analysis, focusing on read length, read number, sequence accuracy, background complexity, taxonomy and reference data coverage. We found PhyloID to be highly accurate and quantitative in its taxonomic mapping of sequences, with excellent precision, sensitivity and robustness. The degree of taxonomic representation available in publicly available databases remains an issue, as expected, for any sequence classifier, but community sequencing efforts are poised to overcome this problem. In order to illustrate real-world usage of the application, we also describe some simple spike-recovery experiments as well as a multi-site comparative characterization of a viral suspension. These data help to illustrate, to corroborate, and to extend results using simulated data.

Lay Abstract: In order to address gaps in the detection of contaminating viruses and microorganisms in vaccines and other biologicals, manufacturers are exploring the use of new technologies that promise greater sensitivity and breadth of coverage. One challenge in implementing such new methods is the complexity of analysis of the "big data" generated by these new instruments: hundreds of millions of sequence reads (segments of genetic material from viruses and cells) need to be compared against a vast and growing number of entries in genetic databases, in order to come up with a confident identification. These large-scale analyses must furthermore be carried out within the strict regulatory environment that governs the industry. We have developed an automated software pipeline named PhyloID™ that is capable of identifying viruses and microorganisms from large-scale sequence data. Using simulated data as well as real samples, we show that PhyloID is both sensitive and accurate in identifying any type of potential contaminant. Such a powerful new assay will be an important addition to the adventitious agent testing package, providing further assurance about product safety.

Download full-text PDF

Source
http://dx.doi.org/10.5731/pdajpst.2014.01023DOI Listing

Publication Analysis

Top Keywords

taxonomic origins
8
adventitious agent
8
named phyloid™
8
sequence reads
8
viruses microorganisms
8
sequence
7
data
6
taxonomic
5
cataloguing taxonomic
4
sequences
4

Similar Publications

Assessment and application of tropical cyclone clustering in the South China Sea.

Sci Rep

January 2025

College of Ocean and Meteorology & South China Sea Institute of Marine Meteorology, Guangdong Ocean University, 524088, Zhanjiang, Guangdong, China.

Accurate classification of tropical cyclone (TC) tracks is essential for evaluating and mitigating the potential disaster risks associated with TCs. In this study, three commonly used methods (K-means, Fuzzy C-Means, and Self-Organizing Maps) are assessed for clustering historical TC tracks that originated in the South China Sea from 1949 to 2023. The results show that the K-means method performs the best, while the Fuzzy C-Means and Self-Organizing Maps methods are also viable alternatives.

View Article and Find Full Text PDF

Surgeons use anatomical landmarks like the scaphoid tubercle, pisiform, trapezial tubercle and hook of hamate, along with Kaplan cardinal line (KCL) to avoid injury to the recurrent motor branch (RMB) of the median nerve during carpal tunnel release. The presence of transverse muscle fibres (TMF) overlying the transverse carpal ligament (TCL) may suggest proximity of the RMB, but their anatomical relationship is unclear. In this study, we evaluated the accuracy of anatomical landmarks to the RMB, TMF origin and insertion, and examined the relationship between TMF presence and RMB running patterns.

View Article and Find Full Text PDF

A thorny tale: The origin and diversification of Cirsium (Compositae).

Mol Phylogenet Evol

January 2025

Autonomous University of Barcelona, Systematics and Evolution of Vascular Plants (UAB) - Associated Unit to CSIC by IBB - Cerdanyola del Vallès, Spain.

Widely distributed plant genera offer insights into biogeographic processes and biodiversity. The Carduus-Cirsium group, with over 600 species in eight genera, is diverse across the Holarctic regions, especially in the Mediterranean Basin, Southwest Asia, Japan, and North America. Despite this diversity, evolutionary and biogeographic processes within the group, particularly for the genus Cirsium, remain underexplored.

View Article and Find Full Text PDF

Menrath ulcers in cats: four cases (2014-2023).

J Small Anim Pract

January 2025

Langford Veterinary Services, Langford, UK.

Objectives: To report the clinical presentation, treatment and outcomes of four cats diagnosed with Menrath ulcers causing significant oral haemorrhage.

Materials And Methods: For all cats, data on signalment, history, physical examination, treatment and outcomes were collected by reviewing medical records. Information regarding outcomes was collected from communication logs between primary care veterinarians and owners, and the original case clinicians after discharge of the patient from the hospital.

View Article and Find Full Text PDF

The microbiota of cork and yellow stain as a model for a new route for the synthesis of chlorophenols and chloroanisoles from the microbial degradation of suberin and/or lignin.

Microbiome

January 2025

Instituto de Investigación de La Viña y El Vino, Escuela de Ingeniería Agraria, Universidad de León, Avenida de Portugal, 41, León, 24009, Spain.

Article Synopsis
  • Cork is primarily used for wine bottle stoppers, but it can contain 2,4,6-trichloroanisole, which causes a musty odor that negatively affects wine quality and leads to financial losses.
  • The presence of yellow stain in cork indicates a degradation linked to higher microbial populations, particularly filamentous fungi that break down lignin, and this microbiota contributes to the formation of chlorophenols and chloroanisoles.
  • Research identified specific fungal and bacterial species associated with yellow stain and demonstrated that certain strains can convert p-hydroxybenzoate into phenol, which can then be chlorinated, potentially leading to the development of 2,4,6-trichlorophenol.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!