Publications by authors named "Chaumeil P"

Background: Descriptive metadata are vital for reporting, discovering, leveraging, and mobilizing research datasets. However, resolving metadata issues as part of a data management plan can be complex for data producers. To organize and document data, various descriptive metadata must be created.

View Article and Find Full Text PDF

The Genome Taxonomy Database (GTDB) provides a species to domain classification of publicly available genomes based on average nucleotide identity (ANI) (for species) and a concatenated gene phylogeny normalized by evolutionary rates (for genus to phylum), which has been widely adopted by the scientific community. Here, we use the Genome UNClutterer (GUNC) software to identify putatively contaminated genomes in GTDB release 07-RS207. We found that GUNC reported 35,723 genomes as putatively contaminated, comprising 11.

View Article and Find Full Text PDF

The Genome Taxonomy Database (GTDB) is a taxonomic framework that defines prokaryotic taxa as monophyletic groups in concatenated protein reference trees according to systematic criteria. This has resulted in a substantial number of changes to existing classifications (https://gtdb.ecogenomic.

View Article and Find Full Text PDF

Summary: The Genome Taxonomy Database (GTDB) and associated taxonomic classification toolkit (GTDB-Tk) have been widely adopted by the microbiology community. However, the growing size of the GTDB bacterial reference tree has resulted in GTDB-Tk requiring substantial amounts of memory (∼320 GB) which limits its adoption and ease of use. Here, we present an update to GTDB-Tk that uses a divide-and-conquer approach where user genomes are initially placed into a bacterial reference tree with family-level representatives followed by placement into an appropriate class-level subtree comprising species representatives.

View Article and Find Full Text PDF

The Genome Taxonomy Database (GTDB; https://gtdb.ecogenomic.org) provides a phylogenetically consistent and rank normalized genome-based taxonomy for prokaryotic genomes sourced from the NCBI Assembly database.

View Article and Find Full Text PDF

The accrual of genomic data from both cultured and uncultured microorganisms provides new opportunities to develop systematic taxonomies based on evolutionary relationships. Previously, we established a bacterial taxonomy through the Genome Taxonomy Database. Here, we propose a standardized archaeal taxonomy that is derived from a 122-concatenated-protein phylogeny that resolves polyphyletic groups and normalizes ranks based on relative evolutionary divergence.

View Article and Find Full Text PDF

An amendment to this paper has been published and can be accessed via a link at the top of the paper.

View Article and Find Full Text PDF

The Genome Taxonomy Database is a phylogenetically consistent, genome-based taxonomy that provides rank-normalized classifications for ~150,000 bacterial and archaeal genomes from domain to genus. However, almost 40% of the genomes in the Genome Taxonomy Database lack a species name. We address this limitation by using commonly accepted average nucleotide identity criteria to set bounds on species and propose species clusters that encompass all publicly available bacterial and archaeal genomes.

View Article and Find Full Text PDF

Summary: The GTDB Toolkit (GTDB-Tk) provides objective taxonomic assignments for bacterial and archaeal genomes based on the Genome Taxonomy Database (GTDB). GTDB-Tk is computationally efficient and able to classify thousands of draft genomes in parallel. Here we demonstrate the accuracy of the GTDB-Tk taxonomic assignments by evaluating its performance on a phylogenetically diverse set of 10,156 bacterial and archaeal metagenome-assembled genomes.

View Article and Find Full Text PDF

Viruses of bacteria and archaea are important players in global carbon cycling as well as drivers of host evolution, yet the taxonomic classification of viruses remains a challenge due to their genetic diversity and absence of universally conserved genes. Traditional classification approaches employ a combination of phenotypic and genetic information which is no longer scalable in the era of bulk viral genome recovery through metagenomics. Here, we evaluate a phylogenetic approach for the classification of tailed double-stranded DNA viruses from the order Caudovirales by inferring a phylogeny from the concatenation of 77 single-copy protein markers using a maximum-likelihood method.

View Article and Find Full Text PDF

We investigate chloroplast DNA variation in a hyperdiverse community of tropical rainforest trees in French Guiana, focusing on patterns of intraspecific and interspecific variation. We test whether a species genetic diversity is higher when it has congeners in the community with which it can exchange genes and if shared haplotypes are more frequent in genetically diverse species, as expected in the presence of introgression.We sampled a total of 1,681 individual trees from 472 species corresponding to 198 genera and sequenced them at a noncoding chloroplast DNA fragment.

View Article and Find Full Text PDF

Taxonomy is an organizing principle of biology and is ideally based on evolutionary relationships among organisms. Development of a robust bacterial taxonomy has been hindered by an inability to obtain most bacteria in pure culture and, to a lesser extent, by the historical use of phenotypes to guide classification. Culture-independent sequencing technologies have matured sufficiently that a comprehensive genome-based taxonomy is now possible.

View Article and Find Full Text PDF

In the original version of this Article, the authors stated that the archaeal phylum Parvarchaeota was previously represented by only two single-cell genomes (ARMAN-4_'5-way FS' and ARMAN-5_'5-way FS'). However, these are in fact unpublished, low-quality metagenome-assembled genomes (MAGs) obtained from Richmond Mine, California. In addition, the authors overlooked two higher-quality published Parvarchaeota MAGs from the same habitat, ARMAN-4 (ADCE00000000) and ARMAN-5 (ADHF00000000) (B.

View Article and Find Full Text PDF

Summary: ArachnoServer is a manually curated database that consolidates information on the sequence, structure, function and pharmacology of spider-venom toxins. Although spider venoms are complex chemical arsenals, the primary constituents are small disulfide-bridged peptides that target neuronal ion channels and receptors. Due to their high potency and selectivity, these peptides have been developed as pharmacological tools, bioinsecticides and drug leads.

View Article and Find Full Text PDF

Challenges in cultivating microorganisms have limited the phylogenetic diversity of currently available microbial genomes. This is being addressed by advances in sequencing throughput and computational techniques that allow for the cultivation-independent recovery of genomes from metagenomes. Here, we report the reconstruction of 7,903 bacterial and archaeal genomes from >1,500 public metagenomes.

View Article and Find Full Text PDF

Diatoms are micro-algal indicators of freshwater pollution. Current standardized methodologies are based on microscopic determinations, which is time consuming and prone to identification uncertainties. The use of DNA-barcoding has been proposed as a way to avoid these flaws.

View Article and Find Full Text PDF

Differential expression of soluble proteins was explored in roots of metallicolous (M) and non-M (NM) plants of Agrostis capillaris L. exposed to increasing Cu to partially identify molecular mechanisms underlying higher Cu tolerance in M plants. Plants were cultivated for 2 months on perlite with a CuSO4 (1-30 μM) spiked-nutrient solution.

View Article and Find Full Text PDF

The recent emergence of barcoding approaches coupled to those of next-generation sequencing (NGS) has raised new perspectives for studying environmental communities. In this framework, we tested the possibility to derive accurate inventories of diatom communities from pyrosequencing outputs with an available DNA reference library. We used three molecular markers targeting the nuclear, chloroplast and mitochondrial genomes (SSU rDNA, rbcL and cox1) and three samples of a mock community composed of 30 known diatom strains belonging to 21 species.

View Article and Find Full Text PDF
Article Synopsis
  • Water deficit negatively impacts tree growth and wood production, prompting a study of Eucalyptus genotypes' adaptation mechanisms under varying water availability in the Republic of Congo.
  • Significant effects were found at the phenotypic level related to genotype and treatment adjustments on biomass traits, alongside molecular changes noted in protein abundance across leaves and stems using two-dimensional gel electrophoresis.
  • Proteins showing genetic control over their plasticity (G × T effects) were identified, revealing adaptation strategies like cell wall modification and osmoregulation, which could be targets for enhancing drought resistance in Eucalyptus.
View Article and Find Full Text PDF

Background: Single nucleotide polymorphisms (SNPs) are the most abundant source of genetic variation among individuals of a species. New genotyping technologies allow examining hundreds to thousands of SNPs in a single reaction for a wide range of applications such as genetic diversity analysis, linkage mapping, fine QTL mapping, association studies, marker-assisted or genome-wide selection. In this paper, we evaluated the potential of highly-multiplexed SNP genotyping for genetic mapping in maritime pine (Pinus pinaster Ait.

View Article and Find Full Text PDF

Microsatellites have been popular molecular markers ever since their advent in the late eighties. Despite growing competition from new genotyping and sequencing techniques, the use of these versatile and cost-effective markers continues to increase, boosted by successive technical advances. First, methods for multiplexing PCR have considerably improved over the last years, thereby decreasing genotyping costs and increasing throughput.

View Article and Find Full Text PDF

ArachnoServer (www.arachnoserver.org) is a manually curated database providing information on the sequence, structure and biological activity of protein toxins from spider venoms.

View Article and Find Full Text PDF

Eucalyptus globulus (Labill.) is used for pulp and paper production worldwide. In this report we studied changes in protein expression in one osmotically stressed elite clone widely used in industrial plantations in Spain.

View Article and Find Full Text PDF

The seasonal effect is the most significant external source of variation affecting vascular cambial activity and the development of newly divided cells, and hence wood properties. Here, the effect of edapho-climatic conditions on the phenotypic and molecular plasticity of differentiating secondary xylem during a growing season was investigated. Wood-forming tissues of maritime pine (Pinus pinaster) were collected from the beginning to the end of the growing season in 2003.

View Article and Find Full Text PDF

A large quantity of high quality RNA is often required in the analysis of gene expression. However, RNA extraction from samples taken from woody plants is generally complex, and represents the main limitation to study gene expression, particularly in refractory species like conifers. Standard RNA extraction protocols are available but they are highly time consuming, and not adapted to large scale extraction.

View Article and Find Full Text PDF