Publications by authors named "Pablo H C G Sa"

The reduction in the cost of DNA sequencing and the total time to perform this process has resulted in a significant increase in the deposit of biological information in public databases such as the NCBI (National Center for Biotechnology Information). The production of large volumes of data per run has culminated in the need to develop algorithms capable of handling data with this new feature and assisting in analyses such as the assembly and annotation of prokaryotic genomes. Over the years, several pipelines and computational tools have been developed to automate this task and consequently reduce the total time to know the genetic content of a given organism, especially non-model organisms, collaborating with the identification of possible targets with biotechnological applicability.

View Article and Find Full Text PDF

The PAN2HGENE is a computational tool that enables two main analyses. First, the tool can identify gene products absent from the original prokaryotic genome sequence. Second, it enables automated comparative analysis for both complete and draft genomes.

View Article and Find Full Text PDF

Genome annotation conceptually consists of inferring and assigning biological information to gene products. Over the years, numerous pipelines and computational tools have been developed aiming to automate this task and assist researchers in gaining knowledge about target genes of study. However, even with these technological advances, manual annotation or manual curation is necessary, where the information attributed to the gene products is verified and enriched.

View Article and Find Full Text PDF

The Next-Generation Sequencing (NGS) platforms provide a major approach to obtaining millions of short reads from samples. NGS has been used in a wide range of analyses, such as for determining genome sequences, analyzing evolutionary processes, identifying gene expression and resolving metagenomic analyses. Usually, the quality of NGS data impacts the final study conclusions.

View Article and Find Full Text PDF

The availability of biological information in public databases has increased exponentially. To ensure the accuracy of this information, researchers have adopted several methods and refinements to avoid the dissemination of incorrect information; for example, several automated tools are available for annotation processes. However, manual curation ensures and enriches biological information.

View Article and Find Full Text PDF

The genomes of four strains (MB11, MB14, MB30, and MB66) of the species biovar equi were sequenced on the Ion Torrent PGM platform, completely assembled, and their gene content and structure were analyzed. The strains were isolated from horses with distinct signs of infection, including ulcerative lymphangitis, external abscesses on the chest, or internal abscesses on the liver, kidneys, and lungs. The average size of the genomes was 2.

View Article and Find Full Text PDF

In this work, we report the complete genome sequence of Corynebacterium pseudotuberculosis strain PA02 isolated from an ovine host. The genome contains 2,328,435 bp, a 52.2% G+C content, 2,035 coding sequences, 12 rRNA operons, 45 tRNAs, and 14 predicted pseudogenes.

View Article and Find Full Text PDF

The advent of NGS (Next Generation Sequencing) technologies has resulted in an exponential increase in the number of complete genomes available in biological databases. This advance has allowed the development of several computational tools enabling analyses of large amounts of data in each of the various steps, from processing and quality filtering to gap filling and manual curation. The tools developed for gap closure are very useful as they result in more complete genomes, which will influence downstream analyses of genomic plasticity and comparative genomics.

View Article and Find Full Text PDF

Corynebacterium pseudotuberculosis is the etiological agent of a caseous lymphadenitis disease. Herein, we present the first complete genome sequencing of C. pseudotuberculosis strain 226, isolated from an abscess of the sub-iliac lymph node of a goat from California (USA).

View Article and Find Full Text PDF

Corynebacterium pseudotuberculosis is the etiological agent of caseous lymphadenitis disease. In this work, we present the first complete genome sequence of Corynebacterium pseudotuberculosis strain PA01, isolated in northern Brazil from an infected sheep. The genome length is 2,337,920 bp, and 2,003 coding sequences (CDS), 12 rRNAs, and 49 tRNAs were predicted.

View Article and Find Full Text PDF

Corynebacterium pseudotuberculosis causes significant loss to goat and sheep farmers because it is the causal agent of the infectious disease caseous lymphadenitis, which may lead to outcomes ranging from skin injury to animal death (Ruiz et al., 2011) [1]. This bacterium was grown under osmotic (2 M), acid (pH) and heat (50 °C) stress and under control (Normal-BHI brain heart infusion) conditions, which simulate the conditions faced by the bacteria during the infectious process.

View Article and Find Full Text PDF

Bacteria are highly diverse organisms that are able to adapt to a broad range of environments and hosts due to their high genomic plasticity. Horizontal gene transfer plays a pivotal role in this genome plasticity and in evolution by leaps through the incorporation of large blocks of genome sequences, ordinarily known as genomic islands (GEIs). GEIs may harbor genes encoding virulence, metabolism, antibiotic resistance and symbiosis-related functions, namely pathogenicity islands (PAIs), metabolic islands (MIs), resistance islands (RIs) and symbiotic islands (SIs).

View Article and Find Full Text PDF

Background: With the emergence of large-scale sequencing platforms since 2005, there has been a great revolution regarding methods for decoding DNA sequences, which have also affected quantitative and qualitative gene expression analyses through the RNA-Sequencing technique. However, issues related to the amount of data required for the analyses have been considered because they affect the reliability of the experiments. Thus, RNA depletion during sample preparation may influence the results.

View Article and Find Full Text PDF

Background: Exiguobacterium antarcticum strain B7 is a Gram-positive psychrotrophic bacterial species isolated in Antarctica. Although this bacteria has been poorly studied, its genome has already been sequenced. Therefore, it is an appropriate model for the study of thermal adaptation.

View Article and Find Full Text PDF

The genome of Corynebacterium pseudotuberculosis MB20 bv. equi was sequenced using the Ion Personal Genome Machine (PGM) platform, and showed a size of 2,363,089 bp, with 2,365 coding sequences and a GC content of 52.1%.

View Article and Find Full Text PDF

Lactococcus lactis subsp. lactis NCDO 2118 is a nondairy lactic acid bacterium, a xylose fermenter, and a gamma-aminobutyric acid (GABA) producer isolated from frozen peas. Here, we report the complete genome sequence of L.

View Article and Find Full Text PDF

Background: The completion of whole-genome sequencing for Corynebacterium pseudotuberculosis strain 1002 has contributed to major advances in research aimed at understanding the biology of this microorganism. This bacterium causes significant loss to goat and sheep farmers because it is the causal agent of the infectious disease caseous lymphadenitis, which may lead to outcomes ranging from skin injury to animal death. In the current study, we simulated the conditions experienced by the bacteria during host infection.

View Article and Find Full Text PDF

Unlabelled: Next-generation sequencing technologies have increased the amount of biological data generated. Thus, bioinformatics has become important because new methods and algorithms are necessary to manipulate and process such data. However, certain challenges have emerged, such as genome assembly using short reads and high-throughput platforms.

View Article and Find Full Text PDF