Publications by authors named "Minh Duc Cao"

(1) Background: Pediatric urinary tract infections (UTIs) pose significant challenges due to drug-resistant () strains. This study utilizes whole-genome sequencing to analyze temporal trends in antibiotic resistance genes (ARGs) in clinical isolates from pediatric UTI cases in central Vietnam. (2) Methods: We conducted whole-genome sequencing on 71 isolates collected from pediatric UTI patients between 2018 and 2020.

View Article and Find Full Text PDF

Machine learning has the potential to be a powerful tool in the fight against antimicrobial resistance (AMR), a critical global health issue. Machine learning can identify resistance mechanisms from DNA sequence data without prior knowledge. The first step in building a machine learning model is a feature extraction from sequencing data.

View Article and Find Full Text PDF

Pangenome inference is an indispensable step in bacterial genomics, yet its scalability poses a challenge due to the rapid growth of genomic collections. This paper presents PanTA, a software package designed for constructing pangenomes of large bacterial datasets, showing unprecedented efficiency levels multiple times higher than existing tools. PanTA introduces a novel mechanism to construct the pangenome progressively without rebuilding the accumulated collection from scratch.

View Article and Find Full Text PDF

Whole genome analysis for microbial genomics is critical to studying and monitoring antimicrobial resistance strains. The exponential growth of microbial sequencing data necessitates a fast and scalable computational pipeline to generate the desired outputs in a timely and cost-effective manner. Recent methods have been implemented to integrate individual genomes into large collections of specific bacterial populations and are widely employed for systematic genomic surveillance.

View Article and Find Full Text PDF

We have developed AMRViz, a toolkit for analyzing, visualizing, and managing bacterial genomics samples. The toolkit is bundled with the current best practice analysis pipeline allowing researchers to perform comprehensive analysis of a collection of samples directly from raw sequencing data with a single command line. The analysis results in a report showing the genome structure, genome annotations, antibiotic resistance and virulence profile for each sample.

View Article and Find Full Text PDF

Whole genome sequencing has increasingly become the essential method for studying the genetic mechanisms of antimicrobial resistance and for surveillance of drug-resistant bacterial pathogens. The majority of bacterial genomes sequenced to date have been sequenced with Illumina sequencing technology, owing to its high-throughput, excellent sequence accuracy, and low cost. However, because of the short-read nature of the technology, these assemblies are fragmented into large numbers of contigs, hindering the obtaining of full information of the genome.

View Article and Find Full Text PDF

Checkpoint inhibitor (CPI) therapies provide limited benefit to patients with tumors of low immune reactivity. T cell-inducing vaccines hold promise to exert long-lasting disease control in combination with CPI therapy. Safety, tolerability and recommended phase 2 dose (RP2D) of an individualized, heterologous chimpanzee adenovirus (ChAd68) and self-amplifying mRNA (samRNA)-based neoantigen vaccine in combination with nivolumab and ipilimumab were assessed as primary endpoints in an ongoing phase 1/2 study in patients with advanced metastatic solid tumors (NCT03639714).

View Article and Find Full Text PDF

Background: Circulating cell-free DNA (cfDNA) in the plasma of cancer patients contains cell-free tumour DNA (ctDNA) derived from tumour cells and it has been widely recognized as a non-invasive source of tumour DNA for diagnosis and prognosis of cancer. Molecular profiling of ctDNA is often performed using targeted sequencing or low-coverage whole genome sequencing (WGS) to identify tumour specific somatic mutations or somatic copy number aberrations (sCNAs). However, these approaches cannot efficiently detect all tumour-derived genomic changes in ctDNA.

View Article and Find Full Text PDF

A streaming assembly pipeline utilising real-time Oxford Nanopore Technology (ONT) sequencing data is important for saving sequencing resources and reducing time-to-result. A previous approach implemented in npScarf provided an efficient streaming algorithm for hybrid assembly but was relatively prone to mis-assemblies compared to other graph-based methods. Here we present npGraph, a streaming hybrid assembly tool using the assembly graph instead of the separated pre-assembly contigs.

View Article and Find Full Text PDF

Despite advances in sequencing technologies, assembly of complex plant genomes remains elusive due to polyploidy and high repeat content. Here we report PolyGembler for grouping and ordering contigs into pseudomolecules by genetic linkage analysis. Our approach also provides an accurate method with which to detect and fix assembly errors.

View Article and Find Full Text PDF

Phage display methodologies offer a versatile platform for the isolation of single-chain Fv (scFv) molecules which may be rebuilt into monoclonal antibodies. Herein, we report on a complete workflow termed PhageXpress, for rapid selection of single-chain Fv sequences by leveraging electrohydrodynamic-manipulation of a solution containing phage library particles to enhance target binding whilst minimizing non-specific interactions. Our PhageXpress technique is combined with Oxford Nanopore Technologies' MinION sequencer and custom bioinformatics to achieve high-throughput screening of phage libraries.

View Article and Find Full Text PDF

Background: Polymyxin B and E (colistin) have been pivotal in the treatment of XDR Gram-negative bacterial infections; however, resistance has emerged. A structurally related lipopeptide, octapeptin C4, has shown significant potency against XDR bacteria, including polymyxin-resistant strains, but its mode of action remains undefined.

Objectives: We sought to compare and contrast the acquisition of resistance in an XDR Klebsiella pneumoniae (ST258) clinical isolate in vitro with all three lipopeptides to potentially unveil variations in their mode of action.

View Article and Find Full Text PDF

The majority of human chromosome ends remain incompletely assembled due to their highly repetitive structure. In this study, we use BioNano data to anchor and extend chromosome ends from two European trios as well as two unrelated Asian genomes. At least 11 BioNano assembled chromosome ends are structurally divergent from the reference genome, including both missing sequence and extensions.

View Article and Find Full Text PDF

Background: Tandem repeats comprise significant proportion of the human genome including coding and regulatory regions. They are highly prone to repeat number variation and nucleotide mutation due to their repetitive and unstable nature, making them a major source of genomic variation between individuals. Despite recent advances in high throughput sequencing, analysis of tandem repeats in the context of complex diseases is still hindered by technical limitations.

View Article and Find Full Text PDF

Background: Detection of genomic inversions remains challenging. Many existing methods primarily target inzversions with a non repetitive breakpoint, leaving inverted repeat (IR) mediated non-allelic homologous recombination (NAHR) inversions largely unexplored.

Result: We present npInv, a novel tool specifically for detecting and genotyping NAHR inversion using long read sub-alignment of long read sequencing data.

View Article and Find Full Text PDF

Sequencing by translocating DNA fragments through an array of nanopores is a rapidly maturing technology that offers faster and cheaper sequencing than other approaches. However, accurately deciphering the DNA sequence from the noisy and complex electrical signal is challenging. Here, we report Chiron, the first deep learning model to achieve end-to-end basecalling and directly translate the raw signal to DNA sequence without the error-prone segmentation step.

View Article and Find Full Text PDF

Extensively drug-resistant Klebsiella pneumoniae (XDR-KP) infections cause high mortality and are disseminating globally. Identifying the genetic basis underpinning resistance allows for rapid diagnosis and treatment. XDR isolates sourced from Greece and Brazil, including 19 polymyxin-resistant and five polymyxin-susceptible strains, were subjected to whole genome sequencing.

View Article and Find Full Text PDF

Motivation: Targeted sequencing using capture probes has become increasingly popular in clinical applications due to its scalability and cost-effectiveness. The approach also allows for higher sequencing coverage of the targeted regions resulting in better analysis statistical power. However, because of the dynamics of the hybridization process, it is difficult to evaluate the efficiency of the probe design prior to the experiments which are time consuming and costly.

View Article and Find Full Text PDF

Motivation: The recent introduction of a barcoding protocol for Oxford Nanopore sequencing has increased the versatility of the technology. Several bioinformatics tools have been developed to demultiplex barcoded reads, but none of them supports streaming analysis. This limits the use of multiplexed sequencing in real-time applications, which is one of the main advantages of the technology.

View Article and Find Full Text PDF

Third generation sequencing technologies provide the opportunity to improve genome assemblies by generating long reads spanning most repeat sequences. However, current analysis methods require substantial amounts of sequence data and computational resources to overcome the high error rates. Furthermore, they can only perform analysis after sequencing has completed, resulting in either over-sequencing, or in a low quality assembly due to under-sequencing.

View Article and Find Full Text PDF

The recently introduced Oxford Nanopore MinION platform generates DNA sequence data in real-time. This has great potential to shorten the sample-to-results time and is likely to have benefits such as rapid diagnosis of bacterial infection and identification of drug resistance. However, there are few tools available for streaming analysis of real-time sequencing data.

View Article and Find Full Text PDF

Klebsiella quasipneumoniae subsp. similipneumoniae strain ATCC 700603, formerly known as K. pneumoniae K6, is known for producing extended-spectrum β-lactamase (ESBL) enzymes that can hydrolyze oxyimino-β-lactams, resulting in resistance to these drugs.

View Article and Find Full Text PDF

More than 30 human genetic diseases are linked to tri-nucleotide repeat expansions. There is no known mechanism that explains repeat expansions in full, but changes in the epigenetic state of the associated locus has been implicated in the disease pathology for a growing number of examples. A comprehensive comparative analysis of the genomic features associated with diverse repeat expansions has been lacking.

View Article and Find Full Text PDF

Methods for measuring genetic distances in phylogenetics are known to be sensitive to the evolutionary model assumed. However, there is a lack of established methodology to accommodate the trade-off between incorporating sufficient biological reality and avoiding model overfitting. In addition, as traditional methods measure distances based on the observed number of substitutions, their tend to underestimate distances between diverged sequences due to backward and parallel substitutions.

View Article and Find Full Text PDF