The pernicious nature of low-quality sequencing data warrants improvement in the bioinformatics workflow for profiling microbial diversity. The conventional merging approach, which drops a copious amount of sequencing reads when processing low-quality amplicon data, requires alternative methods. In this study, a computational workflow, a combination of merging and direct-joining where the paired-end reads lacking overlaps are concatenated and pooled with the merged sequences, is proposed to handle the low-quality amplicon data. The proposed computational strategy was compared with two workflows; the merging approach where the paired-end reads are merged, and the direct-joining approach where the reads are concatenated. The results showed that the merging approach generates a significantly low number of amplicon sequences, limits the microbiome inference, and obscures some microbial associations. In comparison to other workflows, the combination of merging and direct-joining strategy reduces the loss of amplicon data, improves the taxonomy classification, and importantly, abates the misleading results associated with the merging approach when analysing the low-quality amplicon data. The mock community analysis also supports the findings. In summary, the researchers are suggested to follow the merging and direct-joining workflow to avoid problems associated with low-quality data while profiling the microbial community structure.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1007/s10529-024-03509-9 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!