Publications by Siu Yiu

Publications by authors named "Siu Yiu"

Page 1 of 1

misFinder: identify mis-assemblies in an unbiased manner using reference and paired-end reads.

Xiao Zhu Henry C M Leung Rongjie Wang Francis Y L Chin Siu Ming Yiu

BMC Bioinformatics

November 2015

Background: Because of the short read length of high throughput sequencing data, assembly errors are introduced in genome assembly, which may have adverse impact to the downstream data analysis. Several tools have been developed to eliminate these errors by either 1) comparing the assembled sequences with some similar reference genome, or 2) analyzing paired-end reads aligned to the assembled sequences and determining inconsistent features alone mis-assembled sequences. However, the former approach cannot distinguish real structural variations between the target genome and the reference genome while the latter approach could have many false positive detections (correctly assembled sequence being considered as mis-assembled sequence).

View Article and Find Full Text PDF

Introduction to selected papers from GIW/InCoB 2015.

Paul Horton Christian Schönbach Shoba Ranganathan Siu Ming Yiu

J Bioinform Comput Biol

October 2015

View Article and Find Full Text PDF

PERGA: a paired-end read guided de novo assembler for extending contigs using SVM and look ahead approach.

Xiao Zhu Henry C M Leung Francis Y L Chin Siu Ming Yiu Guangri Quan

PLoS One

January 2016

Since the read lengths of high throughput sequencing (HTS) technologies are short, de novo assembly which plays significant roles in many applications remains a great challenge. Most of the state-of-the-art approaches base on de Bruijn graph strategy and overlap-layout strategy. However, these approaches which depend on k-mers or read overlaps do not fully utilize information of paired-end and single-end reads when resolving branches.

View Article and Find Full Text PDF

Computational identification of protein binding sites on RNAs using high-throughput RNA structure-probing data.

Xihao Hu Thomas K F Wong Zhi John Lu Ting Fung Chan Terrence Chi Kong Lau Siu Ming Yiu

Bioinformatics

April 2014

Motivation: High-throughput sequencing has been used to probe RNA structures, by treating RNAs with reagents that preferentially cleave or mark certain nucleotides according to their local structures, followed by sequencing of the resulting fragments. The data produced contain valuable information for studying various RNA properties.

Results: We developed methods for statistically modeling these structure-probing data and extracting structural features from them.

View Article and Find Full Text PDF

MetaCluster-TA: taxonomic annotation for metagenomic data based on assembly-assisted binning.

Yi Wang Henry Leung Siu Yiu Francis Chin

BMC Genomics

November 2014

Background: Taxonomic annotation of reads is an important problem in metagenomic analysis. Existing annotation tools, which rely on the approach of aligning each read to the taxonomic structure, are unable to annotate many reads efficiently and accurately as reads (~100 bp) are short and most of them come from unknown genomes. Previous work has suggested assembling the reads to make longer contigs before annotation.

View Article and Find Full Text PDF