FastUniq: a fast de novo duplicates removal tool for paired short reads.

PLoS One

National Engineering Laboratory for Breeding of Endangered Medicinal Materials, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, People's Republic of China.

Published: June 2013

The presence of duplicates introduced by PCR amplification is a major issue in paired short reads from next-generation sequencing platforms. These duplicates might have a serious impact on research applications, such as scaffolding in whole-genome sequencing and discovering large-scale genome variations, and are usually removed. We present FastUniq as a fast de novo tool for removal of duplicates in paired short reads. FastUniq identifies duplicates by comparing sequences between read pairs and does not require complete genome sequences as prerequisites. FastUniq is capable of simultaneously handling reads with different lengths and results in highly efficient running time, which increases linearly at an average speed of 87 million reads per 10 minutes. FastUniq is freely available at http://sourceforge.net/projects/fastuniq/.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3527383PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0052249PLOS

Publication Analysis

Top Keywords

paired short
12
short reads
12
fastuniq fast
8
fast novo
8
fastuniq
5
duplicates
5
reads
5
novo duplicates
4
duplicates removal
4
removal tool
4

Similar Publications

Dose effect of corticosteroids on peripheral lymphocyte profiles in patients with systemic lupus erythematosus.

Clin Rheumatol

January 2025

Department of Rheumatology and Immunology, Peking University People's Hospital, 11 Xizhimen South Street, Beijing, 100044, China.

Objective: To investigate the dose effect of methylprednisolone (MP) on peripheral lymphocyte profiles in patients with systemic lupus erythematosus (SLE). This study investigated the impact of varied MP doses on peripheral lymphocyte subtypes in SLE patients.

Methods: We conducted a prospective study involving 51 SLE patients, categorized into four groups (40 mg/day, 80 mg/day, 500 mg/day, and 1000 mg/day) based on the administered MP dosage during hospitalization.

View Article and Find Full Text PDF

Novel High-Quality Amoeba Genomes Reveal Widespread Codon Usage Mismatch Between Giant Viruses and Their Hosts.

Genome Biol Evol

January 2025

Centre for Microbiology and Environmental Systems Science, Division of Microbial Ecology, University of Vienna, Vienna 1030, Austria.

The need for high-quality protist genomes has prevented in-depth computational and experimental studies of giant virus-host interactions. In addition, our current knowledge of host range is highly biased due to the few hosts used to isolate novel giant viruses. This study presents 6 high-quality amoeba genomes from known and potential giant virus hosts belonging to 2 distinct eukaryotic clades: Amoebozoa and Discoba.

View Article and Find Full Text PDF

Rapid detection of by recombinase-aided amplification combined with the CRISPR/Cas12a system.

Front Cell Infect Microbiol

January 2025

State Key Laboratory for Animal Disease Control and Prevention, Harbin Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Harbin, China.

() is one of the primary agents involved in porcine respiratory disease complex, and circulates in the swine industry worldwide. The prevention and control of is complicated. Thus, a recombinase-aided amplification (RAA) assay coupled with the clustered regularly-interspaced short palindromic repeats (CRISPR)/Cas12a system was established for the detection of .

View Article and Find Full Text PDF

Background: The neural mechanisms underlying freezing of gait (FOG) in Parkinson's disease (PD) have not been completely comprehended. Sensory-motor integration dysfunction was proposed as one of the contributing factors. Here, we investigated short-latency afferent inhibition (SAI) and long-latency afferent inhibition (LAI), and analyzed their association with gait performance in FOG PD patients, to further validate the role of sensorimotor integration in the occurrence of FOG in PD.

View Article and Find Full Text PDF

Motivation: Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-Cas9 system is a ground-breaking genome editing tool, which has revolutionized cell and gene therapies. One of the essential components involved in this system that ensures its success is the design of an optimal single-guide RNA (sgRNA) with high on-target cleavage efficiency and low off-target effects. This is challenging as many conditions need to be considered, and empirically testing every design is time-consuming and costly.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!