Metagenome fragment classification based on multiple motif-occurrence profiles.

PeerJ

Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, Osaka University, Yamadaoka, Suita, Osaka , Japan.

Published: September 2014

A vast amount of metagenomic data has been obtained by extracting multiple genomes simultaneously from microbial communities, including genomes from uncultivable microbes. By analyzing these metagenomic data, novel microbes are discovered and new microbial functions are elucidated. The first step in analyzing these data is sequenced-read classification into reference genomes from which each read can be derived. The Naïve Bayes Classifier is a method for this classification. To identify the derivation of the reads, this method calculates a score based on the occurrence of a DNA sequence motif in each reference genome. However, large differences in the sizes of the reference genomes can bias the scoring of the reads. This bias might cause erroneous classification and decrease the classification accuracy. To address this issue, we have updated the Naïve Bayes Classifier method using multiple sets of occurrence profiles for each reference genome by normalizing the genome sizes, dividing each genome sequence into a set of subsequences of similar length and generating profiles for each subsequence. This multiple profile strategy improves the accuracy of the results generated by the Naïve Bayes Classifier method for simulated and Sargasso Sea datasets.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4157293PMC
http://dx.doi.org/10.7717/peerj.559DOI Listing

Publication Analysis

Top Keywords

naïve bayes
12
bayes classifier
12
classifier method
12
metagenomic data
8
reference genomes
8
reference genome
8
classification
5
metagenome fragment
4
fragment classification
4
classification based
4

Similar Publications

Introduction: Available therapies for peripheral nerve injury (PNI) include surgical and non-surgical treatments. Surgical treatment includes neurorrhaphy, grafting (allografts and autografts) and tissue-engineered grafting (artificial nerve guide conduits), while non-surgical treatment methods include electrical stimulation, magnetic stimulation, laser phototherapy and administration of nerve growth factors. However, the treatments currently available to best manage the different PNI manifestations remain undetermined.

View Article and Find Full Text PDF

Predicting lack of clinical improvement following varicose vein ablation using machine learning.

J Vasc Surg Venous Lymphat Disord

December 2024

Department of Surgery, University of Toronto, Canada; Division of Vascular Surgery, St. Michael's Hospital, Unity Health Toronto, Canada; Institute of Medical Science, University of Toronto, Canada; Temerty Centre for Artificial Intelligence Research and Education in Medicine (T-CAIREM), University of Toronto, Canada; Li Ka Shing Knowledge Institute, St. Michael's Hospital, Unity Health Toronto, Canada; Department of Surgery, King Faisal Specialist Hospital and Research Center, Saudi Arabia. Electronic address:

Objective: Varicose vein ablation is generally indicated in patients with active/healed venous ulcers. However, patient selection for intervention in individuals without venous ulcers is less clear. Tools that predict lack of clinical improvement (LCI) following vein ablation may help guide clinical decision-making but remain limited.

View Article and Find Full Text PDF

Health care decisions are increasingly informed by clinical decision support algorithms, but these algorithms may perpetuate or increase racial and ethnic disparities in access to and quality of health care. Further complicating the problem, clinical data often have missing or poor quality racial and ethnic information, which can lead to misleading assessments of algorithmic bias. We present novel statistical methods that allow for the use of probabilities of racial/ethnic group membership in assessments of algorithm performance and quantify the statistical bias that results from error in these imputed group probabilities.

View Article and Find Full Text PDF

A new prediction model based on deep learning for pig house environment.

Sci Rep

December 2024

School of Mechanical and Electrical Engineering, Qiqihar University, Qiqihar, 161006, China.

A prediction model of the pig house environment based on Bayesian optimization (BO), squeeze and excitation block (SE), convolutional neural network (CNN) and gated recurrent unit (GRU) is proposed to improve the prediction accuracy and animal welfare and take control measures in advance. To ensure the optimal model configuration, the model uses a BO algorithm to fine-tune hyper-parameters, such as the number of GRUs, initial learning rate and L2 normal form regularization factor. The environmental data are fed into the SE-CNN block, which extracts the local features of the data through convolutional operations.

View Article and Find Full Text PDF

Dengue fever poses a significant public health burden in tropical regions, including Thailand, where periodic epidemics strain healthcare resources. Effective disease surveillance is essential for timely intervention and resource allocation. Various methods exist for spatiotemporal cluster detection, but their comparative performance remains unclear.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!