Metagenome fragment classification based on multiple motif-occurrence profiles.

Naoki Matsushita Shigeto Seno Yoichi Takenaka Hideo Matsuda

PeerJ

Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, Osaka University, Yamadaoka, Suita, Osaka , Japan.

Published: September 2014

A vast amount of metagenomic data has been obtained by extracting multiple genomes simultaneously from microbial communities, including genomes from uncultivable microbes. By analyzing these metagenomic data, novel microbes are discovered and new microbial functions are elucidated. The first step in analyzing these data is sequenced-read classification into reference genomes from which each read can be derived. The Naïve Bayes Classifier is a method for this classification. To identify the derivation of the reads, this method calculates a score based on the occurrence of a DNA sequence motif in each reference genome. However, large differences in the sizes of the reference genomes can bias the scoring of the reads. This bias might cause erroneous classification and decrease the classification accuracy. To address this issue, we have updated the Naïve Bayes Classifier method using multiple sets of occurrence profiles for each reference genome by normalizing the genome sizes, dividing each genome sequence into a set of subsequences of similar length and generating profiles for each subsequence. This multiple profile strategy improves the accuracy of the results generated by the Naïve Bayes Classifier method for simulated and Sargasso Sea datasets.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4157293	PMC
http://dx.doi.org/10.7717/peerj.559	DOI Listing

Publication Analysis

Top Keywords

naïve bayes

bayes classifier

classifier method

metagenomic data

reference genomes

reference genome

classification

metagenome fragment

fragment classification

classification based

Similar Publications

Treatment for peripheral nerve injury: a protocol for a systematic review and Bayesian network meta-analysis.

BMJ Open

December 2024

National Clinical Research Center for Chinese Medicine Acupuncture and Moxibustion, Tianjin, China

Yongke Yang Wenlong Gu Shuting Xu Shuai Wang Huiyan Shi

Introduction: Available therapies for peripheral nerve injury (PNI) include surgical and non-surgical treatments. Surgical treatment includes neurorrhaphy, grafting (allografts and autografts) and tissue-engineered grafting (artificial nerve guide conduits), while non-surgical treatment methods include electrical stimulation, magnetic stimulation, laser phototherapy and administration of nerve growth factors. However, the treatments currently available to best manage the different PNI manifestations remain undetermined.

View Article and Find Full Text PDF

Similar Publications

Predicting lack of clinical improvement following varicose vein ablation using machine learning.

J Vasc Surg Venous Lymphat Disord

December 2024

Department of Surgery, University of Toronto, Canada; Division of Vascular Surgery, St. Michael's Hospital, Unity Health Toronto, Canada; Institute of Medical Science, University of Toronto, Canada; Temerty Centre for Artificial Intelligence Research and Education in Medicine (T-CAIREM), University of Toronto, Canada; Li Ka Shing Knowledge Institute, St. Michael's Hospital, Unity Health Toronto, Canada; Department of Surgery, King Faisal Specialist Hospital and Research Center, Saudi Arabia. Electronic address:

Ben Li Naomi Eisenberg Derek Beaton Douglas S Lee Leen Al-Omran

Objective: Varicose vein ablation is generally indicated in patients with active/healed venous ulcers. However, patient selection for intervention in individuals without venous ulcers is less clear. Tools that predict lack of clinical improvement (LCI) following vein ablation may help guide clinical decision-making but remain limited.

View Article and Find Full Text PDF

Similar Publications

De-biasing the bias: methods for improving disparity assessments with noisy group measurements.

Biometrics

October 2024

RAND Corporation, Pittsburgh, PA 15213, United States.

Solvejg Wastvedt Joshua Snoke Denis Agniel Julie Lai Marc N Elliott

Health care decisions are increasingly informed by clinical decision support algorithms, but these algorithms may perpetuate or increase racial and ethnic disparities in access to and quality of health care. Further complicating the problem, clinical data often have missing or poor quality racial and ethnic information, which can lead to misleading assessments of algorithmic bias. We present novel statistical methods that allow for the use of probabilities of racial/ethnic group membership in assessments of algorithm performance and quantify the statistical bias that results from error in these imputed group probabilities.

View Article and Find Full Text PDF

Similar Publications

A new prediction model based on deep learning for pig house environment.

Sci Rep

December 2024

School of Mechanical and Electrical Engineering, Qiqihar University, Qiqihar, 161006, China.

Zhidong Wu Kaixiang Xu Yanwei Chen Yonglan Liu Wusheng Song

A prediction model of the pig house environment based on Bayesian optimization (BO), squeeze and excitation block (SE), convolutional neural network (CNN) and gated recurrent unit (GRU) is proposed to improve the prediction accuracy and animal welfare and take control measures in advance. To ensure the optimal model configuration, the model uses a BO algorithm to fine-tune hyper-parameters, such as the number of GRUs, initial learning rate and L2 normal form regularization factor. The environmental data are fed into the SE-CNN block, which extracts the local features of the data through convolutional operations.

View Article and Find Full Text PDF

Similar Publications

Comparative evaluation of spatiotemporal methods for effective dengue cluster detection with a case study of national surveillance data in Thailand.

Sci Rep

December 2024

Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand.

Chawarat Rotejanaprasert Kawin Chinpong Andrew B Lawson Richard J Maude

Dengue fever poses a significant public health burden in tropical regions, including Thailand, where periodic epidemics strain healthcare resources. Effective disease surveillance is essential for timely intervention and resource allocation. Various methods exist for spatiotemporal cluster detection, but their comparative performance remains unclear.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!