SEK: sparsity exploiting k-mer-based estimation of bacterial community composition.

Bioinformatics

Department of Communication Theory, KTH Royal Institute of Technology, Stockholm, Sweden, Department of Mathematics, Oregon State University, Corvallis, OR, USA, Systems Biology program, KTH Royal Institute of Technology, Sweden, Aalto University, Esbo, Finland, Department of Computational Biology, KTH Royal Institute of Technology, Stockholm, Sweden, Department of Mathematics and Statistics, University of Helsinki, Helsinki, Finland, Department of Physics, Tsinghua University, Beijing, China, Department of Signal Processing and Department of Information and Computer Science, Aalto University, Esbo, Finland.

Published: September 2014

Motivation: Estimation of bacterial community composition from a high-throughput sequenced sample is an important task in metagenomics applications. As the sample sequence data typically harbors reads of variable lengths and different levels of biological and technical noise, accurate statistical analysis of such data is challenging. Currently popular estimation methods are typically time-consuming in a desktop computing environment.

Results: Using sparsity enforcing methods from the general sparse signal processing field (such as compressed sensing), we derive a solution to the community composition estimation problem by a simultaneous assignment of all sample reads to a pre-processed reference database. A general statistical model based on kernel density estimation techniques is introduced for the assignment task, and the model solution is obtained using convex optimization tools. Further, we design a greedy algorithm solution for a fast solution. Our approach offers a reasonably fast community composition estimation method, which is shown to be more robust to input data variation than a recently introduced related method.

Availability And Implementation: A platform-independent Matlab implementation of the method is freely available at http://www.ee.kth.se/ctsoftware; source code that does not require access to Matlab is currently being tested and will be made available later through the above Web site.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bioinformatics/btu320DOI Listing

Publication Analysis

Top Keywords

community composition
16
estimation bacterial
8
bacterial community
8
composition estimation
8
estimation
6
sek sparsity
4
sparsity exploiting
4
exploiting k-mer-based
4
k-mer-based estimation
4
community
4

Similar Publications

This study sought to compare bacterial abundance and diversity in milk and feces of healthy lactating women with patients suffering from lactation mastitis, explore the pathogenesis of lactation mastitis, and develop new ideas for its treatment and prevention from a microbiological perspective. A total of 19 lactating mastitis patients and 19 healthy lactating women were recruited. Milk and fecal Specimens were obtained from both groups, and microbial community structure was analyzed using 16S rRNA gene sequencing.

View Article and Find Full Text PDF

This study investigated the relationship between stepping-defined daily activity levels, time spent in different postures, and the patterns and intensities of stepping behaviour. Using a thigh-mounted triaxial accelerometer, physical activity data from 3547 participants with seven days of valid data were analysed. We classified days based on step count and quantified posture and stepping behaviour, distinguishing between indoor, community, and recreation stepping.

View Article and Find Full Text PDF

Background/objectives: The unique properties of iron oxide nanoparticles have attracted significant interest within the biomedical community, particularly for magnetic hyperthermia applications. Various synthesis methods have been developed to optimize these nanoparticles.

Methods: In this study, we employed a powdered coconut water (PCW)-assisted sol-gel method to produce magnetite nanoparticles for the first time.

View Article and Find Full Text PDF

Continuous Cropping of Patchouli Alleviate Soil Properties, Enzyme Activities, and Bacterial Community Structures.

Plants (Basel)

December 2024

School of Breeding and Multiplication (Sanya Institute of Breeding and Multiplication), School of Tropical Agriculture and Forestry, Hainan University, Sanya 572025, China.

(Patchouli), an essential medicinal plant in the family, faces significant challenges under continuous cropping (CC) obstacles. This study examined the rhizospheric soil bacterial communities of patchouli under four different CC years, zero (CK), one (T1), two (T2), and three (T3) years through high-throughput 16S rRNA gene amplicon sequencing. Results showed long-term CC led to significant soil properties and enzyme activity shifts.

View Article and Find Full Text PDF

Non-targeted screening (NTS) technology has been showing significant potential in identifying contaminants of emerging concern (CECs) in water and has attracted great attention in academia in recent years. It is a method that analyzes samples without pre-selecting substances, enabling the detection and identification of unknown compounds, which is crucial for environmental health and public protection. This study uses the Bibliometrix package in R 4.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!