Similarity-Based Segmentation of Multi-Dimensional Signals.

Sci Rep

Bioinformatics Group, Department of Computer Science, Interdisciplinary Center for Bioinformatics, German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Competence Center for Scalable Data Services and Solutions, and Leipzig Research Center for Civilization Diseases, University Leipzig, Härtelstrasse 16-18, D-04107, Leipzig, Germany.

Published: September 2017

The segmentation of time series and genomic data is a common problem in computational biology. With increasingly complex measurement procedures individual data points are often not just numbers or simple vectors in which all components are of the same kind. Analysis methods that capitalize on slopes in a single real-valued data track or that make explicit use of the vectorial nature of the data are not applicable in such scenaria. We develop here a framework for segmentation in arbitrary data domains that only requires a minimal notion of similarity. Using unsupervised clustering of (a sample of) the input yields an approximate segmentation algorithm that is efficient enough for genome-wide applications. As a showcase application we segment a time-series of transcriptome sequencing data from budding yeast, in high temporal resolution over ca. 2.5 cycles of the short-period respiratory oscillation. The algorithm is used with a similarity measure focussing on periodic expression profiles across the metabolic cycle rather than coverage per time point.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5617875PMC
http://dx.doi.org/10.1038/s41598-017-12401-8DOI Listing

Publication Analysis

Top Keywords

data
6
similarity-based segmentation
4
segmentation multi-dimensional
4
multi-dimensional signals
4
signals segmentation
4
segmentation time
4
time series
4
series genomic
4
genomic data
4
data common
4

Similar Publications

This study examined the interplay between physical workload, psychological stress, and the prevalence of work-related musculoskeletal disorders (WMSDs) among construction workers in Indonesia. This cross-sectional study used a purposive sampling technique to gather quantitative data from 409 respondents working in four construction companies through structured questionnaires. Data collection tools included the Copenhagen Psychosocial Questionnaire III (COPSOQ III), the K10 scale for psychosocial distress, and the Nordic Body Map for musculoskeletal symptoms.

View Article and Find Full Text PDF

Objectives: Nepenthes, sometimes known as tropical pitcher plants or monkey cups, is a carnivorous plant genus that contains more than 160 species. Nepenthes khasiana, India's sole representative of the genus, is a rare and endangered dioecious plant endemic to North-east India. Despite the fact that it is a prominent insectivorous plant in the Nepenthaceae family, genomic resources for the species are limited, making genomic breeding and understanding the genetic basis of botanical carnivory difficult.

View Article and Find Full Text PDF

Background: Human immunodeficiency virus continues to be a major global public health issue. Body mass index is a general indicator of nutritional status and has emerged as a powerful predictor of morbidity and mortality among adult PLHIV initiating antiretroviral therapy in resource-limited settings. However, there is a dearth of information regarding longitudinal changes in body mass index and its predictors among adult PLHIV in Ethiopia, particularly in the study area.

View Article and Find Full Text PDF

Purpose: This study aims to assess the risks associated with drug-induced macular edema and to examine the epidemiological characteristics of this condition.

Methods: This study analyzed data from the U.S.

View Article and Find Full Text PDF

Background: The Anticholinergic Risk Scale and Total Anticholinergic Load were developed to assess the risks associated with anticholinergic drugs. Recently, the Japan Anticholinergic Risk Scale was introduced; however, the total anticholinergic load for adverse events has not been clarified, and the criteria for risk assessment in clinical practice have not been established. In this study, we used data from the Japanese Adverse Drug Event Report (JADER) database provided by the Pharmaceuticals and Medical Devices Agency to determine the total anticholinergic load associated with reported adverse events related to anticholinergic syndrome.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!