Feature selection for unsupervised machine learning of accelerometer data physical activity clusters - A systematic review.

Gait Posture

Diabetes Research Centre, University of Leicester, Leicester General Hospital, Gwendolen Road, Leicester LE5 4PW, UK; NIHR Leicester Biomedical Research Centre, Leicester General Hospital, Gwendolen Road, Leicester, LE5 4PW, UK; Alliance for Research in Exercise, Nutrition and Activity (ARENA), Sansom Institute for Health Research, Division of Health Sciences, University of South Australia, Adelaide, Australia.

Published: October 2021

Background: Identifying clusters of physical activity (PA) from accelerometer data is important to identify levels of sedentary behaviour and physical activity associated with risks of serious health conditions and time spent engaging in healthy PA. Unsupervised machine learning models can capture PA in everyday free-living activity without the need for labelled data. However, there is scant research addressing the selection of features from accelerometer data. The aim of this systematic review is to summarise feature selection techniques applied in studies concerned with unsupervised machine learning of accelerometer-based device obtained physical activity, and to identify commonly used features identified through these techniques. Feature selection methods can reduce the complexity and computational burden of these models by removing less important features and assist in understanding the relative importance of feature sets and individual features in clustering.

Method: We conducted a systematic search of Pubmed, Medline, Google Scholar, Scopus, Arxiv and Web of Science databases to identify studies published before January 2021 which used feature selection methods to derive PA clusters using unsupervised machine learning models.

Results: A total of 13 studies were eligible for inclusion within the review. The most popular feature selection techniques were Principal Component Analysis (PCA) and correlation-based methods, with k-means frequently used in clustering accelerometer data. Cluster quality evaluation methods were diverse, including both external (e.g. cluster purity) or internal evaluation measures (silhouette score most frequently). Only four of the 13 studies had more than 25 participants and only four studies included two or more datasets.

Conclusion: There is a need to assess multiple feature selection methods upon large cohort data consisting of multiple (3 or more) PA datasets. The cut-off criteria e.g. number of components, pairwise correlation value, explained variance ratio for PCA, etc. should be expressly stated along with any hyperparameters used in clustering.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.gaitpost.2021.08.007DOI Listing

Publication Analysis

Top Keywords

feature selection
24
unsupervised machine
16
machine learning
16
accelerometer data
16
physical activity
16
selection methods
12
systematic review
8
selection techniques
8
feature
7
data
6

Similar Publications

Nanoencapsulated Optical Fiber-Based PEC Microelectrode: Highly Sensitive and Specific Detection of NT-proBNP and Its Implantable Performance.

Anal Chem

January 2025

Hunan Provincial Key Laboratory of Micro & Nano Materials Interface Science, College of Chemistry and Chemical Engineering, Central South University, Changsha 410083, China.

Microelectrodes offer exceptional sensitivity, rapid response, and versatility, making them ideal for real-time detection and monitoring applications. Photoelectrochemical (PEC) sensors have shown great value in many fields due to their high sensitivity, fast response, and ease of operation. Nevertheless, conventional PEC sensing relies on cumbersome external light sources and bulky electrodes, hindering its miniaturization and implantation, thereby limiting its application in real-time disease monitoring.

View Article and Find Full Text PDF

Metabolic dysfunction-associated steatotic liver disease (MASLD) describes liver diseases caused by the accumulation of triglycerides in hepatocytes (steatosis) as well as the resulting inflammation and fibrosis. Previous studies have demonstrated that accumulation of fat in visceral adipose tissue compartments and the liver is associated with alterations in the circulating levels of some amino acids, notably glutamate. This study aimed to investigate the associations between circulating amino acids, particularly glutamate, and MASLD.

View Article and Find Full Text PDF

Unlabelled: Motivated behaviors are regulated by distributed forebrain networks. Traditional approaches have often focused on individual brain regions and connections that do not capture the topographic organization of forebrain connectivity. We performed co-injections of anterograde and retrograde tract tracers in rats to provide novel high-spatial resolution evidence of topographic connections that elaborate a previously identified closed-loop forebrain circuit implicated in affective and motivational processes.

View Article and Find Full Text PDF

Unlabelled: To overcome the paucity of known tumor-specific surface antigens in pediatric high-grade glioma (pHGG), we contrasted splicing patterns in pHGGs and normal brain samples. Among alternative splicing events affecting extracellular protein domains, the most pervasive alteration was the skipping of ≤30 nucleotide-long microexons. Several of these skipped microexons mapped to L1-IgCAM family members, such as .

View Article and Find Full Text PDF

Cyclin F, a non-canonical member of the cyclin protein family, plays a critical role in regulating the precise transitions of cell-cycle events. Unlike canonical cyclins, which bind and activate cyclin-dependent kinases (CDKs), Cyclin F functions as a substrate receptor protein within the Skp1-Cullin-F box (SCF) E3 ubiquitin ligase complex, enabling the ubiquitylation of target proteins. The structural features that distinguish Cyclin F as a ligase adaptor and the mechanisms underlying its selective substrate recruitment over Cyclin A, which functions in complex with CDK2 at a similar time in the cell cycle, remain largely unexplored.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!