Semiautomatic robust regression clustering of international trade data.

Stat Methods Appt

Dipartimento di Scienze Economiche e Aziendali and Interdepartmental Centre for Robust Statistics, University of Parma, Via Kennedy 6, 43125 Parma, Italy.

Published: June 2021

The purpose of this paper is to show in regression clustering how to choose the most relevant solutions, analyze their stability, and provide information about best combinations of optimal number of groups, restriction factor among the error variance across groups and level of trimming. The procedure is based on two steps. First we generalize the information criteria of constrained robust multivariate clustering to the case of clustering weighted models. Differently from the traditional approaches which are based on the choice of the best solution found minimizing an information criterion (i.e. BIC), we concentrate our attention on the so called optimal stable solutions. In the second step, using the monitoring approach, we select the best value of the trimming factor. Finally, we validate the solution using a confirmatory forward search approach. A motivating example based on a novel dataset concerning the European Union trade of face masks shows the limitations of the current existing procedures. The suggested approach is initially applied to a set of well known datasets in the literature of robust regression clustering. Then, we focus our attention on a set of international trade datasets and we provide a novel informative way of updating the subset in the random start approach. The Supplementary material, in the spirit of the Special Issue, deepens the analysis of trade data and compares the suggested approach with the existing ones available in the literature.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8193608PMC
http://dx.doi.org/10.1007/s10260-021-00569-3DOI Listing

Publication Analysis

Top Keywords

regression clustering
12
robust regression
8
international trade
8
trade data
8
suggested approach
8
clustering
5
approach
5
semiautomatic robust
4
clustering international
4
trade
4

Similar Publications

Detection and quantification of disease-related biomarkers in wastewater samples, denominated Wastewater-based Surveillance (WBS), has proven a valuable strategy for studying the prevalence of infectious diseases within populations in a time- and resource-efficient manner, as wastewater samples are representative of all cases within the catchment area, whether they are clinically reported or not. However, analysis and interpretation of WBS datasets for decision-making during public health emergencies, such as the COVID-19 pandemic, remains an area of opportunity. In this article, a database obtained from wastewater sampling at wastewater treatment plants (WWTPs) and university campuses in Monterrey and Mexico City between 2021 and 2022 was used to train simple clustering- and regression-based risk assessment models to allow for informed prevention and control measures in high-affluence facilities, even if working with low-dimensionality datasets and a limited number of observations.

View Article and Find Full Text PDF

Profiles of 71 Human Milk Oligosaccharides and Novel Sub-Clusters of Type I Milk: Results from the Ulm SPATZ Health Study.

Nutrients

January 2025

Pediatric Epidemiology, Department of Pediatrics, Medical Faculty, Leipzig University, Liebigstr 20a, Haus 6, 04103 Leipzig, Germany.

Background/objectives: Although approximately 160 human milk oligosaccharides (HMOs) have been identified, current studies on HMO quantitation are limited to the 10-19 most abundant HMOs. We assessed the variations in the relative concentrations of 71 HMO structures over lactation in human milk samples by an advanced liquid chromatography-mass spectrometry approach.

Methods: Samples were collected from 64 mothers at 6 weeks, 6 months, and 12 months of lactation in the Ulm SPATZ Health Study, a German birth cohort.

View Article and Find Full Text PDF

(1) Background: Surra is a debilitating disease of wild and domestic animals caused by (), resulting in significant mortality and production losses in the affected animals. This study is the first to assess the genetic relationships of in naturally affected buffaloes from Multan district, Pakistan, using ITS-1 primers and evaluating the effects of parasitemia and oxidative stress on DNA damage and hematobiochemical changes in infected buffaloes. (2) Methods: Blood samples were collected from 167 buffaloes using a multi-stage cluster sampling strategy, and trypomastigote identification was performed through microscopy and PCR targeting RoTat 1.

View Article and Find Full Text PDF

: New-onset postoperative atrial fibrillation (POAF) is the most common complication after cardiac surgery, occurring approximately in one-third of the patients. This study considered all-comer patients who underwent cardiac surgery to build a predictive model for POAF. : A total of 3467 (Center 1) consecutive patients were used as a derivation cohort to build the model.

View Article and Find Full Text PDF

Predictive Diagnostic Power of Anthropometric Indicators for Metabolic Syndrome: A Comparative Study in Korean Adults.

J Clin Med

January 2025

School of Global Sport Studies, Korea University, 2511, Sejong-ro, Sejong-si 30019, Republic of Korea.

Metabolic syndrome (MetS) is a cluster of risk factors that significantly increase the risk of cardiovascular disease, including type 2 diabetes, etc. Assessing the predictive diagnostic power of anthropometric indicators for MetS is crucial for the early identification and prevention of related health issues. This study focuses on the Korean adult population while providing insights that may be applicable to broader global contexts.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!