The main aim of this paper is to propose a novel method (RMD-MRCD-PCA) of identification of High Leverage Points (HLPs) in high-dimensional sparse data. It is to address the weakness of the Robust Mahalanobis Distance (RMD) method which is based on the Minimum Regularized Covariance Determinant (RMD-MRCD), which indicates a decrease in its performance as the number of independent variables () increases. The RMD-MRCD-PCA is developed by incorporating the Principal Component Analysis (PCA) in the MRCD algorithm whereby this robust approach shrinks the covariance matrix to make it invertible and thus, can be employed to compute the RMD for high dimensional data. A simulation study and two real data sets are used to illustrate the merit of our proposed method compared to the RMD-MRCD and Robust PCA (ROBPCA) methods. Findings show that the performance of the RMD-MRCD is similar to the performance of the RMD-MRCD-PCA for close to 200. However, its performance tends to decrease when the number of is more than 200 and worsens at equals 700 and larger. On the other hand, the ROBPCA is not effective for less than 20% contamination as it suffers from serious swamping problems.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10606823PMC
http://dx.doi.org/10.1080/02664763.2022.2093842DOI Listing

Publication Analysis

Top Keywords

minimum regularized
8
regularized covariance
8
covariance determinant
8
principal component
8
identification high
8
high leverage
8
leverage points
8
high dimensional
8
sparse data
8
determinant principal
4

Similar Publications

Goal: A lack of healthcare worker well-being is a serious threat to patient care quality and safety, as well as to the overall operational performance of hospitals in the US healthcare delivery system. Extreme resilience depletion and compassion fatigue are known to negatively influence individual well-being and have contributed to the rise in turnover in the healthcare workforce. The primary aim of this research was to identify interventions that health system leaders can use to combat resilience depletion and exhaustion among healthcare workers.

View Article and Find Full Text PDF

This study is the application of a recurrent neural networks with Bayesian regularization optimizer (RNNs-BRO) to analyze the effect of various physical parameters on fluid velocity, temperature, and mass concentration profiles in the Darcy-Forchheimer flow of propylene glycol mixed with carbon nanotubes model across a stretched cylinder. This model has significant applications in thermal systems such as in heat exchangers, chemical processing, and medical cooling devices. The data-set of the proposed model has been generated with variation of various parameters such as, curvature parameter, inertia coefficient, Hartmann number, porosity parameter, Eckert number, Prandtl number, radiation parameter, activation energy variable, Schmidt number and reaction rate parameter for different scenarios.

View Article and Find Full Text PDF

Forward calculation of airborne gamma 3D radiation fields based on rapid coupling method of point kernel integrals.

J Environ Radioact

December 2024

College of Nuclear Technology and Automation Engineering, Chengdu University of Technology, Chengdu, 610000, China; Applied Nuclear Technology in Geosciences Key Laboratory of Sichuan Province, Chengdu, 610000, China. Electronic address:

Airborne gamma ray spectrum detection technology is an effective means to measure the concentration and spatial distribution of natural radionuclides in environmental media such as surface rocks and soil during aviation flight. Therefore, it is vital to fully explore the radiation information related to mineralization in airborne gamma spectrometry data and explore the dose distribution law of gamma radiation field of radionuclides in the detection area. This paper is based on the theoretical calculation model of ground-air interface gamma radiation field.

View Article and Find Full Text PDF

Minimum carbon dioxide is a key predictor of the respiratory health of pigs in climate-controlled housing systems.

Porcine Health Manag

December 2024

Animal Nutrition and Feed Science Laboratory, Department of Animal Science and Technology, Sunchon National University, Suncheon, 57922, Republic of Korea.

Background: Respiratory disease is an economically important disease in the swine industry. Housing air quality control is crucial for maintaining the respiratory health of pigs. However, maintaining air quality is a limitation of current housing systems.

View Article and Find Full Text PDF

Brucellosis is an infectious zoonotic disease. The disease is one of the major concerns in developing societies due to its great importance for public health and economic losses in the animal industry. The principal target of the study was to detect the prevalence of brucellosis and associated risk factors in cattle from Hamedan (western Iran) using different laboratory techniques.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!