A Hybrid Missing Data Imputation Method for Batch Process Monitoring Dataset.

Sensors (Basel)

Big Data Analysis and Fusion Application Technology Engineering Laboratory of Sichuan Province, Chengdu 610065, China.

Published: October 2023

AI Article Synopsis

  • Batch process monitoring datasets often have missing data, which can hurt fault identification and control performance.
  • The proposed hybrid imputation method categorizes missing data into five types and then uses tailored strategies, including interpolation and multivariate regression, to fill in the gaps.
  • Experiments show that this method outperforms existing techniques in accurately imputing various types of missing data in real-world scenarios.

Article Abstract

Batch process monitoring datasets usually contain missing data, which decreases the performance of data-driven modeling for fault identification and optimal control. Many methods have been proposed to impute missing data; however, they do not fulfill the need for data quality, especially in sensor datasets with different types of missing data. We propose a hybrid missing data imputation method for batch process monitoring datasets with multi-type missing data. In this method, the missing data is first classified into five categories based on the continuous missing duration and the number of variables missing simultaneously. Then, different categories of missing data are step-by-step imputed considering their unique characteristics. A combination of three single-dimensional interpolation models is employed to impute transient isolated missing values. An iterative imputation based on a multivariate regression model is designed for imputing long-term missing variables, and a combination model based on single-dimensional interpolation and multivariate regression is proposed for imputing short-term missing variables. The Long Short-Term Memory (LSTM) model is utilized to impute both short-term and long-term missing samples. Finally, a series of experiments for different categories of missing data were conducted based on a real-world batch process monitoring dataset. The results demonstrate that the proposed method achieves higher imputation accuracy than other comparative methods.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10650138PMC
http://dx.doi.org/10.3390/s23218678DOI Listing

Publication Analysis

Top Keywords

missing data
36
batch process
16
process monitoring
16
missing
14
data
10
hybrid missing
8
data imputation
8
imputation method
8
method batch
8
monitoring dataset
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!