Data-driven Soft Sensors in the process industry

doi:10.1016/j.compchemeng.2008.12.012

Computers & Chemical Engineering

Volume 33, Issue 4, 21 April 2009, Pages 795-814

https://doi.org/10.1016/j.compchemeng.2008.12.012 Get rights and content

Abstract

In the last two decades Soft Sensors established themselves as a valuable alternative to the traditional means for the acquisition of critical process variables, process monitoring and other tasks which are related to process control. This paper discusses characteristics of the process industry data which are critical for the development of data-driven Soft Sensors. These characteristics are common to a large number of process industry fields, like the chemical industry, bioprocess industry, steel industry, etc. The focus of this work is put on the data-driven Soft Sensors because of their growing popularity, already demonstrated usefulness and huge, though yet not completely realised, potential. A comprehensive selection of case studies covering the three most important Soft Sensor application fields, a general introduction to the most popular Soft Sensor modelling techniques as well as a discussion of some open issues in the Soft Sensor development and maintenance and their possible solutions are the main contributions of this work.

Introduction

Industrial processing plants are usually heavily instrumented with a large number of sensors. The primary purpose of the sensors is to deliver data for process monitoring and control. But approximately two decades ago researchers started to make use of the large amounts of data being measured and stored in the process industry by building predictive models based on this data. In the context of process industry, these predictive models are called Soft Sensors. This term is a combination of the words “software”, because the models are usually computer programs, and “sensors”, because the models are delivering similar information as their hardware counterparts. Other common terms for predictive sensors in the process industry are inferential sensors (see e.g. Jordaan et al., 2004, Qin et al., 1997), virtual on-line analyser as they are called in the Six-Sigma context (Han & Lee, 2002) and observer-based sensors (Goodwin, 2000).

At a very general level one can distinguish two different classes of Soft Sensors, namely model-driven and data-driven. The model-driven family of Soft Sensors is most commonly based on First Principle Models (FPM) but model-driven Soft Sensors based on extended Kalman filter (Welch & Bishop, 2001) or adaptive observer (Bastin & Dochain, 1990) have also been published (e.g. Chruy, 1997, Jos de Assis and Maciel Filho, 2000). First Principle Models describe the physical and chemical background of the process. These models are developed primarily for the planning and design of the processing plants, and therefore usually focus on the description of the ideal steady-states of the processes which is only one of their drawbacks which makes it difficult to base Soft Sensors on them. As a solution the data-driven Soft Sensors gained increasing popularity in the process industry. Because data-driven models are based on the data measured within the processing plants, and thus describe the real process conditions, they are, compared to the model-driven Soft Sensors, more reality related and describe the true conditions of the process in a better way. Nevertheless there is a lot of different issues which have to be dealt with while developing data-driven Soft Sensors. These issues will be discussed later on in this paper. The most popular modelling techniques applied to data-driven Soft Sensors are the Principle Component Analysis (PCA) (Jolliffe, 2002) in a combination with a regression model, Partial Least Squares (Wold, Sjstrm, & Eriksson, 2001), Artificial Neural Networks Bishop, 1995, Principe et al., 2000, Hastie et al., 2001, Neuro-Fuzzy Systems Jang et al., 1997, Lin and Lee, 1996 and Support Vector Machines (SVMs) (Vapnik, 1998).

The range of tasks fulfilled by Soft Sensors is broad. The original and still most dominant application area of Soft Sensors is the prediction of process variables which can be determined either at low sampling rates or through off-line analysis only. Because these variables are often related to the process output quality, they are very important for the process control and management. For these reasons it is of great interest to deliver additional information about these variables at higher sampling rate and/or at lower financial burden, which is exactly the role of the Soft Sensors. The modelling methods applied to this kind of applications are either statistical or soft computing supervised learning approaches. This Soft Sensor application field is further on referred to as on-line prediction. Other important application fields of Soft Sensors are those of process monitoring and process fault detection. These tasks refer to detection of the state of the process and in the case of a deviation from the normal conditions to identification of the deviation source. Traditionally, the process state is monitored by process operators in the control rooms of the processing plants. The observation and interpretation of the process state is often based on univariate statistics and it is up to the experience of the process operator to put the particular variables into relations and to make decisions about the process state. The role of process monitoring Soft Sensors is, based on the historical data, to build multivariate features which are relevant for the description of the process state. By presenting the predicted process state or the multivariate features the Soft Sensor can support the process operators and allow them to make faster, better and more objective decisions. Process monitoring Soft Sensors are usually based on the Principle Component Analysis and Self Organizing Maps (Kohonen, 1997). It was already mentioned that processing plants embody large number of various sensors, therefore there is a certain probability that a sensor can occasionally fail. Detection of this failure is the next application area of Soft Sensors. In more general terms this application field can be described as sensor fault detection and reconstruction. Once a faulty sensor is detected and identified, it can be either reconstructed or the hardware sensor can be replaced by another Soft Sensor, which is trained to act as a back-up Soft Sensor of the hardware measuring device. If the back-up sensor proves to be an adequate replacement of the physical sensor, this idea can be driven even further and the Soft Sensor can replace the measuring device also in normal working conditions. The software tool can be easier maintained and is not subject to mechanical failures and therefore such a substitution can provide a financial advantage for the process owner.

Despite all the previously listed Soft Sensor application fields and the high number of publications dealing with Soft Sensor applications, there are still some unaddressed issues of the Soft Sensor development and maintenance. A lot of the origins of these issues are in the process data which is used for the Soft Sensor building. Common effects present in the data are measurement noise, missing values, data outliers, co-linear features and varying sampling rates. To solve these problems, there is typically a large amount of manual work needed. Another problem is that the processing plants are rather dynamic environments. Often they develop gradually during the operation time but there can be also sudden abrupt changes of the process, for example, if the quality of the process input changes. It is very difficult for the Soft Sensors to react to these changes which usually results in prediction accuracy deterioration. At present time, these issues are solved in a rather ad hoc manner, which leads to unnecessary high costs of the Soft Sensor development and maintenance. Further on in this work, all the aspects, which have been briefly outlined in this section, are going to be reviewed in a more comprehensive way. The rest of the paper is organized as follows. Section 2 gives an overview of different process types and deals with their aspects from the Soft Sensor modelling point of view. Section 3 focuses on data-driven Soft Sensors, namely on their development methodology, on the methods which are commonly applied to soft sensing and on open issues of the Soft Sensor modelling. A review of publications dealing with Soft Sensor application to diverse processes is also given in Section 3. Section 4 provides a brief description of most popular data-driven pre-processing and modelling techniques to soft sensing. Section 5 contains a discussion of the most important open issues of Soft Sensor development and maintenance as well as an outline of the future research directions in the Soft Sensors field. Finally, the work is concluded in Section 6.

Section snippets

Industrial processes

This section deals with the process industry environment. First, the two different types of industrial processes and their distinguishing characteristics are discussed in Section 2.1. This is followed by a detailed discussions of the data produced in the process industry in Section 2.2.

Soft Sensors in the process industry

This section deals with Soft Sensors in a detailed way. After distinguishing two types of them in Section 3.1 a discussion of a state-of-the-art Soft Sensor development methodology is given in Section 3.2. Section 3.3 provides a comprehensive overview of published Soft Sensor application case studies.

Data-driven methods for soft sensing

This section outlines and provides further references to the most popular techniques for Soft Sensor development as they were identified in Section 3.3.4. These are the multivariate Principle Component Analysis (Section 4.1), Partial Least Squares (Section 4.2), Artificial Neural Networks (Section 4.3), Neuro-Fuzzy Systems (Section 4.4) and Support Vector Machines (Section 4.5). Finally, the last part of the section deals with the modelling of batch processes (Section 4.6).

Open issues and future steps of Soft Sensor development

There are two main issues in the Soft Sensor development and maintenance, respectively. At the development phase, there has to be a lot of effort spent on the manual pre-processing of the data as well as on the model selection and validation steps. To be able to deal with issues like missing values, data outliers, etc., discussed in Section 2.2, the model developer has to manually try different pre-processing approaches and select the one giving the best performance as estimated on the training

Summary

Fig. 6 provides a summary of this review. We focus on two main aspects of the Soft Sensor development: (i) on the process industry and (ii) on the most common computational learning techniques applied for the Soft Sensor modelling.

This review mainly focused on data-driven and grey-box Soft Sensors. The data for the training, evaluation and testing of the models is delivered by the process industry. The industrial data has some common properties like missing values and outliers, which are listed

References (163)

H. Abdi
Partial least squares (PLS) regression
Encyclopedia of social sciences, research methods
(2003)
E.S.A. Alhoniemi
Process monitoring and modeling using the self-organizing map
Integrated Computer-Aided Engineering
(1999)
Amazouz, M., & Pantea, R. (2006). Use of multivariate data analysis for lumber drying process monitoring and fault...
P. Angelov et al.
Identification of evolving fuzzy rule-based models
IEEE Transactions on Fuzzy Systems
(2002)
P. Angelov et al.
Evolving computational intelligence systems
P.P. Angelov et al.
Flexible models with evolving structure
International Journal of Intelligent Systems
(2004)
M.J. Arazo-Bravo et al.
Automatization of a penicillin production process with soft sensors and an adaptive controller based on neuro fuzzy systems
Control Engineering Practice
(2004)
C.G. Atkeson et al.
Locally weighted learning
Artificial Intelligence Review
(1997)
G. Bastin et al.
On-line estimation and adaptive control of bioreactors
(1990)
E. Bauer et al.
An empirical comparison of voting classification algorithms: Bagging, boosting, and variants
Machine Learning
(1999)

C.M. Bishop

Neural networks for pattern recognition

(1995)

D. Bonne et al.

Data-driven modeling of batch processes

L. Breiman

Bagging predictors

Machine Learning

(1996)

R. Bro

Multiway calibration. Multilinear PLS

Journal of Chemometrics

(1996)

A. Casali et al.

Particle size distribution soft-sensor for a grinding circuit

Powder Technology

(1998)

Champagne, M., Dudzic, M., Inc, T., & Temiscaming, Q. (2002). Industrial use of multivariate statistical analysis for...

L. Chen et al.

Hybrid modelling of biotechnological processes using neural networks

Control Engineering Practice

(2000)

J.M. Chen et al.

System parameter estimation with input/output noisy data and missing measurements

IEEE Transactions on Signal Processing [see also IEEE Transactions on Acoustics, Speech, and Signal Processing]

(2000)

X. Chen et al.

A soft-sensor development for melt-flow-length measurement during injection mold filling

Materials Science & Engineering A

(2004)

J. Chen et al.

On-line batch process monitoring using dynamic PCA and dynamic PLS models

Chemical Engineering Science

(2002)

L.Z. Chen et al.

Soft sensors for on-line biomass measurements

Bioprocess and Biosystems Engineering

(2004)

S.W. Choi et al.

Adaptive multivariate statistical process control for monitoring time-varying processes

Industrial & Engineering Chemistry Research

(2006)

A. Chruy

Software sensors in bioprocess engineering

Journal of Biotechnology

(1997)

L. Davies et al.

The identification of multiple outliers

Journal of the American Statistical Association

(1993)

B.S. Dayal et al.

Recursive exponentially weighted PLS and its applications to adaptive control and prediction

Journal of Process Control

(1997)

S. De Wolf et al.

Model predictive control of a slurry polymerisation reactor

Computers and Chemical Engineering

(1996)

K. Desai et al.

Soft-sensor development for fed-batch bioreactors using support vector regression

Biochemical Engineering Journal

(2006)

D. Devogelaere et al.

Application of feedforward neural networks for soft sensors in the sugar industry

F. Ding et al.

Modeling and identification for multirate systems

Acta Automatica Sinica

(2005)

D. Dong et al.

Nonlinear principal component analysis-based on principal curves and neural networks

Computers and Chemical Engineering

(1996)

D. Dong et al.

Emission monitoring using multivariate soft sensors

Y. Dote et al.

Industrial applications of soft computing: A review

F.J. Doyle

Nonlinear inferential control for process applications

Journal of Process Control

(1998)

R. Dunia et al.

Joint diagnosis of process and sensor faults using principal component analysis

Control Engineering Practice

(1998)

R. Dunia et al.

Subspace approach to multidimensional identification and reconstruction

AIChE Journal

(1998)

R. Dunia et al.

Sensor fault identification and reconstruction using principal component analysis

L. Eriksson et al.

Multi-and megavariate data analysis: Principles and applications

(2001)

M. Fellner et al.

Functional nodes in dynamic neural networks for bioprocess modelling

Bioprocess and Biosystems Engineering

(2003)

Feng, R., Shen, W., & Shao, H. (2003). A soft sensor modeling approach using support vector machines. In Proceedings of...

L. Fortuna

Soft sensors for monitoring and control of industrial processes

(2007)

L. Fortuna et al.

Soft sensors for product quality monitoring in debutanizer distillation columns

Control Engineering Practice

(2005)

I.E. Frank et al.

A statistical view of some chemometrics regression tools

Technometrics

(1993)

Y. Freund et al.

A decision-theoretic generalization of on-line learning and an application to boosting

Journal of Computer and System Sciences

(1997)

K. Funahashi

On the approximate realization of continuous mappings by neural networks

Neural Networks

(1989)

J. Gabrielsson et al.

Combining process and spectroscopic data to improve batch modeling

AIChE Journal

(2006)

B. Gabrys

Neuro-fuzzy approach to processing inputs with missing values in pattern recognition problems

International Journal of Approximate Reasoning

(2002)

B. Gabrys

Learning hybrid neuro-fuzzy classifier models from data: To combine or not to combine?

Fuzzy Sets and Systems

(2004)

B. Gabrys et al.

Neural networks based decision support in presence of uncertainties

Journal of Water Resources Planning and Management

(1999)

B. Gabrys et al.

Genetic algorithms in classifier fusion

Applied Soft Computing

(2006)

J. Gama et al.

Learning with drift detection

Cited by (1467)

A task-oriented deep learning framework based on target-related transformer network for industrial quality prediction applications
2024, Engineering Applications of Artificial Intelligence
Executing various production tasks is critical to the safe operation and efficient production of industrial processes. As one of them, the detection task of key quality variables directly affects the operation optimization and decision-making of industrial processes, but it is severely limited by the harsh environment and detection instruments. Therefore, the real-time prediction task of key quality variables becomes the basis for optimal control of industrial processes. To address this issue, this paper proposes a task-oriented deep learning framework based on a target-related transformer (TR-Former) network for industrial quality prediction tasks. Specifically, a new target-related self-attention (TR-SA) mechanism is developed to guide feature learning by adding attention scores between task-related target variables and other variables. As a result, the learned features in this instance will be guaranteed to be relevant to the target variable and useful for the quality prediction task. Moreover, the long-range dynamics of industrial process data can also be captured, which can further improve the prediction performance of the model. Finally, extensive experiments were conducted on two industrial processes to validate the superiority of the proposed method in terms of quality prediction tasks. The experimental results demonstrate that the proposed TR-Former method exhibits an improvement ranging from 3% to 13% in the mean absolute error indicator compared to the traditional transformer and other state-of-the-art methods.
Residual-aware deep attention graph convolutional network via unveiling data latent interactions for product quality prediction in industrial processes
2024, Expert Systems with Applications
Data-driven modeling based on deep learning is crucial for online product quality prediction in industrial processes. Extracting latent data interactions from sensor variables lies at the core of various data-driven modeling applications. Typically, the observed variables exhibit non-stationary properties due to changes in operating conditions and sensor tuning issues. These fluctuations inevitably affect the reliability of traditional feature extraction methods, thus hindering their applications. Hence, this paper proposes a novel method named residual-aware deep attention graph convolutional network (RaDA-GCN) to explore potential interactions among sensor variables. RaDA-GCN ingeniously incorporates an attention mechanism into the graph convolution layer to extract nonlinear variable-related features according to their importance. Then, a novel residual-aware connection module is designed to reduce data uncertainty and alleviate over-smoothing. By skillfully stacking multiple attention map convolutional layers with the integration of residual-aware connections, deep structural features are obtained to facilitate the effective quantification and revelation of potential relationships among data variables. Finally, the application of the predictive modeling framework based on the proposed method is used to validate its effectiveness in actual industrial process data. The experimental results demonstrate that the proposed RaDA-GCN method exhibits a 23% improvement in the R-squared (R²) indicator and a 13% reduction in the root mean square error (RMSE) compared to the traditional graph convolutional network method.
Quick prediction of thinning in stretch forming of hat-shaped profiles during multi-stage hot sheet metal forming
2024, Journal of Materials Processing Technology
A novel quick model predicting the thinning distribution in hot stretch forming of hat-shaped profiles as a building block for a closed-loop control of a multi-stage forming process operated at a stroke rate of up to 12 strokes/min (5 s per stroke) is presented. The key characteristic of this new model is that it considers both a spatially as well as time-variant temperature distribution within the blank during forming. This is achieved by using temperature sensors and applying the information as input for the model, which is based on force equilibria and the theory of plasticity. The model predicts the geometry of the formed part which can be used as crucial input information to the process control. For the quick model, an element-based time-discrete approach was chosen. By assuming a plane strain condition and a decoupling of thermal and mechanical computation as well as by adapting further assumptions, the calculation time on a current desktop PC is 4 s. The achieved calculation time is sufficiently short to realize the control at the given stroke rates. The model is validated by experiments using a hot stretch forming setup designed to simulate the multi-stage process. The model successfully replicates the influence of various stroke rates on the final thinning distribution and predicts the effects of diverse pre-cooling scenarios.
Monitoring the evolution of dimensional accuracy and product properties in property-controlled forming processes
2024, Advances in Industrial and Manufacturing Engineering
As recent trends in manufacturing engineering disciplines show a clear development in the sustainable as well as economically efficient design of forming processes, monitoring techniques have been gaining in relevance. In terms of monitoring of product properties, most processes are currently open-loop controlled, entailing that the microstructure evolution, which determines the final product properties, is not considered. However, a closed-loop control that can adjust and manipulate the process actuators according to the required product properties of the component will lead to a considerable increase in efficiency of the processes regarding resources and will decrease postproduction of the component. For most forming processes, one set of component dimensions will result in a certain set of product properties. However, to successfully establish closed-loop property controls for the processes, a systematic understanding of the reciprocity of the dimensions after forming and final product properties must be established. This work investigates the evolution of dimensional accuracy as well as product properties for a series of forming processes that utilize different degrees of freedom for process control.
Process structure-based fully connected neural network for the modelling of chemical processes: A comparison between global and modular configurations
2024, Journal of the Taiwan Institute of Chemical Engineers
Chemical process modeling is essential for the optimization, precise control and prediction of processes performance. As the modeling based on the first principles is time-consuming and labor-intensive, utilizing neural networks for the process modeling was proposed. However, the poor interpretability of the neural network models is may a problem in the modelling of chemical processes where the accuracy and reliability of the model are highly required.
Inspired by the functional partitioning of neural networks in human brain, in this paper, global fully connected neural network (GFNN) and modular fully connected neural network (MFNN) models are constructed for the production process of ethylbenzene and that of isopropanol with cycle process, respectively. For the chemical processes with complex cycle processes, the quality of the product can be easily affected by small feed changes due to the highly nonlinear model which is called snowball effect. To avoid snowball effect, the cycle-avoidance MFNN is constructed, and it shows better modelling performance than GFNN.
MFNN model has more advantages than GFNN model in the modelling of large-scale chemical processes without cycle processes. In addition, it is clear that MFNN model has a partial interpretability which means the local physical meaning for different parts of the whole process.
Process expert knowledge is essential in creating value from data-driven industrial soft sensors
2024, Computers and Chemical Engineering
The objective of Industry 5.0 is to (re)centre the human operator amidst digital process automation. This requires new data processing technologies that extract human expertise and integrate it with advanced data modelling techniques to enhance human-computer collaboration. In this work, we present an integrated and systematic approach that combines contemporary data modelling technology with process knowledge gained from engineers. Specifically, we develop, investigate, and compare data- and expert-driven approaches for selecting process variables for a real-time product quality prediction for three parallel-operated dairy processing lines. We show how and why data-driven selections are outperformed by expert-driven selections, but that they can improve upon expert-driven selections when used as a secondary selection step. This leads to explainable soft-sensors that are optimal both in accuracy and in parsimony. This highlights the significance of capturing and preserving human expertise, which will enhance the quality and sustainability of industrial processing.

View all citing articles on Scopus

View full text

ReviewData-driven Soft Sensors in the process industry

Abstract

Introduction

Section snippets

Industrial processes

Soft Sensors in the process industry

Data-driven methods for soft sensing

Open issues and future steps of Soft Sensor development

Summary

Partial least squares (PLS) regression

Encyclopedia of social sciences, research methods

Process monitoring and modeling using the self-organizing map

Integrated Computer-Aided Engineering

Identification of evolving fuzzy rule-based models

IEEE Transactions on Fuzzy Systems

Evolving computational intelligence systems

Flexible models with evolving structure

International Journal of Intelligent Systems

Automatization of a penicillin production process with soft sensors and an adaptive controller based on neuro fuzzy systems

Control Engineering Practice

Locally weighted learning

Artificial Intelligence Review

On-line estimation and adaptive control of bioreactors

An empirical comparison of voting classification algorithms: Bagging, boosting, and variants

Machine Learning

Neural networks for pattern recognition

Data-driven modeling of batch processes

Bagging predictors

Machine Learning

Multiway calibration. Multilinear PLS

Journal of Chemometrics

Particle size distribution soft-sensor for a grinding circuit

Powder Technology

Hybrid modelling of biotechnological processes using neural networks

Control Engineering Practice

System parameter estimation with input/output noisy data and missing measurements

IEEE Transactions on Signal Processing [see also IEEE Transactions on Acoustics, Speech, and Signal Processing]

A soft-sensor development for melt-flow-length measurement during injection mold filling

Materials Science & Engineering A

On-line batch process monitoring using dynamic PCA and dynamic PLS models

Chemical Engineering Science

Soft sensors for on-line biomass measurements

Bioprocess and Biosystems Engineering

Adaptive multivariate statistical process control for monitoring time-varying processes

Industrial & Engineering Chemistry Research

Software sensors in bioprocess engineering

Journal of Biotechnology

The identification of multiple outliers

Journal of the American Statistical Association

Recursive exponentially weighted PLS and its applications to adaptive control and prediction

Journal of Process Control

Model predictive control of a slurry polymerisation reactor

Computers and Chemical Engineering

Soft-sensor development for fed-batch bioreactors using support vector regression

Biochemical Engineering Journal

Application of feedforward neural networks for soft sensors in the sugar industry

Modeling and identification for multirate systems

Acta Automatica Sinica

Nonlinear principal component analysis-based on principal curves and neural networks

Computers and Chemical Engineering

Emission monitoring using multivariate soft sensors

Industrial applications of soft computing: A review

Nonlinear inferential control for process applications

Journal of Process Control

Joint diagnosis of process and sensor faults using principal component analysis

Control Engineering Practice

Subspace approach to multidimensional identification and reconstruction

AIChE Journal

Sensor fault identification and reconstruction using principal component analysis

Multi-and megavariate data analysis: Principles and applications

Functional nodes in dynamic neural networks for bioprocess modelling

Bioprocess and Biosystems Engineering

Soft sensors for monitoring and control of industrial processes

Soft sensors for product quality monitoring in debutanizer distillation columns

Control Engineering Practice

A statistical view of some chemometrics regression tools

Technometrics

A decision-theoretic generalization of on-line learning and an application to boosting

Journal of Computer and System Sciences

On the approximate realization of continuous mappings by neural networks

Review
Data-driven Soft Sensors in the process industry