Review
Data-driven Soft Sensors in the process industry

https://doi.org/10.1016/j.compchemeng.2008.12.012Get rights and content

Abstract

In the last two decades Soft Sensors established themselves as a valuable alternative to the traditional means for the acquisition of critical process variables, process monitoring and other tasks which are related to process control. This paper discusses characteristics of the process industry data which are critical for the development of data-driven Soft Sensors. These characteristics are common to a large number of process industry fields, like the chemical industry, bioprocess industry, steel industry, etc. The focus of this work is put on the data-driven Soft Sensors because of their growing popularity, already demonstrated usefulness and huge, though yet not completely realised, potential. A comprehensive selection of case studies covering the three most important Soft Sensor application fields, a general introduction to the most popular Soft Sensor modelling techniques as well as a discussion of some open issues in the Soft Sensor development and maintenance and their possible solutions are the main contributions of this work.

Introduction

Industrial processing plants are usually heavily instrumented with a large number of sensors. The primary purpose of the sensors is to deliver data for process monitoring and control. But approximately two decades ago researchers started to make use of the large amounts of data being measured and stored in the process industry by building predictive models based on this data. In the context of process industry, these predictive models are called Soft Sensors. This term is a combination of the words “software”, because the models are usually computer programs, and “sensors”, because the models are delivering similar information as their hardware counterparts. Other common terms for predictive sensors in the process industry are inferential sensors (see e.g. Jordaan et al., 2004, Qin et al., 1997), virtual on-line analyser as they are called in the Six-Sigma context (Han & Lee, 2002) and observer-based sensors (Goodwin, 2000).

At a very general level one can distinguish two different classes of Soft Sensors, namely model-driven and data-driven. The model-driven family of Soft Sensors is most commonly based on First Principle Models (FPM) but model-driven Soft Sensors based on extended Kalman filter (Welch & Bishop, 2001) or adaptive observer (Bastin & Dochain, 1990) have also been published (e.g. Chruy, 1997, Jos de Assis and Maciel Filho, 2000). First Principle Models describe the physical and chemical background of the process. These models are developed primarily for the planning and design of the processing plants, and therefore usually focus on the description of the ideal steady-states of the processes which is only one of their drawbacks which makes it difficult to base Soft Sensors on them. As a solution the data-driven Soft Sensors gained increasing popularity in the process industry. Because data-driven models are based on the data measured within the processing plants, and thus describe the real process conditions, they are, compared to the model-driven Soft Sensors, more reality related and describe the true conditions of the process in a better way. Nevertheless there is a lot of different issues which have to be dealt with while developing data-driven Soft Sensors. These issues will be discussed later on in this paper. The most popular modelling techniques applied to data-driven Soft Sensors are the Principle Component Analysis (PCA) (Jolliffe, 2002) in a combination with a regression model, Partial Least Squares (Wold, Sjstrm, & Eriksson, 2001), Artificial Neural Networks Bishop, 1995, Principe et al., 2000, Hastie et al., 2001, Neuro-Fuzzy Systems Jang et al., 1997, Lin and Lee, 1996 and Support Vector Machines (SVMs) (Vapnik, 1998).

The range of tasks fulfilled by Soft Sensors is broad. The original and still most dominant application area of Soft Sensors is the prediction of process variables which can be determined either at low sampling rates or through off-line analysis only. Because these variables are often related to the process output quality, they are very important for the process control and management. For these reasons it is of great interest to deliver additional information about these variables at higher sampling rate and/or at lower financial burden, which is exactly the role of the Soft Sensors. The modelling methods applied to this kind of applications are either statistical or soft computing supervised learning approaches. This Soft Sensor application field is further on referred to as on-line prediction. Other important application fields of Soft Sensors are those of process monitoring and process fault detection. These tasks refer to detection of the state of the process and in the case of a deviation from the normal conditions to identification of the deviation source. Traditionally, the process state is monitored by process operators in the control rooms of the processing plants. The observation and interpretation of the process state is often based on univariate statistics and it is up to the experience of the process operator to put the particular variables into relations and to make decisions about the process state. The role of process monitoring Soft Sensors is, based on the historical data, to build multivariate features which are relevant for the description of the process state. By presenting the predicted process state or the multivariate features the Soft Sensor can support the process operators and allow them to make faster, better and more objective decisions. Process monitoring Soft Sensors are usually based on the Principle Component Analysis and Self Organizing Maps (Kohonen, 1997). It was already mentioned that processing plants embody large number of various sensors, therefore there is a certain probability that a sensor can occasionally fail. Detection of this failure is the next application area of Soft Sensors. In more general terms this application field can be described as sensor fault detection and reconstruction. Once a faulty sensor is detected and identified, it can be either reconstructed or the hardware sensor can be replaced by another Soft Sensor, which is trained to act as a back-up Soft Sensor of the hardware measuring device. If the back-up sensor proves to be an adequate replacement of the physical sensor, this idea can be driven even further and the Soft Sensor can replace the measuring device also in normal working conditions. The software tool can be easier maintained and is not subject to mechanical failures and therefore such a substitution can provide a financial advantage for the process owner.

Despite all the previously listed Soft Sensor application fields and the high number of publications dealing with Soft Sensor applications, there are still some unaddressed issues of the Soft Sensor development and maintenance. A lot of the origins of these issues are in the process data which is used for the Soft Sensor building. Common effects present in the data are measurement noise, missing values, data outliers, co-linear features and varying sampling rates. To solve these problems, there is typically a large amount of manual work needed. Another problem is that the processing plants are rather dynamic environments. Often they develop gradually during the operation time but there can be also sudden abrupt changes of the process, for example, if the quality of the process input changes. It is very difficult for the Soft Sensors to react to these changes which usually results in prediction accuracy deterioration. At present time, these issues are solved in a rather ad hoc manner, which leads to unnecessary high costs of the Soft Sensor development and maintenance. Further on in this work, all the aspects, which have been briefly outlined in this section, are going to be reviewed in a more comprehensive way. The rest of the paper is organized as follows. Section 2 gives an overview of different process types and deals with their aspects from the Soft Sensor modelling point of view. Section 3 focuses on data-driven Soft Sensors, namely on their development methodology, on the methods which are commonly applied to soft sensing and on open issues of the Soft Sensor modelling. A review of publications dealing with Soft Sensor application to diverse processes is also given in Section 3. Section 4 provides a brief description of most popular data-driven pre-processing and modelling techniques to soft sensing. Section 5 contains a discussion of the most important open issues of Soft Sensor development and maintenance as well as an outline of the future research directions in the Soft Sensors field. Finally, the work is concluded in Section 6.

Section snippets

Industrial processes

This section deals with the process industry environment. First, the two different types of industrial processes and their distinguishing characteristics are discussed in Section 2.1. This is followed by a detailed discussions of the data produced in the process industry in Section 2.2.

Soft Sensors in the process industry

This section deals with Soft Sensors in a detailed way. After distinguishing two types of them in Section 3.1 a discussion of a state-of-the-art Soft Sensor development methodology is given in Section 3.2. Section 3.3 provides a comprehensive overview of published Soft Sensor application case studies.

Data-driven methods for soft sensing

This section outlines and provides further references to the most popular techniques for Soft Sensor development as they were identified in Section 3.3.4. These are the multivariate Principle Component Analysis (Section 4.1), Partial Least Squares (Section 4.2), Artificial Neural Networks (Section 4.3), Neuro-Fuzzy Systems (Section 4.4) and Support Vector Machines (Section 4.5). Finally, the last part of the section deals with the modelling of batch processes (Section 4.6).

Open issues and future steps of Soft Sensor development

There are two main issues in the Soft Sensor development and maintenance, respectively. At the development phase, there has to be a lot of effort spent on the manual pre-processing of the data as well as on the model selection and validation steps. To be able to deal with issues like missing values, data outliers, etc., discussed in Section 2.2, the model developer has to manually try different pre-processing approaches and select the one giving the best performance as estimated on the training

Summary

Fig. 6 provides a summary of this review. We focus on two main aspects of the Soft Sensor development: (i) on the process industry and (ii) on the most common computational learning techniques applied for the Soft Sensor modelling.

This review mainly focused on data-driven and grey-box Soft Sensors. The data for the training, evaluation and testing of the models is delivered by the process industry. The industrial data has some common properties like missing values and outliers, which are listed

References (163)

  • H. Abdi

    Partial least squares (PLS) regression

    Encyclopedia of social sciences, research methods

    (2003)
  • E.S.A. Alhoniemi

    Process monitoring and modeling using the self-organizing map

    Integrated Computer-Aided Engineering

    (1999)
  • Amazouz, M., & Pantea, R. (2006). Use of multivariate data analysis for lumber drying process monitoring and fault...
  • P. Angelov et al.

    Identification of evolving fuzzy rule-based models

    IEEE Transactions on Fuzzy Systems

    (2002)
  • P. Angelov et al.

    Evolving computational intelligence systems

  • P.P. Angelov et al.

    Flexible models with evolving structure

    International Journal of Intelligent Systems

    (2004)
  • M.J. Arazo-Bravo et al.

    Automatization of a penicillin production process with soft sensors and an adaptive controller based on neuro fuzzy systems

    Control Engineering Practice

    (2004)
  • C.G. Atkeson et al.

    Locally weighted learning

    Artificial Intelligence Review

    (1997)
  • G. Bastin et al.

    On-line estimation and adaptive control of bioreactors

    (1990)
  • E. Bauer et al.

    An empirical comparison of voting classification algorithms: Bagging, boosting, and variants

    Machine Learning

    (1999)
  • C.M. Bishop

    Neural networks for pattern recognition

    (1995)
  • D. Bonne et al.

    Data-driven modeling of batch processes

  • L. Breiman

    Bagging predictors

    Machine Learning

    (1996)
  • R. Bro

    Multiway calibration. Multilinear PLS

    Journal of Chemometrics

    (1996)
  • A. Casali et al.

    Particle size distribution soft-sensor for a grinding circuit

    Powder Technology

    (1998)
  • Champagne, M., Dudzic, M., Inc, T., & Temiscaming, Q. (2002). Industrial use of multivariate statistical analysis for...
  • L. Chen et al.

    Hybrid modelling of biotechnological processes using neural networks

    Control Engineering Practice

    (2000)
  • J.M. Chen et al.

    System parameter estimation with input/output noisy data and missing measurements

    IEEE Transactions on Signal Processing [see also IEEE Transactions on Acoustics, Speech, and Signal Processing]

    (2000)
  • X. Chen et al.

    A soft-sensor development for melt-flow-length measurement during injection mold filling

    Materials Science & Engineering A

    (2004)
  • J. Chen et al.

    On-line batch process monitoring using dynamic PCA and dynamic PLS models

    Chemical Engineering Science

    (2002)
  • L.Z. Chen et al.

    Soft sensors for on-line biomass measurements

    Bioprocess and Biosystems Engineering

    (2004)
  • S.W. Choi et al.

    Adaptive multivariate statistical process control for monitoring time-varying processes

    Industrial & Engineering Chemistry Research

    (2006)
  • A. Chruy

    Software sensors in bioprocess engineering

    Journal of Biotechnology

    (1997)
  • L. Davies et al.

    The identification of multiple outliers

    Journal of the American Statistical Association

    (1993)
  • B.S. Dayal et al.

    Recursive exponentially weighted PLS and its applications to adaptive control and prediction

    Journal of Process Control

    (1997)
  • S. De Wolf et al.

    Model predictive control of a slurry polymerisation reactor

    Computers and Chemical Engineering

    (1996)
  • K. Desai et al.

    Soft-sensor development for fed-batch bioreactors using support vector regression

    Biochemical Engineering Journal

    (2006)
  • D. Devogelaere et al.

    Application of feedforward neural networks for soft sensors in the sugar industry

  • F. Ding et al.

    Modeling and identification for multirate systems

    Acta Automatica Sinica

    (2005)
  • D. Dong et al.

    Nonlinear principal component analysis-based on principal curves and neural networks

    Computers and Chemical Engineering

    (1996)
  • D. Dong et al.

    Emission monitoring using multivariate soft sensors

  • Y. Dote et al.

    Industrial applications of soft computing: A review

  • F.J. Doyle

    Nonlinear inferential control for process applications

    Journal of Process Control

    (1998)
  • R. Dunia et al.

    Joint diagnosis of process and sensor faults using principal component analysis

    Control Engineering Practice

    (1998)
  • R. Dunia et al.

    Subspace approach to multidimensional identification and reconstruction

    AIChE Journal

    (1998)
  • R. Dunia et al.

    Sensor fault identification and reconstruction using principal component analysis

  • L. Eriksson et al.

    Multi-and megavariate data analysis: Principles and applications

    (2001)
  • M. Fellner et al.

    Functional nodes in dynamic neural networks for bioprocess modelling

    Bioprocess and Biosystems Engineering

    (2003)
  • Feng, R., Shen, W., & Shao, H. (2003). A soft sensor modeling approach using support vector machines. In Proceedings of...
  • L. Fortuna

    Soft sensors for monitoring and control of industrial processes

    (2007)
  • L. Fortuna et al.

    Soft sensors for product quality monitoring in debutanizer distillation columns

    Control Engineering Practice

    (2005)
  • I.E. Frank et al.

    A statistical view of some chemometrics regression tools

    Technometrics

    (1993)
  • Y. Freund et al.

    A decision-theoretic generalization of on-line learning and an application to boosting

    Journal of Computer and System Sciences

    (1997)
  • K. Funahashi

    On the approximate realization of continuous mappings by neural networks

    Neural Networks

    (1989)
  • J. Gabrielsson et al.

    Combining process and spectroscopic data to improve batch modeling

    AIChE Journal

    (2006)
  • B. Gabrys

    Neuro-fuzzy approach to processing inputs with missing values in pattern recognition problems

    International Journal of Approximate Reasoning

    (2002)
  • B. Gabrys

    Learning hybrid neuro-fuzzy classifier models from data: To combine or not to combine?

    Fuzzy Sets and Systems

    (2004)
  • B. Gabrys et al.

    Neural networks based decision support in presence of uncertainties

    Journal of Water Resources Planning and Management

    (1999)
  • B. Gabrys et al.

    Genetic algorithms in classifier fusion

    Applied Soft Computing

    (2006)
  • J. Gama et al.

    Learning with drift detection

  • Cited by (1467)

    View all citing articles on Scopus
    View full text