Elsevier

Journal of Cleaner Production

Volume 283, 10 February 2021, 125287
Journal of Cleaner Production

An evolutionary-based predictive soft computing model for the prediction of electricity consumption using multi expression programming

https://doi.org/10.1016/j.jclepro.2020.125287Get rights and content

Abstract

Proper estimation of electricity consumption is one of the influential factors for sustainability and cleaner production in both developed and developing countries. Many studies have been conducted to present accurate prediction models for forecasting electricity demand. However, researchers are still working to develop models with higher accuracy. This study applies a newer branch of Genetic Programming (GP) as a soft computing technique, known as Multi Expression Programming (MEP) to predict the electricity consumption of China for the first time based on the data collected from 1991 to 2019. Specifically, a robust mathematical model was developed using MEP for this purpose. Different predictive techniques known as Gene Expression Programming (GEP) and Adaptive Neuro Fuzzy Inference System (ANFIS) were used to compare the accuracy of the model. Based on the results, the proposed MEP model is more powerful and accurate than both GEP and ANFIS. In addition, a sensitivity analysis was conducted to present the impact of each factor on the electricity consumption of China. It was shown that among the four independent factors (Population, Gross Domestic Product (GDP), Import, and Export), Population has the highest impact, followed by Export, Import and GDP, respectively.

Introduction

Electricity demand forecasting is one of the critical issues in sustainability and cleaner production (Wang et al., 2019).de Azevedo et al. (2018) stated that sustainability comprises three elements which are economy, environment and society. In the recent decades, the demand for electricity and energy resources has increased tremendously and therefore, its linkage with sustainability has received much attention.

Long-term planning of energy supply-demand must satisfy the requirements for sustainable development of a country. Accurate forecasts can help decision makers to know the volume and trend of future energy consumption to better schedule and plan the operations of the supply system. Since most countries perform financial budget planning and allocation on a yearly basis, an annual forecast of energy demand is important. A precise estimation of energy consumption is necessary to manage social and economic growth of developing countries such as China. Previous reports showed that over the past two decades, the demand (consumption) of energy has gone up almost 200% in China (Petroleum, 2009). In addition, over these years, China has faced energy shortages many times (Wang, Q. et al., 2018). In order to avoid such a problem which strongly affects economic growth, developing an accurate predictive model for energy consumption is very significant for China’s policy makers.

The related literature on energy consumption prediction reveals that many scholars have applied different approaches to estimate energy consumption. Overall, these methods are classified into three categories: 1) Time series methods; 2) Regression-based methods; 3) Soft computing methods.

Over the recent years, soft computing techniques have been widely used for simulating and estimating energy consumption. Generally, soft computing techniques can be categorized into three groups: 1) Optimization algorithms such as Ant Colony Optimization (ACO), Particle Swarm Optimization (PSO) and Genetic Algorithm (GA); 2) Neural-based Predictive Soft Computing (NPSC) methods such as Adaptive Neuro Fuzzy Inference System (ANFIS), Support Vector Machine (SVM) and Artificial Neural Network (ANN) (Zhou et al., 2020); 3) Evolutionary-based Predictive Soft Computing (EPSC) techniques such as Genetic Programming (GP) and Gene Expression Programming (GEP).

Many researchers have applied NPSC methods in their studies and indicated that these methods are powerful in their estimation, but their serious drawback is that they act as a black box system (Luo et al., 2019; Pattanaik et al., 2019). This denotes that they are not able to provide a clear mathematical equation connecting the output (energy consumption) and the input variables (such as Gross Domestic Product (GDP), Population, Import, and Export). Therefore, there are still some basic questions to be considered:

  • i)

    How useful is an intangible neural-based mechanism which only predicts energy demand without any mathematical model?

  • ii)

    How can the developed NPSC mechanism ease the forecasting procedure for decision makers if this type of technique strongly needs special knowledge?

In order to answer the aforementioned questions, EPSC techniques such as GP, GEP and Optimized GEP have been utilized by different researchers (Kaboli et al., 2017; Lu et al., 2017; Samadianfard et al., 2018). In fact, using these methods, an explicit mathematical equation that links the output and inputs can be generated. The relevant literature shows that there is a newer version of GP known as Multi Expression Programming (MEP) which is more accurate than other GP-based techniques (Alavi et al., 2010). Despite the many advantages of this technique over the other variants of GP, MEP is still very new in energy demand prediction.

The most significant aim of this paper is to generate a mathematical equation for forecasting the electricity consumption of China using MEP based on the data sets collected from 1991 to 2019. To date, the application of MEP has not been considered in the context of energy demand estimation. This is the first study that utilizes MEP which is not only applicable for China, but it can be used internationally for any country. It is a generic and universal approach. This technique can help decision makers of any country to estimate electricity consumption accurately. In order to show the accuracy of MEP, it is compared with other methods including GEP, ANFIS, and regression.

The rest of this paper is structured as follows: Section 2 reviews the relevant literature. Then, explanations of GP, GEP, MEP and the proposed steps are provided in Section 3. The developed mathematical model is presented in Section 4. Section 5 discusses the assessment results of the model. In Section 6, a sensitivity analysis is conducted. Managerial implications are described in Section 7. Finally, conclusions are given in Section 8.

Section snippets

Literature review

As mentioned earlier, there are three main categories of methods for energy demand forecasting. In this section, some of the studies are presented.

Methodology

This section comprises four sub-sections including an overview of GP, GEP, MEP and the proposed steps.

Formulation of electricity consumption of China

MEP was utilized to develop a mathematical formulation for the electricity consumption of China with respect to the data collected from 1991 to 2019. As mentioned before, the process was started by defining the most important inputs. In this study, Population (P), Gross Domestic Product (GDP), Import (Im), and Export (Ex) of China were determined as the independent variables, and Electricity Consumption (EC) was the dependent variable as illustrated in Fig. 4. The recorded data of China in the

Performance validation of the generated MEP mathematical model

This section includes four different sub-sections. Firstly, the MEP model was evaluated by applying different statistical tests. Secondly, a comparison between the accuracy of the MEP model and GEP (as one of the well-known GP variants) was done. Thirdly, a comparison between the MEP model (as an EPSC technique) and ANFIS (as one of the powerful NPSC methods) was carried out. In addition, the accuracy of the MEP model was compared with regression (as a traditional prediction technique).

Sensitivity analysis

Sensitivity analysis of the inputs or independent variables that influence the electricity consumption of China was performed to demonstrate the contributions of the utilized predictive factors in prediction. To this end, after finding the optimized parameters of MEP, the MEP structure was run thirty times to obtain the frequency of each independent variable in contributing to the fitness of the MEP programs over the thirty runs (Alavi et al., 2010). The sensitivity analysis showed that

Managerial implications

One of the important features of any prediction model is its applicability for decision makers or managers in real world. As mentioned before, ANFIS and other NPSC techniques are considered as a black box which means they can only forecast electricity consumption without providing any mathematical model. As there is no mathematical equation that links the dependent variable and independent variables, managers need to understand the neural-based structure of ANFIS. However, the structure of

Conclusions

In the past, many computer aided techniques such as ANN, ANFIS, etc., have been developed to predict energy consumption. These techniques only present a list of values for an output using a neural-based structure which consists of intangible information. Arguably, these techniques are less useful for policy makers and may make them more confused (due to the need for deep knowledge to understand a neural-based structure and its elements). In real world, a more tangible way to predict and

CRediT authorship contribution statement

Alireza Fallahpour: Writing - original draft, Writing - review & editing. Kuan Yew Wong: Writing - original draft, Writing - review & editing. Srithar Rajoo: Writing - original draft, Writing - review & editing. Guangdong Tian: Writing - original draft, Writing - review & editing.

Declaration of competing interest

The authors declare that they have no conflict of interest.

Acknowledgement

The authors are grateful to Universiti Teknologi Malaysia (UTM) (Vote Number: 20H69). In addition, this work is supported in part by Natural Science Foundation of China Under Grant No. 51775238 and 52075303, Science and technology development project of Jilin Province under Grant Nos 20180101060JC and 20180101058JC.

References (46)

  • A. Golbraikh et al.

    Beware ofq2!

    J. Mol. Graph. Model.

    (2002)
  • S.H.A. Kaboli et al.

    Long-term electric energy consumption forecasting via artificial cooperative search algorithm

    Energy

    (2016)
  • S.H.A. Kaboli et al.

    Long-term electrical energy consumption formulating and forecasting via optimized gene expression programming

    Energy

    (2017)
  • M.-R. Kazemzadeh et al.

    A hybrid data mining driven algorithm for long term electric peak load and energy demand forecasting

    Energy

    (2020)
  • M. Khashei et al.

    A novel hybridization of artificial neural networks and ARIMA models for time series forecasting

    Appl. Soft Comput.

    (2011)
  • E.S. Mostafavi et al.

    A novel machine learning approach for estimation of electricity demand: an empirical evidence from Thailand

    Energy Convers. Manag.

    (2013)
  • S.M. Mousavi et al.

    Gene expression programming as a basis for new generation of electricity demand prediction models

    Comput. Ind. Eng.

    (2014)
  • K. Muralitharan et al.

    Neural network based optimization approach for energy demand prediction in smart grid

    Neurocomputing

    (2018)
  • S.S. Pappas et al.

    Electricity demand loads modeling using AutoRegressive Moving Average (ARMA) models

    Energy

    (2008)
  • S. Samadianfard et al.

    Wavelet neural networks and gene expression programming models to predict short-term soil temperature at different depths

    Soil Tillage Res.

    (2018)
  • D.H. Vu et al.

    A variance inflation factor and backward elimination based robust regression model for forecasting monthly electricity demand using climatic variables

    Appl. Energy

    (2015)
  • Q. Wang et al.

    Forecasting energy demand in China and India: using single-linear, hybrid-linear, and non-linear time series forecast techniques

    Energy

    (2018)
  • X. Wang et al.

    Estimates of energy consumption in China using a self-adaptive multi-verse optimizer-based support vector machine with rolling cross-validation

    Energy

    (2018)
  • Cited by (24)

    • Low-dimensional representation of monthly electricity demand profiles

      2023, Engineering Applications of Artificial Intelligence
      Citation Excerpt :

      Specifically, in the retailer arena, data-based decision making has gained increasing interest in methods for long-term load forecasting, energy procurement strategies, pricing schemes, and risk management in the retail market (Liu et al., 2020; Yang et al., 2018). Some authors have done some related works on the long-term evolution of electric demand (Fallahpour et al., 2021; Sánchez-Durán et al., 2019b) and the solar production (Sánchez-Durán et al., 2019a), and also in featuring electricity demand using spectral analysis (Luque et al., 2020). The characterization of load curves has highly exploited the developments in the data mining of time series, with techniques generally categorized into representation and indexing, similarity measure, segmentation, visualization, and mining (Fu, 2011).

    View all citing articles on Scopus
    View full text