Skip to Main Content
Purpose

This paper aims to obtain accurate forecasts of the hourly residential natural gas consumption, in Egypt, taken into consideration the volatile multiple seasonal nature of the gas series. This matter helps in both minimizing the cost of energy and maintaining the reliability of the Egyptian power system as well.

Design/methodology/approach

Double seasonal autoregressive integrated moving average-generalized autoregressive conditional heteroskedasticity model is used to obtain accurate forecasts of the hourly Egyptian gas consumption series. This model captures both daily and weekly seasonal patterns apparent in the series as well as the volatility of the series.

Findings

Using the mean absolute percentage error to check the forecasting accuracy of the model, it is proved that the produced outcomes are accurate. Therefore, the proposed model could be recommended for forecasting the Egyptian natural gas consumption.

Originality/value

The contribution of this research lies in the ingenuity of using time series models that accommodate both daily and weekly seasonal patterns, which have not been taken into consideration before, in addition to the series volatility to forecast hourly consumption of natural gas in Egypt.

Natural gas constitutes one of the keystones of both economic and social development, as it plays an essential role in reducing pollution and maintaining environmental cleanness. It is commonly used for many purposes such as heating, electricity generation, transportation, cooking and cooling. All these usages give it a wide prominence in economic, social and technological developments in any country.

Pursuant to the energy information administration of the USA, Egypt is one of the largest natural gas producers in the continent. However, it is worth noting that, during the past two decades, Egyptian total demand for energy has grown at an annual rate of 4.6% and there is an expectation for more increase. Consequently, adopting energy management policies becomes very essential (African Development Bank Group, 2010).

It is well-known that decision-makers, around the world, widely use scientific forecasting methods for making such a scientific plan for development. Policymaking in the field of natural gas is not an exception. For instance, determining accurate forecasts of natural gas consumption for one day ahead is crucial for the economical and reliable operation of the distributed network.

Natural gas consumption is mainly influenced by seasonal effects (daily and weekly cycles as well as calendar holidays) and could be classified into three main types; residential, industrial and commercial consumption. The fundamental features of these types usually differ owing to the fact that each type has a seasonal cycle related to its characteristics. For instance, residential consumption is expected to increase at weekends compared to working days. While both commercial and industrial consumption is expected to increase on working days.

Moreover, it is very important to clarify that multiple seasonal patterns are also apparent in the Egyptian residential gas consumption series. In this context, it can be seen, through the series, that daily seasonal pattern is apparent in the similarity of hourly consumption from one day to the next, while a weekly seasonal pattern is apparent in the similarity of the daily consumption existing week after week. Accordingly, a double seasonal autoregressive integrated moving average (DSARIMA) model can be used to capture those seasonal patterns. Furthermore, the generalized autoregressive conditional heteroskedasticity (GARCH) model may be also used to capture the conditional volatility heteroskedasticity observed in the series.

Consequently, in this study, a DSARIMA-GARCH model is used to study the Egyptian hourly residential natural gas consumption. The DSARIMA model is used to accommodate both daily and weekly seasonal patterns whereas the conditional volatility behavior is captured by the GARCH model. The remainder of this research is organized as follows; Section 2 reviews the previous studies investigating the usage of autoregressive integrated moving average (ARIMA) model in forecasting gas consumption, Section 3 describes the Egyptian residential gas consumption series followed by an illustration of the DSARIMA-GARCH model in Section 4. In Section 5, the proposed model of the Egyptian gas consumption is introduced while Section 6 concludes the study and clarifies some points to be studied in the future.

ARIMA models are extensively used in forecasting based on previously observed values of a time series. It has been used, in many preceding studies, to obtain accurate forecasting of natural gas consumption. In this part, an investigation of some studies is introduced to clarify the essential role of this model in making an accurate forecasting.

In their study, Liu and Lin (1991) used the ARIMA model to forecast natural gas consumption in Taiwan. The model was used with some exogenous variables such as temperature and prices. The study was conducted based on monthly data and monthly as well as quarterly forecasts were obtained. In 2004, the ARIMA model was used with the sake of forecasting the natural gas consumption in some residential areas of Eskisehir in Turkey. Based on a monthly data, the ARIMA model was used with separate autoregressive (AR) models to estimate both heating and non-heating months to capture the seasonal patterns (Aras and Aras, 2004). In his research, Al-Fattah (2006) adopted the use of ARIMA models to predict the USA natural gas production and annual depletion.

In 2010, the ARIMA models were used to forecast monthly Turkish natural gas consumption based on monthly data covering the period from 1987 to 2007 (Erdogdu, 2010). Moreover, Akkurt et al. (2010) used the Seasonal ARIMA (SARIMA) model, which is an extension of ARIMA model, in a comparative study with the linear regression model to forecast Turkish natural gas consumption in a monthly and yearly base through a time-series data having single seasonal pattern.

In their study, SARIMA model provided better results compared with the regression model. In other words, the values of the mean absolute percentage error (MAPE) and the mean absolute deviation of the forecasts that produced from time series models were lower than those corresponding of the linear regression. Furthermore, based on the historical monthly gas consumption, in Bangladesh, during the period (1980–2008), the ARIMA model was used to estimate the gas consumption in 2014 (Faisal, 2012).

Recently, different statistical methods among ARIMA models are used, in two studies, to forecast natural gas consumption in Turkey using residential and commercial data. The results of these studies demonstrate the use of ARIMA models to obtain an accurate natural gas consumption forecasting (Taspinar et al., 2013; Akpinar and Yumusak, 2016).In addition, they are also proved to be accurate in forecasting gas consumption in India and Iran (Pradhan et al., 2016; Jafari and Sadigh, 2019).

ARIMA model is not used to forecast the Egyptian gas consumption. In addition, the volatile multiple seasonal nature of the gas series is not taken into consideration in the previous studies. Hence, DSARIMA, as an extension of ARIMA model, is used in this study to accommodate daily and weekly seasonal patterns and combined with GARCH model to capture the volatility of the series to obtain accurate forecasts of the Egyptian residential gas consumption.

The residential natural gas consumption, in Egypt, is analyzed based on hourly data of a one year, specifically from the first Sunday (1st of January) to the last Sunday (31th of December) of 2017. All the data, with 8,760 hourly observations, is used to estimate the model. In this context, it is worth noting that the first month of 2018, with 744 hourly observations, is also obtained for the sake of being used to evaluate the forecasting accuracy of the model.

Figure 1 presents a time series plot of a part of the data. It clarifies the Egyptian hourly residential gas consumption series from Wednesday, 1 March 2017, to Friday, 31 March 2017. It can be seen that hours from 1 to 24 represent the 24 h of the first day of this part of the time series while the hours from 24 to 48 illustrate the second day, etc. A typical daily seasonal pattern is apparent in the similarity of the consumption from one day to the next. Moreover, a weekly seasonal pattern is apparent as well. It can be shown that working days show similar patterns of consumption, while weekends have different natural gas consumption. Weekends have a residential natural gas consumption, that is higher than the corresponding of the working days.

Figure 1.

Time plot of the Egyptian hourly gas consumption series from Wednesday 1 March 2017 to Friday 31 March 2017

Figure 1.

Time plot of the Egyptian hourly gas consumption series from Wednesday 1 March 2017 to Friday 31 March 2017

Close modal

DSARIMA-GARCH models are used to model time-series data that contains two seasonal cycles. This can be conducted by representing the current values of the series as a combination of past seasonal values and past seasonal errors. However, this is with the condition that there are one or more observations, in the series, for which the variance of the current error term is a function of the previous time periods’ error or previous error variance.

A DSARIMA-GARCH model, which can be denoted as ARIMA (p,d,q)(P1,D1,Q1)s1(P2,D2,Q2)s2−GARCH(r,s), could be written as:

(1)

with error term εt splits into a stochastic piece ηt and a time-dependent standard deviation σt so that:

(2)

where ηt is independent and identical distributed with zero mean and variance equal to one and σt2 is modeled as follows:

(3)

yt is a time series, s1 and s2 are the first and second seasonal periods,∇d is the nonseasonal differencing operator while siD is the seasonal differencing operator; B is the backward shift operator such that Biyt = yt−1; ϕp(B) represents the nonseasonal AR polynomial of order p and θq(B) represents the nonseasonal moving average polynomial of order q, likewise ΦP1(Bs1) and ΩP2(Bs2) are seasonal AR polynomials of orders P1 and P2 for the first and second seasonal cycles respectively; and ΘQ1(Bs1) and ΨQ2(Bs2) are seasonal moving average polynomials of orders Q1 and Q2 respectively. r is the order of the GARCH terms and s is the order of the ARCH terms.

The multiplicative SARIMA model has been introduced by Box and Jenkins (1976) to capture only one seasonal pattern. SARIMA model is denoted as ARIMA (p, d, q) (P, D, Q)s where p and P are the number of non-seasonal and seasonal AR terms respectively, d is the order of the difference needed to render stationarity while D is the order of seasonal differencing, q is the number of moving average terms that represent the past errors, Q is the number of seasonal moving average terms and s is the seasonal period. SARIMA model could be expressed as:

(4)

SARIMA model has been extended to DSARIMA model by Taylor (2003) to accommodate two seasonal patterns (daily and weekly). Therefore, second seasonal AR and moving average terms and second seasonal differencing operator have been added to SARIMA model. The multiplicative DSARIMA model, which is denoted as ARIMA (p, d, q) (P1, D1, Q1)s1 (P2, D2, Q2)s2, is expressed as follows:

(5)

In his research, Engle (1982) introduced the autoregressive conditional heteroscedasticity (ARCH) model that has been extended later by Bollerslev (1986) to the GARCH model. The ARCH model assumes that the conditional variances of the error are described by a function of previous squared errors and could be used when the error variance is assumed to follow AR model. To model a time series using an ARCH process, let εt denote the error term. This εt consists of a stochastic piece ηt and a time-dependent standard deviation σt as follows:

(6)

where ηtN (0, 1) i.i.d. Then, σt2 is modeled by:

(7)

GARCH model is a generalization of ARCH model. It is used when the error variance assumed to follow autoregressive moving average model (ARMA) model. The conditional variance is not only explained by past squared errors but also by past conditional variances. The GARCH (r, s) model for a series yt is given by:

(8)

Stationarity and invertibilty are important concepts in analyzing time series. A time series that follows DSAIMA-GARCH model is stationary if the absolute roots of the nonseasonal and seasonal AR polynomials ϕp(B) = 0,ΦP1(Bs1)=0 and ΩP2(Bs2)=0 are greater than 1.While invertibility is a model characteristic on the contrary to stationarity, which is a time series characteristics. DSARIMA-GARCH model is invertible if the absolute roots of the nonseasonal and seasonal moving average polynomials θq(B) = 0, ΘQ1(Bs1)=0 and ΨQ2(Bs2)=0 are greater than 1 (Box et al., 1994).

In other words, there are several constraints on the parameters of the DSARIMA-GARCH model. The roots of the polynomials ϕp(B),ΦP1(Bs1), ΩP2(Bs2), θq(B), ΘQ1(Bs1) and ΨQ2(Bs2) should be outside the unit circle to satisfy the sationarity and invertibility conditions for the DSARIMA part, while the constrains αi > 0, i = 0, …, s and βj > 0, j = 1, …, r are assumed to guarantee the positivity of the conditional variance for the GARCH part (Bollerslev, 1986; Brooks, 2008; Tsay, 2010).

Different DSARIMA-GARCH models are estimated to forecast hourly residential natural gas consumption in Egypt. Firstly, to identify a suitable DSARIMA model, the estimated autocorrelation function (ACF) of the Egyptian consumption series is plotted. The ACF is a tool for describing the pattern within a time series. It measures the correlation between a set of observations and a lagged set of observations of a time series (Vandaele, 1983; Pankratz, 1983).

An important consideration in time series models is whether the data is stationary. If the series is non-stationary, one needs to stationarize it by taking differences or transformations to render stationarity. The estimated ACF of the series is used to check the stationarity. Figure 2 shows the estimated ACF of the hourly Egyptian natural gas consumption series. It is clear from the estimated ACF the presence of the daily seasonal pattern. The same pattern was repeated every 24 point, hence, a daily seasonal differencing (D1 = 1, s1 = 24) is considered to transform the non-stationary series that results from that daily pattern into a stationary one. Plotting the ACF after the daily seasonal differencing, Figure 3 shows another seasonal pattern, which is the weekly seasonal pattern. Spikes at lags equal to 168 and multiples are observed; therefore, the weekly seasonal differencing (D2 = 1, s2 = 168) is also considered.

Figure 2.

ACF of the hourly Egyptian residential consumption

Figure 2.

ACF of the hourly Egyptian residential consumption

Close modal
Figure 3.

ACF of the Egyptian residential consumption after the daily differencing

Figure 3.

ACF of the Egyptian residential consumption after the daily differencing

Close modal

Figure 4 shows the estimated ACF after the daily and weekly seasonal differencing, which indicates that the series becomes stationary after eliminating the daily and weekly seasonal patterns. Lag polynomials up to order three are considered for the seasonal AR and moving average polynomials. Different DSARIMA models are estimated using maximum likelihood method. To estimate the parameters of the models and to check the forecasting performance, the whole year of 2017 hourly gas consumption is used as in-sample period to estimate DSARIMA models’ parameter, while the first month of 2018 hourly consumption put aside as out-of-sample period to evaluate the accuracy of the forecasts.

Figure 4.

ACF of the Egyptian residential consumption after the daily and weekly differencing

Figure 4.

ACF of the Egyptian residential consumption after the daily and weekly differencing

Close modal

A Schwartz Bayesian criterion (SBC) is used to select a model from the different DSARIMA models. SBC is known as the Schwartz criterion and could be defined as:

(9)

where n is the sample data size, L ^ is the maximum likelihood estimate of the model and k is the number of parameters in the model. By choosing the model corresponding to the minimum value of SBC, one is selecting the model, that is, not over-fitted the data and has the highest probability (Schwarz, 1978). The SBC for the different orders of DSARIMA models are obtained. DSARIMA (3, 0, 2) (3, 1, 1)24(3, 1, 2)168 model is selected, as it has the lowest SBC.

Secondly, to treat the volatility of the series, the squared residuals of the selected DSARIMA model are checked. Plotting the ACF and the partial ACF(PACF) of the squared residuals, as shown in Figures 5 and 6, confirmed the existence of GARCH effect. The PACF is similar to the ACF. It is also a tool to represent the statistical relationship between sets of ordered pairs of a time series.

Figure 5.

ACF of the squared residuals of DSARIMA model

Figure 5.

ACF of the squared residuals of DSARIMA model

Close modal
Figure 6.

PACF of the squared residuals of DSARIMA model

Figure 6.

PACF of the squared residuals of DSARIMA model

Close modal

Moreover, Table 1 summarizes the results of the Lagrange multiplier (LM) test to test the ARCHeffect. It clearly emphasizes on the presence of ARCH effect. The p-value is relatively small and less than the significance level 0.05. Hence, GARCH model is used in conjunction with DSARIMA to capture the volatility of the conditional variance.

Table 1.

LM test for heteroskedasticity

LagTest statisticp-value
1538.7906<0.0001
2574.9881<0.0001
3576.8938<0.0001
4578.9294<0.0001
5578.9402<0.0001
6579.1947<0.0001

Lag order up to order two is often considered for GARCH models. Based on Schwartz criterion, DSARIMA (3, 0, 2) (3, 1, 1)24(3, 1, 2)168 + GARCH (1, 1) model yield the lowest SBC estimate among the different orders. The estimated results of this model are present in Table 2.

Table 2.

Estimated parameters for DSARIMA (3, 0, 2) (3, 1, 1)24(3, 1, 2)168-GARCH (1, 1) model

ParameterEstimateStd. errorp-value
Φ11.56690.0431<0.0001
Φ2−1.33170.0627<0.0001
Φ30.40120.0285<0.0001
Φ40.16120.0139<0.0001
Φ50.10990.0127<0.0001
Φ60.06540.0121<0.0001
Ω10.69480.1156<0.0001
Ω2−0.01360.01780.4448
Ω30.03740.01560.0168
θ11.02420.0394<0.0001
θ2−0.81940.0396<0.0001
Θ10.90090.0086<0.0001
Ψ11.51650.1159<0.0001
Ψ2−0.54060.1070<0.0001
α10.00070.0030<0.0001
α20.51400.0250<0.0001
β10.27400.0203<0.0001

Hence, the estimated equations for the mean and the variance of the selected DSARIMA-GARCH model are

(10)

and

(11)

The MAPE is used then to check the accuracy of the forecasts produced by the identified model. The MAPE is calculated based on the difference between the actual and the forecasted values as follows:

(12)

where yi and ŷi are the actual and the forecasted values respectively, while n is the number of forecasted values(Taylor, 2003; Taylor et al., 2006). The MAPE is calculated for 24 h-ahead, i.e. one day ahead, a week ahead and a month ahead. The percentages are 5.88%, 8.78% and 11.42% respectively. The MAPE for the different time periods are small. In addition, Figures 7 and 8 show how close the forecasts produced by the selected DSARIMA-GARCH model to the actual values of the Egyptian natural gas consumption series. This demonstrates the usage of DSARIMA-GARCH model that accommodates the volatile daily and weekly seasonal patterns of the Egyptian series in providing precision gas consumption forecasts to serve the energy power sector in Egypt.

Figure 7.

Time plot of the actual vs the forecasts values up to one week ahead

Figure 7.

Time plot of the actual vs the forecasts values up to one week ahead

Close modal
Figure 8.

Box plot of the actual vs the forecasts values

Figure 8.

Box plot of the actual vs the forecasts values

Close modal

It is very important, for any country, to take into account the rapid increase of the natural gas consumption in their policies. Energy policymakers use forecasting methods as essential tools for adopting management policies. Providing an accurate forecasting of natural gas consumption for few hours ahead helps a lot in both maintaining and controlling the energy sector.

An important feature of the residential natural gas consumption series is the presence of both daily and weekly seasonal cycles. In this research, it is found, through the Egyptian residential gas consumption series, that the daily gas consumption has the same pattern every day. In addition, consumption, on weekends, is observed to be similar on different weeks but different compared to the other working days. Considering this feature can improve the accuracy of the forecasting.

Accordingly, DSARIMA-GARCH model is investigated in accommodating those daily and weekly seasonal patterns and capturing the volatility behavior of the Egyptian residential gas consumption series at the same time. Different orders of DSARIMA-GARCH models are estimated using maximum likelihood method. DARIMA-GARCH model with the lowest SBC is selected. The forecasts produced by the selected model are accurate for one day ahead, one week ahead and a month ahead. Hence, the selected model is recommended for forecasting the Egyptian residential natural gas consumption. It would be of a great importance for energy policymakers in Egypt.

For future research, DSARIMA-GARCH models could be used with different external variables such as temperature and weather variables. Further, different statistical approaches such as Bayesian approaches, could be investigated and compared with this approach in forecasting the Egyptian residential gas consumption.

African Development Bank Group
(
2010
),
The Arab Republic of Egypt: Power Sector in Brief – 2010
,
African Development Bank (AFDB)
.
Akkurt
,
M.
,
Demirel
,
O.
and
Zain
,
S.
(
2010
), “
Forecasting turkey’s natural gas consumption by using time series methods
”,
European Journal of Economic and Political Studies
, Vol.
3
No.
2
, pp.
1
-
21
.
Akpinar
,
M.
and
Yumusak
,
N.
(
2016
), “
Year ahead demand forecasting city natural gas using seasonal time series methods
”,
Energies
, Vol.
9
No.
9
, pp.
1
-
17
.
Al-Fattah
,
S.M.
(
2006
), “
Time series modeling for U.S. Natural gas forecasting
”,
E-Journal of Petroleum Management Economics
, Vol.
1
, pp.
1
-
17
.
Aras
,
H.
and
Aras
,
N.
(
2004
), “
Forecasting residential natural gas demand
”,
Energy Sources
, Vol.
26
No.
5
, pp.
463
-
472
.
Bollerslev
,
T.
(
1986
), “
Generalized autoregressive conditional heteroskedasticity
”,
Journal of Econometrics
, Vol.
31
No.
3
, pp.
307
-
327
.
Box
,
G.E.
and
Jenkins
,
G.M.
(
1976
),
Time Series Analysis: Forecasting and Control
,
Revised ed
.,
Holden-day
,
San Francisco
.
Box
,
G.E.
,
Jenkins
,
G.M.
and
Reinsel
,
G.C.
(
1994
),
Time Series Analysis: Forecasting and Control
, (3rd ed.) ,
Prentice Hall
,
Englewood Cliffs, NJ
.
Brooks
,
C.
(
2008
),
Introductory Economics for Finance
, (2nd ed.) ,
Cambridge University Press
,
NewYork
.
Engle
,
R.F.
(
1982
), “
Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation
”,
Econometrica
, Vol.
50
No.
4
, pp.
987
-
1007
.
Erdogdu
,
E.
(
2010
), “
Natural gas demand in Turkey
”,
Applied Energy
, Vol.
87
No.
1
, pp.
211
-
219
.
Faisal
,
F.
(
2012
), “
Time series ARIMA forecasting of natural gas consumption in Bangladeshs power sector
”,
Elixir International Journal
, Vol.
49
, pp.
9985
-
9990
.
Jafari
,
F.D.
and
Sadigh
,
R.
(
2019
), “
Modeling and forecasting residential natural gas demand in Iran
”,
Revista Gestão and Tecnologia
, Vol.
19
No.
4
, pp.
33
-
57
.
Liu
,
L.M.
and
Lin
,
M.W.
(
1991
), “
Forecasting residential consumption of natural gas using monthly and quarterly time series
”,
International Journal of Forecasting
, Vol.
7
No.
1
, pp.
3
-
16
.
Pankratz
,
A.
(
1983
),
Forecasting with Univariate Box- Jenkins Models: Concepts and Cases
,
John Wiley and Sons
,
New York, NY
, p.
576
.
Pradhan
,
P.
,
Nayak
,
B.
and
Dhal
,
S.D.
(
2016
), “
Time series data prediction of natural gas consumption using ARIMA model
”,
International Journal of Information Technology and Management Information System
, Vol.
7
No.
3
, pp.
1
-
7
.
Schwarz
,
G.
(
1978
), “
Estimating the dimension of a model
”,
The Annals of Statistics
, Vol.
6
No.
2
, pp.
461
-
464
.
Taspinar
,
F.
,
Celebi
,
N.
and
Tutkun
,
N.
(
2013
), “
Forecasting of daily natural gas consumption on regional basis in Turkey using various computational methods
”,
Energy and Buildings
, Vol.
56
, pp.
23
-
31
.
Taylor
,
J.W.
(
2003
), “
Short-Term electricity demand forecasting using double seasonal exponential smoothing
”,
Journal of the Operational Research Society
, Vol.
54
No.
8
, pp.
799
-
805
.
Taylor
,
J.W.
,
de Menezes
,
L.M.
and
McSharry
,
P.
(
2006
), “
A comparison of univariate methods for forecasting electricity demand up to a day ahead
”,
International Journal of Forecasting
, Vol.
22
No.
1
, pp.
1
-
16
.
Tsay
,
R.S.
(
2010
),
Analysis of Financial Time Series
, (3rd ed.) ,
John Wiley and Sons
,
Hoboken, NJ
.
Vandaele
,
W.
(
1983
),
Applied Time Series and Box-Jenkins Models
,
Academic Press
,
New York, NY
.
Published in Journal of Humanities and Applied Social Sciences. Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at http://creativecommons.org/licences/by/4.0/legalcode

or Create an Account

Close Modal
Close Modal