Forecasting total energy demand based on the DGM(1,N) model

Hou, Meiru; Zhang, Jun; Dong, Siqi

doi:10.1108/MAEM-06-2025-0017

Purpose

This study aims to explore the key drivers affecting China’s total energy demand through quantitative analysis methods, and to construct a high-precision grey prediction model to forecast China’s total energy demand, with a view to providing a scientific basis for optimizing the energy demand structure and formulating sustainable development policies.

Design/methodology/approach

The study selects the national total energy demand and related macro indicators (GDP overall amount, total population, coal demand, the proportion of secondary industry output and the elasticity coefficient of energy demand) from 2010 to 2021, screens the core influencing factors through grey relational analysis and establishes a DGM(1,4) model to make predictions.

Findings

Grey relational analysis shows that GDP overall amount (correlation 0.9281), total population (0.8386) and coal demand (0.8061) have the most significant effect on total energy demand, while the elasticity coefficient of energy demand, the proportion of secondary industry output and the elasticity coefficient of energy demand have lower correlations. The prediction results of the DGM(1,4) model are well fitted to the true values, and the mean absolute percentage error (MAPE) is 0.06%, indicating that the model has high prediction accuracy.

Originality/value

GDP overall amount, total population and coal demand are the core factors driving the growth of energy demand in China, and the DGM(1,4) model is suitable for medium- and long-term energy demand forecasting, but the accuracy can be further improved by introducing time lag effects or intelligent optimization algorithms. This study provides data support for energy policymaking, and the model can be optimized in the future by combining more dynamic factors to achieve more accurate forecasting analysis.

1. Introduction

1.1 Background and motivation

Achieving carbon peak by 2030 and carbon neutrality by 2060 (Luo et al., 2024) represents China’s dual carbon goals—a crucial commitment to addressing global climate change and a key strategy for driving high-quality economic development and energy structure transformation. In this context, the development trend of China’s energy demand will profoundly affect the country’s emission reduction path, industrial policy adjustment and international climate governance cooperation. Currently, as the world’s largest energy consumer and carbon emitter, changes in China’s energy demand not only concern the success or failure of the domestic green low-carbon transition, but also have great significance for the global carbon emission reduction process. In recent years, China’s total energy consumption has continued to grow, but the energy structure has been gradually optimized, and the proportion of renewable energy has increased significantly. However, the traditional high energy-consuming industries still occupy an important position, coupled with the continuous progress of urbanization and industrialization, energy demand will continue to face growth pressure for a period of time in the future. At the same time, technological progress, policy regulation and market-based instruments (e.g. carbon trading system) are becoming the core driving forces for energy efficiency improvement and carbon emission reduction. Against this complex background, scientific forecasting of the future trend of China’s energy demand and analysis of its driving factors and potential challenges are of great significance to the formulation of reasonable emission reduction policies and optimization of energy resource allocation (Hao et al., 2018).

Based on this, this paper selects the national total energy demand and related macro indicators from 2010 to 2021, screens the core influencing factors through grey correlation analysis, and establishes a DGM(1,4) model for prediction. The study not only improves the theoretical basis for the formulation of energy structure adjustment policies, but also provides methodological support for the accurate prediction of energy demand under the goal of “dual-carbon”, which provides data support for the formulation of differentiated regional energy policies by the governmental departments. The study not only provides a theoretical basis for the formulation of energy restructuring policies, but also provides methodological support for the accurate prediction of energy demand under the goal of “dual-carbon”, which in turn provides data support for the governmental departments to formulate differentiated regional energy policies.

1.2 Literature review

Energy serves as the cornerstone of national development security. Rational analysis and precise forecasting of factors influencing energy demand and their evolving trends provide crucial reference for formulating scientific energy policies and rationally planning energy construction projects. In recent years, scholars both domestically and internationally have conducted extensive research on the determinants of energy demand, focusing primarily on economic factors, industrial structure, population and urbanization, energy prices, and regional disparities. Economic growth is the most fundamental driver of energy demand. Research indicates a significant positive correlation between energy consumption and GDP. Hidayat et al. (2024) conducted a regression analysis on panel data from over ten ASEAN countries, demonstrating that energy consumption exhibits a significant positive coefficient for real GDP growth. However, with economic restructuring and technological advancement, the correlation between energy demand and economic growth may weaken, leading to a phenomenon known as “decoupling” (Gao et al., 2020). Research indicates that industrial structure is a key structural factor influencing energy demand: the higher the proportion of secondary industries—particularly heavy and chemical industries—the greater the energy consumption intensity tends to be (Sun and Gao, 2021). Population size and urbanization processes also directly influence total energy consumption. Population growth expands the baseline demand for residential energy use (Wei and Shen, 2019). Simultaneously, urbanization significantly increases energy demand by altering lifestyles, enhancing infrastructure density, and expanding industrial activities (Zhou et al., 2018). Energy prices serve as a crucial economic lever for regulating energy demand. Theoretically, rising energy prices suppress consumption while promoting energy conservation and substitution. It is noteworthy that the factors influencing energy demand exhibit significant regional heterogeneity. Resource-rich regions, economically developed areas, and underdeveloped regions demonstrate distinct energy consumption patterns and driving mechanisms (Liu and Lu, 2024).

Energy demand forecasting serves as the foundational work for energy system planning, policy formulation, and market operations. With the acceleration of the global energy transition and the introduction of the “dual carbon” goals, energy demand exhibits complex characteristics of nonlinearity, non-stationarity, and high volatility, placing higher demands on forecasting methodologies. In recent years, research on energy demand forecasting has primarily been categorized into three major types: traditional statistical and econometric models, AI prediction model and grey models. Autoregressive Integrated Moving Average (ARIMA) models and their variants, such as SARIMA, have been widely applied in short-term energy demand forecasting. These models extrapolate based on the statistical characteristics of historical data and are suitable for stationary time series. Li et al. (2022) employed time series analysis to conduct short-term forecasts of daily coal consumption at coastal power plants, thereby enhancing the accuracy of electricity coal consumption predictions. Multiple linear regression (MLR) and econometric models predict energy demand by establishing functional relationships between demand and influencing factors such as GDP, population, and prices. Li et al., (2025) developed a stepwise regression-based double-log demand function forecasting method that accurately captures nonlinear inflection points in natural gas demand trends, demonstrating strong predictive performance. However, this model relies on variable selection and the validity of model assumptions, and is sensitive to multicollinearity and heteroscedasticity. With the increase in data volume and the advancement of computational capabilities, machine learning methods represented by artificial neural networks (ANNs) have rapidly developed in the field of energy forecasting due to their powerful nonlinear fitting capabilities. Machado et al. (2021) applied feedforward neural networks (FFNN) to short-term electricity load forecasting in industrial zones. They noted that FFNNs rely solely on current input vectors and lack internal state retention of historical sequences, resulting in insufficient memory capacity when handling load sequences with significant temporal dependencies. Shanmugam and Ramana (2024) developed multiple short-term electricity consumption forecasting models based on radial basis function networks (RBFNs). They validated the advantages of RBFNs in nonlinear approximation and rapid convergence, comparing them with traditional feedforward networks to highlight their model effectiveness. Deep learning models, owing to their unique gating mechanisms, effectively capture long-term dependencies in time series, significantly enhancing prediction accuracy. Congxiang et al. (2025) proposed integrating photovoltaic thermodynamics using LSTM networks. By training bidirectional LSTM (BiLSTM) models with thermal and meteorological features, prediction accuracy can be effectively enhanced. Support vector machines (SVMs) in machine learning are also frequently applied to energy consumption and demand forecasting (Morteza et al., 2023).

Although modern forecasting methods such as AI models demonstrate formidable capabilities in capturing complex nonlinear relationships, and traditional econometric models possess a theoretical foundation for analyzing causal relationships between variables, intelligent AI models involve computationally complex processes, while traditional models are sensitive to data fluctuations. Therefore, this study selects the DGM(1,N) model from the grey forecasting model as its core forecasting tool. Gray prediction models are suitable for systems characterized by “small samples and sparse information.” China’s energy system is undergoing rapid transformation. The introduction of the “dual carbon” goals has introduced new policy variables, disrupting the long-term regularity of certain historical data. Against the backdrop of limited data and potential structural changes, grey system theory specifically addresses uncertain systems characterized by “partially known and partially unknown information.” It does not require large sample sizes or typical probability distributions, enabling more effective modeling and trend extrapolation using limited annual macroeconomic data. This study selected GDP, total population, coal demand, the proportion of secondary industry output value, and the energy demand elasticity coefficient as independent variables for energy demand forecasting. It employed the grey correlation method to quantitatively describe the degree of correlation among variables, screened out indicators with higher correlation, and constructed a DGM(1,4) forecasting model based on the screening results to evaluate forecasting accuracy.

This paper is followed by Section 2, which introduces the theoretical knowledge of grey prediction model and grey correlation; Section 3, which utilizes the total national energy demand and related macro indicators from 2010 to 2021, screens the core influencing factors through grey correlation analysis, and establishes a DGM(1,4) model to make a prediction; and Section 4, which contains conclusions and recommendations.

2. Methodology

2.1 Grey relational analysis

Grey relational analysis is an important analytical tool in grey systems theory for quantifying the degree of dynamic correlation between factors within a system. The basic idea is to determine the magnitude of the degree of correlation by comparing the degree of geometric similarity between data series. If the trend of two sequences is closer, i.e. the higher the degree of synchronized change, the greater the degree of correlation between them is considered to be. This method does not require the data to obey a specific distribution, and is applicable to the degree of direct correlation of a small sample of data series for which part of the information is known and part of the information is unknown (Liu et al., 2014). The grey relational degree is used to portray the degree of influence of Chinese energy demand influencing factors on energy demand, and its calculation steps are as follows:

Perform dimensionless processing on the raw data:

To eliminate dimensional effects, the data was normalized. The calculation steps are as follows:

x^{'} = \frac{x_{i} - x_{\min}}{x_{\max} - x_{\min}}

(1)

Find the absolute difference between the reference sequence and the comparison sequence:

Δ x_{i}^{'} (k) = | x_{i}^{'} (k) - x_{0}^{'} (k) |

(2)

Calculate the grey relational coefficient between the reference sequence and the comparison sequence:

γ_{i} (k) = \frac{\min Δ x_{i}^{'} (k) + ρ \max Δ x_{i}^{'} (k)}{| x_{0}^{'} (k) - {x_{i}}^{'} (k) | + ρ \max Δ x_{i}^{'} (k)}

(3)

Here, $\min$ and $\max$ denote the minimum and maximum values of the data; $ρ$ is the resolution coefficient, typically set to 0.5.

Calculate the grey relational degree:

γ (X_{0}, X_{i}) = \frac{1}{n} \sum_{k = 1}^{n} γ_{i} (k)

(4)

Grey relational analysis $R_{0 i}$ can be able to portray the extent to which China’s energy demand influencing factors affect energy demand. Grey relation analysis $R_{0 i}$ greater, it indicates that the greater the degree of influence, the greater the importance to energy demand forecasting, and vice versa. Grey relational analysis not only quantitatively describes the degree of importance of the influencing factors to the energy demand forecast, but also determines the obvious multicollinearity problem based on the correlation.

In this study, a grey relational degree threshold of 0.8 was set to screen the core influencing factors. This established threshold is widely recognized in grey systems theory as indicative of a strong correlation (Liu et al., 2014). The inter-factor grey relational degrees exceeding 0.8 also signal the presence of significant multicollinearity among the potential influencing factors. Adhering to this threshold allows for the elimination of factors less critical to energy demand forecasting based on their correlation magnitude. This process not only identifies the most impactful drivers but also provides an effective safeguard for the construction of a robust multivariate forecasting model by mitigating the risk of multicollinearity.

2.2 Gray multivariate DGM(1,N) model

The gray multivariate DGM(1,N) model represents a first-order difference time series model with one response variable and N influencing factors. It constructs model expressions through differential and difference relationships between first-order cumulative generation and cumulative reduction generation sequences, thereby enabling model parameter estimation (Ma and Liu, 2015).

In the gray multivariate DGM(1,N) model, let the system’s characteristic behavior sequence be:

Y^{(0)} = {y^{(0)} (t)}, t = 1, 2, \dots, n

The first-order cumulative generating sequence for the system’s characteristic behavior sequence $Y^{(0)}$ is:

Y^{(1)} = y^{(1)} (t) = \sum_{k = 1}^{t} y^{(0)} (k)

The sequence of relevant influencing factors is as follows:

X_{i}^{(0)} = {x_{i}^{(0)} (t)}, i = 1, 2, \dots, N; t = 1, 2, \dots, n

The first-order cumulative generating sequence for the sequence X of relevant influencing factors is:

X_{i}^{(1)} = x_{i}^{(1)} (t) = \sum_{k = 1}^{t} x_{i}^{(0)} (k), i = 1, 2, \dots, N; t = 1, 2, \dots, n

The differential equation form of the DGM(1,N) model is given by

\frac{d y^{(1)} (t)}{d t} + a y^{(1)} (t) = \sum_{i = 1}^{N} b_{i} x_{i}^{(1)} (t) + u, t = 1, 2, \dots, n

(5)

Among these, parameter $a$ represents the system’s development coefficient, parameter $b_{i} x_{i}^{(1)} (t)$ denotes the system’s driving term, and parameter $b_{i}$ signifies the driving coefficient (Ma and Liu, 2015).

The discrete values of the gray derivative of $y^{(1)} (t)$ is

\frac{d y^{(1)} (t)}{d t} \approx \frac{y^{(1)} (t) - y^{(1)} (t - 1)}{t - (t - 1)} = y^{(0)} (t)

(6)

Substituting the discrete values of the gray derivative of $y^{(1)} (t)$ (6) into the differential equation form (5) of the DGM(1,N) model yields

y^{(1)} (t) = \frac{1}{a + 1} [y^{(1)} (t - 1) + \sum_{i = 1}^{N} b_{i} x_{i}^{(1)} (t) + u], t \geq 2

(7)

Let $β_{0} = \frac{1}{a + 1}, β_{i} = \frac{b_{i}}{a + 1}, μ = \frac{u}{a + 1}, i = 1, 2, \dots, N$

Then, the difference equation form of the DGM(1,N) model can be obtained as:

y^{(1)} (t) = β_{0} y^{(1)} (t - 1) + \sum_{i = 1}^{N} β_{i} x_{i}^{(1)} (t) + μ, t \geq 2

(8)

The parameter sequence $\hat{P} = {(β_{0}, β_{1}, β_{2}, \dots, β_{N}, μ)}^{T}$ in the differential equation form of the DGM(1,N) model can be estimated using the least squares method:

\hat{P} = {(β_{0}, β_{1}, β_{2}, \dots, β_{N}, μ)}^{T} = {(B^{T} B)}^{- 1} B^{T} Y

(9)

Among these,

B = [\begin{array}{c} y^{(1)} (1) & x_{1}^{(1)} (2) & \dots & x_{N}^{(1)} (2) & 1 \\ y^{(1)} (2) & x_{1}^{(1)} (3) & \dots & x_{N}^{(1)} (3) & 1 \\ ⋮ & ⋮ & ⋮ & ⋮ \\ y^{(1)} (n - 1) & x_{1}^{(1)} (n) & \dots & x_{N}^{(1)} (n) & 1 \end{array}], Y = [\begin{array}{l} y^{(1)} (2) \\ y^{(1)} (3) \\ ⋮ \\ y^{(1)} (n) \end{array}]

Based on the difference equation form of the DGM(1,1) model and the results of its parameter estimation (9), the time response sequence of the DGM(1,1) model can be obtained:

{\hat{y}}^{(1)} (t + 1) = β_{0}^{t} y^{(0)} (1) + \sum_{m = 0}^{t - 1} β_{0}^{m} [\sum_{i = 1}^{N} β_{i} x_{i}^{(1)} (t + 1 - m)] + \frac{1 - β_{0}^{t}}{1 - β_{0}} μ, t \geq 1

(10)

Reduce ${{\hat{y}}^{(1)} (t)}, t = 1, 2, \dots, n$ by successive subtractions to obtain

{\hat{y}}^{(0)} (t + 1) = {\hat{y}}^{(1)} (t + 1) - {\hat{y}}^{(1)} (t), t = 1, 2, \dots, n - 1

(11)

3. Empirical analysis

3.1 Data sources

In light of China’s energy situation, five specific indicators were selected as input variables, including Gross Domestic Product (GDP) and total energy demand, from the macro categories of population, economy, energy demand, and industrial structure. The total energy demand is taken as the reference series, and is denoted as $X_{1}$ ⁠. The factors influencing this are $X_{2}, X_{3}, X_{4}, X_{5}, X_{6}$ ⁠, and the total energy demand for 2010–2021 and the five factors are constructed as shown in Table 1 below, which is derived from The China Statistical Yearbook (2022).

Table 1

Raw data table of variables

Year	Total energy demand X₁ (tons of standard coal)	GDP overall amount X₂ (billions of dollars)	Percentage of output in the secondary sector X₃ (%)	Total population X₄ (in tens of thousands)	Coefficient of elasticity of energy demand X₅	Demand for coal X₆ (tons of standard coal)
2010	360648.00	412119.30	57.40	134091.00	0.69	249568.42
2011	387043.00	487940.20	52.00	134916.00	0.76	271704.19
2012	402138.00	538580.00	50.00	135922.00	0.49	275464.53
2013	416913.00	592963.20	48.50	136726.00	0.47	280999.36
2014	428334.00	643563.10	45.60	137646.00	0.36	281843.77
2015	434113.00	688858.20	39.70	138326.00	0.19	276964.09
2016	441492.00	746395.10	36.00	139232.00	0.25	274608.02
2017	455827.00	832035.90	34.20	140011.00	0.46	276231.16
2018	471925.00	919281.10	34.40	140541.00	0.52	278435.75
2019	487488.00	986515.20	32.60	141008.00	0.55	281280.58
2020	498314.00	1013567.00	43.30	141212.00	1.00	283540.67
2021	524000.00	1143669.70	38.40	141260.00	0.64	293440.00

Year	Total energy demand X₁ (tons of standard coal)	GDP overall amount X₂ (billions of dollars)	Percentage of output in the secondary sector X₃ (%)	Total population X₄ (in tens of thousands)	Coefficient of elasticity of energy demand X₅	Demand for coal X₆ (tons of standard coal)
2010	360648.00	412119.30	57.40	134091.00	0.69	249568.42
2011	387043.00	487940.20	52.00	134916.00	0.76	271704.19
2012	402138.00	538580.00	50.00	135922.00	0.49	275464.53
2013	416913.00	592963.20	48.50	136726.00	0.47	280999.36
2014	428334.00	643563.10	45.60	137646.00	0.36	281843.77
2015	434113.00	688858.20	39.70	138326.00	0.19	276964.09
2016	441492.00	746395.10	36.00	139232.00	0.25	274608.02
2017	455827.00	832035.90	34.20	140011.00	0.46	276231.16
2018	471925.00	919281.10	34.40	140541.00	0.52	278435.75
2019	487488.00	986515.20	32.60	141008.00	0.55	281280.58
2020	498314.00	1013567.00	43.30	141212.00	1.00	283540.67
2021	524000.00	1143669.70	38.40	141260.00	0.64	293440.00

3.2 Data processing and analysis

Pretreatment

Standardize the data according to formula (1). The standardized data are shown in Table 2 below.

Table 2

Standardized data sheets

Year	Total energy demand	GDP overall amount	Percentage of output in the secondary sector	Total population	Coefficient of elasticity of energy demand	Demand for coal
Year	X₁ (tons of standard coal)	X₂ (billions of dollars)	X₃ (%)	X₄ (in tens of thousands)	X₅	X₆ (tons of standard coal)
2010	0.0000	0.0000	1.0000	0.0000	0.6173	0.0000
2011	0.1616	0.1036	0.7823	0.1151	0.7037	0.5046
2012	0.2540	0.1729	0.7016	0.2554	0.3704	0.5903
2013	0.3444	0.2472	0.6411	0.3676	0.3457	0.7164
2014	0.4144	0.3164	0.5242	0.4959	0.2099	0.7357
2015	0.4497	0.3783	0.2863	0.5907	0.0000	0.6245
2016	0.4949	0.4569	0.1371	0.7171	0.0741	0.5707
2017	0.5827	0.5740	0.0645	0.8258	0.3333	0.6077
2018	0.6812	0.6933	0.0726	0.8997	0.4074	0.6580
2019	0.7765	0.7852	0.0000	0.9648	0.4444	0.7228
2020	0.8428	0.8222	0.4315	0.9933	1.0000	0.7744
2021	1.0000	1.0000	0.2339	1.0000	0.5556	1.0000

Year	Total energy demand	GDP overall amount	Percentage of output in the secondary sector	Total population	Coefficient of elasticity of energy demand	Demand for coal
Year	X₁ (tons of standard coal)	X₂ (billions of dollars)	X₃ (%)	X₄ (in tens of thousands)	X₅	X₆ (tons of standard coal)
2010	0.0000	0.0000	1.0000	0.0000	0.6173	0.0000
2011	0.1616	0.1036	0.7823	0.1151	0.7037	0.5046
2012	0.2540	0.1729	0.7016	0.2554	0.3704	0.5903
2013	0.3444	0.2472	0.6411	0.3676	0.3457	0.7164
2014	0.4144	0.3164	0.5242	0.4959	0.2099	0.7357
2015	0.4497	0.3783	0.2863	0.5907	0.0000	0.6245
2016	0.4949	0.4569	0.1371	0.7171	0.0741	0.5707
2017	0.5827	0.5740	0.0645	0.8258	0.3333	0.6077
2018	0.6812	0.6933	0.0726	0.8997	0.4074	0.6580
2019	0.7765	0.7852	0.0000	0.9648	0.4444	0.7228
2020	0.8428	0.8222	0.4315	0.9933	1.0000	0.7744
2021	1.0000	1.0000	0.2339	1.0000	0.5556	1.0000

Grey relational analysis

This study employed grey relational analysis to identify factors with significant influence on China’s total energy consumption. Perform calculations using MATLAB software. The values of the grey correlation coefficients between the solved reference series and the contrasting series are shown in Table 3 below; the correlation coefficients reflect the degree of correlation between each contrasting series and the reference series at each dynamic point in time.

Table 3

Grey correlation coefficient values

X₂	X₃	X₄	X₅	X₆
0.0000	1.0000	0.0000	0.6173	0.0000
0.05780	0.6207	0.0465	0.5421	0.3430
0.0811	0.4476	0.0014	0.1164	0.3363
0.0972	0.2967	0.0231	0.0012	0.3720
0.0980	0.1098	0.0815	0.2045	0.3213
0.0714	0.1634	0.1410	0.4497	0.1747
0.0380	0.3578	0.2222	0.4208	0.0758
0.0087	0.5181	0.2431	0.2493	0.0251
0.0121	0.6086	0.2185	0.2738	0.0232
0.0087	0.7765	0.1884	0.3320	0.0536
0.0206	0.4113	0.1505	0.1572	0.0684
0.0000	0.7661	0.0000	0.4444	0.0000

X₂	X₃	X₄	X₅	X₆
0.0000	1.0000	0.0000	0.6173	0.0000
0.05780	0.6207	0.0465	0.5421	0.3430
0.0811	0.4476	0.0014	0.1164	0.3363
0.0972	0.2967	0.0231	0.0012	0.3720
0.0980	0.1098	0.0815	0.2045	0.3213
0.0714	0.1634	0.1410	0.4497	0.1747
0.0380	0.3578	0.2222	0.4208	0.0758
0.0087	0.5181	0.2431	0.2493	0.0251
0.0121	0.6086	0.2185	0.2738	0.0232
0.0087	0.7765	0.1884	0.3320	0.0536
0.0206	0.4113	0.1505	0.1572	0.0684
0.0000	0.7661	0.0000	0.4444	0.0000

Following the steps for calculating the grey correlation, The grey relational degree for each indicator were calculated, with the results shown in Table 4.

Table 4

Grey correlation results table

Factor	Relatedness	Arrange in order
X₂	0.9281	1
X₄	0.8386	2
X₆	0.8061	3
X₅	0.6433	4
X₃	0.5307	5

As shown in Table 4, the top three indicators in terms of grey relational degree are GDP overall amount (0.9281), total population (0.8386), and coal demand (0.8061). Therefore, these three indicators are selected to construct a grey multivariate prediction model.

3.3 Comparison of three predictive models

Based on the grey correlation coefficient, GDP total amount, total population, and coal demand were selected as the influence factor sequence. The DGM(1,4) model, along with two comparative models—GM(1,4) and MLR—were constructed to forecast total energy demand from 2010 to 2021. The prediction results and model performance evaluations are presented in Table 5.

Table 5

Model prediction results

Year	Actual value	MLR	GM(1,4)	DGM(1,4)
2010	360,648	359,310	360,648	360,648
2011	387,043	389,557	334,937	387,347
2012	402,138	402,319	443,049	401,598
2013	416,913	416,683	430,654	416,883
2014	428,334	427,132	430,908	428,588
2015	434,113	432,066	432,731	434,282
2016	441,492	441,256	440,690	441,455
2017	455,827	457,810	457,092	455,454
2018	471,925	474,651	474,177	472,207
2019	487,488	488,577	488,349	487,847
2020	498,314	495,034	495,001	497,766
2021	524,000	523,841	525,624	524,105
MAPE	–	0.32%	0.77%	0.06%

Year	Actual value	MLR	GM(1,4)	DGM(1,4)
2010	360,648	359,310	360,648	360,648
2011	387,043	389,557	334,937	387,347
2012	402,138	402,319	443,049	401,598
2013	416,913	416,683	430,654	416,883
2014	428,334	427,132	430,908	428,588
2015	434,113	432,066	432,731	434,282
2016	441,492	441,256	440,690	441,455
2017	455,827	457,810	457,092	455,454
2018	471,925	474,651	474,177	472,207
2019	487,488	488,577	488,349	487,847
2020	498,314	495,034	495,001	497,766
2021	524,000	523,841	525,624	524,105
MAPE	–	0.32%	0.77%	0.06%

By comparing the forecast results of the three models with actual values (Table 5), it is evident that the DGM(1,4) model demonstrates the most outstanding predictive performance. Its Mean Absolute Percentage Error (MAPE) is only 0.06%, significantly lower than the 0.32% of the MLR model and the 0.77% of the GM(1,4) model. Examining the forecast values for each year reveals that the DGM(1,4) model’s prediction curve closely aligns with the actual value curve, particularly after 2015 where they nearly coincide completely, demonstrating exceptional fitting accuracy and stability. In contrast, the MLR model exhibited significant deviations in 2011 and 2020, indicating limited adaptability of linear models to abnormal fluctuations. Meanwhile, the GM(1,4) model showed substantial prediction errors during the early phase of the series (2011–2012), reflecting potential issues such as initial value sensitivity in standard grey models. This comparative analysis fully validates the superiority of the DGM(1,4) model in forecasting China’s energy demand. By incorporating differential operations, it effectively enhances model stability and predictive accuracy, providing more reliable quantitative support for energy policy formulation.

This figure visually demonstrates the fitting performance of three forecasting models for China’s total energy demand between 2010 and 2021 (Figure 1). As shown, the forecast curve of the DGM(1,4) model nearly perfectly aligns with the actual observed curve, indicating its ability to accurately capture the dynamic trends in energy demand. The GM(1,4) model exhibits noticeable fluctuations in the early stages of the sequence, with forecast values significantly deviating from actual values, revealing initial instability. While the MLR method maintains a consistent overall trend, visible deviations occur at multiple points (e.g. 2011, 2020), reflecting the limitations of linear models in fitting complex nonlinear relationships. Overall, this illustration further validates the superior performance and reliability of the DGM(1,4) model in energy demand forecasting.

Figure 1

A multi-line graph shows the comparison of the total energy demand for four models from 2010 to 2020.

View large Download slide

The vertical axis is labeled “total energy demand per (10 superscript 4 t c e)” and ranges from 300000 to 550000 in increments of 50000 units. The horizontal axis is labeled “year” and ranges from 2010 to 2022 in increments of 2 years. A legend at the top left corner indicates that the graph plots four lines. The line with asterisk data markers represents “Actual Value”, the line with circle data markers represents “M L R”, the line with triangle data markers represents “G M (1, 4)”, and the line with inverted triangles as data markers represents “D G M (1, 4)”. The lines labeled “Actual Value”, “M L R”, and “G M (1, 4)” start at (2010, 360526.316) and end at (2021, 526315.789), exhibiting a general upward trend. Some of the other data points through which these three lines pass are as follows: (2012, 402631.579), (2014, 429824.561), (2016, 443859.649), (2018, 474561.404), and (2020, 499122.807). The line labeled “D G M (1, 4)” starts at (2010, 360526.316) and ends at (2021, 526315.789). Some of the other data points through which the line passes are as follows: (2011, 336842.105), (2012, 445614.035), (2014, 434210.526), (2016, 443859.649), (2018, 474561.404), and (2020, 499122.807). Note: All numerical data values are approximated.

Model prediction results comparison line chart

4. Conclusions and recommendations

4.1 Conclusion

Through grey correlation analysis, this study clarifies the core factors affecting China’s total energy demand, which are, in order of correlation, GDP overall amount (correlation of 0.928), total population (correlation of 0.839), and coal demand (correlation of 0.806), suggesting that economic growth (GDP) is the strongest driver of energy demand growth, with population size and coal dependence coming next, and the proportion of secondary industry output, and the elasticity coefficient of energy demand have relatively weak effects.
The DGM(1,4) model shows high accuracy (MAPE = 0.06%) in forecasting total energy demand, and fits well especially in medium- and long-term forecasts. The grey correlation coefficient indicates that coal demand exerts the greatest influence on overall energy demand, reflecting China’s continued reliance on coal as the dominant energy source. Population size contributes secondarily, demonstrating that population growth remains a persistent driver of energy demand. GDP overall amount exhibits relatively low influence, likely attributable to economic restructuring—such as the increased share of the service sector—which has reduced energy intensity per unit of GDP.

4.2 Recommendations

Optimize the energy mix. Reduce dependence on coal, replace coal through clean energy (wind energy, photovoltaic) and reduce negative impacts. Promote industrial upgrading, increase the proportion of tertiary industry, and weaken the correlation between secondary industry (high energy consumption) and energy demand.
Improve the forecasting system. Dynamically adjust the model by introducing time-lag variables (e.g. policy lag effect) or segmented modeling (distinguishing between high/low economic growth stages). Integrate multi-methods and combine machine learning (LSTM, Random Forest) to deal with non-linear features and improve forecast robustness.
Strengthen data support. Expand the dimensions of the indicators to include new variables such as the proportion of renewable energy and carbon emission constraints, reflecting the trend of energy transition under the “dual-carbon” goal. Increase the frequency of data and adopt monthly or quarterly data to enhance the ability to capture short-term fluctuations.

From the above conclusions, we can see that the DGM(1, N) model has good accuracy in forecasting, but the model can still be optimized to further improve its accuracy, such as increasing the time lag and adding an intelligent optimization algorithm to search for the optimal parameters. This will be the focus of my next research, and I will strive to learn more about this topic so that I can improve the model as soon as possible to increase the accuracy of the model and make more accurate forecasts of China’s energy demand.

References

Congxiang

,

T.

,

Ziyan

,

L.

,

Xiancheng

,

L.

and

Abdalla

,

A.N.

(

2025

), “

Enhancing short-term building energy forecasting with photovoltaic thermal effects and seasonal heat transfer dynamics using deep learning models

”,

Case Studies in Thermal Engineering

, Vol.

74

, 107019, doi:

https://doi.org/10.1016/j.csite.2025.107019

.

Google Scholar

Gao

,

W.

,

Ding

,

H.

and

Zhang

,

Z.

(

2020

), “

The driving factors of China’s energy consumption and economic development decoupling

”,

Henan Science

, Vol.

38

No.

10

, pp.

1650

-

1659

.

Google Scholar

Hao

,

Y.

,

Wang

,

L.

and

Wu

,

Y.

(

2018

), “

The forecasting and prospects of China’s energy economy in the new era

”,

Journal of Beijing Institute of Technology (Social Sciences Edition)

, Vol.

20

, pp.

8

-

14

.

Google Scholar

Hidayat

,

M.

,

Rangkuty

,

D.M.

,

Ferine

,

K.F.

and

Saputra

,

J.

(

2024

), “

The influence of natural resources, energy consumption, and renewable energy on economic growth in ASEAN region countries

”,

International Journal of Energy Economics and Policy

, Vol.

14

No.

3

, pp.

332

-

338

, doi:

https://doi.org/10.32479/ijeep.15917

.

Google Scholar

Crossref

Li

,

T.

,

Wu

,

L.

and

Zhu

,

J.M.

(

2022

), “

Short-term prediction of daily coal consumption in coastal power plants based on ARIMA model

”,

Coal Economic Research

, Vol.

42

No.

8

, pp.

13

-

18

.

Google Scholar

Li

,

H.B.

,

Liu

,

Q.

,

Chen

,

Y.G.

,

Han

,

M.

and

Wu

,

X.D.

(

2025

), “

Predicting China’s natural-gas demand trend

”,

Natural Gas Technology and Economy

, Vol.

19

No.

3

, pp.

1

-

82

.

Google Scholar

Liu

,

R.H.

and

Lu

,

H.Z.

(

2024

), “

Spatial spillover effect of energy efficiency and its influencing factors——an empirical analysis based on spatial metrology

”,

Journal of industrial Technological Economics

, Vol.

43

No.

2

, pp.

135

-

141

.

Google Scholar

Liu

,

S.

,

Yang

,

Y.J.

and

Wu

,

L.F.

(

2014

),

Grey System Theory and its Applications

, (7th ed.) ,

Science Press

,

Beijing

.

Google Scholar

Luo

,

S.H.

,

Hu

,

W.H.

,

Liu

,

W.

,

Zhang

,

Z.

,

Bai

,

C.

,

Du

,

Y.

,

Huang

,

Q.

and

Chen

,

Z.

(

2024

), “

Transition pathway for China to achieve carbon neutrality by 2060 (in Chinese)

”,

Scientia Sinica Technologica

, Vol.

54

No.

1

, pp.

43

-

64

, doi:

https://doi.org/10.1360/sst-2022-0041

.

Google Scholar

Crossref

Ma

,

X.

and

Liu

,

Z.

(

2015

), “

Predicting the oil field production using the novel discrete GM(1,N) model

”,

The Journal of Grey System

, Vol.

27

No.

4

, pp.

63

-

73

.

Google Scholar

Machado

,

E.

,

Pinto

,

T.

,

Guedes

,

V.

and

Morais

,

H.

(

2021

), “

Electrical load demand forecasting using feed-forward neural networks

”,

Energies

, Vol.

14

No.

22

, p.

7644

, doi:

https://doi.org/10.3390/en14227644

.

Google Scholar

Crossref

Morteza

,

A.

,

Yahyaeian

,

A.A.

,

Mirzaeibonehkhater

,

M.

,

Sadeghi

,

S.

,

Mohaimeni

,

A.

and

Taheri

,

S.

(

2023

), “

Deep learning hyperparameter optimization: application to electricity and heat demand prediction for buildings

”,

Energy and Buildings

, Vol.

289

, 113036, doi:

https://doi.org/10.1016/j.enbuild.2023.113036

.

Google Scholar

Shanmugam

,

D.

and

Ramana

,

V.V.

(

2024

), “

Different meta-heuristic optimized radial basis function neural network models for short-term power consumption forecasting

”,

Advances in Engineering and Intelligence Systems

, Vol.

3

No.

2

, pp.

63

-

82

.

Google Scholar

Sun

,

Y.B.

and

Gao

,

S.

(

2021

), “

Study on measurement and effect of influencing factors of energy consumption intensity in China

”,

Coal Economic Research

, Vol.

41

No.

12

, pp.

11

-

19

.

Google Scholar

Wei

,

C.

and

Shen

,

Z.Y.

(

2019

), “

Determinants of residential energy consumption: a urban-rural comparison

”,

Economic Theory and Business Management

, No.

12

, pp.

4

-

16

.

Google Scholar

Zhou

,

M.

,

Xie

,

Y.Y.

,

Sun

,

Y.F.

and

Gao

,

W.

(

2018

), “

Study on the influence path of China’s urbanization development on energy consumption based on direct and indirect effect perspective

”,

Resources Science

, Vol.

40

No.

9

, pp.

1693

-

1705

.

Google Scholar

2025

Meiru Hou, Jun Zhang and Siqi Dong

Published in Marine Economics and Management. Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at Link to the terms of the CC BY 4.0 licence.

Forecasting total energy demand based on the DGM(1,N) model

1. Introduction

1.1 Background and motivation

1.2 Literature review

2. Methodology

2.1 Grey relational analysis

2.2 Gray multivariate DGM(1,N) model

3. Empirical analysis

3.1 Data sources

3.2 Data processing and analysis

3.3 Comparison of three predictive models

4. Conclusions and recommendations

4.1 Conclusion

4.2 Recommendations

References

Email Alerts

Cited By

Forecasting total energy demand based on the DGM(1,N) model Open Access

1. Introduction

1.1 Background and motivation

1.2 Literature review

2. Methodology

2.1 Grey relational analysis

2.2 Gray multivariate DGM(1,N) model

3. Empirical analysis

3.1 Data sources

3.2 Data processing and analysis

3.3 Comparison of three predictive models

4. Conclusions and recommendations

4.1 Conclusion

4.2 Recommendations

References

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Forecasting total energy demand based on the DGM(1,N) model