Skip to Main Content
Purpose

Port throughput forecasting is crucial for port planning and regional economic development, while traditional models suffer from poor adaptability to small-sample data, failure to capture periodic and time-varying features and low prediction accuracy. This study aims to develop a more accurate and robust model to realize precise forecasting of China's coastal port throughputs with different time scales and dimensions.

Design/methodology/approach

This study proposes a fractional-order time-varying grey Fourier model (FTGFM) by extending the traditional grey model framework. The proposed model integrates a fractional-order accumulation operator, a time-varying correction term and a truncated Fourier series to respectively characterize new-information priority, system dynamic evolution and periodic data fluctuations. The Whale Optimization Algorithm is employed to optimize nonlinear parameters. The effectiveness of the model is evaluated using quarterly data (2017–2025) and monthly data (2023–2025) for four core port throughput indicators, and its performance is compared with six comparative models based on mean absolute percentage error (MAPE), root mean squared error and Theil's U statistic.

Findings

Empirical results indicate that FTGFM(1,1) outperforms all benchmark models, with training and testing MAPE both below 3%. It effectively captures the periodic and time-varying trends in port throughput, and forecasts suggest China's port throughput will fluctuate steadily upward in 2026–2027.

Originality/value

The study structurally improves the grey model with a time-varying correction term and embeds Fourier series into grey modeling for port throughput forecasting for the first time, combined with the fractional-order accumulation operator to optimize information processing. The proposed model thus provides an effective new tool for forecasting small-sample time series with periodic and time-varying features.

Ports, as vital gateways for the circulation of a wide range of elements, are the pivotal nexus between domestic and international markets (Ziran et al., 2022). According to the China Port Operation Analysis Report (2024), China's foreign trade sea transportation volume accounted for 30.1% of the global sea transportation volume, with the year-on-year growth rates of container throughput and foreign trade throughput reaching 4.9% and 9.6%, respectively. Port throughput forecasting exerts a significant influence on the planning of port infrastructure, the improvement of operational efficiency, the determination of investment scale, the formulation of operational strategies and the planning of development strategies (Sun et al., 2024). It also has a far-reaching impact on the economic growth of port cities and the economic development of their hinterland regions (Twrdy and Batista, 2016). As a hub of the logistics network, ports have their throughput influenced by the complex interplay of various factors, which leads to the uncertainty of port throughput and also presents challenges to port forecasting (Liu et al., 2023; Li et al., 2024).

Current research on port data forecasting primarily employs methods such as time series analysis, neural networks and hybrid forecasting models. Time series analysis methods include the cointegration error correction model (Hui Eddie et al., 2004) and the auto regressive integrated moving average (ARIMA) model (Yao, 2021). However, studies by Mokhtar et al. (2022) have found that due to the limited sample size of port throughput data, linear and seasonal ARIMA models exhibit significant errors in forecasting container volumes and freight volumes, especially in short-term predictions. Additionally, some scholars have used single models based on neural network systems to forecast port throughput (Liu et al., 2023), while others have employed hybrid machine learning models composed of multiple models, such as least squares support vector regression (LS-SVR) (Xie et al., 2013) and gated recurrent unit (GRU) neural network, refined with the Adam algorithm (Niu et al., 2018). Nevertheless, single models may face issues of overfitting and local optima, which can affect learning effectiveness. Hybrid models, on the other hand, may easily overlook the influence of exterior determinants and tend to have longer runtime.

Conventional predictive models typically demand voluminous data samples for robust training and validation. Nevertheless, port throughput forecasting is frequently plagued by the scarcity of available historical data, alongside pronounced periodic fluctuations and time-varying dynamics in its evolutionary trajectory (Xie et al., 2013). Intriguingly, the inherent sample characteristics and pervasive data uncertainty associated with port throughput align closely with the core research scope of grey system theory, a methodological framework specifically engineered to address challenges characterized by small datasets and incomplete, ambiguous information. Originated by Deng Julong in 1982, grey system theory is tailored to tackle small-sample and poor-information problems with exceptional efficacy. Among its diverse derivatives, the grey model (GM(1,1)) stands out for its capacity to accurately characterize data variation patterns and conform to the structural attributes of grey systems; accordingly, numerous scholarly efforts have been devoted to refining this model to bolster its practical applicability across real-world scenarios.

Initial conditions and background values constitute two indispensable components that are tightly correlated with parameter estimation, and extensive academic endeavors have targeted the optimization of multivariate grey models from the perspectives of Initial value optimization (Yang et al., 2025), algorithmic improvement (Tang and Zhu, 2025) and background value interpolation coefficient calibration (Wang and Wang, 2023). Beyond methodological tweaks, the intrinsic features of sample data further exert a substantial impact on the predictive error of multivariate grey models, motivating a wealth of studies to customize such models for compatibility with idiosyncratic data evolutionary trends. In the context of nonlinear datasets, Zheng et al. (2021) accounted for the inherent biases stemming from the discrete approximation of differential equations in most prevailing grey sequence models, and subsequently developed an unbiased Nonlinear Grey Bernoulli Model (NGBM(1,1)) to quantify China's hydropower consumption. For capturing seasonal volatility in time series, a spectrum of sophisticated fitting strategies has been proposed, encompassing seasonal buffer operators (Zhou et al., 2020) and seasonal dummy variables (Zhou and Ding, 2021), among other techniques. To accommodate the time-varying traits of sequential data, Qian et al. (2012) developed a grey model containing a time power term GM (1,1, tα) using the concept of grey modeling and the constant c in the grey operation term btα+c, and applied it to predict foundation settlement. Ma and Liu (2017) further innovated a time-delay polynomial grey prediction model (TDPGM(1,1)) to forecast China's natural gas demand with enhanced precision. To forecast port cargo throughput in China, Li et al. (2025) explore policy-driven mechanisms and autoregressive time lag terms, and introduce a novel self-adaptive grey prediction modeling framework.

Grey prediction models have garnered widespread scholarly attention for their superior capability in handling uncertain information and small-sample datasets. Nevertheless, conventional treatments for periodic and seasonal fluctuations in time series, such as seasonal buffer operators and seasonal dummy variables, fail to characterize flexible periodic patterns embedded in the data, thereby compromising the accuracy and reliability of forecasting outcomes. Furthermore, existing empirical validations are mostly confined to single data dimensions, uniform time scales or isolated regional indicators, lacking a comprehensive and multi-dimensional verification framework. Consequently, the generalization performance and practical applicability of such models in port throughput forecasting remain to be further verified.

Notably, port throughput forecasting is featured by sparse datasets, pronounced time-varying dynamics and periodic evolutionary trends, which impose stringent requirements on the descriptive and predictive power of forecasting models. To address these challenges, this paper innovatively develops a fractional-order time-varying grey Fourier model (FTGFM) to better fit the evolutionary trends and fluctuation characteristics of the data. The main contributions of this work are summarized as follows:

  1. Aiming at the inherent time-varying nature of time series, this study structurally improves the traditional fractional grey model (FGM(1,1)) model by innovatively introducing a time-varying correction term. This term accurately characterizes the system's dynamic evolution, and quantifies the dynamic impacts of internal uncertainties.

  2. To capture data periodicity, this paper integrates Fourier series into the grey modeling framework to enhance periodic fluctuation fitting. Fourier series decomposes complex periodic functions into basic sine and cosine components, enabling more accurate simulation and prediction of periodic time series and offering a new method for seasonal trend forecasting.

  3. A multi-dimensional empirical system is constructed to fully validate the proposed FTGFM(1,1) by combining long/short-period datasets and comparing regional and national indicators. Experimental results on port throughput data verify the model's superior prediction accuracy.

Definition 1.

Consider X(0) as the original series, formulated as: X(0)=(x(0)(1),x(0)(2),,x(0)(n)). Let X(1)=(x(1)(1),x(1)(2),,x(1)(n)) be the one-order accumulating generation series of X(0), where

(1)
Definition 2.

The grey model (GM(1,1)) can be expressed as

(2)

where a denotes the developing coefficient, b denotes grey action.

The parameter vector A=[a,b]Τ are estimated with the application of the least squares method (LSM).

(3)

where, E=[z(2)1z(3)1z(n)1], ϒ=[x1(0)(2)x1(0)(3)x1(0)(n)]

Discretization of the time response Eq. (2) yields

(4)

The predicted value Xˆ(0) of the original sequence is

(5)

When dealing with time series characterized by periodic fluctuations and time-varying dynamics, adopting an appropriate grey prediction model is critical to achieving effective fitting and forecasting. To address these challenges, a novel grey model, termed the FTGFM and abbreviated as the FTGFM(1,1) model, is proposed in this paper.

2.2.1 Model establishment

Definition 3.

The original sequence X(0) is represented by a column vector X(0)=(x(0)(1),x(0)(2),,x(0)(n)), where x(0)(k) is the element of the set and n is the number of the series.

Definition 4.

Let X(r)=(x(r)(1),x(r)(2),,x(r)(n)) be the r-order accumulating generation series of X(0). The fractional order accumulation operator D is delineated thusly:

(6)

where,

(7)

In the fractional-order accumulation operator, the effective processing of weighted information is achieved by assigning higher weights to the most recent data points, thereby realizing the principle of prioritizing new information. This means that the model gives priority to the latest data points rather than simply averaging all data points. This approach helps enhance the model's sensitivity to recent changes, thereby making the forecasting results more accurate and timely (Wu et al., 2013).

Definition 5.

Assume that X(r) is given by Definition 4. On the basis of the fractional-order univariate grey prediction model, we incorporate the time-varying term and the truncated Fourier function to establish the FTGFM(1,1) model. The whitening differential equation of the FTGFM(1,1) model is expressed as:

(8)

where a denotes the development coefficient, h1,h2 represents the time-varying term coefficient, gs,os(s=1,2,,Z) refers to the Fourier coefficient and v is an adjustable nonlinear parameter that reflects the sequence frequency to match applications with diverse evolutionary trends and oscillation states. In this work, parameter optimization is adopted to fit the complex periodicity of the sequence. Z denotes the adjustable Fourier order, representing the number of truncated terms, which is correlated with the periodic pattern.

On the one hand, a Fourier series with a small truncation order Z can only represent simple periodic patterns. On the other hand, a model with a higher order enables the fitting of more complex periodic patterns but is accompanied by the risk of overfitting. Therefore, it is crucial to select an appropriate order according to the seasonal time series. An alternative approach is to treat the Fourier truncation order as a parameter and conduct algorithmic optimization to determine its optimal value (Liu et al., 2024).

Fourier series can decompose complex periodic functions into a linear combination of multiple sine and cosine functions with distinct frequencies and amplitudes. Such decomposition capability makes the Fourier series highly suitable for capturing and analyzing the periodic components embedded in data (Xiong et al., 2024).

Definition 6.

Let Z(r) denotes the background value sequence, which is defined as Z(r)=(z(r)(1),z(r)(2),,z(r)(n)), where

(9)

The whitening differential equation is transformed into its discrete form by discretizing Eq (8), and the corresponding grey differential equation is obtained as follows:

(10)

2.2.2 Prediction formula of FTGFM (1, 1)

Solving the above equation yields:

Substitute k=1,

The time response of the FTGFM (1, 1) model is:

(11)

Finally,the sequence predicted value xˆ(r)(k) can be obtained:

(12)

2.3.1 Structure parameter

As for the FTGFM (1, 1), the development coefficient, Fourier coefficient, and time-varying coefficient must be estimated. The parameter sequence is expressed as Λ, that is,

(13)

In the realm of grey prediction, the LSM is the predominant parameter optimization technique. It identifies the optimal development coefficient and grey action by minimizing the squared deviations between simulated and predicted values. And this approach aligns with the scientific principle of simplicity. Consequently, this study employs the LSM to determine the parameter vector Λ. The parameters of the FTGFM (1, 1) model are estimated as follows:

(14)

Where,

2.3.2 Estimation of the nonlinear parameter

As for the FTGFM (1, 1), the optimal fractional order r contributes to efficiently addressing new information's priority, the optimal sequence frequency v is conducive to fitting the evolutionary trend and oscillation state of the data and the optimal Fourier truncation order Z is beneficial for fitting complex periodic patterns. Therefore, optimizing these nonlinear parameters of FTGFM (1, 1) can not only augment the forecasting accuracy but also strengthen the model's adaptive and flexible capabilities. We develop a nonlinear constrained optimization model that integrates FTGFM (1, 1) estimated parameters, enabling dynamic identification of true fluctuations. To elaborate, the object function is expressed as the minimum Mean Absolute Percentage Error (MAPE) for true data and estimated outcomes (see Eq. (15)). Under constraints, Whale Optimization Algorithm (WOA) serves to optimize the parameters r and v as real numbers, while performing integer optimization for Z, thereby achieving accurate forecasting. Specifically, the search ranges of parameters r, v and Z are set to [0,2], [−10,10] and [0,10] respectively; the population size is set to 50; and the maximum number of iterations is set to 200. The objective function and constraints of WOA are shown in the equations.

(15)

2.4.1 Criteria for validation

To assess the model's prediction exactness and examine relative and absolute errors of predicted and real values, this study selects four evaluation indicators to evaluate the model's effectiveness, involving APE, MAPE, root mean squared error (RMSE) and Theil U statistic (U). The formulas are presented in the following manner:

(16)
(17)
(18)
(19)

where, xˆ(0)(k) denotes the simulated outcomes obtained by recursion of Eq.(12), x(0)(k) denotes the real values.

2.4.2 Modeling process

The detailed modeling process is presented as follows:

  • Step 1: The system behavior sequence is determined via relevant literature and validated data sources. The dataset is split into training and testing subsets for model building and accuracy validation. To fit the exponential growth trend, r-order fractional accumulation is applied to the raw data to generate the accumulated sequence and boost model adaptability.

  • Step 2: Model coefficients are estimated via the OLS method to minimize the sum of squared deviations between estimated and actual values, ensuring parameters reflect the system's internal structure and improve fitting accuracy.

  • Step 3: To boost prediction accuracy, the WOA is adopted to fine-tune hyperparameters (fractional accumulation order and power exponent), which are vital for capturing data dynamics. The FTGFM(1,1) model is then built with the optimized hyperparameters.

  • Step 4: Time Response Derivation. Using the parameters obtained in Step3, the time response function of the FTGFM (1, 1) model is formulated to generate the predicted accumulated sequence. The purpose of this step is to reconstruct the system's dynamic evolution over time. Subsequently, the inverse accumulation process is performed to obtain the final forecast values xˆ(0)(k) in their original scale.

  • Step 5: The predicted values are used to calculate performance metrics, including MAPE, RMSE and Theil's Inequality Coefficient (U), enabling the comparison of prediction performance and robustness with other benchmark models.

To verify the effectiveness of the proposed port trade forecasting model and achieve accurate prediction of future port throughput data, multi-dimensional core port throughput indicators are selected to construct the research dataset. Specifically, the indicators are defined as follows: coastal cargo throughput (CCgT) refers to the total quantity of all types of cargo handled at coastal ports within a specified time interval, which comprehensively reflects the cargo distribution capacity and regional logistics hub status of coastal ports; coastal foreign trade cargo throughput (CFTCT) denotes the total volume of import and export goods loaded and unloaded through coastal ports during a certain period, which directly characterizes the port's foreign trade service capacity and linkage level with the global industrial chain; national port cargo throughput (NPCT) represents the total amount of cargo handled by all ports across the country within a defined period, which reflects the overall development scale of the national port system; coastal container throughput (CCnT) refers to the total number of containers handled at coastal ports within a specified interval, which measures the port's containerized transportation level and international shipping hub capacity.

Quarterly data spanning 2017 to 2025 are adopted for coastal cargo throughput and CFTCT, aiming to depict the long-term evolutionary trends of the indicators. Monthly data covering 2023–2025 are selected for NPCT and coastal container throughput, to capture short-term fluctuation characteristics and conduct comprehensive predictive analysis of regional and national port development levels (data sourced from the wind database). The dataset is divided into training and testing subsets according to data frequency. For the quarterly data of coastal cargo throughput and CFTCT, 28 consecutive quarterly samples from 2017 to 2023 are utilized as the training set to complete the optimal estimation of model parameters and deeply mine long-term trend features, while 8 quarterly samples from 2024 to 2025 are employed as the testing set to verify the extrapolation accuracy and result robustness of the model. For the monthly data of NPCT and coastal container throughput, 24 consecutive monthly samples from 2023 to 2024 are applied as the training set to effectively fit the short-term fluctuation rules of the indicators and 12 monthly samples in 2025 are used as the testing set to test the model's prediction capability and generalization performance for high-frequency data.

This study combines long- and short-cycle data, as well as compares regional and national indicators, providing rigorous and scientific data support for model effectiveness verification and future port throughput data prediction.

3.2.1 Prediction performance analysis of CCgT and CFTCT

The raw data of CCgT and CFTCT are divided into training and testing sets, and the WOA is adopted to optimize the real numbers r and v, while the Fourier truncation order Z is optimized as an integer. The FTGFM(1,1) model is then evaluated against six competing models in terms of prediction performance, based on the sequence length that achieves the highest accuracy. For CCgT, the optimal parameters are determined as r=0.47,v=0.34 and Z=9. For CFTCT, the optimal parameters are determined as r=0.62,v=0.32 and Z=6. Subsequently, simulations and predictions for CCgT and CFTCT are carried out based on the time response function and estimated parameters. Finally, the upper and lower bounds of errors are determined using the specified equations, and the average relative error is calculated accordingly.

For validation purposes, the model's forecasting outcomes were contrasted with results from other models, including back propagation neural network (BPNN), ARIMA, GM(1,1), fractional time-varying grey model (FTGM(1,1)) and seasonal grey model (SGM(1,1)) models. Table 1 provides the prediction results for the test set. The simulation curve of the new model is depicted in Figure 1. The forecast curve roughly corresponds to the actual value curve, and fits the periodicity and nonlinearity of the original sequence very well. Compared with the real values, the predicted values show excellent fitting accuracy.

Table 1

Testing set outcomes among six models in CCgT and CFTCT

QuarterReal valueFTGFM(1,1)BPNNARIMAGM(1,1)FTGM(1,1)SGM(1,1)
CCgT (Ten thousand tons)
2024–1290,172288545.38278236.55265989.92269812.27274179.10258690.98
2024–2287,027283370.65278322.85276291.72271676.56278771.99270561.47
2024–3280,269269636.43278315.49269855.87273545.05283764.46276942.88
2024–4285,000271698.75278302.95279944.29275417.76289190.96274762.71
2025–1295,089309784.62278297.33273719.09277294.69295088.99266359.50
2025–2296,504298894.91278295.51283599.54279175.85301499.26278581.86
2025–3295,405295404.82278295.06277579.66281061.26308466.04285152.45
2025–4297,710301190.92278294.99287257.42282950.91316037.43282907.65
CFTCT (Ten thousand tons)
2024–1117,001117385.78113733.15114127.51114159.86120170.34107354.28
2024–2124,352122542.43113261.11114942.02115155.35121926.15117908.55
2024–3122,261124341.13113040.71115890.06116155.14123747.89118719.52
2024–4119,658119545.53112944.35116838.10117159.24125638.27123924.56
2025–1120,204120203.99112903.53117786.14118167.68127600.10111414.55
2025–2126,760128341.62112886.52118734.18119180.47129636.27122367.99
2025–3129,668129668.59112879.50119682.21120197.64131749.77123209.64
2025–4131,789125135.86112876.62120630.25121219.20133943.69128611.54
Figure 1

The simulation curves of CCgT and CFTCT by FTGFM(1,1) model

Figure 1

The simulation curves of CCgT and CFTCT by FTGFM(1,1) model

Close modal

From the perspective of error distribution, Table 2 and Figure 2 present the MAPE, RMSE and Theil's U statistic for all prediction models in both training and testing phases, and the results show that the FTGFM(1,1) model achieves the lowest values of these three indicators among BPNN, ARIMA, GM(1,1), FTGM(1,1) and SGM(1,1) models, reflecting its distinct superiority in simulation and forecasting performance. In the training phase, FTGFM(1,1) yields very low errors for both CCgT and CFTCT datasets. In the forecasting (testing set) phase, for the CCgT dataset, FTGFM(1,1) obtained a MAPE of 2.16%, an RMSE of 8214.20 and a U of 0.03; for the CFTCT dataset, the MAPE was 1.23%, the RMSE was 2610.78 and the U was 0.02. These are the lowest values among all comparison models for both datasets, demonstrating satisfactory performance in simulating and forecasting the relevant time-series data.

Table 2

MAPE, RMSE and U among six models in CCgT and CFTCT

SetIndexFTGFM(1,1)BPNNARIMAGM(1,1)FTGM(1,1)SGM(1,1)
CCgTtrain set      
MAPE1.013.253.313.463.763.08
RMSE3486.3210620.7310168.6212073.4412104.028999.98
U0.010.040.040.050.050.04
test set      
MAPE2.164.294.844.982.925.70
RMSE8214.2013919.8015346.8615114.8210518.6718870.76
U0.030.050.060.050.040.07
CFTCTtrain set      
MAPE0.941.582.342.392.812.23
RMSE1175.242147.673048.103074.223922.682960.80
U0.010.020.030.030.040.03
test set      
MAPE1.238.655.264.992.824.76
RMSE2610.7811984.787414.867063.843966.356261.73
U0.020.110.060.060.030.05
Figure 2

MAPE of the six models in CCgT and CFTCT

Figure 2

MAPE of the six models in CCgT and CFTCT

Close modal

3.2.2 Analysis of prediction results

Predictions for these two indicators across eight quarters of 2026 are generated via the model, and both show a steady fluctuating upward trend as depicted in Figure 1 and Table 3. In 2026, CCgT and CFTCT will maintain overall steady growth, with positive quarter-on-quarter changes in most quarters, which aligns well with the model's verified low error and high stability. Specifically, CCgT climbs gradually from its Q1 2026 baseline and stays at a high level quarter by quarter; CFTCT also rises continuously with minor inter-quarter fluctuations, presenting favorable growth continuity. These results validate the model's accuracy in capturing time-series trends and reflect the resilience of the corresponding business sectors.

Table 3

Prediction results of FTGFM(1,1) in CCgT and CFTCT

QuarterCCgTCFTCT
2026–1330757.42129558.70
2026–2306423.13131085.62
2026–3309038.66127371.53
2026–4304257.30124295.55
2027–1325162.83124977.35
2027–2301625.60125717.47
2027–3315534.72126873.11
2027–4326014.33130718.30

The continuous growth of CCgT and CFTCT is driven by the combined effects of multiple factors. First, the restructuring of the global industrial chain and the recovery of international trade have underpinned the expansion of business volume, laying a solid foundation for the upward trend of the indicators. Second, technological advancements, including port digital transformation and the deployment of intelligent handling equipment, have greatly enhanced operational efficiency and service capacity, providing robust technical support for sustained growth.

The quarterly forecasts for CCgT and CFTCT hold unique value as a risk early-warning and strategic alignment tool for stakeholders across the maritime trade ecosystem. For port operators and infrastructure investors, the identified quarterly fluctuation patterns enable forward-looking allocation of deep-water berths, storage facilities and multimodal transport corridors. This allows stakeholders to pre-empt capacity bottlenecks during high-demand quarters, optimize resource utilization in slower periods and prioritize investments in high-potential growth segments, thereby improving asset efficiency and long-term operational sustainability. From a trade and macroeconomic governance perspective, these forecasts serve as critical leading indicators for assessing the trajectory of coastal foreign trade and formulating targeted stabilization policies. The steady growth of CFTCT signals sustained external demand, providing empirical evidence for policymakers to refine export support measures, streamline customs clearance procedures and strengthen regional trade cooperation. Meanwhile, the consistent expansion of CCgT underscores the vitality of domestic and international circulation, helping authorities align port development with broader industrial upgrading and regional economic integration goals, ultimately fostering more resilient and high-quality growth in coastal economic zones.

3.3.1 Prediction performance analysis of NPCT and CCnT

The raw data of NPCT and CCnT are divided into training and testing sets, and the WOA is adopted to optimize the real numbers r and v, while the Fourier truncation order Z is optimized as an integer. The FTGFM(1,1) model is then evaluated against six competing models in terms of prediction performance, based on the sequence length that achieves the highest accuracy. For NPCT, the optimal parameters are determined as r=0.10,v=6.80 and Z=6. For CCnT, the optimal parameters are determined as r=1.98,v=5.76 and Z=5. Subsequently, simulations and predictions for CCgT and CFTCT are carried out based on the time response function and estimated parameters. Finally, the upper and lower bounds of errors are determined using the specified equations, and the average relative error is calculated accordingly.

For validation purposes, the model's forecasting outcomes were contrasted with results from other models, including BPNN, ARIMA, GM(1,1), FTGM(1,1) and SGM(1,1) models. Table 4 provides the prediction results for the testing set.The simulation curve of new model is depicted in Figure 3. The forecast curve roughly corresponds to actual value curve, and fits the periodicity and nonlinearity of the original sequence very well. Compared with the real values, the predicted values show excellent fitting accuracy.

Table 4

Testing set outcomes among six models in NPCT and CCnT

MonthReal valueFTGFM(1,1)BPNNARIMAGM(1,1)FTGM(1,1)SGM(1,1)
NPCT (Ten thousand tons)
2025–1153,368152970.40152794.15151330.00152601.46161453.33150242.10
2025–2153,870154125.88152626.72152650.96153050.48162009.91150701.00
2025–3154,690155765.49152232.94152864.91153500.82162564.65150532.32
2025–4153,341154570.81151633.02153307.41153952.49163117.80151475.79
2025–5158,993158174.91151073.50153702.73154405.48163669.59151875.22
2025–6155,856159909.28150741.84154107.79154859.81164220.20152339.11
2025–7153,729152757.80150594.95154510.83155315.47164769.83152168.59
2025–8158,545160780.00150538.36154914.29155772.48165318.62153122.32
2025–9154,030156221.51150517.74155317.67156230.83165866.72153526.09
2025–10156,837161282.00150510.39155721.06156690.53166414.28153995.02
2025–11161,452163809.80150507.79156124.45157151.58166961.40153822.65
2025–12158,865164604.15150506.87156527.84157613.99167508.21154786.75
CCnT (Ten thousand standard containers)
2025–12,6122650.612544.512555.252540.892615.582554.35
2025–22,6182617.632581.732567.412552.562618.002590.58
2025–32,5812532.812596.992579.572564.292620.372587.53
2025–42,5662544.822602.162591.722576.082622.692632.27
2025–52,6722631.552603.812575.582587.922624.972601.67
2025–62,6752691.862604.312616.042599.812627.202638.57
2025–72,6442669.872604.472628.192611.762629.382635.46
2025–82,7752719.032604.522640.352623.762631.532681.03
2025–92,6222622.002604.532652.512635.822633.622649.87
2025–102,6402590.112604.542664.672647.932635.682687.46
2025–112,6812603.772604.542676.822660.102637.702684.29
2025–122,6092610.972604.542688.982672.322639.672730.70
Figure 3

The simulation curves of NPCT and CCnT by FTGFM(1,1) model

Figure 3

The simulation curves of NPCT and CCnT by FTGFM(1,1) model

Close modal

From the perspective of error distribution, Table 5 and Figure 4 present the MAPE, RMSE and Theil's U statistic for all prediction models in both training and testing phases, and the results show that the FTGFM(1,1) model achieves the lowest values of these three indicators among BPNN, ARIMA, GM(1,1), FTGM(1,1) and SGM(1,1) models, reflecting its distinct superiority in simulation and forecasting performance. In the model fitting (training set) phase, for the NPCT dataset, FTGFM(1,1) achieved a MAPE of 0.59%, an RMSE of 984.77 and a U of 0.01; for the CCnT dataset, the MAPE was 0.91%, the RMSE was 25.24 and the U was 0.01. In the forecasting (testing set) phase, for the NPCT dataset, FTGFM(1,1) obtained a MAPE of 1.37%, an RMSE of 3335.17 and a U of 0.02; for the CCnT dataset, the MAPE was 1.18%, the RMSE was 47.98 and the U was 0.01. These are the lowest values among all comparison models for both datasets, demonstrating relatively satisfactory performance in simulating and forecasting the relevant time-series data.

Table 5

MAPE, RMSE and U among six models in NPCT and CCnT

SetIndexFTGFM(1,1)BPNNARIMAGM(1,1)FTGM(1,1)SGM(1,1)
NPCTtrain set     
MAPE0.591.421.371.435.301.41
RMSE984.772415.482331.102376.658273.802321.57
U0.010.020.020.020.060.02
test set      
MAPE1.373.131.411.625.372.38
RMSE3335.177207.873364.432743.4510522.215227.56
U0.020.040.020.010.050.03
CCnTtrain set     
MAPE0.911.412.152.239.002.20
RMSE25.2438.6956.8257.83243.2357.52
U0.010.020.030.030.100.03
test set      
MAPE1.181.991.811.911.381.79
RMSE47.9883.0275.4479.8764.2772.06
U0.010.030.020.030.020.02
Figure 4

MAPE of the six models in NPCT and CCnT

Figure 4

MAPE of the six models in NPCT and CCnT

Close modal

3.3.2 Analysis of prediction results

Based on the systematic comparison of prediction performance among various models in the preceding section, the FTGFM(1,1) model achieves the highest accuracy and stability in the monthly forecasting of NPCT and coastal container throughput (CCnT). The 12-month forecast values for 2026 generated by this model (see Table 6 and Figure 3) reveal that both throughput indicators exhibit a fluctuating yet steady upward trend, which is highly consistent with the periodic patterns of historical data. Specifically, the annual range of NPCT in 2026 is 159990.41–171266.74, climbing gradually from 160183.19 in January to 168992.75 in December and peaking in November. CCnT fluctuates within the range of 2694.23–2837.87, starting at 2763.15 in early January and closing at 2787.55 at the end of the year, with its annual high recorded in July. This moderate growth trend not only reflects the seasonal patterns of port operations but also validates the FTGFM(1,1) model's ability to accurately capture both trends and fluctuations.

Table 6

Prediction results of FTGFM(1,1) in NPCT and CCnT

MonthNPCTCCnT
2026–1160183.192763.15
2026–2159990.412753.47
2026–3163379.422694.23
2026–4160725.532714.13
2026–5166261.792829.04
2026–6165087.502824.49
2026–7160988.532837.87
2026–8166999.172880.45
2026–9164145.172759.37
2026–10166750.922773.52
2026–11171266.742789.39
2026–12168992.752787.55

From the perspective of driving logic, the recovery of global trade and the resilience restoration of cross-border supply chains provide demand-side support for NPCT and CCnT. Technological innovations such as port automation upgrades and the popularization of intelligent dispatching systems release operational efficiency from the supply side, effectively underpinning throughput expansion. Coupled with the coordinated development of domestic and international economic cycles, these factors jointly drive the steady growth of national port throughput in 2026.

The above monthly forecast results for NPCT and CCnT in 2026 can provide key quantitative references for port operations, logistics planning and macroeconomic decision-making. On one hand, port operators can deploy quay cranes, storage yard capacity and collection-distribution transportation resources in advance based on the monthly fluctuation characteristics, so as to alleviate peak-season operational pressures, optimize off-season resource allocation, reduce operational costs and improve service efficiency. On the other hand, logistics and supply chain enterprises can rationally plan route layout, cabin allocation and warehouse network construction based on the steady growth trend, respond to changes in trade flow in advance and enhance supply chain resilience and risk resistance. From a macro governance standpoint, these forecasts provide empirical evidence for transportation and commerce authorities to refine port development strategies, upgrade intermodal logistics systems and design trade-stabilizing policies. They help pinpoint high-pressure months for infrastructure investment and channel security, while serving as leading indicators to assess global trade recovery and monitor progress in the dual circulation of domestic and international economies–ultimately fostering more precise alignment between port development and broader macroeconomic goals.

Ports throughput forecasting serves as the foundation for port structure optimization and infrastructure construction. Accurate prediction of future ports throughput facilitates the advancement of transportation construction in port cities and contributes significantly to the sustainable development of the transportation industry. This study focuses on the quarterly and monthly variations of China's ports throughput, adopting a novel grey prediction model capable of capturing periodic data fluctuations for forecasting. The proposed model delivers superior prediction performance.

To incorporate more uncertain information, this paper presents the FTGFM(1,1) model applicable to prediction, which alleviates the errors of the traditional GM(1,1) model regarding the new information priority principle and data feature fitting to a certain extent. In accordance with the new information priority rule in grey system theory, a fractional-order accumulation operator is introduced to construct the r-AGO sequence. Furthermore, the model integrates Fourier function terms and time-varying terms, exhibiting greater advantages in forecasting sequences with periodic and time-varying trends, especially seasonal data.

To verify the effectiveness and practicability of the model, the FTGFM(1,1) is employed to conduct forecasting via four application cases: quarterly data of coastal cargo throughput and CFTCT, as well as monthly data of NPCT and coastal container throughput. The multi-dimensional design can effectively verify the stability, adaptability and generalization ability of the proposed FTGFM(1,1) model under different data frequencies and spatial scales, which provides a referable modeling paradigm for port throughput forecasting with multi-scenario and multi-scale characteristics. The proposed model is compared with BPNN, ARIMA, GM(1,1), FTGM(1,1) and SGM(1,1) models. The results demonstrate that the FTGFM(1,1) can accurately predict sequences with time-varying and periodic trends, with MAPE values of both the training and testing sets below 3, outperforming the other five models in terms of accuracy and fluctuation similarity.

The optimized model in this study performs better than other benchmarks in port throughput forecasting, indicating that the improved grey prediction framework can also be applied to forecast data with variable characteristics. Admittedly, this research still has limitations. In many practical systems, the relationships between variables are not immediately reflected but feature a certain time-delay effect. Therefore, the time-delay effect may be considered in the construction of prediction models in future work, enabling the model to capture future variable changes more accurately.

Hui Eddie
,
C.M.
,
Seabrooke
,
W.
and
Wong Gordon
,
K.C.
(
2004
), “
Forecasting cargo throughput for the port of Hong Kong: error correction model approach
”,
Journal of Urban Planning and Development
, Vol. 
130
No. 
4
, pp. 
195
-
203
, doi: .
Li
,
X.
,
Sun
,
Y.
,
Shi
,
Y.
,
Zhao
,
Y.
and
Zhou
,
S.
(
2025
), “
A novel self-adaptive multivariate grey model with external intervention for port cargo throughput prediction
”,
Grey Systems: Theory and Application
, Vol. 
15
No. 
2
, pp. 
257
-
278
, doi: .
Li
,
W.
,
Cai
,
L.
,
He
,
L.
and
Guo
,
W.
(
2024
), “
Scheduling techniques for addressing uncertainties in container ports: a systematic literature review
”,
Applied Soft Computing
, Vol. 
162
, 111820, doi: .
Liu
,
B.
,
Wang
,
X.
and
Liang
,
X.
(
2023
), “
Neural network-based prediction system for port throughput: a case study of Ningbo-Zhoushan Port
”,
Research in Transportation Business and Management
, Vol. 
51
, 101067, doi: .
Liu
,
X.
,
Li
,
S.
and
Gao
,
M.
(
2024
), “
A discrete time-varying grey Fourier model with fractional order terms for electricity consumption forecast
”,
Energy
, Vol. 
296
, 131065, doi: .
Niu
,
M.
,
Hu
,
Y.
,
Sun
,
S.
and
Liu
,
Y.
(
2018
), “
A novel hybrid decomposition-ensemble model based on VMD and HGWO for container throughput forecasting
”,
Applied Mathematical Modelling
, Vol. 
57
, pp. 
163
-
178
, doi: .
Ma
,
X.
and
Liu
,
Z.
(
2017
), “
Application of a novel time-delayed polynomial grey model to predict the natural gas consumption in China
”,
Journal of Computational and Applied Mathematics
, Vol. 
324
, pp. 
17
-
24
, doi: .
Mokhtar
,
K.
,
Mhd Ruslan
,
S.M.
,
Abu Bakar
,
A.
,
Jeevan
,
J.
and
Othman
,
M.R.
(
2022
), “
The analysis of container terminal throughput using ARIMA and SARIMA
”, in
Design in Maritime Engineering
,
Springer International Publishing
,
Cham
, pp.
229
-
243
.
Qian
,
W.
,
Dang
,
Y.
and
Liu
,
S.
(
2012
), “
Grey GM (1, 1, tα) model with time power and its application
”,
Systems Engineering-Theory and Practice
, Vol. 
32
No. 
10
, pp. 
2247
-
2252
.
Sun
,
Y.
,
Zhang
,
Y.
and
Zhao
,
Z.
(
2024
), “
Research and application of a novel grey multivariable model in port scale prediction under the impact of free trade zone
”,
Marine Economics and Management
, Vol. 
7
No. 
1
, pp. 
79
-
101
, doi: .
Tang
,
X.
and
Zhu
,
Z.
(
2025
), “
A novel self-adaptive nonlinear discrete grey model and its application
”,
Marine Economics and Management
, Vol. 
8
No. 
2
, pp. 
127
-
161
, doi: .
Twrdy
,
E.
and
Batista
,
M.
(
2016
), “
Modeling of container throughput in northern Adriatic ports over the period 1990-2013
”,
Journal of Transport Geography
, Vol. 
52
, pp. 
131
-
142
, doi: .
Wang
,
Y.
and
Wang
,
H.
(
2023
), “
Forecasting CO2 emissions using a novel fractional discrete grey Bernoulli model: a case of Shaanxi in China
”,
Urban Climate
, Vol. 
49
, 101452, doi: .
Wu
,
L.
,
Liu
,
S.
,
Yao
,
L.
,
Yan
,
S.
and
Liu
,
D.
(
2013
), “
Grey system model with the fractional order accumulation
”,
Communications in Nonlinear Science and Numerical Simulation
, Vol. 
18
No. 
7
, pp. 
1775
-
1785
, doi: .
Xie
,
G.
,
Wang
,
S.
,
Zhao
,
Y.
and
Lai
,
K.K.
(
2013
), “
Hybrid approaches based on LSSVR model for container throughput forecasting: a comparative study
”,
Applied Soft Computing
, Vol. 
13
No. 
5
, pp. 
2232
-
2241
, doi: .
Xiong
,
P.
,
Zeng
,
X.
,
Wu
,
L.
and
Shu
,
H.
(
2024
), “
A fluctuation data grey model and its prediction of rainstorm days
”,
Applied Mathematical Modelling
, Vol. 
127
, pp. 
767
-
783
, doi: .
Yao
,
W.
(
2021
), “
Prediction of container throughput of Dalian Port Based on factor analysis and ARIMA model
”,
IOP Conference Series: Earth and Environmental Science
, Vol. 
831
No. 
1
, 012046, doi: .
Yang
,
L.
,
Yang
,
P.
,
Li
,
X.
,
Cui
,
Y.
and
Wan
,
G.
(
2025
), “
An innovative data-driven grey Bernoulli model for the port container throughput forecasting
”,
Applied Mathematical Modelling
, Vol. 
154
, 116709, doi: .
Zheng
,
C.
,
Wu
,
W.-Z.
,
Xie
,
W.
,
Li
,
Q.
and
Zhang
,
T.
(
2021
), “
Forecasting the hydroelectricity consumption of China by using a novel unbiased nonlinear grey Bernoulli model
”,
Journal of Cleaner Production
, Vol. 
278
, 123903, doi: .
Zhou
,
W.
and
Ding
,
S.
(
2021
), “
A novel discrete grey seasonal model and its applications
”,
Communications in Nonlinear Science and Numerical Simulation
, Vol. 
93
, 105493, doi: .
Zhou
,
W.
,
Wu
,
X.
,
Ding
,
S.
and
Cheng
,
Y.
(
2020
), “
Predictive analysis of the air quality indicators in the Yangtze river delta in China: an application of a novel seasonal grey model
”,
Science of The Total Environment
, Vol. 
748
, 141428, doi: .
Ziran
,
J.
,
Chunfang
,
P.
,
Huayou
,
Z.
,
Chengjin
,
W.
and
Shilin
,
Y.
(
2022
), “
Temporal and spatial evolution and influencing factors of the port system in Yangtze river delta region from the perspective of dual circulation: comparing port domestic trade throughput with port foreign trade throughput
”,
Transport Policy
, Vol. 
118
, pp. 
79
-
90
, doi: .
Published in Marine Economics and Management. Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at Link to the terms of the CC BY 4.0 licence.

or Create an Account

Close Modal
Close Modal