Traditional optimisation methods often struggle to address the stochastic and dynamic characteristics of reverse logistics. Deep reinforcement learning (DRL), with its ability to learn adaptive policies under uncertainty, is well suited to this challenge. This study investigates the optimisation of green transportation strategies in reverse logistics systems through a DRL framework. By formulating the problem as a Markov decision process, this study integrates economic profitability, carbon dioxide emissions reduction and transportation efficiency into a dynamic reward function. The experimental results of the basic model and the extended model show that the superior performance of the DRL-based approach, which significantly surpasses traditional methods such as genetic algorithms and random strategies in adaptability and optimisation effectiveness. These findings provide theoretical foundations and practical insights for advancing sustainable logistics practices in complex systems.
Article navigation
1 November 2025
Research Article|
April 23 2025
Deep reinforcement learning-based optimisation of reverse logistics transport in closed-loop supply chain Available to Purchase
Hao Zou;
Hao Zou
Business School,
Sichuan University
, Chengdu, PR China
Search for other works by this author on:
Lu Yi;
Lu Yi
Professor, Business School,
Sichuan University
, Chengdu, PR China
Search for other works by this author on:
Tingjie Zhang;
Tingjie Zhang
Business School,
Sichuan University
, Chengdu, PR China
Search for other works by this author on:
Chao Yu;
Chao Yu
Business School,
Sichuan University
, Chengdu, PR China
Search for other works by this author on:
Jun Huang
Business School,
Sichuan University
, Chengdu, PR China
Corresponding author Jun Huang (huangjun1@stu.scu.edu.cn)
Search for other works by this author on:
Corresponding author Jun Huang (huangjun1@stu.scu.edu.cn)
Publisher: Emerald Publishing
Received:
December 26 2024
Accepted:
February 21 2025
Online ISSN: 1751-7710
Print ISSN: 0965-092X
Funding
Funding Group:
- Award Group:
- Funder(s): Major Programme of the National Social Science Foundation of China
- Award Id(s): 23&ZD051
- Funder(s):
- Funding Statement(s): This work was supported by the Major Programme of the National Social Science Foundation of China (grant no. 23&ZD051).
© 2025 Emerald Publishing Limited
2025
Emerald Publishing Limited
Licensed re-use rights only
Proceedings of the Institution of Civil Engineers - Transport (2025) 178 (8): 583–600.
Article history
Received:
December 26 2024
Accepted:
February 21 2025
Citation
Zou H, Yi L, Zhang T, Yu C, Huang J (2025), "Deep reinforcement learning-based optimisation of reverse logistics transport in closed-loop supply chain". Proceedings of the Institution of Civil Engineers - Transport, Vol. 178 No. 8 pp. 583–600, doi: https://doi.org/10.1680/jtran.24.00163
Download citation file:
Suggested Reading
Two-stage optimization for site selection and resource allocation of metro rescue stations
Proceedings of the Institution of Civil Engineers - Transport (December,2024)
Transport–environment–economy linkages in Yellow River’s ‘Ji-shape’ bend metro area
Proceedings of the Institution of Civil Engineers - Transport (October,2025)
Impact of Covid-19 on motorized transport modes in Tehran: competition or complementarity?
Proceedings of the Institution of Civil Engineers - Transport (October,2024)
Environmental and economical assessment of earth-retaining walls for design optimisation
Environmental Geotechnics (April,2024)
Optimization strategies for resource scheduling in vehicular network slicing oriented to differentiated web application services
International Journal of Web Information Systems (February,2026)
Related Chapters
Chapter 14 How Much Should a Nation Save? A New Answer
Economic Growth and Development
The Development of Green Sustainable Transportation in China
Sustainable Transport for Chinese Cities
A 5D Land-use Transport Model for a High Density, Rapidly Growing City
Sustainable Transport for Chinese Cities
Recommended for you
These recommendations are informed by your reading behaviors and indicated interests.
