Deep reinforcement learning-based optimisation of reverse logistics transport in closed-loop supply chain

Zou, Hao; Yi, Lu; Zhang, Tingjie; Yu, Chao; Huang, Jun

doi:10.1680/jtran.24.00163

Article navigation

Research Article| April 23 2025

Deep reinforcement learning-based optimisation of reverse logistics transport in closed-loop supply chain

Hao Zou;

Hao Zou

Business School,

Sichuan University

, Chengdu, PR

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Lu Yi;

Lu Yi

Professor, Business School,

Sichuan University

, Chengdu, PR

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Tingjie Zhang;

Tingjie Zhang

Business School,

Sichuan University

, Chengdu, PR

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Chao Yu;

Chao Yu

Business School,

Sichuan University

, Chengdu, PR

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Jun Huang

Business School,

Sichuan University

, Chengdu, PR

China

Corresponding author Jun Huang (huangjun1@stu.scu.edu.cn)

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Corresponding author Jun Huang (huangjun1@stu.scu.edu.cn)

Publisher: Emerald Publishing

Received: December 26 2024

Accepted: February 21 2025

Online ISSN: 1751-7710

Print ISSN: 0965-092X

Funding

Funding Group:

Award Group:
- Funder(s):
  Major Programme of the National Social Science Foundation of China
- Award Id(s):
  23&ZD051
Funding Statement(s):
This work was supported by the Major Programme of the National Social Science Foundation of China (grant no. 23&ZD051).

2025

Emerald Publishing Limited

Licensed re-use rights only

Proceedings of the Institution of Civil Engineers - Transport (2025) 178 (8): 583–600.

https://doi.org/10.1680/jtran.24.00163

Traditional optimisation methods often struggle to address the stochastic and dynamic characteristics of reverse logistics. Deep reinforcement learning (DRL), with its ability to learn adaptive policies under uncertainty, is well suited to this challenge. This study investigates the optimisation of green transportation strategies in reverse logistics systems through a DRL framework. By formulating the problem as a Markov decision process, this study integrates economic profitability, carbon dioxide emissions reduction and transportation efficiency into a dynamic reward function. The experimental results of the basic model and the extended model show that the superior performance of the DRL-based approach, which significantly surpasses traditional methods such as genetic algorithms and random strategies in adaptability and optimisation effectiveness. These findings provide theoretical foundations and practical insights for advancing sustainable logistics practices in complex systems.

2025

Emerald Publishing Limited

Licensed re-use rights only

You do not currently have access to this content.

Don't already have an account? Register

Deep reinforcement learning-based optimisation of reverse logistics transport in closed-loop supply chain

Email Alerts

Cited By

Deep reinforcement learning-based optimisation of reverse logistics transport in closed-loop supply chain

Sign in

Client Account

ICE Member Sign In

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Sharing Unavailable