Skip to Main Content
Article navigation

Traditional optimisation methods often struggle to address the stochastic and dynamic characteristics of reverse logistics. Deep reinforcement learning (DRL), with its ability to learn adaptive policies under uncertainty, is well suited to this challenge. This study investigates the optimisation of green transportation strategies in reverse logistics systems through a DRL framework. By formulating the problem as a Markov decision process, this study integrates economic profitability, carbon dioxide emissions reduction and transportation efficiency into a dynamic reward function. The experimental results of the basic model and the extended model show that the superior performance of the DRL-based approach, which significantly surpasses traditional methods such as genetic algorithms and random strategies in adaptability and optimisation effectiveness. These findings provide theoretical foundations and practical insights for advancing sustainable logistics practices in complex systems.

Licensed re-use rights only
You do not currently have access to this content.
Don't already have an account? Register

Purchased this content as a guest? Enter your email address to restore access.

Please enter valid email address.
Email address must be 94 characters or fewer.
Pay-Per-View Access
$41.00
Rental

or Create an Account

Close Modal
Close Modal