Multi-queue Lyapunov-guided deep reinforcement learning scheme for mobile edge computing offloading

Liu, Jianhua; Zhang, Xudong; Lei, Xia; Hua, Houqiang; Tu, Xiaoguang

doi:10.1108/IJWIS-05-2025-0131

Article navigation

Research Article| January 01 2026

Multi-queue Lyapunov-guided deep reinforcement learning scheme for mobile edge computing offloading

Jianhua Liu;

Jianhua Liu

Institute of Electronic and Electrical Engineering,

Civil Aviation Flight University of China

, Guanghan,

China

Corresponding author Jianhua Liu jianhuacafuc13@cafuc.edu.cn

Search for other works by this author on:

This Site

PubMed

Google Scholar

Xudong Zhang;

Xudong Zhang

Institute of Electronic and Electrical Engineering,

Civil Aviation Flight University of China

, Guanghan,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Xia Lei;

Xia Lei

Institute of Electronic and Electrical Engineering,

Civil Aviation Flight University of China

, Guanghan,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Houqiang Hua;

Houqiang Hua

Institute of Electronic and Electrical Engineering,

Civil Aviation Flight University of China

, Guanghan,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Xiaoguang Tu

Institute of Electronic and Electrical Engineering,

Civil Aviation Flight University of China

, Guanghan,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Corresponding author Jianhua Liu jianhuacafuc13@cafuc.edu.cn

Publisher: Emerald Publishing

Received: May 27 2025

Revision Received: August 05 2025

Revision Received: September 08 2025

Revision Received: September 17 2025

Accepted: September 17 2025

Online ISSN: 1744-0092

Print ISSN: 1744-0084

Funding

Funding Group:

Award Group:
- Funder(s):
  Fundamental Research Funds for the Central Universities
- Award Id(s):
  24CAFUC03022
Award Group:
- Funder(s):
  National Natural Science Foundation of China
- Award Id(s):
  62406207
Award Group:
- Funder(s):
  CAAC Key laboratory of General Aviation Operation (Civil Aviation Management Institute of China)
- Award Id(s):
  CAMICKFJJ-2024–03
Funding Statement(s):
This work was supported by the Fundamental Research Funds for the Central Universities (No. 24CAFUC03022), National Natural Science Foundation of China (No. 62406207) and CAAC Key laboratory of General Aviation Operation (Civil Aviation Management Institute of China) (No. CAMICKFJJ-2024–03).

2025

Emerald Publishing Limited

Licensed re-use rights only

International Journal of Web Information Systems (2026) 22 (1): 44–76.

https://doi.org/10.1108/IJWIS-05-2025-0131

Purpose

In time-critical natural disaster scenarios, unmanned aerial vehicles (UAVs) are crucial for search and rescue. While mobile edge computing (MEC) enables real-time data processing for these UAVs, it introduces a significant challenge: balancing low-delay data analysis to locate survivors against the UAVs’ limited battery life. This paper aims to propose a solution to minimize task processing delay in dynamic rescue environments while conserving UAV energy.

Design/methodology/approach

To overcome this challenge, this study proposes the multi-queue Lyapunov-guided deep reinforcement learning (MQ-LyDRL) method to minimize task processing delay by jointly optimizing task offloading and resource allocation. This method innovatively integrates Lyapunov optimization with DRL. Specifically, by constructing Lyapunov functions based on queue stability and energy constraints, MQ-LyDRL decomposes the complex multistage stochastic optimization problem into a deterministic, per-time-slot subproblem. An adaptive DRL framework is then employed to solve this subproblem, enabling it to learn the optimal policy for real-time decision-making without requiring prior knowledge of the environment’s dynamics.

Findings

Extensive simulations demonstrate that MQ-LyDRL significantly outperforms existing methods. It maintains operational stability in fluctuating conditions and reduces average delay by at least 9.21% while adhering to an energy budget. This reduction translates to faster data-to-decision cycles, accelerating life-saving interventions by extending the operational time of UAVs.

Originality/value

This work’s primary value is providing a blueprint for intelligent and efficient edge computing systems in high-stakes scenarios. By combining stability theory with adaptive artificial intelligence (AI), this study offers a practical framework applicable to critical missions where performance and reliability are nonnegotiable.

2025

Emerald Publishing Limited

Licensed re-use rights only

You do not currently have access to this content.

Don't already have an account? Register

Multi-queue Lyapunov-guided deep reinforcement learning scheme for mobile edge computing offloading

Email Alerts

Cited By

Multi-queue Lyapunov-guided deep reinforcement learning scheme for mobile edge computing offloading Available to Purchase

Sign in

Client Account

ICE Member Sign In

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Multi-queue Lyapunov-guided deep reinforcement learning scheme for mobile edge computing offloading