Reinforcement learning control for a flapping-wing micro aerial vehicle with output constraint

Huang, Haifeng; Wu, Xiaoyang; Wang, Tingting; Sun, Yongbin; Fu, Qiang

doi:10.1108/AA-05-2022-0140

Article navigation

Research Article| October 27 2022

Reinforcement learning control for a flapping-wing micro aerial vehicle with output constraint

Haifeng Huang;

Haifeng Huang

School of Intelligence Science and Technology,

University of Science and Technology Beijing

, Beijing,

China

and Institute of Artificial Intelligence,

University of Science and Technology Beijing

, Beijing,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Xiaoyang Wu;

Xiaoyang Wu

School of Intelligence Science and Technology,

University of Science and Technology Beijing

, Beijing,

China

and Institute of Artificial Intelligence,

University of Science and Technology Beijing

, Beijing,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Tingting Wang;

Tingting Wang

School of Intelligence Science and Technology,

University of Science and Technology Beijing

, Beijing,

China

and Institute of Artificial Intelligence,

University of Science and Technology Beijing

, Beijing,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Yongbin Sun;

Yongbin Sun

School of Intelligence Science and Technology,

University of Science and Technology Beijing

, Beijing,

China

and Institute of Artificial Intelligence,

University of Science and Technology Beijing

, Beijing,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Qiang Fu

School of Intelligence Science and Technology,

University of Science and Technology Beijing

, Beijing,

China

and Institute of Artificial Intelligence,

University of Science and Technology Beijing

, Beijing,

China

Qiang Fu can be contacted at: fuqiang@ustb.edu.cn

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Qiang Fu can be contacted at: fuqiang@ustb.edu.cn

Publisher: Emerald Publishing

Received: May 27 2022

Revision Received: August 09 2022

Revision Received: September 01 2022

Accepted: September 06 2022

Online ISSN: 1758-4078

Print ISSN: 0144-5154

2022

Emerald Publishing Limited

Licensed re-use rights only

Assembly Automation (2022) 42 (6): 730–741.

https://doi.org/10.1108/AA-05-2022-0140

Purpose

This paper aims to study the application of reinforcement learning (RL) in the control of an output-constrained flapping-wing micro aerial vehicle (FWMAV) with system uncertainty.

Design/methodology/approach

A six-degrees-of-freedom hummingbird model is used without consideration of the inertial effects of the wings. A RL algorithm based on actor–critic framework is applied, which consists of an actor network with unknown policy gradient and a critic network with unknown value function. Considering the good performance of neural network (NN) in fitting nonlinearity and its optimum characteristics, an actor–critic NN optimization algorithm is designed, in which the actor and critic NNs are used to generate a policy and approximate the cost functions, respectively. In addition, to ensure the safe and stable flight of the FWMAV, a barrier Lyapunov function is used to make the flight states constrained in predefined regions. Based on the Lyapunov stability theory, the stability of the system is analyzed, and finally, the feasibility of RL in the control of a FWMAV is verified through simulation.

Findings

The proposed RL control scheme works well in ensuring the trajectory tracking of the FWMAV in the presence of output constraint and system uncertainty.

Originality/value

A novel RL algorithm based on actor–critic framework is applied to the control of a FWMAV with system uncertainty. For the stable and safe flight of the FWMAV, the output constraint problem is considered and solved by barrier Lyapunov function-based control.

2022

Emerald Publishing Limited

Licensed re-use rights only

You do not currently have access to this content.

Don't already have an account? Register

Reinforcement learning control for a flapping-wing micro aerial vehicle with output constraint

Email Alerts

Cited By

Reinforcement learning control for a flapping-wing micro aerial vehicle with output constraint Available to Purchase

Sign in

Client Account

ICE Member Sign In

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Reinforcement learning control for a flapping-wing micro aerial vehicle with output constraint