Adaptive walking speed motion control of Kuavo humanoid robot based on inverse reinforcement learning

Wu, Yucong; Wang, Song; Pan, Yang; He, Zhicheng; Leng, Xiaokun

doi:10.1108/IR-01-2025-0015

Article navigation

Research Article| August 08 2025

Adaptive walking speed motion control of Kuavo humanoid robot based on inverse reinforcement learning

Yucong Wu;

Yucong Wu

Southern University of Science and Technology

, Shenzhen,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Song Wang;

Song Wang

Soochow University

, Suzhou,

China

Corresponding author Song Wang wangk4386@163.com

Search for other works by this author on:

This Site

PubMed

Google Scholar

Yang Pan;

Yang Pan

Southern University of Science and Technology

, Shenzhen,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Zhicheng He;

Zhicheng He

Harbin Institute of Technology

, Shenzhen,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Xiaokun Leng

Harbin Institute of Technology

, Shenzhen,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Corresponding author Song Wang wangk4386@163.com

Publisher: Emerald Publishing

Received: February 08 2025

Revision Received: May 15 2025

Accepted: June 14 2025

Online ISSN: 1758-5791

Print ISSN: 0143-991X

Funding

Funding Group:

Award Group:
- Funder(s):
  Shenzhen Special Fund for Future Industrial Development
- Award Id(s):
  No. KJZD20230923114222045
Funding Statement(s):
This work is supported by Shenzhen Special Fund for Future Industrial Development (No. KJZD20230923114222045).

2025

Emerald Publishing Limited

Licensed re-use rights only

Industrial Robot (2026) 53 (2): 235–245.

https://doi.org/10.1108/IR-01-2025-0015

Purpose

This paper aims to tackle the challenges of high dimensionality, strong nonlinearity and tight coupling in motion control for the Kuavo humanoid robot by introducing a novel method based on inverse reinforcement learning (IRL).

Design/methodology/approach

To overcome traditional limitations relying on precise dynamic models and manual controllers, the authors use IRL to learn reward functions and policies from limited expert demonstrations. An action re-targeting technique maps human expert motion data to the Kuavo’s action space, generating initial motion references. This IRL framework uses these demonstrations to learn implicit reward functions and further incorporates velocity targets into the policy, thus formulating motion control as a velocity-conditioned Markov Decision Process to improve adaptability.

Findings

Experimental results demonstrate that this method effectively recovers reward functions, generates natural and stable motion control policies and improves the robot’s adaptability across different speeds and environments. The Kuavo robot achieves real-time motion adjustments according to speed commands, ensuring rapid response and stable control during variations in speed.

Originality/value

This study represents a pioneering application of IRL to humanoid robot motion control, particularly for the Kuavo robot. By leveraging limited expert demonstrations and integrating velocity-conditioned policies, the proposed method facilitates autonomous acquisition of diverse motion skills and adaptation to various tasks and environmental conditions, marking a significant advancement over traditional methods.

2025

Emerald Publishing Limited

Licensed re-use rights only

You do not currently have access to this content.

Don't already have an account? Register

Adaptive walking speed motion control of Kuavo humanoid robot based on inverse reinforcement learning

New and popular articles

Email Alerts

Cited By

Adaptive walking speed motion control of Kuavo humanoid robot based on inverse reinforcement learning

Sign in

Client Account

ICE Member Sign In

New and popular articles

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Sharing Unavailable