Sound source localization for variable microphone arrays based on cross-domain collaborative features and multi-head graph attention mechanism

Liu, Mengran; Gong, Chuanqi; Li, Ziming; Wang, Cong; Jian, Zeming

doi:10.1108/SR-07-2025-0525

Article navigation

Research Article| November 17 2025

Sound source localization for variable microphone arrays based on cross-domain collaborative features and multi-head graph attention mechanism

Mengran Liu;

Mengran Liu

Hubei Key Laboratory of Modern Manufacturing Quantity Engineering, School of Mechanical Engineering,

Hubei University of Technology

, Wuhan,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Chuanqi Gong;

Chuanqi Gong

Hubei Key Laboratory of Modern Manufacturing Quantity Engineering, School of Mechanical Engineering,

Hubei University of Technology

, Wuhan,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Ziming Li;

Ziming Li

Hubei Key Laboratory of Modern Manufacturing Quantity Engineering, School of Mechanical Engineering,

Hubei University of Technology

, Wuhan,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Cong Wang;

Cong Wang

Hubei Key Laboratory of Modern Manufacturing Quantity Engineering, School of Mechanical Engineering,

Hubei University of Technology

, Wuhan,

China

Search for other works by this author on:

This Site

PubMed

Google Scholar

Zeming Jian

Hubei Key Laboratory of Modern Manufacturing Quantity Engineering, School of Mechanical Engineering,

Hubei University of Technology

, Wuhan,

China

Corresponding author Zeming Jian jianzemingx@163.com

Search for other works by this author on:

This Site

PubMed

Google Scholar

Author & Article Information

Corresponding author Zeming Jian jianzemingx@163.com

Publisher: Emerald Publishing

Received: July 21 2025

Revision Received: September 11 2025

Revision Received: October 02 2025

Accepted: October 05 2025

Online ISSN: 1758-6828

Print ISSN: 0260-2288

Funding

Funding Group:

Award Group:
- Funder(s):
  National Natural Science Foundation of China
- Award Id(s):
  51805154
Award Group:
- Funder(s):
  Hubei Provincial Natural Science Foundation of China
- Award Id(s):
  2022CFB473
Funding Statement(s):
This work was funded by the National Natural Science Foundation of China (Grant No.51805154) and the Hubei Provincial Natural Science Foundation of China (2022CFB473).

2025

Emerald Publishing Limited

Licensed re-use rights only

Sensor Review (2026) 46 (4): 557–568.

https://doi.org/10.1108/SR-07-2025-0525

Purpose

To address the problem of microphone count variation in complex acoustic environments with microphone arrays, traditional sound source localization methods and single features cannot achieve accurate localization, and they rely heavily on fixed microphone arrays. Once the array structure changes, re-localization is required. To solve this problem, this paper aims to propose a variable-microphone sound source localization method based on cross-domain collaborative features and multi-head graph attention mechanism.

Design/methodology/approach

First, time-frequency domain fusion features are obtained using Short-Time Fourier Transform magnitude spectra (STFT), Inter-channel Phase Difference (IPD) and Generalized Cross-Correlation with Phase Transform (GCC-PHAT). These three complementary features jointly provide richer and more stable acoustic cues. Then, k-nearest neighbor (k-NN) is used to construct graph data for the microphone array, capturing spatial relationships among microphones. Finally, a multi-head graph attention mechanism is integrated into the graph neural network to adaptively learn the weights of neighboring nodes, enabling accurate localization even when the microphone topology changes.

Findings

Simulation results show that the proposed method can accurately localize multiple sound sources and maintains high localization accuracy and low error even under challenging conditions such as reverberation and noise. Experimental results demonstrate that in real-world indoor and outdoor environments, the model achieves over 86% accuracy in multi-source localization even with damaged microphones, with localization errors within 0.47 meters.

Originality/value

The proposed method achieves accurate and robust multi-source localization in acoustically complex scenarios with varying microphone counts, making it suitable for harsh indoor and outdoor environments and offering a novel approach for advancing sound source localization technology.

2025

Emerald Publishing Limited

Licensed re-use rights only

You do not currently have access to this content.

Don't already have an account? Register

Sound source localization for variable microphone arrays based on cross-domain collaborative features and multi-head graph attention mechanism

New and popular articles

Email Alerts

Cited By

Sound source localization for variable microphone arrays based on cross-domain collaborative features and multi-head graph attention mechanism

Sign in

Client Account

ICE Member Sign In

New and popular articles

Email Alerts

Suggested Reading

Recommended for you

Cited By

Sharing Unavailable