EEG-based Auditory Attention Detection in Cocktail Party Environment Open Access

[2]

Accou

M. J.

Monesi

Montoya

Francart

, et al., “

Modeling the relationship between acoustic stimulus and EEG with a dilated convolutional neural network

,” in

2020 28th European Signal Processing Conference (EUSIPCO)

, IEEE,

2021

1175

–

1179

[3]

Ahveninen

Hämäläinen

I. P.

Jääskeläinen

S. P.

Ahlfors

Huang

F.-H.

Lin

Raij

Sams

C. E.

Vasios

, and

J. W.

Belliveau

, “

Attention-driven auditory cortex short-term plasticity helps segregate relevant sounds from noise

,”

Proceedings of the National Academy of Sciences

108

(

2011

4182

–

4187

[4]

Akbari

Khalighinejad

J. L.

Herrero

A. D.

Mehta

, and

Mes-Garani

, “

Towards reconstructing intelligible speech from the human auditory cortex

,”

Scientific reports

(

2019

–

[5]

Akram

Presacco

J. Z.

Simon

S. A.

Shamma

, and

Babadi

, “

Robust decoding of selective auditory attention from MEG in a competing-speaker environment via state-space modeling

,”

NeuroImage

124

2016

906

–

917

https://doi.org/10.3389/fnins.2019.00153

[6]

Alickovic

Lunner

Gustafsson

, and

Ljung

, “

A Tutorial on Auditory Attention Identification Methods

,”

Frontiers in Neuroscience

2019

, DOI:

, https://www.frontiersin.org/articles/10.3389/fnins.2019.00153.

[7]

W. M. H.

Bakay

L. A.

Anderson

J. A.

Garcia-Lazaro

McAlpine

, and

Schaette

, “

Hidden hearing loss selectively impairs neural adaptation to loud sound environments

,”

Nature Communications

(

2018

–

[8]

Bednar

F. M.

Boland

, and

E. C.

Lalor

, “

Different spatio-temporal electroencephalography features drive the successful decoding of bin-aural and monaural cues for sound localization

,”

European Journal of Neuroscience

(

2017

679

–

689

[9]

Bednar

and

E. C.

Lalor

, “

Where is the cocktail party? Decoding locations of attended and unattended moving sound sources using EEG

,”

NeuroImage

205

2020

116283

[10]

Bengio

Courville

, and

Vincent

, “

Representation learning: A review and new perspectives

,”

IEEE Transactions on Pattern Analysis and Machine Intelligence

(

2013

1798

–

1828

[11]

Bertrand

and

Moonen

, “

Energy-based multi-speaker voice activity detection with an ad hoc microphone array

,” in

2010 IEEE International Conference on Acoustics, Speech and Signal Processing

, IEEE,

2010

–

[12]

Biesmans

Das

Francart

, and

Bertrand

, “

Auditory-inspired speech envelope extraction methods for improved EEG-based auditory attention detection in a cocktail party scenario

,”

IEEE Transactions on Neural Systems and Rehabilitation Engineering

(

2016

402

–

412

[13]

Blank

and

M. H.

Davis

, “

Prediction errors but not sharpened signals simulate multivoxel fMRI patterns during speech perception

,”

PLoS biology

(

2016

e1002577

[14]

M. G.

Bleichner

and

Debener

, “

Concealed, unobtrusive ear-centered EEG acquisition: cEEGrids for transparent EEG

,”

Frontiers in Human Neuroscience

2017

163

[15]

M. G.

Bleichner

Lundbeck

Selisky

Minow

Jäger

Emkes

Debener

, and

De Vos

, “

Exploring miniaturized EEG electrodes for brain-computer interfaces. An EEG you do not see?

”

Physiological Reports

(

2015

, e12362.

[16]

J. N.

de Boer

M. M.

Linszen

de Vries

M. J.

Schutte

M. J.

Begemann

S. M.

Heringa

M. M.

Bohlken

Hugdahl

Aleman

F. N.

Wijnen

, et al., “

Auditory hallucinations, top-down processing and language perception: a general population study

,”

Psychological medicine

(

2019

2772

–

2780

[17]

A. W.

Bronkhorst

, “

The cocktail-party problem revisited: early processing and selection of multi-talker speech

,”

Attention, Perception, & Psychophysics

(

2015

1465

–

1487

https://doi.org/10.1109/THMS.2022.3176212

[18]

Cai

Liu

, and

Xie

, “

A Neural-Inspired Architecture for EEG-Based Auditory Attention Detection

,”

IEEE Transactions on Human-Machine Systems

(

2022

668

–

676

, DOI:

[19]

Cai

, and

Xie

, “

Auditory Attention Detection via Cross-Modal Attention

,”

Frontiers in Neuroscience

2021

https://doi.org/10.1109/O-COCOSDA202257103.2022.9997944

[20]

Cai

Xie

, and

, “

ESAA: An EEG-Speech Auditory Attention Detection Database

,” in

2022 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)

, IEEE,

2022

–

, DOI:

https://doi.org/10.5281/zenodo.7078451

[21]

Cai

Xie

, and

ESAA: an EEG-Speech auditory attention detection database

version 1.0, Zenodo

September

2022

, DOI:

, https://doi.org/10.5281/zenodo.7078451.

[22]

Cai

Xie

, and

, “

EEG-Based Auditory Attention Detection via Frequency and Channel Neural Attention

,”

IEEE Transactions on Human-Machine Systems

(

2021

256

–

266

[23]

Cai

Xie

, and

, “

EEG-Based Auditory Attention Detection via Frequency and Channel Neural Attention

,”

IEEE Transactions on Human-Machine Systems

(

2022

256

–

266

[24]

Cai

Sun

Schultz

, and

, “

Low-latency auditory spatial attention detection based on spectro-spatial features from EEG

,” in

2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

, IEEE,

2021

5812

–

5815

[25]

Ceolini

Hjortkjœr

D. D.

Wong

O’Sullivan

V. S.

Raghavan

Herrero

A. D.

Mehta

S.-C.

Liu

, and

Mesgarani

, “

Brain-informed speech separation (BISS) for enhancement of target speaker in multitalker speech perception

,”

NeuroImage

223

2020

117282

[26]

Chadha

Kamenov

, and

Cieza

, “

The world report on hearing, 2021

,”

Bulletin of the World Health Organization

(

2021

242

[27]

E. C.

Cherry

, “

Some experiments on the recognition of speech, with one and with two ears

,”

The Journal of the Acoustical Society of America

(

1953

975

–

979

[28]

de Cheveigné

D. D.

Wong

G. M.

Di Liberto

Hjortkjaer

Slaney

, and

Lalor

, “

Decoding the auditory brain with canonical component analysis

,”

NeuroImage

172

2018

206

–

216

[29]

Ciccarelli

Nolan

Perricone

P. T.

Calamia

Haro

O’Sullivan

Mesgarani

T. F.

Quatieri

, and

C. J.

Smalt

, “

Comparison of two-talker attention decoding from EEG with nonlinear neural networks and linear methods

,”

Scientific Reports

(

2019

–

[30]

N. E.

Crone

Boatman

Gordon

, and

Hao

, “

Induced electro-corticographic gamma activity during auditory perception

,”

Clinical Neurophysiology

112

(

2001

565

–

582

[31]

N. E.

Crone

Sinai

, and

Korzeniewska

, “

High-frequency gamma oscillations and human brain mapping with electrocorticography

,”

Progress in Brain Research

159

2006

275

–

295

https://doi.org/10.5281/zenodo.3997352

[32]

Das

Francart

, and

Bertrand

Auditory Attention Detection Dataset KULeuven

, Zenodo,

August

2020

, DOI:

, https://doi.org/10.5281/zenodo.3997352.

[33]

Das

Zegers

Francart

Bertrand

, et al., “

Linear versus deep learning methods for noisy speech separation for EEG-informed attention decoding

,”

Journal of Neural Engineering

(

2020

046039

[34]

Dasenbrock

Blum

Debener

Hohmann

, and

Kayser

, “

A step towards neuro-steered hearing aids: Integrated portable setup for time-synchronized acoustic stimuli presentation and EEG recording

,”

Current Directions in Biomedical Engineering

(

2021

855

–

858

[35]

Dasenbrock

Blum

Maanen

Debener

Hohmann

, and

Kayser

, “

Synchronization of ear-EEG and audio streams in a portable research hearing device

,”

Frontiers in Neuroscience

2022

[36]

Debener

Emkes

De Vos

, and

Bleichner

, “

Unobtrusive ambulatory EEG using a smartphone and flexible printed electrodes around the ear

,”

Scientific Reports

(

2015

–

[37]

Deng

Choi

, and

Shinn-Cunningham

, “

Topographic specificity of alpha power during auditory spatial attention

,”

NeuroImage

207

2020

116360

[38]

Denk

Grzybowski

S. M.

Ernst

Kollmeier

Debener

, and

M. G.

Bleichner

, “

Event-related potentials measured from in and around the ear electrodes integrated in a live hearing device for monitoring sound perception

,”

Trends in Hearing

2018

, 2331216518788219.

[39]

Ding

and

J. Z.

Simon

, “

Emergence of neural encoding of auditory objects while listening to competing speakers

,”

Proceedings of the National Academy of Sciences

109

(

2012

11854

–

[40]

A. K.

Engel

Fries

, and

Singer

, “

Dynamic predictions: oscillations and synchrony in top-down processing

,”

Nature Reviews Neuroscience

(

2001

704

–

716

[41]

Faghihi

Cai

, and

A. A.

Moustafa

, “

A neuroscience-inspired spiking neural network for EEG-based auditory spatial attention detection

,”

Neural Networks

152

2022

555

–

565

[42]

Fiedler

Obleser

Lunner

, and

Graversen

, “

Ear-EEG allows extraction of neural responses in challenging listening scenarios—a future technology for hearing aids?

” In

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

, IEEE,

2016

5697

–

700

[43]

Fiedler

Wöstmann

Graversen

Brandmeyer

Lunner

, and

Obleser

, “

Single-channel in-ear-EEG detects the focus of auditory attention to concurrent tone streams and mixed speech

,”

Journal of Neural Engineering

(

2017

036020

[44]

Francart

Das

Van Eyndhoven

Biesmans

, and

Bertrand

, “

Neuro-steered noise suppression for auditory prostheses

,”

The Journal of the Acoustical Society of America

139

(

2016

2044

–

2044

[45]

J. N.

Frey

Mainy

J.-P.

Lachaux

Müller

Bertrand

, and

Weisz

, “

Selective modulation of auditory cortical alpha activity in an audiovisual spatial attention task

,”

Journal of Neuroscience

(

2014

6634

–

6639

[46]

S. A.

Fuglsang

Märcher-R0rsted

Dau

, and

Hjortkjœr

, “

Effects of sensorineural hearing loss on cortical synchronization to competing speech during selective attention

,”

Journal of Neuroscience

(

2020

2562

–

2572

https://doi.org/10.5281/zenodo.1199011

[47]

S. A.

Fuglsang

D. D.

Wong

, and

Hjortkjœr

EEG and Audio Dataset for Auditory Attention Decoding

, version 1,

Zenodo

March

2018

, DOI:

, https://doi.org/10.5281/zenodo.1199011.

[48]

Garrett

Debener

, and

Verhulst

, “

Acquisition of subcortical auditory potentials with around-the-ear cEEGrid technology in normal and hearing impaired listeners

,”

Frontiers in Neuroscience

2019

730

[49]

Geirnaert

Vandecappelle

Alickovic

de Cheveigne

Lalor

B. T.

Meyer

Miran

Francart

, and

Bertrand

, “

Electroence-phalography-Based Auditory Attention Decoding: Toward Neurosteered Hearing Devices

,”

IEEE Signal Processing Magazine

(

2021

–

102

[50]

Geirnaert

Francart

, and

Bertrand

, “

An Interpretable Performance Metric for Auditory Attention Decoding Algorithms in a Context of Neuro-Steered Gain Control

,”

IEEE Transactions on Neural Systems and Rehabilitation Engineering

2019

[51]

Geirnaert

Francart

, and

Bertrand

, “

Fast EEG-based decoding of the directional focus of auditory attention using common spatial patterns

,”

IEEE Transactions on Biomedical Engineering

(

2020

1557

–

1568

[52]

Geirnaert

Francart

, and

Bertrand

, “

Time-adaptive Unsupervised Auditory Attention Decoding Using EEG-based Stimulus Reconstruction

,”

IEEE Journal of Biomedical and Health Informatics

2022

[53]

Geirnaert

Francart

, and

Bertrand

, “

Unsupervised Self-Adaptive Auditory Attention Decoding

,”

IEEE Journal of Biomedical and Health Informatics

(

2021

3955

–

3966

[54]

Geirnaert

Simon

and

Francart

Tom

and

Bertrand

Alexander

, “

Riemannian geometry-based decoding of the directional focus of auditory attention using EEG

,” in

Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

2021

1115

–

1119

[55]

Geravanchizadeh

and

Zakeri

, “

Ear-EEG-based binaural speech enhancement (ee-BSE) using auditory attention detection and audio-metric characteristics of hearing-impaired subjects

,”

Journal of Neural Engineering

(

2021

, 0460d6.

[56]

Haykin

and

Chen

, “

The cocktail party problem

,”

Neural computation

(

2005

1875

–

1902

[57]

M. J.

Henry

Herrmann

Kunke

, and

Obleser

, “

Aging affects the balance of neural entrainment and top-down neural modulation in the listening brain

,”

Nature Communications

(

2017

–

[58]

Herff

and

Schultz

, “

Automatic speech recognition from neural signals: a focused review

,”

Frontiers in Neuroscience

2016

429

[59]

J. R.

Hershey

Chen

Le Roux

, and

Watanabe

, “

Deep clustering: Discriminative embeddings for segmentation and separation

,” in

2016 IEEE international conference on acoustics, speech and signal processing (ICASSP)

, IEEE,

2016

–

[60]

Hölle

Meekes

, and

M. G.

Bleichner

, “

Mobile ear-EEG to study auditory attention in everyday life

,”

Behavior research methods

(

2021

2025

–

2036

[61]

Holtze

Rosenkranz

Jaeger

Debener

, and

Mirkovic

, “

Ear-EEG Measures of Auditory Attention to Continuous Speech

,”

Frontiers in Neuroscience

2022

539

[62]

Horton

Srinivasan

, and

D’Zmura

, “

Envelope responses in single-trial EEG indicate attended speaker in a ‘cocktail party’

,”

Journal of Neural Engineering

(

2014

046015

https://doi.org/10.1109/ICASSP39728.2021.9414969

[63]

Hosseini

Celotti

, and

É.

Plourde

, “

Speaker-Independent Brain Enhanced Speech Denoising

,” in

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

2021

1310

–

1314

, DOI:

[64]

Jaeger

Mirkovic

M. G.

Bleichner

, and

Debener

, “

Decoding the attended speaker from EEG using adaptive evaluation intervals captures fluctuations in attentional listening

,”

Frontiers in Neuroscience

2020

603

[65]

D.-H.

Jeong

and

Jeong

, “

In-ear EEG based attention state classification using echo state network

,”

Brain Sciences

(

2020

321

[66]

S. L.

Kappel

Makeig

, and

Kidmose

, “

Ear-EEG forward models: improved head-models for ear-EEG

,”

Frontiers in Neuroscience

2019

943

[67]

Kellis

Miller

Thomson

Brown

House

, and

Greger

, “

Decoding spoken words using local field potentials recorded from the cortical surface

,”

Journal of neural engineering

(

2010

056007

[68]

Khalighinejad

G. C.

da Silva

, and

Mesgarani

, “

Dynamic encoding of acoustic features in neural responses to continuous speech

,”

Journal of Neuroscience

(

2017

2176

–

2185

[69]

O.-Y.

Kwon

M.-H.

Lee

Guan

, and

S.-W.

Lee

, “

Subject-independent brain–computer interfaces based on deep convolutional neural networks

,”

IEEE Transactions on Neural Networks and Learning Systems

(

2019

3839

–

3852

[70]

E. C.

Lalor

and

J. J.

Foxe

, “

Neural responses to uninterrupted natural speech can be extracted with precise temporal resolution

,”

European Journal of Neuroscience

(

2010

189

–

193

, DOI: https://doi.org/10.1016/j.cub.2015.08.030;https://www.sciencedirect.com/science/article/pii/S0960982215010015.

[71]

G. M. D.

Liberto

J. A. O’

Sullivan

, and

E. C.

Lalor

, “

Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing

,”

Current Biology

(

2015

2457

–

2465

[72]

J. L.

Lobo

Del Ser

Bifet

, and

Kasabov

, “

Spiking neural networks and online learning: An overview and perspectives

,”

Neural Networks

121

2020

–

100

https://doi.org/10.1088/1741-2560/4/2/R01

[73]

Looney

Park

Kidmose

M. L.

Rank

Ungstrup

Rosenkranz

, and

D. P.

Mandic

, “

An in-the-ear platform for recording electroencephalogram

,” in

2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society

, IEEE,

2011

6882

–

6885

[74]

Lotte

Congedo

Lécuyer

Lamarche

, and

Arnaldi

, “

A review of classification algorithms for EEG-based brain-computer interfaces

,”

Journal of Neural Engineering

(

2007

, R1, DOI:

[75]

J. H.

McDermott

, “

The cocktail party problem

,”

Current Biology

(

2009

, R1024-R1027.

[76]

Meiser

and

M. G.

Bleichner

, “

Ear-EEG compares well to cap-EEG in recording auditory ERPs: a quantification of signal loss

,”

Journal of Neural Engineering

(

2022

026042

[77]

Meiser

Tadel

Debener

, and

M. G.

Bleichner

, “

The sensitivity of ear-EEG: evaluating the source-sensor relationship using forward modeling

,”

Brain Topography

(

2020

665

–

676

[78]

Mesgarani

and

E. F.

Chang

, “

Selective cortical representation of attended speaker in multi-talker speech perception

,”

Nature

485

(

7397

2012

233

[79]

Mesgarani

S. V.

David

J. B.

Fritz

, and

S. A.

Shamma

, “

Influence of context and behavior on stimulus reconstruction from neural activity in primary auditory cortex

,”

Journal of neurophysiology

102

(

2009

3329

–

3339

[80]

Mirkovic

M. G.

Bleichner

De Vos

, and

Debener

, “

Target speaker detection with concealed EEG around the ear

,”

Frontiers in Neuroscience

2016

349

[81]

Mirkovic

Debener

Jaeger

, and

De Vos

, “

Decoding the attended speech stream with multi-channel EEG: implications for online, daily-life applications

,”

Journal of Neural Engineering

(

2015

046007

[82]

M. J.

Monesi

Accou

Montoya-Martinez

Francart

, and

Van Hamme

, “

An LSTM based architecture to relate speech stimulus to EEG

,” in

ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

, IEEE,

2020

941

–

945

[83]

A. M.

Narayanan

and

Bertrand

, “

Analysis of miniaturization effects and channel selection strategies for EEG sensor networks with application to auditory attention detection

,”

IEEE Transactions on Biomedical Engineering

(

2019

234

–

244

[84]

A. M.

Narayanan

Patrinos

, and

Bertrand

, “

Optimal versus approximate channel selection methods for EEG decoding with application to topology-constrained neuro-sensor networks

,”

IEEE Transactions on Neural Systems and Rehabilitation Engineering

2020

–

102

[85]

A. M.

Narayanan

Zink

, and

Bertrand

, “

EEG miniaturization limits for stimulus decoding with EEG sensor networks

,”

Journal of Neural Engineering

(

2021

056042

[86]

Nogueira

Cosatti

Schierholz

Egger

Mirkovic

, and

Büchner

, “

Toward Decoding Selective Attention From Single-Trial EEG Data in Cochlear Implant Users

,”

IEEE Transactions on Biomedical Engineering

(

2019

–

[87]

Nogueira

Dolhopiatenko

Schierholz

Büchner

Mirkovic

M. G.

Bleichner

, and

Debener

, “

Decoding selective attention in normal hearing listeners and bilateral cochlear implant users with concealed ear EEG

,”

Frontiers in Neuroscience

2019

720

[88]

J. A. O’

Sullivan

A. J.

Power

Mesgarani

Rajaram

J. J.

Foxe

B. G.

Shinn-Cunningham

Slaney

S. A.

Shamma

, and

E. C.

Lalor

, “

Attentional selection in a cocktail party environment can be decoded from single-trial EEG

,”

Cerebral Cortex

(

2015

1697

–

1706

[89]

O’Callaghan

Kveraga

J. M.

Shine

R. B.

Adams Jr

, and

Bar

, “

Predictions penetrate perception: Converging insights from brain, behaviour and disorder

,”

Consciousness and cognition

2017

–

https://doi.org/10.7554/eLife.51419

[90]

Parthasarathy

K. E.

Hancock

Bennett

DeGruttola

, and

D. B.

Polley

, “

Bottom-up and top-down neural signatures of disordered multi-talker speech perception in adults with normal hearing

,”

eLife

2020

e51419

, DOI:

[91]

Peelle

and

Wingfield

, “

How Our Brains Make Sense of Noisy Speech

,”

Acoustics Today

(

2022

–

[92]

Perez

Strub

De Vries

Dumoulin

, and

Courville

, “

Film: Visual reasoning with a general conditioning layer

,” in

Proceedings of the AAAI Conference on Artificial Intelligence

, Vol.

, No.

2018

[93]

M. K.

Pichora-Fuller and G. Singh

, “

Effects of age on auditory and cognitive processing: implications for hearing aid fitting and audiologic rehabilitation

,”

Trends in amplification

(

2006

–

[94]

Puffay

Van Canneyt

Vanthornhout

Francart

, et al., “

Relating the fundamental frequency of speech with EEG using a dilated convolutional network

,”

arXiv preprint arXiv:2207.01963

2022

[95]

Ramoser

Muller-Gerking

, and

Pfurtscheller

, “

Optimal spatial filtering of single trial EEG during imagined hand movement

,”

IEEE Transactions on Rehabilitation Engineering

(

2000

441

–

446

[96]

S. A.

Fuglsang

Dau

, and

Hjortkjœr

, “

Noise-robust cortical tracking of attended speech in real-world acoustic scenes

,”

NeuroImage

156

2017

435

–

444

[97]

Schultz

Wand

Hueber

D. J.

Krusienski

Herff

, and

J. S.

Brumberg

, “

Biosignal-based spoken communication: A survey

,”

IEEE/ACM Transactions on Audio, Speech, and Language Processing

(

2017

2257

–

2271

[98]

J. Z.

Simon

, “Human auditory neuroscience and the cocktail party problem,” in

The Auditory System at the Cocktail Party

Springer

2017

169

–

197

[99]

A. A.

Stocker

and

E. P.

Simoncelli

, “

Noise characteristics and prior expectations in human visual speed perception

,”

Nature neuroscience

(

2006

578

–

585

https://doi.org/10.18112/openneuro.ds003801.v1.0.0

[100]

Straetmans

Holtze

Debener

Jaeger

, and

Mirkovic

, “

Neural Tracking to go

”,

OpenNeuro

2021

, DOI:

[101]

Cai

Xie

, and

, “

Auditory attention detection with EEG channel attention

,” in

2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

, IEEE,

2021

5804

–

5807

[102]

Cai

Xie

, and

Schultz

, “

STAnet: A Spatiotemporal Attention Network for Decoding Auditory Spatial Attention from EEG

,”

IEEE Transactions on Biomedical Engineering

2022

[103]

S. R.

Synigal

E. S.

Teoh

, and

E. C.

Lalor

, “

Including measures of high gamma power can improve the decoding of natural speech from EEG

,”

Frontiers in Human Neuroscience

2020

130

[104]

Taherkhani

Belatreche

Cosma

L. P.

Maguire

, and

T. M.

McGinnity

, “

A review of learning in biologically plausible spiking neural networks

,”

Neural Networks

122

2020

253

–

272

[105]

de Taillez

Kollmeier

, and

B. T.

Meyer

, “

Machine learning for decoding listeners’ attention from electroencephalography evoked by continuous speech

,”

European Journal of Neuroscience

(

2020

1234

–

1241

[106]

Tremblay

Brisson

, and

Deschamps

, “

Brain aging and speech perception: Effects of background noise and talker variability

,”

NeuroImage

227

2021

117675

[107]

Tune

Alavash

Fiedler

, and

Obleser

, “

Neural attentional-filter mechanisms of listening success in middle-aged and older individuals

,”

Nature Communications

(

2021

–

[108]

J. J.

Van Berkum

, “

The brain is a prediction machine that cares about good and bad-any implications for neuropragmatics?

”

Italian Journal of Linguistics

2010

181

–

208

[109]

Van Eyndhoven

Francart

, and

Bertrand

, “

EEG-informed attended speaker extraction from recorded speech mixtures with application in neuro-steered hearing prostheses

,”

IEEE Transactions on Biomedical Engineering

(

2016

1045

–

1056

[110]

Vandecappelle

Deckers

Das

A. H.

Ansari

Bertrand

, and

Francart

, “

EEG-based detection of the locus of auditory attention with convolutional neural networks

,”

eLife

2021

e56481

[111]

Wan

Yang

Huang

Zeng

, and

Liu

, “

A review on transfer learning in EEG signal analysis

,”

Neurocomputing

421

2021

–

[112]

Wang

and

Chen

, “

Supervised speech separation based on deep learning: an overview

,”

IEEE/ACM Transactions on Audio, Speech, and Language Processing

(

2018

1702

–

1726

[113]

Wang

E. X.

, and

Chen

, “

EEG-based auditory attention decoding using speech-level-based segmented computational models

,”

Journal of Neural Engineering

(

2021

046066

[114]

Wöstmann

Herrmann

Maess

, and

Obleser

, “

Spatiotemporal dynamics of auditory attention synchronize with speech

,”

Proceedings of the National Academy of Sciences

113

(

2016

3873

–

3878

[115]

Wöstmann

Herrmann

Wilsch

, and

Obleser

, “

Neural alpha dynamics in younger and older listeners reflect acoustic challenges and predictive benefits

,”

Journal of Neuroscience

(

2015

1458

–

1467

[116]

Wöstmann

Vosskuhl

Obleser

, and

C. S.

Herrmann

, “

Opposite effects of lateralised transcranial alpha versus gamma stimulation on auditory spatial attention

,”

Brain Stimulation

(

2018

752

–

758

[117]

, and

B.-L.

, “

Transfer learning for EEG-based brain-computer interfaces: A review of progress made since 2016

,”

IEEE Transactions on Cognitive and Developmental Systems

(

2020

–

[118]

Yang

S. A.

Sheth

C. A.

Schevon

G. M. M.

, and

Mesgarani

, “

Speech reconstruction from human auditory cortex with deep neural networks

,” in

Sixteenth Annual Conference of the International Speech Communication Association

2015

[119]

Kolbœk

Z.-H.

Tan

, and

Jensen

, “

Permutation invariant training of deep models for speaker-independent multi-talker speech separation

,” in

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

, IEEE,

2017

241

–

245

2023

S. Cai, H. Zhu, T. Schultz and H. Li

Figure 1

Typical flow of EEG-based speech perception in a cocktail party environment involves training the auditory attention detection (AAD) model, represented by the green box, and applying the detection in real-time, represented by the blue box.

Figure 2

A diagram illustrates the Auditory Pathway from the Cochlea up to the Auditory cortex in the Brain. Starting at the bottom, sound information travels from the Cochlea via the C N V I I I (Cochlear nerve) to the Cochlear nuclei in the Medulla.

The auditory processing pathway. The speech perception begins with the cochlear nucleus and proceeds through a series of relay nucleus, including the superior olivary complex, the inferior colliculus, and the medial geniculate nucleus. Each of these nuclei decodes and integrates the incoming auditory information before forwarding it to the next stage of processing. Finally, the auditory cortex receives and analyzes the integrated signals, enabling the perception of speech. (Adopted from [91]).

Figure 3

A diagram illustrates a system for neuro-steered speaker extraction. A sound mixture is generated from a Target speaker and Other speakers and is played to a Subject wearing an E E G cap. The E E G cap captures E E G signals. Both the mixture and the E E G signals are fed into a box labeled neuro-steered speaker extraction.

A neuro-steered hearing device for speech perception in a cocktail party environment.

Figure 4

A pair of images shows two different types of ear-based E E G electrodes on a person’s ear.

Illustration of the ear-EEG design and layout (a) Ear-EEG in the ear (adopted from [15]) (b) Ear-EEG around the ear. (Adopted from [36]).

Figure 5

A hearing aid with ear-EEG recordings platform (Adopted from [35].)

Table 1

The characteristics of different AAD datasets. NH = Normal-hearing, HI = Hearing-impaired.

Dataset	NH subjects	HI subjects	Language	# cap-EEG channels	Duration per subject (min)
Das-2015 [32]	16	0	Flemish	64	48
Fuglsang-2018 [47]	18	0	Danish	64	50
Fuglsang-2020 [46]	22	22	Danish	64^a	40
ESAA [21]	20	0	Chinese	64	38
Neural Tracking to go [100]	20	0	German	24^b	30

Dataset	NH subjects	HI subjects	Language	# cap-EEG channels	Duration per subject (min)
Das-2015 [32]	16	0	Flemish	64	48
Fuglsang-2018 [47]	18	0	Danish	64	50
Fuglsang-2020 [46]	22	22	Danish	64^a	40
ESAA [21]	20	0	Chinese	64	38
Neural Tracking to go [100]	20	0	German	24^b	30

Note:^aIn-ear EEG was also recorded for 19 of the 44 subjects. ^bA fully mobile EEG Device.

[1]

Accou

M. J.

Monesi

Francart

, et al., “

Predicting speech intelligibility from EEG in a non-linear classification paradigm

,”

Journal of Neural Engineering

(

2021

066008

[2]

Accou

M. J.

Monesi

Montoya

Francart

, et al., “

Modeling the relationship between acoustic stimulus and EEG with a dilated convolutional neural network

,” in

2020 28th European Signal Processing Conference (EUSIPCO)

, IEEE,

2021

1175

–

1179

[3]

Ahveninen

Hämäläinen

I. P.

Jääskeläinen

S. P.

Ahlfors

Huang

F.-H.

Lin

Raij

Sams

C. E.

Vasios

, and

J. W.

Belliveau

, “

Attention-driven auditory cortex short-term plasticity helps segregate relevant sounds from noise

,”

Proceedings of the National Academy of Sciences

108

(

2011

4182

–

4187

[4]

Akbari

Khalighinejad

J. L.

Herrero

A. D.

Mehta

, and

Mes-Garani

, “

Towards reconstructing intelligible speech from the human auditory cortex

,”

Scientific reports

(

2019

–

[5]

Akram

Presacco

J. Z.

Simon

S. A.

Shamma

, and

Babadi

, “

Robust decoding of selective auditory attention from MEG in a competing-speaker environment via state-space modeling

,”

NeuroImage

124

2016

906

–

917

https://doi.org/10.3389/fnins.2019.00153

[6]

Alickovic

Lunner

Gustafsson

, and

Ljung

, “

A Tutorial on Auditory Attention Identification Methods

,”

Frontiers in Neuroscience

2019

, DOI:

, https://www.frontiersin.org/articles/10.3389/fnins.2019.00153.

[7]

W. M. H.

Bakay

L. A.

Anderson

J. A.

Garcia-Lazaro

McAlpine

, and

Schaette

, “

Hidden hearing loss selectively impairs neural adaptation to loud sound environments

,”

Nature Communications

(

2018

–

[8]

Bednar

F. M.

Boland

, and

E. C.

Lalor

, “

Different spatio-temporal electroencephalography features drive the successful decoding of bin-aural and monaural cues for sound localization

,”

European Journal of Neuroscience

(

2017

679

–

689

[9]

Bednar

and

E. C.

Lalor

, “

Where is the cocktail party? Decoding locations of attended and unattended moving sound sources using EEG

,”

NeuroImage

205

2020

116283

[10]

Bengio

Courville

, and

Vincent

, “

Representation learning: A review and new perspectives

,”

IEEE Transactions on Pattern Analysis and Machine Intelligence

(

2013

1798

–

1828

[11]

Bertrand

and

Moonen

, “

Energy-based multi-speaker voice activity detection with an ad hoc microphone array

,” in

2010 IEEE International Conference on Acoustics, Speech and Signal Processing

, IEEE,

2010

–

[12]

Biesmans

Das

Francart

, and

Bertrand

, “

Auditory-inspired speech envelope extraction methods for improved EEG-based auditory attention detection in a cocktail party scenario

,”

IEEE Transactions on Neural Systems and Rehabilitation Engineering

(

2016

402

–

412

[13]

Blank

and

M. H.

Davis

, “

Prediction errors but not sharpened signals simulate multivoxel fMRI patterns during speech perception

,”

PLoS biology

(

2016

e1002577

[14]

M. G.

Bleichner

and

Debener

, “

Concealed, unobtrusive ear-centered EEG acquisition: cEEGrids for transparent EEG

,”

Frontiers in Human Neuroscience

2017

163

[15]

M. G.

Bleichner

Lundbeck

Selisky

Minow

Jäger

Emkes

Debener

, and

De Vos

, “

Exploring miniaturized EEG electrodes for brain-computer interfaces. An EEG you do not see?

”

Physiological Reports

(

2015

, e12362.

[16]

J. N.

de Boer

M. M.

Linszen

de Vries

M. J.

Schutte

M. J.

Begemann

S. M.

Heringa

M. M.

Bohlken

Hugdahl

Aleman

F. N.

Wijnen

, et al., “

Auditory hallucinations, top-down processing and language perception: a general population study

,”

Psychological medicine

(

2019

2772

–

2780

[17]

A. W.

Bronkhorst

, “

The cocktail-party problem revisited: early processing and selection of multi-talker speech

,”

Attention, Perception, & Psychophysics

(

2015

1465

–

1487

https://doi.org/10.1109/THMS.2022.3176212

[18]

Cai

Liu

, and

Xie

, “

A Neural-Inspired Architecture for EEG-Based Auditory Attention Detection

,”

IEEE Transactions on Human-Machine Systems

(

2022

668

–

676

, DOI:

[19]

Cai

, and

Xie

, “

Auditory Attention Detection via Cross-Modal Attention

,”

Frontiers in Neuroscience

2021

https://doi.org/10.1109/O-COCOSDA202257103.2022.9997944

[20]

Cai

Xie

, and

, “

ESAA: An EEG-Speech Auditory Attention Detection Database

,” in

2022 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA)

, IEEE,

2022

–

, DOI:

https://doi.org/10.5281/zenodo.7078451

[21]

Cai

Xie

, and

ESAA: an EEG-Speech auditory attention detection database

version 1.0, Zenodo

September

2022

, DOI:

, https://doi.org/10.5281/zenodo.7078451.

[22]

Cai

Xie

, and

, “

EEG-Based Auditory Attention Detection via Frequency and Channel Neural Attention

,”

IEEE Transactions on Human-Machine Systems

(

2021

256

–

266

[23]

Cai

Xie

, and

, “

EEG-Based Auditory Attention Detection via Frequency and Channel Neural Attention

,”

IEEE Transactions on Human-Machine Systems

(

2022

256

–

266

[24]

Cai

Sun

Schultz

, and

, “

Low-latency auditory spatial attention detection based on spectro-spatial features from EEG

,” in

2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

, IEEE,

2021

5812

–

5815

[25]

Ceolini

Hjortkjœr

D. D.

Wong

O’Sullivan

V. S.

Raghavan

Herrero

A. D.

Mehta

S.-C.

Liu

, and

Mesgarani

, “

Brain-informed speech separation (BISS) for enhancement of target speaker in multitalker speech perception

,”

NeuroImage

223

2020

117282

[26]

Chadha

Kamenov

, and

Cieza

, “

The world report on hearing, 2021

,”

Bulletin of the World Health Organization

(

2021

242

[27]

E. C.

Cherry

, “

Some experiments on the recognition of speech, with one and with two ears

,”

The Journal of the Acoustical Society of America

(

1953

975

–

979

[28]

de Cheveigné

D. D.

Wong

G. M.

Di Liberto

Hjortkjaer

Slaney

, and

Lalor

, “

Decoding the auditory brain with canonical component analysis

,”

NeuroImage

172

2018

206

–

216

[29]

Ciccarelli

Nolan

Perricone

P. T.

Calamia

Haro

O’Sullivan

Mesgarani

T. F.

Quatieri

, and

C. J.

Smalt

, “

Comparison of two-talker attention decoding from EEG with nonlinear neural networks and linear methods

,”

Scientific Reports

(

2019

–

[30]

N. E.

Crone

Boatman

Gordon

, and

Hao

, “

Induced electro-corticographic gamma activity during auditory perception

,”

Clinical Neurophysiology

112

(

2001

565

–

582

[31]

N. E.

Crone

Sinai

, and

Korzeniewska

, “

High-frequency gamma oscillations and human brain mapping with electrocorticography

,”

Progress in Brain Research

159

2006

275

–

295

https://doi.org/10.5281/zenodo.3997352

[32]

Das

Francart

, and

Bertrand

Auditory Attention Detection Dataset KULeuven

, Zenodo,

August

2020

, DOI:

, https://doi.org/10.5281/zenodo.3997352.

[33]

Das

Zegers

Francart

Bertrand

, et al., “

Linear versus deep learning methods for noisy speech separation for EEG-informed attention decoding

,”

Journal of Neural Engineering

(

2020

046039

[34]

Dasenbrock

Blum

Debener

Hohmann

, and

Kayser

, “

A step towards neuro-steered hearing aids: Integrated portable setup for time-synchronized acoustic stimuli presentation and EEG recording

,”

Current Directions in Biomedical Engineering

(

2021

855

–

858

[35]

Dasenbrock

Blum

Maanen

Debener

Hohmann

, and

Kayser

, “

Synchronization of ear-EEG and audio streams in a portable research hearing device

,”

Frontiers in Neuroscience

2022

[36]

Debener

Emkes

De Vos

, and

Bleichner

, “

Unobtrusive ambulatory EEG using a smartphone and flexible printed electrodes around the ear

,”

Scientific Reports

(

2015

–

[37]

Deng

Choi

, and

Shinn-Cunningham

, “

Topographic specificity of alpha power during auditory spatial attention

,”

NeuroImage

207

2020

116360

[38]

Denk

Grzybowski

S. M.

Ernst

Kollmeier

Debener

, and

M. G.

Bleichner

, “

Event-related potentials measured from in and around the ear electrodes integrated in a live hearing device for monitoring sound perception

,”

Trends in Hearing

2018

, 2331216518788219.

[39]

Ding

and

J. Z.

Simon

, “

Emergence of neural encoding of auditory objects while listening to competing speakers

,”

Proceedings of the National Academy of Sciences

109

(

2012

11854

–

[40]

A. K.

Engel

Fries

, and

Singer

, “

Dynamic predictions: oscillations and synchrony in top-down processing

,”

Nature Reviews Neuroscience

(

2001

704

–

716

[41]

Faghihi

Cai

, and

A. A.

Moustafa

, “

A neuroscience-inspired spiking neural network for EEG-based auditory spatial attention detection

,”

Neural Networks

152

2022

555

–

565

[42]

Fiedler

Obleser

Lunner

, and

Graversen

, “

Ear-EEG allows extraction of neural responses in challenging listening scenarios—a future technology for hearing aids?

” In

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

, IEEE,

2016

5697

–

700

[43]

Fiedler

Wöstmann

Graversen

Brandmeyer

Lunner

, and

Obleser

, “

Single-channel in-ear-EEG detects the focus of auditory attention to concurrent tone streams and mixed speech

,”

Journal of Neural Engineering

(

2017

036020

[44]

Francart

Das

Van Eyndhoven

Biesmans

, and

Bertrand

, “

Neuro-steered noise suppression for auditory prostheses

,”

The Journal of the Acoustical Society of America

139

(

2016

2044

–

2044

[45]

J. N.

Frey

Mainy

J.-P.

Lachaux

Müller

Bertrand

, and

Weisz

, “

Selective modulation of auditory cortical alpha activity in an audiovisual spatial attention task

,”

Journal of Neuroscience

(

2014

6634

–

6639

[46]

S. A.

Fuglsang

Märcher-R0rsted

Dau

, and

Hjortkjœr

, “

Effects of sensorineural hearing loss on cortical synchronization to competing speech during selective attention

,”

Journal of Neuroscience

(

2020

2562

–

2572

https://doi.org/10.5281/zenodo.1199011

[47]

S. A.

Fuglsang

D. D.

Wong

, and

Hjortkjœr

EEG and Audio Dataset for Auditory Attention Decoding

, version 1,

Zenodo

March

2018

, DOI:

, https://doi.org/10.5281/zenodo.1199011.

[48]

Garrett

Debener

, and

Verhulst

, “

Acquisition of subcortical auditory potentials with around-the-ear cEEGrid technology in normal and hearing impaired listeners

,”

Frontiers in Neuroscience

2019

730

[49]

Geirnaert

Vandecappelle

Alickovic

de Cheveigne

Lalor

B. T.

Meyer

Miran

Francart

, and

Bertrand

, “

Electroence-phalography-Based Auditory Attention Decoding: Toward Neurosteered Hearing Devices

,”

IEEE Signal Processing Magazine

(

2021

–

102

[50]

Geirnaert

Francart

, and

Bertrand

, “

An Interpretable Performance Metric for Auditory Attention Decoding Algorithms in a Context of Neuro-Steered Gain Control

,”

IEEE Transactions on Neural Systems and Rehabilitation Engineering

2019

[51]

Geirnaert

Francart

, and

Bertrand

, “

Fast EEG-based decoding of the directional focus of auditory attention using common spatial patterns

,”

IEEE Transactions on Biomedical Engineering

(

2020

1557

–

1568

[52]

Geirnaert

Francart

, and

Bertrand

, “

Time-adaptive Unsupervised Auditory Attention Decoding Using EEG-based Stimulus Reconstruction

,”

IEEE Journal of Biomedical and Health Informatics

2022

[53]

Geirnaert

Francart

, and

Bertrand

, “

Unsupervised Self-Adaptive Auditory Attention Decoding

,”

IEEE Journal of Biomedical and Health Informatics

(

2021

3955

–

3966

[54]

Geirnaert

Simon

and

Francart

Tom

and

Bertrand

Alexander

, “

Riemannian geometry-based decoding of the directional focus of auditory attention using EEG

,” in

Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

2021

1115

–

1119

[55]

Geravanchizadeh

and

Zakeri

, “

Ear-EEG-based binaural speech enhancement (ee-BSE) using auditory attention detection and audio-metric characteristics of hearing-impaired subjects

,”

Journal of Neural Engineering

(

2021

, 0460d6.

[56]

Haykin

and

Chen

, “

The cocktail party problem

,”

Neural computation

(

2005

1875

–

1902

[57]

M. J.

Henry

Herrmann

Kunke

, and

Obleser

, “

Aging affects the balance of neural entrainment and top-down neural modulation in the listening brain

,”

Nature Communications

(

2017

–

[58]

Herff

and

Schultz

, “

Automatic speech recognition from neural signals: a focused review

,”

Frontiers in Neuroscience

2016

429

[59]

J. R.

Hershey

Chen

Le Roux

, and

Watanabe

, “

Deep clustering: Discriminative embeddings for segmentation and separation

,” in

2016 IEEE international conference on acoustics, speech and signal processing (ICASSP)

, IEEE,

2016

–

[60]

Hölle

Meekes

, and

M. G.

Bleichner

, “

Mobile ear-EEG to study auditory attention in everyday life

,”

Behavior research methods

(

2021

2025

–

2036

[61]

Holtze

Rosenkranz

Jaeger

Debener

, and

Mirkovic

, “

Ear-EEG Measures of Auditory Attention to Continuous Speech

,”

Frontiers in Neuroscience

2022

539

[62]

Horton

Srinivasan

, and

D’Zmura

, “

Envelope responses in single-trial EEG indicate attended speaker in a ‘cocktail party’

,”

Journal of Neural Engineering

(

2014

046015

https://doi.org/10.1109/ICASSP39728.2021.9414969

[63]

Hosseini

Celotti

, and

É.

Plourde

, “

Speaker-Independent Brain Enhanced Speech Denoising

,” in

ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

2021

1310

–

1314

, DOI:

[64]

Jaeger

Mirkovic

M. G.

Bleichner

, and

Debener

, “

Decoding the attended speaker from EEG using adaptive evaluation intervals captures fluctuations in attentional listening

,”

Frontiers in Neuroscience

2020

603

[65]

D.-H.

Jeong

and

Jeong

, “

In-ear EEG based attention state classification using echo state network

,”

Brain Sciences

(

2020

321

[66]

S. L.

Kappel

Makeig

, and

Kidmose

, “

Ear-EEG forward models: improved head-models for ear-EEG

,”

Frontiers in Neuroscience

2019

943

[67]

Kellis

Miller

Thomson

Brown

House

, and

Greger

, “

Decoding spoken words using local field potentials recorded from the cortical surface

,”

Journal of neural engineering

(

2010

056007

[68]

Khalighinejad

G. C.

da Silva

, and

Mesgarani

, “

Dynamic encoding of acoustic features in neural responses to continuous speech

,”

Journal of Neuroscience

(

2017

2176

–

2185

[69]

O.-Y.

Kwon

M.-H.

Lee

Guan

, and

S.-W.

Lee

, “

Subject-independent brain–computer interfaces based on deep convolutional neural networks

,”

IEEE Transactions on Neural Networks and Learning Systems

(

2019

3839

–

3852

[70]

E. C.

Lalor

and

J. J.

Foxe

, “

Neural responses to uninterrupted natural speech can be extracted with precise temporal resolution

,”

European Journal of Neuroscience

(

2010

189

–

193

, DOI: https://doi.org/10.1016/j.cub.2015.08.030;https://www.sciencedirect.com/science/article/pii/S0960982215010015.

[71]

G. M. D.

Liberto

J. A. O’

Sullivan

, and

E. C.

Lalor

, “

Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing

,”

Current Biology

(

2015

2457

–

2465

[72]

J. L.

Lobo

Del Ser

Bifet

, and

Kasabov

, “

Spiking neural networks and online learning: An overview and perspectives

,”

Neural Networks

121

2020

–

100

https://doi.org/10.1088/1741-2560/4/2/R01

[73]

Looney

Park

Kidmose

M. L.

Rank

Ungstrup

Rosenkranz

, and

D. P.

Mandic

, “

An in-the-ear platform for recording electroencephalogram

,” in

2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society

, IEEE,

2011

6882

–

6885

[74]

Lotte

Congedo

Lécuyer

Lamarche

, and

Arnaldi

, “

A review of classification algorithms for EEG-based brain-computer interfaces

,”

Journal of Neural Engineering

(

2007

, R1, DOI:

[75]

J. H.

McDermott

, “

The cocktail party problem

,”

Current Biology

(

2009

, R1024-R1027.

[76]

Meiser

and

M. G.

Bleichner

, “

Ear-EEG compares well to cap-EEG in recording auditory ERPs: a quantification of signal loss

,”

Journal of Neural Engineering

(

2022

026042

[77]

Meiser

Tadel

Debener

, and

M. G.

Bleichner

, “

The sensitivity of ear-EEG: evaluating the source-sensor relationship using forward modeling

,”

Brain Topography

(

2020

665

–

676

[78]

Mesgarani

and

E. F.

Chang

, “

Selective cortical representation of attended speaker in multi-talker speech perception

,”

Nature

485

(

7397

2012

233

[79]

Mesgarani

S. V.

David

J. B.

Fritz

, and

S. A.

Shamma

, “

Influence of context and behavior on stimulus reconstruction from neural activity in primary auditory cortex

,”

Journal of neurophysiology

102

(

2009

3329

–

3339

[80]

Mirkovic

M. G.

Bleichner

De Vos

, and

Debener

, “

Target speaker detection with concealed EEG around the ear

,”

Frontiers in Neuroscience

2016

349

[81]

Mirkovic

Debener

Jaeger

, and

De Vos

, “

Decoding the attended speech stream with multi-channel EEG: implications for online, daily-life applications

,”

Journal of Neural Engineering

(

2015

046007

[82]

M. J.

Monesi

Accou

Montoya-Martinez

Francart

, and

Van Hamme

, “

An LSTM based architecture to relate speech stimulus to EEG

,” in

ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

, IEEE,

2020

941

–

945

[83]

A. M.

Narayanan

and

Bertrand

, “

Analysis of miniaturization effects and channel selection strategies for EEG sensor networks with application to auditory attention detection

,”

IEEE Transactions on Biomedical Engineering

(

2019

234

–

244

[84]

A. M.

Narayanan

Patrinos

, and

Bertrand

, “

Optimal versus approximate channel selection methods for EEG decoding with application to topology-constrained neuro-sensor networks

,”

IEEE Transactions on Neural Systems and Rehabilitation Engineering

2020

–

102

[85]

A. M.

Narayanan

Zink

, and

Bertrand

, “

EEG miniaturization limits for stimulus decoding with EEG sensor networks

,”

Journal of Neural Engineering

(

2021

056042

[86]

Nogueira

Cosatti

Schierholz

Egger

Mirkovic

, and

Büchner

, “

Toward Decoding Selective Attention From Single-Trial EEG Data in Cochlear Implant Users

,”

IEEE Transactions on Biomedical Engineering

(

2019

–

[87]

Nogueira

Dolhopiatenko

Schierholz

Büchner

Mirkovic

M. G.

Bleichner

, and

Debener

, “

Decoding selective attention in normal hearing listeners and bilateral cochlear implant users with concealed ear EEG

,”

Frontiers in Neuroscience

2019

720

[88]

J. A. O’

Sullivan

A. J.

Power

Mesgarani

Rajaram

J. J.

Foxe

B. G.

Shinn-Cunningham

Slaney

S. A.

Shamma

, and

E. C.

Lalor

, “

Attentional selection in a cocktail party environment can be decoded from single-trial EEG

,”

Cerebral Cortex

(

2015

1697

–

1706

[89]

O’Callaghan

Kveraga

J. M.

Shine

R. B.

Adams Jr

, and

Bar

, “

Predictions penetrate perception: Converging insights from brain, behaviour and disorder

,”

Consciousness and cognition

2017

–

https://doi.org/10.7554/eLife.51419

[90]

Parthasarathy

K. E.

Hancock

Bennett

DeGruttola

, and

D. B.

Polley

, “

Bottom-up and top-down neural signatures of disordered multi-talker speech perception in adults with normal hearing

,”

eLife

2020

e51419

, DOI:

[91]

Peelle

and

Wingfield

, “

How Our Brains Make Sense of Noisy Speech

,”

Acoustics Today

(

2022

–

[92]

Perez

Strub

De Vries

Dumoulin

, and

Courville

, “

Film: Visual reasoning with a general conditioning layer

,” in

Proceedings of the AAAI Conference on Artificial Intelligence

, Vol.

, No.

2018

[93]

M. K.

Pichora-Fuller and G. Singh

, “

Effects of age on auditory and cognitive processing: implications for hearing aid fitting and audiologic rehabilitation

,”

Trends in amplification

(

2006

–

[94]

Puffay

Van Canneyt

Vanthornhout

Francart

, et al., “

Relating the fundamental frequency of speech with EEG using a dilated convolutional network

,”

arXiv preprint arXiv:2207.01963

2022

[95]

Ramoser

Muller-Gerking

, and

Pfurtscheller

, “

Optimal spatial filtering of single trial EEG during imagined hand movement

,”

IEEE Transactions on Rehabilitation Engineering

(

2000

441

–

446

[96]

S. A.

Fuglsang

Dau

, and

Hjortkjœr

, “

Noise-robust cortical tracking of attended speech in real-world acoustic scenes

,”

NeuroImage

156

2017

435

–

444

[97]

Schultz

Wand

Hueber

D. J.

Krusienski

Herff

, and

J. S.

Brumberg

, “

Biosignal-based spoken communication: A survey

,”

IEEE/ACM Transactions on Audio, Speech, and Language Processing

(

2017

2257

–

2271

[98]

J. Z.

Simon

, “Human auditory neuroscience and the cocktail party problem,” in

The Auditory System at the Cocktail Party

Springer

2017

169

–

197

[99]

A. A.

Stocker

and

E. P.

Simoncelli

, “

Noise characteristics and prior expectations in human visual speed perception

,”

Nature neuroscience

(

2006

578

–

585

https://doi.org/10.18112/openneuro.ds003801.v1.0.0

[100]

Straetmans

Holtze

Debener

Jaeger

, and

Mirkovic

, “

Neural Tracking to go

”,

OpenNeuro

2021

, DOI:

[101]

Cai

Xie

, and

, “

Auditory attention detection with EEG channel attention

,” in

2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

, IEEE,

2021

5804

–

5807

[102]

Cai

Xie

, and

Schultz

, “

STAnet: A Spatiotemporal Attention Network for Decoding Auditory Spatial Attention from EEG

,”

IEEE Transactions on Biomedical Engineering

2022

[103]

S. R.

Synigal

E. S.

Teoh

, and

E. C.

Lalor

, “

Including measures of high gamma power can improve the decoding of natural speech from EEG

,”

Frontiers in Human Neuroscience

2020

130

[104]

Taherkhani

Belatreche

Cosma

L. P.

Maguire

, and

T. M.

McGinnity

, “

A review of learning in biologically plausible spiking neural networks

,”

Neural Networks

122

2020

253

–

272

[105]

de Taillez

Kollmeier

, and

B. T.

Meyer

, “

Machine learning for decoding listeners’ attention from electroencephalography evoked by continuous speech

,”

European Journal of Neuroscience

(

2020

1234

–

1241

[106]

Tremblay

Brisson

, and

Deschamps

, “

Brain aging and speech perception: Effects of background noise and talker variability

,”

NeuroImage

227

2021

117675

[107]

Tune

Alavash

Fiedler

, and

Obleser

, “

Neural attentional-filter mechanisms of listening success in middle-aged and older individuals

,”

Nature Communications

(

2021

–

[108]

J. J.

Van Berkum

, “

The brain is a prediction machine that cares about good and bad-any implications for neuropragmatics?

”

Italian Journal of Linguistics

2010

181

–

208

[109]

Van Eyndhoven

Francart

, and

Bertrand

, “

EEG-informed attended speaker extraction from recorded speech mixtures with application in neuro-steered hearing prostheses

,”

IEEE Transactions on Biomedical Engineering

(

2016

1045

–

1056

[110]

Vandecappelle

Deckers

Das

A. H.

Ansari

Bertrand

, and

Francart

, “

EEG-based detection of the locus of auditory attention with convolutional neural networks

,”

eLife

2021

e56481

[111]

Wan

Yang

Huang

Zeng

, and

Liu

, “

A review on transfer learning in EEG signal analysis

,”

Neurocomputing

421

2021

–

[112]

Wang

and

Chen

, “

Supervised speech separation based on deep learning: an overview

,”

IEEE/ACM Transactions on Audio, Speech, and Language Processing

(

2018

1702

–

1726

[113]

Wang

E. X.

, and

Chen

, “

EEG-based auditory attention decoding using speech-level-based segmented computational models

,”

Journal of Neural Engineering

(

2021

046066

[114]

Wöstmann

Herrmann

Maess

, and

Obleser

, “

Spatiotemporal dynamics of auditory attention synchronize with speech

,”

Proceedings of the National Academy of Sciences

113

(

2016

3873

–

3878

[115]

Wöstmann

Herrmann

Wilsch

, and

Obleser

, “

Neural alpha dynamics in younger and older listeners reflect acoustic challenges and predictive benefits

,”

Journal of Neuroscience

(

2015

1458

–

1467

[116]

Wöstmann

Vosskuhl

Obleser

, and

C. S.

Herrmann

, “

Opposite effects of lateralised transcranial alpha versus gamma stimulation on auditory spatial attention

,”

Brain Stimulation

(

2018

752

–

758

[117]

, and

B.-L.

, “

Transfer learning for EEG-based brain-computer interfaces: A review of progress made since 2016

,”

IEEE Transactions on Cognitive and Developmental Systems

(

2020

–

[118]

Yang

S. A.

Sheth

C. A.

Schevon

G. M. M.

, and

Mesgarani

, “

Speech reconstruction from human auditory cortex with deep neural networks

,” in

Sixteenth Annual Conference of the International Speech Communication Association

2015