Skip to Main Content
Close
Journals
Books
Case Studies
Collections
Open Access
Citation Manager
Journals
Books
Case Studies
Collections
Open Access
Citation Manager
Search Dropdown Menu
header search
search input
Search input auto suggest
filter your search
All Content
All Journals
APSIPA Transactions on Signal and Information Processing
Search
Advanced Search
Cart
User Tools Dropdown
Cart
Register
Sign In
Open Menu
Toggle Menu
Menu
Journal Home
Issues
Earlycite Articles
About this Journal
Open Menu
About this journal
Editorial board
Author guidelines
Indexing and metrics
Issues
Select Year
2026
2025
2024
2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
Issue
23 April - Volume 14, Issue 2, Pages 1 - 33
25 June - Volume 14, Issue 3, Pages 1 - 44
9 October - Volume 14, Issue 1, Pages 1 - 49
28 October - Volume 14, Issue 4, Pages 35 - 176
Volume 14, Issue 1
9 October 2025
All Issues
Cover Image
Cover Image
ISSN
2048-7703
EISSN
2048-7703
Close navigation menu
In this Issue
Original Paper
Overview Paper
Issue Navigation
Original Paper
Serial-OE: Anomalous Sound Detection Based on Serial Method with Outlier Exposure Capable of Using Small Amounts of Anomalous Data for Training
Ibuki Kuroyanagi
;
Tomoki Hayashi
;
Kazuya Takeda
;
Tomoki Toda
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Serial-OE: Anomalous Sound Detection Based on Serial Method with Outlier Exposure Capable of Using Small Amounts of Anomalous Data for Training
PSHop: A Lightweight Feed-Forward Method for 3D Prostate Gland Segmentation
Yijing Yang
;
Vasileios Magoulianitis
;
Jiaxin Yang
;
Jintang Xue
;
Masatomo Kaneko
;
Giovanni Cacciamani
;
Andre Abreu
;
Vinay Duddalwar
;
C.-C. Jay Kuo
;
Inderbir S. Gill
;
Chrysostomos Nikias
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for PSHop: A Lightweight Feed-Forward Method for 3D Prostate Gland Segmentation
Target Speaker Extraction under Noisy Underdetermined Conditions Using Conditional Variational Autoencoder, Global Style Token, and Neural Postfilter
Rui Wang
;
Takuya Fujimura
;
Tomoki Toda
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Target Speaker Extraction under Noisy Underdetermined Conditions Using Conditional Variational Autoencoder, Global Style Token, and Neural Postfilter
RTL Evaluation of
ℓ
2
-Norm Approximation with Rotated
ℓ
1
-Norm for 2-Tuple Arrays
Shu Abe
;
Yuya Kodama
;
Hiroyoshi Yamada
;
Shogo Muramatsu
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for RTL Evaluation of <em>ℓ</em><sub>2</sub>-Norm Approximation with Rotated <em>ℓ</em><sub>1</sub>-Norm for 2-Tuple Arrays
Unsupervised Pitch-Timbre-Variation Disentanglement of Monophonic Music Signals Based on Random Perturbation and Re-entry Training
Keitaro Tanaka
;
Kazuyoshi Yoshii
;
Simon Dixon
;
Shigeo Morishima
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Unsupervised Pitch-Timbre-Variation Disentanglement of Monophonic Music Signals Based on Random Perturbation and Re-entry Training
Speech Emotion Recognition Using Sequences of Fine-grained Emotion Labels with Phoneme Class Attributes
Ryotaro Nagase
;
Takahiro Fukumori
;
Yoichi Yamashita
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Speech Emotion Recognition Using Sequences of Fine-grained Emotion Labels with Phoneme Class Attributes
Robust ICU Mortality Prediction with Multi-Task Diffusion and Contrastive Learning Frameworks
Namtip Buranaburustam
;
Wuttipong Kumwilaisak
;
Chatchawarn Hansakunbuntheung
;
Nattanun Thatphithakkul
;
Kanya Kumwilaisak
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Robust ICU Mortality Prediction with Multi-Task Diffusion and Contrastive Learning Frameworks
Asymptotics of Proximity Operator for Squared Loss and Performance Prediction of Nonconvex Sparse Signal Recovery
Ryo Hayakawa
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Asymptotics of Proximity Operator for Squared Loss and Performance Prediction of Nonconvex Sparse Signal Recovery
Improvement of Sound Quality in Visual Microphone by Manipulation of Focused Area
Hayata Nakano
;
Yuting Geng
;
Kenta Iwai
;
Takanobu Nishiura
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Improvement of Sound Quality in Visual Microphone by Manipulation of Focused Area
Sequence-to-sequence Voice Conversion-based Techniques for Electrolaryngeal Speech Enhancement in Noisy and Reverberant Conditions
Ding Ma
;
Yeonjong Choi
;
Takuya Fujimura
;
Fengji Li
;
Chao Xie
;
Kazuhiro Kobayashi
;
Tomoki Toda
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Sequence-to-sequence Voice Conversion-based Techniques for Electrolaryngeal Speech Enhancement in Noisy and Reverberant Conditions
Nested Frequency Diverse Array for Co-located MIMO Radar using Grid-free DOA and Range Estimation Method
Beizuo Zhu
;
Kazunori Hayashi
;
Hiroki Mori
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Nested Frequency Diverse Array for Co-located MIMO Radar using Grid-free DOA and Range Estimation Method
How Good is ChatGPT at Audiovisual Deepfake Detection: A Comparative Study of ChatGPT, AI Models and Human Perception
Sahibzada Adil Shahzad
;
Ammarah Hashmi
;
Yan-Tsung Peng
;
Yu Tsao
;
Hsin-Min Wang
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for How Good is ChatGPT at Audiovisual Deepfake Detection: A Comparative Study of ChatGPT, AI Models and Human Perception
Text- and Speech-style Control for Lecture Speech Generation Focusing on Disfluency
Daiki Yoshioka
;
Yuuto Nakata
;
Yusuke Yasuda
;
Tomoki Toda
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Text- and Speech-style Control for Lecture Speech Generation Focusing on Disfluency
Scene Understanding by Fused Hu’s Invariant Moments and Deep Learning Features
Michael Nachipyangu
;
Jiangbin Zheng
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Scene Understanding by Fused Hu’s Invariant Moments and Deep Learning Features
An Investigation of Noisy-to-noisy Voice Conversion Performance in Various Noisy Conditions
Chao Xie
;
Tomoki Toda
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for An Investigation of Noisy-to-noisy Voice Conversion Performance in Various Noisy Conditions
A Brain-inspired Multi-Detector Machine for Fake Speech Detection
Chang Feng
;
Xiaolong Wu
;
Mingxing Xu
;
Thomas Fang Zheng
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for A Brain-inspired Multi-Detector Machine for Fake Speech Detection
PPMamba: A Pyramid Pooling Local Auxiliary SSM-based Model for Remote Sensing Image Semantic Segmentation
Yin Hu
;
Xianping Ma
;
Jialu Sui
;
Man-On Pun
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for PPMamba: A Pyramid Pooling Local Auxiliary SSM-based Model for Remote Sensing Image Semantic Segmentation
Learning Separated Representations for Instrument-based Music Similarity
Yuka Hashizume
;
Li Li
;
Atsushi Miyashita
;
Tomoki Toda
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Learning Separated Representations for Instrument-based Music Similarity
Improving Anomalous Sound Detection Through Pseudo-anomalous Set Selection and Pseudo-label Utilization Under Unlabeled Conditions
Ibuki Kuroyanagi
;
Takuya Fujimura
;
Kazuya Takeda
;
Tomoki Toda
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Improving Anomalous Sound Detection Through Pseudo-anomalous Set Selection and Pseudo-label Utilization Under Unlabeled Conditions
Analysis and Extension of Noisy-target Training for Unsupervised Target Signal Enhancement
Takuya Fujimura
;
Tomoki Toda
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Analysis and Extension of Noisy-target Training for Unsupervised Target Signal Enhancement
Music Bleeding-sound Reduction Based on Time-channel Nonnegative Matrix Factorization
Yusaku Mizobuchi
;
Daichi Kitamura
;
Tomohiko Nakamura
;
Norihiro Takamune
;
Hiroshi Saruwatari
;
Yu Takahashi
;
Kazunobu Kondo
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Music Bleeding-sound Reduction Based on Time-channel Nonnegative Matrix Factorization
Audio Difference Learning Framework for Audio Captioning
Tatsuya Komatsu
;
Kazuya Takeda
;
Tomoki Toda
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Audio Difference Learning Framework for Audio Captioning
Time-domain Separation Priority Pipeline-based Cascaded Multi-task Learning for Monaural Noisy and Reverberant Speech Separation
Shaoxiang Dang
;
Tetsuya Matsumoto
;
Yoshinori Takeuchi
;
Hiroaki Kudo
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Time-domain Separation Priority Pipeline-based Cascaded Multi-task Learning for Monaural Noisy and Reverberant Speech Separation
Multi-attribute Learning for Multi-level Emotion Recognition from Speech
Yuan Gao
;
Hao Shi
;
Chenhui Chu
;
Tatsuya Kawahara
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Multi-attribute Learning for Multi-level Emotion Recognition from Speech
ASVSpoof 2021: Detecting Spoofed Utterances Through Hybrid Features
Ramesh K. Bhukya
;
Aditya Raj
;
Anshul Kumar
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for ASVSpoof 2021: Detecting Spoofed Utterances Through Hybrid Features
MR-EEGWaveNet: Multiresolutional EEGWaveNet for Seizure Detection from Long EEG Recordings
Kazi Mahmudul Hassan
;
Xuyang Zhao
;
Hidenori Sugano
;
Toshihisa Tanaka
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for MR-EEGWaveNet: Multiresolutional EEGWaveNet for Seizure Detection from Long EEG Recordings
Two-stage Pipeline for Automated Cell Segmentation: Integrating Semantic and Instance Learning
Thanh-Ha Do
;
Hoang Minh-Huong Dang
;
Thanh-Lam Tran
;
Van-De Nguyen
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Two-stage Pipeline for Automated Cell Segmentation: Integrating Semantic and Instance Learning
Spatial Active Noise Control Based on Kernel Interpolation With Individual Directional Weighting
Kazuyuki Arikawa
;
Shoichi Koyama
;
Hiroshi Saruwatari
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Spatial Active Noise Control Based on Kernel Interpolation With Individual Directional Weighting
Research and Standards in 3D Scene Description Technologies: A Survey
Dong-shin Lim
;
Dong-hun Lee
;
Dong-hwi Kim
;
Jeong-hun Hong
;
Aro Kim
;
Chae-yeong Song
;
Bosung Baek
;
Dabin Kang
;
Myeong-jin Jang
;
Jinwoo Jeong
;
Sungjei Kim
;
Sang-hyo Park
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Research and Standards in 3D Scene Description Technologies: A Survey
Stabilizing and Enhancing Remixing-based Unsupervised Sound Source Separation
Kohei Saijo
;
Tetsuji Ogawa
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Stabilizing and Enhancing Remixing-based Unsupervised Sound Source Separation
Target Speaker Extractor Training with Diverse Speaker Conditions and Synthetic Data
Yun Liu
;
Xuechen Liu
;
Xiaoxiao Miao
;
Junichi Yamagishi
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Target Speaker Extractor Training with Diverse Speaker Conditions and Synthetic Data
Estimation of Geometric Transformation Matrices Using Grid-shaped Pilot Signals
Rinka Kawano
;
Masaki Kawamura
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Estimation of Geometric Transformation Matrices Using Grid-shaped Pilot Signals
Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-supervised Training of Sound Events With Partial Labels
Keisuke Imoto
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-supervised Training of Sound Events With Partial Labels
Overview Paper
Generative Coding: Promise and Challenges
Siwei Ma
;
Shenpeng Song
;
Bolin Chen
;
Qi Mao
;
Xiaohan Fang
;
Chuanmin Jia
;
Shiqi Wang
Abstract
View article
titled, SCM.SharedControls.Infrastructure.TitleDisplayModel?.Text
Open the
PDF
for in another window
Add to Citation Manager
for Generative Coding: Promise and Challenges
Latest
Most Read
Most Cited
An investigation of the robustness of flow- and diffusion-based speech generation models on noisy transcriptions
Event camera guided visual media restoration and 3D reconstruction: a survey
Investigation of part-level perceptual music similarity by large-scale listening test
Expanded noise modeling for scalable and adaptive zero-shot speech enhancement
Email alerts
Earlycite Alert
Closed Issue Alert
Latest Published Articles Alert
Close Modal
Recommended for you
These recommendations are informed by your reading behaviors and indicated interests.
RSS
Current Issue RSS Feed
Open Issues RSS Feed
Close Modal
Close Modal
This Feature Is Available To Subscribers Only
Sign In
or
Create an Account
Close Modal
Close Modal