Sound

Authors and titles for September 2021

Total of 163 entries : 1-25 26-50 51-75 76-100 101-125 126-150 ... 151-163

Showing up to 25 entries per page: fewer | more | all

[51] arXiv:2109.11594 [pdf, other]: Title: Implementation of interactive tools for investigating fundamental frequency response of voiced sounds to auditory stimulation

Hideki Kawahara, Toshie Matsui Kohei, Yatabe Ken-Ichi Sakakibara Minoru Tsuzaki Masanori Morise Toshio Irino

Comments: Accepted for APSIPA ASC 2021

Subjects: Sound (cs.SD); Human-Computer Interaction (cs.HC); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[52] arXiv:2109.11782 [pdf, other]: Title: Causal Analysis of Carnatic Music: A Preliminary Study

Abhsihek Nandekar, Preeth Khona, Rajani M. B., Anindya Sinha, Nithin Nagaraj

Comments: 22 pages, 12 figures

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[53] arXiv:2109.11946 [pdf, other]: Title: Evaluating X-vector-based Speaker Anonymization under White-box Assessment

Pierre Champion (Inria), Denis Jouvet (Inria), Anthony Larcher (LIUM)

Journal-ref: 23rd International Conference on Speech and Computer - SPECOM 2021, Sep 2021, Saint Petersburg, Russia

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Audio and Speech Processing (eess.AS)
[54] arXiv:2109.12014 [pdf, other]: Title: A data acquisition setup for data driven acoustic design

Romana Rust, Achilleas Xydis, Kurt Heutschi, Nathanaël Perraudin, Gonzalo Casas, Chaoyu Du, Jürgen Strauss, Kurt Eggenschwiler, Fernando Perez-Cruz, Fabio Gramazio, Matthias Kohler

Journal-ref: Building Acoustics. February 2021

Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[55] arXiv:2109.12056 [pdf, other]: Title: Parameterized Channel Normalization for Far-field Deep Speaker Verification

Xuechen Liu, Md Sahidullah, Tomi Kinnunen

Comments: Accepted for publication at ASRU 2021

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[56] arXiv:2109.12058 [pdf, other]: Title: Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification

Xuechen Liu, Md Sahidullah, Tomi Kinnunen

Comments: Accepted for publication at ASRU 2021

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[57] arXiv:2109.12471 [pdf, other]: Title: Rendering Spatial Sound for Interoperable Experiences in the Audio Metaverse

Jean-Marc Jot, Rémi Audfray, Mark Hertensteiner, Brian Schmidt

Comments: International Conference on Immersive and 3D Audio (i3DA), September 2021

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[58] arXiv:2109.12475 [pdf, other]: Title: General Theory of Music by Icosahedron 3: Musical invariant and Melakarta raga

Yusuke Imai

Comments: 31 pages, 34 figures

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[59] arXiv:2109.12591 [pdf, other]: Title: Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement

Guochen Yu, Andong Li, Yutian Wang, Yinuo Guo, Hui Wang, Chengshi Zheng

Comments: Accecpted by ICASSP 2022

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[60] arXiv:2109.12690 [pdf, other]: Title: Soundata: A Python library for reproducible use of audio datasets

Magdalena Fuentes, Justin Salamon, Pablo Zinemanas, Martín Rocamora, Genís Paja, Irán R. Román, Marius Miron, Xavier Serra, Juan Pablo Bello

Subjects: Sound (cs.SD); Databases (cs.DB); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[61] arXiv:2109.13072 [pdf, other]: Title: Estimating Angle of Arrival (AoA) of multiple Echoes in a Steering Vector Space

Yu-Lin Wei, Romit Roy Choudhury

Comments: 14 pages, 20 figures

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[62] arXiv:2109.13094 [pdf, other]: Title: Inferring Facing Direction from Voice Signals

Yu-Lin Wei, Rui Li, Abhinav Mehrotra, Romit Roy Choudhury, Nic Lane

Comments: 12 pages, 16 figures

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[63] arXiv:2109.13496 [pdf, other]: Title: FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures

Li Li, Hirokazu Kameoka, Shoji Makino

Comments: submit to IEEE/ACM TASLP, under review

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[64] arXiv:2109.13675 [pdf, other]: Title: FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech Synthesis

Manh Luong, Viet Anh Tran

Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[65] arXiv:2109.13731 [pdf, other]: Title: VoiceFixer: Toward General Speech Restoration with Neural Vocoder

Haohe Liu, Qiuqiang Kong, Qiao Tian, Yan Zhao, DeLiang Wang, Chuanzeng Huang, Yuxuan Wang

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[66] arXiv:2109.13821 [pdf, other]: Title: Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme

Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov, Jiansheng Wei

Subjects: Sound (cs.SD); Machine Learning (cs.LG); Machine Learning (stat.ML)
[67] arXiv:2109.14508 [pdf, other]: Title: Cross-domain Semi-Supervised Audio Event Classification Using Contrastive Regularization

Donmoon Lee, Kyogu Lee

Comments: 5 pages, 3 figures, and 2 tables. Accepted paper at IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2021

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[68] arXiv:2109.14705 [pdf, other]: Title: Adaptive Approach For Sparse Representations Using The Locally Competitive Algorithm For Audio

Soufiyan Bahadi, Jean Rouat, Éric Plourde

Comments: To be published at IEEE Machine Learning for Signal Processing 2021

Journal-ref: 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)

Subjects: Sound (cs.SD); Computation and Language (cs.CL)
[69] arXiv:2109.14797 [pdf, other]: Title: Emergency Vehicles Audio Detection and Localization in Autonomous Driving

Hongyi Sun, Xinyi Liu, Kecheng Xu, Jinghao Miao, Qi Luo

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Robotics (cs.RO); Audio and Speech Processing (eess.AS)
[70] arXiv:2109.15053 [pdf, other]: Title: Fine-tuning wav2vec2 for speaker recognition

Nik Vaessen, David A. van Leeuwen

Comments: accepted to ICASSP 2022

Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[71] arXiv:2109.15188 [pdf, other]: Title: Assessing Algorithmic Biases for Musical Version Identification

Furkan Yesiler, Marius Miron, Joan Serrà, Emilia Gómez

Subjects: Sound (cs.SD); Information Retrieval (cs.IR); Audio and Speech Processing (eess.AS)
[72] arXiv:2109.00281 (cross-list from cs.CR) [pdf, other]: Title: Benchmarking and challenges in security and privacy for voice biometrics

Jean-Francois Bonastre, Hector Delgado, Nicholas Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Paul-Gauthier Noe, Jose Patino, Md Sahidullah, Brij Mohan Lal Srivastava, Massimiliano Todisco, Natalia Tomashenko, Emmanuel Vincent, Xin Wang, Junichi Yamagishi

Comments: Submitted to the symposium of the ISCA Security & Privacy in Speech Communications (SPSC) special interest group

Subjects: Cryptography and Security (cs.CR); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[73] arXiv:2109.00393 (cross-list from cs.NE) [pdf, other]: Title: Mean absorption estimation from room impulse responses using virtually supervised learning

Cédric Foy (UMRAE ), Antoine Deleforge (MULTISPEECH), Diego Di Carlo (PANAMA)

Journal-ref: Journal of the Acoustical Society of America, Acoustical Society of America, 2021, 150 (2), pp.1286-1299

Subjects: Neural and Evolutionary Computing (cs.NE); Sound (cs.SD); Audio and Speech Processing (eess.AS); Classical Physics (physics.class-ph)
[74] arXiv:2109.00535 (cross-list from eess.AS) [pdf, other]: Title: ASVspoof 2021: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan

Héctor Delgado, Nicholas Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Jose Patino, Md Sahidullah, Massimiliano Todisco, Xin Wang, Junichi Yamagishi

Comments: this http URL

Subjects: Audio and Speech Processing (eess.AS); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Sound (cs.SD)
[75] arXiv:2109.00537 (cross-list from eess.AS) [pdf, other]: Title: ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection

Junichi Yamagishi, Xin Wang, Massimiliano Todisco, Md Sahidullah, Jose Patino, Andreas Nautsch, Xuechen Liu, Kong Aik Lee, Tomi Kinnunen, Nicholas Evans, Héctor Delgado

Comments: Accepted to the ASVspoof 2021 Workshop

Subjects: Audio and Speech Processing (eess.AS); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Sound (cs.SD)

Total of 163 entries : 1-25 26-50 51-75 76-100 101-125 126-150 ... 151-163

Showing up to 25 entries per page: fewer | more | all