Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.SD

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Sound

Authors and titles for September 2021

Total of 163 entries : 1-25 26-50 51-75 76-100 101-125 126-150 ... 151-163
Showing up to 25 entries per page: fewer | more | all
[51] arXiv:2109.11594 [pdf, other]
Title: Implementation of interactive tools for investigating fundamental frequency response of voiced sounds to auditory stimulation
Hideki Kawahara, Toshie Matsui Kohei, Yatabe Ken-Ichi Sakakibara Minoru Tsuzaki Masanori Morise Toshio Irino
Comments: Accepted for APSIPA ASC 2021
Subjects: Sound (cs.SD); Human-Computer Interaction (cs.HC); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[52] arXiv:2109.11782 [pdf, other]
Title: Causal Analysis of Carnatic Music: A Preliminary Study
Abhsihek Nandekar, Preeth Khona, Rajani M. B., Anindya Sinha, Nithin Nagaraj
Comments: 22 pages, 12 figures
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[53] arXiv:2109.11946 [pdf, other]
Title: Evaluating X-vector-based Speaker Anonymization under White-box Assessment
Pierre Champion (Inria), Denis Jouvet (Inria), Anthony Larcher (LIUM)
Journal-ref: 23rd International Conference on Speech and Computer - SPECOM 2021, Sep 2021, Saint Petersburg, Russia
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Audio and Speech Processing (eess.AS)
[54] arXiv:2109.12014 [pdf, other]
Title: A data acquisition setup for data driven acoustic design
Romana Rust, Achilleas Xydis, Kurt Heutschi, Nathanaël Perraudin, Gonzalo Casas, Chaoyu Du, Jürgen Strauss, Kurt Eggenschwiler, Fernando Perez-Cruz, Fabio Gramazio, Matthias Kohler
Journal-ref: Building Acoustics. February 2021
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[55] arXiv:2109.12056 [pdf, other]
Title: Parameterized Channel Normalization for Far-field Deep Speaker Verification
Xuechen Liu, Md Sahidullah, Tomi Kinnunen
Comments: Accepted for publication at ASRU 2021
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[56] arXiv:2109.12058 [pdf, other]
Title: Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification
Xuechen Liu, Md Sahidullah, Tomi Kinnunen
Comments: Accepted for publication at ASRU 2021
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
[57] arXiv:2109.12471 [pdf, other]
Title: Rendering Spatial Sound for Interoperable Experiences in the Audio Metaverse
Jean-Marc Jot, Rémi Audfray, Mark Hertensteiner, Brian Schmidt
Comments: International Conference on Immersive and 3D Audio (i3DA), September 2021
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[58] arXiv:2109.12475 [pdf, other]
Title: General Theory of Music by Icosahedron 3: Musical invariant and Melakarta raga
Yusuke Imai
Comments: 31 pages, 34 figures
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[59] arXiv:2109.12591 [pdf, other]
Title: Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement
Guochen Yu, Andong Li, Yutian Wang, Yinuo Guo, Hui Wang, Chengshi Zheng
Comments: Accecpted by ICASSP 2022
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[60] arXiv:2109.12690 [pdf, other]
Title: Soundata: A Python library for reproducible use of audio datasets
Magdalena Fuentes, Justin Salamon, Pablo Zinemanas, Martín Rocamora, Genís Paja, Irán R. Román, Marius Miron, Xavier Serra, Juan Pablo Bello
Subjects: Sound (cs.SD); Databases (cs.DB); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[61] arXiv:2109.13072 [pdf, other]
Title: Estimating Angle of Arrival (AoA) of multiple Echoes in a Steering Vector Space
Yu-Lin Wei, Romit Roy Choudhury
Comments: 14 pages, 20 figures
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[62] arXiv:2109.13094 [pdf, other]
Title: Inferring Facing Direction from Voice Signals
Yu-Lin Wei, Rui Li, Abhinav Mehrotra, Romit Roy Choudhury, Nic Lane
Comments: 12 pages, 16 figures
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[63] arXiv:2109.13496 [pdf, other]
Title: FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures
Li Li, Hirokazu Kameoka, Shoji Makino
Comments: submit to IEEE/ACM TASLP, under review
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[64] arXiv:2109.13675 [pdf, other]
Title: FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech Synthesis
Manh Luong, Viet Anh Tran
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[65] arXiv:2109.13731 [pdf, other]
Title: VoiceFixer: Toward General Speech Restoration with Neural Vocoder
Haohe Liu, Qiuqiang Kong, Qiao Tian, Yan Zhao, DeLiang Wang, Chuanzeng Huang, Yuxuan Wang
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[66] arXiv:2109.13821 [pdf, other]
Title: Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme
Vadim Popov, Ivan Vovk, Vladimir Gogoryan, Tasnima Sadekova, Mikhail Kudinov, Jiansheng Wei
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Machine Learning (stat.ML)
[67] arXiv:2109.14508 [pdf, other]
Title: Cross-domain Semi-Supervised Audio Event Classification Using Contrastive Regularization
Donmoon Lee, Kyogu Lee
Comments: 5 pages, 3 figures, and 2 tables. Accepted paper at IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2021
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[68] arXiv:2109.14705 [pdf, other]
Title: Adaptive Approach For Sparse Representations Using The Locally Competitive Algorithm For Audio
Soufiyan Bahadi, Jean Rouat, Éric Plourde
Comments: To be published at IEEE Machine Learning for Signal Processing 2021
Journal-ref: 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP)
Subjects: Sound (cs.SD); Computation and Language (cs.CL)
[69] arXiv:2109.14797 [pdf, other]
Title: Emergency Vehicles Audio Detection and Localization in Autonomous Driving
Hongyi Sun, Xinyi Liu, Kecheng Xu, Jinghao Miao, Qi Luo
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Robotics (cs.RO); Audio and Speech Processing (eess.AS)
[70] arXiv:2109.15053 [pdf, other]
Title: Fine-tuning wav2vec2 for speaker recognition
Nik Vaessen, David A. van Leeuwen
Comments: accepted to ICASSP 2022
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[71] arXiv:2109.15188 [pdf, other]
Title: Assessing Algorithmic Biases for Musical Version Identification
Furkan Yesiler, Marius Miron, Joan Serrà, Emilia Gómez
Subjects: Sound (cs.SD); Information Retrieval (cs.IR); Audio and Speech Processing (eess.AS)
[72] arXiv:2109.00281 (cross-list from cs.CR) [pdf, other]
Title: Benchmarking and challenges in security and privacy for voice biometrics
Jean-Francois Bonastre, Hector Delgado, Nicholas Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Paul-Gauthier Noe, Jose Patino, Md Sahidullah, Brij Mohan Lal Srivastava, Massimiliano Todisco, Natalia Tomashenko, Emmanuel Vincent, Xin Wang, Junichi Yamagishi
Comments: Submitted to the symposium of the ISCA Security & Privacy in Speech Communications (SPSC) special interest group
Subjects: Cryptography and Security (cs.CR); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[73] arXiv:2109.00393 (cross-list from cs.NE) [pdf, other]
Title: Mean absorption estimation from room impulse responses using virtually supervised learning
Cédric Foy (UMRAE ), Antoine Deleforge (MULTISPEECH), Diego Di Carlo (PANAMA)
Journal-ref: Journal of the Acoustical Society of America, Acoustical Society of America, 2021, 150 (2), pp.1286-1299
Subjects: Neural and Evolutionary Computing (cs.NE); Sound (cs.SD); Audio and Speech Processing (eess.AS); Classical Physics (physics.class-ph)
[74] arXiv:2109.00535 (cross-list from eess.AS) [pdf, other]
Title: ASVspoof 2021: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan
Héctor Delgado, Nicholas Evans, Tomi Kinnunen, Kong Aik Lee, Xuechen Liu, Andreas Nautsch, Jose Patino, Md Sahidullah, Massimiliano Todisco, Xin Wang, Junichi Yamagishi
Comments: this http URL
Subjects: Audio and Speech Processing (eess.AS); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Sound (cs.SD)
[75] arXiv:2109.00537 (cross-list from eess.AS) [pdf, other]
Title: ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
Junichi Yamagishi, Xin Wang, Massimiliano Todisco, Md Sahidullah, Jose Patino, Andreas Nautsch, Xuechen Liu, Kong Aik Lee, Tomi Kinnunen, Nicholas Evans, Héctor Delgado
Comments: Accepted to the ASVspoof 2021 Workshop
Subjects: Audio and Speech Processing (eess.AS); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Sound (cs.SD)
Total of 163 entries : 1-25 26-50 51-75 76-100 101-125 126-150 ... 151-163
Showing up to 25 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack