Sound

Authors and titles for September 2021

Total of 163 entries : 1-50 51-100 101-150 151-163

Showing up to 50 entries per page: fewer | more | all

[151] arXiv:2109.14061 (cross-list from eess.AS) [pdf, other]: Title: The impact of non-target events in synthetic soundscapes for sound event detection

Francesca Ronchini, Romain Serizel, Nicolas Turpault, Samuele Cornell

Journal-ref: Proceedings of the 6th Detection and Classification of Acoustic Scenes and Events 2021 Workshop (DCASE2021)

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[152] arXiv:2109.14200 (cross-list from eess.AS) [pdf, other]: Title: Can phones, syllables, and words emerge as side-products of cross-situational audiovisual learning? -- A computational investigation

Khazar Khorrami, Okko Räsänen

Comments: Final manuscript published in Language Development Research under CC BY-NC-SA 4.0. Pre-print redistributed through arXiv with permission. Replaces corrupted PsyArXiv pre-print repository at this https URL

Journal-ref: Language Development Research, 1(1), 123-191 (2021)

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD)
[153] arXiv:2109.14357 (cross-list from eess.AS) [pdf, other]: Title: Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch

Jakob Poncelet, Hugo Van hamme

Comments: To be published in the 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2021)

Journal-ref: 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp. 169-176

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[154] arXiv:2109.14370 (cross-list from eess.AS) [pdf, other]: Title: Objective-oriented method for uniformation of various directivity representations

Adam Szwajcowski

Comments: Author's Accepted Manuscript from 151st AES Convention

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[155] arXiv:2109.14420 (cross-list from cs.CL) [pdf, other]: Title: FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition

Yichong Leng, Xu Tan, Rui Wang, Linchen Zhu, Jin Xu, Wenjie Liu, Linquan Liu, Tao Qin, Xiang-Yang Li, Edward Lin, Tie-Yan Liu

Comments: Findings of EMNLP 2021

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[156] arXiv:2109.14436 (cross-list from eess.AS) [pdf, other]: Title: A Universal Deep Room Acoustics Estimator

Paula Sánchez López, Paul Callens, Milos Cernak

Comments: Room acoustics, Convolutional Recurrent Neural Network, RT60, C50, DRR, STI, SNR

Journal-ref: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2021

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[157] arXiv:2109.14725 (cross-list from cs.LG) [pdf, other]: Title: Tiny-CRNN: Streaming Wakeword Detection In A Low Footprint Setting

Mohammad Omar Khursheed, Christin Jose, Rajath Kumar, Gengshen Fu, Brian Kulis, Santosh Kumar Cheekatmalla

Comments: arXiv admin note: substantial text overlap with arXiv:2011.12941

Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[158] arXiv:2109.14831 (cross-list from eess.AS) [pdf, other]: Title: USEV: Universal Speaker Extraction with Visual Cue

Zexu Pan, Meng Ge, Haizhou Li

Comments: Accepted by TASLP

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[159] arXiv:2109.14992 (cross-list from cs.HC) [pdf, other]: Title: Xenakis: Experimenting with Data, Cities, and Sounds

Victor Schetinger, Ignacio Pérez-Messina, Renan Guarese, Velitchko Filipov

Comments: This manuscript heavily links to a miro board as part of an experiment in exposition, and was presented at this http URL, a workshop co-located with IEEE VIS 2021 (held virtually)

Subjects: Human-Computer Interaction (cs.HC); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[160] arXiv:2109.14994 (cross-list from eess.AS) [pdf, other]: Title: An investigation of pre-upsampling generative modelling and Generative Adversarial Networks in audio super resolution

James King, Ramon Viñas Torné, Alexander Campbell, Pietro Liò

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[161] arXiv:2109.15108 (cross-list from eess.AS) [pdf, other]: Title: Federated Learning in ASR: Not as Easy as You Think

Wentao Yu, Jan Freiwald, Sören Tewes, Fabien Huennemeyer, Dorothea Kolossa

Journal-ref: ITG Conference on Speech Communication, 2021

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[162] arXiv:2109.15127 (cross-list from eess.AS) [pdf, other]: Title: Real-Time Multi-Level Neonatal Heart and Lung Sound Quality Assessment for Telehealth Applications

Ethan Grooby, Chiranjibi Sitaula, Davood Fattahi, Reza Sameni, Kenneth Tan, Lindsay Zhou, Arrabella King, Ashwin Ramanathan, Atul Malhotra, Guy A. Dumont, Faezeh Marzbanrad

Comments: 13 pages, 8 figures, 3 tables. Paper submitted and under review in IEEE Access

Journal-ref: IEEE Access, 2022

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Signal Processing (eess.SP)
[163] arXiv:2109.15166 (cross-list from eess.AS) [pdf, other]: Title: PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Yi Ren, Jinglin Liu, Zhou Zhao

Comments: Accepted by NeurIPS 2021. Source code: this https URL

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)

Total of 163 entries : 1-50 51-100 101-150 151-163

Showing up to 50 entries per page: fewer | more | all