Audio and Speech Processing

Authors and titles for September 2019

Total of 113 entries : 1-50 51-100 101-113

Showing up to 50 entries per page: fewer | more | all

[101] arXiv:1909.12289 (cross-list from cs.LG) [pdf, other]: Title: Attention Forcing for Sequence-to-sequence Model Training

Qingyun Dou, Yiting Lu, Joshua Efiong, Mark J. F. Gales

Comments: 11 pages, 4 figures, conference

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[102] arXiv:1909.12408 (cross-list from cs.CL) [pdf, other]: Title: Optimizing Speech Recognition For The Edge

Yuan Shangguan, Jian Li, Qiao Liang, Raziel Alvarez, Ian McGraw

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[103] arXiv:1909.12415 (cross-list from cs.CL) [pdf, other]: Title: Improving RNN Transducer Modeling for End-to-End Speech Recognition

Jinyu Li, Rui Zhao, Hu Hu, Yifan Gong

Comments: Accepted by IEEE ASRU workshop, 2019

Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[104] arXiv:1909.12681 (cross-list from cs.CL) [pdf, other]: Title: End-to-End Code-Switching ASR for Low-Resourced Language Pairs

Xianghu Yue, Grandee Lee, Emre Yılmaz, Fang Deng, Haizhou Li

Comments: Accepted for publication at IEEE ASRU Workshop 2019

Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[105] arXiv:1909.12699 (cross-list from cs.SD) [pdf, other]: Title: Urban Sound Tagging using Convolutional Neural Networks

Sainath Adapa

Comments: 5 pages

Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[106] arXiv:1909.12780 (cross-list from cs.CV) [pdf, other]: Title: Learning to Have an Ear for Face Super-Resolution

Givi Meishvili, Simon Jenni, Paolo Favaro

Subjects: Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
[107] arXiv:1909.13070 (cross-list from cs.SD) [pdf, other]: Title: Emirati-Accented Speaker Identification in Stressful Talking Conditions

Ismail Shahin, Ali Bou Nassif

Comments: 6 pages, this work has been accepted in The International Conference on Electrical and Computing Technologies and Applications, 2019 (ICECTA 2019)

Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[108] arXiv:1909.13244 (cross-list from cs.SD) [pdf, other]: Title: Speaker Verification in Emotional Talking Environments based on Third-Order Circular Suprasegmental Hidden Markov Model

Ismail Shahin, Ali Bou Nassif

Comments: 6 pages, accepted in The International Conference on Electrical and Computing Technologies and Applications, 2019 (ICECTA 2019). arXiv admin note: text overlap with arXiv:1903.09803

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[109] arXiv:1909.13287 (cross-list from cs.MM) [pdf, other]: Title: MG-VAE: Deep Chinese Folk Songs Generation with Specific Regional Style

Jing Luo, Xinyu Yang, Shulei Ji, Juan Li

Comments: Accepted by the 7th Conference on Sound and Music Technology, 2019, Harbin, China

Subjects: Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[110] arXiv:1909.13332 (cross-list from cs.CL) [pdf, other]: Title: Recent Advances in End-to-End Spoken Language Understanding

Natalia Tomashenko, Antoine Caubriere, Yannick Esteve, Antoine Laurent, Emmanuel Morin

Journal-ref: Statistical Language and Speech Processing. SLSP 2019

Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[111] arXiv:1909.13537 (cross-list from cs.CL) [pdf, other]: Title: Embeddings for DNN speaker adaptive training

Joanna Rownicka, Peter Bell, Steve Renals

Comments: Accepted at ASRU 2019

Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[112] arXiv:1909.13775 (cross-list from cs.HC) [pdf, other]: Title: Ephemeral instruments

Vincent Goudard (SU)

Comments: New Interfaces for Musical Expression, Jun 2019, Porto-Alegre, Brazil

Subjects: Human-Computer Interaction (cs.HC); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[113] arXiv:1909.13790 (cross-list from cs.CL) [pdf, other]: Title: Incremental processing of noisy user utterances in the spoken language understanding task

Stefan Constantin, Jan Niehues, Alex Waibel

Comments: 10 pages, 3 figures, 7 tables, forthcoming in W-NUT 2019

Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)

Total of 113 entries : 1-50 51-100 101-113

Showing up to 50 entries per page: fewer | more | all