Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > eess.AS

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Audio and Speech Processing

Authors and titles for September 2019

Total of 113 entries : 1-50 51-100 101-113
Showing up to 50 entries per page: fewer | more | all
[101] arXiv:1909.12289 (cross-list from cs.LG) [pdf, other]
Title: Attention Forcing for Sequence-to-sequence Model Training
Qingyun Dou, Yiting Lu, Joshua Efiong, Mark J. F. Gales
Comments: 11 pages, 4 figures, conference
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[102] arXiv:1909.12408 (cross-list from cs.CL) [pdf, other]
Title: Optimizing Speech Recognition For The Edge
Yuan Shangguan, Jian Li, Qiao Liang, Raziel Alvarez, Ian McGraw
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[103] arXiv:1909.12415 (cross-list from cs.CL) [pdf, other]
Title: Improving RNN Transducer Modeling for End-to-End Speech Recognition
Jinyu Li, Rui Zhao, Hu Hu, Yifan Gong
Comments: Accepted by IEEE ASRU workshop, 2019
Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[104] arXiv:1909.12681 (cross-list from cs.CL) [pdf, other]
Title: End-to-End Code-Switching ASR for Low-Resourced Language Pairs
Xianghu Yue, Grandee Lee, Emre Yılmaz, Fang Deng, Haizhou Li
Comments: Accepted for publication at IEEE ASRU Workshop 2019
Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[105] arXiv:1909.12699 (cross-list from cs.SD) [pdf, other]
Title: Urban Sound Tagging using Convolutional Neural Networks
Sainath Adapa
Comments: 5 pages
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[106] arXiv:1909.12780 (cross-list from cs.CV) [pdf, other]
Title: Learning to Have an Ear for Face Super-Resolution
Givi Meishvili, Simon Jenni, Paolo Favaro
Subjects: Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
[107] arXiv:1909.13070 (cross-list from cs.SD) [pdf, other]
Title: Emirati-Accented Speaker Identification in Stressful Talking Conditions
Ismail Shahin, Ali Bou Nassif
Comments: 6 pages, this work has been accepted in The International Conference on Electrical and Computing Technologies and Applications, 2019 (ICECTA 2019)
Subjects: Sound (cs.SD); Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[108] arXiv:1909.13244 (cross-list from cs.SD) [pdf, other]
Title: Speaker Verification in Emotional Talking Environments based on Third-Order Circular Suprasegmental Hidden Markov Model
Ismail Shahin, Ali Bou Nassif
Comments: 6 pages, accepted in The International Conference on Electrical and Computing Technologies and Applications, 2019 (ICECTA 2019). arXiv admin note: text overlap with arXiv:1903.09803
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
[109] arXiv:1909.13287 (cross-list from cs.MM) [pdf, other]
Title: MG-VAE: Deep Chinese Folk Songs Generation with Specific Regional Style
Jing Luo, Xinyu Yang, Shulei Ji, Juan Li
Comments: Accepted by the 7th Conference on Sound and Music Technology, 2019, Harbin, China
Subjects: Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[110] arXiv:1909.13332 (cross-list from cs.CL) [pdf, other]
Title: Recent Advances in End-to-End Spoken Language Understanding
Natalia Tomashenko, Antoine Caubriere, Yannick Esteve, Antoine Laurent, Emmanuel Morin
Journal-ref: Statistical Language and Speech Processing. SLSP 2019
Subjects: Computation and Language (cs.CL); Audio and Speech Processing (eess.AS)
[111] arXiv:1909.13537 (cross-list from cs.CL) [pdf, other]
Title: Embeddings for DNN speaker adaptive training
Joanna Rownicka, Peter Bell, Steve Renals
Comments: Accepted at ASRU 2019
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[112] arXiv:1909.13775 (cross-list from cs.HC) [pdf, other]
Title: Ephemeral instruments
Vincent Goudard (SU)
Comments: New Interfaces for Musical Expression, Jun 2019, Porto-Alegre, Brazil
Subjects: Human-Computer Interaction (cs.HC); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[113] arXiv:1909.13790 (cross-list from cs.CL) [pdf, other]
Title: Incremental processing of noisy user utterances in the spoken language understanding task
Stefan Constantin, Jan Niehues, Alex Waibel
Comments: 10 pages, 3 figures, 7 tables, forthcoming in W-NUT 2019
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Total of 113 entries : 1-50 51-100 101-113
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack