Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > eess.AS

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Audio and Speech Processing

Authors and titles for June 2020

Total of 181 entries : 1-25 26-50 51-75 76-100 101-125 ... 176-181
Showing up to 25 entries per page: fewer | more | all
[26] arXiv:2006.02814 [pdf, other]
Title: CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning
Sameer Khurana, Antoine Laurent, James Glass
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[27] arXiv:2006.02902 [pdf, other]
Title: Constrained Variational Autoencoder for improving EEG based Speech Recognition Systems
Gautam Krishna, Co Tran, Mason Carnahan, Ahmed Tewfik
Comments: Under Review. arXiv admin note: substantial text overlap with arXiv:2006.01260
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Signal Processing (eess.SP)
[28] arXiv:2006.03107 [pdf, other]
Title: Attention and Encoder-Decoder based models for transforming articulatory movements at different speaking rates
Abhayjeet Singh, Aravind Illa, Prasanta Kumar Ghosh
Comments: 5 pages, 4 figures, InterSpeech 2020
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[29] arXiv:2006.03214 [pdf, other]
Title: Defense for Black-box Attacks on Anti-spoofing Models by Self-Supervised Learning
Haibin Wu, Andy T. Liu, Hung-yi Lee
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG)
[30] arXiv:2006.03411 [pdf, other]
Title: Contextual RNN-T For Open Domain ASR
Mahaveer Jain, Gil Keren, Jay Mahadeokar, Geoffrey Zweig, Florian Metze, Yatharth Saraf
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[31] arXiv:2006.03429 [pdf, other]
Title: Acoustic Anomaly Detection for Machine Sounds based on Image Transfer Learning
Robert Müller, Fabian Ritz, Steffen Illium, Claudia Linnhoff-Popien
Comments: ICAART 2021, 8 pages, 2 figures, 1 table
Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD)
[32] arXiv:2006.03473 [pdf, other]
Title: AP20-OLR Challenge: Three Tasks and Their Baselines
Zheng Li, Miao Zhao, Qingyang Hong, Lin Li, Zhiyuan Tang, Dong Wang, Liming Song, Cheng Yang
Comments: arXiv admin note: substantial text overlap with arXiv:1907.07626, arXiv:1806.00616, arXiv:1706.09742
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[33] arXiv:2006.04136 [pdf, other]
Title: Analysis and Synthesis of Hypo and Hyperarticulated Speech
Benjamin Picart, Thomas Drugman, Thierry Dutoit
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL)
[34] arXiv:2006.04138 [pdf, other]
Title: Maximum Phase Modeling for Sparse Linear Prediction of Speech
Thomas Drugman
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[35] arXiv:2006.04142 [pdf, other]
Title: Parametric Representation for Singing Voice Synthesis: a Comparative Evaluation
Onur Babacan, Thomas Drugman, Tuomo Raitio, Daniel Erro, Thierry Dutoit
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[36] arXiv:2006.04154 [pdf, other]
Title: VQVC+: One-Shot Voice Conversion by Vector Quantization and U-Net architecture
Da-Yi Wu, Yen-Hao Chen, Hung-Yi Lee
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[37] arXiv:2006.04326 [pdf, other]
Title: Semi-Supervised Contrastive Learning with Generalized Contrastive Loss and Its Application to Speaker Recognition
Nakamasa Inoue, Keita Goto
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[38] arXiv:2006.04372 [pdf, other]
Title: Zero resource speech synthesis using transcripts derived from perceptual acoustic units
Karthik Pandia D S, Hema A Murthy
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[39] arXiv:2006.04469 [pdf, other]
Title: A non-causal FFTNet architecture for speech enhancement
Muhammed PV Shifas, Nagaraj Adiga, Vassilis Tsiaras, Yannis Stylianou
Comments: 5 pages
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[40] arXiv:2006.04558 [pdf, other]
Title: FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
Yi Ren, Chenxu Hu, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu
Comments: Accepted by ICLR 2021
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[41] arXiv:2006.04664 [pdf, other]
Title: MultiSpeech: Multi-Speaker Text to Speech with Transformer
Mingjian Chen, Xu Tan, Yi Ren, Jin Xu, Hao Sun, Sheng Zhao, Tao Qin, Tie-Yan Liu
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
[42] arXiv:2006.04928 [pdf, other]
Title: Learning to Count Words in Fluent Speech enables Online Speech Recognition
George Sterpu, Christian Saam, Naomi Harte
Comments: Accepted at the 8th IEEE Spoken Language Technology Workshop (SLT 2021)
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[43] arXiv:2006.05129 [pdf, other]
Title: On the Effectiveness of Neural Text Generation based Data Augmentation for Recognition of Morphologically Rich Speech
Balázs Tarján, György Szaszák, Tibor Fegyó, Péter Mihajlik
Comments: 8 pages, 2 figures, accepted for publication at TSD 2020
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[44] arXiv:2006.05174 [pdf, other]
Title: Input-independent Attention Weights Are Expressive Enough: A Study of Attention in Self-supervised Audio Transformers
Tsung-Han Wu, Chun-Chen Hsieh, Yen-Hao Chen, Po-Han Chi, Hung-yi Lee
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[45] arXiv:2006.05233 [pdf, other]
Title: A fully recurrent feature extraction for single channel speech enhancement
Muhammed PV Shifas, Santelli Claudio, Vassilis Tsiaras, Yannis Stylianou
Comments: 5 pages
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[46] arXiv:2006.05257 [pdf, other]
Title: Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition
Gurunath Reddy Madhumani, Sanket Shah, Basil Abraham, Vikas Joshi, Sunayana Sitaram
Comments: 5 pages (4 pages + 1 reference), 3 tables, 2 figures
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[47] arXiv:2006.05365 [pdf, other]
Title: Vocal markers from sustained phonation in Huntington's Disease
Rachid Riad, Hadrien Titeux, Laurie Lemoine, Justine Montillot, Jennifer Hamet Bagnou, Xuan Nga Cao, Emmanuel Dupoux, Anne-Catherine Bachoud-Lévi
Comments: To appear at INTERSPEECH 2020. 1 pages of supplementary material appear only in the arxiv version. Code to replicate this https URL
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[48] arXiv:2006.05474 [pdf, other]
Title: Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation
Changhan Wang, Juan Pino, Jiatao Gu
Comments: Accepted to INTERSPEECH 2020
Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
[49] arXiv:2006.05584 [pdf, other]
Title: Exploring Quality and Generalizability in Parameterized Neural Audio Effects
William Mitchell, Scott H. Hawley
Comments: 7 pages, 5 figures
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[50] arXiv:2006.05596 [pdf, other]
Title: Speaker Diarization: Using Recurrent Neural Networks
Vishal Sharma, Zekun Zhang, Zachary Neubert, Curtis Dyreson
Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
Total of 181 entries : 1-25 26-50 51-75 76-100 101-125 ... 176-181
Showing up to 25 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack