Sound

Authors and titles for June 2019

Total of 132 entries : 1-25 26-50 51-75 76-100 101-125 126-132

Showing up to 25 entries per page: fewer | more | all

[51] arXiv:1906.01199 (cross-list from cs.CL) [pdf, other]: Title: Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation

Elizabeth Salesky, Matthias Sperber, Alan W Black

Comments: Accepted to ACL 2019

Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[52] arXiv:1906.01454 (cross-list from eess.AS) [pdf, other]: Title: Voice Mimicry Attacks Assisted by Automatic Speaker Verification

Ville Vestman, Tomi Kinnunen, Rosa González Hautamäki, Md Sahidullah

Comments: Published in Computer Speech and Language. arXiv admin note: text overlap with arXiv:1811.03790

Subjects: Audio and Speech Processing (eess.AS); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Sound (cs.SD)
[53] arXiv:1906.02070 (cross-list from eess.SP) [pdf, other]: Title: Automated Activity Recognition of Construction Equipment Using a Data Fusion Approach

Behnam Sherafat, Abbas Rashidi, Yong-Cheol Lee, Changbum R. Ahn

Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[54] arXiv:1906.02125 (cross-list from cs.CL) [pdf, other]: Title: Strong and Simple Baselines for Multimodal Utterance Embeddings

Paul Pu Liang, Yao Chong Lim, Yao-Hung Hubert Tsai, Ruslan Salakhutdinov, Louis-Philippe Morency

Comments: NAACL 2019 oral presentation

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[55] arXiv:1906.02246 (cross-list from cs.LG) [pdf, other]: Title: Complex Evolution Recurrent Neural Networks (ceRNNs)

Izhak Shafran, Tom Bagby, R. J. Skerry-Ryan

Journal-ref: Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 5854-5858, 2018

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[56] arXiv:1906.02572 (cross-list from eess.AS) [pdf, other]: Title: GIBBONFINDR: An R package for the detection and classification of acoustic signals

Dena J. Clink, Holger Klinck

Comments: R package

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Quantitative Methods (q-bio.QM)
[57] arXiv:1906.02812 (cross-list from eess.AS) [pdf, other]: Title: Role of non-linear data processing on speech recognition task in the framework of reservoir computing

Flavio Abreu Araujo, Mathieu Riou, Jacob Torrejon, Sumito Tsunegi, Damien Querlioz, Kay Yakushiji, Akio Fukushima, Hitoshi Kubota, Shinji Yuasa, Mark D. Stiles, Julie Grollier

Comments: 13 pages, 5 figures

Journal-ref: Scientific Reports 10, 328 (2020)

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)
[58] arXiv:1906.03402 (cross-list from cs.CL) [pdf, other]: Title: Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis

Eric Battenberg, Soroosh Mariooryad, Daisy Stanton, RJ Skerry-Ryan, Matt Shannon, David Kao, Tom Bagby

Comments: Submitted to ICLR 2020

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[59] arXiv:1906.03450 (cross-list from cs.IR) [pdf, other]: Title: Adversarial Mahalanobis Distance-based Attentive Song Recommender for Automatic Playlist Continuation

Thanh Tran, Renee Sweeney, Kyumin Lee

Journal-ref: SIGIR 2019

Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[60] arXiv:1906.03870 (cross-list from cs.IR) [pdf, other]: Title: Deep Learning-Based Automatic Downbeat Tracking: A Brief Review

Bijue Jia, Jiancheng Lv, Dayiheng Liu

Comments: 22 pages, 7 figures. arXiv admin note: text overlap with arXiv:1605.08396 by other authors

Journal-ref: Multimedia Systems, 2019, 25(6): 617-638

Subjects: Information Retrieval (cs.IR); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[61] arXiv:1906.04027 (cross-list from cs.AI) [pdf, other]: Title: "Did You Hear That?" Learning to Play Video Games from Audio Cues

Raluca D. Gaina, Matthew Stephenson

Comments: 4 pages, 2 figures, accepted at IEEE COG 2019

Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[62] arXiv:1906.04165 (cross-list from cs.CL) [pdf, other]: Title: Leveraging BERT for Extractive Text Summarization on Lectures

Derek Miller

Comments: 7 Pages, First Version

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
[63] arXiv:1906.04232 (cross-list from eess.IV) [pdf, other]: Title: BowNet: Dilated Convolution Neural Network for Ultrasound Tongue Contour Extraction

M. Hamed Mozaffari, Won-Sook Lee

Comments: 23 pages, 15 figures, 10 tables

Journal-ref: BowNet: Dilated convolutional neural network for ultrasound tongue contour extraction, 2019, The Journal of the Acoustical Society of America, pages 2940-2941, volume 146, number 4

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[64] arXiv:1906.04233 (cross-list from eess.AS) [pdf, other]: Title: Using generative modelling to produce varied intonation for speech synthesis

Zack Hodari, Oliver Watts, Simon King

Comments: Accepted for the 10th ISCA Speech Synthesis Workshop (SSW10)

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[65] arXiv:1906.04301 (cross-list from cs.LG) [pdf, other]: Title: Transfer Learning for Ultrasound Tongue Contour Extraction with Different Domains

M. Hamed Mozaffari, Won-Sook Lee

Comments: 3 figures, 9 pages, 1 table, 16 references

Journal-ref: The Journal of the Acoustical Society of America 146, 2940 (2019)

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
[66] arXiv:1906.04310 (cross-list from eess.SP) [pdf, other]: Title: Estimation of 2D Velocity Model using Acoustic Signals and Convolutional Neural Networks

Marco Apolinario, Samuel Huaman Bustamante, Giorgio Morales, Joel Telles, Daniel Diaz

Comments: Submitted to IEEE XXVI International Conference on Electronics, Electrical Engineering and Computing (INTERCON 2019). Lima, Peru

Journal-ref: 2019 IEEE XXVI International Conference on Electronics, Electrical Engineering and Computing (INTERCON)

Subjects: Signal Processing (eess.SP); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[67] arXiv:1906.04323 (cross-list from cs.CL) [pdf, other]: Title: Word-level Speech Recognition with a Letter to Word Encoder

Ronan Collobert, Awni Hannun, Gabriel Synnaeve

Comments: ICML 2020

Subjects: Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[68] arXiv:1906.05507 (cross-list from eess.AS) [pdf, other]: Title: Adjusting Pleasure-Arousal-Dominance for Continuous Emotional Text-to-speech Synthesizer

Azam Rabiee, Tae-Ho Kim, Soo-Young Lee

Comments: Interspeech2019, Show and Tell demonstration this https URL

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
[69] arXiv:1906.05678 (cross-list from eess.AS) [pdf, other]: Title: Telephonetic: Making Neural Language Models Robust to ASR and Semantic Noise

Chris Larson, Tarek Lahlou, Diana Mingels, Zachary Kulis, Erik Mueller

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[70] arXiv:1906.05681 (cross-list from eess.AS) [pdf, other]: Title: Deep Learning based Emotion Recognition System Using Speech Features and Transcriptions

Suraj Tripathi, Abhay Kumar, Abhiram Ramesh, Chirag Singh, Promod Yenigalla

Comments: Accepted in CICLing 2019

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[71] arXiv:1906.05682 (cross-list from eess.AS) [pdf, other]: Title: Focal Loss based Residual Convolutional Neural Network for Speech Emotion Recognition

Suraj Tripathi, Abhay Kumar, Abhiram Ramesh, Chirag Singh, Promod Yenigalla

Comments: Accepted in CICLing 2019

Subjects: Audio and Speech Processing (eess.AS); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
[72] arXiv:1906.05962 (cross-list from eess.AS) [pdf, other]: Title: Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments

Guan-Lin Chao, William Chan, Ian Lane

Comments: Published in INTERSPEECH 2016

Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[73] arXiv:1906.06301 (cross-list from eess.AS) [pdf, other]: Title: Video-Driven Speech Reconstruction using Generative Adversarial Networks

Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Maja Pantic

Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[74] arXiv:1906.06355 (cross-list from eess.AS) [pdf, other]: Title: Perceptual Based Adversarial Audio Attacks

Joseph Szurley, J. Zico Kolter

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
[75] arXiv:1906.06763 (cross-list from eess.AS) [pdf, other]: Title: Audio Transport: A Generalized Portamento via Optimal Transport

Trevor Henderson, Justin Solomon

Comments: Accepted to The 22nd International Conference on Digital Audio Effects (DAFx-19), Birmingham, UK, September 2-6, 2019

Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)

Total of 132 entries : 1-25 26-50 51-75 76-100 101-125 126-132

Showing up to 25 entries per page: fewer | more | all