Sound

Authors and titles for May 2016

Total of 22 entries

Showing up to 25 entries per page: fewer | more | all

[1] arXiv:1605.00810 [pdf, other]: Title: Diagonal Unloading Beamforming for Source Localization

Daniele Salvati, Carlo Drioli, Gian Luca Foresti

Journal-ref: IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 25, Issue 3, Pages 609-622 (2018)

Subjects: Sound (cs.SD)
[2] arXiv:1605.01329 [pdf, other]: Title: Single Channel Speech Enhancement Using Outlier Detection

Eunjoon Cho, Bowon Lee, Ronald Schafer, Bernard Widrow

Subjects: Sound (cs.SD); Machine Learning (cs.LG)
[3] arXiv:1605.01755 [pdf, other]: Title: DCTNet and PCANet for acoustic signal feature extraction

Yin Xian, Andrew Thompson, Xiaobai Sun, Douglas Nowacek, Loren Nolte

Comments: 22 figures

Subjects: Sound (cs.SD); Machine Learning (cs.LG)
[4] arXiv:1605.02401 [pdf, other]: Title: Audio Event Detection using Weakly Labeled Data

Anurag Kumar, Bhiksha Raj

Comments: ACM Multimedia 2016

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[5] arXiv:1605.02427 [pdf, other]: Title: Speech Enhancement In Multiple-Noise Conditions using Deep Neural Networks

Anurag Kumar, Dinei Florencio

Subjects: Sound (cs.SD)
[6] arXiv:1605.03724 [pdf, other]: Title: Sub-vector Extraction and Cascade Post-Processing for Speaker Verification Using MLLR Super-vectors

A. K. Sarkar, C. Barras, V. B. Le, D. Matrouf

Subjects: Sound (cs.SD)
[7] arXiv:1605.06644 [pdf, other]: Title: Deep convolutional networks on the pitch spiral for musical instrument recognition

Vincent Lostanlen, Carmine-Emanuele Cella

Comments: 7 pages, 3 figures. Accepted at the International Society for Music Information Retrieval Conference (ISMIR) conference in New York City, NY, USA, August 2016

Subjects: Sound (cs.SD)
[8] arXiv:1605.07008 [pdf, other]: Title: madmom: a new Python Audio and Music Signal Processing Library

Sebastian Böck, Filip Korzeniowski, Jan Schlüter, Florian Krebs, Gerhard Widmer

Subjects: Sound (cs.SD)
[9] arXiv:1605.07466 [pdf, other]: Title: Complex NMF under phase constraints based on signal modeling: application to audio source separation

Paul Magron, Roland Badeau, Bertrand David

Comments: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2016

Subjects: Sound (cs.SD)
[10] arXiv:1605.07467 [pdf, other]: Title: Phase reconstruction of spectrograms with linear unwrapping: application to audio signal restoration

Paul Magron, Roland Badeau, Bertrand David

Comments: European Signal Processing Conference (EUSIPCO) 2015

Subjects: Sound (cs.SD)
[11] arXiv:1605.07468 [pdf, other]: Title: Phase reconstruction of spectrograms based on a model of repeated audio events

Paul Magron, Roland Badeau, Bertrand David

Comments: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2015

Subjects: Sound (cs.SD)
[12] arXiv:1605.07469 [pdf, other]: Title: Phase recovery in NMF for audio source separation: an insightful benchmark

Paul Magron, Roland Badeau, Bertrand David

Comments: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015

Subjects: Sound (cs.SD)
[13] arXiv:1605.07809 [pdf, other]: Title: Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis

Hideki Kawahara, Yannis Agiomyrgiannakis, Heiga Zen

Comments: Accepted for presentation in ISCA workshop SSW9

Journal-ref: 9th ISCA Speech Synthesis Workshop, 2016, pp.221-228

Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[14] arXiv:1605.08396 [pdf, other]: Title: Robust Downbeat Tracking Using an Ensemble of Convolutional Networks

S. Durand, J. P. Bello, B. David, G. Richard

Subjects: Sound (cs.SD); Neural and Evolutionary Computing (cs.NE)
[15] arXiv:1605.08450 [pdf, other]: Title: The Implementation of Low-cost Urban Acoustic Monitoring Devices

Charlie Mydlarz, Justin Salamon, Juan Pablo Bello

Comments: Accepted into the Journal of Applied Acoustics special issue: Acoustics of Smart Cities. 26 pages, 12 figures

Subjects: Sound (cs.SD)
[16] arXiv:1605.09507 [pdf, other]: Title: Deep convolutional neural networks for predominant instrument recognition in polyphonic music

Yoonchang Han, Jaehun Kim, Kyogu Lee

Comments: 13 pages, 7 figures, accepted for publication in IEEE/ACM Transactions on Audio, Speech, and Language Processing on 16-Nov-2016. This is initial submission version. Fully edited version is available at this http URL

Journal-ref: Published in: IEEE/ACM Transactions on Audio, Speech, and Language Processing ( Volume: 25, Issue: 1, Jan. 2017 ) Page(s): 208 - 221

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[17] arXiv:1605.01635 (cross-list from cs.CL) [pdf, other]: Title: The IBM Speaker Recognition System: Recent Advances and Error Analysis

Seyed Omid Sadjadi, Jason Pelecanos, Sriram Ganapathy

Comments: submitted to INTERSPEECH 2016. arXiv admin note: substantial text overlap with arXiv:1602.07291

Subjects: Computation and Language (cs.CL); Sound (cs.SD); Machine Learning (stat.ML)
[18] arXiv:1605.01805 (cross-list from physics.data-an) [pdf, other]: Title: Wave-shape function analysis -- when cepstrum meets time-frequency analysis

Chen-Yun Lin, Li Su, Hau-tieng Wu

Subjects: Data Analysis, Statistics and Probability (physics.data-an); Sound (cs.SD); Numerical Analysis (math.NA)
[19] arXiv:1605.05369 (cross-list from cs.IR) [pdf, other]: Title: Audio Features Affected by Music Expressiveness

Alberto Introini, Giorgio Presti, Giuseppe Boccignone

Comments: Submitted to ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2016), Pisa, Italy, July 17-21, 2016

Subjects: Information Retrieval (cs.IR); Sound (cs.SD)
[20] arXiv:1605.06238 (cross-list from cs.CY) [pdf, other]: Title: A Multi-Smartwatch System for Assessing Speech Characteristics of People with Dysarthria in Group Settings

Harishchandra Dubey, J. Cody Goldberg, Kunal Mankodiya, Leslie Mahler

Comments: 6 page, 9 figure, 1 table, 8 equations, Proceedings e-Health Networking, Applications and Services (Healthcom), 2015 IEEE 17th International Conference on, Boston, USA. 2015

Subjects: Computers and Society (cs.CY); Sound (cs.SD)
[21] arXiv:1605.07733 (cross-list from cs.HC) [pdf, other]: Title: On model architecture for a children's speech recognition interactive dialog system

Radoslava Kraleva, Velin Kralev

Comments: 6 pages, 2 figures, in proc. of conference FMNS 2009, Blagoevgrad, Bulgaria

Journal-ref: Third International Scientific Conference "Mathematics and Natural Sciences", Vol. (1), pp. 106-111, 2009

Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Sound (cs.SD)
[22] arXiv:1605.07735 (cross-list from cs.CL) [pdf, other]: Title: Design and development a children's speech database

Radoslava Kraleva

Comments: 8 pages, 2 figures, 1 table, conference FMNS 2011, Blagoevgrad, Bulgaria

Journal-ref: Fourth International Scientific Conference "Mathematics and Natural Sciences" 2011, Bulgaria, Vol. (2), pp. 41-48

Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Sound (cs.SD)

Total of 22 entries

Showing up to 25 entries per page: fewer | more | all