Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.SD

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Sound

Authors and titles for May 2016

Total of 22 entries
Showing up to 25 entries per page: fewer | more | all
[1] arXiv:1605.00810 [pdf, other]
Title: Diagonal Unloading Beamforming for Source Localization
Daniele Salvati, Carlo Drioli, Gian Luca Foresti
Journal-ref: IEEE/ACM Transactions on Audio, Speech and Language Processing, Volume 25, Issue 3, Pages 609-622 (2018)
Subjects: Sound (cs.SD)
[2] arXiv:1605.01329 [pdf, other]
Title: Single Channel Speech Enhancement Using Outlier Detection
Eunjoon Cho, Bowon Lee, Ronald Schafer, Bernard Widrow
Subjects: Sound (cs.SD); Machine Learning (cs.LG)
[3] arXiv:1605.01755 [pdf, other]
Title: DCTNet and PCANet for acoustic signal feature extraction
Yin Xian, Andrew Thompson, Xiaobai Sun, Douglas Nowacek, Loren Nolte
Comments: 22 figures
Subjects: Sound (cs.SD); Machine Learning (cs.LG)
[4] arXiv:1605.02401 [pdf, other]
Title: Audio Event Detection using Weakly Labeled Data
Anurag Kumar, Bhiksha Raj
Comments: ACM Multimedia 2016
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[5] arXiv:1605.02427 [pdf, other]
Title: Speech Enhancement In Multiple-Noise Conditions using Deep Neural Networks
Anurag Kumar, Dinei Florencio
Subjects: Sound (cs.SD)
[6] arXiv:1605.03724 [pdf, other]
Title: Sub-vector Extraction and Cascade Post-Processing for Speaker Verification Using MLLR Super-vectors
A. K. Sarkar, C. Barras, V. B. Le, D. Matrouf
Subjects: Sound (cs.SD)
[7] arXiv:1605.06644 [pdf, other]
Title: Deep convolutional networks on the pitch spiral for musical instrument recognition
Vincent Lostanlen, Carmine-Emanuele Cella
Comments: 7 pages, 3 figures. Accepted at the International Society for Music Information Retrieval Conference (ISMIR) conference in New York City, NY, USA, August 2016
Subjects: Sound (cs.SD)
[8] arXiv:1605.07008 [pdf, other]
Title: madmom: a new Python Audio and Music Signal Processing Library
Sebastian Böck, Filip Korzeniowski, Jan Schlüter, Florian Krebs, Gerhard Widmer
Subjects: Sound (cs.SD)
[9] arXiv:1605.07466 [pdf, other]
Title: Complex NMF under phase constraints based on signal modeling: application to audio source separation
Paul Magron, Roland Badeau, Bertrand David
Comments: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2016
Subjects: Sound (cs.SD)
[10] arXiv:1605.07467 [pdf, other]
Title: Phase reconstruction of spectrograms with linear unwrapping: application to audio signal restoration
Paul Magron, Roland Badeau, Bertrand David
Comments: European Signal Processing Conference (EUSIPCO) 2015
Subjects: Sound (cs.SD)
[11] arXiv:1605.07468 [pdf, other]
Title: Phase reconstruction of spectrograms based on a model of repeated audio events
Paul Magron, Roland Badeau, Bertrand David
Comments: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2015
Subjects: Sound (cs.SD)
[12] arXiv:1605.07469 [pdf, other]
Title: Phase recovery in NMF for audio source separation: an insightful benchmark
Paul Magron, Roland Badeau, Bertrand David
Comments: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2015
Subjects: Sound (cs.SD)
[13] arXiv:1605.07809 [pdf, other]
Title: Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis
Hideki Kawahara, Yannis Agiomyrgiannakis, Heiga Zen
Comments: Accepted for presentation in ISCA workshop SSW9
Journal-ref: 9th ISCA Speech Synthesis Workshop, 2016, pp.221-228
Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[14] arXiv:1605.08396 [pdf, other]
Title: Robust Downbeat Tracking Using an Ensemble of Convolutional Networks
S. Durand, J. P. Bello, B. David, G. Richard
Subjects: Sound (cs.SD); Neural and Evolutionary Computing (cs.NE)
[15] arXiv:1605.08450 [pdf, other]
Title: The Implementation of Low-cost Urban Acoustic Monitoring Devices
Charlie Mydlarz, Justin Salamon, Juan Pablo Bello
Comments: Accepted into the Journal of Applied Acoustics special issue: Acoustics of Smart Cities. 26 pages, 12 figures
Subjects: Sound (cs.SD)
[16] arXiv:1605.09507 [pdf, other]
Title: Deep convolutional neural networks for predominant instrument recognition in polyphonic music
Yoonchang Han, Jaehun Kim, Kyogu Lee
Comments: 13 pages, 7 figures, accepted for publication in IEEE/ACM Transactions on Audio, Speech, and Language Processing on 16-Nov-2016. This is initial submission version. Fully edited version is available at this http URL
Journal-ref: Published in: IEEE/ACM Transactions on Audio, Speech, and Language Processing ( Volume: 25, Issue: 1, Jan. 2017 ) Page(s): 208 - 221
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[17] arXiv:1605.01635 (cross-list from cs.CL) [pdf, other]
Title: The IBM Speaker Recognition System: Recent Advances and Error Analysis
Seyed Omid Sadjadi, Jason Pelecanos, Sriram Ganapathy
Comments: submitted to INTERSPEECH 2016. arXiv admin note: substantial text overlap with arXiv:1602.07291
Subjects: Computation and Language (cs.CL); Sound (cs.SD); Machine Learning (stat.ML)
[18] arXiv:1605.01805 (cross-list from physics.data-an) [pdf, other]
Title: Wave-shape function analysis -- when cepstrum meets time-frequency analysis
Chen-Yun Lin, Li Su, Hau-tieng Wu
Subjects: Data Analysis, Statistics and Probability (physics.data-an); Sound (cs.SD); Numerical Analysis (math.NA)
[19] arXiv:1605.05369 (cross-list from cs.IR) [pdf, other]
Title: Audio Features Affected by Music Expressiveness
Alberto Introini, Giorgio Presti, Giuseppe Boccignone
Comments: Submitted to ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2016), Pisa, Italy, July 17-21, 2016
Subjects: Information Retrieval (cs.IR); Sound (cs.SD)
[20] arXiv:1605.06238 (cross-list from cs.CY) [pdf, other]
Title: A Multi-Smartwatch System for Assessing Speech Characteristics of People with Dysarthria in Group Settings
Harishchandra Dubey, J. Cody Goldberg, Kunal Mankodiya, Leslie Mahler
Comments: 6 page, 9 figure, 1 table, 8 equations, Proceedings e-Health Networking, Applications and Services (Healthcom), 2015 IEEE 17th International Conference on, Boston, USA. 2015
Subjects: Computers and Society (cs.CY); Sound (cs.SD)
[21] arXiv:1605.07733 (cross-list from cs.HC) [pdf, other]
Title: On model architecture for a children's speech recognition interactive dialog system
Radoslava Kraleva, Velin Kralev
Comments: 6 pages, 2 figures, in proc. of conference FMNS 2009, Blagoevgrad, Bulgaria
Journal-ref: Third International Scientific Conference "Mathematics and Natural Sciences", Vol. (1), pp. 106-111, 2009
Subjects: Human-Computer Interaction (cs.HC); Computation and Language (cs.CL); Sound (cs.SD)
[22] arXiv:1605.07735 (cross-list from cs.CL) [pdf, other]
Title: Design and development a children's speech database
Radoslava Kraleva
Comments: 8 pages, 2 figures, 1 table, conference FMNS 2011, Blagoevgrad, Bulgaria
Journal-ref: Fourth International Scientific Conference "Mathematics and Natural Sciences" 2011, Bulgaria, Vol. (2), pp. 41-48
Subjects: Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Sound (cs.SD)
Total of 22 entries
Showing up to 25 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack