Multimedia

Authors and titles for February 2020

Total of 39 entries : 1-25 26-39

Showing up to 25 entries per page: fewer | more | all

[1] arXiv:2002.00251 [pdf, other]: Title: Multi-Modal Music Information Retrieval: Augmenting Audio-Analysis with Visual Computing for Improved Music Video Analysis

Alexander Schindler

Comments: Dissertation at TU Wien

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[2] arXiv:2002.01425 [pdf, other]: Title: Spatially Variant Laplacian Pyramids for Multi-Frame Exposure Fusion

Anmol Biswas, Green Rosh K S, Sachin Deepak Lomte

Subjects: Multimedia (cs.MM)
[3] arXiv:2002.02370 [pdf, other]: Title: Data hiding in speech signal using steganography and encryption

Hanisha Chowdary N, Karan K, Bharath K P, Rajesh Kumar M

Subjects: Multimedia (cs.MM)
[4] arXiv:2002.03156 [pdf, other]: Title: A Time-Frequency Perspective on Audio Watermarking

Haijian Zhang

Subjects: Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[5] arXiv:2002.09607 [pdf, other]: Title: Multi-Representation Knowledge Distillation For Audio Classification

Liang Gao, Kele Xu, Huaimin Wang, Yuxing Peng

Subjects: Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[6] arXiv:2002.10651 [pdf, other]: Title: A Comparative Evaluation of Temporal Pooling Methods for Blind Video Quality Assessment

Zhengzhong Tu, Chia-Ju Chen, Li-Heng Chen, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[7] arXiv:2002.11088 [pdf, other]: Title: Model Watermarking for Image Processing Networks

Jie Zhang, Dongdong Chen, Jing Liao, Han Fang, Weiming Zhang, Wenbo Zhou, Hao Cui, Nenghai Yu

Comments: AAAI 2020

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[8] arXiv:2002.12275 [pdf, other]: Title: Subjective Quality Assessment for YouTube UGC Dataset

Joong Gon Yim, Yilin Wang, Neil Birkbeck, Balu Adsumilli

Subjects: Multimedia (cs.MM); Image and Video Processing (eess.IV)
[9] arXiv:2002.01553 (cross-list from cs.NI) [pdf, other]: Title: EdgeDASH: Exploiting Network-Assisted Adaptive Video Streaming for Edge Caching

Suzan Bayhan, Setareh Maghsudi, Anatolij Zubow

Subjects: Networking and Internet Architecture (cs.NI); Multimedia (cs.MM)
[10] arXiv:2002.02609 (cross-list from cs.CV) [pdf, other]: Title: Image Fine-grained Inpainting

Zheng Hui, Jie Li, Xiumei Wang, Xinbo Gao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[11] arXiv:2002.02927 (cross-list from cs.CV) [pdf, other]: Title: SPN-CNN: Boosting Sensor-Based Source Camera Attribution With Deep Learning

Matthias Kirchner, Cameron Johnson

Comments: Presented at the IEEE International Workshop on Information Forensics and Security (WIFS) 2019

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[12] arXiv:2002.03322 (cross-list from cs.CV) [pdf, other]: Title: VIFB: A Visible and Infrared Image Fusion Benchmark

Xingchen Zhang, Ping Ye, Gang Xiao

Comments: 11 pages, 5 figures, 5 tables. Accepted to CVPRW2020. Compared to the CVPRW2020 version, this version corrects minor mistakes in Table 4 and the first paragraph of Section 4.2

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[13] arXiv:2002.03557 (cross-list from cs.CV) [pdf, other]: Title: Multitask Emotion Recognition with Incomplete Labels

Didan Deng, Zhaokang Chen, Bertram E. Shi

Comments: Accepted by FG2020

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[14] arXiv:2002.03773 (cross-list from cs.CV) [pdf, other]: Title: Deriving Emotions and Sentiments from Visual Content: A Disaster Analysis Use Case

Kashif Ahmad, Syed Zohaib, Nicola Conci, Ala Al-Fuqaha

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG); Multimedia (cs.MM)
[15] arXiv:2002.03977 (cross-list from eess.AS) [pdf, other]: Title: Multimodal active speaker detection and virtual cinematography for video conferencing

Ross Cutler, Ramin Mehran, Sam Johnson, Cha Zhang, Adam Kirk, Oliver Whyte, Adarsh Kowdle

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Multimedia (cs.MM); Machine Learning (stat.ML)
[16] arXiv:2002.04537 (cross-list from eess.IV) [pdf, other]: Title: 3D Point Cloud Enhancement using Graph-Modelled Multiview Depth Measurements

Xue Zhang, Gene Cheung, Jiahao Pang, Dong Tian

Comments: 5 figures

Subjects: Image and Video Processing (eess.IV); Multimedia (cs.MM)
[17] arXiv:2002.04780 (cross-list from cs.CV) [pdf, other]: Title: MFFW: A new dataset for multi-focus image fusion

Shuang Xu, Xiaoli Wei, Chunxia Zhang, Junmin Liu, Jiangshe Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[18] arXiv:2002.05070 (cross-list from cs.CV) [pdf, other]: Title: AlignNet: A Unifying Approach to Audio-Visual Alignment

Jianren Wang, Zhaoyuan Fang, Hang Zhao

Comments: WACV2020. Project video and code are available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[19] arXiv:2002.05305 (cross-list from cs.HC) [pdf, other]: Title: Interactive Multi-User 3D Visual Analytics in Augmented Reality

Wanze Xie, Yining Liang, Janet Johnson, Andrea Mower, Samuel Burns, Colleen Chelini, Paul D Alessandro, Nadir Weibel, Jürgen P. Schulze

Comments: In Proceedings of IS&T The Engineering Reality of Virtual Reality 2020

Journal-ref: Electronic Imaging, The Engineering Reality of Virtual Reality 2020, pp. 363-1-363-6(6)

Subjects: Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[20] arXiv:2002.05314 (cross-list from eess.AS) [pdf, other]: Title: Self-supervised learning for audio-visual speaker diarization

Yifan Ding, Yong Xu, Shi-Xiong Zhang, Yahuan Cong, Liqiang Wang

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Machine Learning (stat.ML)
[21] arXiv:2002.05604 (cross-list from eess.AS) [pdf, other]: Title: Efficient And Scalable Neural Residual Waveform Coding With Collaborative Quantization

Kai Zhen, Mi Suk Lee, Jongmo Sung, Seungkwon Beack, Minje Kim

Comments: Accepted in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) , Barcelona, Spain, May 4-8, 2020

Subjects: Audio and Speech Processing (eess.AS); Multimedia (cs.MM); Sound (cs.SD); Signal Processing (eess.SP)
[22] arXiv:2002.05639 (cross-list from cs.CL) [pdf, other]: Title: Looking Enhances Listening: Recovering Missing Speech Using Images

Tejas Srinivasan, Ramon Sanabria, Florian Metze

Comments: Accepted to ICASSP 2020

Subjects: Computation and Language (cs.CL); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[23] arXiv:2002.06652 (cross-list from cs.CL) [pdf, other]: Title: SBERT-WK: A Sentence Embedding Method by Dissecting BERT-based Word Models

Bin Wang, C.-C. Jay Kuo

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[24] arXiv:2002.06794 (cross-list from cs.CR) [pdf, other]: Title: Computing in Covert Domain Using Data Hiding

Zhenxing Qian, Zichi Wang, Xinpeng Zhang

Comments: 5 pages, 7 figures

Subjects: Cryptography and Security (cs.CR); Multimedia (cs.MM)
[25] arXiv:2002.06817 (cross-list from cs.SD) [pdf, other]: Title: Addressing the confounds of accompaniments in singer identification

Tsung-Han Hsieh, Kai-Hsiang Cheng, Zhe-Cheng Fan, Yu-Ching Yang, Yi-Hsuan Yang

Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)

Total of 39 entries : 1-25 26-39

Showing up to 25 entries per page: fewer | more | all