Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.MM

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Multimedia

Authors and titles for March 2020

Total of 46 entries
Showing up to 50 entries per page: fewer | more | all
[1] arXiv:2003.01958 [pdf, other]
Title: ASMD: an automatic framework for compiling multimodal datasets with audio and scores
Federico Simonetta, Stavros Ntalampiras, Federico Avanzini
Comments: Accepted at the Sound and Music Computing Conference 2020
Subjects: Multimedia (cs.MM); Digital Libraries (cs.DL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[2] arXiv:2003.02526 [pdf, other]
Title: Cloud Rendering-based Volumetric Video Streaming System for Mixed Reality Services
Serhan Gül, Dimitri Podborski, Jangwoo Son, Gurdeep Singh Bhullar, Thomas Buchholz, Thomas Schierl, Cornelius Hellge
Comments: 4 pages, 2 figures
Journal-ref: 11th ACM Multimedia Systems Conference (MMSys) 2020
Subjects: Multimedia (cs.MM); Image and Video Processing (eess.IV)
[3] arXiv:2003.03092 [pdf, other]
Title: Soft Video Multicasting Using Adaptive Compressed Sensing
Hadi Hadizadeh, Ivan V. bajic
Subjects: Multimedia (cs.MM); Signal Processing (eess.SP)
[4] arXiv:2003.05096 [pdf, other]
Title: Exploring the Role of Visual Content in Fake News Detection
Juan Cao, Peng Qi, Qiang Sheng, Tianyun Yang, Junbo Guo, Jintao Li
Comments: This is a preprint of a chapter published in Disinformation, Misinformation, and Fake News in Social Media: Emerging Research Challenges and Opportunities, edited by Kai, S., Suhang, W., Dongwon, L., Huan, L, 2020, Springer reproduced with permission of Springer Nature Switzerland AG. The final authenticated version is available online at: this https URL. arXiv admin note: text overlap with arXiv:2001.00623, arXiv:1808.06686, arXiv:1903.00788 by other authors
Journal-ref: Disinformation, Misinformation, and Fake News in Social Media. 2020
Subjects: Multimedia (cs.MM); Social and Information Networks (cs.SI)
[5] arXiv:2003.07505 [pdf, other]
Title: Hide Secret Information in Blocks: Minimum Distortion Embedding
Md Amiruzzaman, Rizal Mohd Nor
Comments: This paper is accepted for publication in IEEE SPIN 2020 conference
Journal-ref: 2020 7th International Conference on Signal Processing and Integrated Networks (SPIN)
Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR)
[6] arXiv:2003.07583 [pdf, other]
Title: Reinforcement Learning Driven Adaptive VR Streaming with Optical Flow Based QoE
Wei Quan, Yuxuan Pan, Bin Xiang, Lin Zhang
Subjects: Multimedia (cs.MM)
[7] arXiv:2003.08473 [pdf, other]
Title: Viewport-Aware Deep Reinforcement Learning Approach for 360$^o$ Video Caching
Pantelis Maniotis, Nikolaos Thomos
Subjects: Multimedia (cs.MM); Information Theory (cs.IT); Machine Learning (cs.LG); Networking and Internet Architecture (cs.NI)
[8] arXiv:2003.08574 [pdf, other]
Title: Convolutional Neural Networks for Continuous QoE Prediction in Video Streaming Services
Tho Nguyen Duc, Chanh Minh Tran, Phan Xuan Tan, Eiji Kamioka
Journal-ref: IEEE Access 2020
Subjects: Multimedia (cs.MM); Image and Video Processing (eess.IV)
[9] arXiv:2003.08619 [pdf, other]
Title: FAURAS: A Proxy-based Framework for Ensuring the Fairness of Adaptive Video Streaming over HTTP/2 Server Push
Chanh Minh Tran, Tho Nguyen Duc, Phan Xuan Tan, Eiji Kamioka
Journal-ref: Appl. Sci. 2020, 10, 2485
Subjects: Multimedia (cs.MM)
[10] arXiv:2003.08865 [pdf, other]
Title: DRST: Deep Residual Shearlet Transform for Densely Sampled Light Field Reconstruction
Yuan Gao, Robert Bregovic, Reinhard Koch, Atanas Gotchev
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[11] arXiv:2003.09249 [pdf, other]
Title: Continuous QoE Prediction Based on WaveNet
Phan Xuan Tan, Tho Nguyen Duc, Chanh Minh Tran, Eiji Kamioka
Subjects: Multimedia (cs.MM); Image and Video Processing (eess.IV)
[12] arXiv:2003.09580 [pdf, other]
Title: Edge-assisted Viewport Adaptive Scheme for real-time Omnidirectional Video transmission
Tao Guo, Xikang Jiang, Bin Xiang, Lin Zhang
Subjects: Multimedia (cs.MM)
[13] arXiv:2003.10082 [pdf, other]
Title: JPEG Steganography and Synchronization of DCT Coefficients for a Given Development Pipeline
Théo Taburet, Patrick Bas, Wadih Sawaya, Remi Cogranne
Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR)
[14] arXiv:2003.10546 [pdf, other]
Title: Forensic Analysis of Residual Information in Adobe PDF Files
Hyunji Chung, Jungheum Park, Sangjin Lee
Comments: 11 figures, 1 table
Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR)
[15] arXiv:2003.10820 [pdf, other]
Title: FacebookVideoLive18: A Live Video Streaming Dataset for Streams Metadata and Online Viewers Locations
Emna Baccour, Aiman Erbad, Kashif Bilal, Amr Mohamed, Mohsen Guizani, Mounir Hamdi
Comments: Manuscript accepted in ICIOT 2020
Journal-ref: ICIOT 2020
Subjects: Multimedia (cs.MM)
[16] arXiv:2003.11100 [pdf, other]
Title: How deep is your encoder: an analysis of features descriptors for an autoencoder-based audio-visual quality metric
Helard Martinez, Andrew Hines, Mylene C. Q. Farias
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[17] arXiv:2003.11300 [pdf, other]
Title: Impact of the Number of Votes on the Reliability and Validity of Subjective Speech Quality Assessment in the Crowdsourcing Approach
Babak Naderi, Tobias Hossfeld, Matthias Hirth, Florian Metzger, Sebastian Möller, Rafael Zequeira Jiménez
Comments: This paper has been accepted for publication in the 2020 Twelfth International Conference on Quality of Multimedia Experience (QoMEX)
Subjects: Multimedia (cs.MM)
[18] arXiv:2003.12265 [pdf, other]
Title: Unsupervised Cross-Modal Audio Representation Learning from Unstructured Multilingual Text
Alexander Schindler, Sergiu Gordea, Peter Knees
Comments: This is the long version of our SAC2020 poster presentation
Journal-ref: In Proceedings of the 35th ACM/SIGAPP Symposium On Applied Computing (SAC2020), March 30-April 3, 2020, Brno, Czech Republic
Subjects: Multimedia (cs.MM); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[19] arXiv:2003.12428 [pdf, other]
Title: A General Approach for Using Deep Neural Network for Digital Watermarking
Yurui Ming, Weiping Ding, Zehong Cao, Chin-Teng Lin
Subjects: Multimedia (cs.MM); Machine Learning (cs.LG); Machine Learning (stat.ML)
[20] arXiv:2003.12742 [pdf, other]
Title: From QoS Distributions to QoE Distributions: a System's Perspective
Tobias Hossfeld, Poul E. Heegaard, Martin Varela, Lea Skorin-Kapov, Markus Fiedler
Comments: 4th International Workshop on Quality of Experience Management (QoE Management 2020), featured by IEEE Conference on Network Softwarization (IEEE NetSoft 2020), Ghent, Belgium
Subjects: Multimedia (cs.MM); Networking and Internet Architecture (cs.NI)
[21] arXiv:2003.13217 [pdf, other]
Title: Deep Residual Neural Networks for Image in Speech Steganography
Shivam Agarwal, Siddarth Venkatraman
Subjects: Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[22] arXiv:2003.13684 [pdf, other]
Title: Social-Sensor Composition for Tapestry Scenes
Tooba Aamir, Hai Dong, Athman Bouguettaya
Comments: 15 pages. IEEE Transactions on Services Computing
Subjects: Multimedia (cs.MM); Information Retrieval (cs.IR); Social and Information Networks (cs.SI)
[23] arXiv:2003.00414 (cross-list from cs.SD) [pdf, other]
Title: Harmonics Based Representation in Clarinet Tone Quality Evaluation
Yixin Wang, Xiaohong Guan, Youtian Du, Nan Nan
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[24] arXiv:2003.00418 (cross-list from cs.CV) [pdf, other]
Title: Towards Automatic Face-to-Face Translation
Prajwal K R, Rudrabha Mukhopadhyay, Jerin Philip, Abhishek Jha, Vinay Namboodiri, C.V. Jawahar
Comments: 9 pages (including references), 5 figures, Published in ACM Multimedia, 2019
Journal-ref: MM '19: Proceedings of the 27th ACM International Conference on Multimedia; October 2019; Pages 1428-1436
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD)
[25] arXiv:2003.00451 (cross-list from eess.IV) [pdf, other]
Title: Weak Texture Information Map Guided Image Super-resolution with Deep Residual Networks
Bo Fu, Liyan Wang, Yuechu Wu, Yufeng Wu, Shilin Fu, Yonggong Ren
Subjects: Image and Video Processing (eess.IV); Multimedia (cs.MM)
[26] arXiv:2003.00832 (cross-list from cs.CV) [pdf, other]
Title: An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos
Sicheng Zhao, Yunsheng Ma, Yang Gu, Jufeng Yang, Tengfei Xing, Pengfei Xu, Runbo Hu, Hua Chai, Kurt Keutzer
Comments: Accepted by AAAI 2020
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[27] arXiv:2003.01299 (cross-list from eess.IV) [pdf, other]
Title: A multiple attributes image quality database for smartphone camera photo quality assessment
Wenhan Zhu, Guangtao Zhai, Zongxi Han, Xiongkuo Min, Tao Wang, Zicheng Zhang, Xiaokang Yang
Subjects: Image and Video Processing (eess.IV); Multimedia (cs.MM)
[28] arXiv:2003.01866 (cross-list from cs.CV) [pdf, other]
Title: Region adaptive graph fourier transform for 3d point clouds
Eduardo Pavez, Benjamin Girault, Antonio Ortega, Philip A. Chou
Comments: 5 pages, 3 figures, accepted ICIP 2020
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Signal Processing (eess.SP)
[29] arXiv:2003.03320 (cross-list from cs.LG) [pdf, other]
Title: Trends and Advancements in Deep Neural Network Communication
Felix Sattler, Thomas Wiegand, Wojciech Samek
Subjects: Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Multimedia (cs.MM); Networking and Internet Architecture (cs.NI); Machine Learning (stat.ML)
[30] arXiv:2003.03703 (cross-list from cs.CV) [pdf, other]
Title: Transferring Cross-domain Knowledge for Video Sign Language Recognition
Dongxu Li, Xin Yu, Chenchen Xu, Lars Petersson, Hongdong Li
Comments: CVPR2020 (oral) preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[31] arXiv:2003.03955 (cross-list from cs.CV) [pdf, other]
Title: Cross-Modal Food Retrieval: Learning a Joint Embedding of Food Images and Recipes with Semantic Consistency and Attention Mechanism
Hao Wang, Doyen Sahoo, Chenghao Liu, Ke Shu, Palakorn Achananuparp, Ee-peng Lim, Steven C. H. Hoi
Comments: IEEE Transactions on Multimedia
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[32] arXiv:2003.04169 (cross-list from cs.CV) [pdf, other]
Title: I-ViSE: Interactive Video Surveillance as an Edge Service using Unsupervised Feature Queries
Seyed Yahya Nikouei, Yu Chen, Alexander Aved, Erik Blasch
Comments: R1 is under review by the IEEE Internet of Things Journal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[33] arXiv:2003.04210 (cross-list from cs.CV) [pdf, other]
Title: Semantic Object Prediction and Spatial Sound Super-Resolution with Binaural Sounds
Arun Balajee Vasudevan, Dengxin Dai, Luc Van Gool
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[34] arXiv:2003.04358 (cross-list from cs.CV) [pdf, other]
Title: Cross modal video representations for weakly supervised active speaker localization
Rahul Sharma, Krishna Somandepalli, Shrikanth Narayanan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[35] arXiv:2003.04679 (cross-list from cs.CL) [pdf, other]
Title: Learning to Respond with Stickers: A Framework of Unifying Multi-Modality in Multi-Turn Dialog
Shen Gao, Xiuying Chen, Chang Liu, Li Liu, Dongyan Zhao, Rui Yan
Comments: Accepted by The Web Conference 2020 (WWW 2020). Equal contribution from first two authors. Dataset and code are released at this https URL
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[36] arXiv:2003.06315 (cross-list from eess.IV) [pdf, other]
Title: Estimation of Rate Control Parameters for Video Coding Using CNN
Maria Santamaria, Ebroul Izquierdo, Saverio Blasi, Marta Mrak
Comments: 5 pages, 5 figures, 4 tables
Journal-ref: IEEE International Conference on Visual Communications and Image Processing (VCIP 2018), Taichung, Taiwan, 9 -12 December 2018
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Multimedia (cs.MM); Machine Learning (stat.ML)
[37] arXiv:2003.06576 (cross-list from cs.CV) [pdf, other]
Title: Counterfactual Samples Synthesizing for Robust Visual Question Answering
Long Chen, Xin Yan, Jun Xiao, Hanwang Zhang, Shiliang Pu, Yueting Zhuang
Comments: Appear in CVPR 2020; Codes in this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[38] arXiv:2003.07544 (cross-list from eess.AS) [pdf, other]
Title: Deep Attention Fusion Feature for Speech Separation with End-to-End Post-filter Method
Cunhang Fan, Jianhua Tao, Bin Liu, Jiangyan Yi, Zhengqi Wen, Xuefei Liu
Comments: ACCEPTED by IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP)
Subjects: Audio and Speech Processing (eess.AS); Multimedia (cs.MM); Sound (cs.SD)
[39] arXiv:2003.07694 (cross-list from cs.CV) [pdf, other]
Title: Parameter-Free Style Projection for Arbitrary Style Transfer
Siyu Huang, Haoyi Xiong, Tianyang Wang, Bihan Wen, Qingzhong Wang, Zeyu Chen, Jun Huan, Dejing Dou
Comments: ICASSP 2022. Project page this https URL and Code this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[40] arXiv:2003.08355 (cross-list from eess.IV) [pdf, other]
Title: Dynamic Point Cloud Denoising via Manifold-to-Manifold Distance
Wei Hu, Qianjiang Hu, Zehua Wang, Xiang Gao
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[41] arXiv:2003.08769 (cross-list from cs.CV) [pdf, other]
Title: Personalized Taste and Cuisine Preference Modeling via Images
Nitish Nag, Bindu Rajanna, Ramesh Jain
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Information Retrieval (cs.IR); Multimedia (cs.MM)
[42] arXiv:2003.08897 (cross-list from cs.CV) [pdf, other]
Title: Normalized and Geometry-Aware Self-Attention Network for Image Captioning
Longteng Guo, Jing Liu, Xinxin Zhu, Peng Yao, Shichen Lu, Hanqing Lu
Comments: Accepted by CVPR 2020
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[43] arXiv:2003.09294 (cross-list from eess.SP) [pdf, other]
Title: Self-Supervised Light Field Reconstruction Using Shearlet Transform and Cycle Consistency
Yuan Gao, Robert Bregovic, Atanas Gotchev
Subjects: Signal Processing (eess.SP); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[44] arXiv:2003.10414 (cross-list from cs.SD) [pdf, other]
Title: Multi-channel U-Net for Music Source Separation
Venkatesh S. Kadandale, Juan F. Montesinos, Gloria Haro, Emilia Gómez
Comments: The paper has been accepted at IEEE MMSP2020. Project Page: this https URL
Subjects: Sound (cs.SD); Information Retrieval (cs.IR); Machine Learning (cs.LG); Multimedia (cs.MM)
[45] arXiv:2003.10421 (cross-list from cs.CL) [pdf, other]
Title: Multimodal Analytics for Real-world News using Measures of Cross-modal Entity Consistency
Eric Müller-Budack, Jonas Theiner, Sebastian Diering, Maximilian Idahl, Ralph Ewerth
Comments: Accepted for publication in: International Conference on Multimedia Retrieval (ICMR), Dublin, 2020
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR); Multimedia (cs.MM)
[46] arXiv:2003.13669 (cross-list from eess.IV) [pdf, other]
Title: A generalized Hausdorff distance based quality metric for point cloud geometry
Alireza Javaheri, Catarina Brites, Fernando Pereira, Joao Ascenso
Comments: This article is accepted to 12th International Conference on Quality of Multimedia Experience (QoMEX)
Subjects: Image and Video Processing (eess.IV); Multimedia (cs.MM)
Total of 46 entries
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack