Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.MM

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Multimedia

Authors and titles for January 2020

Total of 34 entries
Showing up to 50 entries per page: fewer | more | all
[1] arXiv:2001.01403 [pdf, other]
Title: Joint Communication and Computational Resource Allocation for QoE-driven Point Cloud Video Streaming
Jie Li, Cong Zhang, Zhi Liu, Wei Sun, Qiyue Li
Subjects: Multimedia (cs.MM)
[2] arXiv:2001.02002 [pdf, other]
Title: SUR-FeatNet: Predicting the Satisfied User Ratio Curvefor Image Compression with Deep Feature Learning
Hanhe Lin, Vlad Hosu, Chunling Fan, Yun Zhang, Yuchen Mu, Raouf Hamzaoui, Dietmar Saupe
Journal-ref: Quality and User Experience (2020) 5:5
Subjects: Multimedia (cs.MM); Machine Learning (cs.LG)
[3] arXiv:2001.02653 [pdf, other]
Title: Natural Steganography in JPEG Domain with a Linear Development Pipeline
Taburet Théo, Bas Patrick, Sawaya Wadih, Jessica Fridrich
Comments: 13 pages, 14 figures
Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR)
[4] arXiv:2001.03251 [pdf, other]
Title: Adaptive Control of Embedding Strength in Image Watermarking using Neural Networks
Mahnoosh Bagheri, Majid Mohrekesh, Nader Karimi, Shadrokh Samavi
Comments: 4 pages 5 figures
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[5] arXiv:2001.03536 [pdf, other]
Title: QoE-driven Coupled Uplink and Downlink Rate Adaptation for 360-degree Video Live Streaming
Jie Li, Ransheng Feng, Zhi Liu, Wei Sun, Qiyue Li
Subjects: Multimedia (cs.MM)
[6] arXiv:2001.03542 [pdf, other]
Title: Exploratory Study on User's Dynamic Visual Acuity and Quality Perception of Impaired Images
Jolien De Letter, Anissa All, Lieven De Marez, Vasileios Avramelos, Peter Lambert, Glenn Van Wallendael
Comments: 6 pages, 5 figures
Subjects: Multimedia (cs.MM); Human-Computer Interaction (cs.HC)
[7] arXiv:2001.04580 [pdf, other]
Title: Distortion Agnostic Deep Watermarking
Xiyang Luo, Ruohan Zhan, Huiwen Chang, Feng Yang, Peyman Milanfar
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[8] arXiv:2001.06466 [pdf, other]
Title: Low-latency Cloud-based Volumetric Video Streaming Using Head Motion Prediction
Serhan Gül, Dimitri Podborski, Thomas Buchholz, Thomas Schierl, Cornelius Hellge
Comments: 7 pages, 4 figures
Journal-ref: 30th ACM Workshop on Network and Operating Systems Support for Digital Audio and Video (NOSSDAV) 2020
Subjects: Multimedia (cs.MM); Image and Video Processing (eess.IV)
[9] arXiv:2001.07494 [pdf, other]
Title: Evaluation of a course mediatised with Xerte
Ghalia Merzougui, Roumaissa Dehkal, Maheiddine Djoudi (TECHNÉ - EA 6316)
Journal-ref: International Conference of Computing for Engineering and Sciences (ICCES'2015), Jul 2015, Istanbul,, Turkey
Subjects: Multimedia (cs.MM)
[10] arXiv:2001.07886 [pdf, other]
Title: AMP: Authentication of Media via Provenance
Paul England, Henrique S. Malvar, Eric Horvitz, Jack W. Stokes, Cédric Fournet, Rebecca Burke-Aguero, Amaury Chamayou, Sylvan Clebsch, Manuel Costa, John Deutscher, Shabnam Erfani, Matt Gaylor, Andrew Jenks, Kevin Kane, Elissa Redmiles, Alex Shamis, Isha Sharma, Sam Wenker, Anika Zaman
Comments: Add detailed manifest description, Add provenance, Improve text
Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR); Systems and Control (eess.SY)
[11] arXiv:2001.10590 [pdf, other]
Title: An Effective Automatic Image Annotation Model Via Attention Model and Data Equilibrium
Amir Vatani, Milad Taleby Ahvanooey, Mostafa Rahimi
Comments: 9 pages, 3 figures
Journal-ref: Int. J. Adv. Comput. Sci. Appl, 9(3), pp.269-277 (2018)
Subjects: Multimedia (cs.MM); Computation and Language (cs.CL)
[12] arXiv:2001.11406 [pdf, other]
Title: NAViDAd: A No-Reference Audio-Visual Quality Metric Based on a Deep Autoencoder
Helard Martinez, M. C. Farias, A. Hines
Comments: 5 pages
Journal-ref: 2019 27th European Signal Processing Conference (EUSIPCO), IEEE, 2019, pp 1-5
Subjects: Multimedia (cs.MM); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[13] arXiv:2001.00001 (cross-list from math.HO) [pdf, other]
Title: Quantum GestART: Identifying and Applying Correlations between Mathematics, Art, and Perceptual Organization
Maria Mannone, Federico Favali, Balandino Di Donato, Luca Turchet
Comments: Accepted for publication, Journal of Mathematics and Music. New references added in this version
Journal-ref: Journal of Mathematics and Music, 2020
Subjects: History and Overview (math.HO); Multimedia (cs.MM)
[14] arXiv:2001.00179 (cross-list from cs.CV) [pdf, other]
Title: DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection
Ruben Tolosana, Ruben Vera-Rodriguez, Julian Fierrez, Aythami Morales, Javier Ortega-Garcia
Journal-ref: Information Fusion, 2020
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[15] arXiv:2001.00847 (cross-list from cs.IT) [pdf, other]
Title: Biometric and Physical Identifiers with Correlated Noise for Controllable Private Authentication
Onur Günlü, Rafael F. Schaefer, H. Vincent Poor
Comments: Shorter version to appear in the IEEE International Symposium on Information Theory 2020
Subjects: Information Theory (cs.IT); Cryptography and Security (cs.CR); Multimedia (cs.MM); Signal Processing (eess.SP); Probability (math.PR)
[16] arXiv:2001.01720 (cross-list from cs.SD) [pdf, other]
Title: Modeling Musical Structure with Artificial Neural Networks
Stefan Lattner
Comments: 152 pages, 28 figures, 10 tables. PhD thesis, Johannes Kepler University Linz, October 2019. Includes results from this https URL, arXiv:1612.04742, arXiv:1708.05325, arXiv:1806.08236, and arXiv:1806.08686 (see Section 1.2 for detailed information)
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[17] arXiv:2001.03353 (cross-list from cs.SI) [pdf, other]
Title: Measuring Similarity between Brands using Followers' Post in Social Media
Yiwei Zhang, Xueting Wang, Yoshiaki Sakai, Toshihiko Yamasaki
Comments: Accepted to ACM Multimedia Asia 2019
Subjects: Social and Information Networks (cs.SI); Multimedia (cs.MM)
[18] arXiv:2001.04316 (cross-list from eess.AS) [pdf, other]
Title: Visually Guided Self Supervised Learning of Speech Representations
Abhinav Shukla, Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Maja Pantic
Comments: Accepted at ICASSP 2020 v2: Updated to the ICASSP 2020 camera ready version
Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[19] arXiv:2001.04463 (cross-list from cs.CV) [pdf, other]
Title: Unsupervised Audiovisual Synthesis via Exemplar Autoencoders
Kangle Deng, Aayush Bansal, Deva Ramanan
Comments: ICLR 2021; Project page -- this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[20] arXiv:2001.04883 (cross-list from cs.HC) [pdf, other]
Title: Disseminating Research News in HCI: Perceived Hazards, How-To's, and Opportunities for Innovation
C. Estelle Smith, Eduardo Nevarez, Haiyi Zhu
Comments: 10 pages, 2 figures, accepted paper to CHI 2020 conference
Subjects: Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[21] arXiv:2001.05200 (cross-list from cs.CV) [pdf, other]
Title: Evaluating image matching methods for book cover identification
Rabie Hachemi, Ikram Achar, Biasi Wiga, Mahfoud Sidi Ali Mebarek
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[22] arXiv:2001.05201 (cross-list from cs.CV) [pdf, other]
Title: Everybody's Talkin': Let Me Talk as You Want
Linsen Song, Wayne Wu, Chen Qian, Ran He, Chen Change Loy
Comments: Technical report. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Multimedia (cs.MM)
[23] arXiv:2001.05864 (cross-list from cs.CV) [pdf, other]
Title: Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning
Yiyan Chen, Li Tao, Xueting Wang, Toshihiko Yamasaki
Comments: mmasia 2019 accepted
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[24] arXiv:2001.05970 (cross-list from cs.SI) [pdf, other]
Title: #MeToo on Campus: Studying College Sexual Assault at Scale Using Data Reported on Social Media
Viet Duong, Phu Pham, Ritwik Bose, Jiebo Luo
Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[25] arXiv:2001.06765 (cross-list from cs.IR) [pdf, other]
Title: Information Foraging for Enhancing Implicit Feedback in Content-based Image Recommendation
Amit Kumar Jaiswal, Haiming Liu, Ingo Frommholz
Comments: FIRE '19: Proceedings of the 11th Forum for Information Retrieval Evaluation
Subjects: Information Retrieval (cs.IR); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[26] arXiv:2001.06888 (cross-list from cs.CL) [pdf, other]
Title: A multimodal deep learning approach for named entity recognition from social media
Meysam Asgari-Chenaghlu, M.Reza Feizi-Derakhshi, Leili Farzinvash, M. A. Balafar, Cina Motamed
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM); Social and Information Networks (cs.SI)
[27] arXiv:2001.07194 (cross-list from cs.CL) [pdf, other]
Title: Recommending Themes for Ad Creative Design via Visual-Linguistic Representations
Yichao Zhou, Shaunak Mishra, Manisha Verma, Narayan Bhamidipati, Wei Wang
Comments: 7 pages, 8 figures, 2 tables, accepted by The Web Conference 2020
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG); Multimedia (cs.MM)
[28] arXiv:2001.07295 (cross-list from cs.AI) [pdf, other]
Title: AutoMATES: Automated Model Assembly from Text, Equations, and Software
Adarsh Pyarelal, Marco A. Valenzuela-Escarcega, Rebecca Sharp, Paul D. Hein, Jon Stephens, Pratik Bhandari, HeuiChan Lim, Saumya Debray, Clayton T. Morrison
Comments: 8 pages, 6 figures, accepted to Modeling the World's Systems 2019
Subjects: Artificial Intelligence (cs.AI); Multimedia (cs.MM); Software Engineering (cs.SE)
[29] arXiv:2001.08730 (cross-list from cs.CV) [pdf, other]
Title: Robust Explanations for Visual Question Answering
Badri N. Patro, Shivansh Pate, Vinay P. Namboodiri
Comments: WACV-2020 (Accepted)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[30] arXiv:2001.08779 (cross-list from cs.CV) [pdf, other]
Title: Deep Bayesian Network for Visual Question Generation
Badri N. Patro, Vinod K. Kurmi, Sandeep Kumar, Vinay P. Namboodiri
Comments: WACV-2020 (Accepted)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[31] arXiv:2001.09545 (cross-list from cs.NE) [pdf, other]
Title: aiTPR: Attribute Interaction-Tensor Product Representation for Image Caption
Chiranjib Sur
Comments: 11 pages
Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[32] arXiv:2001.10190 (cross-list from cs.SD) [pdf, other]
Title: Time-Domain Audio Source Separation Based on Wave-U-Net Combined with Discrete Wavelet Transform
Tomohiko Nakamura, Hiroshi Saruwatari
Comments: 5 pages, to appear in IEEE International Conference on Acoustics, Speech, and Signal Processing 2020 (ICASSP 2020)
Journal-ref: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[33] arXiv:2001.10832 (cross-list from eess.AS) [pdf, other]
Title: Audio-Visual Decision Fusion for WFST-based and seq2seq Models
Rohith Aralikatti, Sharad Roy, Abhinav Thanda, Dilip Kumar Margam, Pujitha Appan Kandala, Tanay Sharma, Shankar M Venkatesan
Comments: Submitted for review to ICASSP 2020 on October 21st, 2019
Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Image and Video Processing (eess.IV)
[34] arXiv:2001.11847 (cross-list from cs.NE) [pdf, other]
Title: CNN-based fast source device identification
Sara Mandelli, Davide Cozzolino, Paolo Bestagini, Luisa Verdoliva, Stefano Tubaro
Subjects: Neural and Evolutionary Computing (cs.NE); Multimedia (cs.MM)
Total of 34 entries
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack