Multimedia

Authors and titles for January 2020

Total of 34 entries

Showing up to 50 entries per page: fewer | more | all

[1] arXiv:2001.01403 [pdf, other]: Title: Joint Communication and Computational Resource Allocation for QoE-driven Point Cloud Video Streaming

Jie Li, Cong Zhang, Zhi Liu, Wei Sun, Qiyue Li

Subjects: Multimedia (cs.MM)
[2] arXiv:2001.02002 [pdf, other]: Title: SUR-FeatNet: Predicting the Satisfied User Ratio Curvefor Image Compression with Deep Feature Learning

Hanhe Lin, Vlad Hosu, Chunling Fan, Yun Zhang, Yuchen Mu, Raouf Hamzaoui, Dietmar Saupe

Journal-ref: Quality and User Experience (2020) 5:5

Subjects: Multimedia (cs.MM); Machine Learning (cs.LG)
[3] arXiv:2001.02653 [pdf, other]: Title: Natural Steganography in JPEG Domain with a Linear Development Pipeline

Taburet Théo, Bas Patrick, Sawaya Wadih, Jessica Fridrich

Comments: 13 pages, 14 figures

Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR)
[4] arXiv:2001.03251 [pdf, other]: Title: Adaptive Control of Embedding Strength in Image Watermarking using Neural Networks

Mahnoosh Bagheri, Majid Mohrekesh, Nader Karimi, Shadrokh Samavi

Comments: 4 pages 5 figures

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[5] arXiv:2001.03536 [pdf, other]: Title: QoE-driven Coupled Uplink and Downlink Rate Adaptation for 360-degree Video Live Streaming

Jie Li, Ransheng Feng, Zhi Liu, Wei Sun, Qiyue Li

Subjects: Multimedia (cs.MM)
[6] arXiv:2001.03542 [pdf, other]: Title: Exploratory Study on User's Dynamic Visual Acuity and Quality Perception of Impaired Images

Jolien De Letter, Anissa All, Lieven De Marez, Vasileios Avramelos, Peter Lambert, Glenn Van Wallendael

Comments: 6 pages, 5 figures

Subjects: Multimedia (cs.MM); Human-Computer Interaction (cs.HC)
[7] arXiv:2001.04580 [pdf, other]: Title: Distortion Agnostic Deep Watermarking

Xiyang Luo, Ruohan Zhan, Huiwen Chang, Feng Yang, Peyman Milanfar

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[8] arXiv:2001.06466 [pdf, other]: Title: Low-latency Cloud-based Volumetric Video Streaming Using Head Motion Prediction

Serhan Gül, Dimitri Podborski, Thomas Buchholz, Thomas Schierl, Cornelius Hellge

Comments: 7 pages, 4 figures

Journal-ref: 30th ACM Workshop on Network and Operating Systems Support for Digital Audio and Video (NOSSDAV) 2020

Subjects: Multimedia (cs.MM); Image and Video Processing (eess.IV)
[9] arXiv:2001.07494 [pdf, other]: Title: Evaluation of a course mediatised with Xerte

Ghalia Merzougui, Roumaissa Dehkal, Maheiddine Djoudi (TECHNÉ - EA 6316)

Journal-ref: International Conference of Computing for Engineering and Sciences (ICCES'2015), Jul 2015, Istanbul,, Turkey

Subjects: Multimedia (cs.MM)
[10] arXiv:2001.07886 [pdf, other]: Title: AMP: Authentication of Media via Provenance

Paul England, Henrique S. Malvar, Eric Horvitz, Jack W. Stokes, Cédric Fournet, Rebecca Burke-Aguero, Amaury Chamayou, Sylvan Clebsch, Manuel Costa, John Deutscher, Shabnam Erfani, Matt Gaylor, Andrew Jenks, Kevin Kane, Elissa Redmiles, Alex Shamis, Isha Sharma, Sam Wenker, Anika Zaman

Comments: Add detailed manifest description, Add provenance, Improve text

Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR); Systems and Control (eess.SY)
[11] arXiv:2001.10590 [pdf, other]: Title: An Effective Automatic Image Annotation Model Via Attention Model and Data Equilibrium

Amir Vatani, Milad Taleby Ahvanooey, Mostafa Rahimi

Comments: 9 pages, 3 figures

Journal-ref: Int. J. Adv. Comput. Sci. Appl, 9(3), pp.269-277 (2018)

Subjects: Multimedia (cs.MM); Computation and Language (cs.CL)
[12] arXiv:2001.11406 [pdf, other]: Title: NAViDAd: A No-Reference Audio-Visual Quality Metric Based on a Deep Autoencoder

Helard Martinez, M. C. Farias, A. Hines

Comments: 5 pages

Journal-ref: 2019 27th European Signal Processing Conference (EUSIPCO), IEEE, 2019, pp 1-5

Subjects: Multimedia (cs.MM); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[13] arXiv:2001.00001 (cross-list from math.HO) [pdf, other]: Title: Quantum GestART: Identifying and Applying Correlations between Mathematics, Art, and Perceptual Organization

Maria Mannone, Federico Favali, Balandino Di Donato, Luca Turchet

Comments: Accepted for publication, Journal of Mathematics and Music. New references added in this version

Journal-ref: Journal of Mathematics and Music, 2020

Subjects: History and Overview (math.HO); Multimedia (cs.MM)
[14] arXiv:2001.00179 (cross-list from cs.CV) [pdf, other]: Title: DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection

Ruben Tolosana, Ruben Vera-Rodriguez, Julian Fierrez, Aythami Morales, Javier Ortega-Garcia

Journal-ref: Information Fusion, 2020

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[15] arXiv:2001.00847 (cross-list from cs.IT) [pdf, other]: Title: Biometric and Physical Identifiers with Correlated Noise for Controllable Private Authentication

Onur Günlü, Rafael F. Schaefer, H. Vincent Poor

Comments: Shorter version to appear in the IEEE International Symposium on Information Theory 2020

Subjects: Information Theory (cs.IT); Cryptography and Security (cs.CR); Multimedia (cs.MM); Signal Processing (eess.SP); Probability (math.PR)
[16] arXiv:2001.01720 (cross-list from cs.SD) [pdf, other]: Title: Modeling Musical Structure with Artificial Neural Networks

Stefan Lattner

Comments: 152 pages, 28 figures, 10 tables. PhD thesis, Johannes Kepler University Linz, October 2019. Includes results from this https URL, arXiv:1612.04742, arXiv:1708.05325, arXiv:1806.08236, and arXiv:1806.08686 (see Section 1.2 for detailed information)

Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[17] arXiv:2001.03353 (cross-list from cs.SI) [pdf, other]: Title: Measuring Similarity between Brands using Followers' Post in Social Media

Yiwei Zhang, Xueting Wang, Yoshiaki Sakai, Toshihiko Yamasaki

Comments: Accepted to ACM Multimedia Asia 2019

Subjects: Social and Information Networks (cs.SI); Multimedia (cs.MM)
[18] arXiv:2001.04316 (cross-list from eess.AS) [pdf, other]: Title: Visually Guided Self Supervised Learning of Speech Representations

Abhinav Shukla, Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis, Maja Pantic

Comments: Accepted at ICASSP 2020 v2: Updated to the ICASSP 2020 camera ready version

Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[19] arXiv:2001.04463 (cross-list from cs.CV) [pdf, other]: Title: Unsupervised Audiovisual Synthesis via Exemplar Autoencoders

Kangle Deng, Aayush Bansal, Deva Ramanan

Comments: ICLR 2021; Project page -- this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[20] arXiv:2001.04883 (cross-list from cs.HC) [pdf, other]: Title: Disseminating Research News in HCI: Perceived Hazards, How-To's, and Opportunities for Innovation

C. Estelle Smith, Eduardo Nevarez, Haiyi Zhu

Comments: 10 pages, 2 figures, accepted paper to CHI 2020 conference

Subjects: Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[21] arXiv:2001.05200 (cross-list from cs.CV) [pdf, other]: Title: Evaluating image matching methods for book cover identification

Rabie Hachemi, Ikram Achar, Biasi Wiga, Mahfoud Sidi Ali Mebarek

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[22] arXiv:2001.05201 (cross-list from cs.CV) [pdf, other]: Title: Everybody's Talkin': Let Me Talk as You Want

Linsen Song, Wayne Wu, Chen Qian, Ran He, Chen Change Loy

Comments: Technical report. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Multimedia (cs.MM)
[23] arXiv:2001.05864 (cross-list from cs.CV) [pdf, other]: Title: Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning

Yiyan Chen, Li Tao, Xueting Wang, Toshihiko Yamasaki

Comments: mmasia 2019 accepted

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[24] arXiv:2001.05970 (cross-list from cs.SI) [pdf, other]: Title: #MeToo on Campus: Studying College Sexual Assault at Scale Using Data Reported on Social Media

Viet Duong, Phu Pham, Ritwik Bose, Jiebo Luo

Subjects: Social and Information Networks (cs.SI); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Multimedia (cs.MM)
[25] arXiv:2001.06765 (cross-list from cs.IR) [pdf, other]: Title: Information Foraging for Enhancing Implicit Feedback in Content-based Image Recommendation

Amit Kumar Jaiswal, Haiming Liu, Ingo Frommholz

Comments: FIRE '19: Proceedings of the 11th Forum for Information Retrieval Evaluation

Subjects: Information Retrieval (cs.IR); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[26] arXiv:2001.06888 (cross-list from cs.CL) [pdf, other]: Title: A multimodal deep learning approach for named entity recognition from social media

Meysam Asgari-Chenaghlu, M.Reza Feizi-Derakhshi, Leili Farzinvash, M. A. Balafar, Cina Motamed

Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM); Social and Information Networks (cs.SI)
[27] arXiv:2001.07194 (cross-list from cs.CL) [pdf, other]: Title: Recommending Themes for Ad Creative Design via Visual-Linguistic Representations

Yichao Zhou, Shaunak Mishra, Manisha Verma, Narayan Bhamidipati, Wei Wang

Comments: 7 pages, 8 figures, 2 tables, accepted by The Web Conference 2020

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG); Multimedia (cs.MM)
[28] arXiv:2001.07295 (cross-list from cs.AI) [pdf, other]: Title: AutoMATES: Automated Model Assembly from Text, Equations, and Software

Adarsh Pyarelal, Marco A. Valenzuela-Escarcega, Rebecca Sharp, Paul D. Hein, Jon Stephens, Pratik Bhandari, HeuiChan Lim, Saumya Debray, Clayton T. Morrison

Comments: 8 pages, 6 figures, accepted to Modeling the World's Systems 2019

Subjects: Artificial Intelligence (cs.AI); Multimedia (cs.MM); Software Engineering (cs.SE)
[29] arXiv:2001.08730 (cross-list from cs.CV) [pdf, other]: Title: Robust Explanations for Visual Question Answering

Badri N. Patro, Shivansh Pate, Vinay P. Namboodiri

Comments: WACV-2020 (Accepted)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[30] arXiv:2001.08779 (cross-list from cs.CV) [pdf, other]: Title: Deep Bayesian Network for Visual Question Generation

Badri N. Patro, Vinod K. Kurmi, Sandeep Kumar, Vinay P. Namboodiri

Comments: WACV-2020 (Accepted)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG); Multimedia (cs.MM)
[31] arXiv:2001.09545 (cross-list from cs.NE) [pdf, other]: Title: aiTPR: Attribute Interaction-Tensor Product Representation for Image Caption

Chiranjib Sur

Comments: 11 pages

Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[32] arXiv:2001.10190 (cross-list from cs.SD) [pdf, other]: Title: Time-Domain Audio Source Separation Based on Wave-U-Net Combined with Discrete Wavelet Transform

Tomohiko Nakamura, Hiroshi Saruwatari

Comments: 5 pages, to appear in IEEE International Conference on Acoustics, Speech, and Signal Processing 2020 (ICASSP 2020)

Journal-ref: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[33] arXiv:2001.10832 (cross-list from eess.AS) [pdf, other]: Title: Audio-Visual Decision Fusion for WFST-based and seq2seq Models

Rohith Aralikatti, Sharad Roy, Abhinav Thanda, Dilip Kumar Margam, Pujitha Appan Kandala, Tanay Sharma, Shankar M Venkatesan

Comments: Submitted for review to ICASSP 2020 on October 21st, 2019

Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Image and Video Processing (eess.IV)
[34] arXiv:2001.11847 (cross-list from cs.NE) [pdf, other]: Title: CNN-based fast source device identification

Sara Mandelli, Davide Cozzolino, Paolo Bestagini, Luisa Verdoliva, Stefano Tubaro

Subjects: Neural and Evolutionary Computing (cs.NE); Multimedia (cs.MM)

Total of 34 entries

Showing up to 50 entries per page: fewer | more | all