Multimedia

Authors and titles for October 2020

Total of 74 entries : 1-50 51-74

Showing up to 50 entries per page: fewer | more | all

[1] arXiv:2010.00302 [pdf, other]: Title: An authorship protection technology for electronic documents based on image watermarking

Anna Melman, Oleg Evsutin, Alexander Shelupanov

Comments: 21 pages, 7 figures

Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR)
[2] arXiv:2010.02015 [pdf, other]: Title: Combined Hapto-Visual and Auditory Rendering of Cultural Heritage Objects

Praseedha Krishnan Aniyath, Sreeni Kamalalayam Gopalan, Priyadarshini K, Subhasis Chaudhuri

Comments: Accepted to ACCVw 2014

Subjects: Multimedia (cs.MM); Graphics (cs.GR)
[3] arXiv:2010.02822 [pdf, other]: Title: Scalable Rendering of Variable Density Point Cloud Data

Priyadarshini Kumari, Sreeni K.G, Subhasis Chaudhuri

Comments: Accepted to World Haptics Conference (WHC), 2013

Subjects: Multimedia (cs.MM); Graphics (cs.GR)
[4] arXiv:2010.03169 [pdf, other]: Title: Haptic Rendering of Cultural Heritage Objects at Different Scales

Sreeni K.G, Priyadarshini K, Praseedha A.K, Subhasis Chaudhuri

Comments: Accepted to EuroHaptics. arXiv admin note: text overlap with arXiv:2010.02015

Subjects: Multimedia (cs.MM); Graphics (cs.GR)
[5] arXiv:2010.04645 [pdf, other]: Title: MPEG Media Enablers For Richer XR Experiences

Emmanuel Thomas, Emmanouil Potetsianakis, Thomas Stockhammer, Imed Bouazizi, Mary-Luc Champel

Journal-ref: IBC (2020)

Subjects: Multimedia (cs.MM); Graphics (cs.GR)
[6] arXiv:2010.04676 [pdf, other]: Title: A Clustering-Based Method for Automatic Educational Video Recommendation Using Deep Face-Features of Lecturers

Paulo R. C. Mendes, Eduardo S. Vieira, Álan L. V. Guedes, Antonio J. G. Busson, Sérgio Colcher

Subjects: Multimedia (cs.MM); Machine Learning (cs.LG)
[7] arXiv:2010.08119 [pdf, other]: Title: Revenue and Energy Efficiency-Driven Delay Constrained Computing Task Offloading and Resource Allocation in a Vehicular Edge Computing Network: A Deep Reinforcement Learning Approach

Xinyu Huang, Lijun He, Xing Chen, Liejun Wang, Fan Li

Comments: 15 pages, 13 figures, submitted to IEEE Internet of Things

Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[8] arXiv:2010.08737 [pdf, other]: Title: Audio-based Near-Duplicate Video Retrieval with Audio Similarity Learning

Pavlos Avgoustinakis, Giorgos Kordopatis-Zilos, Symeon Papadopoulos, Andreas L. Symeonidis, Ioannis Kompatsiaris

Subjects: Multimedia (cs.MM); Information Retrieval (cs.IR); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[9] arXiv:2010.09235 [pdf, other]: Title: Ensemble Chinese End-to-End Spoken Language Understanding for Abnormal Event Detection from audio stream

Haoran Wei, Fei Tao, Runze Su, Sen Yang, Ji Liu

Comments: Submitting to ICASSP 2021

Subjects: Multimedia (cs.MM); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[10] arXiv:2010.09641 [pdf, other]: Title: DIME: An Online Tool for the Visual Comparison of Cross-Modal Retrieval Models

Tony Zhao, Jaeyoung Choi, Gerald Friedland

Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI)
[11] arXiv:2010.10135 [pdf, other]: Title: INDCOR white paper 1: A shared vocabulary for IDN (Interactive Digital Narratives)

Hartmut Koenitz, Mirjam Palosaari Eladhari, Sandy Louchart, Frank Nack

Subjects: Multimedia (cs.MM); Human-Computer Interaction (cs.HC)
[12] arXiv:2010.10144 [pdf, other]: Title: Keystroke Dynamics as Part of Lifelogging

Alan F. Smeaton, Naveen Garaga Krishnamurthy, Amruth Hebbasuru Suryanarayana

Comments: Accepted to 27th International Conference on Multimedia Modeling, Prague, Czech Republic, June 2021

Subjects: Multimedia (cs.MM); Human-Computer Interaction (cs.HC)
[13] arXiv:2010.10721 [pdf, other]: Title: ComboLoss for Facial Attractiveness Analysis with Squeeze-and-Excitation Networks

Lu Xu, Jinhai Xiang

Comments: Tech Report

Subjects: Multimedia (cs.MM)
[14] arXiv:2010.12662 [pdf, other]: Title: Short Video-based Advertisements Evaluation System: Self-Organizing Learning Approach

Yunjie Zhang, Fei Tao, Xudong Liu, Runze Su, Xiaorong Mei, Weicong Ding, Zhichen Zhao, Lei Yuan, Ji Liu

Comments: Submitting to ICASSP 2021

Subjects: Multimedia (cs.MM); Machine Learning (cs.LG)
[15] arXiv:2010.13260 [pdf, other]: Title: Effect of Language Proficiency on Subjective Evaluation of Noise Suppression Algorithms

Babak Naderi, Gabriel Mittag, Rafael Zequeira Jim\a'enez, Sebastian Möller

Subjects: Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[16] arXiv:2010.13715 [pdf, other]: Title: ST-GREED: Space-Time Generalized Entropic Differences for Frame Rate Dependent Video Quality Prediction

Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik

Journal-ref: IEEE Transactions on Image Processing. 30 (2021) 7446 - 7457

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[17] arXiv:2010.00246 (cross-list from cs.CV) [pdf, other]: Title: CariMe: Unpaired Caricature Generation with Multiple Exaggerations

Zheng Gu, Chuanqi Dong, Jing Huo, Wenbin Li, Yang Gao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[18] arXiv:2010.00400 (cross-list from cs.CV) [pdf, other]: Title: DeepFakesON-Phys: DeepFakes Detection based on Heart Rate Estimation

Javier Hernandez-Ortega, Ruben Tolosana, Julian Fierrez, Aythami Morales

Journal-ref: Proc. 35th AAAI Conference on Artificial Intelligence Workshops, 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[19] arXiv:2010.00984 (cross-list from cs.IR) [pdf, other]: Title: An Empirical Study of DNNs Robustification Inefficacy in Protecting Visual Recommenders

Vito Walter Anelli, Tommaso Di Noia, Daniele Malitesta, Felice Antonio Merra

Comments: 9 pages, 1 figure

Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[20] arXiv:2010.01158 (cross-list from cs.CV) [pdf, other]: Title: MM-Hand: 3D-Aware Multi-Modal Guided Hand Generative Network for 3D Hand Pose Synthesis

Zhenyu Wu, Duc Hoang, Shih-Yao Lin, Yusheng Xie, Liangjian Chen, Yen-Yu Lin, Zhangyang Wang, Wei Fan

Comments: Accepted by ACM Multimedia 2020

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[21] arXiv:2010.01323 (cross-list from cs.HC) [pdf, other]: Title: The Design of Tangible Digital Musical Instruments

Gareth W. Young, Katie Crowley

Comments: MusTWork 2016

Subjects: Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[22] arXiv:2010.01326 (cross-list from cs.HC) [pdf, other]: Title: Digital Musical Instrument Analysis: The Haptic Bowl

Gareth W. Young, Dave Murphy

Comments: CMMR 2015

Subjects: Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[23] arXiv:2010.01328 (cross-list from cs.HC) [pdf, other]: Title: HCI Models for Digital Musical Instruments: Methodologies for Rigorous Testing of Digital Musical Instruments

Gareth W. Young, Dave Murphy

Comments: CMMR 2015

Subjects: Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[24] arXiv:2010.01424 (cross-list from cs.CV) [pdf, other]: Title: MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network

Yi Wei, Zhe Gan, Wenbo Li, Siwei Lyu, Ming-Ching Chang, Lei Zhang, Jianfeng Gao, Pengchuan Zhang

Comments: published at ACCV2020

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[25] arXiv:2010.01679 (cross-list from cs.CV) [pdf, other]: Title: Learning Complete 3D Morphable Face Models from Images and Videos

Mallikarjun B R, Ayush Tewari, Hans-Peter Seidel, Mohamed Elgharib, Christian Theobalt

Comments: Project Page - this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[26] arXiv:2010.01944 (cross-list from cs.HC) [pdf, other]: Title: Actors in VR storytelling

Selma Rizvic, Dusanka Boskovic, Fabio Bruno, Barbara Davidde Petriaggi, Sanda Sljivo, Marco Cozza

Comments: Pre-print version

Subjects: Human-Computer Interaction (cs.HC); Graphics (cs.GR); Multimedia (cs.MM)
[27] arXiv:2010.02748 (cross-list from eess.IV) [pdf, other]: Title: Neural Generation of Blocks for Video Coding

Jonah Probell

Comments: 12 pages, 4 figures

Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Multimedia (cs.MM)
[28] arXiv:2010.02959 (cross-list from cs.CV) [pdf, other]: Title: Using Sentences as Semantic Representations in Large Scale Zero-Shot Learning

Yannick Le Cacheux, Hervé Le Borgne, Michel Crucianu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[29] arXiv:2010.03183 (cross-list from cs.NI) [pdf, other]: Title: Network-aware Recommendations in the Wild: Methodology, Realistic Evaluations, Experiments

Savvas Kastanakis, Pavlos Sermpezis, Vasileios Kotronis, Daniel Menasché, Thrasyvoulos Spyropoulos

Comments: arXiv admin note: text overlap with arXiv:1806.02704

Subjects: Networking and Internet Architecture (cs.NI); Multimedia (cs.MM)
[30] arXiv:2010.04862 (cross-list from cs.LG) [pdf, other]: Title: Remarks on Optimal Scores for Speaker Recognition

Dong Wang

Comments: 17 pages, 8 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Sound (cs.SD); Machine Learning (stat.ML)
[31] arXiv:2010.05466 (cross-list from cs.CV) [pdf, other]: Title: Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching

Di Hu, Rui Qian, Minyue Jiang, Xiao Tan, Shilei Wen, Errui Ding, Weiyao Lin, Dejing Dou

Comments: To appear in NeurIPS 2020. Previous Title: Learning to Discriminatively Localize Sounding Objects in a Cocktail-party Scenario

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[32] arXiv:2010.05468 (cross-list from cs.CV) [pdf, other]: Title: TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation

Dongxu Li, Chenchen Xu, Xin Yu, Kaihao Zhang, Ben Swift, Hanna Suominen, Hongdong Li

Comments: NeurIPS 2020 preprint

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[33] arXiv:2010.05760 (cross-list from eess.IV) [pdf, other]: Title: Video Quality Enhancement Using Deep Learning-Based Prediction Models for Quantized DCT Coefficients in MPEG I-frames

Antonio J G Busson, Paulo R C Mendes, Daniel de S Moraes, Álvaro M da Veiga, Álan L V Guedes, Sérgio Colcher

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[34] arXiv:2010.07637 (cross-list from cs.CL) [pdf, other]: Title: DialogueTRM: Exploring the Intra- and Inter-Modal Emotional Behaviors in the Conversation

Yuzhao Mao, Qi Sun, Guang Liu, Xiaojie Wang, Weiguo Gao, Xuan Li, Jianping Shen

Subjects: Computation and Language (cs.CL); Multimedia (cs.MM)
[35] arXiv:2010.07739 (cross-list from cs.SD) [pdf, other]: Title: Music Classification in MIDI Format based on LSTM Mdel

Yiting Xia, Yiwei Jiang, Tao Ye

Comments: in Chinese

Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[36] arXiv:2010.07775 (cross-list from eess.AS) [pdf, other]: Title: Muse: Multi-modal target speaker extraction with visual cues

Zexu Pan, Ruijie Tao, Chenglin Xu, Haizhou Li

Comments: Accepted by ICASSP2021

Subjects: Audio and Speech Processing (eess.AS); Multimedia (cs.MM); Sound (cs.SD); Image and Video Processing (eess.IV)
[37] arXiv:2010.08021 (cross-list from cs.CL) [pdf, other]: Title: MAST: Multimodal Abstractive Summarization with Trimodal Hierarchical Attention

Aman Khullar, Udit Arora

Comments: To appear in the first EMNLP Workshop on NLP Beyond Text, 2020. Aman Khullar and Udit Arora have equal contribution

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[38] arXiv:2010.08091 (cross-list from cs.SD) [pdf, other]: Title: PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music

Hongru Liang, Wenqiang Lei, Paul Yaozhu Chan, Zhenglu Yang, Maosong Sun, Tat-Seng Chua

Comments: ACM Multimedia 2020 -- best paper

Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[39] arXiv:2010.08123 (cross-list from cs.SD) [pdf, other]: Title: Melody Classifier with Stacked-LSTM

You Li, Zhuowen Lin

Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[40] arXiv:2010.08919 (cross-list from cs.CV) [pdf, other]: Title: Boosting High-Level Vision with Joint Compression Artifacts Reduction and Super-Resolution

Xiaoyu Xiang, Qian Lin, Jan P. Allebach

Comments: 8 pages, 6 figures, 5 tables. Accepted by the 25th ICPR (2020)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[41] arXiv:2010.09290 (cross-list from cs.CV) [pdf, other]: Title: Frame Aggregation and Multi-Modal Fusion Framework for Video-Based Person Recognition

Fangtao Li, Wenzhe Wang, Zihe Liu, Haoran Wang, Chenghao Yan, Bin Wu

Comments: Accepted by MMM 2021

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[42] arXiv:2010.09489 (cross-list from cs.SD) [pdf, other]: Title: Hit Song Prediction Based on Early Adopter Data and Audio Features

Dorien Herremans, Tom Bergmans

Journal-ref: The 18th International Society for Music Information Retrieval Conference (ISMIR)2018 - LBD

Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM)
[43] arXiv:2010.09907 (cross-list from cs.CV) [pdf, other]: Title: Color Image Segmentation Metrics

Majid Harouni, Hadi Yazdani Baghmaleki

Comments: 19 pages, 11 figures, 6 tables, 29 equations, book chapter, 2 authors

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[44] arXiv:2010.09925 (cross-list from cs.CV) [pdf, other]: Title: Hierarchical Paired Channel Fusion Network for Street Scene Change Detection

Yinjie Lei, Duo Peng, Pingping Zhang, Qiuhong Ke, Haifeng Li

Comments: To appear in Transactions on Image Processing, including 13 pages, 13 figures, 9 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[45] arXiv:2010.10637 (cross-list from cs.CV) [pdf, other]: Title: Mutual Information Regularized Identity-aware Facial ExpressionRecognition in Compressed Video

Xiaofeng Liu, Linghao Jin, Xu Han, Jane You

Comments: Published in Pattern Recognition

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[46] arXiv:2010.10658 (cross-list from cs.HC) [pdf, other]: Title: Display object alignment may influence location recall in unexpected ways

Peter Zelchenko, Xiaohan Fu, Xiangqian Li, Alex Ivanov, Zhenyu Gu

Comments: superseded by arXiv:2308.12201

Subjects: Human-Computer Interaction (cs.HC); Computers and Society (cs.CY); Multimedia (cs.MM)
[47] arXiv:2010.10706 (cross-list from cs.RO) [pdf, other]: Title: Can We Enable the Drone to be a Filmmaker?

Yuanjie Dang

Comments: 7 pages, 14 figures

Subjects: Robotics (cs.RO); Multimedia (cs.MM)
[48] arXiv:2010.11098 (cross-list from cs.SD) [pdf, other]: Title: WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information

An Tran, Konstantinos Drossos, Tuomas Virtanen

Comments: Submitted for review at ICASSP2021

Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[49] arXiv:2010.11550 (cross-list from cs.CV) [pdf, other]: Title: Learning Dual Semantic Relations with Graph Attention for Image-Text Matching

Keyu Wen, Xiaodong Gu, Qingrong Cheng

Comments: 14pages, 9 figures. Accepted at: IEEE Transactions on Circuits and Systems for Video Technology (Early Access Print) | |Codes Available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[50] arXiv:2010.11732 (cross-list from cs.CV) [pdf, other]: Title: A Cluster-Matching-Based Method for Video Face Recognition

Paulo R C Mendes, Antonio J G Busson, Sérgio Colcher, Daniel Schwabe, Álan L V Guedes, Carlos Laufer

Comments: 13 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)

Total of 74 entries : 1-50 51-74

Showing up to 50 entries per page: fewer | more | all