Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.MM

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Multimedia

Authors and titles for October 2020

Total of 74 entries : 1-50 51-74
Showing up to 50 entries per page: fewer | more | all
[1] arXiv:2010.00302 [pdf, other]
Title: An authorship protection technology for electronic documents based on image watermarking
Anna Melman, Oleg Evsutin, Alexander Shelupanov
Comments: 21 pages, 7 figures
Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR)
[2] arXiv:2010.02015 [pdf, other]
Title: Combined Hapto-Visual and Auditory Rendering of Cultural Heritage Objects
Praseedha Krishnan Aniyath, Sreeni Kamalalayam Gopalan, Priyadarshini K, Subhasis Chaudhuri
Comments: Accepted to ACCVw 2014
Subjects: Multimedia (cs.MM); Graphics (cs.GR)
[3] arXiv:2010.02822 [pdf, other]
Title: Scalable Rendering of Variable Density Point Cloud Data
Priyadarshini Kumari, Sreeni K.G, Subhasis Chaudhuri
Comments: Accepted to World Haptics Conference (WHC), 2013
Subjects: Multimedia (cs.MM); Graphics (cs.GR)
[4] arXiv:2010.03169 [pdf, other]
Title: Haptic Rendering of Cultural Heritage Objects at Different Scales
Sreeni K.G, Priyadarshini K, Praseedha A.K, Subhasis Chaudhuri
Comments: Accepted to EuroHaptics. arXiv admin note: text overlap with arXiv:2010.02015
Subjects: Multimedia (cs.MM); Graphics (cs.GR)
[5] arXiv:2010.04645 [pdf, other]
Title: MPEG Media Enablers For Richer XR Experiences
Emmanuel Thomas, Emmanouil Potetsianakis, Thomas Stockhammer, Imed Bouazizi, Mary-Luc Champel
Journal-ref: IBC (2020)
Subjects: Multimedia (cs.MM); Graphics (cs.GR)
[6] arXiv:2010.04676 [pdf, other]
Title: A Clustering-Based Method for Automatic Educational Video Recommendation Using Deep Face-Features of Lecturers
Paulo R. C. Mendes, Eduardo S. Vieira, Álan L. V. Guedes, Antonio J. G. Busson, Sérgio Colcher
Subjects: Multimedia (cs.MM); Machine Learning (cs.LG)
[7] arXiv:2010.08119 [pdf, other]
Title: Revenue and Energy Efficiency-Driven Delay Constrained Computing Task Offloading and Resource Allocation in a Vehicular Edge Computing Network: A Deep Reinforcement Learning Approach
Xinyu Huang, Lijun He, Xing Chen, Liejun Wang, Fan Li
Comments: 15 pages, 13 figures, submitted to IEEE Internet of Things
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[8] arXiv:2010.08737 [pdf, other]
Title: Audio-based Near-Duplicate Video Retrieval with Audio Similarity Learning
Pavlos Avgoustinakis, Giorgos Kordopatis-Zilos, Symeon Papadopoulos, Andreas L. Symeonidis, Ioannis Kompatsiaris
Subjects: Multimedia (cs.MM); Information Retrieval (cs.IR); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[9] arXiv:2010.09235 [pdf, other]
Title: Ensemble Chinese End-to-End Spoken Language Understanding for Abnormal Event Detection from audio stream
Haoran Wei, Fei Tao, Runze Su, Sen Yang, Ji Liu
Comments: Submitting to ICASSP 2021
Subjects: Multimedia (cs.MM); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[10] arXiv:2010.09641 [pdf, other]
Title: DIME: An Online Tool for the Visual Comparison of Cross-Modal Retrieval Models
Tony Zhao, Jaeyoung Choi, Gerald Friedland
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI)
[11] arXiv:2010.10135 [pdf, other]
Title: INDCOR white paper 1: A shared vocabulary for IDN (Interactive Digital Narratives)
Hartmut Koenitz, Mirjam Palosaari Eladhari, Sandy Louchart, Frank Nack
Subjects: Multimedia (cs.MM); Human-Computer Interaction (cs.HC)
[12] arXiv:2010.10144 [pdf, other]
Title: Keystroke Dynamics as Part of Lifelogging
Alan F. Smeaton, Naveen Garaga Krishnamurthy, Amruth Hebbasuru Suryanarayana
Comments: Accepted to 27th International Conference on Multimedia Modeling, Prague, Czech Republic, June 2021
Subjects: Multimedia (cs.MM); Human-Computer Interaction (cs.HC)
[13] arXiv:2010.10721 [pdf, other]
Title: ComboLoss for Facial Attractiveness Analysis with Squeeze-and-Excitation Networks
Lu Xu, Jinhai Xiang
Comments: Tech Report
Subjects: Multimedia (cs.MM)
[14] arXiv:2010.12662 [pdf, other]
Title: Short Video-based Advertisements Evaluation System: Self-Organizing Learning Approach
Yunjie Zhang, Fei Tao, Xudong Liu, Runze Su, Xiaorong Mei, Weicong Ding, Zhichen Zhao, Lei Yuan, Ji Liu
Comments: Submitting to ICASSP 2021
Subjects: Multimedia (cs.MM); Machine Learning (cs.LG)
[15] arXiv:2010.13260 [pdf, other]
Title: Effect of Language Proficiency on Subjective Evaluation of Noise Suppression Algorithms
Babak Naderi, Gabriel Mittag, Rafael Zequeira Jim\a'enez, Sebastian Möller
Subjects: Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[16] arXiv:2010.13715 [pdf, other]
Title: ST-GREED: Space-Time Generalized Entropic Differences for Frame Rate Dependent Video Quality Prediction
Pavan C. Madhusudana, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
Journal-ref: IEEE Transactions on Image Processing. 30 (2021) 7446 - 7457
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[17] arXiv:2010.00246 (cross-list from cs.CV) [pdf, other]
Title: CariMe: Unpaired Caricature Generation with Multiple Exaggerations
Zheng Gu, Chuanqi Dong, Jing Huo, Wenbin Li, Yang Gao
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[18] arXiv:2010.00400 (cross-list from cs.CV) [pdf, other]
Title: DeepFakesON-Phys: DeepFakes Detection based on Heart Rate Estimation
Javier Hernandez-Ortega, Ruben Tolosana, Julian Fierrez, Aythami Morales
Journal-ref: Proc. 35th AAAI Conference on Artificial Intelligence Workshops, 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[19] arXiv:2010.00984 (cross-list from cs.IR) [pdf, other]
Title: An Empirical Study of DNNs Robustification Inefficacy in Protecting Visual Recommenders
Vito Walter Anelli, Tommaso Di Noia, Daniele Malitesta, Felice Antonio Merra
Comments: 9 pages, 1 figure
Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[20] arXiv:2010.01158 (cross-list from cs.CV) [pdf, other]
Title: MM-Hand: 3D-Aware Multi-Modal Guided Hand Generative Network for 3D Hand Pose Synthesis
Zhenyu Wu, Duc Hoang, Shih-Yao Lin, Yusheng Xie, Liangjian Chen, Yen-Yu Lin, Zhangyang Wang, Wei Fan
Comments: Accepted by ACM Multimedia 2020
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[21] arXiv:2010.01323 (cross-list from cs.HC) [pdf, other]
Title: The Design of Tangible Digital Musical Instruments
Gareth W. Young, Katie Crowley
Comments: MusTWork 2016
Subjects: Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[22] arXiv:2010.01326 (cross-list from cs.HC) [pdf, other]
Title: Digital Musical Instrument Analysis: The Haptic Bowl
Gareth W. Young, Dave Murphy
Comments: CMMR 2015
Subjects: Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[23] arXiv:2010.01328 (cross-list from cs.HC) [pdf, other]
Title: HCI Models for Digital Musical Instruments: Methodologies for Rigorous Testing of Digital Musical Instruments
Gareth W. Young, Dave Murphy
Comments: CMMR 2015
Subjects: Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[24] arXiv:2010.01424 (cross-list from cs.CV) [pdf, other]
Title: MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network
Yi Wei, Zhe Gan, Wenbo Li, Siwei Lyu, Ming-Ching Chang, Lei Zhang, Jianfeng Gao, Pengchuan Zhang
Comments: published at ACCV2020
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[25] arXiv:2010.01679 (cross-list from cs.CV) [pdf, other]
Title: Learning Complete 3D Morphable Face Models from Images and Videos
Mallikarjun B R, Ayush Tewari, Hans-Peter Seidel, Mohamed Elgharib, Christian Theobalt
Comments: Project Page - this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[26] arXiv:2010.01944 (cross-list from cs.HC) [pdf, other]
Title: Actors in VR storytelling
Selma Rizvic, Dusanka Boskovic, Fabio Bruno, Barbara Davidde Petriaggi, Sanda Sljivo, Marco Cozza
Comments: Pre-print version
Subjects: Human-Computer Interaction (cs.HC); Graphics (cs.GR); Multimedia (cs.MM)
[27] arXiv:2010.02748 (cross-list from eess.IV) [pdf, other]
Title: Neural Generation of Blocks for Video Coding
Jonah Probell
Comments: 12 pages, 4 figures
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Multimedia (cs.MM)
[28] arXiv:2010.02959 (cross-list from cs.CV) [pdf, other]
Title: Using Sentences as Semantic Representations in Large Scale Zero-Shot Learning
Yannick Le Cacheux, Hervé Le Borgne, Michel Crucianu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[29] arXiv:2010.03183 (cross-list from cs.NI) [pdf, other]
Title: Network-aware Recommendations in the Wild: Methodology, Realistic Evaluations, Experiments
Savvas Kastanakis, Pavlos Sermpezis, Vasileios Kotronis, Daniel Menasché, Thrasyvoulos Spyropoulos
Comments: arXiv admin note: text overlap with arXiv:1806.02704
Subjects: Networking and Internet Architecture (cs.NI); Multimedia (cs.MM)
[30] arXiv:2010.04862 (cross-list from cs.LG) [pdf, other]
Title: Remarks on Optimal Scores for Speaker Recognition
Dong Wang
Comments: 17 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Sound (cs.SD); Machine Learning (stat.ML)
[31] arXiv:2010.05466 (cross-list from cs.CV) [pdf, other]
Title: Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching
Di Hu, Rui Qian, Minyue Jiang, Xiao Tan, Shilei Wen, Errui Ding, Weiyao Lin, Dejing Dou
Comments: To appear in NeurIPS 2020. Previous Title: Learning to Discriminatively Localize Sounding Objects in a Cocktail-party Scenario
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[32] arXiv:2010.05468 (cross-list from cs.CV) [pdf, other]
Title: TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation
Dongxu Li, Chenchen Xu, Xin Yu, Kaihao Zhang, Ben Swift, Hanna Suominen, Hongdong Li
Comments: NeurIPS 2020 preprint
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[33] arXiv:2010.05760 (cross-list from eess.IV) [pdf, other]
Title: Video Quality Enhancement Using Deep Learning-Based Prediction Models for Quantized DCT Coefficients in MPEG I-frames
Antonio J G Busson, Paulo R C Mendes, Daniel de S Moraes, Álvaro M da Veiga, Álan L V Guedes, Sérgio Colcher
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[34] arXiv:2010.07637 (cross-list from cs.CL) [pdf, other]
Title: DialogueTRM: Exploring the Intra- and Inter-Modal Emotional Behaviors in the Conversation
Yuzhao Mao, Qi Sun, Guang Liu, Xiaojie Wang, Weiguo Gao, Xuan Li, Jianping Shen
Subjects: Computation and Language (cs.CL); Multimedia (cs.MM)
[35] arXiv:2010.07739 (cross-list from cs.SD) [pdf, other]
Title: Music Classification in MIDI Format based on LSTM Mdel
Yiting Xia, Yiwei Jiang, Tao Ye
Comments: in Chinese
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[36] arXiv:2010.07775 (cross-list from eess.AS) [pdf, other]
Title: Muse: Multi-modal target speaker extraction with visual cues
Zexu Pan, Ruijie Tao, Chenglin Xu, Haizhou Li
Comments: Accepted by ICASSP2021
Subjects: Audio and Speech Processing (eess.AS); Multimedia (cs.MM); Sound (cs.SD); Image and Video Processing (eess.IV)
[37] arXiv:2010.08021 (cross-list from cs.CL) [pdf, other]
Title: MAST: Multimodal Abstractive Summarization with Trimodal Hierarchical Attention
Aman Khullar, Udit Arora
Comments: To appear in the first EMNLP Workshop on NLP Beyond Text, 2020. Aman Khullar and Udit Arora have equal contribution
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[38] arXiv:2010.08091 (cross-list from cs.SD) [pdf, other]
Title: PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music
Hongru Liang, Wenqiang Lei, Paul Yaozhu Chan, Zhenglu Yang, Maosong Sun, Tat-Seng Chua
Comments: ACM Multimedia 2020 -- best paper
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[39] arXiv:2010.08123 (cross-list from cs.SD) [pdf, other]
Title: Melody Classifier with Stacked-LSTM
You Li, Zhuowen Lin
Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[40] arXiv:2010.08919 (cross-list from cs.CV) [pdf, other]
Title: Boosting High-Level Vision with Joint Compression Artifacts Reduction and Super-Resolution
Xiaoyu Xiang, Qian Lin, Jan P. Allebach
Comments: 8 pages, 6 figures, 5 tables. Accepted by the 25th ICPR (2020)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[41] arXiv:2010.09290 (cross-list from cs.CV) [pdf, other]
Title: Frame Aggregation and Multi-Modal Fusion Framework for Video-Based Person Recognition
Fangtao Li, Wenzhe Wang, Zihe Liu, Haoran Wang, Chenghao Yan, Bin Wu
Comments: Accepted by MMM 2021
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[42] arXiv:2010.09489 (cross-list from cs.SD) [pdf, other]
Title: Hit Song Prediction Based on Early Adopter Data and Audio Features
Dorien Herremans, Tom Bergmans
Journal-ref: The 18th International Society for Music Information Retrieval Conference (ISMIR)2018 - LBD
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM)
[43] arXiv:2010.09907 (cross-list from cs.CV) [pdf, other]
Title: Color Image Segmentation Metrics
Majid Harouni, Hadi Yazdani Baghmaleki
Comments: 19 pages, 11 figures, 6 tables, 29 equations, book chapter, 2 authors
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[44] arXiv:2010.09925 (cross-list from cs.CV) [pdf, other]
Title: Hierarchical Paired Channel Fusion Network for Street Scene Change Detection
Yinjie Lei, Duo Peng, Pingping Zhang, Qiuhong Ke, Haifeng Li
Comments: To appear in Transactions on Image Processing, including 13 pages, 13 figures, 9 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[45] arXiv:2010.10637 (cross-list from cs.CV) [pdf, other]
Title: Mutual Information Regularized Identity-aware Facial ExpressionRecognition in Compressed Video
Xiaofeng Liu, Linghao Jin, Xu Han, Jane You
Comments: Published in Pattern Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[46] arXiv:2010.10658 (cross-list from cs.HC) [pdf, other]
Title: Display object alignment may influence location recall in unexpected ways
Peter Zelchenko, Xiaohan Fu, Xiangqian Li, Alex Ivanov, Zhenyu Gu
Comments: superseded by arXiv:2308.12201
Subjects: Human-Computer Interaction (cs.HC); Computers and Society (cs.CY); Multimedia (cs.MM)
[47] arXiv:2010.10706 (cross-list from cs.RO) [pdf, other]
Title: Can We Enable the Drone to be a Filmmaker?
Yuanjie Dang
Comments: 7 pages, 14 figures
Subjects: Robotics (cs.RO); Multimedia (cs.MM)
[48] arXiv:2010.11098 (cross-list from cs.SD) [pdf, other]
Title: WaveTransformer: A Novel Architecture for Audio Captioning Based on Learning Temporal and Time-Frequency Information
An Tran, Konstantinos Drossos, Tuomas Virtanen
Comments: Submitted for review at ICASSP2021
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[49] arXiv:2010.11550 (cross-list from cs.CV) [pdf, other]
Title: Learning Dual Semantic Relations with Graph Attention for Image-Text Matching
Keyu Wen, Xiaodong Gu, Qingrong Cheng
Comments: 14pages, 9 figures. Accepted at: IEEE Transactions on Circuits and Systems for Video Technology (Early Access Print) | |Codes Available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[50] arXiv:2010.11732 (cross-list from cs.CV) [pdf, other]
Title: A Cluster-Matching-Based Method for Video Face Recognition
Paulo R C Mendes, Antonio J G Busson, Sérgio Colcher, Daniel Schwabe, Álan L V Guedes, Carlos Laufer
Comments: 13 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
Total of 74 entries : 1-50 51-74
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack