Multimedia

Authors and titles for August 2022

Total of 87 entries : 1-25 26-50 51-75 76-87

Showing up to 25 entries per page: fewer | more | all

[26] arXiv:2208.02337 (cross-list from cs.CV) [pdf, other]: Title: Estimating Visual Information From Audio Through Manifold Learning

Fabrizio Pedersoli, Dryden Wiebe, Amin Banitalebi, Yong Zhang, George Tzanetakis, Kwang Moo Yi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[27] arXiv:2208.02490 (cross-list from cs.SE) [pdf, other]: Title: Smartphone Apps for Tracking Food Consumption and Recommendations: Evaluating Artificial Intelligence-based Functionalities, Features and Quality of Current Apps

Sabiha Samad, Fahmida Ahmed, Samsun Naher, Muhammad Ashad Kabir, Anik Das, Sumaiya Amin, Sheikh Mohammed Shariful Islam

Journal-ref: Intelligent Systems with Applications, Volume 15, September 2022, 200103

Subjects: Software Engineering (cs.SE); Multimedia (cs.MM)
[28] arXiv:2208.02519 (cross-list from cs.CV) [pdf, other]: Title: IPDAE: Improved Patch-Based Deep Autoencoder for Lossy Point Cloud Geometry Compression

Kang You, Pan Gao, Qing Li

Comments: 12 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[29] arXiv:2208.03479 (cross-list from cs.CV) [pdf, other]: Title: Analysing the Memorability of a Procedural Crime-Drama TV Series, CSI

Sean Cummins, Lorin Sweeney, Alan F. Smeaton

Comments: 7 pages, accepted to CBMI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[30] arXiv:2208.03497 (cross-list from cs.CV) [pdf, other]: Title: Contrastive Positive Mining for Unsupervised 3D Action Representation Learning

Haoyuan Zhang, Yonghong Hou, Wenjing Zhang, Wanqing Li

Comments: Accepted by ECCV 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[31] arXiv:2208.04303 (cross-list from eess.IV) [pdf, other]: Title: Boosting neural video codecs by exploiting hierarchical redundancy

Reza Pourreza, Hoang Le, Amir Said, Guillaume Sautiere, Auke Wiggers

Comments: WACV 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[32] arXiv:2208.04998 (cross-list from cs.NI) [pdf, other]: Title: Towards Enabling Next Generation Societal Virtual Reality Applications for Virtual Human Teleportation

Jacob Chakareski, Mahmudur Khan, Murat Yuksel

Comments: This is an extended version (with more details) of a tutorial feature article that will appear in the IEEE Signal Processing Magazine in September 2022

Subjects: Networking and Internet Architecture (cs.NI); Multimedia (cs.MM); Image and Video Processing (eess.IV); Systems and Control (eess.SY); Applications (stat.AP)
[33] arXiv:2208.05057 (cross-list from cs.SD) [pdf, other]: Title: Subjective Evaluation of Deep Neural Network Based Speech Enhancement Systems in Real-World Conditions

Gaurav Naithani, Kirsi Pietilä, Riitta Niemistö, Erkki Paajanen, Tero Takala, Tuomas Virtanen

Comments: Accepted for publication in IEEE MMSP 2022

Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[34] arXiv:2208.05162 (cross-list from cs.SD) [pdf, other]: Title: Controlling Perceived Emotion in Symbolic Music Generation with Monte Carlo Tree Search

Lucas N. Ferreira, Lili Mou, Jim Whitehead, Levi H. S. Lelis

Comments: Accepted for publication at the 18th AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment (AIIDE-22)

Subjects: Sound (cs.SD); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[35] arXiv:2208.05190 (cross-list from cs.IR) [pdf, other]: Title: DVR: Micro-Video Recommendation Optimizing Watch-Time-Gain under Duration Bias

Yu Zheng, Chen Gao, Jingtao Ding, Lingling Yi, Depeng Jin, Yong Li, Meng Wang

Comments: Accepted by MM'22

Subjects: Information Retrieval (cs.IR); Multimedia (cs.MM)
[36] arXiv:2208.05213 (cross-list from cs.CV) [pdf, other]: Title: Automatic Camera Control and Directing with an Ultra-High-Definition Collaborative Recording System

Bram Vanherle, Tim Vervoort, Nick Michiels, Philippe Bekaert

Journal-ref: CVMP, 2021, 1-10

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[37] arXiv:2208.05251 (cross-list from cs.CV) [pdf, other]: Title: Consistency-based Self-supervised Learning for Temporal Anomaly Localization

Aniello Panariello, Angelo Porrello, Simone Calderara, Rita Cucchiara

Comments: Accepted to the WCPA Workshop at ECCV 2022 (1st International Workshop and Challenge on People Analysis: From Face, Body and Fashion to 3D Virtual Avatars). 13 pages, 2 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[38] arXiv:2208.05318 (cross-list from cs.CV) [pdf, other]: Title: Generative Action Description Prompts for Skeleton-based Action Recognition

Wangmeng Xiang, Chao Li, Yuxuan Zhou, Biao Wang, Lei Zhang

Comments: Accepted by ICCV23

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[39] arXiv:2208.05605 (cross-list from cs.SD) [pdf, other]: Title: Symbolic Music Loop Generation with Neural Discrete Representations

Sangjun Han, Hyeongrae Ihm, Moontae Lee, Woohyung Lim

Comments: Accepted at ISMIR 2022

Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[40] arXiv:2208.05647 (cross-list from cs.CV) [pdf, other]: Title: PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding

Zihan Ding, Zi-han Ding, Tianrui Hui, Junshi Huang, Xiaoming Wei, Xiaolin Wei, Si Liu

Comments: Accepted by ACM MM 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[41] arXiv:2208.05697 (cross-list from cs.SD) [pdf, other]: Title: Re-creation of Creations: A New Paradigm for Lyric-to-Melody Generation

Ang Lv, Xu Tan, Tao Qin, Tie-Yan Liu, Rui Yan

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[42] arXiv:2208.05701 (cross-list from cs.HC) [pdf, other]: Title: Cine-AI: Generating Video Game Cutscenes in the Style of Human Directors

Inan Evin, Perttu Hämäläinen, Christian Guckelsberger

Comments: 23 pages, 6 figures, 4 tables. In Proceedings ACM Human-Computer Interaction, Vol. 6, CHIPLAY, Article 223. Publication date: October 2022

Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[43] arXiv:2208.05775 (cross-list from cs.CV) [pdf, other]: Title: PSUMNet: Unified Modality Part Streams are All You Need for Efficient Pose-based Action Recognition

Neel Trivedi, Ravi Kiran Sarvadevabhatla

Comments: Accepted at ECCV 2022 WCPA (this https URL) . Code and models at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Multimedia (cs.MM)
[44] arXiv:2208.05814 (cross-list from cs.CV) [pdf, other]: Title: Seeing your sleep stage: cross-modal distillation from EEG to infrared video

Jianan Han, Shaoxing Zhang, Aidong Men, Yang Liu, Ziming Yao, Yan Yan, Qingchao Chen

Comments: We have submitted this paper to an academic journal

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[45] arXiv:2208.06678 (cross-list from cs.CV) [pdf, other]: Title: A new way of video compression via forward-referencing using deep learning

S.M.A.K. Rajin, M. Murshed, M. Paul, S.W. Teng, J. Ma

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[46] arXiv:2208.06773 (cross-list from cs.CV) [pdf, other]: Title: TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency

Medhini Narasimhan, Arsha Nagrani, Chen Sun, Michael Rubinstein, Trevor Darrell, Anna Rohrbach, Cordelia Schmid

Comments: Accepted to ECCV 2022. Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG); Multimedia (cs.MM)
[47] arXiv:2208.07110 (cross-list from cs.CV) [pdf, other]: Title: A Unified Image Preprocessing Framework For Image Compression

Moqi Zhang, Weihui Deng, Xiaocheng Li

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[48] arXiv:2208.07589 (cross-list from cs.LG) [pdf, other]: Title: Efficient Multimodal Transformer with Dual-Level Feature Restoration for Robust Multimodal Sentiment Analysis

Licai Sun, Zheng Lian, Bin Liu, Jianhua Tao

Comments: Accepted by TAC. The code is available at this https URL

Journal-ref: IEEE Transactions on Affective Computing, 2023

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[49] arXiv:2208.07825 (cross-list from cs.CR) [pdf, other]: Title: An Adaptive Image Encryption Scheme Guided by Fuzzy Models

Mahdi Shariatzadeh, Mohammad Javad Rostami, Mahdi Eftekhari

Comments: Iranian Journal of Fuzzy Systems (2023)

Subjects: Cryptography and Security (cs.CR); Multimedia (cs.MM)
[50] arXiv:2208.07994 (cross-list from cs.SD) [pdf, other]: Title: Enhancing Audio Perception of Music By AI Picked Room Acoustics

Prateek Verma, Jonathan Berger

Comments: 24th International Congress on Acoustics, Gyeongju, South Korea

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)

Total of 87 entries : 1-25 26-50 51-75 76-87

Showing up to 25 entries per page: fewer | more | all