Multimedia

Authors and titles for February 2024

Total of 86 entries : 1-25 26-50 51-75 76-86

Showing up to 25 entries per page: fewer | more | all

[1] arXiv:2402.00045 [pdf, html, other]: Title: Detecting Multimedia Generated by Large AI Models: A Survey

Li Lin, Neeraj Gupta, Yue Zhang, Hainan Ren, Chun-Hao Liu, Feng Ding, Xin Wang, Xin Li, Luisa Verdoliva, Shu Hu

Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[2] arXiv:2402.00622 [pdf, other]: Title: Gain of Grain: A Film Grain Handling Toolchain for VVC-based Open Implementations

Vignesh V Menon, Adam Wieckowski, Jens Brandenburg, Benjamin Bross, Thomas Schierl, Detlev Marpe

Comments: 2024 Mile High Video (MHV)

Subjects: Multimedia (cs.MM)
[3] arXiv:2402.03413 [pdf, html, other]: Title: Perceptual Video Quality Assessment: A Survey

Xiongkuo Min, Huiyu Duan, Wei Sun, Yucheng Zhu, Guangtao Zhai

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[4] arXiv:2402.03513 [pdf, html, other]: Title: Video Super-Resolution for Optimized Bitrate and Green Online Streaming

Vignesh V Menon, Prajit T Rajendran, Amritha Premkumar, Benjamin Bross, Detlev Marpe

Comments: 2024 Picture Coding Symposium (PCS)

Subjects: Multimedia (cs.MM)
[5] arXiv:2402.03946 [pdf, other]: Title: BioNet-XR: Biological Network Visualization Framework for Virtual Reality and Mixed Reality Environments

Busra Senderin, Nurcan Tuncbag, Elif Surer

Subjects: Multimedia (cs.MM)
[6] arXiv:2402.05508 [pdf, html, other]: Title: Performance Evaluation of Associative Watermarking Using Statistical Neurodynamics

Ryoto Kanegae, Masaki Kawamura

Comments: 8 pages, 6 figures

Journal-ref: J. Phys. Soc. Jpn., Vol.93, No.11, 2024, Article ID: 114004

Subjects: Multimedia (cs.MM); Statistical Mechanics (cond-mat.stat-mech)
[7] arXiv:2402.06424 [pdf, other]: Title: Reducing Latency for Multimedia Broadcast Services Over Mobile Networks

C. M. Lentisco, L. Bellido, A. Cárdenas, R. F. Moyano, D. Fernández

Comments: 10 pages

Journal-ref: IEEE Transactions on Multimedia, vol. 19, no. 1, pp. 173-182, Jan. 2017

Subjects: Multimedia (cs.MM)
[8] arXiv:2402.06437 [pdf, other]: Title: Design of a 5G Multimedia Broadcast Application Function Supporting Adaptive Error Recovery

C. M. Lentisco, L. Bellido, A. Cárdenas, R. F. Moyano, D. Fernández

Comments: 14 pages, 10 figures

Journal-ref: in IEEE Transactions on Multimedia, vol. 25, pp. 378-388, 2023

Subjects: Multimedia (cs.MM)
[9] arXiv:2402.06945 [pdf, html, other]: Title: Evaluation Metrics for Automated Typographic Poster Generation

Sérgio M. Rebelo, J. J. Merelo, João Bicker, Penousal Machado

Comments: Paper accepted be presented in the 13th International Conference Artificial Intelligence in Music, Sound, Art and Design -- EvoMUSART 2024, Held as Part of EvoStar 2024, Aberystwyth, Wales, United Kingdom, April 3\textendash{}5, 2024

Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[10] arXiv:2402.07402 [pdf, other]: Title: BDIQA: A New Dataset for Video Question Answering to Explore Cognitive Reasoning through Theory of Mind

Yuanyuan Mao, Xin Lin, Qin Ni, Liang He

Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI)
[11] arXiv:2402.07640 [pdf, html, other]: Title: Synthesizing Sentiment-Controlled Feedback For Multimodal Text and Image Data

Puneet Kumar, Sarthak Malik, Balasubramanian Raman, Xiaobai Li

Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI)
[12] arXiv:2402.09062 [pdf, other]: Title: Blind Deep-Learning-Based Image Watermarking Robust Against Geometric Transformations

Hannes Mareen, Lucas Antchougov, Glenn Van Wallendael, Peter Lambert

Comments: Accepted and presented at IEEE International Conference on Consumer Electronics (ICCE) 2024

Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[13] arXiv:2402.09392 [pdf, other]: Title: LL-GABR: Energy Efficient Live Video Streaming Using Reinforcement Learning

Adithya Raman, Bekir Turkkan, Tevfik Kosar

Comments: 10 pages, 3 figures, 3 Tables

Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI)
[14] arXiv:2402.09720 [pdf, html, other]: Title: SpaceMeta: Global-Scale Massive Multi-User Virtual Interaction over LEO Satellite Constellations

Jiahe Huang, Yifei Zhu

Comments: Accepted by IEEE Satellite'23

Subjects: Multimedia (cs.MM); Networking and Internet Architecture (cs.NI)
[15] arXiv:2402.10805 [pdf, other]: Title: Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond

Yongqi Li, Wenjie Wang, Leigang Qu, Liqiang Nie, Wenjie Li, Tat-Seng Chua

Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[16] arXiv:2402.12629 [pdf, html, other]: Title: Television Discourse Decoded: Comprehensive Multimodal Analytics at Scale

Anmol Agarwal, Pratyush Priyadarshi, Shiven Sinha, Shrey Gupta, Hitkul Jangra, Ponnurangam Kumaraguru, Kiran Garimella

Comments: KDD 2024 [Updates for Camera Ready version]

Subjects: Multimedia (cs.MM); Computers and Society (cs.CY); Social and Information Networks (cs.SI)
[17] arXiv:2402.12760 [pdf, html, other]: Title: A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis

Nailei Hei, Qianyu Guo, Zihao Wang, Yan Wang, Haofen Wang, Wenqiang Zhang

Comments: Accepted by The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)

Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[18] arXiv:2402.14326 [pdf, html, other]: Title: Think before You Leap: Content-Aware Low-Cost Edge-Assisted Video Semantic Segmentation

Mingxuan Yan, Yi Wang, Xuedou Xiao, Zhiqing Luo, Jianhua He, Wei Wang

Comments: Accepted by ACM Multimedia 2023

Subjects: Multimedia (cs.MM)
[19] arXiv:2402.15513 [pdf, html, other]: Title: Investigating the Generalizability of Physiological Characteristics of Anxiety

Emily Zhou, Mohammad Soleymani, Maja J. Matarić

Journal-ref: 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2023, pp. 4848-4855

Subjects: Multimedia (cs.MM); Machine Learning (cs.LG); Signal Processing (eess.SP); Medical Physics (physics.med-ph)
[20] arXiv:2402.18107 [pdf, html, other]: Title: Multimodal Interaction Modeling via Self-Supervised Multi-Task Learning for Review Helpfulness Prediction

HongLin Gong, Mengzhao Jia, Liqiang Jing

Comments: 10 pages,4 figures, 4 tables

Subjects: Multimedia (cs.MM)
[21] arXiv:2402.18400 [pdf, html, other]: Title: Towards Alleviating Text-to-Image Retrieval Hallucination for CLIP in Zero-shot Learning

Hanyao Wang, Yibing Zhan, Liu Liu, Liang Ding, Yan Yang, Jun Yu

Comments: This work has been submitted to the lEEE for possible publication. Copyright may betransferred without notice, after which this version may no longer be accessible

Subjects: Multimedia (cs.MM)
[22] arXiv:2402.18702 [pdf, other]: Title: Characterizing Multimedia Information Environment through Multi-modal Clustering of YouTube Videos

Niloofar Yousefi, Mainuddin Shaik, Nitin Agarwal

Comments: 14 pages, In the 4th International Conference on SMART MULTIMEDIA, 2024

Subjects: Multimedia (cs.MM)
[23] arXiv:2402.01180 (cross-list from cs.NI) [pdf, html, other]: Title: Real-time Extended Reality Video Transmission Optimization Based on Frame-priority Scheduling

Guangjin Pan, Shugong Xu, Shunqing Zhang, Xiaojing Chen, Yanzan Sun

Comments: 6 pages, 7 figures

Subjects: Networking and Internet Architecture (cs.NI); Multimedia (cs.MM); Signal Processing (eess.SP)
[24] arXiv:2402.02210 (cross-list from cs.CV) [pdf, other]: Title: Wavelet-Decoupling Contrastive Enhancement Network for Fine-Grained Skeleton-Based Action Recognition

Haochen Chang, Jing Chen, Yilin Li, Jixiang Chen, Xiaofeng Zhang

Comments: Accepted by ICASSP 2024

Journal-ref: IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2024, Seoul (Korea), South Korea

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[25] arXiv:2402.02369 (cross-list from cs.CV) [pdf, other]: Title: M$^3$Face: A Unified Multi-Modal Multilingual Framework for Human Face Generation and Editing

Mohammadreza Mofayezi, Reza Alipour, Mohammad Ali Kakavand, Ehsaneddin Asgari

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)

Total of 86 entries : 1-25 26-50 51-75 76-86

Showing up to 25 entries per page: fewer | more | all