Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.MM

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Multimedia

Authors and titles for February 2024

Total of 86 entries : 1-25 26-50 51-75 76-86
Showing up to 25 entries per page: fewer | more | all
[1] arXiv:2402.00045 [pdf, html, other]
Title: Detecting Multimedia Generated by Large AI Models: A Survey
Li Lin, Neeraj Gupta, Yue Zhang, Hainan Ren, Chun-Hao Liu, Feng Ding, Xin Wang, Xin Li, Luisa Verdoliva, Shu Hu
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[2] arXiv:2402.00622 [pdf, other]
Title: Gain of Grain: A Film Grain Handling Toolchain for VVC-based Open Implementations
Vignesh V Menon, Adam Wieckowski, Jens Brandenburg, Benjamin Bross, Thomas Schierl, Detlev Marpe
Comments: 2024 Mile High Video (MHV)
Subjects: Multimedia (cs.MM)
[3] arXiv:2402.03413 [pdf, html, other]
Title: Perceptual Video Quality Assessment: A Survey
Xiongkuo Min, Huiyu Duan, Wei Sun, Yucheng Zhu, Guangtao Zhai
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[4] arXiv:2402.03513 [pdf, html, other]
Title: Video Super-Resolution for Optimized Bitrate and Green Online Streaming
Vignesh V Menon, Prajit T Rajendran, Amritha Premkumar, Benjamin Bross, Detlev Marpe
Comments: 2024 Picture Coding Symposium (PCS)
Subjects: Multimedia (cs.MM)
[5] arXiv:2402.03946 [pdf, other]
Title: BioNet-XR: Biological Network Visualization Framework for Virtual Reality and Mixed Reality Environments
Busra Senderin, Nurcan Tuncbag, Elif Surer
Subjects: Multimedia (cs.MM)
[6] arXiv:2402.05508 [pdf, html, other]
Title: Performance Evaluation of Associative Watermarking Using Statistical Neurodynamics
Ryoto Kanegae, Masaki Kawamura
Comments: 8 pages, 6 figures
Journal-ref: J. Phys. Soc. Jpn., Vol.93, No.11, 2024, Article ID: 114004
Subjects: Multimedia (cs.MM); Statistical Mechanics (cond-mat.stat-mech)
[7] arXiv:2402.06424 [pdf, other]
Title: Reducing Latency for Multimedia Broadcast Services Over Mobile Networks
C. M. Lentisco, L. Bellido, A. Cárdenas, R. F. Moyano, D. Fernández
Comments: 10 pages
Journal-ref: IEEE Transactions on Multimedia, vol. 19, no. 1, pp. 173-182, Jan. 2017
Subjects: Multimedia (cs.MM)
[8] arXiv:2402.06437 [pdf, other]
Title: Design of a 5G Multimedia Broadcast Application Function Supporting Adaptive Error Recovery
C. M. Lentisco, L. Bellido, A. Cárdenas, R. F. Moyano, D. Fernández
Comments: 14 pages, 10 figures
Journal-ref: in IEEE Transactions on Multimedia, vol. 25, pp. 378-388, 2023
Subjects: Multimedia (cs.MM)
[9] arXiv:2402.06945 [pdf, html, other]
Title: Evaluation Metrics for Automated Typographic Poster Generation
Sérgio M. Rebelo, J. J. Merelo, João Bicker, Penousal Machado
Comments: Paper accepted be presented in the 13th International Conference Artificial Intelligence in Music, Sound, Art and Design -- EvoMUSART 2024, Held as Part of EvoStar 2024, Aberystwyth, Wales, United Kingdom, April 3\textendash{}5, 2024
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[10] arXiv:2402.07402 [pdf, other]
Title: BDIQA: A New Dataset for Video Question Answering to Explore Cognitive Reasoning through Theory of Mind
Yuanyuan Mao, Xin Lin, Qin Ni, Liang He
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI)
[11] arXiv:2402.07640 [pdf, html, other]
Title: Synthesizing Sentiment-Controlled Feedback For Multimodal Text and Image Data
Puneet Kumar, Sarthak Malik, Balasubramanian Raman, Xiaobai Li
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI)
[12] arXiv:2402.09062 [pdf, other]
Title: Blind Deep-Learning-Based Image Watermarking Robust Against Geometric Transformations
Hannes Mareen, Lucas Antchougov, Glenn Van Wallendael, Peter Lambert
Comments: Accepted and presented at IEEE International Conference on Consumer Electronics (ICCE) 2024
Subjects: Multimedia (cs.MM); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[13] arXiv:2402.09392 [pdf, other]
Title: LL-GABR: Energy Efficient Live Video Streaming Using Reinforcement Learning
Adithya Raman, Bekir Turkkan, Tevfik Kosar
Comments: 10 pages, 3 figures, 3 Tables
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI)
[14] arXiv:2402.09720 [pdf, html, other]
Title: SpaceMeta: Global-Scale Massive Multi-User Virtual Interaction over LEO Satellite Constellations
Jiahe Huang, Yifei Zhu
Comments: Accepted by IEEE Satellite'23
Subjects: Multimedia (cs.MM); Networking and Internet Architecture (cs.NI)
[15] arXiv:2402.10805 [pdf, other]
Title: Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond
Yongqi Li, Wenjie Wang, Leigang Qu, Liqiang Nie, Wenjie Li, Tat-Seng Chua
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[16] arXiv:2402.12629 [pdf, html, other]
Title: Television Discourse Decoded: Comprehensive Multimodal Analytics at Scale
Anmol Agarwal, Pratyush Priyadarshi, Shiven Sinha, Shrey Gupta, Hitkul Jangra, Ponnurangam Kumaraguru, Kiran Garimella
Comments: KDD 2024 [Updates for Camera Ready version]
Subjects: Multimedia (cs.MM); Computers and Society (cs.CY); Social and Information Networks (cs.SI)
[17] arXiv:2402.12760 [pdf, html, other]
Title: A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis
Nailei Hei, Qianyu Guo, Zihao Wang, Yan Wang, Haofen Wang, Wenqiang Zhang
Comments: Accepted by The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[18] arXiv:2402.14326 [pdf, html, other]
Title: Think before You Leap: Content-Aware Low-Cost Edge-Assisted Video Semantic Segmentation
Mingxuan Yan, Yi Wang, Xuedou Xiao, Zhiqing Luo, Jianhua He, Wei Wang
Comments: Accepted by ACM Multimedia 2023
Subjects: Multimedia (cs.MM)
[19] arXiv:2402.15513 [pdf, html, other]
Title: Investigating the Generalizability of Physiological Characteristics of Anxiety
Emily Zhou, Mohammad Soleymani, Maja J. Matarić
Journal-ref: 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2023, pp. 4848-4855
Subjects: Multimedia (cs.MM); Machine Learning (cs.LG); Signal Processing (eess.SP); Medical Physics (physics.med-ph)
[20] arXiv:2402.18107 [pdf, html, other]
Title: Multimodal Interaction Modeling via Self-Supervised Multi-Task Learning for Review Helpfulness Prediction
HongLin Gong, Mengzhao Jia, Liqiang Jing
Comments: 10 pages,4 figures, 4 tables
Subjects: Multimedia (cs.MM)
[21] arXiv:2402.18400 [pdf, html, other]
Title: Towards Alleviating Text-to-Image Retrieval Hallucination for CLIP in Zero-shot Learning
Hanyao Wang, Yibing Zhan, Liu Liu, Liang Ding, Yan Yang, Jun Yu
Comments: This work has been submitted to the lEEE for possible publication. Copyright may betransferred without notice, after which this version may no longer be accessible
Subjects: Multimedia (cs.MM)
[22] arXiv:2402.18702 [pdf, other]
Title: Characterizing Multimedia Information Environment through Multi-modal Clustering of YouTube Videos
Niloofar Yousefi, Mainuddin Shaik, Nitin Agarwal
Comments: 14 pages, In the 4th International Conference on SMART MULTIMEDIA, 2024
Subjects: Multimedia (cs.MM)
[23] arXiv:2402.01180 (cross-list from cs.NI) [pdf, html, other]
Title: Real-time Extended Reality Video Transmission Optimization Based on Frame-priority Scheduling
Guangjin Pan, Shugong Xu, Shunqing Zhang, Xiaojing Chen, Yanzan Sun
Comments: 6 pages, 7 figures
Subjects: Networking and Internet Architecture (cs.NI); Multimedia (cs.MM); Signal Processing (eess.SP)
[24] arXiv:2402.02210 (cross-list from cs.CV) [pdf, other]
Title: Wavelet-Decoupling Contrastive Enhancement Network for Fine-Grained Skeleton-Based Action Recognition
Haochen Chang, Jing Chen, Yilin Li, Jixiang Chen, Xiaofeng Zhang
Comments: Accepted by ICASSP 2024
Journal-ref: IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2024, Seoul (Korea), South Korea
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[25] arXiv:2402.02369 (cross-list from cs.CV) [pdf, other]
Title: M$^3$Face: A Unified Multi-Modal Multilingual Framework for Human Face Generation and Editing
Mohammadreza Mofayezi, Reza Alipour, Mohammad Ali Kakavand, Ehsaneddin Asgari
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
Total of 86 entries : 1-25 26-50 51-75 76-86
Showing up to 25 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack