Multimedia

Authors and titles for December 2021

Total of 67 entries : 1-25 26-50 51-67

Showing up to 25 entries per page: fewer | more | all

[51] arXiv:2112.10358 (cross-list from eess.AS) [pdf, other]: Title: Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus

Rongjie Huang, Feiyang Chen, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao

Comments: Accepted by ACM Multimedia 2021

Subjects: Audio and Speech Processing (eess.AS); Multimedia (cs.MM); Sound (cs.SD)
[52] arXiv:2112.10936 (cross-list from cs.CV) [pdf, other]: Title: Watch Those Words: Video Falsification Detection Using Word-Conditioned Facial Motion

Shruti Agarwal, Liwen Hu, Evonne Ng, Trevor Darrell, Hao Li, Anna Rohrbach

Comments: Accepted in WACV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Multimedia (cs.MM)
[53] arXiv:2112.10948 (cross-list from cs.CV) [pdf, other]: Title: Task-Oriented Image Transmission for Scene Classification in Unmanned Aerial Systems

Xu Kang, Bin Song, Jie Guo, Zhijin Qin, F. Richard Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[54] arXiv:2112.11294 (cross-list from cs.IR) [pdf, other]: Title: Extending CLIP for Category-to-image Retrieval in E-commerce

Mariya Hendriksen, Maurits Bleeker, Svitlana Vakulenko, Nanne van Noord, Ernst Kuiper, Maarten de Rijke

Comments: 15 pages, accepted as a full paper at ECIR 2022

Subjects: Information Retrieval (cs.IR); Machine Learning (cs.LG); Multimedia (cs.MM)
[55] arXiv:2112.11384 (cross-list from cs.CV) [pdf, other]: Title: Sports Video: Fine-Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2021

Pierre-Etienne Martin (LaBRI, MPI-EVA, UB), Jordan Calandre (MIA), Boris Mansencal (LaBRI), Jenny Benois-Pineau (LaBRI), Renaud Péteri (MIA), Laurent Mascarilla (MIA), Julien Morlier (IMS)

Comments: MediaEval 2021, Dec 2021, Online, Germany

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[56] arXiv:2112.11547 (cross-list from cs.CV) [pdf, other]: Title: Decompose the Sounds and Pixels, Recompose the Events

Varshanth R. Rao, Md Ibrahim Khalil, Haoda Li, Peng Dai, Juwei Lu

Comments: Accepted at AAAI 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[57] arXiv:2112.11749 (cross-list from cs.CV) [pdf, other]: Title: Class-aware Sounding Objects Localization via Audiovisual Correspondence

Di Hu, Yake Wei, Rui Qian, Weiyao Lin, Ruihua Song, Ji-Rong Wen

Comments: accepted by TPAMI 2021. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[58] arXiv:2112.12073 (cross-list from cs.CV) [pdf, other]: Title: Two Stream Network for Stroke Detection in Table Tennis

Anam Zahra (MPI-EVA), Pierre-Etienne Martin (LaBRI, MPI-EVA, UB)

Comments: MediaEval 2021, Dec 2021, Online, Germany

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[59] arXiv:2112.12074 (cross-list from cs.CV) [pdf, other]: Title: Spatio-Temporal CNN baseline method for the Sports Video Task of MediaEval 2021 benchmark

Pierre-Etienne Martin (LaBRI, MPI-EVA, UB)

Journal-ref: MediaEval 2021, Dec 2021, Online, Germany

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[60] arXiv:2112.12786 (cross-list from cs.CV) [pdf, other]: Title: ELSA: Enhanced Local Self-Attention for Vision Transformer

Jingkai Zhou, Pichao Wang, Fan Wang, Qiong Liu, Hao Li, Rong Jin

Comments: Project at \url{this https URL}

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[61] arXiv:2112.12792 (cross-list from cs.LG) [pdf, other]: Title: Understanding and Measuring Robustness of Multimodal Learning

Nishant Vishwamitra, Hongxin Hu, Ziming Zhao, Long Cheng, Feng Luo

Subjects: Machine Learning (cs.LG); Multimedia (cs.MM)
[62] arXiv:2112.13156 (cross-list from cs.SD) [pdf, other]: Title: Enabling Real-time On-chip Audio Super Resolution for Bone Conduction Microphones

Yuang Li, Yuntao Wang, Xin Liu, Yuanchun Shi, Shao-fu Shih

Subjects: Sound (cs.SD); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[63] arXiv:2112.13384 (cross-list from cs.LG) [pdf, other]: Title: Will You Dance To The Challenge? Predicting User Participation of TikTok Challenges

Lynnette Hui Xian Ng, John Yeh Han Tan, Darryl Jing Heng Tan, Roy Ka-Wei Lee

Comments: Accepted at ASONAM 2021

Subjects: Machine Learning (cs.LG); Multimedia (cs.MM); Social and Information Networks (cs.SI)
[64] arXiv:2112.13416 (cross-list from cs.CR) [pdf, other]: Title: Attribute Inference Attack of Speech Emotion Recognition in Federated Learning Settings

Tiantian Feng, Hanieh Hashemi, Rajat Hebbar, Murali Annavaram, Shrikanth S. Narayanan

Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG); Multimedia (cs.MM)
[65] arXiv:2112.14108 (cross-list from cs.CR) [pdf, other]: Title: Fostering the Robustness of White-Box Deep Neural Network Watermarks by Neuron Alignment

Fang-Qi Li, Shi-Lin Wang, Yun Zhu

Comments: 5 pages

Subjects: Cryptography and Security (cs.CR); Machine Learning (cs.LG); Multimedia (cs.MM)
[66] arXiv:2112.15320 (cross-list from cs.LG) [pdf, other]: Title: InverseMV: Composing Piano Scores with a Convolutional Video-Music Transformer

Chin-Tung Lin, Mu Yang

Comments: Rejected by ISMIR 2020

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD)
[67] arXiv:2112.15386 (cross-list from eess.IV) [pdf, other]: Title: Efficient Single Image Super-Resolution Using Dual Path Connections with Multiple Scale Learning

Bin-Cheng Yang, Gangshan Wu

Comments: 21 pages, 9 figures, 5 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)

Total of 67 entries : 1-25 26-50 51-67

Showing up to 25 entries per page: fewer | more | all