Computer Vision and Pattern Recognition

Authors and titles for February 2024

Total of 1842 entries : 1-100 101-200 201-300 301-400 401-500 501-600 ... 1801-1842

Showing up to 100 entries per page: fewer | more | all

[201] arXiv:2402.02941 [pdf, other]: Title: Exploring the Synergies of Hybrid CNNs and ViTs Architectures for Computer Vision: A survey

Haruna Yunusa, Shiyin Qin, Abdulrahman Hamman Adama Chukkol, Abdulganiyu Abdu Yusuf, Isah Bello, Adamu Lawan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[202] arXiv:2402.02946 [pdf, other]: Title: HoughToRadon Transform: New Neural Network Layer for Features Improvement in Projection Space

Alexandra Zhabitskaya, Alexander Sheshkus, Vladimir L. Arlazarov

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[203] arXiv:2402.02956 [pdf, html, other]: Title: AdaTreeFormer: Few Shot Domain Adaptation for Tree Counting from a Single High-Resolution Image

Hamed Amini Amirkolaee, Miaojing Shi, Lianghua He, Mark Mulligan

Comments: Accepted in ISPRS Journal of Photogrammetry and Remote Sensing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[204] arXiv:2402.02968 [pdf, html, other]: Title: Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives

Sheng Luo, Wei Chen, Wanxin Tian, Rui Liu, Luanxuan Hou, Xiubao Zhang, Haifeng Shen, Ruiqi Wu, Shuyi Geng, Yi Zhou, Ling Shao, Yi Yang, Bojun Gao, Qun Li, Guobin Wu

Comments: Accepted to IEEE Transactions on Intelligent Vehicles(T-IV). 24 pages, 9 figures, 1 table

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[205] arXiv:2402.02972 [pdf, html, other]: Title: Retrieval-Augmented Score Distillation for Text-to-3D Generation

Junyoung Seo, Susung Hong, Wooseok Jang, Inès Hyeonsu Kim, Minseop Kwak, Doyup Lee, Seungryong Kim

Comments: Accepted to ICML 2024 / Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[206] arXiv:2402.02985 [pdf, html, other]: Title: Applying Unsupervised Semantic Segmentation to High-Resolution UAV Imagery for Enhanced Road Scene Parsing

Zihan Ma, Yongshang Li, Ronggui Ma, Chen Liang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[207] arXiv:2402.03003 [pdf, html, other]: Title: [Citation needed] Data usage and citation practices in medical imaging conferences

Théo Sourget, Ahmet Akkoç, Stinna Winther, Christine Lyngbye Galsgaard, Amelia Jiménez-Sánchez, Dovile Juodelyte, Caroline Petitjean, Veronika Cheplygina

Comments: Accepted at MIDL conference Updated with the revised version after MIDL rebuttal

Subjects: Computer Vision and Pattern Recognition (cs.CV); Digital Libraries (cs.DL)
[208] arXiv:2402.03019 [pdf, html, other]: Title: Taylor Videos for Action Recognition

Lei Wang, Xiuyuan Yuan, Tom Gedeon, Liang Zheng

Comments: Published at the International Conference on Machine Learning (ICML 2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[209] arXiv:2402.03040 [pdf, other]: Title: InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions

Yiyuan Zhang, Yuhao Kang, Zhixin Zhang, Xiaohan Ding, Sanyuan Zhao, Xiangyu Yue

Comments: Code, models, and demo are available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[210] arXiv:2402.03047 [pdf, other]: Title: PFDM: Parser-Free Virtual Try-on via Diffusion Model

Yunfang Niu, Dong Yi, Lingxiang Wu, Zhiwei Liu, Pengxiang Cai, Jinqiao Wang

Comments: Accepted by IEEE ICASSP 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[211] arXiv:2402.03082 [pdf, other]: Title: Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing

Yan Shu, Weichao Zeng, Zhenhang Li, Fangmin Zhao, Yu Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[212] arXiv:2402.03093 [pdf, other]: Title: AI-Enhanced Virtual Reality in Medicine: A Comprehensive Survey

Yixuan Wu, Kaiyuan Hu, Danny Z. Chen, Jian Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[213] arXiv:2402.03094 [pdf, other]: Title: Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector

Yuqian Fu, Yu Wang, Yixuan Pan, Lian Huai, Xingyu Qiu, Zeyu Shangguan, Tong Liu, Yanwei Fu, Luc Van Gool, Xingqun Jiang

Comments: Accepted by ECCV2024 (project website: this http URL)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[214] arXiv:2402.03095 [pdf, html, other]: Title: Transcending Adversarial Perturbations: Manifold-Aided Adversarial Examples with Legitimate Semantics

Shuai Li, Xiaoyu Jiang, Xiaoguang Ma

Comments: 12 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[215] arXiv:2402.03119 [pdf, html, other]: Title: Good Teachers Explain: Explanation-Enhanced Knowledge Distillation

Amin Parchami-Araghi, Moritz Böhle, Sukrut Rao, Bernt Schiele

Comments: 32 pages, 11 figures, European Conference on Computer Vision (ECCV) 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[216] arXiv:2402.03161 [pdf, html, other]: Title: Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization

Yang Jin, Zhicheng Sun, Kun Xu, Kun Xu, Liwei Chen, Hao Jiang, Quzhe Huang, Chengru Song, Yuliang Liu, Di Zhang, Yang Song, Kun Gai, Yadong Mu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[217] arXiv:2402.03162 [pdf, html, other]: Title: Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion

Shiyuan Yang, Liang Hou, Haibin Huang, Chongyang Ma, Pengfei Wan, Di Zhang, Xiaodong Chen, Jing Liao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[218] arXiv:2402.03188 [pdf, other]: Title: Towards mitigating uncann(eye)ness in face swaps via gaze-centric loss terms

Ethan Wilson, Frederick Shic, Sophie Jörg, Eakta Jain

Comments: Accepted to Computers and Graphics Special Issue: Eye Gaze Visualization, Interaction, Synthesis, and Analysis

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[219] arXiv:2402.03214 [pdf, html, other]: Title: Organic or Diffused: Can We Distinguish Human Art from AI-generated Images?

Anna Yoo Jeong Ha, Josephine Passananti, Ronik Bhaskar, Shawn Shan, Reid Southen, Haitao Zheng, Ben Y. Zhao

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[220] arXiv:2402.03227 [pdf, html, other]: Title: IGUANe: a 3D generalizable CycleGAN for multicenter harmonization of brain MR images

Vincent Roca, Grégory Kuchcinski, Jean-Pierre Pruvo, Dorian Manouvriez, Renaud Lopes

Comments: 29 pages, 14 figures

Journal-ref: Medical Image Analysis 99 (2025)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[221] arXiv:2402.03235 [pdf, html, other]: Title: ActiveAnno3D -- An Active Learning Framework for Multi-Modal 3D Object Detection

Ahmed Ghita, Bjørk Antoniussen, Walter Zimmer, Ross Greer, Christian Creß, Andreas Møgelmose, Mohan M. Trivedi, Alois C. Knoll

Comments: 2024 Proceedings of the IEEE Intelligent Vehicles Symposium 2024 (IV'24)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[222] arXiv:2402.03241 [pdf, other]: Title: FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action Recognition

Xiaohu Huang, Hao Zhou, Kun Yao, Kai Han

Comments: Accepted by ICLR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[223] arXiv:2402.03246 [pdf, html, other]: Title: SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM

Mingrui Li, Shuhong Liu, Heng Zhou, Guohao Zhu, Na Cheng, Tianchen Deng, Hongyu Wang

Journal-ref: European Conference on Computer Vision (ECCV) 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[224] arXiv:2402.03251 [pdf, other]: Title: CLIP Can Understand Depth

Dunam Kim, Seokju Lee

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[225] arXiv:2402.03286 [pdf, other]: Title: Training-Free Consistent Text-to-Image Generation

Yoad Tewel, Omri Kaduri, Rinon Gal, Yoni Kasten, Lior Wolf, Gal Chechik, Yuval Atzmon

Comments: Accepted to journal track of SIGGRAPH 2024 (TOG). Project page is at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[226] arXiv:2402.03290 [pdf, other]: Title: InstanceDiffusion: Instance-level Control for Image Generation

Xudong Wang, Trevor Darrell, Sai Saketh Rambhatla, Rohit Girdhar, Ishan Misra

Comments: Preprint; Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[227] arXiv:2402.03307 [pdf, html, other]: Title: 4D-Rotor Gaussian Splatting: Towards Efficient Novel View Synthesis for Dynamic Scenes

Yuanxing Duan, Fangyin Wei, Qiyu Dai, Yuhang He, Wenzheng Chen, Baoquan Chen

Comments: Proc. SIGGRAPH, 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[228] arXiv:2402.03309 [pdf, html, other]: Title: AONeuS: A Neural Rendering Framework for Acoustic-Optical Sensor Fusion

Mohamad Qadri, Kevin Zhang, Akshay Hinduja, Michael Kaess, Adithya Pediredla, Christopher A. Metzler

Comments: SIGGRAPH 2024 (conference track full paper). First two authors contributed equally. Paper website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[229] arXiv:2402.03311 [pdf, other]: Title: HASSOD: Hierarchical Adaptive Self-Supervised Object Detection

Shengcao Cao, Dhiraj Joshi, Liang-Yan Gui, Yu-Xiong Wang

Comments: NeurIPS 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[230] arXiv:2402.03312 [pdf, html, other]: Title: Test-Time Adaptation for Depth Completion

Hyoungseob Park, Anjali Gupta, Alex Wong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[231] arXiv:2402.03315 [pdf, html, other]: Title: RTHDet: Rotate Table Area and Head Detection in images

Wenxing Hu, Minglei Tong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[232] arXiv:2402.03317 [pdf, html, other]: Title: SpecFormer: Guarding Vision Transformer Robustness via Maximum Singular Value Penalization

Xixu Hu, Runkai Zheng, Jindong Wang, Cheuk Hang Leung, Qi Wu, Xing Xie

Comments: Accepted by ECCV 2024; 27 pages; code is at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[233] arXiv:2402.03325 [pdf, html, other]: Title: Connect Later: Improving Fine-tuning for Robustness with Targeted Augmentations

Helen Qu, Sang Michael Xie

Comments: ICML 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[234] arXiv:2402.03326 [pdf, html, other]: Title: Slot Structured World Models

Jonathan Collu, Riccardo Majellaro, Aske Plaat, Thomas M. Moerland

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[235] arXiv:2402.03327 [pdf, html, other]: Title: Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with Large Language Models

Dingning Liu, Xiaoshui Huang, Yuenan Hou, Zhihui Wang, Zhenfei Yin, Yongshun Gong, Peng Gao, Wanli Ouyang

Comments: 10 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[236] arXiv:2402.03328 [pdf, html, other]: Title: Visual Enumeration Remains Challenging for Multimodal Generative AI

Alberto Testolin, Kuinan Hou, Marco Zorzi

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[237] arXiv:2402.03329 [pdf, html, other]: Title: Unsupervised Salient Patch Selection for Data-Efficient Reinforcement Learning

Zhaohui Jiang, Paul Weng

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[238] arXiv:2402.03347 [pdf, other]: Title: Transfer Learning With Densenet201 Architecture Model For Potato Leaf Disease Classification

Rifqi Alfinnur Charisma, Faisal Dharma Adhinata

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[239] arXiv:2402.03348 [pdf, html, other]: Title: Respect the model: Fine-grained and Robust Explanation with Sharing Ratio Decomposition

Sangyu Han, Yearim Kim, Nojun Kwak

Comments: To be published in ICLR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[240] arXiv:2402.03384 [pdf, other]: Title: Survival and grade of the glioma prediction using transfer learning

Santiago Valbuena Rubio, María Teresa García-Ordás, Oscar García-Olalla Olivera, Héctor Alaiz-Moretón, Maria-Inmaculada González-Alonso, José Alberto Benítez-Andrades

Journal-ref: PeerJ Computer Science, Volume 9, December 2023, ID e1723

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[241] arXiv:2402.03417 [pdf, html, other]: Title: A Computer Vision Based Approach for Stalking Detection Using a CNN-LSTM-MLP Hybrid Fusion Model

Murad Hasan, Shahriar Iqbal, Md. Billal Hossain Faisal, Md. Musnad Hossin Neloy, Md. Tonmoy Kabir, Md. Tanzim Reza, Md. Golam Rabiul Alam, Md Zia Uddin

Comments: Under review for publication in the PLOS ONE journal, 17 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[242] arXiv:2402.03445 [pdf, html, other]: Title: Denoising Diffusion via Image-Based Rendering

Titas Anciukevičius, Fabian Manhardt, Federico Tombari, Paul Henderson

Comments: Accepted at ICLR 2024. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[243] arXiv:2402.03456 [pdf, html, other]: Title: Constrained Multiview Representation for Self-supervised Contrastive Learning

Siyuan Dai, Kai Ye, Kun Zhao, Ge Cui, Haoteng Tang, Liang Zhan

Comments: 11 pages, 9 figures, 2 algorithms

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[244] arXiv:2402.03466 [pdf, html, other]: Title: Physics-Encoded Graph Neural Networks for Deformation Prediction under Contact

Mahdi Saleh, Michael Sommersperger, Nassir Navab, Federico Tombari

Comments: Accepted at 2024 IEEE International Conference on Robotics and Automation (ICRA2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG); Robotics (cs.RO)
[245] arXiv:2402.03501 [pdf, html, other]: Title: An Inpainting-Infused Pipeline for Attire and Background Replacement

Felipe Rodrigues Perche-Mahlow, André Felipe-Zanella, William Alberto Cruz-Castañeda, Marcellus Amadeus

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[246] arXiv:2402.03526 [pdf, html, other]: Title: nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model

Haifan Gong, Luoyao Kang, Yitao Wang, Xiang Wan, Haofeng Li

Comments: Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[247] arXiv:2402.03549 [pdf, other]: Title: AnaMoDiff: 2D Analogical Motion Diffusion via Disentangled Denoising

Maham Tanveer, Yizhi Wang, Ruiqi Wang, Nanxuan Zhao, Ali Mahdavi-Amiri, Hao Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[248] arXiv:2402.03553 [pdf, other]: Title: One-shot Neural Face Reenactment via Finding Directions in GAN's Latent Space

Stella Bounareli, Christos Tzelepis, Vasileios Argyriou, Ioannis Patras, Georgios Tzimiropoulos

Comments: Preprint version, accepted for publication in International Journal of Computer Vision (IJCV)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[249] arXiv:2402.03557 [pdf, html, other]: Title: Robust Analysis of Multi-Task Learning Efficiency: New Benchmarks on Light-Weighed Backbones and Effective Measurement of Multi-Task Learning Challenges by Feature Disentanglement

Dayou Mao, Yuhao Chen, Yifan Wu, Maximilian Gilles, Alexander Wong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[250] arXiv:2402.03561 [pdf, other]: Title: VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation

Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme, Mohit Bansal

Comments: AAAI 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[251] arXiv:2402.03585 [pdf, html, other]: Title: Decoder-Only Image Registration

Xi Jia, Wenqi Lu, Xinxing Cheng, Jinming Duan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[252] arXiv:2402.03592 [pdf, html, other]: Title: GRASP: GRAph-Structured Pyramidal Whole Slide Image Representation

Ali Khajegili Mirabadi, Graham Archibald, Amirali Darbandsari, Alberto Contreras-Sanz, Ramin Ebrahim Nakhli, Maryam Asadi, Allen Zhang, C. Blake Gilks, Peter Black, Gang Wang, Hossein Farahani, Ali Bashashati

Comments: Accepted in Learning Meaningful Representations of Life (LMRL) Workshop at ICLR 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[253] arXiv:2402.03631 [pdf, html, other]: Title: CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model

Aoran Xiao, Weihao Xuan, Heli Qi, Yun Xing, Ruijie Ren, Xiaoqin Zhang, Ling Shao, Shijian Lu

Comments: ECCV 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[254] arXiv:2402.03634 [pdf, html, other]: Title: Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection

Feng Liu, Tengteng Huang, Qianjing Zhang, Haotian Yao, Chi Zhang, Fang Wan, Qixiang Ye, Yanzhao Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[255] arXiv:2402.03654 [pdf, other]: Title: Reviewing FID and SID Metrics on Generative Adversarial Networks

Ricardo de Deijn, Aishwarya Batra, Brandon Koch, Naseef Mansoor, Hema Makkena

Comments: 14 pages 9 figures 1 table Included in IOTBS, NLTM, AIMLA, DBDM - 2024 Conference Proceedings Editor: David C. Wyld et al

Journal-ref: CS & IT - CSCP (2024) 111-124

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[256] arXiv:2402.03666 [pdf, html, other]: Title: QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning

Haoxuan Wang, Yuzhang Shang, Zhihang Yuan, Junyi Wu, Junchi Yan, Yan Yan

Comments: ICCV 2025. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[257] arXiv:2402.03690 [pdf, html, other]: Title: 3Doodle: Compact Abstraction of Objects with 3D Strokes

Changwoon Choi, Jaeah Lee, Jaesik Park, Young Min Kim

Comments: SIGGRAPH 2024 (Transactions on Graphics)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[258] arXiv:2402.03697 [pdf, html, other]: Title: SHMC-Net: A Mask-guided Feature Fusion Network for Sperm Head Morphology Classification

Nishchal Sapkota, Yejia Zhang, Sirui Li, Peixian Liang, Zhuo Zhao, Jingjing Zhang, Xiaomin Zha, Yiru Zhou, Yunxia Cao, Danny Z Chen

Comments: Published on ISBI 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[259] arXiv:2402.03705 [pdf, html, other]: Title: FoolSDEdit: Deceptively Steering Your Edits Towards Targeted Attribute-aware Distribution

Qi Zhou, Dongxia Wang, Tianlin Li, Zhihong Xu, Yang Liu, Kui Ren, Wenhai Wang, Qing Guo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[260] arXiv:2402.03708 [pdf, other]: Title: SISP: A Benchmark Dataset for Fine-grained Ship Instance Segmentation in Panchromatic Satellite Images

Pengming Feng, Mingjie Xie, Hongning Liu, Xuanjia Zhao, Guangjun He, Xueliang Zhang, Jian Guan

Comments: 14 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[261] arXiv:2402.03716 [pdf, other]: Title: Attention-based Shape and Gait Representations Learning for Video-based Cloth-Changing Person Re-Identification

Vuong D. Nguyen, Samiha Mirza, Pranav Mantini, Shishir K. Shah

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[262] arXiv:2402.03723 [pdf, other]: Title: Rig3DGS: Creating Controllable Portraits from Casual Monocular Videos

Alfredo Rivero, ShahRukh Athar, Zhixin Shu, Dimitris Samaras

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[263] arXiv:2402.03738 [pdf, other]: Title: AoSRNet: All-in-One Scene Recovery Networks via Multi-knowledge Integration

Yuxu Lu, Dong Yang, Yuan Gao, Ryan Wen Liu, Jun Liu, Yu Guo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[264] arXiv:2402.03746 [pdf, html, other]: Title: Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback

Daechul Ahn, Yura Choi, Youngjae Yu, Dongyeop Kang, Jonghyun Choi

Comments: ACL 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[265] arXiv:2402.03749 [pdf, other]: Title: Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Jianyuan Guo, Hanting Chen, Chengcheng Wang, Kai Han, Chang Xu, Yunhe Wang

Comments: 12 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[266] arXiv:2402.03752 [pdf, other]: Title: Pre-training of Lightweight Vision Transformers on Small Datasets with Minimally Scaled Images

Jen Hong Tan

Comments: 7 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[267] arXiv:2402.03754 [pdf, other]: Title: Intensive Vision-guided Network for Radiology Report Generation

Fudan Zheng, Mengfei Li, Ying Wang, Weijiang Yu, Ruixuan Wang, Zhiguang Chen, Nong Xiao, Yutong Lu

Comments: Accepted by Physics in Medicine & Biology

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[268] arXiv:2402.03757 [pdf, html, other]: Title: The Instinctive Bias: Spurious Images lead to Illusion in MLLMs

Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[269] arXiv:2402.03758 [pdf, other]: Title: Virtual Classification: Modulating Domain-Specific Knowledge for Multidomain Crowd Counting

Mingyue Guo, Binghui Chen, Zhaoyi Yan, Yaowei Wang, Qixiang Ye

Comments: Multidomain learning; Domain-guided virtual classifier; Instance-specific batch normalization

Journal-ref: IEEE Transactions on Neural Networks and Learning Systems,2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[270] arXiv:2402.03762 [pdf, html, other]: Title: MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction

Heng Zhou, Zhetao Guo, Shuhong Liu, Lechen Zhang, Qihao Wang, Yuxiang Ren, Mingrui Li

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[271] arXiv:2402.03766 [pdf, other]: Title: MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

Xiangxiang Chu, Limeng Qiao, Xinyu Zhang, Shuang Xu, Fei Wei, Yang Yang, Xiaofei Sun, Yiming Hu, Xinyang Lin, Bo Zhang, Chunhua Shen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[272] arXiv:2402.03769 [pdf, html, other]: Title: AttackNet: Enhancing Biometric Security via Tailored Convolutional Neural Network Architectures for Liveness Detection

Oleksandr Kuznetsov, Dmytro Zakharov, Emanuele Frontoni, Andrea Maranesi

Journal-ref: Computers & Security (2024), 103828

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[273] arXiv:2402.03783 [pdf, other]: Title: Exploring Low-Resource Medical Image Classification with Weakly Supervised Prompt Learning

Fudan Zheng, Jindong Cao, Weijiang Yu, Zhiguang Chen, Nong Xiao, Yutong Lu

Comments: Accepted by Pattern Recognition

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[274] arXiv:2402.03795 [pdf, other]: Title: Energy-based Domain-Adaptive Segmentation with Depth Guidance

Jinjing Zhu, Zhedong Hu, Tae-Kyun Kim, Lin Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[275] arXiv:2402.03796 [pdf, other]: Title: Face Detection: Present State and Research Directions

Purnendu Prabhat, Himanshu Gupta, Ajeet Kumar Vishwakarma

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[276] arXiv:2402.03830 [pdf, other]: Title: OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving

Guohang Yan, Jiahao Pi, Jianfei Guo, Zhaotong Luo, Min Dou, Nianchen Deng, Qiusheng Huang, Daocheng Fu, Licheng Wen, Pinlong Cai, Xing Gao, Xinyu Cai, Bo Zhang, Xuemeng Yang, Yeqi Bai, Hongbin Zhou, Botian Shi

Comments: 10 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[277] arXiv:2402.03833 [pdf, html, other]: Title: A Lightweight Randomized Nonlinear Dictionary Learning Method using Random Vector Functional Link

G.Madhuri, Atul Negi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[278] arXiv:2402.03843 [pdf, html, other]: Title: A new method for optical steel rope non-destructive damage detection

Yunqing Bao, Bin Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[279] arXiv:2402.03896 [pdf, html, other]: Title: Multimodal Rationales for Explainable Visual Question Answering

Kun Li, George Vosselman, Michael Ying Yang

Comments: Accepted to CVPR workshops 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[280] arXiv:2402.03904 [pdf, html, other]: Title: Deep Frequency-Aware Functional Maps for Robust Shape Matching

Feifan Luo, Qinsong Li, Ling Hu, Haibo Wang, Xinru Liu, Shengjun Liu, Hongyang Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[281] arXiv:2402.03908 [pdf, other]: Title: EscherNet: A Generative Model for Scalable View Synthesis

Xin Kong, Shikun Liu, Xiaoyang Lyu, Marwan Taher, Xiaojuan Qi, Andrew J. Davison

Comments: CVPR2024 Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[282] arXiv:2402.03917 [pdf, html, other]: Title: Elastic Feature Consolidation for Cold Start Exemplar-Free Incremental Learning

Simone Magistri, Tomaso Trinci, Albin Soutif-Cormerais, Joost van de Weijer, Andrew D. Bagdanov

Comments: Accepted at Twelfth International Conference on Learning Representations (ICLR 2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[283] arXiv:2402.03944 [pdf, html, other]: Title: Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units

Youjia Wang, Yiwen Wu, Hengan Zhou, Hongyang Lin, Xingyue Peng, Jingyan Zhang, Yingsheng Zhu, Yingwenqi Jiang, Yatu Zhang, Lan Xu, Jingya Wang, Jingyi Yu

Comments: Go to CAPUS project page this https URL and watch our video this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[284] arXiv:2402.03951 [pdf, other]: Title: Boosting Adversarial Transferability across Model Genus by Deformation-Constrained Warping

Qinliang Lin, Cheng Luo, Zenghao Niu, Xilin He, Weicheng Xie, Yuanbo Hou, Linlin Shen, Siyang Song

Comments: AAAI 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[285] arXiv:2402.03973 [pdf, html, other]: Title: A comparison between humans and AI at recognizing objects in unusual poses

Netta Ollikka, Amro Abbas, Andrea Perin, Markku Kilpeläinen, Stéphane Deny

Comments: version accepted at TMLR

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[286] arXiv:2402.03981 [pdf, other]: Title: Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting

Yiming Xu, Hao Cheng, Monika Sester

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[287] arXiv:2402.03989 [pdf, other]: Title: YOLOPoint Joint Keypoint and Object Detection

Anton Backhaus, Thorsten Luettel, Hans-Joachim Wuensche

Comments: 12 pages, 5 figures

Journal-ref: Proceedings of Advanced Concepts for Intelligent Vision Systems, 14124, 112-123 (2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[288] arXiv:2402.04009 [pdf, other]: Title: Low-rank Attention Side-Tuning for Parameter-Efficient Fine-Tuning

Ningyuan Tang, Minghao Fu, Ke Zhu, Jianxin Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[289] arXiv:2402.04013 [pdf, html, other]: Title: Privacy Leakage on DNNs: A Survey of Model Inversion Attacks and Defenses

Hao Fang, Yixiang Qiu, Hongyao Yu, Wenbo Yu, Jiawei Kong, Baoli Chong, Bin Chen, Xuan Wang, Shu-Tao Xia, Ke Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[290] arXiv:2402.04031 [pdf, other]: Title: Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation

Zolnamar Dorjsembe, Hsing-Kuo Pao, Furen Xiao

Comments: This preprint has been accepted for publication in the proceedings of the IEEE Engineering in Medicine and Biology Society (EMBC 2024). The final published version is available at this https URL. The copyright for this work has been transferred to IEEE

Journal-ref: Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[291] arXiv:2402.04064 [pdf, html, other]: Title: Multi-class Road Defect Detection and Segmentation using Spatial and Channel-wise Attention for Autonomous Road Repairing

Jongmin Yu, Chen Bene Chi, Sebastiano Fichera, Paolo Paoletti, Devansh Mehta, Shan Luo

Comments: Accepted to the ICRA 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[292] arXiv:2402.04087 [pdf, other]: Title: A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation

Zhengbo Wang, Jian Liang, Lijun Sheng, Ran He, Zilei Wang, Tieniu Tan

Comments: Accepted by ICLR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[293] arXiv:2402.04097 [pdf, html, other]: Title: Analysis of Deep Image Prior and Exploiting Self-Guidance for Image Reconstruction

Shijun Liang, Evan Bell, Qing Qu, Rongrong Wang, Saiprasad Ravishankar

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[294] arXiv:2402.04101 [pdf, html, other]: Title: VRMM: A Volumetric Relightable Morphable Head Model

Haotian Yang, Mingwu Zheng, Chongyang Ma, Yu-Kun Lai, Pengfei Wan, Haibin Huang

Comments: Accepted to SIGGRAPH 2024 (Conference); Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[295] arXiv:2402.04139 [pdf, html, other]: Title: U-shaped Vision Mamba for Single Image Dehazing

Zhuoran Zheng, Chen Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[296] arXiv:2402.04178 [pdf, html, other]: Title: SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models

Yichen Shi, Yuhao Gao, Yingxin Lai, Hongyang Wang, Jun Feng, Lei He, Jun Wan, Changsheng Chen, Zitong Yu, Xiaochun Cao

Comments: Accepted by Visual Intelligence

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[297] arXiv:2402.04195 [pdf, other]: Title: Instance by Instance: An Iterative Framework for Multi-instance 3D Registration

Xinyue Cao, Xiyu Zhang, Yuxin Cheng, Zhaoshuai Qi, Yanning Zhang, Jiaqi Yang

Comments: 14 pages, 12 figures, 10 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[298] arXiv:2402.04236 [pdf, html, other]: Title: CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning

Ji Qi, Ming Ding, Weihan Wang, Yushi Bai, Qingsong Lv, Wenyi Hong, Bin Xu, Lei Hou, Juanzi Li, Yuxiao Dong, Jie Tang

Comments: 21 pages, 10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[299] arXiv:2402.04252 [pdf, other]: Title: EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Quan Sun, Jinsheng Wang, Qiying Yu, Yufeng Cui, Fan Zhang, Xiaosong Zhang, Xinlong Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[300] arXiv:2402.04273 [pdf, html, other]: Title: Breaking Data Silos: Cross-Domain Learning for Multi-Agent Perception from Independent Private Sources

Jinlong Li, Baolu Li, Xinyu Liu, Runsheng Xu, Jiaqi Ma, Hongkai Yu

Comments: Accepted by the 2024 IEEE International Conference on Robotics and Automation (ICRA)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

Total of 1842 entries : 1-100 101-200 201-300 301-400 401-500 501-600 ... 1801-1842

Showing up to 100 entries per page: fewer | more | all