Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Vision and Pattern Recognition

Authors and titles for February 2024

Total of 1842 entries : 1-100 101-200 201-300 301-400 401-500 501-600 ... 1801-1842
Showing up to 100 entries per page: fewer | more | all
[201] arXiv:2402.02941 [pdf, other]
Title: Exploring the Synergies of Hybrid CNNs and ViTs Architectures for Computer Vision: A survey
Haruna Yunusa, Shiyin Qin, Abdulrahman Hamman Adama Chukkol, Abdulganiyu Abdu Yusuf, Isah Bello, Adamu Lawan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[202] arXiv:2402.02946 [pdf, other]
Title: HoughToRadon Transform: New Neural Network Layer for Features Improvement in Projection Space
Alexandra Zhabitskaya, Alexander Sheshkus, Vladimir L. Arlazarov
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[203] arXiv:2402.02956 [pdf, html, other]
Title: AdaTreeFormer: Few Shot Domain Adaptation for Tree Counting from a Single High-Resolution Image
Hamed Amini Amirkolaee, Miaojing Shi, Lianghua He, Mark Mulligan
Comments: Accepted in ISPRS Journal of Photogrammetry and Remote Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[204] arXiv:2402.02968 [pdf, html, other]
Title: Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives
Sheng Luo, Wei Chen, Wanxin Tian, Rui Liu, Luanxuan Hou, Xiubao Zhang, Haifeng Shen, Ruiqi Wu, Shuyi Geng, Yi Zhou, Ling Shao, Yi Yang, Bojun Gao, Qun Li, Guobin Wu
Comments: Accepted to IEEE Transactions on Intelligent Vehicles(T-IV). 24 pages, 9 figures, 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[205] arXiv:2402.02972 [pdf, html, other]
Title: Retrieval-Augmented Score Distillation for Text-to-3D Generation
Junyoung Seo, Susung Hong, Wooseok Jang, Inès Hyeonsu Kim, Minseop Kwak, Doyup Lee, Seungryong Kim
Comments: Accepted to ICML 2024 / Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[206] arXiv:2402.02985 [pdf, html, other]
Title: Applying Unsupervised Semantic Segmentation to High-Resolution UAV Imagery for Enhanced Road Scene Parsing
Zihan Ma, Yongshang Li, Ronggui Ma, Chen Liang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[207] arXiv:2402.03003 [pdf, html, other]
Title: [Citation needed] Data usage and citation practices in medical imaging conferences
Théo Sourget, Ahmet Akkoç, Stinna Winther, Christine Lyngbye Galsgaard, Amelia Jiménez-Sánchez, Dovile Juodelyte, Caroline Petitjean, Veronika Cheplygina
Comments: Accepted at MIDL conference Updated with the revised version after MIDL rebuttal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Digital Libraries (cs.DL)
[208] arXiv:2402.03019 [pdf, html, other]
Title: Taylor Videos for Action Recognition
Lei Wang, Xiuyuan Yuan, Tom Gedeon, Liang Zheng
Comments: Published at the International Conference on Machine Learning (ICML 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[209] arXiv:2402.03040 [pdf, other]
Title: InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
Yiyuan Zhang, Yuhao Kang, Zhixin Zhang, Xiaohan Ding, Sanyuan Zhao, Xiangyu Yue
Comments: Code, models, and demo are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[210] arXiv:2402.03047 [pdf, other]
Title: PFDM: Parser-Free Virtual Try-on via Diffusion Model
Yunfang Niu, Dong Yi, Lingxiang Wu, Zhiwei Liu, Pengxiang Cai, Jinqiao Wang
Comments: Accepted by IEEE ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[211] arXiv:2402.03082 [pdf, other]
Title: Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing
Yan Shu, Weichao Zeng, Zhenhang Li, Fangmin Zhao, Yu Zhou
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[212] arXiv:2402.03093 [pdf, other]
Title: AI-Enhanced Virtual Reality in Medicine: A Comprehensive Survey
Yixuan Wu, Kaiyuan Hu, Danny Z. Chen, Jian Wu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[213] arXiv:2402.03094 [pdf, other]
Title: Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector
Yuqian Fu, Yu Wang, Yixuan Pan, Lian Huai, Xingyu Qiu, Zeyu Shangguan, Tong Liu, Yanwei Fu, Luc Van Gool, Xingqun Jiang
Comments: Accepted by ECCV2024 (project website: this http URL)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[214] arXiv:2402.03095 [pdf, html, other]
Title: Transcending Adversarial Perturbations: Manifold-Aided Adversarial Examples with Legitimate Semantics
Shuai Li, Xiaoyu Jiang, Xiaoguang Ma
Comments: 12 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[215] arXiv:2402.03119 [pdf, html, other]
Title: Good Teachers Explain: Explanation-Enhanced Knowledge Distillation
Amin Parchami-Araghi, Moritz Böhle, Sukrut Rao, Bernt Schiele
Comments: 32 pages, 11 figures, European Conference on Computer Vision (ECCV) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[216] arXiv:2402.03161 [pdf, html, other]
Title: Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization
Yang Jin, Zhicheng Sun, Kun Xu, Kun Xu, Liwei Chen, Hao Jiang, Quzhe Huang, Chengru Song, Yuliang Liu, Di Zhang, Yang Song, Kun Gai, Yadong Mu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[217] arXiv:2402.03162 [pdf, html, other]
Title: Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion
Shiyuan Yang, Liang Hou, Haibin Huang, Chongyang Ma, Pengfei Wan, Di Zhang, Xiaodong Chen, Jing Liao
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[218] arXiv:2402.03188 [pdf, other]
Title: Towards mitigating uncann(eye)ness in face swaps via gaze-centric loss terms
Ethan Wilson, Frederick Shic, Sophie Jörg, Eakta Jain
Comments: Accepted to Computers and Graphics Special Issue: Eye Gaze Visualization, Interaction, Synthesis, and Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[219] arXiv:2402.03214 [pdf, html, other]
Title: Organic or Diffused: Can We Distinguish Human Art from AI-generated Images?
Anna Yoo Jeong Ha, Josephine Passananti, Ronik Bhaskar, Shawn Shan, Reid Southen, Haitao Zheng, Ben Y. Zhao
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[220] arXiv:2402.03227 [pdf, html, other]
Title: IGUANe: a 3D generalizable CycleGAN for multicenter harmonization of brain MR images
Vincent Roca, Grégory Kuchcinski, Jean-Pierre Pruvo, Dorian Manouvriez, Renaud Lopes
Comments: 29 pages, 14 figures
Journal-ref: Medical Image Analysis 99 (2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[221] arXiv:2402.03235 [pdf, html, other]
Title: ActiveAnno3D -- An Active Learning Framework for Multi-Modal 3D Object Detection
Ahmed Ghita, Bjørk Antoniussen, Walter Zimmer, Ross Greer, Christian Creß, Andreas Møgelmose, Mohan M. Trivedi, Alois C. Knoll
Comments: 2024 Proceedings of the IEEE Intelligent Vehicles Symposium 2024 (IV'24)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[222] arXiv:2402.03241 [pdf, other]
Title: FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action Recognition
Xiaohu Huang, Hao Zhou, Kun Yao, Kai Han
Comments: Accepted by ICLR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[223] arXiv:2402.03246 [pdf, html, other]
Title: SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM
Mingrui Li, Shuhong Liu, Heng Zhou, Guohao Zhu, Na Cheng, Tianchen Deng, Hongyu Wang
Journal-ref: European Conference on Computer Vision (ECCV) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[224] arXiv:2402.03251 [pdf, other]
Title: CLIP Can Understand Depth
Dunam Kim, Seokju Lee
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[225] arXiv:2402.03286 [pdf, other]
Title: Training-Free Consistent Text-to-Image Generation
Yoad Tewel, Omri Kaduri, Rinon Gal, Yoni Kasten, Lior Wolf, Gal Chechik, Yuval Atzmon
Comments: Accepted to journal track of SIGGRAPH 2024 (TOG). Project page is at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[226] arXiv:2402.03290 [pdf, other]
Title: InstanceDiffusion: Instance-level Control for Image Generation
Xudong Wang, Trevor Darrell, Sai Saketh Rambhatla, Rohit Girdhar, Ishan Misra
Comments: Preprint; Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[227] arXiv:2402.03307 [pdf, html, other]
Title: 4D-Rotor Gaussian Splatting: Towards Efficient Novel View Synthesis for Dynamic Scenes
Yuanxing Duan, Fangyin Wei, Qiyu Dai, Yuhang He, Wenzheng Chen, Baoquan Chen
Comments: Proc. SIGGRAPH, 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[228] arXiv:2402.03309 [pdf, html, other]
Title: AONeuS: A Neural Rendering Framework for Acoustic-Optical Sensor Fusion
Mohamad Qadri, Kevin Zhang, Akshay Hinduja, Michael Kaess, Adithya Pediredla, Christopher A. Metzler
Comments: SIGGRAPH 2024 (conference track full paper). First two authors contributed equally. Paper website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[229] arXiv:2402.03311 [pdf, other]
Title: HASSOD: Hierarchical Adaptive Self-Supervised Object Detection
Shengcao Cao, Dhiraj Joshi, Liang-Yan Gui, Yu-Xiong Wang
Comments: NeurIPS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[230] arXiv:2402.03312 [pdf, html, other]
Title: Test-Time Adaptation for Depth Completion
Hyoungseob Park, Anjali Gupta, Alex Wong
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[231] arXiv:2402.03315 [pdf, html, other]
Title: RTHDet: Rotate Table Area and Head Detection in images
Wenxing Hu, Minglei Tong
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[232] arXiv:2402.03317 [pdf, html, other]
Title: SpecFormer: Guarding Vision Transformer Robustness via Maximum Singular Value Penalization
Xixu Hu, Runkai Zheng, Jindong Wang, Cheuk Hang Leung, Qi Wu, Xing Xie
Comments: Accepted by ECCV 2024; 27 pages; code is at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[233] arXiv:2402.03325 [pdf, html, other]
Title: Connect Later: Improving Fine-tuning for Robustness with Targeted Augmentations
Helen Qu, Sang Michael Xie
Comments: ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[234] arXiv:2402.03326 [pdf, html, other]
Title: Slot Structured World Models
Jonathan Collu, Riccardo Majellaro, Aske Plaat, Thomas M. Moerland
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[235] arXiv:2402.03327 [pdf, html, other]
Title: Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with Large Language Models
Dingning Liu, Xiaoshui Huang, Yuenan Hou, Zhihui Wang, Zhenfei Yin, Yongshun Gong, Peng Gao, Wanli Ouyang
Comments: 10 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[236] arXiv:2402.03328 [pdf, html, other]
Title: Visual Enumeration Remains Challenging for Multimodal Generative AI
Alberto Testolin, Kuinan Hou, Marco Zorzi
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
[237] arXiv:2402.03329 [pdf, html, other]
Title: Unsupervised Salient Patch Selection for Data-Efficient Reinforcement Learning
Zhaohui Jiang, Paul Weng
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[238] arXiv:2402.03347 [pdf, other]
Title: Transfer Learning With Densenet201 Architecture Model For Potato Leaf Disease Classification
Rifqi Alfinnur Charisma, Faisal Dharma Adhinata
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[239] arXiv:2402.03348 [pdf, html, other]
Title: Respect the model: Fine-grained and Robust Explanation with Sharing Ratio Decomposition
Sangyu Han, Yearim Kim, Nojun Kwak
Comments: To be published in ICLR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[240] arXiv:2402.03384 [pdf, other]
Title: Survival and grade of the glioma prediction using transfer learning
Santiago Valbuena Rubio, María Teresa García-Ordás, Oscar García-Olalla Olivera, Héctor Alaiz-Moretón, Maria-Inmaculada González-Alonso, José Alberto Benítez-Andrades
Journal-ref: PeerJ Computer Science, Volume 9, December 2023, ID e1723
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[241] arXiv:2402.03417 [pdf, html, other]
Title: A Computer Vision Based Approach for Stalking Detection Using a CNN-LSTM-MLP Hybrid Fusion Model
Murad Hasan, Shahriar Iqbal, Md. Billal Hossain Faisal, Md. Musnad Hossin Neloy, Md. Tonmoy Kabir, Md. Tanzim Reza, Md. Golam Rabiul Alam, Md Zia Uddin
Comments: Under review for publication in the PLOS ONE journal, 17 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[242] arXiv:2402.03445 [pdf, html, other]
Title: Denoising Diffusion via Image-Based Rendering
Titas Anciukevičius, Fabian Manhardt, Federico Tombari, Paul Henderson
Comments: Accepted at ICLR 2024. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[243] arXiv:2402.03456 [pdf, html, other]
Title: Constrained Multiview Representation for Self-supervised Contrastive Learning
Siyuan Dai, Kai Ye, Kun Zhao, Ge Cui, Haoteng Tang, Liang Zhan
Comments: 11 pages, 9 figures, 2 algorithms
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[244] arXiv:2402.03466 [pdf, html, other]
Title: Physics-Encoded Graph Neural Networks for Deformation Prediction under Contact
Mahdi Saleh, Michael Sommersperger, Nassir Navab, Federico Tombari
Comments: Accepted at 2024 IEEE International Conference on Robotics and Automation (ICRA2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Geometry (cs.CG); Robotics (cs.RO)
[245] arXiv:2402.03501 [pdf, html, other]
Title: An Inpainting-Infused Pipeline for Attire and Background Replacement
Felipe Rodrigues Perche-Mahlow, André Felipe-Zanella, William Alberto Cruz-Castañeda, Marcellus Amadeus
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[246] arXiv:2402.03526 [pdf, html, other]
Title: nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model
Haifan Gong, Luoyao Kang, Yitao Wang, Xiang Wan, Haofeng Li
Comments: Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[247] arXiv:2402.03549 [pdf, other]
Title: AnaMoDiff: 2D Analogical Motion Diffusion via Disentangled Denoising
Maham Tanveer, Yizhi Wang, Ruiqi Wang, Nanxuan Zhao, Ali Mahdavi-Amiri, Hao Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[248] arXiv:2402.03553 [pdf, other]
Title: One-shot Neural Face Reenactment via Finding Directions in GAN's Latent Space
Stella Bounareli, Christos Tzelepis, Vasileios Argyriou, Ioannis Patras, Georgios Tzimiropoulos
Comments: Preprint version, accepted for publication in International Journal of Computer Vision (IJCV)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[249] arXiv:2402.03557 [pdf, html, other]
Title: Robust Analysis of Multi-Task Learning Efficiency: New Benchmarks on Light-Weighed Backbones and Effective Measurement of Multi-Task Learning Challenges by Feature Disentanglement
Dayou Mao, Yuhao Chen, Yifan Wu, Maximilian Gilles, Alexander Wong
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[250] arXiv:2402.03561 [pdf, other]
Title: VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
Jialu Li, Aishwarya Padmakumar, Gaurav Sukhatme, Mohit Bansal
Comments: AAAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[251] arXiv:2402.03585 [pdf, html, other]
Title: Decoder-Only Image Registration
Xi Jia, Wenqi Lu, Xinxing Cheng, Jinming Duan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[252] arXiv:2402.03592 [pdf, html, other]
Title: GRASP: GRAph-Structured Pyramidal Whole Slide Image Representation
Ali Khajegili Mirabadi, Graham Archibald, Amirali Darbandsari, Alberto Contreras-Sanz, Ramin Ebrahim Nakhli, Maryam Asadi, Allen Zhang, C. Blake Gilks, Peter Black, Gang Wang, Hossein Farahani, Ali Bashashati
Comments: Accepted in Learning Meaningful Representations of Life (LMRL) Workshop at ICLR 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[253] arXiv:2402.03631 [pdf, html, other]
Title: CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model
Aoran Xiao, Weihao Xuan, Heli Qi, Yun Xing, Ruijie Ren, Xiaoqin Zhang, Ling Shao, Shijian Lu
Comments: ECCV 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[254] arXiv:2402.03634 [pdf, html, other]
Title: Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection
Feng Liu, Tengteng Huang, Qianjing Zhang, Haotian Yao, Chi Zhang, Fang Wan, Qixiang Ye, Yanzhao Zhou
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[255] arXiv:2402.03654 [pdf, other]
Title: Reviewing FID and SID Metrics on Generative Adversarial Networks
Ricardo de Deijn, Aishwarya Batra, Brandon Koch, Naseef Mansoor, Hema Makkena
Comments: 14 pages 9 figures 1 table Included in IOTBS, NLTM, AIMLA, DBDM - 2024 Conference Proceedings Editor: David C. Wyld et al
Journal-ref: CS & IT - CSCP (2024) 111-124
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[256] arXiv:2402.03666 [pdf, html, other]
Title: QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning
Haoxuan Wang, Yuzhang Shang, Zhihang Yuan, Junyi Wu, Junchi Yan, Yan Yan
Comments: ICCV 2025. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[257] arXiv:2402.03690 [pdf, html, other]
Title: 3Doodle: Compact Abstraction of Objects with 3D Strokes
Changwoon Choi, Jaeah Lee, Jaesik Park, Young Min Kim
Comments: SIGGRAPH 2024 (Transactions on Graphics)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[258] arXiv:2402.03697 [pdf, html, other]
Title: SHMC-Net: A Mask-guided Feature Fusion Network for Sperm Head Morphology Classification
Nishchal Sapkota, Yejia Zhang, Sirui Li, Peixian Liang, Zhuo Zhao, Jingjing Zhang, Xiaomin Zha, Yiru Zhou, Yunxia Cao, Danny Z Chen
Comments: Published on ISBI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[259] arXiv:2402.03705 [pdf, html, other]
Title: FoolSDEdit: Deceptively Steering Your Edits Towards Targeted Attribute-aware Distribution
Qi Zhou, Dongxia Wang, Tianlin Li, Zhihong Xu, Yang Liu, Kui Ren, Wenhai Wang, Qing Guo
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[260] arXiv:2402.03708 [pdf, other]
Title: SISP: A Benchmark Dataset for Fine-grained Ship Instance Segmentation in Panchromatic Satellite Images
Pengming Feng, Mingjie Xie, Hongning Liu, Xuanjia Zhao, Guangjun He, Xueliang Zhang, Jian Guan
Comments: 14 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[261] arXiv:2402.03716 [pdf, other]
Title: Attention-based Shape and Gait Representations Learning for Video-based Cloth-Changing Person Re-Identification
Vuong D. Nguyen, Samiha Mirza, Pranav Mantini, Shishir K. Shah
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[262] arXiv:2402.03723 [pdf, other]
Title: Rig3DGS: Creating Controllable Portraits from Casual Monocular Videos
Alfredo Rivero, ShahRukh Athar, Zhixin Shu, Dimitris Samaras
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[263] arXiv:2402.03738 [pdf, other]
Title: AoSRNet: All-in-One Scene Recovery Networks via Multi-knowledge Integration
Yuxu Lu, Dong Yang, Yuan Gao, Ryan Wen Liu, Jun Liu, Yu Guo
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[264] arXiv:2402.03746 [pdf, html, other]
Title: Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
Daechul Ahn, Yura Choi, Youngjae Yu, Dongyeop Kang, Jonghyun Choi
Comments: ACL 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[265] arXiv:2402.03749 [pdf, other]
Title: Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
Jianyuan Guo, Hanting Chen, Chengcheng Wang, Kai Han, Chang Xu, Yunhe Wang
Comments: 12 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[266] arXiv:2402.03752 [pdf, other]
Title: Pre-training of Lightweight Vision Transformers on Small Datasets with Minimally Scaled Images
Jen Hong Tan
Comments: 7 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[267] arXiv:2402.03754 [pdf, other]
Title: Intensive Vision-guided Network for Radiology Report Generation
Fudan Zheng, Mengfei Li, Ying Wang, Weijiang Yu, Ruixuan Wang, Zhiguang Chen, Nong Xiao, Yutong Lu
Comments: Accepted by Physics in Medicine & Biology
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[268] arXiv:2402.03757 [pdf, html, other]
Title: The Instinctive Bias: Spurious Images lead to Illusion in MLLMs
Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
[269] arXiv:2402.03758 [pdf, other]
Title: Virtual Classification: Modulating Domain-Specific Knowledge for Multidomain Crowd Counting
Mingyue Guo, Binghui Chen, Zhaoyi Yan, Yaowei Wang, Qixiang Ye
Comments: Multidomain learning; Domain-guided virtual classifier; Instance-specific batch normalization
Journal-ref: IEEE Transactions on Neural Networks and Learning Systems,2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[270] arXiv:2402.03762 [pdf, html, other]
Title: MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction
Heng Zhou, Zhetao Guo, Shuhong Liu, Lechen Zhang, Qihao Wang, Yuxiang Ren, Mingrui Li
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[271] arXiv:2402.03766 [pdf, other]
Title: MobileVLM V2: Faster and Stronger Baseline for Vision Language Model
Xiangxiang Chu, Limeng Qiao, Xinyu Zhang, Shuang Xu, Fei Wei, Yang Yang, Xiaofei Sun, Yiming Hu, Xinyang Lin, Bo Zhang, Chunhua Shen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[272] arXiv:2402.03769 [pdf, html, other]
Title: AttackNet: Enhancing Biometric Security via Tailored Convolutional Neural Network Architectures for Liveness Detection
Oleksandr Kuznetsov, Dmytro Zakharov, Emanuele Frontoni, Andrea Maranesi
Journal-ref: Computers & Security (2024), 103828
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[273] arXiv:2402.03783 [pdf, other]
Title: Exploring Low-Resource Medical Image Classification with Weakly Supervised Prompt Learning
Fudan Zheng, Jindong Cao, Weijiang Yu, Zhiguang Chen, Nong Xiao, Yutong Lu
Comments: Accepted by Pattern Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[274] arXiv:2402.03795 [pdf, other]
Title: Energy-based Domain-Adaptive Segmentation with Depth Guidance
Jinjing Zhu, Zhedong Hu, Tae-Kyun Kim, Lin Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[275] arXiv:2402.03796 [pdf, other]
Title: Face Detection: Present State and Research Directions
Purnendu Prabhat, Himanshu Gupta, Ajeet Kumar Vishwakarma
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[276] arXiv:2402.03830 [pdf, other]
Title: OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving
Guohang Yan, Jiahao Pi, Jianfei Guo, Zhaotong Luo, Min Dou, Nianchen Deng, Qiusheng Huang, Daocheng Fu, Licheng Wen, Pinlong Cai, Xing Gao, Xinyu Cai, Bo Zhang, Xuemeng Yang, Yeqi Bai, Hongbin Zhou, Botian Shi
Comments: 10 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[277] arXiv:2402.03833 [pdf, html, other]
Title: A Lightweight Randomized Nonlinear Dictionary Learning Method using Random Vector Functional Link
G.Madhuri, Atul Negi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[278] arXiv:2402.03843 [pdf, html, other]
Title: A new method for optical steel rope non-destructive damage detection
Yunqing Bao, Bin Hu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[279] arXiv:2402.03896 [pdf, html, other]
Title: Multimodal Rationales for Explainable Visual Question Answering
Kun Li, George Vosselman, Michael Ying Yang
Comments: Accepted to CVPR workshops 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[280] arXiv:2402.03904 [pdf, html, other]
Title: Deep Frequency-Aware Functional Maps for Robust Shape Matching
Feifan Luo, Qinsong Li, Ling Hu, Haibo Wang, Xinru Liu, Shengjun Liu, Hongyang Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[281] arXiv:2402.03908 [pdf, other]
Title: EscherNet: A Generative Model for Scalable View Synthesis
Xin Kong, Shikun Liu, Xiaoyang Lyu, Marwan Taher, Xiaojuan Qi, Andrew J. Davison
Comments: CVPR2024 Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[282] arXiv:2402.03917 [pdf, html, other]
Title: Elastic Feature Consolidation for Cold Start Exemplar-Free Incremental Learning
Simone Magistri, Tomaso Trinci, Albin Soutif-Cormerais, Joost van de Weijer, Andrew D. Bagdanov
Comments: Accepted at Twelfth International Conference on Learning Representations (ICLR 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[283] arXiv:2402.03944 [pdf, html, other]
Title: Capturing the Unseen: Vision-Free Facial Motion Capture Using Inertial Measurement Units
Youjia Wang, Yiwen Wu, Hengan Zhou, Hongyang Lin, Xingyue Peng, Jingyan Zhang, Yingsheng Zhu, Yingwenqi Jiang, Yatu Zhang, Lan Xu, Jingya Wang, Jingyi Yu
Comments: Go to CAPUS project page this https URL and watch our video this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[284] arXiv:2402.03951 [pdf, other]
Title: Boosting Adversarial Transferability across Model Genus by Deformation-Constrained Warping
Qinliang Lin, Cheng Luo, Zenghao Niu, Xilin He, Weicheng Xie, Yuanbo Hou, Linlin Shen, Siyang Song
Comments: AAAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[285] arXiv:2402.03973 [pdf, html, other]
Title: A comparison between humans and AI at recognizing objects in unusual poses
Netta Ollikka, Amro Abbas, Andrea Perin, Markku Kilpeläinen, Stéphane Deny
Comments: version accepted at TMLR
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[286] arXiv:2402.03981 [pdf, other]
Title: Controllable Diverse Sampling for Diffusion Based Motion Behavior Forecasting
Yiming Xu, Hao Cheng, Monika Sester
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[287] arXiv:2402.03989 [pdf, other]
Title: YOLOPoint Joint Keypoint and Object Detection
Anton Backhaus, Thorsten Luettel, Hans-Joachim Wuensche
Comments: 12 pages, 5 figures
Journal-ref: Proceedings of Advanced Concepts for Intelligent Vision Systems, 14124, 112-123 (2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[288] arXiv:2402.04009 [pdf, other]
Title: Low-rank Attention Side-Tuning for Parameter-Efficient Fine-Tuning
Ningyuan Tang, Minghao Fu, Ke Zhu, Jianxin Wu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[289] arXiv:2402.04013 [pdf, html, other]
Title: Privacy Leakage on DNNs: A Survey of Model Inversion Attacks and Defenses
Hao Fang, Yixiang Qiu, Hongyao Yu, Wenbo Yu, Jiawei Kong, Baoli Chong, Bin Chen, Xuan Wang, Shu-Tao Xia, Ke Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[290] arXiv:2402.04031 [pdf, other]
Title: Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation
Zolnamar Dorjsembe, Hsing-Kuo Pao, Furen Xiao
Comments: This preprint has been accepted for publication in the proceedings of the IEEE Engineering in Medicine and Biology Society (EMBC 2024). The final published version is available at this https URL. The copyright for this work has been transferred to IEEE
Journal-ref: Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[291] arXiv:2402.04064 [pdf, html, other]
Title: Multi-class Road Defect Detection and Segmentation using Spatial and Channel-wise Attention for Autonomous Road Repairing
Jongmin Yu, Chen Bene Chi, Sebastiano Fichera, Paolo Paoletti, Devansh Mehta, Shan Luo
Comments: Accepted to the ICRA 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[292] arXiv:2402.04087 [pdf, other]
Title: A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation
Zhengbo Wang, Jian Liang, Lijun Sheng, Ran He, Zilei Wang, Tieniu Tan
Comments: Accepted by ICLR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[293] arXiv:2402.04097 [pdf, html, other]
Title: Analysis of Deep Image Prior and Exploiting Self-Guidance for Image Reconstruction
Shijun Liang, Evan Bell, Qing Qu, Rongrong Wang, Saiprasad Ravishankar
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[294] arXiv:2402.04101 [pdf, html, other]
Title: VRMM: A Volumetric Relightable Morphable Head Model
Haotian Yang, Mingwu Zheng, Chongyang Ma, Yu-Kun Lai, Pengfei Wan, Haibin Huang
Comments: Accepted to SIGGRAPH 2024 (Conference); Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[295] arXiv:2402.04139 [pdf, html, other]
Title: U-shaped Vision Mamba for Single Image Dehazing
Zhuoran Zheng, Chen Wu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[296] arXiv:2402.04178 [pdf, html, other]
Title: SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models
Yichen Shi, Yuhao Gao, Yingxin Lai, Hongyang Wang, Jun Feng, Lei He, Jun Wan, Changsheng Chen, Zitong Yu, Xiaochun Cao
Comments: Accepted by Visual Intelligence
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[297] arXiv:2402.04195 [pdf, other]
Title: Instance by Instance: An Iterative Framework for Multi-instance 3D Registration
Xinyue Cao, Xiyu Zhang, Yuxin Cheng, Zhaoshuai Qi, Yanning Zhang, Jiaqi Yang
Comments: 14 pages, 12 figures, 10 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[298] arXiv:2402.04236 [pdf, html, other]
Title: CogCoM: A Visual Language Model with Chain-of-Manipulations Reasoning
Ji Qi, Ming Ding, Weihan Wang, Yushi Bai, Qingsong Lv, Wenyi Hong, Bin Xu, Lei Hou, Juanzi Li, Yuxiao Dong, Jie Tang
Comments: 21 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[299] arXiv:2402.04252 [pdf, other]
Title: EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Quan Sun, Jinsheng Wang, Qiying Yu, Yufeng Cui, Fan Zhang, Xiaosong Zhang, Xinlong Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[300] arXiv:2402.04273 [pdf, html, other]
Title: Breaking Data Silos: Cross-Domain Learning for Multi-Agent Perception from Independent Private Sources
Jinlong Li, Baolu Li, Xinyu Liu, Runsheng Xu, Jiaqi Ma, Hongkai Yu
Comments: Accepted by the 2024 IEEE International Conference on Robotics and Automation (ICRA)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Total of 1842 entries : 1-100 101-200 201-300 301-400 401-500 501-600 ... 1801-1842
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack