Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Vision and Pattern Recognition

Authors and titles for February 2024

Total of 1842 entries : 1-100 101-200 201-300 301-400 401-500 501-600 601-700 ... 1801-1842
Showing up to 100 entries per page: fewer | more | all
[301] arXiv:2402.04297 [pdf, html, other]
Title: Road Surface Defect Detection -- From Image-based to Non-image-based: A Survey
Jongmin Yu, Jiaqi Jiang, Sebastiano Fichera, Paolo Paoletti, Lisa Layzell, Devansh Mehta, Shan Luo
Comments: Survey papers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[302] arXiv:2402.04324 [pdf, html, other]
Title: ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
Weiming Ren, Huan Yang, Ge Zhang, Cong Wei, Xinrun Du, Wenhao Huang, Wenhu Chen
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[303] arXiv:2402.04408 [pdf, other]
Title: Detection Transformer for Teeth Detection, Segmentation, and Numbering in Oral Rare Diseases: Focus on Data Augmentation and Inpainting Techniques
Hocine Kadi, Théo Sourget, Marzena Kawczynski, Sara Bendjama, Bruno Grollemund, Agnès Bloch-Zupan
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[304] arXiv:2402.04416 [pdf, html, other]
Title: Multimodal Unsupervised Domain Generalization by Retrieving Across the Modality Gap
Christopher Liao, Christian So, Theodoros Tsiligkaridis, Brian Kulis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[305] arXiv:2402.04465 [pdf, html, other]
Title: BAdaCost: Multi-class Boosting with Costs
Antonio Fernández-Baldera, José M. Buenaposada, Luis Baumela
Journal-ref: Pattern Recognition. Volume 79, July 2018, Pages 467-479
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[306] arXiv:2402.04476 [pdf, html, other]
Title: Dual-View Visual Contextualization for Web Navigation
Jihyung Kil, Chan Hee Song, Boyuan Zheng, Xiang Deng, Yu Su, Wei-Lun Chao
Comments: Accepted to CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[307] arXiv:2402.04482 [pdf, html, other]
Title: BEBLID: Boosted efficient binary local image descriptor
Iago Suárez, Ghesn Sfeir, José M. Buenaposada, Luis Baumela
Journal-ref: Pattern Recognition Letters. Volume 133, May 2020, Pages 366-372
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[308] arXiv:2402.04492 [pdf, html, other]
Title: ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation
Jirayu Burapacheep, Ishan Gaur, Agam Bhatia, Tristan Thrush
Comments: ACL Findings 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[309] arXiv:2402.04504 [pdf, html, other]
Title: Text2Street: Controllable Text-to-image Generation for Street Views
Jinming Su, Songen Gu, Yiting Duan, Xingyue Chen, Junfeng Luo
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[310] arXiv:2402.04507 [pdf, other]
Title: A Review on Digital Pixel Sensors
Md Rahatul Islam Udoy, Shamiul Alam, Md Mazharul Islam, Akhilesh Jaiswal, Ahmedullah Aziz
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[311] arXiv:2402.04519 [pdf, html, other]
Title: BioDrone: A Bionic Drone-based Single Object Tracking Benchmark for Robust Vision
Xin Zhao, Shiyu Hu, Yipei Wang, Jing Zhang, Yimin Hu, Rongshuai Liu, Haibin Ling, Yin Li, Renshu Li, Kun Liu, Jiadong Li
Comments: This paper is published in IJCV (refer to DOI). Please cite the published IJCV
Journal-ref: Int J Comput Vis (2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[312] arXiv:2402.04541 [pdf, other]
Title: BRI3L: A Brightness Illusion Image Dataset for Identification and Localization of Regions of Illusory Perception
Aniket Roy, Anirban Roy, Soma Mitra, Kuntal Ghosh
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[313] arXiv:2402.04554 [pdf, html, other]
Title: BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery
Huiqing Zhang, Yifei Xue, Ming Liao, Yizhen Lao
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[314] arXiv:2402.04555 [pdf, html, other]
Title: FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models
Chuhao Liu, Ke Wang, Jieqi Shi, Zhijian Qiao, Shaojie Shen
Comments: Published in IEEE RAL
Journal-ref: vol. 9, no. 3, pp. 2232-2239, March 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[315] arXiv:2402.04558 [pdf, other]
Title: DMAT: A Dynamic Mask-Aware Transformer for Human De-occlusion
Guoqiang Liang, Jiahao Hu, Qingyue Wang, Shizhou Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[316] arXiv:2402.04563 [pdf, other]
Title: Attention Guided CAM: Visual Explanations of Vision Transformer Guided by Self-Attention
Saebom Leem, Hyunseok Seo
Comments: AAAI2024. Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[317] arXiv:2402.04573 [pdf, html, other]
Title: Progressive Conservative Adaptation for Evolving Target Domains
Gangming Zhao, Chaoqi Chen, Wenhao He, Chengwei Pan, Chaowei Fang, Jinpeng Li, Xilin Chen, Yizhou Yu
Comments: 7 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[318] arXiv:2402.04583 [pdf, other]
Title: A Psychological Study: Importance of Contrast and Luminance in Color to Grayscale Mapping
Prasoon Ambalathankandy, Yafei Ou, Sae Kaneko, Masayuki Ikebe
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319] arXiv:2402.04587 [pdf, html, other]
Title: Sparse Anatomical Prompt Semi-Supervised Learning with Masked Image Modeling for CBCT Tooth Segmentation
Pengyu Dai, Yafei Ou, Yuqiao Yang, Yang Liu, Yue Zhao
Comments: accepted by ISBI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[320] arXiv:2402.04599 [pdf, html, other]
Title: Meet JEANIE: a Similarity Measure for 3D Skeleton Sequences via Temporal-Viewpoint Alignment
Lei Wang, Jun Liu, Liang Zheng, Tom Gedeon, Piotr Koniusz
Comments: Accepted by the International Journal of Computer Vision (IJCV). An extension of our ACCV'22 paper [arXiv:arXiv:2210.16820] which was distinguished by the Sang Uk Lee Best Student Paper Award
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[321] arXiv:2402.04615 [pdf, html, other]
Title: ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Gilles Baechler, Srinivas Sunkara, Maria Wang, Fedir Zubach, Hassan Mansoor, Vincent Etter, Victor Cărbune, Jason Lin, Jindong Chen, Abhanshu Sharma
Comments: Accepted to International Joint Conference on Artificial Intelligence (IJCAI), 2024. Revision Notes: full version of the paper, including 1) Camera-ready version for IJCAI-24; 2) Appendices that are mentioned, but not included in 1)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[322] arXiv:2402.04618 [pdf, other]
Title: Multi-Scale Semantic Segmentation with Modified MBConv Blocks
Xi Chen, Yang Cai, Yuan Wu, Bo Xiong, Taesung Park
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[323] arXiv:2402.04625 [pdf, other]
Title: Noise Map Guidance: Inversion with Spatial Context for Real Image Editing
Hansam Cho, Jonghyun Lee, Seoung Bum Kim, Tae-Hyun Oh, Yonghyun Jeong
Comments: ICLR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[324] arXiv:2402.04630 [pdf, other]
Title: LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors
Sheng Jin, Xueying Jiang, Jiaxing Huang, Lewei Lu, Shijian Lu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[325] arXiv:2402.04632 [pdf, html, other]
Title: GSN: Generalisable Segmentation in Neural Radiance Field
Vinayak Gupta, Rahul Goel, Sirikonda Dhawal, P. J. Narayanan
Comments: Accepted at the Main Technical Track of AAAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[326] arXiv:2402.04648 [pdf, html, other]
Title: OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding
Guibiao Liao, Kaichen Zhou, Zhenyu Bao, Kanglin Liu, Qing Li
Comments: IEEE TCSVT 2024: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[327] arXiv:2402.04671 [pdf, other]
Title: V2VSSC: A 3D Semantic Scene Completion Benchmark for Perception with Vehicle to Vehicle Communication
Yuanfang Zhang, Junxuan Li, Kaiqing Luo, Yiying Yang, Jiayi Han, Nian Liu, Denghui Qin, Peng Han, Chengpei Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[328] arXiv:2402.04672 [pdf, other]
Title: G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection
Fan Wu, Jinling Gao, Lanqing Hong, Xinbing Wang, Chenghu Zhou, Nanyang Ye
Comments: Accepted by AAAI24
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[329] arXiv:2402.04686 [pdf, other]
Title: The Influence of Autofocus Lenses in the Camera Calibration Process
Carlos Ricolfe-Viala, Alicia Esparza
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[330] arXiv:2402.04699 [pdf, html, other]
Title: Breaking Free: How to Hack Safety Guardrails in Black-Box Diffusion Models!
Shashank Kotyan, Po-Yuan Mao, Pin-Yu Chen, Danilo Vasconcellos Vargas
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[331] arXiv:2402.04717 [pdf, other]
Title: InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior
Chenguo Lin, Yadong Mu
Comments: Accepted by ICLR 2024 for spotlight presentation; Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[332] arXiv:2402.04754 [pdf, html, other]
Title: Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints
Jian Chen, Ruiyi Zhang, Yufan Zhou, Rajiv Jain, Zhiqiang Xu, Ryan Rossi, Changyou Chen
Comments: Accepted by ICLR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[333] arXiv:2402.04756 [pdf, html, other]
Title: Boundary-aware Contrastive Learning for Semi-supervised Nuclei Instance Segmentation
Ye Zhang, Ziyue Wang, Yifeng Wang, Hao Bian, Linghan Cai, Hengrui Li, Lingbo Zhang, Yongbing Zhang
Comments: 12 pages, 3 figures, 6 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[334] arXiv:2402.04762 [pdf, html, other]
Title: Color Recognition in Challenging Lighting Environments: CNN Approach
Nizamuddin Maitlo, Nooruddin Noonari, Sajid Ahmed Ghanghro, Sathishkumar Duraisamy, Fayaz Ahmed
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[335] arXiv:2402.04798 [pdf, html, other]
Title: Spiking-PhysFormer: Camera-Based Remote Photoplethysmography with Parallel Spike-driven Transformer
Mingxuan Liu, Jiankai Tang, Yongli Chen, Haoxiang Li, Jiahao Qi, Siwei Li, Kegang Wang, Jie Gan, Yuntao Wang, Hong Chen
Comments: Mingxuan Liu and Jiankai Tang are co-first authors of the article. Accepted by Neural Networks
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[336] arXiv:2402.04829 [pdf, html, other]
Title: NeRF as a Non-Distant Environment Emitter in Physics-based Inverse Rendering
Jingwang Ling, Ruihan Yu, Feng Xu, Chun Du, Shuang Zhao
Comments: SIGGRAPH 2024. Project page and video: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[337] arXiv:2402.04835 [pdf, html, other]
Title: Pseudo-labelling meets Label Smoothing for Noisy Partial Label Learning
Darshana Saravanan, Naresh Manwani, Vineet Gandhi
Comments: Best Paper Award at The 12th Workshop on Fine-Grained Visual Categorization (CVPRW 2025)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[338] arXiv:2402.04841 [pdf, other]
Title: Data-efficient Large Vision Models through Sequential Autoregression
Jianyuan Guo, Zhiwei Hao, Chengcheng Wang, Yehui Tang, Han Wu, Han Hu, Kai Han, Chang Xu
Comments: 15 pages
Journal-ref: ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[339] arXiv:2402.04855 [pdf, other]
Title: Dual-Path Coupled Image Deraining Network via Spatial-Frequency Interaction
Yuhong He, Aiwen Jiang, Lingfang Jiang, Zhifeng Wang, Lu Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[340] arXiv:2402.04857 [pdf, html, other]
Title: Advancing Video Anomaly Detection: A Concise Review and a New Dataset
Liyun Zhu, Lei Wang, Arjun Raj, Tom Gedeon, Chen Chen
Comments: Accepted at the 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[341] arXiv:2402.04878 [pdf, html, other]
Title: Shape-biased Texture Agnostic Representations for Improved Textureless and Metallic Object Detection and 6D Pose Estimation
Peter Hönig, Stefan Thalhammer, Jean-Baptiste Weibel, Matthias Hirschmanner, Markus Vincze
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[342] arXiv:2402.04883 [pdf, other]
Title: Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration
Chaoqun Wang, Yiran Qin, Zijian Kang, Ningning Ma, Ruimao Zhang
Comments: Accepted to ICRA2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[343] arXiv:2402.04929 [pdf, html, other]
Title: Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation
Shivang Chopra, Suraj Kothawade, Houda Aynaou, Aman Chadha
Comments: arXiv admin note: substantial text overlap with arXiv:2310.01701
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[344] arXiv:2402.04930 [pdf, html, other]
Title: Blue noise for diffusion models
Xingchang Huang, Corentin Salaün, Cristina Vasconcelos, Christian Theobalt, Cengiz Öztireli, Gurprit Singh
Comments: SIGGRAPH 2024 Conference Proceedings; Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[345] arXiv:2402.04953 [pdf, other]
Title: 4-Dimensional deformation part model for pose estimation using Kalman filter constraints
Enrique Martinez-Berti, Antonio-Jose Sanchez-Salmeron, Carlos Ricolfe-Viala
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[346] arXiv:2402.04958 [pdf, html, other]
Title: Channel-Selective Normalization for Label-Shift Robust Test-Time Adaptation
Pedro Vianna, Muawiz Chaudhary, Paria Mehrbod, An Tang, Guy Cloutier, Guy Wolf, Michael Eickenberg, Eugene Belilovsky
Comments: Accepted at the Conference on Lifelong Learning Agents (CoLLAs) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[347] arXiv:2402.04964 [pdf, other]
Title: ConvLoRA and AdaBN based Domain Adaptation via Self-Training
Sidra Aleem, Julia Dietlmeier, Eric Arazo, Suzanne Little
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[348] arXiv:2402.04979 [pdf, html, other]
Title: Detection and Pose Estimation of flat, Texture-less Industry Objects on HoloLens using synthetic Training
Thomas Pöllabauer, Fabian Rücker, Andreas Franek, Felix Gorschlüter
Comments: Scandinavian Conference on Image Analysis 2023
Journal-ref: In Scandinavian Conference on Image Analysis 2023 (pp. 569-585). Cham: Springer Nature Switzerland
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[349] arXiv:2402.05008 [pdf, html, other]
Title: EfficientViT-SAM: Accelerated Segment Anything Model Without Accuracy Loss
Zhuoyang Zhang, Han Cai, Song Han
Comments: CVPR 2024 Workshop (Efficient Large Vision Models)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[350] arXiv:2402.05035 [pdf, other]
Title: A Survey on Domain Generalization for Medical Image Analysis
Ziwei Niu, Shuyi Ouyang, Shiao Xie, Yen-wei Chen, Lanfen Lin
Comments: This is a withdrawn submission and will be considered invalid. Due to some errors and overlap with published papers, we have chosen to withdraw it
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[351] arXiv:2402.05045 [pdf, other]
Title: Efficient Multi-Resolution Fusion for Remote Sensing Data with Label Uncertainty
Hersh Vakharia, Xiaoxiao Du
Comments: 4 pages, 3 figures, 2 tables; Accepted to International Geoscience and Remote Sensing Symposium (IGARSS) 2023; Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[352] arXiv:2402.05054 [pdf, other]
Title: LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation
Jiaxiang Tang, Zhaoxi Chen, Xiaokang Chen, Tengfei Wang, Gang Zeng, Ziwei Liu
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353] arXiv:2402.05106 [pdf, other]
Title: Image captioning for Brazilian Portuguese using GRIT model
Rafael Silva de Alencar, William Alberto Cruz Castañeda, Marcellus Amadeus
Comments: arXiv admin note: text overlap with arXiv:2207.09666 by other authors
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[354] arXiv:2402.05158 [pdf, html, other]
Title: Enhancement of Bengali OCR by Specialized Models and Advanced Techniques for Diverse Document Types
AKM Shahariar Azad Rabby, Hasmot Ali, Md. Majedul Islam, Sheikh Abujar, Fuad Rahman
Comments: 8 pages, 7 figures, 4 table Link of the paper this https URL
Journal-ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, 2024, pp. 1102-1109
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[355] arXiv:2402.05195 [pdf, html, other]
Title: $λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space
Maitreya Patel, Sangmin Jung, Chitta Baral, Yezhou Yang
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[356] arXiv:2402.05235 [pdf, html, other]
Title: SPAD : Spatially Aware Multiview Diffusers
Yash Kant, Ziyi Wu, Michael Vasilkovsky, Guocheng Qian, Jian Ren, Riza Alp Guler, Bernard Ghanem, Sergey Tulyakov, Igor Gilitschenski, Aliaksandr Siarohin
Comments: Webpage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[357] arXiv:2402.05248 [pdf, other]
Title: Comparative Analysis of Kinect-Based and Oculus-Based Gaze Region Estimation Methods in a Driving Simulator
David González-Ortega, Francisco Javier Díaz-Perna, Mario Martínez-Zarzuela, Míriam Antón-Rodríguez
Comments: 25 pages
Journal-ref: Sensors 2021, 21, 26
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[358] arXiv:2402.05281 [pdf, html, other]
Title: Physics Informed and Data Driven Simulation of Underwater Images via Residual Learning
Tanmoy Mondal, Ricardo Mendoza, Lucas Drumetz
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[359] arXiv:2402.05301 [pdf, html, other]
Title: BIKED++: A Multimodal Dataset of 1.4 Million Bicycle Image and Parametric CAD Designs
Lyle Regenwetter, Yazan Abu Obaideh, Amin Heyrani Nobari, Faez Ahmed
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[360] arXiv:2402.05305 [pdf, html, other]
Title: Knowledge Distillation for Road Detection based on cross-model Semi-Supervised Learning
Wanli Ma, Oktay Karakus, Paul L. Rosin
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[361] arXiv:2402.05310 [pdf, html, other]
Title: Dual-disentangled Deep Multiple Clustering
Jiawei Yao, Juhua Hu
Comments: Accepted by SDM'24. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[362] arXiv:2402.05349 [pdf, html, other]
Title: Scrapping The Web For Early Wildfire Detection: A New Annotated Dataset of Images and Videos of Smoke Plumes In-the-wild
Mateo Lostanlen, Nicolas Isla, Jose Guillen, Felix Veith, Cristian Buc, Valentin Barriere
Comments: Preprint of ongoing work
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[363] arXiv:2402.05350 [pdf, html, other]
Title: Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model
Junghun Cha, Ali Haider, Seoyun Yang, Hoeyeong Jin, Subin Yang, A. F. M. Shahab Uddin, Jaehyoung Kim, Soo Ye Kim, Sung-Ho Bae
Comments: Accepted to AAAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[364] arXiv:2402.05374 [pdf, html, other]
Title: CIC: A Framework for Culturally-Aware Image Captioning
Youngsik Yun, Jihie Kim
Comments: Accepted by IJCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[365] arXiv:2402.05375 [pdf, html, other]
Title: Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models
Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang
Comments: ICLR 2024. Our code is available in this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[366] arXiv:2402.05382 [pdf, html, other]
Title: Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts
Zhili Liu, Kai Chen, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, James T. Kwok
Comments: Accepted by ICLR 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[367] arXiv:2402.05394 [pdf, html, other]
Title: Enhancing Zero-shot Counting via Language-guided Exemplar Learning
Mingjie Wang, Jun Zhou, Yong Dai, Eric Buys, Minglun Gong
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[368] arXiv:2402.05398 [pdf, html, other]
Title: On the Effect of Image Resolution on Semantic Segmentation
Ritambhara Singh, Abhishek Jain, Pietro Perona, Shivani Agarwal, Junfeng Yang
Comments: arXiv admin note: text overlap with arXiv:2209.08667 by other authors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[369] arXiv:2402.05408 [pdf, html, other]
Title: MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
Dewei Zhou, You Li, Fan Ma, Xiaoting Zhang, Yi Yang
Comments: Accepted for publication in CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[370] arXiv:2402.05410 [pdf, html, other]
Title: SpirDet: Towards Efficient, Accurate and Lightweight Infrared Small Target Detector
Qianchen Mao, Qiang Li, Bingshu Wang, Yongjun Zhang, Tao Dai, C.L. Philip Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[371] arXiv:2402.05417 [pdf, other]
Title: Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification
Vaibhav Khatavkar, Makarand Velankar, Sneha Petkar
Comments: 17 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[372] arXiv:2402.05423 [pdf, html, other]
Title: MTSA-SNN: A Multi-modal Time Series Analysis Model Based on Spiking Neural Network
Chengzhi Liu, Zheng Tao, Zihong Luo, Chenghao Liu
Comments: 6 pages, 6 figures, published to International Conference on Computer Supported Cooperative Work in Design
Journal-ref: International Conference on Pattern Recognition 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[373] arXiv:2402.05441 [pdf, other]
Title: Spiking Neural Network Enhanced Hand Gesture Recognition Using Low-Cost Single-photon Avalanche Diode Array
Zhenya Zang, Xingda Li, David Day Uei Li
Comments: 9 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[374] arXiv:2402.05448 [pdf, html, other]
Title: Minecraft-ify: Minecraft Style Image Generation with Text-guided Image Editing for In-Game Application
Bumsoo Kim, Sanghyun Byun, Yonghoon Jung, Wonseop Shin, Sareer UI Amin, Sanghyun Seo
Comments: 2 pages, 2 figures. Accepted as Spotlight to NeurIPS 2023 Workshop on Machine Learning for Creativity and Design
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[375] arXiv:2402.05472 [pdf, other]
Title: Question Aware Vision Transformer for Multimodal Reasoning
Roy Ganz, Yair Kittenplon, Aviad Aberdam, Elad Ben Avraham, Oren Nuriel, Shai Mazor, Ron Litman
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[376] arXiv:2402.05532 [pdf, html, other]
Title: NCRF: Neural Contact Radiance Fields for Free-Viewpoint Rendering of Hand-Object Interaction
Zhongqun Zhang, Jifei Song, Eduardo Pérez-Pellitero, Yiren Zhou, Hyung Jin Chang, Aleš Leonardis
Comments: Accepted by 3DV 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[377] arXiv:2402.05548 [pdf, other]
Title: Efficient Expression Neutrality Estimation with Application to Face Recognition Utility Prediction
Marcel Grimmer, Raymond N. J. Veldhuis, Christoph Busch
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[378] arXiv:2402.05557 [pdf, other]
Title: On Convolutional Vision Transformers for Yield Prediction
Alvin Inderka, Florian Huber, Volker Steinhage
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[379] arXiv:2402.05589 [pdf, html, other]
Title: RESMatch: Referring Expression Segmentation in a Semi-Supervised Manner
Ying Zang, Chenglong Fu, Runlong Cao, Didi Zhu, Min Zhang, Wenjun Hu, Lanyun Zhu, Tianrun Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[380] arXiv:2402.05593 [pdf, other]
Title: A Concept for Reconstructing Stucco Statues from historic Sketches using synthetic Data only
Thomas Pöllabauer, Julius Kühn
Journal-ref: Eurographics Workshop on Graphics and Cultural Heritage 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[381] arXiv:2402.05608 [pdf, html, other]
Title: Scalable Diffusion Models with State Space Backbone
Zhengcong Fei, Mingyuan Fan, Changqian Yu, Junshi Huang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[382] arXiv:2402.05610 [pdf, html, other]
Title: Extending 6D Object Pose Estimators for Stereo Vision
Thomas Pöllabauer, Jan Emrich, Volker Knauthe, Arjan Kuijper
Comments: 4th International Conference on Pattern Recognition and Artificial Intelligence (ICPRAI)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[383] arXiv:2402.05615 [pdf, other]
Title: DAPlankton: Benchmark Dataset for Multi-instrument Plankton Recognition via Fine-grained Domain Adaptation
Daniel Batrakhanov, Tuomas Eerola, Kaisa Kraft, Lumi Haraguchi, Lasse Lensu, Sanna Suikkanen, María Teresa Camarena-Gómez, Jukka Seppälä, Heikki Kälviäinen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[384] arXiv:2402.05637 [pdf, html, other]
Title: Learning pseudo-contractive denoisers for inverse problems
Deliang Wei, Peng Chen, Fang Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[385] arXiv:2402.05655 [pdf, html, other]
Title: Real-time Holistic Robot Pose Estimation with Unknown States
Shikun Ban, Juling Fan, Xiaoxuan Ma, Wentao Zhu, Yu Qiao, Yizhou Wang
Comments: Accepted by ECCV 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[386] arXiv:2402.05685 [pdf, html, other]
Title: An Ordinal Regression Framework for a Deep Learning Based Severity Assessment for Chest Radiographs
Patrick Wienholt, Alexander Hermans, Firas Khader, Behrus Puladi, Bastian Leibe, Christiane Kuhl, Sven Nebelung, Daniel Truhn
Comments: 17 pages, 3 figures, the code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[387] arXiv:2402.05712 [pdf, html, other]
Title: DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer
Zhiyuan Ma, Xiangyu Zhu, Guojun Qi, Chen Qian, Zhaoxiang Zhang, Zhen Lei
Comments: 9 pages, 5 figures. Code is avalable at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[388] arXiv:2402.05728 [pdf, html, other]
Title: CTGAN: Semantic-guided Conditional Texture Generator for 3D Shapes
Yi-Ting Pan, Chai-Rong Lee, Shu-Ho Fan, Jheng-Wei Su, Jia-Bin Huang, Yung-Yu Chuang, Hung-Kuo Chu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[389] arXiv:2402.05746 [pdf, html, other]
Title: Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents
Yuxi Wei, Zi Wang, Yifan Lu, Chenxin Xu, Changxing Liu, Hao Zhao, Siheng Chen, Yanfeng Wang
Comments: CVPR 2024(Highlight)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[390] arXiv:2402.05747 [pdf, other]
Title: Jacquard V2: Refining Datasets using the Human In the Loop Data Correction Method
Qiuhao Li, Shenghai Yuan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[391] arXiv:2402.05773 [pdf, html, other]
Title: UAV-Rain1k: A Benchmark for Raindrop Removal from UAV Aerial Imagery
Wenhui Chang, Hongming Chen, Xin He, Xiang Chen, Liangduo Shen
Comments: Accepted by IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392] arXiv:2402.05797 [pdf, html, other]
Title: TaE: Task-aware Expandable Representation for Long Tail Class Incremental Learning
Linjie Li, Zhenyu Wu, Jiaming Liu, Yang Ji
Comments: Accepted to ACCV2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[393] arXiv:2402.05803 [pdf, other]
Title: AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning
Wamiq Reyaz Para, Abdelrahman Eldesokey, Zhenyu Li, Pradyumna Reddy, Jiankang Deng, Peter Wonka
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[394] arXiv:2402.05804 [pdf, html, other]
Title: InkSight: Offline-to-Online Handwriting Conversion by Teaching Vision-Language Models to Read and Write
Blagoj Mitrevski, Arina Rak, Julian Schnitzler, Chengkun Li, Andrii Maksai, Jesse Berent, Claudiu Musat
Comments: Accepted by Transactions on Machine Learning Research
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[395] arXiv:2402.05809 [pdf, html, other]
Title: You Only Need One Color Space: An Efficient Network for Low-light Image Enhancement
Qingsen Yan, Yixu Feng, Cheng Zhang, Pei Wang, Peng Wu, Wei Dong, Jinqiu Sun, Yanning Zhang
Comments: Qingsen Yan, Yixu Feng, Cheng Zhang contributed equally to this work. Corresponding author: Yanning Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[396] arXiv:2402.05860 [pdf, other]
Title: Privacy-Preserving Synthetic Continual Semantic Segmentation for Robotic Surgery
Mengya Xu, Mobarakol Islam, Long Bai, Hongliang Ren
Comments: 12 pages, 8 figures, IEEE Transactions on Medical Image (accepted)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[397] arXiv:2402.05861 [pdf, html, other]
Title: Memory Consolidation Enables Long-Context Video Understanding
Ivana Balažević, Yuge Shi, Pinelopi Papalampidi, Rahma Chaabouni, Skanda Koppula, Olivier J. Hénaff
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[398] arXiv:2402.05869 [pdf, html, other]
Title: Adaptive Surface Normal Constraint for Geometric Estimation from Monocular Images
Xiaoxiao Long, Yuhang Zheng, Yupeng Zheng, Beiwen Tian, Cheng Lin, Lingjie Liu, Hao Zhao, Guyue Zhou, Wenping Wang
Comments: Accepted by TPAMI. arXiv admin note: substantial text overlap with arXiv:2103.15483
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399] arXiv:2402.05889 [pdf, html, other]
Title: CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
Shoubin Yu, Jaehong Yoon, Mohit Bansal
Comments: ICLR 2025; first two authors contributed equally. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[400] arXiv:2402.05892 [pdf, html, other]
Title: Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data
Shufan Li, Harkanwar Singh, Aditya Grover
Comments: 24 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Total of 1842 entries : 1-100 101-200 201-300 301-400 401-500 501-600 601-700 ... 1801-1842
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack