Computer Vision and Pattern Recognition

Authors and titles for February 2024

Total of 1842 entries : 1-100 101-200 201-300 301-400 401-500 501-600 601-700 ... 1801-1842

Showing up to 100 entries per page: fewer | more | all

[301] arXiv:2402.04297 [pdf, html, other]: Title: Road Surface Defect Detection -- From Image-based to Non-image-based: A Survey

Jongmin Yu, Jiaqi Jiang, Sebastiano Fichera, Paolo Paoletti, Lisa Layzell, Devansh Mehta, Shan Luo

Comments: Survey papers

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[302] arXiv:2402.04324 [pdf, html, other]: Title: ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation

Weiming Ren, Huan Yang, Ge Zhang, Cong Wei, Xinrun Du, Wenhao Huang, Wenhu Chen

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[303] arXiv:2402.04408 [pdf, other]: Title: Detection Transformer for Teeth Detection, Segmentation, and Numbering in Oral Rare Diseases: Focus on Data Augmentation and Inpainting Techniques

Hocine Kadi, Théo Sourget, Marzena Kawczynski, Sara Bendjama, Bruno Grollemund, Agnès Bloch-Zupan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[304] arXiv:2402.04416 [pdf, html, other]: Title: Multimodal Unsupervised Domain Generalization by Retrieving Across the Modality Gap

Christopher Liao, Christian So, Theodoros Tsiligkaridis, Brian Kulis

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[305] arXiv:2402.04465 [pdf, html, other]: Title: BAdaCost: Multi-class Boosting with Costs

Antonio Fernández-Baldera, José M. Buenaposada, Luis Baumela

Journal-ref: Pattern Recognition. Volume 79, July 2018, Pages 467-479

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[306] arXiv:2402.04476 [pdf, html, other]: Title: Dual-View Visual Contextualization for Web Navigation

Jihyung Kil, Chan Hee Song, Boyuan Zheng, Xiang Deng, Yu Su, Wei-Lun Chao

Comments: Accepted to CVPR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[307] arXiv:2402.04482 [pdf, html, other]: Title: BEBLID: Boosted efficient binary local image descriptor

Iago Suárez, Ghesn Sfeir, José M. Buenaposada, Luis Baumela

Journal-ref: Pattern Recognition Letters. Volume 133, May 2020, Pages 366-372

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[308] arXiv:2402.04492 [pdf, html, other]: Title: ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation

Jirayu Burapacheep, Ishan Gaur, Agam Bhatia, Tristan Thrush

Comments: ACL Findings 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[309] arXiv:2402.04504 [pdf, html, other]: Title: Text2Street: Controllable Text-to-image Generation for Street Views

Jinming Su, Songen Gu, Yiting Duan, Xingyue Chen, Junfeng Luo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[310] arXiv:2402.04507 [pdf, other]: Title: A Review on Digital Pixel Sensors

Md Rahatul Islam Udoy, Shamiul Alam, Md Mazharul Islam, Akhilesh Jaiswal, Ahmedullah Aziz

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[311] arXiv:2402.04519 [pdf, html, other]: Title: BioDrone: A Bionic Drone-based Single Object Tracking Benchmark for Robust Vision

Xin Zhao, Shiyu Hu, Yipei Wang, Jing Zhang, Yimin Hu, Rongshuai Liu, Haibin Ling, Yin Li, Renshu Li, Kun Liu, Jiadong Li

Comments: This paper is published in IJCV (refer to DOI). Please cite the published IJCV

Journal-ref: Int J Comput Vis (2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[312] arXiv:2402.04541 [pdf, other]: Title: BRI3L: A Brightness Illusion Image Dataset for Identification and Localization of Regions of Illusory Perception

Aniket Roy, Anirban Roy, Soma Mitra, Kuntal Ghosh

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[313] arXiv:2402.04554 [pdf, html, other]: Title: BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery

Huiqing Zhang, Yifei Xue, Ming Liao, Yizhen Lao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[314] arXiv:2402.04555 [pdf, html, other]: Title: FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models

Chuhao Liu, Ke Wang, Jieqi Shi, Zhijian Qiao, Shaojie Shen

Comments: Published in IEEE RAL

Journal-ref: vol. 9, no. 3, pp. 2232-2239, March 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[315] arXiv:2402.04558 [pdf, other]: Title: DMAT: A Dynamic Mask-Aware Transformer for Human De-occlusion

Guoqiang Liang, Jiahao Hu, Qingyue Wang, Shizhou Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[316] arXiv:2402.04563 [pdf, other]: Title: Attention Guided CAM: Visual Explanations of Vision Transformer Guided by Self-Attention

Saebom Leem, Hyunseok Seo

Comments: AAAI2024. Code available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[317] arXiv:2402.04573 [pdf, html, other]: Title: Progressive Conservative Adaptation for Evolving Target Domains

Gangming Zhao, Chaoqi Chen, Wenhao He, Chengwei Pan, Chaowei Fang, Jinpeng Li, Xilin Chen, Yizhou Yu

Comments: 7 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[318] arXiv:2402.04583 [pdf, other]: Title: A Psychological Study: Importance of Contrast and Luminance in Color to Grayscale Mapping

Prasoon Ambalathankandy, Yafei Ou, Sae Kaneko, Masayuki Ikebe

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319] arXiv:2402.04587 [pdf, html, other]: Title: Sparse Anatomical Prompt Semi-Supervised Learning with Masked Image Modeling for CBCT Tooth Segmentation

Pengyu Dai, Yafei Ou, Yuqiao Yang, Yang Liu, Yue Zhao

Comments: accepted by ISBI 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[320] arXiv:2402.04599 [pdf, html, other]: Title: Meet JEANIE: a Similarity Measure for 3D Skeleton Sequences via Temporal-Viewpoint Alignment

Lei Wang, Jun Liu, Liang Zheng, Tom Gedeon, Piotr Koniusz

Comments: Accepted by the International Journal of Computer Vision (IJCV). An extension of our ACCV'22 paper [arXiv:arXiv:2210.16820] which was distinguished by the Sang Uk Lee Best Student Paper Award

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[321] arXiv:2402.04615 [pdf, html, other]: Title: ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Gilles Baechler, Srinivas Sunkara, Maria Wang, Fedir Zubach, Hassan Mansoor, Vincent Etter, Victor Cărbune, Jason Lin, Jindong Chen, Abhanshu Sharma

Comments: Accepted to International Joint Conference on Artificial Intelligence (IJCAI), 2024. Revision Notes: full version of the paper, including 1) Camera-ready version for IJCAI-24; 2) Appendices that are mentioned, but not included in 1)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[322] arXiv:2402.04618 [pdf, other]: Title: Multi-Scale Semantic Segmentation with Modified MBConv Blocks

Xi Chen, Yang Cai, Yuan Wu, Bo Xiong, Taesung Park

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[323] arXiv:2402.04625 [pdf, other]: Title: Noise Map Guidance: Inversion with Spatial Context for Real Image Editing

Hansam Cho, Jonghyun Lee, Seoung Bum Kim, Tae-Hyun Oh, Yonghyun Jeong

Comments: ICLR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[324] arXiv:2402.04630 [pdf, other]: Title: LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors

Sheng Jin, Xueying Jiang, Jiaxing Huang, Lewei Lu, Shijian Lu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[325] arXiv:2402.04632 [pdf, html, other]: Title: GSN: Generalisable Segmentation in Neural Radiance Field

Vinayak Gupta, Rahul Goel, Sirikonda Dhawal, P. J. Narayanan

Comments: Accepted at the Main Technical Track of AAAI 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[326] arXiv:2402.04648 [pdf, html, other]: Title: OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding

Guibiao Liao, Kaichen Zhou, Zhenyu Bao, Kanglin Liu, Qing Li

Comments: IEEE TCSVT 2024: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[327] arXiv:2402.04671 [pdf, other]: Title: V2VSSC: A 3D Semantic Scene Completion Benchmark for Perception with Vehicle to Vehicle Communication

Yuanfang Zhang, Junxuan Li, Kaiqing Luo, Yiying Yang, Jiayi Han, Nian Liu, Denghui Qin, Peng Han, Chengpei Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[328] arXiv:2402.04672 [pdf, other]: Title: G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection

Fan Wu, Jinling Gao, Lanqing Hong, Xinbing Wang, Chenghu Zhou, Nanyang Ye

Comments: Accepted by AAAI24

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[329] arXiv:2402.04686 [pdf, other]: Title: The Influence of Autofocus Lenses in the Camera Calibration Process

Carlos Ricolfe-Viala, Alicia Esparza

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[330] arXiv:2402.04699 [pdf, html, other]: Title: Breaking Free: How to Hack Safety Guardrails in Black-Box Diffusion Models!

Shashank Kotyan, Po-Yuan Mao, Pin-Yu Chen, Danilo Vasconcellos Vargas

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[331] arXiv:2402.04717 [pdf, other]: Title: InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior

Chenguo Lin, Yadong Mu

Comments: Accepted by ICLR 2024 for spotlight presentation; Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[332] arXiv:2402.04754 [pdf, html, other]: Title: Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints

Jian Chen, Ruiyi Zhang, Yufan Zhou, Rajiv Jain, Zhiqiang Xu, Ryan Rossi, Changyou Chen

Comments: Accepted by ICLR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[333] arXiv:2402.04756 [pdf, html, other]: Title: Boundary-aware Contrastive Learning for Semi-supervised Nuclei Instance Segmentation

Ye Zhang, Ziyue Wang, Yifeng Wang, Hao Bian, Linghan Cai, Hengrui Li, Lingbo Zhang, Yongbing Zhang

Comments: 12 pages, 3 figures, 6 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[334] arXiv:2402.04762 [pdf, html, other]: Title: Color Recognition in Challenging Lighting Environments: CNN Approach

Nizamuddin Maitlo, Nooruddin Noonari, Sajid Ahmed Ghanghro, Sathishkumar Duraisamy, Fayaz Ahmed

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[335] arXiv:2402.04798 [pdf, html, other]: Title: Spiking-PhysFormer: Camera-Based Remote Photoplethysmography with Parallel Spike-driven Transformer

Mingxuan Liu, Jiankai Tang, Yongli Chen, Haoxiang Li, Jiahao Qi, Siwei Li, Kegang Wang, Jie Gan, Yuntao Wang, Hong Chen

Comments: Mingxuan Liu and Jiankai Tang are co-first authors of the article. Accepted by Neural Networks

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[336] arXiv:2402.04829 [pdf, html, other]: Title: NeRF as a Non-Distant Environment Emitter in Physics-based Inverse Rendering

Jingwang Ling, Ruihan Yu, Feng Xu, Chun Du, Shuang Zhao

Comments: SIGGRAPH 2024. Project page and video: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[337] arXiv:2402.04835 [pdf, html, other]: Title: Pseudo-labelling meets Label Smoothing for Noisy Partial Label Learning

Darshana Saravanan, Naresh Manwani, Vineet Gandhi

Comments: Best Paper Award at The 12th Workshop on Fine-Grained Visual Categorization (CVPRW 2025)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[338] arXiv:2402.04841 [pdf, other]: Title: Data-efficient Large Vision Models through Sequential Autoregression

Jianyuan Guo, Zhiwei Hao, Chengcheng Wang, Yehui Tang, Han Wu, Han Hu, Kai Han, Chang Xu

Comments: 15 pages

Journal-ref: ICML 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[339] arXiv:2402.04855 [pdf, other]: Title: Dual-Path Coupled Image Deraining Network via Spatial-Frequency Interaction

Yuhong He, Aiwen Jiang, Lingfang Jiang, Zhifeng Wang, Lu Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[340] arXiv:2402.04857 [pdf, html, other]: Title: Advancing Video Anomaly Detection: A Concise Review and a New Dataset

Liyun Zhu, Lei Wang, Arjun Raj, Tom Gedeon, Chen Chen

Comments: Accepted at the 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[341] arXiv:2402.04878 [pdf, html, other]: Title: Shape-biased Texture Agnostic Representations for Improved Textureless and Metallic Object Detection and 6D Pose Estimation

Peter Hönig, Stefan Thalhammer, Jean-Baptiste Weibel, Matthias Hirschmanner, Markus Vincze

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[342] arXiv:2402.04883 [pdf, other]: Title: Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration

Chaoqun Wang, Yiran Qin, Zijian Kang, Ningning Ma, Ruimao Zhang

Comments: Accepted to ICRA2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[343] arXiv:2402.04929 [pdf, html, other]: Title: Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation

Shivang Chopra, Suraj Kothawade, Houda Aynaou, Aman Chadha

Comments: arXiv admin note: substantial text overlap with arXiv:2310.01701

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[344] arXiv:2402.04930 [pdf, html, other]: Title: Blue noise for diffusion models

Xingchang Huang, Corentin Salaün, Cristina Vasconcelos, Christian Theobalt, Cengiz Öztireli, Gurprit Singh

Comments: SIGGRAPH 2024 Conference Proceedings; Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[345] arXiv:2402.04953 [pdf, other]: Title: 4-Dimensional deformation part model for pose estimation using Kalman filter constraints

Enrique Martinez-Berti, Antonio-Jose Sanchez-Salmeron, Carlos Ricolfe-Viala

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[346] arXiv:2402.04958 [pdf, html, other]: Title: Channel-Selective Normalization for Label-Shift Robust Test-Time Adaptation

Pedro Vianna, Muawiz Chaudhary, Paria Mehrbod, An Tang, Guy Cloutier, Guy Wolf, Michael Eickenberg, Eugene Belilovsky

Comments: Accepted at the Conference on Lifelong Learning Agents (CoLLAs) 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[347] arXiv:2402.04964 [pdf, other]: Title: ConvLoRA and AdaBN based Domain Adaptation via Self-Training

Sidra Aleem, Julia Dietlmeier, Eric Arazo, Suzanne Little

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[348] arXiv:2402.04979 [pdf, html, other]: Title: Detection and Pose Estimation of flat, Texture-less Industry Objects on HoloLens using synthetic Training

Thomas Pöllabauer, Fabian Rücker, Andreas Franek, Felix Gorschlüter

Comments: Scandinavian Conference on Image Analysis 2023

Journal-ref: In Scandinavian Conference on Image Analysis 2023 (pp. 569-585). Cham: Springer Nature Switzerland

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[349] arXiv:2402.05008 [pdf, html, other]: Title: EfficientViT-SAM: Accelerated Segment Anything Model Without Accuracy Loss

Zhuoyang Zhang, Han Cai, Song Han

Comments: CVPR 2024 Workshop (Efficient Large Vision Models)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[350] arXiv:2402.05035 [pdf, other]: Title: A Survey on Domain Generalization for Medical Image Analysis

Ziwei Niu, Shuyi Ouyang, Shiao Xie, Yen-wei Chen, Lanfen Lin

Comments: This is a withdrawn submission and will be considered invalid. Due to some errors and overlap with published papers, we have chosen to withdraw it

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[351] arXiv:2402.05045 [pdf, other]: Title: Efficient Multi-Resolution Fusion for Remote Sensing Data with Label Uncertainty

Hersh Vakharia, Xiaoxiao Du

Comments: 4 pages, 3 figures, 2 tables; Accepted to International Geoscience and Remote Sensing Symposium (IGARSS) 2023; Code available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[352] arXiv:2402.05054 [pdf, other]: Title: LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

Jiaxiang Tang, Zhaoxi Chen, Xiaokang Chen, Tengfei Wang, Gang Zeng, Ziwei Liu

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353] arXiv:2402.05106 [pdf, other]: Title: Image captioning for Brazilian Portuguese using GRIT model

Rafael Silva de Alencar, William Alberto Cruz Castañeda, Marcellus Amadeus

Comments: arXiv admin note: text overlap with arXiv:2207.09666 by other authors

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[354] arXiv:2402.05158 [pdf, html, other]: Title: Enhancement of Bengali OCR by Specialized Models and Advanced Techniques for Diverse Document Types

AKM Shahariar Azad Rabby, Hasmot Ali, Md. Majedul Islam, Sheikh Abujar, Fuad Rahman

Comments: 8 pages, 7 figures, 4 table Link of the paper this https URL

Journal-ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, 2024, pp. 1102-1109

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[355] arXiv:2402.05195 [pdf, html, other]: Title: $λ$-ECLIPSE: Multi-Concept Personalized Text-to-Image Diffusion Models by Leveraging CLIP Latent Space

Maitreya Patel, Sangmin Jung, Chitta Baral, Yezhou Yang

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[356] arXiv:2402.05235 [pdf, html, other]: Title: SPAD : Spatially Aware Multiview Diffusers

Yash Kant, Ziyi Wu, Michael Vasilkovsky, Guocheng Qian, Jian Ren, Riza Alp Guler, Bernard Ghanem, Sergey Tulyakov, Igor Gilitschenski, Aliaksandr Siarohin

Comments: Webpage: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[357] arXiv:2402.05248 [pdf, other]: Title: Comparative Analysis of Kinect-Based and Oculus-Based Gaze Region Estimation Methods in a Driving Simulator

David González-Ortega, Francisco Javier Díaz-Perna, Mario Martínez-Zarzuela, Míriam Antón-Rodríguez

Comments: 25 pages

Journal-ref: Sensors 2021, 21, 26

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[358] arXiv:2402.05281 [pdf, html, other]: Title: Physics Informed and Data Driven Simulation of Underwater Images via Residual Learning

Tanmoy Mondal, Ricardo Mendoza, Lucas Drumetz

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[359] arXiv:2402.05301 [pdf, html, other]: Title: BIKED++: A Multimodal Dataset of 1.4 Million Bicycle Image and Parametric CAD Designs

Lyle Regenwetter, Yazan Abu Obaideh, Amin Heyrani Nobari, Faez Ahmed

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[360] arXiv:2402.05305 [pdf, html, other]: Title: Knowledge Distillation for Road Detection based on cross-model Semi-Supervised Learning

Wanli Ma, Oktay Karakus, Paul L. Rosin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[361] arXiv:2402.05310 [pdf, html, other]: Title: Dual-disentangled Deep Multiple Clustering

Jiawei Yao, Juhua Hu

Comments: Accepted by SDM'24. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[362] arXiv:2402.05349 [pdf, html, other]: Title: Scrapping The Web For Early Wildfire Detection: A New Annotated Dataset of Images and Videos of Smoke Plumes In-the-wild

Mateo Lostanlen, Nicolas Isla, Jose Guillen, Felix Veith, Cristian Buc, Valentin Barriere

Comments: Preprint of ongoing work

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[363] arXiv:2402.05350 [pdf, html, other]: Title: Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model

Junghun Cha, Ali Haider, Seoyun Yang, Hoeyeong Jin, Subin Yang, A. F. M. Shahab Uddin, Jaehyoung Kim, Soo Ye Kim, Sung-Ho Bae

Comments: Accepted to AAAI 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[364] arXiv:2402.05374 [pdf, html, other]: Title: CIC: A Framework for Culturally-Aware Image Captioning

Youngsik Yun, Jihie Kim

Comments: Accepted by IJCAI 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[365] arXiv:2402.05375 [pdf, html, other]: Title: Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models

Senmao Li, Joost van de Weijer, Taihang Hu, Fahad Shahbaz Khan, Qibin Hou, Yaxing Wang, Jian Yang

Comments: ICLR 2024. Our code is available in this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[366] arXiv:2402.05382 [pdf, html, other]: Title: Task-customized Masked AutoEncoder via Mixture of Cluster-conditional Experts

Zhili Liu, Kai Chen, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, James T. Kwok

Comments: Accepted by ICLR 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[367] arXiv:2402.05394 [pdf, html, other]: Title: Enhancing Zero-shot Counting via Language-guided Exemplar Learning

Mingjie Wang, Jun Zhou, Yong Dai, Eric Buys, Minglun Gong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[368] arXiv:2402.05398 [pdf, html, other]: Title: On the Effect of Image Resolution on Semantic Segmentation

Ritambhara Singh, Abhishek Jain, Pietro Perona, Shivani Agarwal, Junfeng Yang

Comments: arXiv admin note: text overlap with arXiv:2209.08667 by other authors

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[369] arXiv:2402.05408 [pdf, html, other]: Title: MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis

Dewei Zhou, You Li, Fan Ma, Xiaoting Zhang, Yi Yang

Comments: Accepted for publication in CVPR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[370] arXiv:2402.05410 [pdf, html, other]: Title: SpirDet: Towards Efficient, Accurate and Lightweight Infrared Small Target Detector

Qianchen Mao, Qiang Li, Bingshu Wang, Yongjun Zhang, Tao Dai, C.L. Philip Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[371] arXiv:2402.05417 [pdf, other]: Title: Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification

Vaibhav Khatavkar, Makarand Velankar, Sneha Petkar

Comments: 17 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[372] arXiv:2402.05423 [pdf, html, other]: Title: MTSA-SNN: A Multi-modal Time Series Analysis Model Based on Spiking Neural Network

Chengzhi Liu, Zheng Tao, Zihong Luo, Chenghao Liu

Comments: 6 pages, 6 figures, published to International Conference on Computer Supported Cooperative Work in Design

Journal-ref: International Conference on Pattern Recognition 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[373] arXiv:2402.05441 [pdf, other]: Title: Spiking Neural Network Enhanced Hand Gesture Recognition Using Low-Cost Single-photon Avalanche Diode Array

Zhenya Zang, Xingda Li, David Day Uei Li

Comments: 9 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[374] arXiv:2402.05448 [pdf, html, other]: Title: Minecraft-ify: Minecraft Style Image Generation with Text-guided Image Editing for In-Game Application

Bumsoo Kim, Sanghyun Byun, Yonghoon Jung, Wonseop Shin, Sareer UI Amin, Sanghyun Seo

Comments: 2 pages, 2 figures. Accepted as Spotlight to NeurIPS 2023 Workshop on Machine Learning for Creativity and Design

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM)
[375] arXiv:2402.05472 [pdf, other]: Title: Question Aware Vision Transformer for Multimodal Reasoning

Roy Ganz, Yair Kittenplon, Aviad Aberdam, Elad Ben Avraham, Oren Nuriel, Shai Mazor, Ron Litman

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[376] arXiv:2402.05532 [pdf, html, other]: Title: NCRF: Neural Contact Radiance Fields for Free-Viewpoint Rendering of Hand-Object Interaction

Zhongqun Zhang, Jifei Song, Eduardo Pérez-Pellitero, Yiren Zhou, Hyung Jin Chang, Aleš Leonardis

Comments: Accepted by 3DV 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[377] arXiv:2402.05548 [pdf, other]: Title: Efficient Expression Neutrality Estimation with Application to Face Recognition Utility Prediction

Marcel Grimmer, Raymond N. J. Veldhuis, Christoph Busch

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[378] arXiv:2402.05557 [pdf, other]: Title: On Convolutional Vision Transformers for Yield Prediction

Alvin Inderka, Florian Huber, Volker Steinhage

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[379] arXiv:2402.05589 [pdf, html, other]: Title: RESMatch: Referring Expression Segmentation in a Semi-Supervised Manner

Ying Zang, Chenglong Fu, Runlong Cao, Didi Zhu, Min Zhang, Wenjun Hu, Lanyun Zhu, Tianrun Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[380] arXiv:2402.05593 [pdf, other]: Title: A Concept for Reconstructing Stucco Statues from historic Sketches using synthetic Data only

Thomas Pöllabauer, Julius Kühn

Journal-ref: Eurographics Workshop on Graphics and Cultural Heritage 2022

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[381] arXiv:2402.05608 [pdf, html, other]: Title: Scalable Diffusion Models with State Space Backbone

Zhengcong Fei, Mingyuan Fan, Changqian Yu, Junshi Huang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[382] arXiv:2402.05610 [pdf, html, other]: Title: Extending 6D Object Pose Estimators for Stereo Vision

Thomas Pöllabauer, Jan Emrich, Volker Knauthe, Arjan Kuijper

Comments: 4th International Conference on Pattern Recognition and Artificial Intelligence (ICPRAI)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[383] arXiv:2402.05615 [pdf, other]: Title: DAPlankton: Benchmark Dataset for Multi-instrument Plankton Recognition via Fine-grained Domain Adaptation

Daniel Batrakhanov, Tuomas Eerola, Kaisa Kraft, Lumi Haraguchi, Lasse Lensu, Sanna Suikkanen, María Teresa Camarena-Gómez, Jukka Seppälä, Heikki Kälviäinen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[384] arXiv:2402.05637 [pdf, html, other]: Title: Learning pseudo-contractive denoisers for inverse problems

Deliang Wei, Peng Chen, Fang Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[385] arXiv:2402.05655 [pdf, html, other]: Title: Real-time Holistic Robot Pose Estimation with Unknown States

Shikun Ban, Juling Fan, Xiaoxuan Ma, Wentao Zhu, Yu Qiao, Yizhou Wang

Comments: Accepted by ECCV 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[386] arXiv:2402.05685 [pdf, html, other]: Title: An Ordinal Regression Framework for a Deep Learning Based Severity Assessment for Chest Radiographs

Patrick Wienholt, Alexander Hermans, Firas Khader, Behrus Puladi, Bastian Leibe, Christiane Kuhl, Sven Nebelung, Daniel Truhn

Comments: 17 pages, 3 figures, the code is available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[387] arXiv:2402.05712 [pdf, html, other]: Title: DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer

Zhiyuan Ma, Xiangyu Zhu, Guojun Qi, Chen Qian, Zhaoxiang Zhang, Zhen Lei

Comments: 9 pages, 5 figures. Code is avalable at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[388] arXiv:2402.05728 [pdf, html, other]: Title: CTGAN: Semantic-guided Conditional Texture Generator for 3D Shapes

Yi-Ting Pan, Chai-Rong Lee, Shu-Ho Fan, Jheng-Wei Su, Jia-Bin Huang, Yung-Yu Chuang, Hung-Kuo Chu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[389] arXiv:2402.05746 [pdf, html, other]: Title: Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents

Yuxi Wei, Zi Wang, Yifan Lu, Chenxin Xu, Changxing Liu, Hao Zhao, Siheng Chen, Yanfeng Wang

Comments: CVPR 2024(Highlight)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[390] arXiv:2402.05747 [pdf, other]: Title: Jacquard V2: Refining Datasets using the Human In the Loop Data Correction Method

Qiuhao Li, Shenghai Yuan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[391] arXiv:2402.05773 [pdf, html, other]: Title: UAV-Rain1k: A Benchmark for Raindrop Removal from UAV Aerial Imagery

Wenhui Chang, Hongming Chen, Xin He, Xiang Chen, Liangduo Shen

Comments: Accepted by IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392] arXiv:2402.05797 [pdf, html, other]: Title: TaE: Task-aware Expandable Representation for Long Tail Class Incremental Learning

Linjie Li, Zhenyu Wu, Jiaming Liu, Yang Ji

Comments: Accepted to ACCV2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[393] arXiv:2402.05803 [pdf, other]: Title: AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning

Wamiq Reyaz Para, Abdelrahman Eldesokey, Zhenyu Li, Pradyumna Reddy, Jiankang Deng, Peter Wonka

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[394] arXiv:2402.05804 [pdf, html, other]: Title: InkSight: Offline-to-Online Handwriting Conversion by Teaching Vision-Language Models to Read and Write

Blagoj Mitrevski, Arina Rak, Julian Schnitzler, Chengkun Li, Andrii Maksai, Jesse Berent, Claudiu Musat

Comments: Accepted by Transactions on Machine Learning Research

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[395] arXiv:2402.05809 [pdf, html, other]: Title: You Only Need One Color Space: An Efficient Network for Low-light Image Enhancement

Qingsen Yan, Yixu Feng, Cheng Zhang, Pei Wang, Peng Wu, Wei Dong, Jinqiu Sun, Yanning Zhang

Comments: Qingsen Yan, Yixu Feng, Cheng Zhang contributed equally to this work. Corresponding author: Yanning Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[396] arXiv:2402.05860 [pdf, other]: Title: Privacy-Preserving Synthetic Continual Semantic Segmentation for Robotic Surgery

Mengya Xu, Mobarakol Islam, Long Bai, Hongliang Ren

Comments: 12 pages, 8 figures, IEEE Transactions on Medical Image (accepted)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[397] arXiv:2402.05861 [pdf, html, other]: Title: Memory Consolidation Enables Long-Context Video Understanding

Ivana Balažević, Yuge Shi, Pinelopi Papalampidi, Rahma Chaabouni, Skanda Koppula, Olivier J. Hénaff

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[398] arXiv:2402.05869 [pdf, html, other]: Title: Adaptive Surface Normal Constraint for Geometric Estimation from Monocular Images

Xiaoxiao Long, Yuhang Zheng, Yupeng Zheng, Beiwen Tian, Cheng Lin, Lingjie Liu, Hao Zhao, Guyue Zhou, Wenping Wang

Comments: Accepted by TPAMI. arXiv admin note: substantial text overlap with arXiv:2103.15483

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399] arXiv:2402.05889 [pdf, html, other]: Title: CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion

Shoubin Yu, Jaehong Yoon, Mohit Bansal

Comments: ICLR 2025; first two authors contributed equally. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[400] arXiv:2402.05892 [pdf, html, other]: Title: Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data

Shufan Li, Harkanwar Singh, Aditya Grover

Comments: 24 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)

Total of 1842 entries : 1-100 101-200 201-300 301-400 401-500 501-600 601-700 ... 1801-1842

Showing up to 100 entries per page: fewer | more | all