Computer Vision and Pattern Recognition

Authors and titles for recent submissions

See today's new changes

Total of 513 entries : 1-50 ... 251-300 301-350 351-400 401-450 451-500 501-513

Showing up to 50 entries per page: fewer | more | all

[401] arXiv:2509.04687 [pdf, html, other]: Title: Guideline-Consistent Segmentation via Multi-Agent Refinement

Vanshika Vats, Ashwani Rathee, James Davis

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402] arXiv:2509.04669 [pdf, html, other]: Title: VCMamba: Bridging Convolutions with Multi-Directional Mamba for Efficient Visual Representation

Mustafa Munir, Alex Zhang, Radu Marculescu

Comments: Proceedings of the 2025 IEEE/CVF International Conference on Computer Vision (ICCV) Workshops

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[403] arXiv:2509.04624 [pdf, html, other]: Title: UAV-Based Intelligent Traffic Surveillance System: Real-Time Vehicle Detection, Classification, Tracking, and Behavioral Analysis

Ali Khanpour, Tianyi Wang, Afra Vahidi-Shams, Wim Ectors, Farzam Nakhaie, Amirhossein Taheri, Christian Claudel

Comments: 15 pages, 8 figures, 2 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Robotics (cs.RO); Image and Video Processing (eess.IV); Systems and Control (eess.SY)
[404] arXiv:2509.04602 [pdf, html, other]: Title: Sali4Vid: Saliency-Aware Video Reweighting and Adaptive Caption Retrieval for Dense Video Captioning

MinJu Jeon, Si-Woo Kim, Ye-Chan Kim, HyunGee Kim, Dong-Jin Kim

Comments: Accepted in EMNLP 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[405] arXiv:2509.04600 [pdf, html, other]: Title: WATCH: World-aware Allied Trajectory and pose reconstruction for Camera and Human

Qijun Ying, Zhongyuan Hu, Rui Zhang, Ronghui Li, Yu Lu, Zijiao Zeng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[406] arXiv:2509.04597 [pdf, html, other]: Title: DisPatch: Disarming Adversarial Patches in Object Detection with Diffusion Models

Jin Ma, Mohammed Aldeen, Christopher Salas, Feng Luo, Mashrur Chowdhury, Mert Pesé, Long Cheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[407] arXiv:2509.04582 [pdf, html, other]: Title: Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping

Jingyi Lu, Kai Han

Comments: Accepted to ICCV 2025. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[408] arXiv:2509.04548 [pdf, html, other]: Title: Skywork UniPic 2.0: Building Kontext Model with Online RL for Unified Multimodal Model

Hongyang Wei, Baixin Xu, Hongbo Liu, Cyrus Wu, Jie Liu, Yi Peng, Peiyu Wang, Zexiang Liu, Jingwen He, Yidan Xietian, Chuanxin Tang, Zidong Wang, Yichen Wei, Liang Hu, Boyi Jiang, William Li, Ying He, Yang Liu, Xuchen Song, Eric Li, Yahui Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[409] arXiv:2509.04545 [pdf, html, other]: Title: PromptEnhancer: A Simple Approach to Enhance Text-to-Image Models via Chain-of-Thought Prompt Rewriting

Linqing Wang, Ximing Xing, Yiji Cheng, Zhiyuan Zhao, Jiale Tao, Qixun Wang, Ruihuang Li, Comi Chen, Xin Li, Mingrui Wu, Xinchi Deng, Chunyu Wang, Qinglin Lu

Comments: technical report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[410] arXiv:2509.04490 [pdf, html, other]: Title: Facial Emotion Recognition does not detect feeling unsafe in automated driving

Abel van Elburg, Konstantinos Gkentsidis, Mathieu Sarrazin, Sarah Barendswaard, Varun Kotian, Riender Happee

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[411] arXiv:2509.05285 (cross-list from cs.GR) [pdf, html, other]: Title: Improved 3D Scene Stylization via Text-Guided Generative Image Editing with Region-Based Control

Haruo Fujiwara, Yusuke Mukuta, Tatsuya Harada

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[412] arXiv:2509.05263 (cross-list from cs.AI) [pdf, html, other]: Title: LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation

Yinglin Duan, Zhengxia Zou, Tongwei Gu, Wei Jia, Zhan Zhao, Luyi Xu, Xinzhu Liu, Yenan Lin, Hao Jiang, Kang Chen, Shuang Qiu

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[413] arXiv:2509.05201 (cross-list from cs.RO) [pdf, html, other]: Title: Robust Model Predictive Control Design for Autonomous Vehicles with Perception-based Observers

Nariman Niknejad, Gokul S. Sankar, Bahare Kiumarsi, Hamidreza Modares

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[414] arXiv:2509.05154 (cross-list from eess.IV) [pdf, html, other]: Title: VLSM-Ensemble: Ensembling CLIP-based Vision-Language Models for Enhanced Medical Image Segmentation

Julia Dietlmeier, Oluwabukola Grace Adegboro, Vayangi Ganepola, Claudia Mazo, Noel E. O'Connor

Comments: Medical Imaging with Deep Learning (MIDL 2025) short paper

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[415] arXiv:2509.05146 (cross-list from cs.CL) [pdf, html, other]: Title: PRIM: Towards Practical In-Image Multilingual Machine Translation

Yanzhi Tian, Zeming Liu, Zhengyang Liu, Chong Feng, Xin Li, Heyan Huang, Yuhang Guo

Comments: Accepted to EMNLP 2025 Main Conference

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[416] arXiv:2509.05031 (cross-list from cs.RO) [pdf, html, other]: Title: Pointing-Guided Target Estimation via Transformer-Based Attention

Luca Müller, Hassan Ali, Philipp Allgeuer, Lukáš Gajdošech, Stefan Wermter

Comments: Accepted at the 34th International Conference on Artificial Neural Networks (ICANN) 2025,12 pages,4 figures,1 table; work was co-funded by Horizon Europe project TERAIS under Grant agreement number 101079338

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[417] arXiv:2509.04948 (cross-list from cs.RO) [pdf, html, other]: Title: Towards an Accurate and Effective Robot Vision (The Problem of Topological Localization for Mobile Robots)

Emanuela Boros

Comments: Master's thesis

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[418] arXiv:2509.04908 (cross-list from cs.AI) [pdf, html, other]: Title: SparkUI-Parser: Enhancing GUI Perception with Robust Grounding and Parsing

Hongyi Jing, Jiafu Chen, Chen Rao, Ziqiang Dang, Jiajie Teng, Tianyi Chu, Juncheng Mo, Shuo Fang, Huaizhong Lin, Rui Lv, Chenguang Ma, Lei Zhao

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[419] arXiv:2509.04870 (cross-list from eess.IV) [pdf, html, other]: Title: Multi-modal Uncertainty Robust Tree Cover Segmentation For High-Resolution Remote Sensing Images

Yuanyuan Gui, Wei Li, Yinjian Wang, Xiang-Gen Xia, Mauro Marty, Christian Ginzler, Zuyuan Wang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[420] arXiv:2509.04849 (cross-list from quant-ph) [pdf, other]: Title: Histogram Driven Amplitude Embedding for Qubit Efficient Quantum Image Compression

Sahil Tomar, Sandeep Kumar

Comments: 7 pages

Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Information Theory (cs.IT)
[421] arXiv:2509.04819 (cross-list from eess.IV) [pdf, other]: Title: AURAD: Anatomy-Pathology Unified Radiology Synthesis with Progressive Representations

Shuhan Ding, Jingjing Fu, Yu Gu, Naiteek Sangani, Mu Wei, Paul Vozila, Nan Liu, Jiang Bian, Hoifung Poon

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[422] arXiv:2509.04745 (cross-list from cs.CL) [pdf, html, other]: Title: Phonological Representation Learning for Isolated Signs Improves Out-of-Vocabulary Generalization

Lee Kezar, Zed Sehyr, Jesse Thomason

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[423] arXiv:2509.04734 (cross-list from cs.LG) [pdf, html, other]: Title: Beyond I-Con: Exploring New Dimension of Distance Measures in Representation Learning

Jasmine Shone, Shaden Alshammari, Mark Hamilton, Zhening Li, William Freeman

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[424] arXiv:2509.04719 (cross-list from cs.DC) [pdf, html, other]: Title: STADI: Fine-Grained Step-Patch Diffusion Parallelism for Heterogeneous GPUs

Han Liang, Jiahui Zhou, Zicheng Zhou, Xiaoxi Zhang, Xu Chen

Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computer Vision and Pattern Recognition (cs.CV)
[425] arXiv:2509.04682 (cross-list from cs.SD) [pdf, html, other]: Title: Ecologically Valid Benchmarking and Adaptive Attention: Scalable Marine Bioacoustic Monitoring

Nicholas R. Rasmussen, Rodrigue Rizk, Longwei Wang, KC Santosh

Comments: Under review as an anonymous submission to IEEETAI - We are allowed an archive submission. Final formatting is yet to be determined

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[426] arXiv:2509.04677 (cross-list from eess.IV) [pdf, html, other]: Title: Inferring the Graph Structure of Images for Graph Neural Networks

Mayur S Gowda, John Shi, Augusto Santos, José M. F. Moura

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[427] arXiv:2509.04606 (cross-list from cs.CL) [pdf, html, other]: Title: Sample-efficient Integration of New Modalities into Large Language Models

Osman Batur İnce, André F. T. Martins, Oisin Mac Aodha, Edoardo M. Ponti

Comments: Pre-print

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

[428] arXiv:2509.04450 [pdf, html, other]: Title: Virtual Fitting Room: Generating Arbitrarily Long Videos of Virtual Try-On from a Single Image -- Technical Preview

Jun-Kun Chen, Aayush Bansal, Minh Phuoc Vo, Yu-Xiong Wang

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[429] arXiv:2509.04448 [pdf, html, other]: Title: TRUST-VL: An Explainable News Assistant for General Multimodal Misinformation Detection

Zehong Yan, Peng Qi, Wynne Hsu, Mong Li Lee

Comments: EMNLP 2025; Project Homepage: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[430] arXiv:2509.04446 [pdf, html, other]: Title: Plot'n Polish: Zero-shot Story Visualization and Disentangled Editing with Text-to-Image Diffusion Models

Kiymet Akdemir, Jing Shi, Kushal Kafle, Brian Price, Pinar Yanardag

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[431] arXiv:2509.04444 [pdf, other]: Title: One Flight Over the Gap: A Survey from Perspective to Panoramic Vision

Xin Lin, Xian Ge, Dizhe Zhang, Zhaoliang Wan, Xianshun Wang, Xiangtai Li, Wenjie Jiang, Bo Du, Dacheng Tao, Ming-Hsuan Yang, Lu Qi

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[432] arXiv:2509.04438 [pdf, html, other]: Title: The Telephone Game: Evaluating Semantic Drift in Unified Models

Sabbir Mollah, Rohit Gupta, Sirnam Swetha, Qingyang Liu, Ahnaf Munir, Mubarak Shah

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[433] arXiv:2509.04437 [pdf, html, other]: Title: From Lines to Shapes: Geometric-Constrained Segmentation of X-Ray Collimators via Hough Transform

Benjamin El-Zein, Dominik Eckert, Andreas Fieselmann, Christopher Syben, Ludwig Ritschl, Steffen Kappler, Sebastian Stober

Subjects: Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[434] arXiv:2509.04434 [pdf, html, other]: Title: Durian: Dual Reference-guided Portrait Animation with Attribute Transfer

Hyunsoo Cha, Byungjun Kim, Hanbyul Joo

Comments: Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[435] arXiv:2509.04406 [pdf, html, other]: Title: Few-step Flow for 3D Generation via Marginal-Data Transport Distillation

Zanwei Zhou, Taoran Yi, Jiemin Fang, Chen Yang, Lingxi Xie, Xinggang Wang, Wei Shen, Qi Tian

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[436] arXiv:2509.04403 [pdf, html, other]: Title: Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios

Jingen Qu, Lijun Li, Bo Zhang, Yichen Yan, Jing Shao

Comments: Accepted at EMNLP 2025 Findings

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Cryptography and Security (cs.CR)
[437] arXiv:2509.04402 [pdf, html, other]: Title: Learning neural representations for X-ray ptychography reconstruction with unknown probes

Tingyou Li, Zixin Xu, Zirui Gao, Hanfei Yan, Xiaojing Huang, Jizhou Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[438] arXiv:2509.04379 [pdf, html, other]: Title: SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer

Jimin Xu, Bosheng Qin, Tao Jin, Zhou Zhao, Zhenhui Ye, Jun Yu, Fei Wu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[439] arXiv:2509.04378 [pdf, other]: Title: Aesthetic Image Captioning with Saliency Enhanced MLLMs

Yilin Tao, Jiashui Huang, Huaze Xu, Ling Shao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[440] arXiv:2509.04376 [pdf, html, other]: Title: AnomalyLMM: Bridging Generative Knowledge and Discriminative Retrieval for Text-Based Person Anomaly Search

Hao Ju, Hu Zhang, Zhedong Zheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[441] arXiv:2509.04370 [pdf, other]: Title: Stitching the Story: Creating Panoramic Incident Summaries from Body-Worn Footage

Dor Cohen, Inga Efrosman, Yehudit Aperstein, Alexander Apartsin

Comments: 5 pages, 3 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[442] arXiv:2509.04344 [pdf, html, other]: Title: MICACL: Multi-Instance Category-Aware Contrastive Learning for Long-Tailed Dynamic Facial Expression Recognition

Feng-Qi Cui, Zhen Lin, Xinlong Rao, Anyang Tong, Shiyao Li, Fei Wang, Changlin Chen, Bin Liu

Comments: Accepted by IEEE ISPA2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[443] arXiv:2509.04338 [pdf, html, other]: Title: From Editor to Dense Geometry Estimator

JiYuan Wang, Chunyu Lin, Lei Sun, Rongying Liu, Lang Nie, Mingxing Li, Kang Liao, Xiangxiang Chu, Yao Zhao

Comments: 20pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[444] arXiv:2509.04334 [pdf, html, other]: Title: GeoArena: An Open Platform for Benchmarking Large Vision-language Models on WorldWide Image Geolocalization

Pengyue Jia, Yingyi Zhang, Xiangyu Zhao, Yixuan Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[445] arXiv:2509.04326 [pdf, html, other]: Title: Efficient Odd-One-Out Anomaly Detection

Silvio Chito, Paolo Rabino, Tatiana Tommasi

Comments: Accepted at ICIAP 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[446] arXiv:2509.04298 [pdf, html, other]: Title: Noisy Label Refinement with Semantically Reliable Synthetic Images

Yingxuan Li, Jiafeng Mao, Yusuke Matsui

Comments: Accepted to ICIP2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[447] arXiv:2509.04276 [pdf, html, other]: Title: PAOLI: Pose-free Articulated Object Learning from Sparse-view Images

Jianning Deng, Kartic Subr, Hakan Bilen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[448] arXiv:2509.04273 [pdf, html, other]: Title: Dual-Scale Volume Priors with Wasserstein-Based Consistency for Semi-Supervised Medical Image Segmentation

Junying Meng, Gangxuan Zhou, Jun Liu, Weihong Guo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[449] arXiv:2509.04269 [pdf, html, other]: Title: TauGenNet: Plasma-Driven Tau PET Image Synthesis via Text-Guided 3D Diffusion Models

Yuxin Gong, Se-in Jang, Wei Shao, Yi Su, Kuang Gong (for the Alzheimer's Disease Neuroimaging Initiative (ADNI))

Comments: 9 pages, 4 figures, submitted to IEEE Transactions on Radiation and Plasma Medical Sciences

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[450] arXiv:2509.04268 [pdf, html, other]: Title: Differential Morphological Profile Neural Networks for Semantic Segmentation

David Huangal, J. Alex Hurt

Comments: 14 pages, 7 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)

Total of 513 entries : 1-50 ... 251-300 301-350 351-400 401-450 451-500 501-513

Showing up to 50 entries per page: fewer | more | all

Computer Vision and Pattern Recognition

Authors and titles for recent submissions

Mon, 8 Sep 2025 (continued, showing last 27 of 69 entries )

Fri, 5 Sep 2025 (showing first 23 of 86 entries )