Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Vision and Pattern Recognition

Authors and titles for recent submissions

  • Mon, 22 Sep 2025
  • Fri, 19 Sep 2025
  • Thu, 18 Sep 2025
  • Wed, 17 Sep 2025
  • Tue, 16 Sep 2025

See today's new changes

Total of 629 entries : 1-50 ... 301-350 351-400 401-450 436-485 451-500 501-550 551-600 ... 601-629
Showing up to 50 entries per page: fewer | more | all

Wed, 17 Sep 2025 (continued, showing last 10 of 132 entries )

[436] arXiv:2509.12534 (cross-list from eess.IV) [pdf, html, other]
Title: DeepEyeNet: Generating Medical Report for Retinal Images
Jia-Hong Huang
Comments: The paper is accepted by the Conference on Information and Knowledge Management (CIKM), 2025
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[437] arXiv:2509.12512 (cross-list from eess.IV) [pdf, html, other]
Title: DinoAtten3D: Slice-Level Attention Aggregation of DinoV2 for 3D Brain MRI Anomaly Classification
Fazle Rafsani, Jay Shah, Catherine D. Chong, Todd J. Schwedt, Teresa Wu
Comments: ACCEPTED at the ICCV 2025 Workshop on Anomaly Detection with Foundation Models
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[438] arXiv:2509.12458 (cross-list from cs.RO) [pdf, html, other]
Title: Neural 3D Object Reconstruction with Small-Scale Unmanned Aerial Vehicles
Àlmos Veres-Vitàlyos, Genis Castillo Gomez-Raya, Filip Lemic, Daniel Johannes Bugelnig, Bernhard Rinner, Sergi Abadal, Xavier Costa-Pérez
Comments: 13 pages, 16 figures, 3 tables, 45 references
Subjects: Robotics (cs.RO); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Systems and Control (eess.SY)
[439] arXiv:2509.12376 (cross-list from math.AC) [pdf, html, other]
Title: Universal Gröbner Bases of (Universal) Multiview Ideals
Timothy Duff, Jack Kendrick, Rekha R. Thomas
Subjects: Commutative Algebra (math.AC); Computer Vision and Pattern Recognition (cs.CV); Algebraic Geometry (math.AG)
[440] arXiv:2509.12287 (cross-list from eess.IV) [pdf, other]
Title: Enhancing Radiographic Disease Detection with MetaCheX, a Context-Aware Multimodal Model
Nathan He, Cody Chen
Comments: All authors contributed equally, 5 pages, 2 figures, 1 table
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[441] arXiv:2509.12274 (cross-list from cs.AI) [pdf, other]
Title: Developing an aeroponic smart experimental greenhouse for controlling irrigation and plant disease detection using deep learning and IoT
Mohammadreza Narimani, Ali Hajiahmad, Ali Moghimi, Reza Alimardani, Shahin Rafiee, Amir Hossein Mirzabe
Comments: Author-accepted version. Presented at ASABE Annual International Meeting (AIM) 2021 (virtual), Paper 2101252. Please cite the published meeting paper: doi:https://doi.org/10.13031/aim.202101252. Minor wording and formatting updates in this preprint
Journal-ref: ASABE Annual International Meeting (AIM), July 12-16, 2021, Virtual. Paper 2101252
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[442] arXiv:2509.12251 (cross-list from cs.AI) [pdf, other]
Title: V-Math: An Agentic Approach to the Vietnamese National High School Graduation Mathematics Exams
Duong Q. Nguyen, Quy P. Nguyen, Nguyen Van Nhon, Quang-Thinh Bui, H. Nguyen-Xuan
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[443] arXiv:2509.12239 (cross-list from cs.LG) [pdf, other]
Title: InJecteD: Analyzing Trajectories and Drift Dynamics in Denoising Diffusion Probabilistic Models for 2D Point Cloud Generation
Sanyam Jain, Khuram Naveed, Illia Oleksiienko, Alexandros Iosifidis, Ruben Pauwels
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[444] arXiv:2509.12237 (cross-list from cs.LG) [pdf, other]
Title: Neural Diffeomorphic-Neural Operator for Residual Stress-Induced Deformation Prediction
Changqing Liu, Kaining Dai, Zhiwei Zhao, Tianyi Wu, Yingguang Li
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[445] arXiv:2509.12234 (cross-list from cs.LG) [pdf, html, other]
Title: Flexible Multimodal Neuroimaging Fusion for Alzheimer's Disease Progression Prediction
Benjamin Burns, Yuan Xue, Douglas W. Scharre, Xia Ning
Comments: Accepted at Applications of Medical AI 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)

Tue, 16 Sep 2025 (showing first 40 of 184 entries )

[446] arXiv:2509.12204 [pdf, html, other]
Title: Character-Centric Understanding of Animated Movies
Zhongrui Gui, Junyu Xie, Tengda Han, Weidi Xie, Andrew Zisserman
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[447] arXiv:2509.12203 [pdf, html, other]
Title: LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence
Zixin Yin, Xili Dai, Duomin Wang, Xianfang Zeng, Lionel M. Ni, Gang Yu, Heung-Yeung Shum
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[448] arXiv:2509.12201 [pdf, html, other]
Title: OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
Yang Zhou, Yifan Wang, Jianjun Zhou, Wenzheng Chang, Haoyu Guo, Zizun Li, Kaijing Ma, Xinyue Li, Yating Wang, Haoyi Zhu, Mingyu Liu, Dingning Liu, Jiange Yang, Zhoujie Fu, Junyi Chen, Chunhua Shen, Jiangmiao Pang, Kaipeng Zhang, Tong He
Comments: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[449] arXiv:2509.12197 [pdf, html, other]
Title: 3D Human Pose and Shape Estimation from LiDAR Point Clouds: A Review
Salma Galaaoui, Eduardo Valle, David Picard, Nermin Samet
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[450] arXiv:2509.12193 [pdf, html, other]
Title: Domain-Adaptive Pretraining Improves Primate Behavior Recognition
Felix B. Mueller, Timo Lueddecke, Richard Vogg, Alexander S. Ecker
Comments: Oral at the CVPR 2025 Workshop CV4Animals
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[451] arXiv:2509.12187 [pdf, html, other]
Title: HoloGarment: 360° Novel View Synthesis of In-the-Wild Garments
Johanna Karras, Yingwei Li, Yasamin Jafarian, Ira Kemelmacher-Shlizerman
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
[452] arXiv:2509.12155 [pdf, other]
Title: LoRA-fine-tuned Large Vision Models for Automated Assessment of Post-SBRT Lung Injury
M. Bolhassani, B. Veasey, E. Daugherty, S. Keltner, N. Kumar, N. Dunlap, A. Amini
Comments: 5 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[453] arXiv:2509.12146 [pdf, html, other]
Title: Multi Anatomy X-Ray Foundation Model
Nishank Singla, Krisztian Koos, Farzin Haddadpour, Amin Honarmandi Shandiz, Lovish Chum, Xiaojian Xu, Qing Jin, Erhan Bas
Comments: This work has been submitted to the IEEE for possible publication
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[454] arXiv:2509.12145 [pdf, html, other]
Title: Open-ended Hierarchical Streaming Video Understanding with Vision Language Models
Hyolim Kang, Yunsu Park, Youngbeom Yoo, Yeeun Choi, Seon Joo Kim
Comments: 17 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[455] arXiv:2509.12143 [pdf, html, other]
Title: 3DViT-GAT: A Unified Atlas-Based 3D Vision Transformer and Graph Learning Framework for Major Depressive Disorder Detection Using Structural MRI Data
Nojod M. Alotaibi, Areej M. Alhothali, Manar S. Ali
Comments: 14 pages, 1 figure, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[456] arXiv:2509.12132 [pdf, other]
Title: Look Again, Think Slowly: Enhancing Visual Reflection in Vision-Language Models
Pu Jian, Junhong Wu, Wei Sun, Chen Wang, Shuo Ren, Jiajun Zhang
Comments: EMNLP2025 Main
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[457] arXiv:2509.12125 [pdf, html, other]
Title: RailSafeNet: Visual Scene Understanding for Tram Safety
Ondřej Valach, Ivan Gruber
Comments: 11 pages, 5 figures, EPIA2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[458] arXiv:2509.12105 [pdf, html, other]
Title: FS-SAM2: Adapting Segment Anything Model 2 for Few-Shot Semantic Segmentation via Low-Rank Adaptation
Bernardo Forni, Gabriele Lombardi, Federico Pozzi, Mirco Planamente
Comments: Accepted at ICIAP 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[459] arXiv:2509.12090 [pdf, html, other]
Title: End-to-End 4D Heart Mesh Recovery Across Full-Stack and Sparse Cardiac MRI
Yihong Chen, Jiancheng Yang, Deniz Sayin Mercadier, Hieu Le, Juerg Schwitter, Pascal Fua
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[460] arXiv:2509.12079 [pdf, html, other]
Title: Progressive Flow-inspired Unfolding for Spectral Compressive Imaging
Xiaodong Wang, Ping Wang, Zijun He, Mengjie Qin, Xin Yuan
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[461] arXiv:2509.12069 [pdf, html, other]
Title: U-Mamba2: Scaling State Space Models for Dental Anatomy Segmentation in CBCT
Zhi Qin Tan, Xiatian Zhu, Owen Addison, Yunpeng Li
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[462] arXiv:2509.12068 [pdf, other]
Title: End-to-End Learning of Multi-Organ Implicit Surfaces from 3D Medical Imaging Data
Farahdiba Zarin, Nicolas Padoy, Jérémy Dana, Vinkle Srivastav
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[463] arXiv:2509.12062 [pdf, html, other]
Title: Robust Fetal Pose Estimation across Gestational Ages via Cross-Population Augmentation
Sebastian Diaz, Benjamin Billot, Neel Dey, Molin Zhang, Esra Abaci Turk, P. Ellen Grant, Polina Golland, Elfar Adalsteinsson
Comments: Accepted MICCAI 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[464] arXiv:2509.12052 [pdf, html, other]
Title: AvatarSync: Rethinking Talking-Head Animation through Autoregressive Perspective
Yuchen Deng, Xiuyang Wu, Hai-Tao Zheng, Suiyang Zhang, Yi He, Yuxing Han
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[465] arXiv:2509.12047 [pdf, other]
Title: A Computer Vision Pipeline for Individual-Level Behavior Analysis: Benchmarking on the Edinburgh Pig Dataset
Haiyu Yang, Enhong Liu, Jennifer Sun, Sumit Sharma, Meike van Leerdam, Sebastien Franceschini, Puchun Niu, Miel Hostens
Comments: 9 figures, Submitted to Computers and Electronics in Agriculture
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[466] arXiv:2509.12046 [pdf, html, other]
Title: Layout-Conditioned Autoregressive Text-to-Image Generation via Structured Masking
Zirui Zheng, Takashi Isobe, Tong Shen, Xu Jia, Jianbin Zhao, Xiaomin Li, Mengmeng Ge, Baolu Li, Qinghe Wang, Dong Li, Dong Zhou, Yunzhi Zhuge, Huchuan Lu, Emad Barsoum
Comments: 10 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[467] arXiv:2509.12040 [pdf, html, other]
Title: Exploring Efficient Open-Vocabulary Segmentation in the Remote Sensing
Bingyu Li, Haocheng Dong, Da Zhang, Zhiyuan Zhao, Junyu Gao, Xuelong Li
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[468] arXiv:2509.12039 [pdf, html, other]
Title: RAM++: Robust Representation Learning via Adaptive Mask for All-in-One Image Restoration
Zilong Zhang, Chujie Qin, Chunle Guo, Yong Zhang, Chao Xue, Ming-Ming Cheng, Chongyi Li
Comments: 18 pages, 22 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[469] arXiv:2509.12024 [pdf, html, other]
Title: Robust Concept Erasure in Diffusion Models: A Theoretical Perspective on Security and Robustness
Zixuan Fu, Yan Ren, Finn Carter, Chenyue Wen, Le Ku, Daheng Yu, Emily Davis, Bo Zhang
Comments: Camera ready version
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[470] arXiv:2509.11986 [pdf, html, other]
Title: Lost in Embeddings: Information Loss in Vision-Language Models
Wenyan Li, Raphael Tang, Chengzu Li, Caiqi Zhang, Ivan Vulić, Anders Søgaard
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[471] arXiv:2509.11959 [pdf, html, other]
Title: Learning to Generate 4D LiDAR Sequences
Ao Liang, Youquan Liu, Yu Yang, Dongyue Lu, Linfeng Li, Lingdong Kong, Huaici Zhao, Wei Tsang Ooi
Comments: Abstract Paper (Non-Archival) @ ICCV 2025 Wild3D Workshop; GitHub Repo at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[472] arXiv:2509.11952 [pdf, html, other]
Title: CLAIRE: A Dual Encoder Network with RIFT Loss and Phi-3 Small Language Model Based Interpretability for Cross-Modality Synthetic Aperture Radar and Optical Land Cover Segmentation
Debopom Sutradhar, Arefin Ittesafun Abian, Mohaimenul Azam Khan Raiaan, Reem E. Mohamed, Sheikh Izzal Azid, Sami Azam
Comments: 23 pages, 6 figures, 10 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[473] arXiv:2509.11948 [pdf, html, other]
Title: Sphere-GAN: a GAN-based Approach for Saliency Estimation in 360° Videos
Mahmoud Z. A. Wahba, Sara Baldoni, Federica Battisti
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[474] arXiv:2509.11926 [pdf, html, other]
Title: Graph Algorithm Unrolling with Douglas-Rachford Iterations for Image Interpolation with Guaranteed Initialization
Xue Zhang, Bingshuo Hu, Gene Cheung
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[475] arXiv:2509.11924 [pdf, html, other]
Title: Enriched text-guided variational multimodal knowledge distillation network (VMD) for automated diagnosis of plaque vulnerability in 3D carotid artery MRI
Bo Cao, Fan Yu, Mengmeng Feng, SenHao Zhang, Xin Meng, Yue Zhang, Zhen Qian, Jie Lu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[476] arXiv:2509.11916 [pdf, html, other]
Title: NeuroGaze-Distill: Brain-informed Distillation and Depression-Inspired Geometric Priors for Robust Facial Emotion Recognition
Zilin Li, Weiwei Xu, Xuanqi Zhao, Yiran Zhu
Comments: Preprint. Vision-only deployment; EEG used only to form static prototypes. Includes appendix, 7 figures and 3 tables. Considering submission to the International Conference on Learning Representations (ICLR) 2026, Rio de Janeiro, Brazil
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[477] arXiv:2509.11895 [pdf, html, other]
Title: Integrating Prior Observations for Incremental 3D Scene Graph Prediction
Marian Renz, Felix Igelbrink, Martin Atzmueller
Comments: Accepted at 24th International Conference on Machine Learning and Applications (ICMLA'25)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[478] arXiv:2509.11892 [pdf, html, other]
Title: Logit Mixture Outlier Exposure for Fine-grained Out-of-Distribution Detection
Akito Shinohara, Kohei Fukuda, Hiroaki Aizawa
Comments: Accepted to DICTA2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[479] arXiv:2509.11885 [pdf, html, other]
Title: BREA-Depth: Bronchoscopy Realistic Airway-geometric Depth Estimation
Francis Xiatian Zhang, Emile Mackute, Mohammadreza Kasaei, Kevin Dhaliwal, Robert Thomson, Mohsen Khadem
Comments: The paper has been accepted to MICCAI 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[480] arXiv:2509.11884 [pdf, html, other]
Title: SAM-TTT: Segment Anything Model via Reverse Parameter Configuration and Test-Time Training for Camouflaged Object Detection
Zhenni Yu, Li Zhao, Guobao Xiao, Xiaoqin Zhang
Comments: accepted by ACM MM 25
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[481] arXiv:2509.11878 [pdf, html, other]
Title: Do It Yourself (DIY): Modifying Images for Poems in a Zero-Shot Setting Using Weighted Prompt Manipulation
Sofia Jamil, Kotla Sai Charan, Sriparna Saha, Koustava Goswami, K J Joseph
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[482] arXiv:2509.11873 [pdf, html, other]
Title: Multi-animal tracking in Transition: Comparative Insights into Established and Emerging Methods
Anne Marthe Sophie Ngo Bibinbe, Patrick Gagnon, Jamie Ahloy-Dallaire, Eric R. Paquet
Comments: 21 pages, 3 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[483] arXiv:2509.11866 [pdf, other]
Title: Dr.V: A Hierarchical Perception-Temporal-Cognition Framework to Diagnose Video Hallucination by Fine-grained Spatial-Temporal Grounding
Meng Luo, Shengqiong Wu, Liqiang Jing, Tianjie Ju, Li Zheng, Jinxiang Lai, Tianlong Wu, Xinya Du, Jian Li, Siyuan Yan, Jiebo Luo, William Yang Wang, Hao Fei, Mong-Li Lee, Wynne Hsu
Comments: 25 pages, 16 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[484] arXiv:2509.11862 [pdf, html, other]
Title: Bridging Vision Language Models and Symbolic Grounding for Video Question Answering
Haodi Ma, Vyom Pathak, Daisy Zhe Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[485] arXiv:2509.11853 [pdf, html, other]
Title: Segmentation-Driven Initialization for Sparse-view 3D Gaussian Splatting
Yi-Hsin Li, Thomas Sikora, Sebastian Knorr, Måarten Sjöström
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Total of 629 entries : 1-50 ... 301-350 351-400 401-450 436-485 451-500 501-550 551-600 ... 601-629
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack