Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Vision and Pattern Recognition

Authors and titles for recent submissions

  • Fri, 12 Sep 2025
  • Thu, 11 Sep 2025
  • Wed, 10 Sep 2025
  • Tue, 9 Sep 2025
  • Mon, 8 Sep 2025

See today's new changes

Total of 522 entries : 1-100 301-400 401-500 501-522 507-522
Showing up to 100 entries per page: fewer | more | all

Mon, 8 Sep 2025 (continued, showing last 16 of 69 entries )

[507] arXiv:2509.05263 (cross-list from cs.AI) [pdf, html, other]
Title: LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation
Yinglin Duan, Zhengxia Zou, Tongwei Gu, Wei Jia, Zhan Zhao, Luyi Xu, Xinzhu Liu, Yenan Lin, Hao Jiang, Kang Chen, Shuang Qiu
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[508] arXiv:2509.05201 (cross-list from cs.RO) [pdf, html, other]
Title: Robust Model Predictive Control Design for Autonomous Vehicles with Perception-based Observers
Nariman Niknejad, Gokul S. Sankar, Bahare Kiumarsi, Hamidreza Modares
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[509] arXiv:2509.05154 (cross-list from eess.IV) [pdf, html, other]
Title: VLSM-Ensemble: Ensembling CLIP-based Vision-Language Models for Enhanced Medical Image Segmentation
Julia Dietlmeier, Oluwabukola Grace Adegboro, Vayangi Ganepola, Claudia Mazo, Noel E. O'Connor
Comments: Medical Imaging with Deep Learning (MIDL 2025) short paper
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[510] arXiv:2509.05146 (cross-list from cs.CL) [pdf, html, other]
Title: PRIM: Towards Practical In-Image Multilingual Machine Translation
Yanzhi Tian, Zeming Liu, Zhengyang Liu, Chong Feng, Xin Li, Heyan Huang, Yuhang Guo
Comments: Accepted to EMNLP 2025 Main Conference
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[511] arXiv:2509.05031 (cross-list from cs.RO) [pdf, html, other]
Title: Pointing-Guided Target Estimation via Transformer-Based Attention
Luca Müller, Hassan Ali, Philipp Allgeuer, Lukáš Gajdošech, Stefan Wermter
Comments: Accepted at the 34th International Conference on Artificial Neural Networks (ICANN) 2025,12 pages,4 figures,1 table; work was co-funded by Horizon Europe project TERAIS under Grant agreement number 101079338
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[512] arXiv:2509.04948 (cross-list from cs.RO) [pdf, html, other]
Title: Towards an Accurate and Effective Robot Vision (The Problem of Topological Localization for Mobile Robots)
Emanuela Boros
Comments: Master's thesis
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[513] arXiv:2509.04908 (cross-list from cs.AI) [pdf, html, other]
Title: SparkUI-Parser: Enhancing GUI Perception with Robust Grounding and Parsing
Hongyi Jing, Jiafu Chen, Chen Rao, Ziqiang Dang, Jiajie Teng, Tianyi Chu, Juncheng Mo, Shuo Fang, Huaizhong Lin, Rui Lv, Chenguang Ma, Lei Zhao
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[514] arXiv:2509.04870 (cross-list from eess.IV) [pdf, html, other]
Title: Multi-modal Uncertainty Robust Tree Cover Segmentation For High-Resolution Remote Sensing Images
Yuanyuan Gui, Wei Li, Yinjian Wang, Xiang-Gen Xia, Mauro Marty, Christian Ginzler, Zuyuan Wang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[515] arXiv:2509.04849 (cross-list from quant-ph) [pdf, other]
Title: Histogram Driven Amplitude Embedding for Qubit Efficient Quantum Image Compression
Sahil Tomar, Sandeep Kumar
Comments: 7 pages
Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Information Theory (cs.IT)
[516] arXiv:2509.04819 (cross-list from eess.IV) [pdf, other]
Title: AURAD: Anatomy-Pathology Unified Radiology Synthesis with Progressive Representations
Shuhan Ding, Jingjing Fu, Yu Gu, Naiteek Sangani, Mu Wei, Paul Vozila, Nan Liu, Jiang Bian, Hoifung Poon
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[517] arXiv:2509.04745 (cross-list from cs.CL) [pdf, html, other]
Title: Phonological Representation Learning for Isolated Signs Improves Out-of-Vocabulary Generalization
Lee Kezar, Zed Sehyr, Jesse Thomason
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[518] arXiv:2509.04734 (cross-list from cs.LG) [pdf, html, other]
Title: Beyond I-Con: Exploring New Dimension of Distance Measures in Representation Learning
Jasmine Shone, Shaden Alshammari, Mark Hamilton, Zhening Li, William Freeman
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[519] arXiv:2509.04719 (cross-list from cs.DC) [pdf, html, other]
Title: STADI: Fine-Grained Step-Patch Diffusion Parallelism for Heterogeneous GPUs
Han Liang, Jiahui Zhou, Zicheng Zhou, Xiaoxi Zhang, Xu Chen
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Computer Vision and Pattern Recognition (cs.CV)
[520] arXiv:2509.04682 (cross-list from cs.SD) [pdf, html, other]
Title: Ecologically Valid Benchmarking and Adaptive Attention: Scalable Marine Bioacoustic Monitoring
Nicholas R. Rasmussen, Rodrigue Rizk, Longwei Wang, KC Santosh
Comments: Under review as an anonymous submission to IEEETAI - We are allowed an archive submission. Final formatting is yet to be determined
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[521] arXiv:2509.04677 (cross-list from eess.IV) [pdf, html, other]
Title: Inferring the Graph Structure of Images for Graph Neural Networks
Mayur S Gowda, John Shi, Augusto Santos, José M. F. Moura
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[522] arXiv:2509.04606 (cross-list from cs.CL) [pdf, html, other]
Title: Sample-efficient Integration of New Modalities into Large Language Models
Osman Batur İnce, André F. T. Martins, Oisin Mac Aodha, Edoardo M. Ponti
Comments: Pre-print
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Total of 522 entries : 1-100 301-400 401-500 501-522 507-522
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack