Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Vision and Pattern Recognition

Authors and titles for June 2025

Total of 3130 entries : 1-100 ... 2701-2800 2801-2900 2901-3000 2951-3050 3001-3100 3101-3130
Showing up to 100 entries per page: fewer | more | all
[2951] arXiv:2506.18371 (cross-list from eess.IV) [pdf, html, other]
Title: Transforming H&E images into IHC: A Variance-Penalized GAN for Precision Oncology
Sara Rehmat, Hafeez Ur Rehman
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2952] arXiv:2506.18378 (cross-list from eess.IV) [pdf, html, other]
Title: Taming Vision-Language Models for Medical Image Analysis: A Comprehensive Review
Haoneng Lin, Cheng Xu, Jing Qin
Comments: 34 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2953] arXiv:2506.18407 (cross-list from cs.GR) [pdf, html, other]
Title: What You Think Is What You Get: Bridge User Intent and Transfer Function Design through Multimodal Large Language Models
Yiyao Wang, Bo Pan, Ke Wang, Han Liu, Jinyuan Mao, Yuxin Liu, Minfeng Zhu, Bo Zhang, Weifeng Chen, Xiuqi Huang, Wei Chen
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[2954] arXiv:2506.18443 (cross-list from cs.RO) [pdf, html, other]
Title: Radar and Event Camera Fusion for Agile Robot Ego-Motion Estimation
Yang Lyu, Zhenghao Zou, Yanfeng Li, Chunhui Zhao, Quan Pan
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[2955] arXiv:2506.18474 (cross-list from eess.IV) [pdf, html, other]
Title: A Deep Convolutional Neural Network-Based Novel Class Balancing for Imbalance Data Segmentation
Atifa Kalsoom, M.A. Iftikhar, Amjad Ali, Zubair Shah, Shidin Balakrishnan, Hazrat Ali
Comments: This is preprint of the paper submitted to Scientific Reports journal
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2956] arXiv:2506.18484 (cross-list from eess.IV) [pdf, html, other]
Title: GANs vs. Diffusion Models for virtual staining with the HER2match dataset
Pascal Klöckner, José Teixeira, Diana Montezuma, Jaime S. Cardoso, Hugo M. Horlings, Sara P. Oliveira
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2957] arXiv:2506.18512 (cross-list from eess.IV) [pdf, html, other]
Title: MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis
Yuting Zhang, Kaishen Yuan, Hao Lu, Yutao Yue, Jintai Chen, Kaishun Wu
Subjects: Image and Video Processing (eess.IV); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[2958] arXiv:2506.18598 (cross-list from cs.LG) [pdf, html, other]
Title: No Training Wheels: Steering Vectors for Bias Correction at Inference Time
Aviral Gupta, Armaan Sethi, Ameesh Sethi
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[2959] arXiv:2506.18601 (cross-list from cs.GR) [pdf, html, other]
Title: BulletGen: Improving 4D Reconstruction with Bullet-Time Generation
Denys Rozumnyi, Jonathon Luiten, Numair Khan, Johannes Schönberger, Peter Kontschieder
Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2960] arXiv:2506.18671 (cross-list from cs.SD) [pdf, html, other]
Title: TCDiff++: An End-to-end Trajectory-Controllable Diffusion Model for Harmonious Music-Driven Group Choreography
Yuqin Dai, Wanlu Zhu, Ronghui Li, Xiu Li, Zhenyu Zhang, Jun Li, Jian Yang
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Audio and Speech Processing (eess.AS)
[2961] arXiv:2506.18680 (cross-list from cs.GR) [pdf, html, other]
Title: DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling
Anindita Ghosh, Bing Zhou, Rishabh Dabral, Jian Wang, Vladislav Golyanik, Christian Theobalt, Philipp Slusallek, Chuan Guo
Comments: 11 pages, 7 figures, 2 tables, accepted in ACM Siggraph 2025 conference track
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[2962] arXiv:2506.18720 (cross-list from eess.IV) [pdf, html, other]
Title: Temporal Neural Cellular Automata: Application to modeling of contrast enhancement in breast MRI
Daniel M. Lang, Richard Osuala, Veronika Spieker, Karim Lekadir, Rickmer Braren, Julia A. Schnabel
Comments: MICCAI 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2963] arXiv:2506.18725 (cross-list from cs.RO) [pdf, html, other]
Title: TopoRec: Point Cloud Recognition Using Topological Data Analysis
Anirban Ghosh, Iliya Kulbaka, Ian Dahlin, Ayan Dutta
Subjects: Robotics (cs.RO); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV)
[2964] arXiv:2506.18810 (cross-list from cs.AI) [pdf, html, other]
Title: ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation
Siao Tang, Xinyin Ma, Gongfan Fang, Xinchao Wang
Comments: Codes are available at this https URL
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[2965] arXiv:2506.18842 (cross-list from cs.DB) [pdf, html, other]
Title: LIGHTHOUSE: Fast and precise distance to shoreline calculations from anywhere on earth
Patrick Beukema, Henry Herzog, Yawen Zhang, Hunter Pitelka, Favyen Bastani
Comments: 8 pages, 7 figures, 1 table, ICML 2025 ML4RS
Subjects: Databases (cs.DB); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2966] arXiv:2506.18844 (cross-list from cs.RO) [pdf, other]
Title: Reproducible Evaluation of Camera Auto-Exposure Methods in the Field: Platform, Benchmark and Lessons Learned
Olivier Gamache, Jean-Michel Fortin, Matěj Boxan, François Pomerleau, Philippe Giguère
Comments: 19 pages, 11 figures, pre-print version of the accepted paper for IEEE Transactions on Field Robotics (T-FR)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[2967] arXiv:2506.18885 (cross-list from cs.RO) [pdf, html, other]
Title: GRAND-SLAM: Local Optimization for Globally Consistent Large-Scale Multi-Agent Gaussian SLAM
Annika Thomas, Aneesa Sonawalla, Alex Rose, Jonathan P. How
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[2968] arXiv:2506.18919 (cross-list from cs.CL) [pdf, html, other]
Title: MemeMind: A Large-Scale Multimodal Dataset with Chain-of-Thought Reasoning for Harmful Meme Detection
Hexiang Gu, Qifan Yu, Saihui Hou, Zhiqin Fang, Huijia Wu, Zhaofeng He
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2969] arXiv:2506.19051 (cross-list from eess.IV) [pdf, html, other]
Title: NIC-RobustBench: A Comprehensive Open-Source Toolkit for Neural Image Compression and Robustness Analysis
Georgii Bychkov, Khaled Abud, Egor Kovalev, Alexander Gushchin, Dmitriy Vatolin, Anastasia Antsiferova
Comments: arXiv admin note: text overlap with arXiv:2411.11795
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[2970] arXiv:2506.19055 (cross-list from eess.IV) [pdf, html, other]
Title: Xray2Xray: World Model from Chest X-rays with Volumetric Context
Zefan Yang, Xinrui Song, Xuanang Xu, Yongyi Shi, Ge Wang, Mannudeep K. Kalra, Pingkun Yan
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2971] arXiv:2506.19106 (cross-list from eess.IV) [pdf, html, other]
Title: Staining normalization in histopathology: Method benchmarking using multicenter dataset
Umair Khan, Jouni Härkönen, Marjukka Friman, Leena Latonen, Teijo Kuopio, Pekka Ruusuvuori
Comments: 18 pages, 9 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Tissues and Organs (q-bio.TO)
[2972] arXiv:2506.19139 (cross-list from cs.GR) [pdf, html, other]
Title: SOF: Sorted Opacity Fields for Fast Unbounded Surface Reconstruction
Lukas Radl, Felix Windisch, Thomas Deixelberger, Jozef Hladky, Michael Steiner, Dieter Schmalstieg, Markus Steinberger
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[2973] arXiv:2506.19167 (cross-list from eess.IV) [pdf, other]
Title: A Deep Learning Based Method for Fast Registration of Cardiac Magnetic Resonance Images
Benjamin Graham
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2974] arXiv:2506.19222 (cross-list from eess.IV) [pdf, html, other]
Title: Deformable Medical Image Registration with Effective Anatomical Structure Representation and Divide-and-Conquer Network
Xinke Ma, Yongsheng Pan, Qingjie Zeng, Mengkang Lu, Bolysbek Murat Yerzhanuly, Bazargul Matkerim, Yong Xia
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2975] arXiv:2506.19234 (cross-list from eess.IV) [pdf, html, other]
Title: Quantitative Benchmarking of Anomaly Detection Methods in Digital Pathology
Can Cui, Xindong Zheng, Ruining Deng, Quan Liu, Tianyuan Yao, Keith T Wilson, Lori A Coburn, Bennett A Landman, Haichun Yang, Yaohong Wang, Yuankai Huo
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2976] arXiv:2506.19266 (cross-list from q-bio.NC) [pdf, other]
Title: Convergent and divergent connectivity patterns of the arcuate fasciculus in macaques and humans
Jiahao Huang, Ruifeng Li, Wenwen Yu, Anan Li, Xiangning Li, Mingchao Yan, Lei Xie, Qingrun Zeng, Xueyan Jia, Shuxin Wang, Ronghui Ju, Feng Chen, Qingming Luo, Hui Gong, Andrew Zalesky, Xiaoquan Yang, Yuanjing Feng, Zheng Wang
Comments: 34 pages, 6 figures
Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[2977] arXiv:2506.19297 (cross-list from eess.IV) [pdf, html, other]
Title: Explicit Residual-Based Scalable Image Coding for Humans and Machines
Yui Tatsumi, Ziyue Zeng, Hiroshi Watanabe
Comments: Accepted to IEEE 27th International Workshop on Multimedia Signal Processing (MMSP 2025)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2978] arXiv:2506.19360 (cross-list from cs.CR) [pdf, html, other]
Title: SoK: Can Synthetic Images Replace Real Data? A Survey of Utility and Privacy of Synthetic Image Generation
Yunsung Chung, Yunbei Zhang, Nassir Marrouche, Jihun Hamm
Comments: Accepted at the 34th USENIX Security Symposium (USENIX Security '25). 21 pages, plus a 6-page appendix
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[2979] arXiv:2506.19363 (cross-list from eess.IV) [pdf, html, other]
Title: Reconsidering Explicit Longitudinal Mammography Alignment for Enhanced Breast Cancer Risk Prediction
Solveig Thrun, Stine Hansen, Zijun Sun, Nele Blum, Suaiba A. Salahuddin, Kristoffer Wickstrøm, Elisabeth Wetzer, Robert Jenssen, Maik Stille, Michael Kampffmeyer
Comments: MICCAI 2025, early accepted
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2980] arXiv:2506.19387 (cross-list from eess.IV) [pdf, other]
Title: NAADA: A Noise-Aware Attention Denoising Autoencoder for Dental Panoramic Radiographs
Khuram Naveed, Bruna Neves de Freitas, Ruben Pauwels
Comments: 10 pages, 8 figures
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2981] arXiv:2506.19415 (cross-list from cs.GR) [pdf, html, other]
Title: Virtual Memory for 3D Gaussian Splatting
Jonathan Haberl, Philipp Fleck, Clemens Arth
Comments: Based on the Master Thesis from Jonathan Haberl from 2024, Submitted to TVCG in Feb. 2025;
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[2982] arXiv:2506.19455 (cross-list from eess.IV) [pdf, html, other]
Title: Angio-Diff: Learning a Self-Supervised Adversarial Diffusion Model for Angiographic Geometry Generation
Zhifeng Wang, Renjiao Yi, Xin Wen, Chenyang Zhu, Kai Xu, Kunlun He
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2983] arXiv:2506.19464 (cross-list from eess.IV) [pdf, html, other]
Title: Assessing Risk of Stealing Proprietary Models for Medical Imaging Tasks
Ankita Raj, Harsh Swaika, Deepankar Varma, Chetan Arora
Comments: Accepted to MICCAI 2024
Subjects: Image and Video Processing (eess.IV); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[2984] arXiv:2506.19491 (cross-list from cs.ET) [pdf, html, other]
Title: Experimental Assessment of Neural 3D Reconstruction for Small UAV-based Applications
Genís Castillo Gómez-Raya, Álmos Veres-Vitályos, Filip Lemic, Pablo Royo, Mario Montagud, Sergi Fernández, Sergi Abadal, Xavier Costa-Pérez
Comments: 6 pages, 7 figures, 2 tables, accepted at IEEE International Symposium on Personal, Indoor and Mobile Radio Communications 2025
Subjects: Emerging Technologies (cs.ET); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Networking and Internet Architecture (cs.NI); Image and Video Processing (eess.IV)
[2985] arXiv:2506.19558 (cross-list from cs.LG) [pdf, html, other]
Title: ConCM: Consistency-Driven Calibration and Matching for Few-Shot Class-Incremental Learning
QinZhe Wang, Zixuan Chen, Keke Huang, Xiu Su, Chunhua Yang, Chang Xu
Comments: 9 pages, 5 figures(Excluding the appendix)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[2986] arXiv:2506.19579 (cross-list from cs.RO) [pdf, html, other]
Title: Fake or Real, Can Robots Tell? Evaluating Embodied Vision-Language Models on Real and 3D-Printed Objects
Federico Tavella, Kathryn Mearns, Angelo Cangelosi
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2987] arXiv:2506.19590 (cross-list from eess.IV) [pdf, html, other]
Title: Learning from Anatomy: Supervised Anatomical Pretraining (SAP) for Improved Metastatic Bone Disease Segmentation in Whole-Body MRI
Joris Wuts, Jakub Ceranka, Nicolas Michoux, Frédéric Lecouvet, Jef Vandemeulebroucke
Comments: This preprint is currently under review at *Computers in Biology and Medicine* (Elsevier). This version has not been peer-reviewed
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2988] arXiv:2506.19600 (cross-list from eess.IV) [pdf, html, other]
Title: Filling of incomplete sinograms from sparse PET detector configurations using a residual U-Net
Klara Leffler, Luigi Tommaso Luppino, Samuel Kuttner, Karin Söderkvist, Jan Axelsson
Comments: 15 pages, 9 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[2989] arXiv:2506.19687 (cross-list from eess.IV) [pdf, html, other]
Title: ReCoGNet: Recurrent Context-Guided Network for 3D MRI Prostate Segmentation
Ahmad Mustafa, Reza Rastegar, Ghassan AlRegib
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2990] arXiv:2506.19708 (cross-list from cs.GR) [pdf, html, other]
Title: Uncovering Conceptual Blindspots in Generative Image Models Using Sparse Autoencoders
Matyas Bohacek, Thomas Fel, Maneesh Agrawala, Ekdeep Singh Lubana
Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2991] arXiv:2506.19741 (cross-list from cs.LG) [pdf, html, other]
Title: Noise Consistency Training: A Native Approach for One-Step Generator in Learning Additional Controls
Yihong Luo, Shuchen Xue, Tianyang Hu, Jing Tang
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[2992] arXiv:2506.19742 (cross-list from eess.IV) [pdf, html, other]
Title: NeRF-based CBCT Reconstruction needs Normalization and Initialization
Zhuowei Xu, Han Li, Dai Sun, Zhicheng Li, Yujia Li, Qingpeng Kong, Zhiwei Cheng, Nassir Navab, S. Kevin Zhou
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2993] arXiv:2506.19797 (cross-list from eess.IV) [pdf, html, other]
Title: Systematic Review of Pituitary Gland and Pituitary Adenoma Automatic Segmentation Techniques in Magnetic Resonance Imaging
Mubaraq Yakubu, Navodini Wijethilake, Jonathan Shapey, Andrew King, Alexander Hammers
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2994] arXiv:2506.19807 (cross-list from cs.AI) [pdf, other]
Title: KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
Baochang Ren, Shuofei Qiao, Wenhao Yu, Huajun Chen, Ningyu Zhang
Comments: Work in progress
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[2995] arXiv:2506.19816 (cross-list from cs.RO) [pdf, html, other]
Title: CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation
Hao Li, Shuai Yang, Yilun Chen, Yang Tian, Xiaoda Yang, Xinyi Chen, Hanqing Wang, Tai Wang, Feng Zhao, Dahua Lin, Jiangmiao Pang
Comments: 36 pages, 21 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[2996] arXiv:2506.19827 (cross-list from cs.RO) [pdf, html, other]
Title: Look to Locate: Vision-Based Multisensory Navigation with 3-D Digital Maps for GNSS-Challenged Environments
Ola Elmaghraby, Eslam Mounier, Paulo Ricardo Marques de Araujo, Aboelmagd Noureldin
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[2997] arXiv:2506.19847 (cross-list from cs.LG) [pdf, html, other]
Title: Orthogonal Finetuning Made Scalable
Zeju Qiu, Weiyang Liu, Adrian Weller, Bernhard Schölkopf
Comments: Technical report (17 pages, 7 figures, project page: this https URL)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[2998] arXiv:2506.19860 (cross-list from eess.SP) [pdf, html, other]
Title: A Multi-Modal Spatial Risk Framework for EV Charging Infrastructure Using Remote Sensing
Oktay Karakuş, Padraig Corcoran
Comments: 11 pages, 4 figures, 2 tables
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV)
[2999] arXiv:2506.19935 (cross-list from cs.LG) [pdf, html, other]
Title: Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture
Shuchen Xue, Tianyu Xie, Tianyang Hu, Zijin Feng, Jiacheng Sun, Kenji Kawaguchi, Zhenguo Li, Zhi-Ming Ma
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[3000] arXiv:2506.19975 (cross-list from eess.IV) [pdf, html, other]
Title: VoxelOpt: Voxel-Adaptive Message Passing for Discrete Optimization in Deformable Abdominal CT Registration
Hang Zhang, Yuxi Zhang, Jiazheng Wang, Xiang Chen, Renjiu Hu, Xin Tian, Gaolei Li, Min Liu
Comments: Accepted for publication at MICCAI 2025
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[3001] arXiv:2506.20045 (cross-list from cs.RO) [pdf, html, other]
Title: Consensus-Driven Uncertainty for Robotic Grasping based on RGB Perception
Eric C. Joyce, Qianwen Zhao, Nathaniel Burgdorfer, Long Wang, Philippos Mordohai
Comments: Accepted to IROS 2025
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[3002] arXiv:2506.20100 (cross-list from cs.LG) [pdf, html, other]
Title: MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided Conversations
Vardhan Dongre, Chi Gui, Shubham Garg, Hooshang Nayyeri, Gokhan Tur, Dilek Hakkani-Tür, Vikram S. Adve
Comments: 66 pages, 32 figures, 23 tables
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[3003] arXiv:2506.20200 (cross-list from eess.IV) [pdf, html, other]
Title: MS-IQA: A Multi-Scale Feature Fusion Network for PET/CT Image Quality Assessment
Siqiao Li, Chen Hui, Wei Zhang, Rui Liang, Chenyue Song, Feng Jiang, Haiqi Zhu, Zhixuan Li, Hong Huang, Xiang Li
Comments: Accepted to MICCAI 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[3004] arXiv:2506.20245 (cross-list from cs.LG) [pdf, html, other]
Title: FedBKD: Distilled Federated Learning to Embrace Gerneralization and Personalization on Non-IID Data
Yushan Zhao, Jinyuan He, Donglai Chen, Weijie Luo, Chong Xie, Ri Zhang, Yonghong Chen, Yan Xu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3005] arXiv:2506.20267 (cross-list from cs.GR) [pdf, html, other]
Title: X-SiT: Inherently Interpretable Surface Vision Transformers for Dementia Diagnosis
Fabian Bongratz, Tom Nuno Wolf, Jaume Gual Ramon, Christian Wachinger
Comments: MICCAI 2025
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[3006] arXiv:2506.20282 (cross-list from eess.IV) [pdf, html, other]
Title: Opportunistic Osteoporosis Diagnosis via Texture-Preserving Self-Supervision, Mixture of Experts and Multi-Task Integration
Jiaxing Huang, Heng Guo, Le Lu, Fan Yang, Minfeng Xu, Ge Yang, Wei Luo
Comments: Accepted by MICCAI 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[3007] arXiv:2506.20303 (cross-list from eess.IV) [pdf, other]
Title: FundaQ-8: A Clinically-Inspired Scoring Framework for Automated Fundus Image Quality Assessment
Lee Qi Zun, Oscar Wong Jin Hao, Nor Anita Binti Che Omar, Zalifa Zakiah Binti Asnir, Mohamad Sabri bin Sinal Zainal, Goh Man Fye
Subjects: Image and Video Processing (eess.IV); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[3008] arXiv:2506.20305 (cross-list from cs.LG) [pdf, html, other]
Title: Learning Moderately Input-Sensitive Functions: A Case Study in QR Code Decoding
Kazuki Yoda, Kazuhiko Kawamoto, Hiroshi Kera
Comments: 17 pages, 13 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[3009] arXiv:2506.20333 (cross-list from eess.IV) [pdf, html, other]
Title: EAGLE: An Efficient Global Attention Lesion Segmentation Model for Hepatic Echinococcosis
Jiayan Chen, Kai Li, Yulu Zhao, Jianqiang Huang, Zhan Wang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[3010] arXiv:2506.20355 (cross-list from quant-ph) [pdf, html, other]
Title: Practical insights on the effect of different encodings, ansätze and measurements in quantum and hybrid convolutional neural networks
Jesús Lozano-Cruz, Albert Nieto-Morales, Oriol Balló-Gimbernat, Adan Garriga, Antón Rodríguez-Otero, Alejandro Borrallo-Rentero
Comments: 20 pages, 22 figures
Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV)
[3011] arXiv:2506.20367 (cross-list from cs.GR) [pdf, html, other]
Title: DreamAnywhere: Object-Centric Panoramic 3D Scene Generation
Edoardo Alberto Dominici, Jozef Hladky, Floor Verhoeven, Lukas Radl, Thomas Deixelberger, Stefan Ainetter, Philipp Drescher, Stefan Hauswiesner, Arno Coomans, Giacomo Nazzaro, Konstantinos Vardis, Markus Steinberger
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[3012] arXiv:2506.20407 (cross-list from eess.IV) [pdf, html, other]
Title: Fusing Radiomic Features with Deep Representations for Gestational Age Estimation in Fetal Ultrasound Images
Fangyijie Wang, Yuan Liang, Sourav Bhattacharjee, Abey Campbell, Kathleen M. Curran, Guénolé Silvestre
Comments: Accepted at MICCAI 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[3013] arXiv:2506.20430 (cross-list from cs.CL) [pdf, html, other]
Title: An Agentic System for Rare Disease Diagnosis with Traceable Reasoning
Weike Zhao, Chaoyi Wu, Yanjie Fan, Xiaoman Zhang, Pengcheng Qiu, Yuze Sun, Xiao Zhou, Yanfeng Wang, Xin Sun, Ya Zhang, Yongguo Yu, Kun Sun, Weidi Xie
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[3014] arXiv:2506.20566 (cross-list from cs.RO) [pdf, html, other]
Title: HRIBench: Benchmarking Vision-Language Models for Real-Time Human Perception in Human-Robot Interaction
Zhonghao Shi, Enyu Zhao, Nathaniel Dennler, Jingzhen Wang, Xinyang Xu, Kaleen Shrestha, Mengxue Fu, Daniel Seita, Maja Matarić
Comments: Accepted to the 19th International Symposium on Experimental Robotics (ISER 2025)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[3015] arXiv:2506.20614 (cross-list from eess.IV) [pdf, html, other]
Title: Weighted Mean Frequencies: a handcraft Fourier feature for 4D Flow MRI segmentation
Simon Perrin, Sébastien Levilly, Huajun Sun, Harold Mouchère, Jean-Michel Serfaty
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3016] arXiv:2506.20652 (cross-list from cs.GR) [pdf, html, other]
Title: EditP23: 3D Editing via Propagation of Image Prompts to Multi-View
Roi Bar-On, Dana Cohen-Bar, Daniel Cohen-Or
Comments: Code, supplementary videos, interactive 3D visualizations, and additional results are available at this https URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[3017] arXiv:2506.20683 (cross-list from eess.IV) [pdf, html, other]
Title: Global and Local Contrastive Learning for Joint Representations from Cardiac MRI and ECG
Alexander Selivanov, Philip Müller, Özgün Turgut, Nil Stolt-Ansó, Daniel Rückert
Comments: accepted to MICCAI 2025 (Springer LNCS)
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[3018] arXiv:2506.20689 (cross-list from eess.IV) [pdf, other]
Title: U-R-VEDA: Integrating UNET, Residual Links, Edge and Dual Attention, and Vision Transformer for Accurate Semantic Segmentation of CMRs
Racheal Mukisa, Arvind K. Bansal
Comments: 15 pages, 3 figures
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[3019] arXiv:2506.20703 (cross-list from cs.GR) [pdf, html, other]
Title: Generative Blocks World: Moving Things Around in Pictures
Vaibhav Vavilala, Seemandhar Jain, Rahul Vasanth, D.A. Forsyth, Anand Bhattad
Comments: 23 pages, 16 figures, 2 tables
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[3020] arXiv:2506.20812 (cross-list from cs.RO) [pdf, html, other]
Title: Model-Based Real-Time Pose and Sag Estimation of Overhead Power Lines Using LiDAR for Drone Inspection
Alexandre Girard, Steven A. Parkison, Philippe Hamelin
Comments: Submitted to IEEE case 2025
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[3021] arXiv:2506.20816 (cross-list from cs.LG) [pdf, html, other]
Title: Universal and Efficient Detection of Adversarial Data through Nonuniform Impact on Network Layers
Furkan Mumcu, Yasin Yilmaz
Comments: arXiv admin note: substantial text overlap with arXiv:2410.17442
Journal-ref: Transactions on Machine Learning Research, June 2025
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[3022] arXiv:2506.20875 (cross-list from cs.GR) [pdf, html, other]
Title: 3DGH: 3D Head Generation with Composable Hair and Face
Chengan He, Junxuan Li, Tobias Kirschstein, Artem Sevastopolsky, Shunsuke Saito, Qingyang Tan, Javier Romero, Chen Cao, Holly Rushmeier, Giljoo Nam
Comments: Accepted to SIGGRAPH 2025. Project page: this https URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[3023] arXiv:2506.20897 (cross-list from eess.IV) [pdf, html, other]
Title: Development of MR spectral analysis method robust against static magnetic field inhomogeneity
Shuki Maruyama, Hidenori Takeshima
Comments: 11 pages, 6 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[3024] arXiv:2506.20946 (cross-list from cs.GR) [pdf, html, other]
Title: Consistent Zero-shot 3D Texture Synthesis Using Geometry-aware Diffusion and Temporal Video Models
Donggoo Kang, Jangyeong Kim, Dasol Jeong, Junyoung Choi, Jeonga Wi, Hyunmin Lee, Joonho Gwon, Joonki Paik
Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3025] arXiv:2506.20969 (cross-list from cs.RO) [pdf, html, other]
Title: ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation
Shruti Bansal, Wenshan Wang, Yifei Liu, Parv Maheshwari
Comments: Accepted at Thermal Infrared in Robotics (TIRO) Workshop, ICRA 2025
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[3026] arXiv:2506.20990 (cross-list from cs.LG) [pdf, html, other]
Title: SharpZO: Hybrid Sharpness-Aware Vision Language Model Prompt Tuning via Forward-Only Passes
Yifan Yang, Zhen Zhang, Rupak Vignesh Swaminathan, Jing Liu, Nathan Susanj, Zheng Zhang
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[3027] arXiv:2506.21037 (cross-list from cs.LG) [pdf, html, other]
Title: RL-Selector: Reinforcement Learning-Guided Data Selection via Redundancy Assessment
Suorong Yang, Peijia Li, Furao Shen, Jian Zhao
Comments: ICCV 2025
Journal-ref: ICCV 2025
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[3028] arXiv:2506.21041 (cross-list from cs.RO) [pdf, html, other]
Title: SEAL: Vision-Language Model-Based Safe End-to-End Cooperative Autonomous Driving with Adaptive Long-Tail Modeling
Junwei You, Pei Li, Zhuoyu Jiang, Zilin Huang, Rui Gan, Haotian Shi, Bin Ran
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3029] arXiv:2506.21144 (cross-list from cs.LG) [pdf, html, other]
Title: Personalized Federated Learning via Dual-Prompt Optimization and Cross Fusion
Yuguang Zhang, Kuangpu Guo, Zhihe Lu, Yunbo Wang, Jian Liang
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[3030] arXiv:2506.21171 (cross-list from eess.IV) [pdf, other]
Title: Uncover Treasures in DCT: Advancing JPEG Quality Enhancement by Exploiting Latent Correlations
Jing Yang, Qunliang Xing, Mai Xu, Minglang Qiao
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[3031] arXiv:2506.21245 (cross-list from eess.IV) [pdf, html, other]
Title: GANet-Seg: Adversarial Learning for Brain Tumor Segmentation with Hybrid Generative Models
Qifei Cui, Xinyu Lu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[3032] arXiv:2506.21272 (cross-list from cs.GR) [pdf, html, other]
Title: FairyGen: Storied Cartoon Video from a Single Child-Drawn Character
Jiayi Zheng, Xiaodong Cun
Comments: Project Page: this https URL ; Code: this https URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[3033] arXiv:2506.21319 (cross-list from cs.HC) [pdf, html, other]
Title: SimVecVis: A Dataset for Enhancing MLLMs in Visualization Understanding
Can Liu, Chunlin Da, Xiaoxiao Long, Yuxiao Yang, Yu Zhang, Yong Wang
Subjects: Human-Computer Interaction (cs.HC); Computer Vision and Pattern Recognition (cs.CV)
[3034] arXiv:2506.21331 (cross-list from cs.DL) [pdf, html, other]
Title: Automatic Reviewers Assignment to a Research Paper Based on Allied References and Publications Weight
Tamim Al Mahmud, B M Mainul Hossain, Dilshad Ara
Comments: IEEE Conference Proceedings (5 Pages)
Journal-ref: 2018 4th International Conference on Computing, Communication and Automation (ICCCA), Greater Noida, India, 2018, pp. 1-5
Subjects: Digital Libraries (cs.DL); Computer Vision and Pattern Recognition (cs.CV)
[3035] arXiv:2506.21448 (cross-list from eess.AS) [pdf, html, other]
Title: ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing
Huadai Liu, Jialei Wang, Kaicheng Luo, Wen Wang, Qian Chen, Zhou Zhao, Wei Xue
Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD)
[3036] arXiv:2506.21458 (cross-list from cs.AI) [pdf, other]
Title: Spatial Mental Modeling from Limited Views
Baiqiao Yin, Qineng Wang, Pingyue Zhang, Jianshu Zhang, Kangrui Wang, Zihan Wang, Jieyu Zhang, Keshigeyan Chandrasegaran, Han Liu, Ranjay Krishna, Saining Xie, Manling Li, Jiajun Wu, Li Fei-Fei
Comments: Preprint version
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[3037] arXiv:2506.21499 (cross-list from eess.IV) [pdf, html, other]
Title: Lightweight Physics-Informed Zero-Shot Ultrasound Plane Wave Denoising
Hojat Asgariandehkordi, Mostafa Sharifzadeh, Hassan Rivaz
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[3038] arXiv:2506.21535 (cross-list from eess.IV) [pdf, html, other]
Title: Exploring the Design Space of 3D MLLMs for CT Report Generation
Mohammed Baharoon, Jun Ma, Congyu Fang, Augustin Toma, Bo Wang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[3039] arXiv:2506.21537 (cross-list from quant-ph) [pdf, html, other]
Title: ResQ: A Novel Framework to Implement Residual Neural Networks on Analog Rydberg Atom Quantum Computers
Nicholas S. DiBrita, Jason Han, Tirthak Patel
Comments: ResQ will appear in the Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2025
Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET)
[3040] arXiv:2506.21586 (cross-list from cs.CL) [pdf, html, other]
Title: Can Vision Language Models Understand Mimed Actions?
Hyundong Cho, Spencer Lin, Tejas Srinivasan, Michael Saxon, Deuksin Kwon, Natali T. Chavez, Jonathan May
Comments: ACL 2025 Findings
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3041] arXiv:2506.21592 (cross-list from cs.CL) [pdf, html, other]
Title: SignBart -- New approach with the skeleton sequence for Isolated Sign language Recognition
Tinh Nguyen, Minh Khue Phan Tran
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[3042] arXiv:2506.21601 (cross-list from cs.IR) [pdf, html, other]
Title: Hierarchical Patch Compression for ColPali: Efficient Multi-Vector Document Retrieval with Dynamic Pruning and Quantization
Duong Bach
Comments: 9 pages
Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV)
[3043] arXiv:2506.21604 (cross-list from cs.IR) [pdf, html, other]
Title: Evaluating VisualRAG: Quantifying Cross-Modal Performance in Enterprise Document Understanding
Varun Mannam, Fang Wang, Xin Chen
Comments: Conference: KDD conference workshop: this https URL
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[3044] arXiv:2506.21629 (cross-list from cs.GR) [pdf, html, other]
Title: ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes
Chenhao Zhang, Yezhi Shen, Fengqing Zhu
Comments: 6 pages, Source code is available at this https URL. To appear at ICIP 2025
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[3045] arXiv:2506.21630 (cross-list from cs.RO) [pdf, html, other]
Title: TOMD: A Trail-based Off-road Multimodal Dataset for Traversable Pathway Segmentation under Challenging Illumination Conditions
Yixin Sun, Li Li, Wenke E, Amir Atapour-Abarghouei, Toby P. Breckon
Comments: 8 pages, 9 figures, 2025 IJCNN
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[3046] arXiv:2506.21635 (cross-list from cs.RO) [pdf, html, other]
Title: AeroLite-MDNet: Lightweight Multi-task Deviation Detection Network for UAV Landing
Haiping Yang, Huaxing Liu, Wei Wu, Zuohui Chen, Ning Wu
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3047] arXiv:2506.21655 (cross-list from cs.LG) [pdf, html, other]
Title: APO: Enhancing Reasoning Ability of MLLMs via Asymmetric Policy Optimization
Minjie Hong, Zirun Guo, Yan Xia, Zehan Wang, Ziang Zhang, Tao Jin, Zhou Zhao
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[3048] arXiv:2506.21680 (cross-list from eess.IV) [pdf, html, other]
Title: PhotonSplat: 3D Scene Reconstruction and Colorization from SPAD Sensors
Sai Sri Teja, Sreevidya Chintalapati, Vinayak Gupta, Mukund Varma T, Haejoon Lee, Aswin Sankaranarayanan, Kaushik Mitra
Comments: Accepted at the International Conference on Computational Photography(ICCP) 2025
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[3049] arXiv:2506.21714 (cross-list from cs.LG) [pdf, html, other]
Title: ODE$_t$(ODE$_l$): Shortcutting the Time and Length in Diffusion and Flow Models for Faster Sampling
Denis Gudovskiy, Wenzhao Zheng, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer
Comments: Preprint. Github page: this http URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[3050] arXiv:2506.21732 (cross-list from cs.RO) [pdf, html, other]
Title: Experimental investigation of pose informed reinforcement learning for skid-steered visual navigation
Ameya Salvi, Venkat Krovi
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
Total of 3130 entries : 1-100 ... 2701-2800 2801-2900 2901-3000 2951-3050 3001-3100 3101-3130
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack