Computer Vision and Pattern Recognition

Authors and titles for June 2025

Total of 3130 entries : 1-50 ... 2551-2600 2601-2650 2651-2700 2701-2750 2751-2800 2801-2850 2851-2900 ... 3101-3130

Showing up to 50 entries per page: fewer | more | all

[2701] arXiv:2506.10028 (cross-list from cs.CR) [pdf, other]: Title: Secure Data Access in Cloud Environments Using Quantum Cryptography

S. Vasavi Venkata Lakshmi, Ziaul Haque Choudhury

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[2702] arXiv:2506.10054 (cross-list from cs.LG) [pdf, html, other]: Title: Omni-DPO: A Dual-Perspective Paradigm for Dynamic Preference Learning of LLMs

Shangpin Peng, Weinong Wang, Zhuotao Tian, Senqiao Yang, Xing Wu, Haotian Xu, Chengquan Zhang, Takashi Isobe, Baotian Hu, Min Zhang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[2703] arXiv:2506.10142 (cross-list from eess.IV) [pdf, html, other]: Title: Rethinking Brain Tumor Segmentation from the Frequency Domain Perspective

Minye Shao, Zeyu Wang, Haoran Duan, Yawen Huang, Bing Zhai, Shizheng Wang, Yang Long, Yefeng Zheng

Comments: Accepted by IEEE Transactions on Medical Imaging

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2704] arXiv:2506.10146 (cross-list from cs.LG) [pdf, html, other]: Title: Balanced Hyperbolic Embeddings Are Natural Out-of-Distribution Detectors

Tejaswi Kasarla, Max van Spengler, Pascal Mettes

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[2705] arXiv:2506.10172 (cross-list from cs.RO) [pdf, html, other]: Title: A Navigation Framework Utilizing Vision-Language Models

Yicheng Duan, Kaiyu tang

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2706] arXiv:2506.10177 (cross-list from cs.LG) [pdf, html, other]: Title: Geometric Regularity in Deterministic Sampling of Diffusion-based Generative Models

Defang Chen, Zhenyu Zhou, Can Wang, Siwei Lyu

Comments: 50 pages. The short version appeared in ICML 2024. arXiv admin note: substantial text overlap with arXiv:2405.11326

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[2707] arXiv:2506.10230 (cross-list from eess.IV) [pdf, html, other]: Title: Prompt-Guided Latent Diffusion with Predictive Class Conditioning for 3D Prostate MRI Generation

Emerson P. Grabke, Masoom A. Haider, Babak Taati

Comments: MAH and BT are co-senior authors on the work. This work has been submitted to the IEEE for possible publication

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2708] arXiv:2506.10233 (cross-list from eess.IV) [pdf, html, other]: Title: Conditional diffusion models for guided anomaly detection in brain images using fluid-driven anomaly randomization

Ana Lawry Aguila, Peirong Liu, Oula Puonti, Juan Eugenio Iglesias

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2709] arXiv:2506.10251 (cross-list from eess.SY) [pdf, other]: Title: Energy Aware Camera Location Search Algorithm for Increasing Precision of Observation in Automated Manufacturing

Rongfei Li, Francis Assadian

Comments: 35 pages, 24 figures, Journal, Published in: Applied Sciences, 2024, vol. 14, article 9140. For published version, see this http URL: this https URL

Journal-ref: Appl. Sci. 2024, 14, 9140

Subjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV)
[2710] arXiv:2506.10265 (cross-list from eess.SP) [pdf, html, other]: Title: Ground Reaction Force Estimation via Time-aware Knowledge Distillation

Eun Som Jeon, Sinjini Mitra, Jisoo Lee, Omik M. Save, Ankita Shukla, Hyunglae Lee, Pavan Turaga

Journal-ref: IEEE Internet of Things Journal, 2025

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[2711] arXiv:2506.10309 (cross-list from eess.IV) [pdf, html, other]: Title: DUN-SRE: Deep Unrolling Network with Spatiotemporal Rotation Equivariance for Dynamic MRI Reconstruction

Yuliang Zhu, Jing Cheng, Qi Xie, Zhuo-Xu Cui, Qingyong Zhu, Yuanyuan Liu, Xin Liu, Jianfeng Ren, Chengbo Wang, Dong Liang

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2712] arXiv:2506.10325 (cross-list from eess.IV) [pdf, html, other]: Title: SWDL: Stratum-Wise Difference Learning with Deep Laplacian Pyramid for Semi-Supervised 3D Intracranial Hemorrhage Segmentation

Cheng Wang, Siqi Chen, Donghua Mi, Yang Chen, Yudong Zhang, Yinsheng Li

Comments: 11 pages, 4 figures, 6 Tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2713] arXiv:2506.10407 (cross-list from eess.SY) [pdf, html, other]: Title: Semi-Tensor-Product Based Convolutional Neural Networks

Daizhan Cheng

Subjects: Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2714] arXiv:2506.10415 (cross-list from cs.CL) [pdf, html, other]: Title: Burn After Reading: Do Multimodal Large Language Models Truly Capture Order of Events in Image Sequences?

Yingjin Song, Yupei Du, Denis Paperno, Albert Gatt

Comments: 27 pages, 14 figures. Accepted to ACL 2025

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[2715] arXiv:2506.10468 (cross-list from cs.GR) [pdf, html, other]: Title: Low-Barrier Dataset Collection with Real Human Body for Interactive Per-Garment Virtual Try-On

Zaiqiang Wu, Yechen Li, Jingyuan Liu, Yuki Shibata, Takayuki Hori, I-Chao Shen, Takeo Igarashi

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[2716] arXiv:2506.10507 (cross-list from cs.GR) [pdf, html, other]: Title: Edit360: 2D Image Edits to 3D Assets from Any Angle

Junchao Huang, Xinting Hu, Shaoshuai Shi, Zhuotao Tian, Li Jiang

Comments: 11 pages, 9 figures

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[2717] arXiv:2506.10540 (cross-list from cs.MA) [pdf, html, other]: Title: AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation

Haoyuan Shi, Yunxin Li, Xinyu Chen, Longyue Wang, Baotian Hu, Min Zhang

Subjects: Multiagent Systems (cs.MA); Computer Vision and Pattern Recognition (cs.CV)
[2718] arXiv:2506.10580 (cross-list from cs.GR) [pdf, html, other]: Title: Transformer IMU Calibrator: Dynamic On-body IMU Calibration for Inertial Motion Capture

Chengxu Zuo, Jiawei Huang, Xiao Jiang, Yuan Yao, Xiangren Shi, Rui Cao, Xinyu Yi, Feng Xu, Shihui Guo, Yipeng Qin

Comments: Accepted by SIGGRAPH 2025 (TOG)

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[2719] arXiv:2506.10600 (cross-list from cs.RO) [pdf, html, other]: Title: EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence

Xinjie Wang, Liu Liu, Yu Cao, Ruiqi Wu, Wenkang Qin, Dehui Wang, Wei Sui, Zhizhong Su

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[2720] arXiv:2506.10617 (cross-list from cs.LG) [pdf, html, other]: Title: Deep Learning-Based Digitization of Overlapping ECG Images with Open-Source Python Code

Reza Karbasi, Masoud Rahimi, Abdol-Hossein Vahabie, Hadi Moradi

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[2721] arXiv:2506.10632 (cross-list from cs.LG) [pdf, html, other]: Title: Hessian Geometry of Latent Space in Generative Models

Alexander Lobashev, Dmitry Guskov, Maria Larchenko, Mikhail Tamm

Comments: ICML 2025

Subjects: Machine Learning (cs.LG); Statistical Mechanics (cond-mat.stat-mech); Computer Vision and Pattern Recognition (cs.CV); Differential Geometry (math.DG); Statistics Theory (math.ST)
[2722] arXiv:2506.10675 (cross-list from eess.IV) [pdf, html, other]: Title: ConStyX: Content Style Augmentation for Generalizable Medical Image Segmentation

Xi Chen, Zhiqiang Shen, Peng Cao, Jinzhu Yang, Osmar R. Zaiane

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2723] arXiv:2506.10797 (cross-list from physics.med-ph) [pdf, other]: Title: Modality-AGnostic Image Cascade (MAGIC) for Multi-Modality Cardiac Substructure Segmentation

Nicholas Summerfield, Qisheng He, Alex Kuo, Ahmed I. Ghanem, Simeng Zhu, Chase Ruff, Joshua Pan, Anudeep Kumar, Prashant Nagpal, Jiwei Zhao, Ming Dong, Carri K. Glide-Hurst

Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
[2724] arXiv:2506.10825 (cross-list from eess.IV) [pdf, other]: Title: Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches

Andrea Moglia (1), Matteo Leccardi (1), Matteo Cavicchioli (1), Alice Maccarini (2), Marco Marcon (1), Luca Mainardi (1), Pietro Cerveri (1 and 2) ((1) Politecnico di Milano, (2) Università di Pavia)

Comments: 132 pages, 26 figures, 23 tables. Andrea Moglia and Matteo Leccardi are equally contributing authors

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2725] arXiv:2506.10858 (cross-list from eess.IV) [pdf, html, other]: Title: Med-URWKV: Pure RWKV With ImageNet Pre-training For Medical Image Segmentation

Zhenhuan Zhou

Comments: Preprint Draft, 5 pages. This paper will be updated with a formal version in the future, Copyright: College of Computer Science, Nankai University. All rights reserved

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2726] arXiv:2506.10916 (cross-list from eess.IV) [pdf, other]: Title: Semi-Automated Quality Assurance in Digital Pathology: Tile Classification Approach

Meredith VandeHaar, M. Clinch, I. Yilmaz, M.A. Rahman, Y. Xiao, F. Dogany, H.M. Alazab, A. Nassar, Z. Akkus, B. Dangott

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2727] arXiv:2506.10955 (cross-list from cs.LG) [pdf, html, other]: Title: ReGuidance: A Simple Diffusion Wrapper for Boosting Sample Quality on Hard Inverse Problems

Aayush Karan, Kulin Shah, Sitan Chen

Comments: 38 pages, 14 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2728] arXiv:2506.10968 (cross-list from cs.RO) [pdf, other]: Title: Eye, Robot: Learning to Look to Act with a BC-RL Perception-Action Loop

Justin Kerr, Kush Hari, Ethan Weber, Chung Min Kim, Brent Yi, Tyler Bonnen, Ken Goldberg, Angjoo Kanazawa

Comments: Project page: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[2729] arXiv:2506.11004 (cross-list from cs.LG) [pdf, html, other]: Title: Developing a Dyslexia Indicator Using Eye Tracking

Kevin Cogan, Vuong M. Ngo, Mark Roantree

Comments: The 23rd International Conference on Artificial Intelligence in Medicine (AIME 2025), LNAI, Springer, 11 pages

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[2730] arXiv:2506.11025 (cross-list from cs.LG) [pdf, html, other]: Title: When Algorithms Play Favorites: Lookism in the Generation and Perception of Faces

Miriam Doh, Aditya Gulati, Matei Mancas, Nuria Oliver

Comments: Accepted as an extended abstract at the Fourth European Workshop on Algorithmic Fairness (EWAF) (URL: this https URL)

Journal-ref: Proceedings of Machine Learning Research 294 (2025) 474-480

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2731] arXiv:2506.11035 (cross-list from cs.LG) [pdf, html, other]: Title: Tversky Neural Networks: Psychologically Plausible Deep Learning with Differentiable Tversky Similarity

Moussa Koulako Bala Doumbouya, Dan Jurafsky, Christopher D. Manning

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[2732] arXiv:2506.11073 (cross-list from cs.CL) [pdf, html, other]: Title: CLAIM: Mitigating Multilingual Object Hallucination in Large Vision-Language Models with Cross-Lingual Attention Intervention

Zekai Ye, Qiming Li, Xiaocheng Feng, Libo Qin, Yichong Huang, Baohang Li, Kui Jiang, Yang Xiang, Zhirui Zhang, Yunfei Lu, Duyu Tang, Dandan Tu, Bing Qin

Comments: ACL2025 Main

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2733] arXiv:2506.11123 (cross-list from q-bio.NC) [pdf, html, other]: Title: Sparse Autoencoders Bridge The Deep Learning Model and The Brain

Ziming Mao, Jia Xu, Zeqi Zheng, Haofang Zheng, Dabing Sheng, Yaochu Jin, Guoyuan Yang

Comments: 54 pages, 41 figures

Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV)
[2734] arXiv:2506.11139 (cross-list from eess.IV) [pdf, html, other]: Title: Grids Often Outperform Implicit Neural Representations

Namhoon Kim, Sara Fridovich-Keil

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2735] arXiv:2506.11146 (cross-list from quant-ph) [pdf, html, other]: Title: HQFNN: A Compact Quantum-Fuzzy Neural Network for Accurate Image Classification

Jianhong Yao, Yangming Guo

Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2736] arXiv:2506.11150 (cross-list from eess.IV) [pdf, html, other]: Title: ADAgent: LLM Agent for Alzheimer's Disease Analysis with Collaborative Coordinator

Wenlong Hou, Guangqian Yang, Ye Du, Yeung Lau, Lihao Liu, Junjun He, Ling Long, Shujun Wang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2737] arXiv:2506.11163 (cross-list from eess.IV) [pdf, html, other]: Title: Vector Representations of Vessel Trees

James Batten, Michiel Schaap, Matthew Sinclair, Ying Bai, Ben Glocker

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[2738] arXiv:2506.11183 (cross-list from eess.IV) [pdf, html, other]: Title: DiffPR: Diffusion-Based Phase Reconstruction via Frequency-Decoupled Learning

Yi Zhang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2739] arXiv:2506.11234 (cross-list from cs.RO) [pdf, other]: Title: Poutine: Vision-Language-Trajectory Pre-Training and Reinforcement Learning Post-Training Enable Robust End-to-End Autonomous Driving

Luke Rowe, Rodrigue de Schaetzen, Roger Girgis, Christopher Pal, Liam Paull

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[2740] arXiv:2506.11252 (cross-list from cs.GR) [pdf, other]: Title: Anti-Aliased 2D Gaussian Splatting

Mae Younes, Adnane Boukhayma

Comments: Code will be available at this https URL

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[2741] arXiv:2506.11261 (cross-list from cs.RO) [pdf, other]: Title: Gondola: Grounded Vision Language Planning for Generalizable Robotic Manipulation

Shizhe Chen, Ricardo Garcia, Paul Pacaud, Cordelia Schmid

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2742] arXiv:2506.11283 (cross-list from eess.IV) [pdf, html, other]: Title: Joint Denoising of Cryo-EM Projection Images using Polar Transformers

Joakim Andén, Justus Sagemüller

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2743] arXiv:2506.11387 (cross-list from cs.RO) [pdf, other]: Title: Control Architecture and Design for a Multi-robotic Visual Servoing System in Automated Manufacturing Environment

Rongfei Li

Comments: 272 pages, 171 figures, PhD dissertation, University of California, Davis, 2025. To be published in ProQuest ETD

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[2744] arXiv:2506.11444 (cross-list from cs.CR) [pdf, html, other]: Title: GaussMarker: Robust Dual-Domain Watermark for Diffusion Models

Kecen Li, Zhicong Huang, Xinwen Hou, Cheng Hong

Comments: Accepted at ICML 2025

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[2745] arXiv:2506.11454 (cross-list from eess.IV) [pdf, other]: Title: FAD-Net: Frequency-Domain Attention-Guided Diffusion Network for Coronary Artery Segmentation using Invasive Coronary Angiography

Nan Mu, Ruiqi Song, Xiaoning Li, Zhihui Xu, Jingfeng Jiang, Chen Zhao

Comments: 35 pages, 12 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2746] arXiv:2506.11455 (cross-list from q-bio.NC) [pdf, other]: Title: Voxel-Level Brain States Prediction Using Swin Transformer

Yifei Sun, Daniel Chahine, Qinghao Wen, Tianming Liu, Xiang Li, Yixuan Yuan, Fernando Calamante, Jinglei Lv

Subjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2747] arXiv:2506.11465 (cross-list from cs.LG) [pdf, html, other]: Title: RollingQ: Reviving the Cooperation Dynamics in Multimodal Transformer

Haotian Ni, Yake Wei, Hang Liu, Gong Chen, Chong Peng, Hao Lin, Di Hu

Comments: Accepted by ICML 2025

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2748] arXiv:2506.11475 (cross-list from cs.MA) [pdf, other]: Title: AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction

Syeda Kisaa Fatima, Tehreem Zubair, Noman Ahmed, Asifullah Khan

Subjects: Multiagent Systems (cs.MA); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[2749] arXiv:2506.11496 (cross-list from eess.IV) [pdf, html, other]: Title: Taming Stable Diffusion for Computed Tomography Blind Super-Resolution

Chunlei Li, Yilei Shi, Haoxi Hu, Jingliang Hu, Xiao Xiang Zhu, Lichao Mou

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2750] arXiv:2506.11545 (cross-list from eess.IV) [pdf, html, other]: Title: FCA2: Frame Compression-Aware Autoencoder for Modular and Fast Compressed Video Super-Resolution

Zhaoyang Wang, Jie Li, Wen Lu, Lihuo He, Maoguo Gong, Xinbo Gao

Comments: This work has been submitted to the IEEE TMM for possible publication

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)

Total of 3130 entries : 1-50 ... 2551-2600 2601-2650 2651-2700 2701-2750 2751-2800 2801-2850 2851-2900 ... 3101-3130

Showing up to 50 entries per page: fewer | more | all