Computer Vision and Pattern Recognition

Authors and titles for July 2025

Total of 2877 entries : 1-50 ... 2701-2750 2751-2800 2801-2850 2851-2877

Showing up to 50 entries per page: fewer | more | all

[2851] arXiv:2507.23000 (cross-list from cs.LG) [pdf, html, other]: Title: Planning for Cooler Cities: A Multimodal AI Framework for Predicting and Mitigating Urban Heat Stress through Urban Landscape Transformation

Shengao Yi, Xiaojiang Li, Wei Tu, Tianhong Zhao

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[2852] arXiv:2507.23001 (cross-list from eess.IV) [pdf, html, other]: Title: LesionGen: A Concept-Guided Diffusion Model for Dermatology Image Synthesis

Jamil Fayyad, Nourhan Bayasi, Ziyang Yu, Homayoun Najjaran

Comments: Accepted at the MICCAI 2025 ISIC Workshop

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2853] arXiv:2507.23002 (cross-list from cs.GR) [pdf, html, other]: Title: Noise-Coded Illumination for Forensic and Photometric Video Analysis

Peter F. Michael, Zekun Hao, Serge Belongie, Abe Davis

Comments: ACM Transactions on Graphics (2025), presented at SIGGRAPH 2025

Journal-ref: ACM Trans. Graph. 44, 5, Article 165 (October 2025), 16 pages

Subjects: Graphics (cs.GR); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[2854] arXiv:2507.23010 (cross-list from cs.LG) [pdf, html, other]: Title: Investigating the Invertibility of Multimodal Latent Spaces: Limitations of Optimization-Based Methods

Siwoo Park

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[2855] arXiv:2507.23110 (cross-list from eess.IV) [pdf, html, other]: Title: Rethink Domain Generalization in Heterogeneous Sequence MRI Segmentation

Zheyuan Zhang, Linkai Peng, Wanying Dou, Cuiling Sun, Halil Ertugrul Aktas, Andrea M. Bejar, Elif Keles, Gorkem Durak, Ulas Bagci

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2856] arXiv:2507.23129 (cross-list from eess.IV) [pdf, html, other]: Title: MRpro - open PyTorch-based MR reconstruction and processing package

Felix Frederik Zimmermann, Patrick Schuenke, Christoph S. Aigner, Bill A. Bernhardt, Mara Guastini, Johannes Hammacher, Noah Jaitner, Andreas Kofler, Leonid Lunin, Stefan Martin, Catarina Redshaw Kranich, Jakob Schattenfroh, David Schote, Yanglei Wu, Christoph Kolbitsch

Comments: Submitted to Magnetic Resonance in Medicine

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[2857] arXiv:2507.23150 (cross-list from eess.IV) [pdf, html, other]: Title: Towards High-Resolution Alignment and Super-Resolution of Multi-Sensor Satellite Imagery

Philip Wootaek Shin, Vishal Gaur, Rahul Ramachandran, Manil Maskey, Jack Sampson, Vijaykrishnan Narayanan, Sujit Roy

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2858] arXiv:2507.23154 (cross-list from cs.LG) [pdf, html, other]: Title: FuseTen: A Generative Model for Daily 10 m Land Surface Temperature Estimation from Spatio-Temporal Satellite Observations

Sofiane Bouaziz, Adel Hafiane, Raphael Canals, Rachid Nedjai

Comments: Accepted in the 2025 International Conference on Machine Intelligence for GeoAnalytics and Remote Sensing (MIGARS)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2859] arXiv:2507.23190 (cross-list from cs.HC) [pdf, html, other]: Title: Accessibility Scout: Personalized Accessibility Scans of Built Environments

William Huang, Xia Su, Jon E. Froehlich, Yang Zhang

Comments: 18 pages, 16 figures. Presented at ACM UIST 2025

Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multiagent Systems (cs.MA)
[2860] arXiv:2507.23219 (cross-list from eess.IV) [pdf, html, other]: Title: Learning Arbitrary-Scale RAW Image Downscaling with Wavelet-based Recurrent Reconstruction

Yang Ren, Hai Jiang, Wei Li, Menglong Yang, Heng Zhang, Zehua Sheng, Qingsheng Ye, Shuaicheng Liu

Comments: Accepted by ACM MM 2025

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2861] arXiv:2507.23256 (cross-list from eess.IV) [pdf, html, other]: Title: EMedNeXt: An Enhanced Brain Tumor Segmentation Framework for Sub-Saharan Africa using MedNeXt V2 with Deep Supervision

Ahmed Jaheen, Abdelrahman Elsayed, Damir Kim, Daniil Tikhonov, Matheus Scatolin, Mohor Banerjee, Qiankun Ji, Mostafa Salem, Hu Wang, Sarim Hashmi, Mohammad Yaqub

Comments: Submitted to the BraTS-Lighthouse 2025 Challenge (MICCAI 2025)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2862] arXiv:2507.23273 (cross-list from cs.RO) [pdf, html, other]: Title: GSFusion:Globally Optimized LiDAR-Inertial-Visual Mapping for Gaussian Splatting

Jaeseok Park, Chanoh Park, Minsu Kim, Soohwan Kim

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[2863] arXiv:2507.23359 (cross-list from eess.IV) [pdf, html, other]: Title: Pixel Embedding Method for Tubular Neurite Segmentation

Huayu Fu, Jiamin Li, Haozhi Qu, Xiaolin Hu, Zengcai Guo

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[2864] arXiv:2507.23382 (cross-list from cs.CL) [pdf, html, other]: Title: MPCC: A Novel Benchmark for Multimodal Planning with Complex Constraints in Multimodal Large Language Models

Yiyan Ji, Haoran Chen, Qiguang Chen, Chengyue Wu, Libo Qin, Wanxiang Che

Comments: Accepted to ACM Multimedia 2025

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2865] arXiv:2507.23398 (cross-list from eess.IV) [pdf, html, other]: Title: Smart Video Capsule Endoscopy: Raw Image-Based Localization for Enhanced GI Tract Investigation

Oliver Bause, Julia Werner, Paul Palomero Bernardo, Oliver Bringmann

Comments: Accepted at the 32nd International Conference on Neural Information Processing - ICONIP 2025

Subjects: Image and Video Processing (eess.IV); Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV)
[2866] arXiv:2507.23497 (cross-list from cs.AI) [pdf, other]: Title: Causal Identification of Sufficient, Contrastive and Complete Feature Sets in Image Classification

David A Kelly, Hana Chockler

Comments: 13 pages, 13 figures, appendix included

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2867] arXiv:2507.23521 (cross-list from eess.IV) [pdf, html, other]: Title: JPEG Processing Neural Operator for Backward-Compatible Coding

Woo Kyoung Han, Yongjun Lee, Byeonghun Lee, Sang Hyun Park, Sunghoon Im, Kyong Hwan Jin

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2868] arXiv:2507.23523 (cross-list from cs.RO) [pdf, html, other]: Title: H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation

Hongzhe Bi, Lingxuan Wu, Tianwei Lin, Hengkai Tan, Zhizhong Su, Hang Su, Jun Zhu

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2869] arXiv:2507.23534 (cross-list from cs.LG) [pdf, html, other]: Title: Continual Learning with Synthetic Boundary Experience Blending

Chih-Fan Hsu, Ming-Ching Chang, Wei-Chao Chen

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[2870] arXiv:2507.23540 (cross-list from cs.RO) [pdf, html, other]: Title: A Unified Perception-Language-Action Framework for Adaptive Autonomous Driving

Yi Zhang, Erik Leo Haß, Kuo-Yi Chao, Nenad Petrovic, Yinglei Song, Chengdong Wu, Alois Knoll

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2871] arXiv:2507.23544 (cross-list from cs.RO) [pdf, html, other]: Title: User Experience Estimation in Human-Robot Interaction Via Multi-Instance Learning of Multimodal Social Signals

Ryo Miyoshi, Yuki Okafuji, Takuya Iwamoto, Junya Nakanishi, Jun Baba

Comments: This paper has been accepted for presentation at IEEE/RSJ International Conference on Intelligent Robots and Systems 2025 (IROS 2025)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[2872] arXiv:2507.23611 (cross-list from cs.CR) [pdf, html, other]: Title: LLM-Based Identification of Infostealer Infection Vectors from Screenshots: The Case of Aurora

Estelle Ruellan, Eric Clay, Nicholas Ascoli

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2873] arXiv:2507.23648 (cross-list from eess.IV) [pdf, html, other]: Title: Towards Field-Ready AI-based Malaria Diagnosis: A Continual Learning Approach

Louise Guillon, Soheib Biga, Yendoube E. Kantchire, Mouhamadou Lamine Sane, Grégoire Pasquier, Kossi Yakpa, Stéphane E. Sossou, Marc Thellier, Laurent Bonnardot, Laurence Lachaud, Renaud Piarroux, Ameyo M. Dorkenoo

Comments: MICCAI 2025 AMAI Workshop, Accepted, Submitted Manuscript Version

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2874] arXiv:2507.23676 (cross-list from cs.LG) [pdf, html, other]: Title: DepMicroDiff: Diffusion-Based Dependency-Aware Multimodal Imputation for Microbiome Data

Rabeya Tus Sadia, Qiang Cheng

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[2875] arXiv:2507.23763 (cross-list from eess.IV) [pdf, html, other]: Title: Topology Optimization in Medical Image Segmentation with Fast Euler Characteristic

Liu Li, Qiang Ma, Cheng Ouyang, Johannes C. Paetzold, Daniel Rueckert, Bernhard Kainz

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2876] arXiv:2507.23771 (cross-list from cs.LG) [pdf, html, other]: Title: Consensus-Driven Active Model Selection

Justin Kay, Grant Van Horn, Subhransu Maji, Daniel Sheldon, Sara Beery

Comments: ICCV 2025 Highlight. 16 pages, 8 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2877] arXiv:2507.23777 (cross-list from cs.GR) [pdf, html, other]: Title: XSpecMesh: Quality-Preserving Auto-Regressive Mesh Generation Acceleration via Multi-Head Speculative Decoding

Dian Chen, Yansong Qu, Xinyang Li, Ming Li, Shengchuan Zhang

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)

Total of 2877 entries : 1-50 ... 2701-2750 2751-2800 2801-2850 2851-2877

Showing up to 50 entries per page: fewer | more | all