Computer Vision and Pattern Recognition

Authors and titles for February 2025

Total of 2199 entries : 1-25 26-50 51-75 76-100 101-125 ... 2176-2199

Showing up to 25 entries per page: fewer | more | all

[26] arXiv:2502.00392 [pdf, html, other]: Title: RefDrone: A Challenging Benchmark for Referring Expression Comprehension in Drone Scenes

Zhichao Sun, Yepeng Liu, Huachao Zhu, Yuliang Gu, Yuda Zou, Zelong Liu, Gui-Song Xia, Bo Du, Yongchao Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[27] arXiv:2502.00397 [pdf, html, other]: Title: Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues

Rohit Girmaji, Siddharth Jain, Bhav Beri, Sarthak Bansal, Vineet Gandhi

Comments: Accepted at 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[28] arXiv:2502.00402 [pdf, html, other]: Title: Enhancing Highway Safety: Accident Detection on the A9 Test Stretch Using Roadside Sensors

Walter Zimmer, Ross Greer, Xingcheng Zhou, Rui Song, Marc Pavel, Daniel Lehmberg, Ahmed Ghita, Akshay Gopalkrishnan, Mohan Trivedi, Alois Knoll

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[29] arXiv:2502.00404 [pdf, html, other]: Title: Exploring Linear Attention Alternative for Single Image Super-Resolution

Rongchang Lu, Changyu Li, Donghang Li, Guojing Zhang, Jianqiang Huang, Xilai Li

Comments: This paper has been published to IEEE International Joint Conference on Neural Networks 2025 as the final camera ready version. Contact at [email protected]

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[30] arXiv:2502.00412 [pdf, html, other]: Title: TROI: Cross-Subject Pretraining with Sparse Voxel Selection for Enhanced fMRI Visual Decoding

Ziyu Wang, Tengyu Pan, Zhenyu Li, Ji Wu, Xiuxing Li, Jianyong Wang

Comments: ICASSP 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[31] arXiv:2502.00418 [pdf, html, other]: Title: Parameter Efficient Fine-Tuning of Segment Anything Model for Biomedical Imaging

Carolin Teuber, Anwai Archit, Constantin Pape

Comments: Published in MIDL 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[32] arXiv:2502.00425 [pdf, html, other]: Title: MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Full Static Quantization

JiangYong Yu, Sifan Zhou, Dawei Yang, Shuo Wang, Shuoyu Li, Xing Hu, Chen Xu, Zukang Xu, Changyong Shu, Zhihang Yuan

Comments: Accepted by ACM MM 2025. First PTQ solution for Multimodal large language models applicable to 5 mainstream MLLMs

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[33] arXiv:2502.00426 [pdf, html, other]: Title: TEST-V: TEst-time Support-set Tuning for Zero-shot Video Classification

Rui Yan, Jin Wang, Hongyu Qu, Xiaoyu Du, Dong Zhang, Jinhui Tang, Tieniu Tan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[34] arXiv:2502.00433 [pdf, html, other]: Title: CAT Pruning: Cluster-Aware Token Pruning For Text-to-Image Diffusion Models

Xinle Cheng, Zhuoming Chen, Zhihao Jia

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[35] arXiv:2502.00435 [pdf, html, other]: Title: SatMamba: Development of Foundation Models for Remote Sensing Imagery Using State Space Models

Chuc Man Duc, Hiromichi Fukui

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[36] arXiv:2502.00462 [pdf, html, other]: Title: MambaGlue: Fast and Robust Local Feature Matching With Mamba

Kihwan Ryoo, Hyungtae Lim, Hyun Myung

Comments: Proc. IEEE Int'l Conf. Robotics and Automation (ICRA) 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[37] arXiv:2502.00464 [pdf, other]: Title: Evaluation of End-to-End Continuous Spanish Lipreading in Different Data Conditions

David Gimeno-Gómez, Carlos-D. Martínez-Hinarejos

Comments: Accepted in the "Language Resources and Evaluation" journal, Springer Nature

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[38] arXiv:2502.00474 [pdf, other]: Title: A framework for river connectivity classification using temporal image processing and attention based neural networks

Timothy James Becker, Derin Gezgin, Jun Yi He Wu, Mary Becker

Comments: 15 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[39] arXiv:2502.00500 [pdf, html, other]: Title: Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation

Yang Cao, Zhao Song, Chiwun Yang

Comments: 39 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[40] arXiv:2502.00528 [pdf, html, other]: Title: Vision-Language Modeling in PET/CT for Visual Grounding of Positive Findings

Zachary Huemann, Samuel Church, Joshua D. Warner, Daniel Tran, Xin Tie, Alan B McMillan, Junjie Hu, Steve Y. Cho, Meghan Lubner, Tyler J. Bradshaw

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[41] arXiv:2502.00535 [pdf, html, other]: Title: Work-Efficient Parallel Non-Maximum Suppression Kernels

David Oro, Carles Fernández, Xavier Martorell, Javier Hernando

Comments: Code: this https URL

Journal-ref: The Computer Journal, Volume 65, Issue 4, April 2022, Pages 773-787

Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[42] arXiv:2502.00536 [pdf, html, other]: Title: CAD: Confidence-Aware Adaptive Displacement for Semi-Supervised Medical Image Segmentation

Wenbo Xiao, Zhihao Xu, Guiping Liang, Yangjun Deng, Yi Xiao

Comments: 9 pages, 3 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[43] arXiv:2502.00547 [pdf, html, other]: Title: Milmer: a Framework for Multiple Instance Learning based Multimodal Emotion Recognition

Zaitian Wang, Jian He, Yu Liang, Xiyuan Hu, Tianhao Peng, Kaixin Wang, Jiakai Wang, Chenlong Zhang, Weili Zhang, Shuang Niu, Xiaoyang Xie

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[44] arXiv:2502.00563 [pdf, html, other]: Title: Complex Wavelet Mutual Information Loss: A Multi-Scale Loss Function for Semantic Segmentation

Renhao Lu

Comments: Accepted at ICML 2025. This version corresponds to the official camera-ready submission

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[45] arXiv:2502.00568 [pdf, html, other]: Title: Generating crossmodal gene expression from cancer histopathology improves multimodal AI predictions

Samiran Dey, Christopher R.S. Banerji, Partha Basuchowdhuri, Sanjoy K. Saha, Deepak Parashar, Tapabrata Chakraborti

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[46] arXiv:2502.00571 [pdf, html, other]: Title: Contrastive Forward-Forward: A Training Algorithm of Vision Transformer

Hossein Aghagolzadeh, Mehdi Ezoji

Comments: 22 pages, 8 figures, under review

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[47] arXiv:2502.00594 [pdf, html, other]: Title: Fast Vision Mamba: Pooling Spatial Dimensions for Accelerated Processing

Saarthak Kapse, Robin Betz, Srinivasan Sivanandan

Comments: 20 pages, 15 figures, this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[48] arXiv:2502.00618 [pdf, html, other]: Title: DesCLIP: Robust Continual Learning via General Attribute Descriptions for VLM-Based Visual Recognition

Chiyuan He, Zihuan Qiu, Fanman Meng, Linfeng Xu, Qingbo Wu, Hongliang Li

Comments: Accepted by IEEE Transactions on Multimedia

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[49] arXiv:2502.00630 [pdf, html, other]: Title: Self-Prompt SAM: Medical Image Segmentation via Automatic Prompt SAM Adaptation

Bin Xie, Hao Tang, Dawen Cai, Yan Yan, Gady Agam

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[50] arXiv:2502.00631 [pdf, html, other]: Title: MedConv: Convolutions Beat Transformers on Long-Tailed Bone Density Prediction

Xuyin Qi, Zeyu Zhang, Huazhan Zheng, Mingxi Chen, Numan Kutaiba, Ruth Lim, Cherie Chiang, Zi En Tham, Xuan Ren, Wenxin Zhang, Lei Zhang, Hao Zhang, Wenbing Lv, Guangzhen Yao, Renda Han, Kangsheng Wang, Mingyuan Li, Hongtao Mao, Yu Li, Zhibin Liao, Yang Zhao, Minh-Son To

Comments: Accepted to IJCNN 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)

Total of 2199 entries : 1-25 26-50 51-75 76-100 101-125 ... 2176-2199

Showing up to 25 entries per page: fewer | more | all