Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Vision and Pattern Recognition

Authors and titles for November 2024

Total of 2614 entries : 1-25 26-50 51-75 76-100 ... 2601-2614
Showing up to 25 entries per page: fewer | more | all
[1] arXiv:2411.00078 [pdf, html, other]
Title: How Good Are We? Evaluating Cell AI Foundation Models in Kidney Pathology with Human-in-the-Loop Enrichment
Junlin Guo, Siqi Lu, Can Cui, Ruining Deng, Tianyuan Yao, Zhewen Tao, Yizhe Lin, Marilyn Lionts, Quan Liu, Juming Xiong, Yu Wang, Shilin Zhao, Catie Chang, Mitchell Wilkes, Mengmeng Yin, Haichun Yang, Yuankai Huo
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[2] arXiv:2411.00128 [pdf, html, other]
Title: Muscles in Time: Learning to Understand Human Motion by Simulating Muscle Activations
David Schneider, Simon Reiß, Marco Kugler, Alexander Jaus, Kunyu Peng, Susanne Sutschet, M. Saquib Sarfraz, Sven Matthiesen, Rainer Stiefelhagen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[3] arXiv:2411.00144 [pdf, html, other]
Title: Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis
Chen Zhao, Xuan Wang, Tong Zhang, Saqib Javed, Mathieu Salzmann
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[4] arXiv:2411.00151 [pdf, html, other]
Title: NIMBA: Towards Robust and Principled Processing of Point Clouds With SSMs
Nursena Köprücü, Destiny Okpekpe, Antonio Orvieto
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[5] arXiv:2411.00158 [pdf, html, other]
Title: Using Deep Neural Networks to Quantify Parking Dwell Time
Marcelo Eduardo Marques Ribas (1), Heloisa Benedet Mendes (1), Luiz Eduardo Soares de Oliveira (1), Luiz Antonio Zanlorensi (2), Paulo Ricardo Lisboa de Almeida (1) ((1) Department of Informatics - Federal University of Paraná, (2) DeepNeuronic)
Comments: Paper accepted to the 2024 International Conference on Machine Learning and Applications
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[6] arXiv:2411.00164 [pdf, html, other]
Title: A Recipe for Geometry-Aware 3D Mesh Transformers
Mohammad Farazi, Yalin Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[7] arXiv:2411.00169 [pdf, html, other]
Title: Aerial Flood Scene Classification Using Fine-Tuned Attention-based Architecture for Flood-Prone Countries in South Asia
Ibne Hassan, Aman Mujahid, Abdullah Al Hasib, Andalib Rahman Shagoto, Joyanta Jyoti Mondal, Meem Arafat Manab, Jannatun Noor
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[8] arXiv:2411.00172 [pdf, other]
Title: SeafloorAI: A Large-scale Vision-Language Dataset for Seafloor Geological Survey
Kien X. Nguyen, Fengchun Qiao, Arthur Trembanis, Xi Peng
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[9] arXiv:2411.00174 [pdf, html, other]
Title: Pedestrian Trajectory Prediction with Missing Data: Datasets, Imputation, and Benchmarking
Pranav Singh Chib, Pravendra Singh
Comments: Accepted at NeurIPS 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[10] arXiv:2411.00178 [pdf, other]
Title: Clinical Evaluation of Medical Image Synthesis: A Case Study in Wireless Capsule Endoscopy
Panagiota Gatoula, Dimitrios E. Diamantis, Anastasios Koulaouzidis, Cristina Carretero, Stefania Chetcuti-Zammit, Pablo Cortegoso Valdivia, Begoña González-Suárez, Alessandro Mussetto, John Plevris, Alexander Robertson, Bruno Rosa, Ervin Toth, Dimitris K. Iakovidis
Comments: This work has been submitted for possible journal publication
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[11] arXiv:2411.00192 [pdf, html, other]
Title: Optical Lens Attack on Monocular Depth Estimation for Autonomous Driving
Ce Zhou (1), Qiben Yan (1), Daniel Kent (1), Guangjing Wang (2), Weikang Ding (1), Ziqi Zhang (3), Hayder Radha (1) ((1) Michigan State University, (2) University of South Florida, (3) Peking University)
Comments: 28 pages. arXiv admin note: substantial text overlap with arXiv:2409.17376
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[12] arXiv:2411.00196 [pdf, html, other]
Title: Whole-Herd Elephant Pose Estimation from Drone Data for Collective Behavior Analysis
Brody McNutt, Libby Zhang, Angus Carey-Douglas, Fritz Vollrath, Frank Pope, Leandra Brickson
Comments: Accepted to CV4Animals: Computer Vision for Animal Behavior Tracking and Modeling Workshop in conjunction with Computer Vision and Pattern Recognition 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[13] arXiv:2411.00201 [pdf, html, other]
Title: YOLO Evolution: A Comprehensive Benchmark and Architectural Review of YOLOv12, YOLO11, and Their Previous Versions
Nidhal Jegham, Chan Young Koh, Marwan Abdelatti, Abdeltawab Hendawi
Comments: 20 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[14] arXiv:2411.00209 [pdf, html, other]
Title: Semantic Knowledge Distillation for Onboard Satellite Earth Observation Image Classification
Thanh-Dung Le, Vu Nguyen Ha, Ti Ti Nguyen, Geoffrey Eappen, Prabhu Thiruvasagam, Hong-fu Chou, Duc-Dung Tran, Luis M. Garces-Socarras, Jorge L. Gonzalez-Rios, Juan Carlos Merlano-Duncan, Symeon Chatzinotas
Comments: Under revisions
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[15] arXiv:2411.00210 [pdf, html, other]
Title: Scale-Aware Recognition in Satellite Images under Resource Constraints
Shreelekha Revankar, Cheng Perng Phoo, Utkarsh Mall, Bharath Hariharan, Kavita Bala
Comments: 16 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[16] arXiv:2411.00225 [pdf, html, other]
Title: Fashion-VDM: Video Diffusion Model for Virtual Try-On
Johanna Karras, Yingwei Li, Nan Liu, Luyang Zhu, Innfarn Yoo, Andreas Lugmayr, Chris Lee, Ira Kemelmacher-Shlizerman
Comments: Accepted to SIGGRAPH Asia 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[17] arXiv:2411.00239 [pdf, html, other]
Title: Aquatic-GS: A Hybrid 3D Representation for Underwater Scenes
Shaohua Liu, Junzhe Lu, Zuoya Gu, Jiajun Li, Yue Deng
Comments: 13 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[18] arXiv:2411.00246 [pdf, html, other]
Title: ResiDual Transformer Alignment with Spectral Decomposition
Lorenzo Basile, Valentino Maiorca, Luca Bortolussi, Emanuele Rodolà, Francesco Locatello
Comments: Published in Transactions on Machine Learning Research (TMLR)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[19] arXiv:2411.00252 [pdf, html, other]
Title: IO Transformer: Evaluating SwinV2-Based Reward Models for Computer Vision
Maxwell Meyer, Jack Spruyt
Comments: 15 pages, 3 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[20] arXiv:2411.00274 [pdf, html, other]
Title: Adaptive Residual Transformation for Enhanced Feature-Based OOD Detection in SAR Imagery
Kyung-hwan Lee, Kyung-tae Kim
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[21] arXiv:2411.00281 [pdf, html, other]
Title: Detection and tracking of gas plumes in LWIR hyperspectral video sequence data
Torin Gerhart, Justin Sunu, Ekaterina Merkurjev, Jen-Mei Chang, Jerome Gilles, Andrea L. Bertozzi
Journal-ref: SPIE Defense, Security, and Sensing, 2013, Baltimore, Proceedings Volume 8743, Algorithms and Technologies for Multispectral, Hyperspectral, and Ultraspectral Imagery XIX; 87430J (2013)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[22] arXiv:2411.00299 [pdf, html, other]
Title: RadFlag: A Black-Box Hallucination Detection Method for Medical Vision Language Models
Serena Zhang, Sraavya Sambara, Oishi Banerjee, Julian Acosta, L. John Fahrner, Pranav Rajpurkar
Comments: 17 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[23] arXiv:2411.00304 [pdf, html, other]
Title: Unified Generative and Discriminative Training for Multi-modal Large Language Models
Wei Chow, Juncheng Li, Qifan Yu, Kaihang Pan, Hao Fei, Zhiqi Ge, Shuai Yang, Siliang Tang, Hanwang Zhang, Qianru Sun
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[24] arXiv:2411.00330 [pdf, html, other]
Title: Multiple Information Prompt Learning for Cloth-Changing Person Re-Identification
Shengxun Wei, Zan Gao, Chunjie Ma, Yibo Zhao, Weili Guan, Shengyong Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[25] arXiv:2411.00335 [pdf, html, other]
Title: NCST: Neural-based Color Style Transfer for Video Retouching
Xintao Jiang, Yaosen Chen, Siqin Zhang, Wei Wang, Xuming Wen
Comments: 10 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Image and Video Processing (eess.IV)
Total of 2614 entries : 1-25 26-50 51-75 76-100 ... 2601-2614
Showing up to 25 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status