Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Vision and Pattern Recognition

Authors and titles for September 2023

Total of 2022 entries : 251-750 501-1000 1001-1500 1501-2000 ... 2001-2022
Showing up to 500 entries per page: fewer | more | all
[251] arXiv:2309.02420 [pdf, other]
Title: Doppelgangers: Learning to Disambiguate Images of Similar Structures
Ruojin Cai, Joseph Tung, Qianqian Wang, Hadar Averbuch-Elor, Bharath Hariharan, Noah Snavely
Comments: Published in ICCV 2023 (Oral); Project page: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[252] arXiv:2309.02423 [pdf, other]
Title: EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding
Yue Xu, Yong-Lu Li, Zhemin Huang, Michael Xu Liu, Cewu Lu, Yu-Wing Tai, Chi-Keung Tang
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[253] arXiv:2309.02429 [pdf, other]
Title: Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach
Vimal K B, Saketh Bachu, Tanmay Garg, Niveditha Lakshmi Narasimhan, Raghavan Konuru, Vineeth N Balasubramanian
Comments: To appear at ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[254] arXiv:2309.02434 [pdf, other]
Title: ReliTalk: Relightable Talking Portrait Generation from a Single Video
Haonan Qiu, Zhaoxi Chen, Yuming Jiang, Hang Zhou, Xiangyu Fan, Lei Yang, Wayne Wu, Ziwei Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[255] arXiv:2309.02436 [pdf, other]
Title: GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction
Youmin Zhang, Fabio Tosi, Stefano Mattoccia, Matteo Poggi
Comments: ICCV 2023. Code: this https URL - Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[256] arXiv:2309.02450 [pdf, other]
Title: Self-Supervised Video Transformers for Isolated Sign Language Recognition
Marcelo Sandoval-Castaneda, Yanhong Li, Diane Brentari, Karen Livescu, Gregory Shakhnarovich
Comments: 14 pages. Submitted to WACV 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[257] arXiv:2309.02455 [pdf, html, other]
Title: RSDiff: Remote Sensing Image Generation from Text Using Diffusion Model
Ahmad Sebaq, Mohamed ElHelw
Journal-ref: Neural Comput & Applic (2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[258] arXiv:2309.02527 [pdf, other]
Title: A skeletonization algorithm for gradient-based optimization
Martin J. Menten, Johannes C. Paetzold, Veronika A. Zimmer, Suprosanna Shit, Ivan Ezhov, Robbie Holland, Monika Probst, Julia A. Schnabel, Daniel Rueckert
Comments: Accepted at ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[259] arXiv:2309.02556 [pdf, other]
Title: Domain Adaptation for Efficiently Fine-tuning Vision Transformer with Encrypted Images
Teru Nagamori, Sayaka Shiota, Hitoshi Kiya
Comments: Accepted by APSIPA 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[260] arXiv:2309.02562 [pdf, other]
Title: Recurrence-Free Survival Prediction for Anal Squamous Cell Carcinoma Chemoradiotherapy using Planning CT-based Radiomics Model
Shanshan Tang, Kai Wang, David Hein, Gloria Lin, Nina N. Sanford, Jing Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[261] arXiv:2309.02578 [pdf, other]
Title: Anatomy-Driven Pathology Detection on Chest X-rays
Philip Müller, Felix Meissen, Johannes Brandt, Georgios Kaissis, Daniel Rueckert
Comments: Accepted at MICCAI 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[262] arXiv:2309.02596 [pdf, other]
Title: Self-Supervised Pretraining Improves Performance and Inference Efficiency in Multiple Lung Ultrasound Interpretation Tasks
Blake VanBerlo, Brian Li, Jesse Hoey, Alexander Wong
Comments: 10 pages, 5 figures, submitted to IEEE Access
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[263] arXiv:2309.02617 [pdf, other]
Title: Compressing Vision Transformers for Low-Resource Visual Learning
Eric Youn, Sai Mitheran J, Sanjana Prabhu, Siyuan Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[264] arXiv:2309.02636 [pdf, other]
Title: Multiclass Alignment of Confidence and Certainty for Network Calibration
Vinith Kugathasan, Muhammad Haris Khan
Comments: Accepted at GCPR 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[265] arXiv:2309.02666 [pdf, other]
Title: Fast and Resource-Efficient Object Tracking on Edge Devices: A Measurement Study
Sanjana Vijay Ganesh, Yanzhao Wu, Gaowen Liu, Ramana Kompella, Ling Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[266] arXiv:2309.02676 [pdf, other]
Title: Efficient Training for Visual Tracking with Deformable Transformer
Qingmao Wei, Guotian Zeng, Bi Zeng
Comments: arXiv admin note: text overlap with arXiv:2303.16580 by other authors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[267] arXiv:2309.02702 [pdf, other]
Title: Gene-induced Multimodal Pre-training for Image-omic Classification
Ting Jin, Xingran Xie, Renjie Wan, Qingli Li, Yan Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[268] arXiv:2309.02713 [pdf, other]
Title: SlAction: Non-intrusive, Lightweight Obstructive Sleep Apnea Detection using Infrared Video
You Rim Choi, Gyeongseon Eo, Wonhyuck Youn, Hyojin Lee, Haemin Jang, Dongyoon Kim, Hyunwoo Shin, Hyung-Sin Kim
Comments: Accepted to ICCV CVAMD 2023, poster
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[269] arXiv:2309.02719 [pdf, other]
Title: DMKD: Improving Feature-based Knowledge Distillation for Object Detection Via Dual Masking Augmentation
Guang Yang, Yin Tang, Zhijian Wu, Jun Li, Jianhua Xu, Xili Wan
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[270] arXiv:2309.02742 [pdf, html, other]
Title: MLN-net: A multi-source medical image segmentation method for clustered microcalcifications using multiple layer normalization
Ke Wang, Zanting Ye, Xiang Xie, Haidong Cui, Tao Chen, Banteng Liu
Comments: 17 pages, 9 figures, 3 tables
Journal-ref: Knowledge-Based Systems, 2024, 283: 111127
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[271] arXiv:2309.02773 [pdf, html, other]
Title: Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter
Jinglong Wang, Xiawei Li, Jing Zhang, Qingyuan Xu, Qin Zhou, Qian Yu, Lu Sheng, Dong Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[272] arXiv:2309.02777 [pdf, html, other]
Title: LightNeuS: Neural Surface Reconstruction in Endoscopy using Illumination Decline
Víctor M. Batlle, José M. M. Montiel, Pascal Fua, Juan D. Tardós
Comments: 13 pages, 7 figures, 1 table
Journal-ref: MICCAI 2023. Lecture Notes in Computer Science, vol 14229 (2023) pp 502-512
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[273] arXiv:2309.02801 [pdf, other]
Title: 3D Trajectory Reconstruction of Drones using a Single Camera
Seobin Hwang, Hanyoung Kim, Chaeyeon Heo, Youkyoung Na, Cheongeun Lee, Yeongjun Cho
Comments: 10 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[274] arXiv:2309.02833 [pdf, html, other]
Title: Image-Object-Specific Prompt Learning for Few-Shot Class-Incremental Learning
In-Ug Yoon, Tae-Min Choi, Sun-Kyung Lee, Young-Min Kim, Jong-Hwan Kim
Comments: 8 pages, 4 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[275] arXiv:2309.02843 [pdf, other]
Title: Knowledge Distillation Layer that Lets the Student Decide
Ada Gorgun, Yeti Z. Gurbuz, A. Aydin Alatan
Comments: Accepted at the British Machine Vision Conference 2023 (BMVC 2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[276] arXiv:2309.02855 [pdf, other]
Title: Bandwidth-efficient Inference for Neural Image Compression
Shanzhi Yin, Tongda Xu, Yongsheng Liang, Yuanyuan Wang, Yanghao Li, Yan Wang, Jingjing Liu
Comments: 9 pages, 6 figures, submitted to ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[277] arXiv:2309.02861 [pdf, other]
Title: Image Aesthetics Assessment via Learnable Queries
Zhiwei Xiong, Yunfan Zhang, Zhiqi Shen, Peiran Ren, Han Yu
Comments: This work has been submitted to the IEEE for possible publication
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[278] arXiv:2309.02875 [pdf, other]
Title: MAD: Modality Agnostic Distance Measure for Image Registration
Vasiliki Sideri-Lampretsa, Veronika A. Zimmer, Huaqi Qiu, Georgios Kaissis, Daniel Rueckert
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[279] arXiv:2309.02903 [pdf, other]
Title: Towards Efficient Training with Negative Samples in Visual Tracking
Qingmao Wei, Bi Zeng, Guotian Zeng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[280] arXiv:2309.02923 [pdf, other]
Title: Patched Line Segment Learning for Vector Road Mapping
Jiakun Xu, Bowen Xu, Gui-Song Xia, Liang Dong, Nan Xue
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[281] arXiv:2309.02954 [pdf, other]
Title: M3D-NCA: Robust 3D Segmentation with Built-in Quality Control
John Kalkhof, Anirban Mukhopadhyay
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[282] arXiv:2309.02964 [pdf, other]
Title: Hierarchical-level rain image generative model based on GAN
Zhenyuan Liu, Tong Jia, Xingyu Xing, Jianfeng Wu, Junyi Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[283] arXiv:2309.02965 [pdf, other]
Title: Dynamic Hyperbolic Attention Network for Fine Hand-object Reconstruction
Zhiying Leng, Shun-Cheng Wu, Mahdi Saleh, Antonio Montanaro, Hao Yu, Yin Wang, Nassir Navab, Xiaohui Liang, Federico Tombari
Comments: Accpeted by ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[284] arXiv:2309.02975 [pdf, other]
Title: FishMOT: A Simple and Effective Method for Fish Tracking Based on IoU Matching
Shuo Liu, Lulu Han, Xiaoyang Liu, Junli Ren, Fang Wang, YingLiu, Yuanshan Lin
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[285] arXiv:2309.02995 [pdf, other]
Title: Continual Evidential Deep Learning for Out-of-Distribution Detection
Eduardo Aguilar, Bogdan Raducanu, Petia Radeva, Joost Van de Weijer
Comments: Accepted at Visual Continual Learning workshop (ICCV2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[286] arXiv:2309.02999 [pdf, other]
Title: Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Sijin Chen, Hongyuan Zhu, Mingsheng Li, Xin Chen, Peng Guo, Yinjie Lei, Gang Yu, Taihao Li, Tao Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[287] arXiv:2309.03008 [pdf, html, other]
Title: Sparse 3D Reconstruction via Object-Centric Ray Sampling
Llukman Cerkezi, Paolo Favaro
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[288] arXiv:2309.03020 [pdf, html, other]
Title: SEAL: A Framework for Systematic Evaluation of Real-World Super-Resolution
Wenlong Zhang, Xiaohui Li, Xiangyu Chen, Yu Qiao, Xiao-Ming Wu, Chao Dong
Comments: ICLR 2024, Spotlight. The source code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[289] arXiv:2309.03031 [pdf, other]
Title: MCM: Multi-condition Motion Synthesis Framework for Multi-scenario
Zeyu Ling, Bo Han, Yongkang Wong, Mohan Kangkanhalli, Weidong Geng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[290] arXiv:2309.03047 [pdf, other]
Title: Combining pre-trained Vision Transformers and CIDER for Out Of Domain Detection
Grégor Jouet, Clément Duhart, Francis Rousseaux, Julio Laborde, Cyril de Runz
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[291] arXiv:2309.03048 [pdf, html, other]
Title: Exploring Semantic Consistency in Unpaired Image Translation to Generate Data for Surgical Applications
Danush Kumar Venkatesh, Dominik Rivoir, Micha Pfeiffer, Fiona Kolbinger, Marius Distler, Jürgen Weitz, Stefanie Speidel
Comments: Accepted at IPCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[292] arXiv:2309.03049 [pdf, other]
Title: Adaptive Growth: Real-time CNN Layer Expansion
Yunjie Zhu, Yunhao Chen
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[293] arXiv:2309.03063 [pdf, other]
Title: Prompt-based Ingredient-Oriented All-in-One Image Restoration
Hu Gao, Depeng Dang
Comments: IEEE Transactions on Circuits and Systems for Video Technology (2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[294] arXiv:2309.03072 [pdf, other]
Title: Character Queries: A Transformer-based Approach to On-Line Handwritten Character Segmentation
Michael Jungo, Beat Wolf, Andrii Maksai, Claudiu Musat, Andreas Fischer
Comments: ICDAR 2023 Best Student Paper Award. Code available at this https URL
Journal-ref: International Conference on Document Analysis and Recognition - ICDAR 2023, pp. 98-114. Cham: Springer Nature Switzerland
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[295] arXiv:2309.03100 [pdf, other]
Title: FArMARe: a Furniture-Aware Multi-task methodology for Recommending Apartments based on the user interests
Ali Abdari, Alex Falcon, Giuseppe Serra
Comments: accepted for presentation at the ICCV2023 CV4Metaverse workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[296] arXiv:2309.03110 [pdf, other]
Title: Do We Still Need Non-Maximum Suppression? Accurate Confidence Estimates and Implicit Duplication Modeling with IoU-Aware Calibration
Johannes Gilg, Torben Teepe, Fabian Herzog, Philipp Wolters, Gerhard Rigoll
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[297] arXiv:2309.03160 [pdf, html, other]
Title: ResFields: Residual Neural Fields for Spatiotemporal Signals
Marko Mihajlovic, Sergey Prokudin, Marc Pollefeys, Siyu Tang
Comments: [ICLR 2024 Spotlight] Project and code at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[298] arXiv:2309.03173 [pdf, other]
Title: PDiscoNet: Semantically consistent part discovery for fine-grained recognition
Robert van der Klis, Stephan Alaniz, Massimiliano Mancini, Cassio F. Dantas, Dino Ienco, Zeynep Akata, Diego Marcos
Comments: 9 pages, 8 figures, ICCV
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[299] arXiv:2309.03179 [pdf, html, other]
Title: SLiMe: Segment Like Me
Aliasghar Khani, Saeid Asgari Taghanaki, Aditya Sanghi, Ali Mahdavi Amiri, Ghassan Hamarneh
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[300] arXiv:2309.03185 [pdf, other]
Title: Bayes' Rays: Uncertainty Quantification for Neural Radiance Fields
Lily Goli, Cody Reading, Silvia Sellán, Alec Jacobson, Andrea Tagliasacchi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[301] arXiv:2309.03198 [pdf, other]
Title: My Art My Choice: Adversarial Protection Against Unruly AI
Anthony Rhodes, Ram Bhagat, Umur Aybars Ciftci, Ilke Demir
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[302] arXiv:2309.03216 [pdf, other]
Title: A Multisensor Hyperspectral Benchmark Dataset For Unmixing of Intimate Mixtures
Bikram Koirala, Behnood Rasti, Zakaria Bnoulkacem, Andrea de Lima Ribeiro, Yuleika Madriz, Erik Herrmann, Arthur Gestels, Thomas De Kerf, Sandra Lorenz, Margret Fuchs, Koen Janssens, Gunther Steenackers, Richard Gloaguen, Paul Scheunders
Comments: Currently, this paper is under review in IEEE
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[303] arXiv:2309.03240 [pdf, other]
Title: RepSGG: Novel Representations of Entities and Relationships for Scene Graph Generation
Hengyue Liu, Bir Bhanu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[304] arXiv:2309.03247 [pdf, other]
Title: Robust Visual Tracking by Motion Analyzing
Mohammed Leo, Kurban Ubul, ShengJie Cheng, Michael Ma
Comments: found some key point that is missed,considering that it will take a lot of time to reproduce the results and revise our mistakes,we would like to withdraw the manuscript to avoid further mislead
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[305] arXiv:2309.03295 [pdf, other]
Title: Comparative Analysis of Deep-Fake Algorithms
Nikhil Sontakke, Sejal Utekar, Shivansh Rastogi, Shriraj Sonawane
Comments: 7 pages, 4 figures, 2 tables, Published with International Journal of Computer Science Trends and Technology (IJCST)
Journal-ref: International Journal of Computer Science Trends and Technology (IJCST) V11(4): Page(109-115) Jul - Aug 2023. ISSN: 2347-8578
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[306] arXiv:2309.03329 [pdf, other]
Title: MEGANet: Multi-Scale Edge-Guided Attention Network for Weak Boundary Polyp Segmentation
Nhat-Tan Bui, Dinh-Hieu Hoang, Quang-Thuc Nguyen, Minh-Triet Tran, Ngan Le
Comments: Accepted at the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[307] arXiv:2309.03331 [pdf, other]
Title: Expert Uncertainty and Severity Aware Chest X-Ray Classification by Multi-Relationship Graph Learning
Mengliang Zhang, Xinyue Hu, Lin Gu, Liangchen Liu, Kazuma Kobayashi, Tatsuya Harada, Ronald M. Summers, Yingying Zhu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[308] arXiv:2309.03335 [pdf, other]
Title: SADIR: Shape-Aware Diffusion Models for 3D Image Reconstruction
Nivetha Jayakumar, Tonmoy Hossain, Miaomiao Zhang
Comments: ShapeMI MICCAI 2023: Workshop on Shape in Medical Imaging
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[309] arXiv:2309.03350 [pdf, other]
Title: Relay Diffusion: Unifying diffusion process across resolutions for image synthesis
Jiayan Teng, Wendi Zheng, Ming Ding, Wenyi Hong, Jianqiao Wangni, Zhuoyi Yang, Jie Tang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[310] arXiv:2309.03351 [pdf, other]
Title: Using Neural Networks for Fast SAR Roughness Estimation of High Resolution Images
Li Fan, Jeova Farias Sales Rocha Neto
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Applications (stat.AP)
[311] arXiv:2309.03353 [pdf, other]
Title: Source Camera Identification and Detection in Digital Videos through Blind Forensics
Venkata Udaya Sameer, Shilpa Mukhopadhyay, Ruchira Naskar, Ishaan Dali
Comments: Submitted to IEEE for inclusion in Xplore- Digital Library. Paper presented at the International Conference on Recent Trends in Computational Engineering & Technologies (ICRTCET 18)with Paper Id: ICRTCET-227
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[312] arXiv:2309.03360 [pdf, other]
Title: ViewMix: Augmentation for Robust Representation in Self-Supervised Learning
Arjon Das, Xin Zhong
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[313] arXiv:2309.03367 [pdf, other]
Title: Self-Supervised Masked Digital Elevation Models Encoding for Low-Resource Downstream Tasks
Priyam Mazumdar, Aiman Soliman, Volodymyr Kindratenko, Luigi Marini, Kenton McHenry
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[314] arXiv:2309.03381 [pdf, other]
Title: Active shooter detection and robust tracking utilizing supplemental synthetic data
Joshua R. Waite, Jiale Feng, Riley Tavassoli, Laura Harris, Sin Yong Tan, Subhadeep Chakraborty, Soumik Sarkar
Comments: 11 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[315] arXiv:2309.03390 [pdf, other]
Title: A novel method for iris recognition using BP neural network and parallel computing by the aid of GPUs (Graphics Processing Units)
Farahnaz Hosseini, Hossein Ebrahimpour, Samaneh Askari
Comments: 8 pages,
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[316] arXiv:2309.03401 [pdf, other]
Title: Reasonable Anomaly Detection in Long Sequences
Yalong Jiang, Changkang Li
Comments: 8 pages, 1 figure
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[317] arXiv:2309.03406 [pdf, other]
Title: Distribution-Aware Prompt Tuning for Vision-Language Models
Eulrang Cho, Jooyeon Kim, Hyunwoo J. Kim
Comments: Accepted to ICCV2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[318] arXiv:2309.03445 [pdf, other]
Title: Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy
Yi Tang, Takafumi Iwaguchi, Hiroshi Kawasaki
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319] arXiv:2309.03452 [pdf, html, other]
Title: Multimodal Guidance Network for Missing-Modality Inference in Content Moderation
Zhuokai Zhao, Harish Palani, Tianyi Liu, Lena Evans, Ruth Toner
Comments: ICME 2024 Camera Ready. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[320] arXiv:2309.03453 [pdf, html, other]
Title: SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
Yuan Liu, Cheng Lin, Zijiao Zeng, Xiaoxiao Long, Lingjie Liu, Taku Komura, Wenping Wang
Comments: ICLR 2024 Spotlight. Project page: this https URL Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[321] arXiv:2309.03467 [pdf, html, other]
Title: Autoregressive Omni-Aware Outpainting for Open-Vocabulary 360-Degree Image Generation
Zhuqiang Lu, Kun Hu, Chaoyue Wang, Lei Bai, Zhiyong Wang
Comments: Accepted by AAAI 24
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[322] arXiv:2309.03468 [pdf, html, other]
Title: Support-Set Context Matters for Bongard Problems
Nikhil Raghuraman, Adam W. Harley, Leonidas Guibas
Comments: TMLR October 2024. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[323] arXiv:2309.03472 [pdf, other]
Title: Perceptual Quality Assessment of 360$^\circ$ Images Based on Generative Scanpath Representation
Xiangjie Sui, Hanwei Zhu, Xuelin Liu, Yuming Fang, Shiqi Wang, Zhou Wang
Comments: 12 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[324] arXiv:2309.03473 [pdf, other]
Title: Temporal Collection and Distribution for Referring Video Object Segmentation
Jiajin Tang, Ge Zheng, Sibei Yang
Comments: Accepted by ICCV 2023; Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[325] arXiv:2309.03483 [pdf, other]
Title: DetermiNet: A Large-Scale Diagnostic Dataset for Complex Visually-Grounded Referencing using Determiners
Clarence Lee, M Ganesh Kumar, Cheston Tan
Comments: 10 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[326] arXiv:2309.03499 [pdf, other]
Title: Instance Segmentation of Dislocations in TEM Images
Karina Ruzaeva, Kishan Govind, Marc Legros, Stefan Sandfeld
Journal-ref: IEEE 23rd International Conference on Nanotechnology (2023) 1-6
Subjects: Computer Vision and Pattern Recognition (cs.CV); Materials Science (cond-mat.mtrl-sci)
[327] arXiv:2309.03504 [pdf, other]
Title: Stroke-based Neural Painting and Stylization with Dynamically Predicted Painting Region
Teng Hu, Ran Yi, Haokun Zhu, Liang Liu, Jinlong Peng, Yabiao Wang, Chengjie Wang, Lizhuang Ma
Comments: ACM MM 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[328] arXiv:2309.03506 [pdf, other]
Title: Towards Robust Natural-Looking Mammography Lesion Synthesis on Ipsilateral Dual-Views Breast Cancer Analysis
Thanh-Huy Nguyen, Quang Hien Kha, Thai Ngoc Toan Truong, Ba Thinh Lam, Ba Hung Ngo, Quang Vinh Dinh, Nguyen Quoc Khanh Le
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[329] arXiv:2309.03508 [pdf, other]
Title: Dynamic Frame Interpolation in Wavelet Domain
Lingtong Kong, Boyuan Jiang, Donghao Luo, Wenqing Chu, Ying Tai, Chengjie Wang, Jie Yang
Comments: Accepted by IEEE TIP
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[330] arXiv:2309.03509 [pdf, other]
Title: BroadCAM: Outcome-agnostic Class Activation Mapping for Small-scale Weakly Supervised Applications
Jiatai Lin, Guoqiang Han, Xuemiao Xu, Changhong Liang, Tien-Tsin Wong, C. L. Philip Chen, Zaiyi Liu, Chu Han
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[331] arXiv:2309.03530 [pdf, other]
Title: Efficient Single Object Detection on Image Patches with Early Exit Enhanced High-Precision CNNs
Arne Moos
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[332] arXiv:2309.03531 [pdf, other]
Title: A Robust Negative Learning Approach to Partial Domain Adaptation Using Source Prototypes
Sandipan Choudhuri, Suli Adeniye, Arunabha Sen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[333] arXiv:2309.03539 [pdf, other]
Title: YOLO series target detection algorithms for underwater environments
Chenjie Zhang, Pengcheng Jiao
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[334] arXiv:2309.03542 [pdf, other]
Title: Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction
Jiankai Li, Yunhong Wang, Weixin Li
Comments: Accept in TOMM 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[335] arXiv:2309.03548 [pdf, other]
Title: Trash to Treasure: Low-Light Object Detection via Decomposition-and-Aggregation
Xiaohan Cui, Long Ma, Tengyu Ma, Jinyuan Liu, Xin Fan, Risheng Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[336] arXiv:2309.03549 [pdf, other]
Title: Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Jiaxi Gu, Shicong Wang, Haoyu Zhao, Tianyi Lu, Xing Zhang, Zuxuan Wu, Songcen Xu, Wei Zhang, Yu-Gang Jiang, Hang Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[337] arXiv:2309.03550 [pdf, other]
Title: Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model
Sungwon Hwang, Junha Hyung, Jaegul Choo
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[338] arXiv:2309.03558 [pdf, other]
Title: Region Generation and Assessment Network for Occluded Person Re-Identification
Shuting He, Weihua Chen, Kai Wang, Hao Luo, Fan Wang, Wei Jiang, Henghui Ding
Journal-ref: IEEE TIFS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[339] arXiv:2309.03575 [pdf, other]
Title: Toward High Quality Facial Representation Learning
Yue Wang, Jinlong Peng, Jiangning Zhang, Ran Yi, Liang Liu, Yabiao Wang, Chengjie Wang
Comments: ACM MM 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[340] arXiv:2309.03576 [pdf, other]
Title: DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
Haochen Wang, Junsong Fan, Yuxi Wang, Kaiyou Song, Tong Wang, Zhaoxiang Zhang
Comments: Accepted by NeurIPS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[341] arXiv:2309.03598 [pdf, other]
Title: Enhancing Sample Utilization through Sample Adaptive Augmentation in Semi-Supervised Learning
Guan Gui, Zhen Zhao, Lei Qi, Luping Zhou, Lei Wang, Yinghuan Shi
Comments: Accepted as International Conference on Computer Vision (ICCV) 2023
Journal-ref: International Conference on Computer Vision (ICCV) 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[342] arXiv:2309.03599 [pdf, other]
Title: Chasing Consistency in Text-to-3D Generation from a Single Image
Yichen Ouyang, Wenhao Chai, Jiayi Ye, Dapeng Tao, Yibing Zhan, Gaoang Wang
Comments: 9 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[343] arXiv:2309.03640 [pdf, other]
Title: Context-Aware 3D Object Localization from Single Calibrated Images: A Study of Basketballs
Marcello Davide Caio (1), Gabriel Van Zandycke (1 and 2), Christophe De Vleeschouwer (2) ((1) Sportradar AG, (2) UCLouvain)
Comments: 5 pages, 4 figures, MMSports'23, in proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports (MMSports '23), October 29, 2023, Ottawa, ON, Canada
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[344] arXiv:2309.03659 [pdf, other]
Title: Towards Comparable Knowledge Distillation in Semantic Image Segmentation
Onno Niemann, Christopher Vox, Thorben Werner
Comments: Accepted by the ECML PKDD 2023 workshop track: Simplification, Compression, Efficiency, and Frugality for Artificial Intelligence (SCEFA). This preprint has not undergone peer review or any post-submission improvements or corrections
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[345] arXiv:2309.03661 [pdf, html, other]
Title: Prompt-based Context- and Domain-aware Pretraining for Vision and Language Navigation
Ting Liu, Yue Hu, Wansen Wu, Youkai Wang, Kai Xu, Quanjun Yin
Comments: 12 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[346] arXiv:2309.03671 [pdf, other]
Title: Dataset Generation and Bonobo Classification from Weakly Labelled Videos
Pierre-Etienne Martin
Comments: IntelliSys 2023 paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[347] arXiv:2309.03696 [pdf, other]
Title: Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory
Ting Lei, Fabian Caba, Qingchao Chen, Hailin Jin, Yuxin Peng, Yang Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[348] arXiv:2309.03722 [pdf, html, other]
Title: A boundary-aware point clustering approach in Euclidean and embedding spaces for roof plane segmentation
Li Li, Qingqing Li, Guozheng Xu, Pengwei Zhou, Jingmin Tu, Jie Li, Mingming Li, Jian Yao
Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing,2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[349] arXiv:2309.03726 [pdf, other]
Title: Interpretable Visual Question Answering via Reasoning Supervision
Maria Parelli, Dimitrios Mallis, Markos Diomataris, Vassilis Pitsikalis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[350] arXiv:2309.03729 [pdf, other]
Title: Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption
Teng Hu, Jiangning Zhang, Liang Liu, Ran Yi, Siqi Kou, Haokun Zhu, Xu Chen, Yabiao Wang, Chengjie Wang, Lizhuang Ma
Comments: Accepted by ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[351] arXiv:2309.03734 [pdf, other]
Title: ClusterFusion: Leveraging Radar Spatial Features for Radar-Camera 3D Object Detection in Autonomous Vehicles
Irfan Tito Kurniawan, Bambang Riyanto Trilaksono
Comments: Accepted for publication in IEEE Access
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[352] arXiv:2309.03750 [pdf, html, other]
Title: PBP: Path-based Trajectory Prediction for Autonomous Driving
Sepideh Afshar, Nachiket Deo, Akshay Bhagat, Titas Chakraborty, Yunming Shao, Balarama Raju Buddharaju, Adwait Deshpande, Henggang Cui
Comments: Published at ICRA 2024; Sepideh Afshar and Nachiket Deo contributed equally
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353] arXiv:2309.03763 [pdf, other]
Title: dacl1k: Real-World Bridge Damage Dataset Putting Open-Source Data to the Test
Johannes Flotzinger, Philipp J. Rösch, Norbert Oswald, Thomas Braml
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[354] arXiv:2309.03764 [pdf, other]
Title: $L_{2,1}$-Norm Regularized Quaternion Matrix Completion Using Sparse Representation and Quaternion QR Decomposition
Juan Han, Kit Ian Kou, Jifei Miao, Lizhi Liu, Haojiang Li
Subjects: Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
[355] arXiv:2309.03799 [pdf, other]
Title: FisheyePP4AV: A privacy-preserving method for autonomous vehicles on fisheye camera images
Linh Trinh, Bach Ha, Tu Tran
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[356] arXiv:2309.03809 [pdf, html, other]
Title: SimNP: Learning Self-Similarity Priors Between Neural Points
Christopher Wewer, Eddy Ilg, Bernt Schiele, Jan Eric Lenssen
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[357] arXiv:2309.03811 [pdf, other]
Title: Panoramas from Photons
Sacha Jungerman, Atul Ingle, Mohit Gupta
Comments: Proc. ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[358] arXiv:2309.03812 [pdf, other]
Title: AnthroNet: Conditional Generation of Humans via Anthropometrics
Francesco Picetti, Shrinath Deshpande, Jonathan Leban, Soroosh Shahtalebi, Jay Patel, Peifeng Jing, Chunpu Wang, Charles Metze III, Cameron Sun, Cera Laidlaw, James Warren, Kathy Huynh, River Page, Jonathan Hogins, Adam Crespi, Sujoy Ganguly, Salehe Erfanian Ebadi
Comments: AnthroNet's Unity data generator source code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[359] arXiv:2309.03815 [pdf, other]
Title: T2IW: Joint Text to Image & Watermark Generation
An-An Liu, Guokai Zhang, Yuting Su, Ning Xu, Yongdong Zhang, Lanjun Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[360] arXiv:2309.03827 [pdf, other]
Title: ArtHDR-Net: Perceptually Realistic and Accurate HDR Content Creation
Hrishav Bakul Barua, Ganesh Krishnasamy, KokSheik Wong, Kalin Stefanov, Abhinav Dhall
Comments: Accepted in Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Taipei, Taiwan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[361] arXiv:2309.03837 [pdf, other]
Title: Cross-Task Attention Network: Improving Multi-Task Learning for Medical Imaging Applications
Sangwook Kim, Thomas G. Purdie, Chris McIntosh
Comments: 13 pages, 2 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[362] arXiv:2309.03869 [pdf, other]
Title: Text-to-feature diffusion for audio-visual few-shot learning
Otniel-Bogdan Mercea, Thomas Hummel, A. Sophia Koepke, Zeynep Akata
Comments: DAGM GCPR 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[363] arXiv:2309.03874 [pdf, other]
Title: Box-based Refinement for Weakly Supervised and Unsupervised Localization Tasks
Eyal Gomel, Tal Shaharabany, Lior Wolf
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[364] arXiv:2309.03893 [pdf, other]
Title: DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection
Manlin Zhang, Jie Wu, Yuxi Ren, Ming Li, Jie Qin, Xuefeng Xiao, Wei Liu, Rui Wang, Min Zheng, Andy J. Ma
Comments: Code and Models are publicly available. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[365] arXiv:2309.03895 [pdf, other]
Title: InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Zigang Geng, Binxin Yang, Tiankai Hang, Chen Li, Shuyang Gu, Ting Zhang, Jianmin Bao, Zheng Zhang, Han Hu, Dong Chen, Baining Guo
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[366] arXiv:2309.03897 [pdf, other]
Title: ProPainter: Improving Propagation and Transformer for Video Inpainting
Shangchen Zhou, Chongyi Li, Kelvin C.K. Chan, Chen Change Loy
Comments: Accepted by ICCV 2023. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[367] arXiv:2309.03899 [pdf, other]
Title: The Making and Breaking of Camouflage
Hala Lamdouar, Weidi Xie, Andrew Zisserman
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[368] arXiv:2309.03903 [pdf, other]
Title: Tracking Anything with Decoupled Video Segmentation
Ho Kei Cheng, Seoung Wug Oh, Brian Price, Alexander Schwing, Joon-Young Lee
Comments: Accepted to ICCV 2023. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[369] arXiv:2309.03904 [pdf, other]
Title: Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis
Jiapeng Zhu, Ceyuan Yang, Kecheng Zheng, Yinghao Xu, Zifan Shi, Yujun Shen
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[370] arXiv:2309.03921 [pdf, other]
Title: C-CLIP: Contrastive Image-Text Encoders to Close the Descriptive-Commentative Gap
William Theisen, Walter Scheirer
Comments: 11 Pages, 5 Figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[371] arXiv:2309.03930 [pdf, other]
Title: Random Expert Sampling for Deep Learning Segmentation of Acute Ischemic Stroke on Non-contrast CT
Sophie Ostmeier, Brian Axelrod, Benjamin Pulli, Benjamin F.J. Verhaaren, Abdelkader Mahammedi, Yongkai Liu, Christian Federau, Greg Zaharchuk, Jeremy J. Heit
Journal-ref: https://jnis.bmj.com/content/early/2024/02/01/jnis-2023-021283
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[372] arXiv:2309.03933 [pdf, other]
Title: BluNF: Blueprint Neural Field
Robin Courant, Xi Wang, Marc Christie, Vicky Kalogeiton
Comments: ICCV-W (AI3DCC) 2023. Project page with videos and code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[373] arXiv:2309.03955 [pdf, other]
Title: SimpleNeRF: Regularizing Sparse Input Neural Radiance Fields with Simpler Solutions
Nagabhushan Somraj, Adithyan Karanayil, Rajiv Soundararajan
Comments: SIGGRAPH Asia 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[374] arXiv:2309.03979 [pdf, other]
Title: Separable Self and Mixed Attention Transformers for Efficient Object Tracking
Goutam Yelluru Gopal, Maria A. Amer
Comments: Accepted by WACV2024. Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[375] arXiv:2309.03989 [pdf, other]
Title: CDFSL-V: Cross-Domain Few-Shot Learning for Videos
Sarinda Samarasinghe, Mamshad Nayeem Rizve, Navid Kardan, Mubarak Shah
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[376] arXiv:2309.03999 [pdf, html, other]
Title: Adapting Self-Supervised Representations to Multi-Domain Setups
Neha Kalibhat, Sam Sharpe, Jeremy Goodsitt, Bayan Bruss, Soheil Feizi
Comments: Published at BMVC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[377] arXiv:2309.04001 [pdf, html, other]
Title: MMSFormer: Multimodal Transformer for Material and Semantic Segmentation
Md Kaykobad Reza, Ashley Prater-Bennette, M. Salman Asif
Comments: Accepted by IEEE Open Journal of Signal Processing. 15 pages, 3 figures, 9 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[378] arXiv:2309.04022 [pdf, other]
Title: Improving the Accuracy of Beauty Product Recommendations by Assessing Face Illumination Quality
Parnian Afshar, Jenny Yeon, Andriy Levitskyy, Rahul Suresh, Amin Banitalebi-Dehkordi
Comments: 7 pages, 5 figures. Presented in FAccTRec2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[379] arXiv:2309.04038 [pdf, html, other]
Title: S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing with Statistical Tokens
Rizhao Cai, Zitong Yu, Chenqi Kong, Haoliang Li, Changsheng Chen, Yongjian Hu, Alex Kot
Comments: Accepted by IEEE Transactions on Information Forensics Security (June 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[380] arXiv:2309.04041 [pdf, html, other]
Title: Evaluation and Enhancement of Semantic Grounding in Large Vision-Language Models
Jiaying Lu, Jinmeng Rao, Kezhen Chen, Xiaoyuan Guo, Yawen Zhang, Baochen Sun, Carl Yang, Jie Yang
Comments: This paper has been accepted to the AAAI'24 Workshop on Responsible Language Models (ReLM 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[381] arXiv:2309.04063 [pdf, other]
Title: INSURE: An Information Theory Inspired Disentanglement and Purification Model for Domain Generalization
Xi Yu, Huan-Hsin Tseng, Shinjae Yoo, Haibin Ling, Yuewei Lin
Comments: 10 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[382] arXiv:2309.04084 [pdf, html, other]
Title: Towards Efficient SDRTV-to-HDRTV by Learning from Image Formation
Xiangyu Chen, Zheyuan Li, Zhengwen Zhang, Jimmy S. Ren, Yihao Liu, Jingwen He, Yu Qiao, Jiantao Zhou, Chao Dong
Comments: Extended version of HDRTVNet
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[383] arXiv:2309.04089 [pdf, html, other]
Title: Toward Sufficient Spatial-Frequency Interaction for Gradient-aware Underwater Image Enhancement
Chen Zhao, Weiling Cai, Chenyu Dong, Ziqi Zeng
Comments: Accepted by ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[384] arXiv:2309.04105 [pdf, other]
Title: Weakly Supervised Point Clouds Transformer for 3D Object Detection
Zuojin Tang, Bo Sun, Tongwei Ma, Daosheng Li, Zhenhui Xu
Comments: International Conference on Intelligent Transportation Systems (ITSC), 2022
Journal-ref: International Conference on Intelligent Transportation Systems (ITSC 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[385] arXiv:2309.04109 [pdf, html, other]
Title: From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models
Changming Xiao, Qi Yang, Feng Zhou, Changshui Zhang
Comments: A revised version of this paper will be published in Neurocomputing, see this https URL
Journal-ref: Neurocomputing, Volume 610, 2024, 128437
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[386] arXiv:2309.04145 [pdf, html, other]
Title: Depth Completion with Multiple Balanced Bases and Confidence for Dense Monocular SLAM
Weijian Xie, Guanyi Chu, Quanhao Qian, Yihao Yu, Hai Li, Danpeng Chen, Shangjin Zhai, Nan Wang, Hujun Bao, Guofeng Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[387] arXiv:2309.04147 [pdf, other]
Title: Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry
Akankshya Kar, Sajal Maheshwari, Shamit Lal, Vinay Sameer Raja Kad
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[388] arXiv:2309.04148 [pdf, html, other]
Title: Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning
Hiroki Nakamura, Masashi Okada, Tadahiro Taniguchi
Comments: Accepted to the IEEE Open Journal of Signal Processing (ICIP2024 track)
Journal-ref: IEEE Open Journal of Signal Processing, vol. 5, pp. 831-840, 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[389] arXiv:2309.04153 [pdf, other]
Title: Mapping EEG Signals to Visual Stimuli: A Deep Learning Approach to Match vs. Mismatch Classification
Yiqian Yang, Zhengqiao Zhao, Qian Wang, Yan Yang, Jingdong Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Engineering, Finance, and Science (cs.CE)
[390] arXiv:2309.04158 [pdf, other]
Title: Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment
Hongyu Hu, Tiancheng Lin, Jie Wang, Zhenbang Sun, Yi Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[391] arXiv:2309.04169 [pdf, other]
Title: Grouping Boundary Proposals for Fast Interactive Image Segmentation
Li Liu, Da Chen, Minglei Shu, Laurent D. Cohen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392] arXiv:2309.04171 [pdf, other]
Title: PRISTA-Net: Deep Iterative Shrinkage Thresholding Network for Coded Diffraction Patterns Phase Retrieval
Aoxu Liu, Xiaohong Fan, Yin Yang, Jianping Zhang
Comments: 12 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Information Theory (cs.IT); Image and Video Processing (eess.IV)
[393] arXiv:2309.04172 [pdf, other]
Title: Unsupervised Object Localization with Representer Point Selection
Yeonghwan Song, Seokwoo Jang, Dina Katabi, Jeany Son
Comments: Accepted by ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[394] arXiv:2309.04183 [pdf, other]
Title: Stereo Matching in Time: 100+ FPS Video Stereo Matching for Extended Reality
Ziang Cheng, Jiayu Yang, Hongdong Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[395] arXiv:2309.04220 [pdf, other]
Title: Score-PA: Score-based 3D Part Assembly
Junfeng Cheng, Mingdong Wu, Ruiyuan Zhang, Guanqi Zhan, Chao Wu, Hao Dong
Comments: BMVC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[396] arXiv:2309.04225 [pdf, other]
Title: Long-Range Correlation Supervision for Land-Cover Classification from Remote Sensing Images
Dawen Yu, Shunping Ji
Comments: 14 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[397] arXiv:2309.04228 [pdf, other]
Title: FIVA: Facial Image and Video Anonymization and Anonymization Defense
Felix Rosberg, Eren Erdal Aksoy, Cristofer Englund, Fernando Alonso-Fernandez
Comments: Accepted to ICCVW 2023 - DFAD 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[398] arXiv:2309.04247 [pdf, other]
Title: Towards Practical Capture of High-Fidelity Relightable Avatars
Haotian Yang, Mingwu Zheng, Wanquan Feng, Haibin Huang, Yu-Kun Lai, Pengfei Wan, Zhongyuan Wang, Chongyang Ma
Comments: Accepted to SIGGRAPH Asia 2023 (Conference); Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399] arXiv:2309.04302 [pdf, other]
Title: Have We Ever Encountered This Before? Retrieving Out-of-Distribution Road Obstacles from Driving Scenes
Youssef Shoeb, Robin Chan, Gesina Schwalbe, Azarm Nowzard, Fatma Güney, Hanno Gottschalk
Comments: 11 pages, 7 figures, and 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[400] arXiv:2309.04312 [pdf, other]
Title: AMLP:Adaptive Masking Lesion Patches for Self-supervised Medical Image Segmentation
Xiangtao Wang, Ruizhi Wang, Jie Zhou, Thomas Lukasiewicz, Zhenghua Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[401] arXiv:2309.04331 [pdf, html, other]
Title: Leveraging Model Fusion for Improved License Plate Recognition
Rayson Laroca, Luiz A. Zanlorensi, Valter Estevam, Rodrigo Minetto, David Menotti
Comments: Accepted for presentation at the Iberoamerican Congress on Pattern Recognition (CIARP) 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402] arXiv:2309.04354 [pdf, other]
Title: Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts
Erik Daxberger, Floris Weers, Bowen Zhang, Tom Gunter, Ruoming Pang, Marcin Eichner, Michael Emmersberger, Yinfei Yang, Alexander Toshev, Xianzhi Du
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[403] arXiv:2309.04357 [pdf, other]
Title: SSIG: A Visually-Guided Graph Edit Distance for Floor Plan Similarity
Casper van Engelenburg, Seyran Khademi, Jan van Gemert
Comments: To be published in ICCVW 2023, 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[404] arXiv:2309.04366 [pdf, other]
Title: CNN Injected Transformer for Image Exposure Correction
Shuning Xu, Xiangyu Chen, Binbin Song, Jiantao Zhou
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[405] arXiv:2309.04372 [pdf, html, other]
Title: MoEController: Instruction-based Arbitrary Image Manipulation with Mixture-of-Expert Controllers
Sijia Li, Chen Chen, Haonan Lu
Comments: 6 pages,6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[406] arXiv:2309.04379 [pdf, html, other]
Title: Language Prompt for Autonomous Driving
Dongming Wu, Wencheng Han, Yingfei Liu, Tiancai Wang, Cheng-zhong Xu, Xiangyu Zhang, Jianbing Shen
Comments: Accepted by AAAI2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[407] arXiv:2309.04399 [pdf, other]
Title: MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask
Yupeng Zhou, Daquan Zhou, Zuo-Liang Zhu, Yaxing Wang, Qibin Hou, Jiashi Feng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[408] arXiv:2309.04410 [pdf, other]
Title: DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields
Junzhe Zhang, Yushi Lan, Shuai Yang, Fangzhou Hong, Quan Wang, Chai Kiat Yeo, Ziwei Liu, Chen Change Loy
Comments: ICCV 2023. Code: this https URL Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[409] arXiv:2309.04421 [pdf, html, other]
Title: SynthoGestures: A Novel Framework for Synthetic Dynamic Hand Gesture Generation for Driving Scenarios
Amr Gomaa, Robin Zitt, Guillermo Reyes, Antonio Krüger
Comments: Accepted at IEEE IV'24. Shorter versions were accepted as AutomotiveUI2023 Work in Progress and UIST2023 Poster Papers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[410] arXiv:2309.04422 [pdf, other]
Title: Video Task Decathlon: Unifying Image and Video Tasks in Autonomous Driving
Thomas E. Huang, Yifan Liu, Luc Van Gool, Fisher Yu
Comments: ICCV 2023, project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[411] arXiv:2309.04430 [pdf, other]
Title: Create Your World: Lifelong Text-to-Image Diffusion
Gan Sun, Wenqi Liang, Jiahua Dong, Jun Li, Zhengming Ding, Yang Cong
Comments: 15 pages,10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[412] arXiv:2309.04437 [pdf, html, other]
Title: Single View Refractive Index Tomography with Neural Fields
Brandon Zhao, Aviad Levis, Liam Connor, Pratul P. Srinivasan, Katherine L. Bouman
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cosmology and Nongalactic Astrophysics (astro-ph.CO)
[413] arXiv:2309.04447 [pdf, html, other]
Title: Impact of Blur and Resolution on Demographic Disparities in 1-to-Many Facial Identification
Aman Bhatta, Gabriella Pangelinan, Michael C. King, Kevin W. Bowyer
Comments: 9 pages, 8 figures, Conference submission
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[414] arXiv:2309.04453 [pdf, other]
Title: WiSARD: A Labeled Visual and Thermal Image Dataset for Wilderness Search and Rescue
Daniel Broyles, Christopher R. Hayner, Karen Leung
Journal-ref: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 2022, pp. 9467-9474
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[415] arXiv:2309.04462 [pdf, other]
Title: Generalized Cross-domain Multi-label Few-shot Learning for Chest X-rays
Aroof Aimen, Arsh Verma, Makarand Tapaswi, Narayanan C. Krishnan
Comments: 17 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[416] arXiv:2309.04502 [pdf, other]
Title: On the Efficacy of Multi-scale Data Samplers for Vision Applications
Elvis Nunez, Thomas Merth, Anish Prabhu, Mehrdad Farajtabar, Mohammad Rastegari, Sachin Mehta, Maxwell Horton
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[417] arXiv:2309.04506 [pdf, other]
Title: Unsupervised Gaze-aware Contrastive Learning with Subject-specific Condition
Lingyu Du, Xucong Zhang, Guohao Lan
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[418] arXiv:2309.04542 [pdf, other]
Title: Examining Autoexposure for Challenging Scenes
SaiKiran Tedla, Beixuan Yang, Michael S. Brown
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[419] arXiv:2309.04549 [pdf, other]
Title: Poster: Making Edge-assisted LiDAR Perceptions Robust to Lossy Point Cloud Compression
Jin Heo, Gregorie Phillips, Per-Erik Brodin, Ada Gavrilovska
Comments: extended abstract of 2 pages, 2 figures, 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[420] arXiv:2309.04561 [pdf, html, other]
Title: Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding
Ozan Unal, Christos Sakaridis, Suman Saha, Luc Van Gool
Comments: Accepted at ECCV 2024. Winner of the ICCV 2023 ScanRefer Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[421] arXiv:2309.04573 [pdf, other]
Title: Mask2Anomaly: Mask Transformer for Universal Open-set Segmentation
Shyam Nandan Rai, Fabio Cermelli, Barbara Caputo, Carlo Masone
Comments: 16 pages. arXiv admin note: substantial text overlap with arXiv:2307.13316
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[422] arXiv:2309.04579 [pdf, other]
Title: EGOFALLS: A visual-audio dataset and benchmark for fall detection using egocentric cameras
Xueyi Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[423] arXiv:2309.04608 [pdf, other]
Title: Style Generation: Image Synthesis based on Coarsely Matched Texts
Mengyao Cui, Zhe Zhu, Shao-Ping Lu, Yulu Yang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[424] arXiv:2309.04650 [pdf, other]
Title: Exploring Robust Features for Improving Adversarial Robustness
Hong Wang, Yuefan Deng, Shinjae Yoo, Yuewei Lin
Comments: 12 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[425] arXiv:2309.04657 [pdf, other]
Title: Generation and Recombination for Multifocus Image Fusion with Free Number of Inputs
Huafeng Li, Dan Wang, Yuxin Huang, Yafei Zhang, Zhengtao Yu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[426] arXiv:2309.04659 [pdf, other]
Title: Progressive Feature Adjustment for Semi-supervised Learning from Pretrained Models
Hai-Ming Xu, Lingqiao Liu, Hao Chen, Ehsan Abbasnejad, Rafael Felix
Comments: to appear at ICCVW2023 (Workshop on Visual Continual Learning)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[427] arXiv:2309.04669 [pdf, html, other]
Title: Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu
Comments: ICLR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[428] arXiv:2309.04675 [pdf, other]
Title: BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification
Takuro Fujii, Shuhei Tarashima
Comments: Accepted at ICCVW 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[429] arXiv:2309.04682 [pdf, other]
Title: DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions
Teng Fu, Xiaocong Wang, Haiyang Yu, Ke Niu, Bin Li, Xiangyang Xue
Comments: ACM Multimedia 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430] arXiv:2309.04702 [pdf, other]
Title: A Spatial-Temporal Deformable Attention based Framework for Breast Lesion Detection in Videos
Chao Qin, Jiale Cao, Huazhu Fu, Rao Muhammad Anwer, Fahad Shahbaz Khan
Comments: Accepted by MICCAI 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[431] arXiv:2309.04708 [pdf, html, other]
Title: UnitModule: A Lightweight Joint Image Enhancement Module for Underwater Object Detection
Zhuoyan Liu, Bo Wang, Ye Li, Jiaxian He, Yunfeng Li
Comments: 15 pages, 10 figures, 13 tables, accepted by PR
Journal-ref: Pattern Recognition 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[432] arXiv:2309.04723 [pdf, other]
Title: Frequency-Aware Self-Supervised Long-Tailed Learning
Ci-Siang Lin, Min-Hung Chen, Yu-Chiang Frank Wang
Comments: ICCV Workshop 2023 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[433] arXiv:2309.04734 [pdf, other]
Title: Towards Better Multi-modal Keyphrase Generation via Visual Entity Enhancement and Multi-granularity Image Noise Filtering
Yifan Dong, Suhang Wu, Fandong Meng, Jie Zhou, Xiaoli Wang, Jianxin Lin, Jinsong Su
Comments: Accepted In Proceedings of the 31st ACM International Conference on Multimedia (MM' 23)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[434] arXiv:2309.04747 [pdf, other]
Title: When to Learn What: Model-Adaptive Data Augmentation Curriculum
Chengkai Hou, Jieyu Zhang, Tianyi Zhou
Comments: Our paper is accpeted by ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[435] arXiv:2309.04750 [pdf, html, other]
Title: Mirror-Aware Neural Humans
Daniel Ajisafe, James Tang, Shih-Yang Su, Bastian Wandt, Helge Rhodin
Comments: The 11th International Conference on 3D Vision (3DV 2024). Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[436] arXiv:2309.04752 [pdf, other]
Title: Deep Video Restoration for Under-Display Camera
Xuanxi Chen, Tao Wang, Ziqian Shao, Kaihao Zhang, Wenhan Luo, Tong Lu, Zikun Liu, Tae-Kyun Kim, Hongdong Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[437] arXiv:2309.04756 [pdf, other]
Title: Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation
Boyuan Jiang, Lei Hu, Shihong Xia
Comments: 9pages, 5figures, conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[438] arXiv:2309.04763 [pdf, other]
Title: Visual Material Characteristics Learning for Circular Healthcare
Federico Zocco, Shahin Rahimifard
Comments: To be submitted
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[439] arXiv:2309.04780 [pdf, html, other]
Title: Latent Degradation Representation Constraint for Single Image Deraining
Yuhong He, Long Peng, Lu Wang, Jun Cheng
Comments: This paper is accepted to ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[440] arXiv:2309.04795 [pdf, html, other]
Title: Latent Spatiotemporal Adaptation for Generalized Face Forgery Video Detection
Daichi Zhang, Zihao Xiao, Jianmin Li, Shiming Ge
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[441] arXiv:2309.04800 [pdf, other]
Title: VeRi3D: Generative Vertex-based Radiance Fields for 3D Controllable Human Image Synthesis
Xinya Chen, Jiaxin Huang, Yanrui Bin, Lu Yu, Yiyi Liao
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[442] arXiv:2309.04801 [pdf, other]
Title: TMComposites: Plug-and-Play Collaboration Between Specialized Tsetlin Machines
Ole-Christoffer Granmo
Comments: 8 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[443] arXiv:2309.04803 [pdf, other]
Title: Towards Real-World Burst Image Super-Resolution: Benchmark and Method
Pengxu Wei, Yujing Sun, Xingbei Guo, Chang Liu, Jie Chen, Xiangyang Ji, Liang Lin
Comments: Accepted by ICCV2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[444] arXiv:2309.04806 [pdf, html, other]
Title: Timely Fusion of Surround Radar/Lidar for Object Detection in Autonomous Driving Systems
Wenjing Xie, Tao Hu, Neiwen Ling, Guoliang Xing, Chun Jason Xue, Nan Guan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[445] arXiv:2309.04814 [pdf, other]
Title: Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
Xiuzhe Wu, Pengfei Hu, Yang Wu, Xiaoyang Lyu, Yan-Pei Cao, Ying Shan, Wenming Yang, Zhongqian Sun, Xiaojuan Qi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[446] arXiv:2309.04820 [pdf, html, other]
Title: ABC Easy as 123: A Blind Counter for Exemplar-Free Multi-Class Class-agnostic Counting
Michael A. Hobley, Victor A. Prisacariu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[447] arXiv:2309.04825 [pdf, other]
Title: Few-Shot Medical Image Segmentation via a Region-enhanced Prototypical Transformer
Yazhou Zhu, Shidong Wang, Tong Xin, Haofeng Zhang
Comments: Accepted by MICCAI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[448] arXiv:2309.04836 [pdf, html, other]
Title: Neural Semantic Surface Maps
Luca Morreale, Noam Aigerman, Vladimir G. Kim, Niloy J. Mitra
Comments: Accepted at Eurographics 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[449] arXiv:2309.04840 [pdf, other]
Title: AnyPose: Anytime 3D Human Pose Forecasting via Neural Ordinary Differential Equations
Zixing Wang, Ahmed H. Qureshi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[450] arXiv:2309.04887 [pdf, other]
Title: SortedAP: Rethinking evaluation metrics for instance segmentation
Long Chen, Yuli Wu, Johannes Stegmaier, Dorit Merhof
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[451] arXiv:2309.04888 [pdf, other]
Title: Semi-supervised Instance Segmentation with a Learned Shape Prior
Long Chen, Weiwen Zhang, Yuli Wu, Martin Strauch, Dorit Merhof
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[452] arXiv:2309.04891 [pdf, html, other]
Title: How to Evaluate Semantic Communications for Images with ViTScore Metric?
Tingting Zhu, Bo Peng, Jifan Liang, Tingchen Han, Hai Wan, Jingqiao Fu, Junjie Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Theory (cs.IT)
[453] arXiv:2309.04902 [pdf, other]
Title: Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art
Aref Miri Rekavandi, Shima Rashidi, Farid Boussaid, Stephen Hoefs, Emre Akbas, Mohammed bennamoun
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[454] arXiv:2309.04907 [pdf, other]
Title: Effective Real Image Editing with Accelerated Iterative Diffusion Inversion
Zhihong Pan, Riccardo Gherardi, Xiufeng Xie, Stephen Huang
Comments: Accepted to ICCV 2023 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[455] arXiv:2309.04914 [pdf, other]
Title: MFPNet: Multi-scale Feature Propagation Network For Lightweight Semantic Segmentation
Guoan Xu, Wenjing Jia, Tao Wu, Ligeng Chen
Comments: 5 pages, 3 figures, 5tables, conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[456] arXiv:2309.04917 [pdf, other]
Title: Editing 3D Scenes via Text Prompts without Retraining
Shuangkang Fang, Yufeng Wang, Yi Yang, Yi-Hsuan Tsai, Wenrui Ding, Shuchang Zhou, Ming-Hsuan Yang
Comments: Project Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[457] arXiv:2309.04958 [pdf, other]
Title: Semi-Supervised learning for Face Anti-Spoofing using Apex frame
Usman Muhammad, Mourad Oussalah, Jorma Laaksonen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[458] arXiv:2309.04965 [pdf, other]
Title: Prefix-diffusion: A Lightweight Diffusion Model for Diverse Image Captioning
Guisheng Liu, Yi Li, Zhengcong Fei, Haiyan Fu, Xiangyang Luo, Yanqing Guo
Comments: 11 pages,4 figures, 6 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[459] arXiv:2309.04967 [pdf, html, other]
Title: Towards Fully Decoupled End-to-End Person Search
Pengcheng Zhang, Xiao Bai, Jin Zheng, Xin Ning
Comments: DICTA 2023 Best Student Paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[460] arXiv:2309.05013 [pdf, other]
Title: Geometrically Consistent Partial Shape Matching
Viktoria Ehm, Paul Roetzer, Marvin Eisenberger, Maolin Gao, Florian Bernard, Daniel Cremers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[461] arXiv:2309.05015 [pdf, other]
Title: DeViT: Decomposing Vision Transformers for Collaborative Inference in Edge Devices
Guanyu Xu, Zhiwei Hao, Yong Luo, Han Hu, Jianping An, Shiwen Mao
Comments: Accepted by IEEE Transactions on Mobile Computing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[462] arXiv:2309.05028 [pdf, other]
Title: SC-NeRF: Self-Correcting Neural Radiance Field with Sparse Views
Liang Song, Guangming Wang, Jiuming Liu, Zhenyang Fu, Yanzi Miao, Hesheng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[463] arXiv:2309.05032 [pdf, other]
Title: Unified Contrastive Fusion Transformer for Multimodal Human Action Recognition
Kyoung Ok Yang, Junho Koh, Jun Won Choi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[464] arXiv:2309.05049 [pdf, other]
Title: Multi-view Self-supervised Disentanglement for General Image Denoising
Hao Chen, Chenyuan Qu, Yu Zhang, Chen Chen, Jianbo Jiao
Comments: International Conference on Computer Vision 2023 (ICCV 2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[465] arXiv:2309.05069 [pdf, other]
Title: Exploiting CLIP for Zero-shot HOI Detection Requires Knowledge Distillation at Multiple Levels
Bo Wan, Tinne Tuytelaars
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[466] arXiv:2309.05073 [pdf, html, other]
Title: FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World Conditions
Jiong Wang, Fengyu Yang, Wenbo Gou, Bingliang Li, Danqi Yan, Ailing Zeng, Yijun Gao, Junle Wang, Yanqing Jing, Ruimao Zhang
Comments: CVPR2024 camera ready version. 19 pages, 16 figures. Project page: this https URL ; API: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[467] arXiv:2309.05090 [pdf, other]
Title: Sculpting Efficiency: Pruning Medical Imaging Models for On-Device Inference
Sudarshan Sreeram, Bernhard Kainz
Comments: Accepted at MedNeurIPS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[468] arXiv:2309.05095 [pdf, other]
Title: MaskRenderer: 3D-Infused Multi-Mask Realistic Face Reenactment
Tina Behrouzi, Atefeh Shahroudnejad, Payam Mousavi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[469] arXiv:2309.05098 [pdf, other]
Title: 3D Implicit Transporter for Temporally Consistent Keypoint Discovery
Chengliang Zhong, Yuhang Zheng, Yupeng Zheng, Hao Zhao, Li Yi, Xiaodong Mu, Ling Wang, Pengfei Li, Guyue Zhou, Chao Yang, Xinliang Zhang, Jian Zhao
Comments: ICCV2023 oral paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[470] arXiv:2309.05132 [pdf, other]
Title: DAD++: Improved Data-free Test Time Adversarial Defense
Gaurav Kumar Nayak, Inder Khatri, Shubham Randive, Ruchit Rawal, Anirban Chakraborty
Comments: IJCV Journal (Under Review)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[471] arXiv:2309.05139 [pdf, other]
Title: A Skeleton-based Approach For Rock Crack Detection Towards A Climbing Robot Application
Josselin Somerville Roberts, Paul-Emile Giacomelli, Yoni Gozlan, Julia Di
Journal-ref: IEEE IRC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[472] arXiv:2309.05148 [pdf, other]
Title: Beyond Skin Tone: A Multidimensional Measure of Apparent Skin Color
William Thong, Przemyslaw Joniak, Alice Xiang
Comments: Accepted at the International Conference on Computer Vision (ICCV) 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[473] arXiv:2309.05150 [pdf, other]
Title: Faster, Lighter, More Accurate: A Deep Learning Ensemble for Content Moderation
Mohammad Hosseini, Mahmudul Hasan
Comments: 6 pages, 22nd IEEE International Conference on Machine Learning and Applications (IEEE ICMLA'23), December 15-17, 2023, Jacksonville Riverfront, Florida, USA. arXiv admin note: substantial text overlap with arXiv:2103.10350
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[474] arXiv:2309.05180 [pdf, html, other]
Title: What's color got to do with it? Face recognition in grayscale
Aman Bhatta, Domingo Mery, Haiyu Wu, Joyce Annan, Micheal C. King, Kevin W. Bowyer
Comments: This is replacement version of the previous arxiv submission: 2309.05180 (Our Deep CNN Face Matchers Have Developed Achromatopsia). The past version is published in CVPRW and available in IEEE proceedings. This submitted version is an extension of the conference paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[475] arXiv:2309.05186 [pdf, html, other]
Title: HiLM-D: Enhancing MLLMs with Multi-Scale High-Resolution Details for Autonomous Driving
Xinpeng Ding, Jianhua Han, Hang Xu, Wei Zhang, Xiaomeng Li
Comments: Accepted by IJCV
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[476] arXiv:2309.05192 [pdf, other]
Title: Towards Viewpoint Robustness in Bird's Eye View Segmentation
Tzofi Klinghoffer, Jonah Philion, Wenzheng Chen, Or Litany, Zan Gojcic, Jungseock Joo, Ramesh Raskar, Sanja Fidler, Jose M. Alvarez
Comments: ICCV 2023. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[477] arXiv:2309.05209 [pdf, other]
Title: Phase-Specific Augmented Reality Guidance for Microscopic Cataract Surgery Using Long-Short Spatiotemporal Aggregation Transformer
Puxun Tu, Hongfei Ye, Haochen Shi, Jeff Young, Meng Xie, Peiquan Zhao, Ce Zheng, Xiaoyi Jiang, Xiaojun Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[478] arXiv:2309.05214 [pdf, other]
Title: Angle Range and Identity Similarity Enhanced Gaze and Head Redirection based on Synthetic data
Jiawei Qin, Xueting Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[479] arXiv:2309.05224 [pdf, other]
Title: SparseSwin: Swin Transformer with Sparse Transformer Block
Krisna Pinasthika, Blessius Sheldo Putra Laksono, Riyandi Banovbi Putera Irsal, Syifa Hukma Shabiyya, Novanto Yudistira
Journal-ref: Neurocomputing, Volume 580, 2024, 127433
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[480] arXiv:2309.05239 [pdf, html, other]
Title: HAT: Hybrid Attention Transformer for Image Restoration
Xiangyu Chen, Xintao Wang, Wenlong Zhang, Xiangtao Kong, Yu Qiao, Jiantao Zhou, Chao Dong
Comments: Extended version of HAT. arXiv admin note: text overlap with arXiv:2205.04437
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[481] arXiv:2309.05251 [pdf, other]
Title: Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Yiming Zhang, ZeMing Gong, Angel X. Chang
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[482] arXiv:2309.05254 [pdf, html, other]
Title: Towards Better Data Exploitation in Self-Supervised Monocular Depth Estimation
Jinfeng Liu, Lingtong Kong, Jie Yang, Wei Liu
Comments: 8 pages, 6 figures, accepted by IEEE Robotics and Automation Letters (RA-L 2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[483] arXiv:2309.05257 [pdf, other]
Title: FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal Consistent Transformer for 3D Object Detection
Chunyong Hu, Hang Zheng, Kun Li, Jianyun Xu, Weibo Mao, Maochun Luo, Lingxuan Wang, Mingxia Chen, Qihao Peng, Kaixuan Liu, Yiru Zhao, Peihan Hao, Minzhe Liu, Kaicheng Yu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[484] arXiv:2309.05261 [pdf, other]
Title: Gall Bladder Cancer Detection from US Images with Only Image Level Labels
Soumen Basu, Ashish Papanai, Mayank Gupta, Pankaj Gupta, Chetan Arora
Comments: Accepted at MICCAI 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[485] arXiv:2309.05262 [pdf, other]
Title: A horizon line annotation tool for streamlining autonomous sea navigation experiments
Yassir Zardoua, Abdelhamid El Wahabi, Mohammed Boulaala, Abdelali Astito
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[486] arXiv:2309.05267 [pdf, other]
Title: Diving into Darkness: A Dual-Modulated Framework for High-Fidelity Super-Resolution in Ultra-Dark Environments
Jiaxin Gao, Ziyu Yue, Yaohua Liu, Sihan Xie, Xin Fan, Risheng Liu
Comments: 9 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[487] arXiv:2309.05277 [pdf, other]
Title: Interactive Class-Agnostic Object Counting
Yifeng Huang, Viresh Ranjan, Minh Hoai
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[488] arXiv:2309.05281 [pdf, other]
Title: Class-Incremental Grouping Network for Continual Audio-Visual Learning
Shentong Mo, Weiguo Pian, Yapeng Tian
Comments: ICCV 2023. arXiv admin note: text overlap with arXiv:2303.17056
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[489] arXiv:2309.05282 [pdf, other]
Title: Can you text what is happening? Integrating pre-trained language encoders into trajectory prediction models for autonomous driving
Ali Keysan, Andreas Look, Eitan Kosman, Gonca Gürsun, Jörg Wagner, Yu Yao, Barbara Rakitsch
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[490] arXiv:2309.05289 [pdf, other]
Title: Task-driven Compression for Collision Encoding based on Depth Images
Mihir Kulkarni, Kostas Alexis
Comments: 14 pages, 5, figures. Accepted to the International Symposium on Visual Computing 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[491] arXiv:2309.05300 [pdf, html, other]
Title: Decoupling Common and Unique Representations for Multimodal Self-supervised Learning
Yi Wang, Conrad M Albrecht, Nassim Ait Ali Braham, Chenying Liu, Zhitong Xiong, Xiao Xiang Zhu
Comments: Accepted to ECCV 2024. 27 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[492] arXiv:2309.05314 [pdf, other]
Title: Semantic Latent Decomposition with Normalizing Flows for Face Editing
Binglei Li, Zhizhong Huang, Hongming Shan, Junping Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[493] arXiv:2309.05330 [pdf, other]
Title: Diff-Privacy: Diffusion-based Face Privacy Protection
Xiao He, Mingrui Zhu, Dongxin Chen, Nannan Wang, Xinbo Gao
Comments: 17pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[494] arXiv:2309.05334 [pdf, html, other]
Title: MultIOD: Rehearsal-free Multihead Incremental Object Detector
Eden Belouadah, Arnaud Dapogny, Kevin Bailly
Comments: Accepted at the archival track of the Workshop on Continual Learning in Computer Vision (CVPR 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[495] arXiv:2309.05375 [pdf, other]
Title: Toward a Deeper Understanding: RetNet Viewed through Convolution
Chenghao Li, Chaoning Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[496] arXiv:2309.05380 [pdf, other]
Title: Collective PV-RCNN: A Novel Fusion Technique using Collective Detections for Enhanced Local LiDAR-Based Perception
Sven Teufel, Jörg Gamerdinger, Georg Volk, Oliver Bringmann
Comments: accepted at IEEE ITSC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[497] arXiv:2309.05388 [pdf, html, other]
Title: Robust Single Rotation Averaging Revisited
Seong Hun Lee, Javier Civera
Comments: Accepted to ECCV 2024 Workshop on Recovering 6D Object Pose (R6D)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[498] arXiv:2309.05418 [pdf, html, other]
Title: FlowIBR: Leveraging Pre-Training for Efficient Neural Image-Based Rendering of Dynamic Scenes
Marcel Büsching, Josef Bengtson, David Nilsson, Mårten Björkman
Comments: Accepted to CVPR 2024 Workshop on Efficient Deep Learning for Computer Vision. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[499] arXiv:2309.05438 [pdf, other]
Title: Towards Content-based Pixel Retrieval in Revisited Oxford and Paris
Guoyuan An, Woo Jae Kim, Saelyne Yang, Rong Li, Yuchi Huo, Sung-Eui Yoon
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[500] arXiv:2309.05448 [pdf, html, other]
Title: Panoptic Vision-Language Feature Fields
Haoran Chen, Kenneth Blomqvist, Francesco Milano, Roland Siegwart
Comments: This work has been accepted by IEEE Robotics and Automation Letters
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[501] arXiv:2309.05451 [pdf, other]
Title: Dual-view Curricular Optimal Transport for Cross-lingual Cross-modal Retrieval
Yabing Wang, Shuhui Wang, Hao Luo, Jianfeng Dong, Fan Wang, Meng Han, Xun Wang, Meng Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[502] arXiv:2309.05490 [pdf, html, other]
Title: Learning Semantic Segmentation with Query Points Supervision on Aerial Images
Santiago Rivier, Carlos Hinojosa, Silvio Giancola, Bernard Ghanem
Comments: Paper Accepted at ICIP 2024 (Oral Presentation)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[503] arXiv:2309.05499 [pdf, html, other]
Title: Zero-Shot Co-salient Object Detection Framework
Haoke Xiao, Lv Tang, Bo Li, Zhiming Luo, Shaozi Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[504] arXiv:2309.05517 [pdf, other]
Title: Stream-based Active Learning by Exploiting Temporal Properties in Perception with Temporal Predicted Loss
Sebastian Schmidt, Stephan Günnemann
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[505] arXiv:2309.05527 [pdf, other]
Title: ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation
Bo Zhang, Xinyu Cai, Jiakang Yuan, Donglin Yang, Jianfei Guo, Xiangchao Yan, Renqiu Xia, Botian Shi, Min Dou, Tao Chen, Si Liu, Junchi Yan, Yu Qiao
Comments: Accepted by ICLR 2024. Code and simulated points are available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[506] arXiv:2309.05528 [pdf, other]
Title: On the detection of Out-Of-Distribution samples in Multiple Instance Learning
Loïc Le Bescond, Maria Vakalopoulou, Stergios Christodoulidis, Fabrice André, Hugues Talbot
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[507] arXiv:2309.05548 [pdf, other]
Title: Distance-Aware eXplanation Based Learning
Misgina Tsighe Hagos, Niamh Belton, Kathleen M. Curran, Brian Mac Namee
Comments: Accepted at the 35th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[508] arXiv:2309.05551 [pdf, other]
Title: OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data
Giuseppe Cartella, Alberto Baldrati, Davide Morelli, Marcella Cornia, Marco Bertini, Rita Cucchiara
Comments: International Conference on Image Analysis and Processing (ICIAP) 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[509] arXiv:2309.05569 [pdf, other]
Title: ITI-GEN: Inclusive Text-to-Image Generation
Cheng Zhang, Xuanbai Chen, Siqi Chai, Chen Henry Wu, Dmitry Lagun, Thabo Beeler, Fernando De la Torre
Comments: Accepted to ICCV 2023 (Oral Presentation)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
[510] arXiv:2309.05573 [pdf, other]
Title: UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase
Youquan Liu, Runnan Chen, Xin Li, Lingdong Kong, Yuchen Yang, Zhaoyang Xia, Yeqi Bai, Xinge Zhu, Yuexin Ma, Yikang Li, Yu Qiao, Yuenan Hou
Comments: ICCV 2023; 21 pages; 9 figures; 18 tables; Code at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[511] arXiv:2309.05590 [pdf, other]
Title: Temporal Action Localization with Enhanced Instant Discriminability
Dingfeng Shi, Qiong Cao, Yujie Zhong, Shan An, Jian Cheng, Haogang Zhu, Dacheng Tao
Comments: An extended version of the CVPR paper arXiv:2303.07347, submitted to IJCV
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[512] arXiv:2309.05613 [pdf, other]
Title: Learning the Geodesic Embedding with Graph Neural Networks
Bo Pang, Zhongtian Zheng, Guoping Wang, Peng-Shuai Wang
Comments: SIGGRAPH Asia 2023, Journal Track
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[513] arXiv:2309.05645 [pdf, html, other]
Title: CitDet: A Benchmark Dataset for Citrus Fruit Detection
Jordan A. James, Heather K. Manching, Matthew R. Mattia, Kim D. Bowman, Amanda M. Hulse-Kemp, William J. Beksi
Comments: To be published in IEEE Robotics and Automation Letters (RA-L)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[514] arXiv:2309.05652 [pdf, other]
Title: An Effective Two-stage Training Paradigm Detector for Small Dataset
Zheng Wang, Dong Xie, Hanzhi Wang, Jiang Tian
Comments: 4 pages, 2 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[515] arXiv:2309.05663 [pdf, other]
Title: Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips
Yufei Ye, Poorvi Hebbar, Abhinav Gupta, Shubham Tulsiani
Comments: Accepted to ICCV23 (Oral). Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[516] arXiv:2309.05747 [pdf, other]
Title: Evaluating the Reliability of CNN Models on Classifying Traffic and Road Signs using LIME
Md. Atiqur Rahman, Ahmed Saad Tanim, Sanjid Islam, Fahim Pranto, G.M. Shahariar, Md. Tanvir Rouf Shawon
Comments: Accepted for publication in the 2nd International Conference on Big Data, IoT and Machine Learning (BIM 2023), 16 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[517] arXiv:2309.05756 [pdf, html, other]
Title: GlobalDoc: A Cross-Modal Vision-Language Framework for Real-World Document Image Retrieval and Classification
Souhail Bakkali, Sanket Biswas, Zuheng Ming, Mickaël Coustaty, Marçal Rusiñol, Oriol Ramos Terrades, Josep Lladós
Comments: Accepted at WACV 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[518] arXiv:2309.05782 [pdf, other]
Title: Blendshapes GHUM: Real-time Monocular Facial Blendshape Prediction
Ivan Grishchenko, Geng Yan, Eduard Gabriel Bazavan, Andrei Zanfir, Nikolai Chinaev, Karthik Raveendran, Matthias Grundmann, Cristian Sminchisescu
Comments: 4 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[519] arXiv:2309.05793 [pdf, other]
Title: PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models
Li Chen, Mengyi Zhao, Yiheng Liu, Mingxu Ding, Yangyang Song, Shizun Wang, Xu Wang, Hao Yang, Jing Liu, Kang Du, Min Zheng
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[520] arXiv:2309.05809 [pdf, other]
Title: Divergences in Color Perception between Deep Neural Networks and Humans
Ethan O. Nadler, Elise Darragh-Ford, Bhargav Srinivasa Desikan, Christian Conaway, Mark Chu, Tasker Hull, Douglas Guilbeault
Comments: 22 pages, 8 figures + SI Appendix; to appear in Cognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[521] arXiv:2309.05810 [pdf, other]
Title: SHIFT3D: Synthesizing Hard Inputs For Tricking 3D Detectors
Hongge Chen, Zhao Chen, Gregory P. Meyer, Dennis Park, Carl Vondrick, Ashish Shrivastava, Yuning Chai
Comments: Accepted by ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Robotics (cs.RO)
[522] arXiv:2309.05818 [pdf, other]
Title: Rice Plant Disease Detection and Diagnosis using Deep Convolutional Neural Networks and Multispectral Imaging
Yara Ali Alnaggar, Ahmad Sebaq, Karim Amer, ElSayed Naeem, Mohamed Elhelw
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[523] arXiv:2309.05829 [pdf, other]
Title: Mobile Vision Transformer-based Visual Object Tracking
Goutam Yelluru Gopal, Maria A. Amer
Comments: Accepted by BMVC2023. Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[524] arXiv:2309.05832 [pdf, other]
Title: Instance-Agnostic Geometry and Contact Dynamics Learning
Mengti Sun, Bowen Jiang, Bibit Bianchini, Camillo Jose Taylor, Michael Posa
Comments: IROS 2023 Workshop on Leveraging Models for Contact-Rich Manipulation
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[525] arXiv:2309.05834 [pdf, other]
Title: SCD-Net: Spatiotemporal Clues Disentanglement Network for Self-supervised Skeleton-based Action Recognition
Cong Wu, Xiao-Jun Wu, Josef Kittler, Tianyang Xu, Sara Atito, Muhammad Awais, Zhenhua Feng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[526] arXiv:2309.05840 [pdf, other]
Title: Self-Correlation and Cross-Correlation Learning for Few-Shot Remote Sensing Image Semantic Segmentation
Linhan Wang, Shuo Lei, Jianfeng He, Shengkun Wang, Min Zhang, Chang-Tien Lu
Comments: 10 pages, 6 figures. Accepted to Sigspatial 2023. arXiv admin note: text overlap with arXiv:2104.01538 by other authors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[527] arXiv:2309.05883 [pdf, other]
Title: Hierarchical Conditional Semi-Paired Image-to-Image Translation For Multi-Task Image Defect Correction On Shopping Websites
Moyan Li, Jinmiao Fu, Shaoyuan Xu, Huidong Liu, Jia Liu, Bryan Wang
Comments: 6 pages, 6 figures, 3 tables. To be published in ICIP 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[528] arXiv:2309.05900 [pdf, other]
Title: Adversarial Attacks Assessment of Salient Object Detection via Symbolic Learning
Gustavo Olague, Roberto Pineda, Gerardo Ibarra-Vazquez, Matthieu Olague, Axel Martinez, Sambit Bakshi, Jonathan Vargas, Isnardo Reducindo
Comments: 14 pages, 8 figures, 6 tables, IEEE Transactions on Emerging Topics in Computing, Accepted for publication
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[529] arXiv:2309.05904 [pdf, html, other]
Title: Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Masked Contrastive Learning
Weijian Huang, Cheng Li, Hong-Yu Zhou, Hao Yang, Jiarun Liu, Yong Liang, Hairong Zheng, Shaoting Zhang, Shanshan Wang
Journal-ref: Nature Communications 15, 7620 (2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[530] arXiv:2309.05911 [pdf, other]
Title: Quality-Agnostic Deepfake Detection with Intra-model Collaborative Learning
Binh M. Le, Simon S. Woo
Journal-ref: International Conference on Computer Vision 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[531] arXiv:2309.05914 [pdf, other]
Title: Medical Image Segmentation with Belief Function Theory and Deep Learning
Ling Huang
Comments: Ph.D. Thesis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[532] arXiv:2309.05930 [pdf, html, other]
Title: Combining Deep Learning and Street View Imagery to Map Smallholder Crop Types
Jordi Laguarta Soler, Thomas Friedel, Sherrie Wang
Comments: Accepted to AAAI-24: Special Track on AI for Social Impact
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[533] arXiv:2309.05943 [pdf, other]
Title: Knowledge-Guided Short-Context Action Anticipation in Human-Centric Videos
Sarthak Bhagat, Simon Stepputtis, Joseph Campbell, Katia Sycara
Comments: ICCV 2023 Workshop on AI for Creative Video Editing and Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[534] arXiv:2309.05956 [pdf, other]
Title: Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation
Yunhao Ge, Jiashu Xu, Brian Nlong Zhao, Neel Joshi, Laurent Itti, Vibhav Vineet
Comments: Code in this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[535] arXiv:2309.05972 [pdf, other]
Title: Self-supervised Extraction of Human Motion Structures via Frame-wise Discrete Features
Tetsuya Abe, Ryusuke Sagawa, Ko Ayusawa, Wataru Takano
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[536] arXiv:2309.05987 [pdf, other]
Title: FLDNet: A Foreground-Aware Network for Polyp Segmentation Leveraging Long-Distance Dependencies
Xuefeng Wei, Xuan Zhou
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[537] arXiv:2309.05994 [pdf, other]
Title: ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation
Zhitong Gao, Shipeng Yan, Xuming He
Comments: Published in NeurIPS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[538] arXiv:2309.06004 [pdf, other]
Title: TSSAT: Two-Stage Statistics-Aware Transformation for Artistic Style Transfer
Haibo Chen, Lei Zhao, Jun Li, Jian Yang
Comments: Accepted by ACM MM 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[539] arXiv:2309.06006 [pdf, other]
Title: SoccerNet 2023 Challenges Results
Anthony Cioppa, Silvio Giancola, Vladimir Somers, Floriane Magera, Xin Zhou, Hassan Mkhallati, Adrien Deliège, Jan Held, Carlos Hinojosa, Amir M. Mansourian, Pierre Miralles, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdullah Kamal, Adrien Maglo, Albert Clapés, Amr Abdelaziz, Artur Xarles, Astrid Orcesi, Atom Scott, Bin Liu, Byoungkwon Lim, Chen Chen, Fabian Deuser, Feng Yan, Fufu Yu, Gal Shitrit, Guanshuo Wang, Gyusik Choi, Hankyul Kim, Hao Guo, Hasby Fahrudin, Hidenari Koguchi, Håkan Ardö, Ibrahim Salah, Ido Yerushalmy, Iftikar Muhammad, Ikuma Uchida, Ishay Be'ery, Jaonary Rabarisoa, Jeongae Lee, Jiajun Fu, Jianqin Yin, Jinghang Xu, Jongho Nang, Julien Denize, Junjie Li, Junpei Zhang, Juntae Kim, Kamil Synowiec, Kenji Kobayashi, Kexin Zhang, Konrad Habel, Kota Nakajima, Licheng Jiao, Lin Ma, Lizhi Wang, Luping Wang, Menglong Li, Mengying Zhou, Mohamed Nasr, Mohamed Abdelwahed, Mykola Liashuha, Nikolay Falaleev, Norbert Oswald, Qiong Jia, Quoc-Cuong Pham, Ran Song, Romain Hérault, Rui Peng, Ruilong Chen, Ruixuan Liu, Ruslan Baikulov, Ryuto Fukushima, Sergio Escalera, Seungcheon Lee, Shimin Chen, Shouhong Ding, Taiga Someya, Thomas B. Moeslund, Tianjiao Li, Wei Shen, Wei Zhang, Wei Li, Wei Dai, Weixin Luo, Wending Zhao, Wenjie Zhang, Xinquan Yang, Yanbiao Ma, Yeeun Joo, Yingsen Zeng, Yiyang Gan, Yongqiang Zhu, Yujie Zhong, Zheng Ruan, Zhiheng Li
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[540] arXiv:2309.06017 [pdf, other]
Title: Feature Aggregation Network for Building Extraction from High-resolution Remote Sensing Images
Xuan Zhou, Xuefeng Wei
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[541] arXiv:2309.06023 [pdf, other]
Title: Learning from History: Task-agnostic Model Contrastive Learning for Image Restoration
Gang Wu, Junjun Jiang, Kui Jiang, Xianming Liu
Comments: Camera Ready Version. Accepted to The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[542] arXiv:2309.06027 [pdf, other]
Title: A new meteor detection application robust to camera movements
Clara Ciocan (ALSOC), Mathuran Kandeepan (ALSOC), Adrien Cassagne (ALSOC), Jeremie Vaubaillon (IMCCE), Fabian Zander (USQ), Lionel Lacassagne (ALSOC)
Comments: in French language, Groupe de Recherche et d'{É}tudes de Traitement du Signal et des Images (GRETSI), Aug 2023, Grenoble, France
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[543] arXiv:2309.06030 [pdf, html, other]
Title: Federated Learning for Large-Scale Scene Modeling with Neural Radiance Fields
Teppei Suzuki
Comments: Our subsequent work is available at arXiv:2403.11460
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[544] arXiv:2309.06047 [pdf, other]
Title: Real-Time Semantic Segmentation: A Brief Survey & Comparative Study in Remote Sensing
Clifford Broni-Bediako, Junshi Xia, Naoto Yokoya
Comments: Submitted to IEEE GRSM
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[545] arXiv:2309.06095 [pdf, other]
Title: Estimating exercise-induced fatigue from thermal facial images
Manuel Lage Cañellas, Constantino Álvarez Casado, Le Nguyen, Miguel Bordallo López
Comments: 5 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[546] arXiv:2309.06102 [pdf, other]
Title: Can we predict the Most Replayed data of video streaming platforms?
Alessandro Duico, Ombretta Strafforello, Jan van Gemert
Comments: Accepted Extended Abstract at ICCV 2023 Workshop on AI for Creative Video Editing and Understanding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[547] arXiv:2309.06105 [pdf, other]
Title: Towards Visual Taxonomy Expansion
Tinghui Zhu, Jingping Liu, Jiaqing Liang, Haiyun Jiang, Yanghua Xiao, Zongyu Wang, Rui Xie, Yunsen Xian
Comments: ACMMM accepted paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[548] arXiv:2309.06107 [pdf, other]
Title: HOC-Search: Efficient CAD Model and Pose Retrieval from RGB-D Scans
Stefan Ainetter, Sinisa Stekovic, Friedrich Fraundorfer, Vincent Lepetit
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[549] arXiv:2309.06118 [pdf, html, other]
Title: CHITNet: A Complementary to Harmonious Information Transfer Network for Infrared and Visible Image Fusion
Keying Du, Huafeng Li, Yafei Zhang, Zhengtao Yu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[550] arXiv:2309.06123 [pdf, other]
Title: Dynamic Visual Prompt Tuning for Parameter Efficient Transfer Learning
Chunqing Ruan, Hongjian Wang
Comments: accepted by 2023 PRCV
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[551] arXiv:2309.06129 [pdf, html, other]
Title: LEyes: A Lightweight Framework for Deep Learning-Based Eye Tracking using Synthetic Eye Images
Sean Anthony Byrne, Virmarie Maquiling, Marcus Nyström, Enkelejda Kasneci, Diederick C. Niehorster
Comments: 32 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[552] arXiv:2309.06130 [pdf, other]
Title: JOADAA: joint online action detection and action anticipation
Mohammed Guermal, Francois Bremond, Rui Dai, Abid Ali
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[553] arXiv:2309.06142 [pdf, other]
Title: Towards Reliable Domain Generalization: A New Dataset and Evaluations
Jiao Zhang, Xu-Yao Zhang, Cheng-Lin Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[554] arXiv:2309.06159 [pdf, other]
Title: Active Label Refinement for Semantic Segmentation of Satellite Images
Tuan Pham Minh, Jayan Wijesingha, Daniel Kottke, Marek Herde, Denis Huseljic, Bernhard Sick, Michael Wachendorf, Thomas Esch
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[555] arXiv:2309.06176 [pdf, other]
Title: Dual-Path Temporal Map Optimization for Make-up Temporal Video Grounding
Jiaxiu Li, Kun Li, Jia Li, Guoliang Chen, Dan Guo, Meng Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[556] arXiv:2309.06188 [pdf, other]
Title: Computer Vision Pipeline for Automated Antarctic Krill Analysis
Mazvydas Gudelis, Michal Mackiewicz, Julie Bremner, Sophie Fielding
Comments: Accepted to MVEO @ BMVC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[557] arXiv:2309.06194 [pdf, other]
Title: A 3M-Hybrid Model for the Restoration of Unique Giant Murals: A Case Study on the Murals of Yongle Palace
Jing Yang, Nur Intan Raihana Ruhaiyem, Chichun Zhou
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[558] arXiv:2309.06197 [pdf, other]
Title: 360$^\circ$ from a Single Camera: A Few-Shot Approach for LiDAR Segmentation
Laurenz Reichardt, Nikolas Ebert, Oliver Wasenmüller
Comments: ICCV Workshop 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[559] arXiv:2309.06199 [pdf, other]
Title: SCP: Scene Completion Pre-training for 3D Object Detection
Yiming Shan, Yan Xia, Yuhong Chen, Daniel Cremers
Comments: Wins the best paper award at ISPRS Geospatial Week 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[560] arXiv:2309.06202 [pdf, other]
Title: Fast Sparse PCA via Positive Semidefinite Projection for Unsupervised Feature Selection
Junjing Zheng, Xinyu Zhang, Yongxiang Liu, Weidong Jiang, Kai Huo, Li Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[561] arXiv:2309.06207 [pdf, html, other]
Title: SGNet: Salient Geometric Network for Point Cloud Registration
Qianliang Wu, Yaqing Ding, Lei Luo, Haobo Jiang, Shuo Gu, Chuanwei Zhou, Jin Xie, Jian Yang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[562] arXiv:2309.06219 [pdf, html, other]
Title: Human Action Co-occurrence in Lifestyle Vlogs using Graph Link Prediction
Oana Ignat, Santiago Castro, Weiji Li, Rada Mihalcea
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Computers and Society (cs.CY); Information Retrieval (cs.IR)
[563] arXiv:2309.06221 [pdf, other]
Title: Use neural networks to recognize students' handwritten letters and incorrect symbols
JiaJun Zhu, Zichuan Yang, Binjie Hong, Jiacheng Song, Jiwei Wang, Tianhao Chen, Shuilan Yang, Zixun Lan, Fei Ma
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[564] arXiv:2309.06255 [pdf, html, other]
Title: Enhancing multimodal cooperation via sample-level modality valuation
Yake Wei, Ruoxuan Feng, Zihe Wang, Di Hu
Comments: Accepted by CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[565] arXiv:2309.06262 [pdf, other]
Title: Modality Unifying Network for Visible-Infrared Person Re-Identification
Hao Yu, Xu Cheng, Wei Peng, Weihao Liu, Guoying Zhao
Comments: 11 pages, 5 figures. Accepted as the poster paper in ICCV2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[566] arXiv:2309.06276 [pdf, other]
Title: OTAS: Unsupervised Boundary Detection for Object-Centric Temporal Action Segmentation
Yuerong Li, Zhengrong Xue, Huazhe Xu
Comments: Accepted to WACV 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[567] arXiv:2309.06282 [pdf, other]
Title: IBAFormer: Intra-batch Attention Transformer for Domain Generalized Semantic Segmentation
Qiyu Sun, Huilin Chen, Meng Zheng, Ziyan Wu, Michael Felsberg, Yang Tang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[568] arXiv:2309.06284 [pdf, other]
Title: Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model
Yin Wang, Zhiying Leng, Frederick W. B. Li, Shun-Cheng Wu, Xiaohui Liang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[569] arXiv:2309.06285 [pdf, other]
Title: Jersey Number Recognition using Keyframe Identification from Low-Resolution Broadcast Videos
Bavesh Balaji, Jerrin Bright, Harish Prakash, Yuhao Chen, David A Clausi, John Zelek
Comments: Accepted in the 6th International Workshop on Multimedia Content Analysis in Sports (MMSports'23) @ ACM Multimedia
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[570] arXiv:2309.06288 [pdf, other]
Title: Self-Training and Multi-Task Learning for Limited Data: Evaluation Study on Object Detection
Hoàng-Ân Lê, Minh-Tan Pham
Comments: Accepted for International Conference in Computer Vision workshop (ICCVW) 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[571] arXiv:2309.06302 [pdf, other]
Title: Towards High-Quality Specular Highlight Removal by Leveraging Large-Scale Synthetic Data
Gang Fu, Qing Zhang, Lei Zhu, Chunxia Xiao, Ping Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[572] arXiv:2309.06308 [pdf, other]
Title: AI4Food-NutritionFW: A Novel Framework for the Automatic Synthesis and Analysis of Eating Behaviours
Sergio Romero-Tapiador, Ruben Tolosana, Aythami Morales, Isabel Espinosa-Salinas, Gala Freixer, Julian Fierrez, Ruben Vera-Rodriguez, Enrique Carrillo de Santa Pau, Ana Ramírez de Molina, Javier Ortega-Garcia
Comments: 10 pages, 5 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Databases (cs.DB)
[573] arXiv:2309.06313 [pdf, other]
Title: Semantic and Articulated Pedestrian Sensing Onboard a Moving Vehicle
Maria Priisalu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[574] arXiv:2309.06323 [pdf, other]
Title: SAMPLING: Scene-adaptive Hierarchical Multiplane Images Representation for Novel View Synthesis from a Single Image
Xiaoyu Zhou, Zhiwei Lin, Xiaojun Shan, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[575] arXiv:2309.06335 [pdf, other]
Title: Grounded Language Acquisition From Object and Action Imagery
James Robert Kubricht, Zhaoyuan Yang, Jianwei Qiu, Peter Henry Tu
Comments: 9 pages, 7 figures, conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[576] arXiv:2309.06337 [pdf, other]
Title: Exploring Flat Minima for Domain Generalization with Large Learning Rates
Jian Zhang, Lei Qi, Yinghuan Shi, Yang Gao
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[577] arXiv:2309.06370 [pdf, other]
Title: Padding-free Convolution based on Preservation of Differential Characteristics of Kernels
Kuangdai Leng, Jeyan Thiyagalingam
Comments: 8 pages, 3 figures, 1 table, ICLMA 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[578] arXiv:2309.06438 [pdf, other]
Title: Exploring Non-additive Randomness on ViT against Query-Based Black-Box Attacks
Jindong Gu, Fangyun Wei, Philip Torr, Han Hu
Comments: Accepted to BMVC2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[579] arXiv:2309.06439 [pdf, other]
Title: Attention De-sparsification Matters: Inducing Diversity in Digital Pathology Representation Learning
Saarthak Kapse, Srijan Das, Jingwei Zhang, Rajarsi R. Gupta, Joel Saltz, Dimitris Samaras, Prateek Prasanna
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[580] arXiv:2309.06441 [pdf, other]
Title: Learning Disentangled Avatars with Hybrid 3D Representations
Yao Feng, Weiyang Liu, Timo Bolkart, Jinlong Yang, Marc Pollefeys, Michael J. Black
Comments: home page: this https URL. arXiv admin note: text overlap with arXiv:2210.01868
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[581] arXiv:2309.06462 [pdf, html, other]
Title: Action Segmentation Using 2D Skeleton Heatmaps and Multi-Modality Fusion
Syed Waleed Hyder, Muhammad Usama, Anas Zafar, Muhammad Naufil, Fawad Javed Fateh, Andrey Konin, M. Zeeshan Zia, Quoc-Huy Tran
Comments: Accepted to ICRA 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[582] arXiv:2309.06511 [pdf, other]
Title: DF-TransFusion: Multimodal Deepfake Detection via Lip-Audio Cross-Attention and Facial Self-Attention
Aaditya Kharel, Manas Paranjape, Aniket Bera
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[583] arXiv:2309.06521 [pdf, other]
Title: Ethnicity and Biometric Uniqueness: Iris Pattern Individuality in a West African Database
John Daugman, Cathryn Downing, Oluwatobi Noah Akande, Oluwakemi Christiana Abikoye
Comments: 8 pages, 8 Figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[584] arXiv:2309.06528 [pdf, other]
Title: Strong-Weak Integrated Semi-supervision for Unsupervised Single and Multi Target Domain Adaptation
Xiaohu Lu, Hayder Radha
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[585] arXiv:2309.06547 [pdf, html, other]
Title: AmodalSynthDrive: A Synthetic Amodal Perception Dataset for Autonomous Driving
Ahmed Rida Sekkat, Rohit Mohan, Oliver Sawade, Elmar Matthes, Abhinav Valada
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[586] arXiv:2309.06581 [pdf, other]
Title: Zero-Shot Visual Classification with Guided Cropping
Piyapat Saranrittichai, Mauricio Munoz, Volker Fischer, Chaithanya Kumar Mummadi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[587] arXiv:2309.06597 [pdf, other]
Title: Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and Reasoning
Enna Sachdeva, Nakul Agarwal, Suhas Chundi, Sean Roelofs, Jiachen Li, Mykel Kochenderfer, Chiho Choi, Behzad Dariush
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[588] arXiv:2309.06618 [pdf, html, other]
Title: Multi-dimensional Fusion and Consistency for Semi-supervised Medical Image Segmentation
Yixing Lu, Zhaoxin Fan, Min Xu
Comments: Accepted by the 30th International Conference on MultiMedia Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV); Tissues and Organs (q-bio.TO)
[589] arXiv:2309.06626 [pdf, other]
Title: Accelerating Deep Neural Networks via Semi-Structured Activation Sparsity
Matteo Grimaldi, Darshan C. Ganji, Ivan Lazarevich, Sudhakar Sah
Comments: Code is available at this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[590] arXiv:2309.06670 [pdf, html, other]
Title: ShaDocFormer: A Shadow-Attentive Threshold Detector With Cascaded Fusion Refiner for Document Shadow Removal
Weiwen Chen, Yingtie Lei, Shenghong Luo, Ziyang Zhou, Mingxian Li, Chi-Man Pun
Comments: Accepted by IJCNN 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[591] arXiv:2309.06677 [pdf, other]
Title: SHARM: Segmented Head Anatomical Reference Models
Essam A. Rashed, Mohammad al-Shatouri, Ilkka Laakso, Akimasa Hirata
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[592] arXiv:2309.06680 [pdf, html, other]
Title: STUPD: A Synthetic Dataset for Spatial and Temporal Relation Reasoning
Palaash Agrawal, Haidi Azaman, Cheston Tan
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[593] arXiv:2309.06701 [pdf, other]
Title: Transparent Object Tracking with Enhanced Fusion Module
Kalyan Garigapati, Erik Blasch, Jie Wei, Haibin Ling
Comments: IEEE IROS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[594] arXiv:2309.06703 [pdf, other]
Title: VLSlice: Interactive Vision-and-Language Slice Discovery
Eric Slyman, Minsuk Kahng, Stefan Lee
Comments: Conference paper at ICCV 2023. 17 pages, 11 figures. this https URL
Journal-ref: 2023 IEEE/CVF International Conference on Computer Vision (ICCV)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[595] arXiv:2309.06714 [pdf, other]
Title: MPI-Flow: Learning Realistic Optical Flow with Multiplane Images
Yingping Liang, Jiaming Liu, Debing Zhang, Ying Fu
Comments: Accepted to ICCV2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[596] arXiv:2309.06720 [pdf, other]
Title: Deep Attentive Time Warping
Shinnosuke Matsuo, Xiaomeng Wu, Gantugs Atarsaikhan, Akisato Kimura, Kunio Kashino, Brian Kenji Iwana, Seiichi Uchida
Comments: Accepted at Pattern Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[597] arXiv:2309.06721 [pdf, other]
Title: Dynamic Spectrum Mixer for Visual Recognition
Zhiqiang Hu, Tao Yu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[598] arXiv:2309.06724 [pdf, other]
Title: Deep Nonparametric Convexified Filtering for Computational Photography, Image Synthesis and Adversarial Defense
Jianqiao Wangni
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Optimization and Control (math.OC); Machine Learning (stat.ML)
[599] arXiv:2309.06728 [pdf, other]
Title: Leveraging Foundation models for Unsupervised Audio-Visual Segmentation
Swapnil Bhosale, Haosen Yang, Diptesh Kanojia, Xiatian Zhu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[600] arXiv:2309.06735 [pdf, other]
Title: GelFlow: Self-supervised Learning of Optical Flow for Vision-Based Tactile Sensor Displacement Measurement
Zhiyuan Zhang, Hua Yang, Zhouping Yin
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[601] arXiv:2309.06742 [pdf, other]
Title: MTD: Multi-Timestep Detector for Delayed Streaming Perception
Yihui Huang, Ningjiang Chen
Comments: 12 pages, accepted by PRCV 2023 (The 6th Chinese Conference on Pattern Recognition and Computer Vision)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[602] arXiv:2309.06745 [pdf, other]
Title: VEATIC: Video-based Emotion and Affect Tracking in Context Dataset
Zhihang Ren, Jefferson Ortega, Yifan Wang, Zhimin Chen, Yunhui Guo, Stella X. Yu, David Whitney
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Multimedia (cs.MM)
[603] arXiv:2309.06747 [pdf, other]
Title: Integrating GAN and Texture Synthesis for Enhanced Road Damage Detection
Tengyang Chen, Jiangtao Ren
Comments: 10 pages, 13 figures, 2 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[604] arXiv:2309.06750 [pdf, other]
Title: MFL-YOLO: An Object Detection Model for Damaged Traffic Signs
Tengyang Chen, Jiangtao Ren
Comments: 11 pages, 8 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[605] arXiv:2309.06751 [pdf, other]
Title: Remote Sensing Object Detection Meets Deep Learning: A Meta-review of Challenges and Advances
Xiangrong Zhang, Tianyang Zhang, Guanchun Wang, Peng Zhu, Xu Tang, Xiuping Jia, Licheng Jiao
Comments: Accepted with IEEE Geoscience and Remote Sensing Magazine. More than 300 papers relevant to the RSOD filed were reviewed in this survey
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[606] arXiv:2309.06792 [pdf, other]
Title: Motion-Bias-Free Feature-Based SLAM
Alejandro Fontan, Javier Civera, Michael Milford
Comments: BMVC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[607] arXiv:2309.06802 [pdf, other]
Title: Dynamic NeRFs for Soccer Scenes
Sacha Lewin, Maxime Vandegar, Thomas Hoyoux, Olivier Barnich, Gilles Louppe
Comments: Accepted at the 6th International ACM Workshop on Multimedia Content Analysis in Sports. 8 pages, 9 figures. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[608] arXiv:2309.06807 [pdf, html, other]
Title: Bayesian uncertainty-weighted loss for improved generalisability on polyp segmentation task
Rebecca S. Stone, Pedro E. Chavarrias-Solano, Andrew J. Bulpitt, David C. Hogg, Sharib Ali
Comments: To be presented at the Fairness of AI in Medical Imaging (FAIMI) MICCAI 2023 Workshop and published in volumes of the Springer Lecture Notes Computer Science (LNCS) series
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[609] arXiv:2309.06809 [pdf, other]
Title: TAP: Targeted Prompting for Task Adaptive Generation of Textual Training Instances for Visual Classification
M. Jehanzeb Mirza, Leonid Karlinsky, Wei Lin, Horst Possegger, Rogerio Feris, Horst Bischof
Comments: Code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[610] arXiv:2309.06810 [pdf, html, other]
Title: Leveraging SE(3) Equivariance for Learning 3D Geometric Shape Assembly
Ruihai Wu, Chenrui Tie, Yushi Du, Yan Zhao, Hao Dong
Comments: ICCV 2023, Project page: this https URL , Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[611] arXiv:2309.06819 [pdf, other]
Title: Tracking Particles Ejected From Active Asteroid Bennu With Event-Based Vision
Loïc J. Azzalini, Dario Izzo
Comments: 6 pages, 3 figures, presented at the XXVII Italian Association of Aeronautics and Astronautics (AIDAA) Congress, 4-7 September 2023, Padova Italy
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[612] arXiv:2309.06824 [pdf, html, other]
Title: Beyond Adapting SAM: Towards End-to-End Ultrasound Image Segmentation via Auto Prompting
Xian Lin, Yangyang Xiang, Li Yu, Zengqiang Yan
Comments: Also known as SAMUS. Officially accepted by MICCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[613] arXiv:2309.06828 [pdf, other]
Title: UniBrain: Universal Brain MRI Diagnosis with Hierarchical Knowledge-enhanced Pre-training
Jiayu Lei, Lisong Dai, Haoyun Jiang, Chaoyi Wu, Xiaoman Zhang, Yao Zhang, Jiangchao Yao, Weidi Xie, Yanyong Zhang, Yuehua Li, Ya Zhang, Yanfeng Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[614] arXiv:2309.06877 [pdf, other]
Title: Video Infringement Detection via Feature Disentanglement and Mutual Information Maximization
Zhenguang Liu, Xinyang Yu, Ruili Wang, Shuai Ye, Zhe Ma, Jianfeng Dong, Sifeng He, Feng Qian, Xiaobo Zhang, Roger Zimmermann, Lei Yang
Comments: This paper is accepted by ACM MM 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[615] arXiv:2309.06884 [pdf, other]
Title: Autoencoder-Based Visual Anomaly Localization for Manufacturing Quality Control
Devang Mehta, Noah Klarmann
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[616] arXiv:2309.06891 [pdf, other]
Title: Keep It SimPool: Who Said Supervised Transformers Suffer from Attention Deficit?
Bill Psomas, Ioannis Kakogeorgiou, Konstantinos Karantzalos, Yannis Avrithis
Comments: ICCV 2023. Code and models: this https URL
Journal-ref: International Conference on Computer Vision (2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[617] arXiv:2309.06895 [pdf, other]
Title: MagiCapture: High-Resolution Multi-Concept Portrait Customization
Junha Hyung, Jaeyo Shin, Jaegul Choo
Comments: 18 pages, 17 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[618] arXiv:2309.06902 [pdf, other]
Title: CCSPNet-Joint: Efficient Joint Training Method for Traffic Sign Detection Under Extreme Conditions
Haoqin Hong, Yue Zhou, Xiangyu Shu, Xiaofang Hu
Journal-ref: 2024 International Joint Conference on Neural Networks (IJCNN), Yokohama, Japan, 2024, pp. 1-8
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[619] arXiv:2309.06922 [pdf, other]
Title: Hydra: Multi-head Low-rank Adaptation for Parameter Efficient Fine-tuning
Sanghyeon Kim, Hyunmo Yang, Younghyun Kim, Youngjoon Hong, Eunbyung Park
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[620] arXiv:2309.06924 [pdf, other]
Title: Contrast-Phys+: Unsupervised and Weakly-supervised Video-based Remote Physiological Measurement via Spatiotemporal Contrast
Zhaodong Sun, Xiaobai Li
Comments: Accepted by TPAMI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[621] arXiv:2309.06933 [pdf, html, other]
Title: DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models
Namhyuk Ahn, Junsoo Lee, Chunggi Lee, Kunhee Kim, Daesik Kim, Seung-Hun Nam, Kibeom Hong
Comments: AAAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[622] arXiv:2309.06941 [pdf, html, other]
Title: DEFormer: DCT-driven Enhancement Transformer for Low-light Image and Dark Vision
Xiangchen Yin, Zhenda Yu, Xin Gao, Xiao Sun
Comments: Accepted by ICASSP
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[623] arXiv:2309.06951 [pdf, other]
Title: TransNet: A Transfer Learning-Based Network for Human Action Recognition
K. Alomar, X. Cai
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[624] arXiv:2309.06958 [pdf, other]
Title: Neural network-based coronary dominance classification of RCA angiograms
Ivan Kruzhilov, Egor Ikryannikov, Artem Shadrin, Ruslan Utegenov, Galina Zubkova, Ivan Bessonov
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[625] arXiv:2309.06961 [pdf, html, other]
Title: Towards Reliable Dermatology Evaluation Benchmarks
Fabian Gröger, Simone Lionetti, Philippe Gottfrois, Alvaro Gonzalez-Jimenez, Matthew Groh, Roxana Daneshjou, Labelling Consortium, Alexander A. Navarini, Marc Pouly
Comments: Link to the revised file lists: this https URL
Journal-ref: Proceedings of the 3rd Machine Learning for Health Symposium, PMLR 225:101-128, 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[626] arXiv:2309.06978 [pdf, other]
Title: Differentiable JPEG: The Devil is in the Details
Christoph Reich, Biplob Debnath, Deep Patel, Srimat Chakradhar
Comments: Accepted at WACV 2024. Project page: this https URL WACV paper: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[627] arXiv:2309.06987 [pdf, other]
Title: Instance Adaptive Prototypical Contrastive Embedding for Generalized Zero Shot Learning
Riti Paul, Sahil Vora, Baoxin Li
Comments: 7 pages, 4 figures. Accepted in IJCAI 2023 Workshop on Generalizing from Limited Resources in the Open World
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[628] arXiv:2309.07021 [pdf, other]
Title: Exploiting Multiple Priors for Neural 3D Indoor Reconstruction
Federico Lincetto, Gianluca Agresti, Mattia Rossi, Pietro Zanuttigh
Comments: Accepted at the British Machine Vision Conference (BMVC) 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[629] arXiv:2309.07054 [pdf, html, other]
Title: Aggregating Nearest Sharp Features via Hybrid Transformers for Video Deblurring
Wei Shang, Dongwei Ren, Yi Yang, Wangmeng Zuo
Comments: Accepted by Information Sciences 2024, and the code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[630] arXiv:2309.07068 [pdf, other]
Title: FAIR: Frequency-aware Image Restoration for Industrial Visual Anomaly Detection
Tongkun Liu, Bing Li, Xiao Du, Bingke Jiang, Leqi Geng, Feiyang Wang, Zhuo Zhao
Comments: 12 pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[631] arXiv:2309.07084 [pdf, other]
Title: SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection
Yiran Qin, Chaoqun Wang, Zijian Kang, Ningning Ma, Zhen Li, Ruimao Zhang
Comments: Accepted to ICCV2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[632] arXiv:2309.07087 [pdf, other]
Title: Developing a Novel Image Marker to Predict the Clinical Outcome of Neoadjuvant Chemotherapy (NACT) for Ovarian Cancer Patients
Ke Zhang, Neman Abdoli, Patrik Gilley, Youkabed Sadri, Xuxin Chen, Theresa C. Thai, Lauren Dockery, Kathleen Moore, Robert S. Mannel, Yuchen Qiu
Journal-ref: Computers in Biology and Medicine 172 (2024): 108240
Subjects: Computer Vision and Pattern Recognition (cs.CV); Data Analysis, Statistics and Probability (physics.data-an); Medical Physics (physics.med-ph)
[633] arXiv:2309.07104 [pdf, other]
Title: Polygon Intersection-over-Union Loss for Viewpoint-Agnostic Monocular 3D Vehicle Detection
Derek Gloudemans, Xinxuan Lu, Shepard Xia, Daniel B. Work
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[634] arXiv:2309.07106 [pdf, other]
Title: Hardening RGB-D Object Recognition Systems against Adversarial Patch Attacks
Yang Zheng, Luca Demetrio, Antonio Emanuele Cinà, Xiaoyi Feng, Zhaoqiang Xia, Xiaoyue Jiang, Ambra Demontis, Battista Biggio, Fabio Roli
Comments: Accepted for publication in the Information Sciences journal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[635] arXiv:2309.07113 [pdf, other]
Title: Contrastive Deep Encoding Enables Uncertainty-aware Machine-learning-assisted Histopathology
Nirhoshan Sivaroopan, Chamuditha Jayanga, Chalani Ekanayake, Hasindri Watawana, Jathurshan Pradeepkumar, Mithunjha Anandakumar, Ranga Rodrigo, Chamira U. S. Edussooriya, Dushan N. Wadduwage
Comments: 18 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[636] arXiv:2309.07122 [pdf, other]
Title: Tree-Structured Shading Decomposition
Chen Geng, Hong-Xing Yu, Sharon Zhang, Maneesh Agrawala, Jiajun Wu
Comments: Accepted at ICCV 2023. Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[637] arXiv:2309.07125 [pdf, other]
Title: Text-Guided Generation and Editing of Compositional 3D Avatars
Hao Zhang, Yao Feng, Peter Kulits, Yandong Wen, Justus Thies, Michael J. Black
Comments: Home page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[638] arXiv:2309.07186 [pdf, other]
Title: LCReg: Long-Tailed Image Classification with Latent Categories based Recognition
Weide Liu, Zhonghua Wu, Yiming Wang, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin
Comments: accepted by Pattern Recognition. arXiv admin note: substantial text overlap with arXiv:2206.01010
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[639] arXiv:2309.07243 [pdf, other]
Title: LInKs "Lifting Independent Keypoints" -- Partial Pose Lifting for Occlusion Handling with Improved Accuracy in 2D-3D Human Pose Estimation
Peter Hardy, Hansung Kim
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[640] arXiv:2309.07254 [pdf, html, other]
Title: Mitigate Replication and Copying in Diffusion Models with Generalized Caption and Dual Fusion Enhancement
Chenghao Li, Dake Chen, Yuke Zhang, Peter A. Beerel
Comments: This paper has been accepted for presentation at 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
[641] arXiv:2309.07268 [pdf, other]
Title: So you think you can track?
Derek Gloudemans, Gergely Zachár, Yanbing Wang, Junyi Ji, Matt Nice, Matt Bunting, William Barbour, Jonathan Sprinkle, Benedetto Piccoli, Maria Laura Delle Monache, Alexandre Bayen, Benjamin Seibold, Daniel B. Work
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[642] arXiv:2309.07277 [pdf, html, other]
Title: Limitations of Face Image Generation
Harrison Rosenberg, Shimaa Ahmed, Guruprasad V Ramesh, Ramya Korlakai Vinayak, Kassem Fawaz
Comments: Accepted to The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[643] arXiv:2309.07293 [pdf, other]
Title: GAN-based Algorithm for Efficient Image Inpainting
Zhengyang Han, Zehao Jiang, Yuan Ju
Comments: 6 pages, 3 figures
Journal-ref: The 3rd International Conference on Artificial Intelligence and Computer Engineering(ICAICE 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[644] arXiv:2309.07297 [pdf, other]
Title: Multi-Modal Hybrid Learning and Sequential Training for RGB-T Saliency Detection
Guangyu Ren, Jitesh Joshi, Youngjun Cho
Comments: 8 Pages main text, 3 pages supplementary information, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[645] arXiv:2309.07322 [pdf, other]
Title: $\texttt{NePhi}$: Neural Deformation Fields for Approximately Diffeomorphic Medical Image Registration
Lin Tian, Hastings Greer, Raúl San José Estépar, Roni Sengupta, Marc Niethammer
Comments: ECCV 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[646] arXiv:2309.07330 [pdf, other]
Title: Automated Assessment of Critical View of Safety in Laparoscopic Cholecystectomy
Yunfan Li, Himanshu Gupta, Haibin Ling, IV Ramakrishnan, Prateek Prasanna, Georgios Georgakis, Aaron Sasson
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[647] arXiv:2309.07361 [pdf, other]
Title: Judging a video by its bitstream cover
Yuxing Han, Yunan Ding, Jiangtao Wen, Chen Ye Gan
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[648] arXiv:2309.07390 [pdf, other]
Title: Unleashing the Power of Depth and Pose Estimation Neural Networks by Designing Compatible Endoscopic Images
Junyang Wu, Yun Gu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[649] arXiv:2309.07394 [pdf, other]
Title: Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for Histopathology Images
Zhiyun Song, Penghui Du, Junpeng Yan, Kailu Li, Jianzhong Shou, Maode Lai, Yubo Fan, Yan Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[650] arXiv:2309.07398 [pdf, other]
Title: Semantic Adversarial Attacks via Diffusion Models
Chenan Wang, Jinhao Duan, Chaowei Xiao, Edward Kim, Matthew Stamm, Kaidi Xu
Comments: To appear in BMVC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[651] arXiv:2309.07400 [pdf, other]
Title: HIGT: Hierarchical Interaction Graph-Transformer for Whole Slide Image Analysis
Ziyu Guo, Weiqin Zhao, Shujun Wang, Lequan Yu
Comments: Accepted by MICCAI2023; Code is available in this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[652] arXiv:2309.07403 [pdf, other]
Title: Flexible Visual Recognition by Evidential Modeling of Confusion and Ignorance
Lei Fan, Bo Liu, Haoxiang Li, Ying Wu, Gang Hua
Comments: Accepted by ICCV23
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[653] arXiv:2309.07409 [pdf, other]
Title: Masked Diffusion with Task-awareness for Procedure Planning in Instructional Videos
Fen Fang, Yun Liu, Ali Koksal, Qianli Xu, Joo-Hwee Lim
Comments: 7 pages (main text excluding references), 3 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[654] arXiv:2309.07425 [pdf, other]
Title: JSMNet Improving Indoor Point Cloud Semantic and Instance Segmentation through Self-Attention and Multiscale
Shuochen Xu, Zhenxin Zhang
Journal-ref: The ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[655] arXiv:2309.07428 [pdf, other]
Title: Physical Invisible Backdoor Based on Camera Imaging
Yusheng Guo, Nan Zhong, Zhenxing Qian, Xinpeng Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[656] arXiv:2309.07439 [pdf, html, other]
Title: DePT: Decoupled Prompt Tuning
Ji Zhang, Shihan Wu, Lianli Gao, Heng Tao Shen, Jingkuan Song
Comments: 13 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[657] arXiv:2309.07444 [pdf, other]
Title: Research on self-cross transformer model of point cloud change detecter
Xiaoxu Ren, Haili Sun, Zhenxin Zhang
Journal-ref: ISPRS Annals of the Photogrammetry Remote Sensing and Spatial Information Sciences2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[658] arXiv:2309.07471 [pdf, other]
Title: EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization
Minjung Kim, Junseo Koo, Gunhee Kim
Comments: Accepted to ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[659] arXiv:2309.07495 [pdf, other]
Title: HDTR-Net: A Real-Time High-Definition Teeth Restoration Network for Arbitrary Talking Face Generation Methods
Yongyuan Li, Xiuyuan Qin, Chao Liang, Mingqiang Wei
Comments: 15pages, 6 figures, PRCV2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[660] arXiv:2309.07499 [pdf, other]
Title: Efficiently Robustify Pre-trained Models
Nishant Jain, Harkirat Behl, Yogesh Singh Rawat, Vibhav Vineet
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[661] arXiv:2309.07509 [pdf, other]
Title: DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks
Zipeng Qi, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang
Comments: submmit to ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[662] arXiv:2309.07513 [pdf, other]
Title: RecycleNet: Latent Feature Recycling Leads to Iterative Decision Refinement
Gregor Koehler, Tassilo Wald, Constantin Ulrich, David Zimmerer, Paul F. Jaeger, Jörg K.H. Franke, Simon Kohl, Fabian Isensee, Klaus H. Maier-Hein
Comments: Accepted at 2024 Winter Conference on Applications of Computer Vision (WACV)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[663] arXiv:2309.07515 [pdf, other]
Title: Dhan-Shomadhan: A Dataset of Rice Leaf Disease Classification for Bangladeshi Local Rice
Md. Fahad Hossain
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[664] arXiv:2309.07524 [pdf, html, other]
Title: A Multi-scale Generalized Shrinkage Threshold Network for Image Blind Deblurring in Remote Sensing
Yujie Feng, Yin Yang, Xiaohong Fan, Zhengpeng Zhang, Jianping Zhang
Comments: 16 pages,Accepted to IEEE Transactions on Geoscience and Remote Sensing,2024
Journal-ref: IEEE Transactions on Geoscience and Remote Sensing,2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Image and Video Processing (eess.IV)
[665] arXiv:2309.07537 [pdf, other]
Title: Towards a universal mechanism for successful deep learning
Yuval Meir, Yarden Tzach, Shiri Hodassman, Ofek Tevet, Ido Kanter
Comments: 31 pages,7 figures, 9 tables. arXiv admin note: text overlap with arXiv:2305.18078
Journal-ref: Scientific Reports volume 14, Article number: 5881 (2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[666] arXiv:2309.07616 [pdf, other]
Title: Road Disease Detection based on Latent Domain Background Feature Separation and Suppression
Juwu Zheng, Jiangtao Ren
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[667] arXiv:2309.07623 [pdf, other]
Title: SwitchGPT: Adapting Large Language Models for Non-Text Outputs
Xinyu Wang, Bohan Zhuang, Qi Wu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[668] arXiv:2309.07640 [pdf, html, other]
Title: Indoor Scene Reconstruction with Fine-Grained Details Using Hybrid Representation and Normal Prior Enhancement
Sheng Ye, Yubin Hu, Matthieu Lin, Yu-Hui Wen, Wang Zhao, Yong-Jin Liu, Wenping Wang
Comments: accepted by TVCG
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[669] arXiv:2309.07654 [pdf, other]
Title: Towards Robust and Unconstrained Full Range of Rotation Head Pose Estimation
Thorsten Hempel, Ahmed A. Abdelrahman, Ayoub Al-Hamadi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[670] arXiv:2309.07668 [pdf, other]
Title: ChromaDistill: Colorizing Monochrome Radiance Fields with Knowledge Distillation
Ankit Dhiman, R Srinath, Srinjay Sarkar, Lokesh R Boregowda, R Venkatesh Babu
Comments: WACV 2025, AI3DCC @ ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[671] arXiv:2309.07698 [pdf, other]
Title: Dataset Condensation via Generative Model
David Junhao Zhang, Heng Wang, Chuhui Xue, Rui Yan, Wenqing Zhang, Song Bai, Mike Zheng Shou
Comments: old work,done in 2022
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[672] arXiv:2309.07704 [pdf, html, other]
Title: NutritionVerse: Empirical Study of Various Dietary Intake Estimation Approaches
Chi-en Amy Tai, Matthew Keller, Saeejith Nair, Yuhao Chen, Yifan Wu, Olivia Markham, Krish Parmar, Pengcheng Xi, Heather Keller, Sharon Kirkpatrick, Alexander Wong
Comments: Corrections made to Tables 6, 7, and 8, and corrections made to Experiments Part C. Additional clarification made in Section 4
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[673] arXiv:2309.07749 [pdf, other]
Title: OmnimatteRF: Robust Omnimatte with 3D Background Modeling
Geng Lin, Chen Gao, Jia-Bin Huang, Changil Kim, Yipeng Wang, Matthias Zwicker, Ayush Saraf
Comments: ICCV 2023. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[674] arXiv:2309.07752 [pdf, other]
Title: DT-NeRF: Decomposed Triplane-Hash Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
Yaoyu Su, Shaohui Wang, Haoqian Wang
Comments: 5 pages, 5 figures. Submitted to ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[675] arXiv:2309.07753 [pdf, other]
Title: Co-Salient Object Detection with Semantic-Level Consensus Extraction and Dispersion
Peiran Xu, Yadong Mu
Comments: Accepted by ACM MM 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[676] arXiv:2309.07760 [pdf, html, other]
Title: PRE: Vision-Language Prompt Learning with Reparameterization Encoder
Thi Minh Anh Pham, An Duc Nguyen, Cephas Svosve, Vasileios Argyriou, Georgios Tzimiropoulos
Comments: 10 pages excluding References and Appendix
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[677] arXiv:2309.07796 [pdf, other]
Title: For A More Comprehensive Evaluation of 6DoF Object Pose Tracking
Yang Li, Fan Zhong, Xin Wang, Shuangbing Song, Jiachen Li, Xueying Qin, Changhe Tu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[678] arXiv:2309.07808 [pdf, html, other]
Title: What Matters to Enhance Traffic Rule Compliance of Imitation Learning for End-to-End Autonomous Driving
Hongkuan Zhou, Wei Cao, Aifen Sui, Zhenshan Bing
Comments: 14 pages, 3 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Robotics (cs.RO)
[679] arXiv:2309.07819 [pdf, other]
Title: Decomposition of linear tensor transformations
Claudio Turchetti
Comments: arXiv admin note: text overlap with arXiv:2305.02803
Subjects: Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
[680] arXiv:2309.07823 [pdf, other]
Title: Large-scale Weakly Supervised Learning for Road Extraction from Satellite Imagery
Shiqiao Meng, Zonglin Di, Siwei Yang, Yin Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[681] arXiv:2309.07846 [pdf, html, other]
Title: MC-NeRF: Multi-Camera Neural Radiance Fields for Multi-Camera Image Acquisition Systems
Yu Gao, Lutong Su, Hao Liang, Yufeng Yue, Yi Yang, Mengyin Fu
Comments: This manuscript is currently under review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[682] arXiv:2309.07849 [pdf, html, other]
Title: TFNet: Exploiting Temporal Cues for Fast and Accurate LiDAR Semantic Segmentation
Rong Li, ShiJie Li, Xieyuanli Chen, Teli Ma, Juergen Gall, Junwei Liang
Comments: accepted by CVPR2024 Workshop on Autonomous Driving
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[683] arXiv:2309.07866 [pdf, other]
Title: Gradient constrained sharpness-aware prompt learning for vision-language models
Liangchen Liu, Nannan Wang, Dawei Zhou, Xinbo Gao, Decheng Liu, Xi Yang, Tongliang Liu
Comments: 19 pages 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[684] arXiv:2309.07880 [pdf, html, other]
Title: mEBAL2 Database and Benchmark: Image-based Multispectral Eyeblink Detection
Roberto Daza, Aythami Morales, Julian Fierrez, Ruben Tolosana, Ruben Vera-Rodriguez
Comments: Published in the journal Pattern Recognition Letters in June 2024. Accessible from this https URL
Journal-ref: Pattern Recognition Letters, vol. 182, pp. 83-89, 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[685] arXiv:2309.07888 [pdf, other]
Title: A Novel Local-Global Feature Fusion Framework for Body-weight Exercise Recognition with Pressure Mapping Sensors
Davinder Pal Singh, Lala Shakti Swarup Ray, Bo Zhou, Sungho Suh, Paul Lukowicz
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[686] arXiv:2309.07891 [pdf, html, other]
Title: HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image
Hongsuk Choi, Nikhil Chavan-Dafle, Jiacheng Yuan, Volkan Isler, Hyunsoo Park
Comments: In ICRA 2024; 13 pages including the supplementary material, 8 tables, 12 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[687] arXiv:2309.07906 [pdf, html, other]
Title: Generative Image Dynamics
Zhengqi Li, Richard Tucker, Noah Snavely, Aleksander Holynski
Comments: Project website: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[688] arXiv:2309.07910 [pdf, other]
Title: TEMPO: Efficient Multi-View Pose Estimation, Tracking, and Forecasting
Rohan Choudhury, Kris Kitani, Laszlo A. Jeni
Comments: Accepted at ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[689] arXiv:2309.07911 [pdf, other]
Title: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
Zhiwu Qing, Shiwei Zhang, Ziyuan Huang, Yingya Zhang, Changxin Gao, Deli Zhao, Nong Sang
Comments: ICCV2023. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[690] arXiv:2309.07914 [pdf, other]
Title: ALWOD: Active Learning for Weakly-Supervised Object Detection
Yuting Wang, Velibor Ilic, Jiatong Li, Branislav Kisacanin, Vladimir Pavlovic
Comments: published in ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[691] arXiv:2309.07917 [pdf, other]
Title: Looking at words and points with attention: a benchmark for text-to-shape coherence
Andrea Amaduzzi, Giuseppe Lisanti, Samuele Salti, Luigi Di Stefano
Comments: ICCV 2023 Workshop "AI for 3D Content Creation", Project page: this https URL, 26 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[692] arXiv:2309.07918 [pdf, html, other]
Title: Unified Human-Scene Interaction via Prompted Chain-of-Contacts
Zeqi Xiao, Tai Wang, Jingbo Wang, Jinkun Cao, Wenwei Zhang, Bo Dai, Dahua Lin, Jiangmiao Pang
Comments: A unified Human-Scene Interaction framework that supports versatile interactions through language this http URL URL: this https URL . Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[693] arXiv:2309.07920 [pdf, other]
Title: Large-Vocabulary 3D Diffusion Model with Transformer
Ziang Cao, Fangzhou Hong, Tong Wu, Liang Pan, Ziwei Liu
Comments: Project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[694] arXiv:2309.07921 [pdf, other]
Title: OpenIllumination: A Multi-Illumination Dataset for Inverse Rendering Evaluation on Real Objects
Isabella Liu, Linghao Chen, Ziyang Fu, Liwen Wu, Haian Jin, Zhong Li, Chin Ming Ryan Wong, Yi Xu, Ravi Ramamoorthi, Zexiang Xu, Hao Su
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[695] arXiv:2309.07929 [pdf, other]
Title: Prompting Segmentation with Sound Is Generalizable Audio-Visual Source Localizer
Yaoting Wang, Weisong Liu, Guangyao Li, Jian Ding, Di Hu, Xi Li
Comments: Accepted by AAAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[696] arXiv:2309.07944 [pdf, other]
Title: Text-to-Image Models for Counterfactual Explanations: a Black-Box Approach
Guillaume Jeanneret, Loïc Simon, Frédéric Jurie
Comments: WACV 2024 Camera ready + supplementary material
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[697] arXiv:2309.07986 [pdf, other]
Title: Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models
James Burgess, Kuan-Chieh Wang, Serena Yeung-Levy
Comments: ECCV 2024 (European Conference on Computer Vision). Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[698] arXiv:2309.08006 [pdf, html, other]
Title: Facial Kinship Verification from remote photoplethysmography
Xiaoting Wu, Xiaoyi Feng, Constantino Álvarez Casado, Lili Liu, Miguel Bordallo López
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[699] arXiv:2309.08009 [pdf, other]
Title: Measuring the Quality of Text-to-Video Model Outputs: Metrics and Dataset
Iya Chivileva, Philip Lynch, Tomas E. Ward, Alan F. Smeaton
Comments: 13 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[700] arXiv:2309.08020 [pdf, other]
Title: Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation
Zhaochong An, Guolei Sun, Zongwei Wu, Hao Tang, Luc Van Gool
Comments: BMVC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[701] arXiv:2309.08021 [pdf, other]
Title: Vision-based Analysis of Driver Activity and Driving Performance Under the Influence of Alcohol
Ross Greer, Akshay Gopalkrishnan, Sumega Mandadi, Pujitha Gunaratne, Mohan M. Trivedi, Thomas D. Marcotte
Comments: Withdrawn at the request of industry research collaborators, per contract agreement
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[702] arXiv:2309.08022 [pdf, other]
Title: Empowering Visually Impaired Individuals: A Novel Use of Apple Live Photos and Android Motion Photos
Seyedalireza Khoshsirat, Chandra Kambhamettu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[703] arXiv:2309.08033 [pdf, other]
Title: Depth Estimation from a Single Optical Encoded Image using a Learned Colored-Coded Aperture
Jhon Lopez, Edwin Vargas, Henry Arguello
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[704] arXiv:2309.08035 [pdf, html, other]
Title: Interpretability-Aware Vision Transformer
Yao Qiang, Chengyin Li, Prashant Khanduri, Dongxiao Zhu
Comments: 10 pages, 4 figures, 5 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[705] arXiv:2309.08036 [pdf, other]
Title: BEA: Revisiting anchor-based object detection DNN using Budding Ensemble Architecture
Syed Sha Qutub, Neslihan Kose, Rafael Rosales, Michael Paulitsch, Korbinian Hagn, Florian Geissler, Yang Peng, Gereon Hinz, Alois Knoll
Comments: 14 pages, 5 pages supplementary material. Accepted at BMVC-2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[706] arXiv:2309.08042 [pdf, other]
Title: Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved
Yao Sun, Anna Kruspe, Liqiu Meng, Yifan Tian, Eike J Hoffmann, Stefan Auer, Xiao Xiang Zhu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[707] arXiv:2309.08048 [pdf, other]
Title: Padding Aware Neurons
Dario Garcia-Gasulla, Victor Gimenez-Abalos, Pablo Martin-Torres
Comments: In 4th Visual Inductive Priors for Data-Efficient Deep Learning Workshop, ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[708] arXiv:2309.08066 [pdf, other]
Title: Morphologically-Aware Consensus Computation via Heuristics-based IterATive Optimization (MACCHIatO)
Dimitri Hamzaoui, Sarah Montagne, Raphaële Renard-Penna, Nicholas Ayache, Hervé Delingette
Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) this https URL
Journal-ref: Machine.Learning.for.Biomedical.Imaging. 2 (2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Optimization and Control (math.OC)
[709] arXiv:2309.08087 [pdf, other]
Title: hear-your-action: human action recognition by ultrasound active sensing
Risako Tanigawa, Yasunori Ishii
Subjects: Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[710] arXiv:2309.08097 [pdf, html, other]
Title: Detail Reinforcement Diffusion Model: Augmentation Fine-Grained Visual Categorization in Few-Shot Conditions
Tianxu Wu, Shuo Ye, Shuhuang Chen, Qinmu Peng, Xinge You
Comments: Accepted by TETCI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[711] arXiv:2309.08113 [pdf, other]
Title: MetaF2N: Blind Image Super-Resolution by Learning Efficient Model Adaptation from Faces
Zhicun Yin, Ming Liu, Xiaoming Li, Hui Yang, Longan Xiao, Wangmeng Zuo
Comments: Accepted by ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[712] arXiv:2309.08134 [pdf, other]
Title: AnyOKP: One-Shot and Instance-Aware Object Keypoint Extraction with Pretrained ViT
Fangbo Qin, Taogang Hou, Shan Lin, Kaiyuan Wang, Michael C. Yip, Shan Yu
Comments: Submitted to IEEE ICRA 2024 as a contributed paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[713] arXiv:2309.08136 [pdf, other]
Title: Let's Roll: Synthetic Dataset Analysis for Pedestrian Detection Across Different Shutter Types
Yue Hu, Gourav Datta, Kira Beerel, Peter Beerel
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[714] arXiv:2309.08139 [pdf, other]
Title: Multi-Scale Estimation for Omni-Directional Saliency Maps Using Learnable Equator Bias
Takao Yamanaka, Tatsuya Suzuki, Taiki Nobutsune, Chenjunlin Wu
Comments: Accepted for publication in IEICE Transactions on Information and Systems, Vol. E106-D, No. 10, 2023. this https URL The code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[715] arXiv:2309.08152 [pdf, html, other]
Title: DA-RAW: Domain Adaptive Object Detection for Real-World Adverse Weather Conditions
Minsik Jeon, Junwon Seo, Jihong Min
Comments: Accepted to ICRA 2024. Our project website can be found at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[716] arXiv:2309.08154 [pdf, other]
Title: Dynamic Visual Semantic Sub-Embeddings and Fast Re-Ranking
Wenzhang Wei, Zhipeng Gui, Changguang Wu, Anqi Zhao, Dehua Peng, Huayi Wu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[717] arXiv:2309.08159 [pdf, other]
Title: AdSEE: Investigating the Impact of Image Style Editing on Advertisement Attractiveness
Liyao Jiang, Chenglin Li, Haolan Chen, Xiaodong Gao, Xinwang Zhong, Yang Qiu, Shani Ye, Di Niu
Comments: Accepted to KDD 2023 Applied Data Science Track
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Machine Learning (cs.LG)
[718] arXiv:2309.08164 [pdf, other]
Title: A Ground Segmentation Method Based on Point Cloud Map for Unstructured Roads
Zixuan Li, Haiying Lin, Zhangyu Wang, Huazhi Li, Miao Yu, Jie Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[719] arXiv:2309.08167 [pdf, other]
Title: Differentiable Resolution Compression and Alignment for Efficient Video Classification and Retrieval
Rui Deng, Qian Wu, Yuke Li, Haoran Fu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[720] arXiv:2309.08179 [pdf, other]
Title: STDG: Semi-Teacher-Student Training Paradigram for Depth-guided One-stage Scene Graph Generation
Xukun Zhou, Zhenbo Song, Jun He, Hongyan Liu, Zhaoxin Fan
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[721] arXiv:2309.08196 [pdf, other]
Title: ECEA: Extensible Co-Existing Attention for Few-Shot Object Detection
Zhimeng Xin, Tianxu Wu, Shiming Chen, Yixiong Zou, Ling Shao, Xinge You
Comments: 12 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[722] arXiv:2309.08204 [pdf, html, other]
Title: One-stage Modality Distillation for Incomplete Multimodal Learning
Shicai Wei, Yang Luo, Chunbo Luo
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[723] arXiv:2309.08206 [pdf, other]
Title: Salient Object Detection in Optical Remote Sensing Images Driven by Transformer
Gongyang Li, Zhen Bai, Zhi Liu, Xinpeng Zhang, Haibin Ling
Comments: 13 pages, 6 figures, Accepted by IEEE Transactions on Image Processing 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[724] arXiv:2309.08220 [pdf, other]
Title: UniST: Towards Unifying Saliency Transformer for Video Saliency Prediction and Detection
Junwen Xiong, Peng Zhang, Chuanyue Li, Wei Huang, Yufei Zha, Tao You
Comments: 11 pages, 7 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[725] arXiv:2309.08239 [pdf, other]
Title: Human-Inspired Topological Representations for Visual Object Recognition in Unseen Environments
Ekta U. Samani, Ashis G. Banerjee
Comments: Accepted for presentation at the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Workshop on Robotic Perception and Mapping: Frontier Vision & Learning Techniques
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[726] arXiv:2309.08244 [pdf, other]
Title: A Real-time Faint Space Debris Detector With Learning-based LCM
Zherui Lu, Gangyi Wang, Xinguo Wei, Jian Li
Comments: 13 pages, 28 figures, normal article
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[727] arXiv:2309.08250 [pdf, other]
Title: Optimization of Rank Losses for Image Retrieval
Elias Ramzi, Nicolas Audebert, Clément Rambour, André Araujo, Xavier Bitot, Nicolas Thome
Comments: arXiv admin note: text overlap with arXiv:2207.04873
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[728] arXiv:2309.08251 [pdf, other]
Title: Cartoondiff: Training-free Cartoon Image Generation with Diffusion Transformer Models
Feihong He, Gang Li, Lingyu Si, Leilei Yan, Shimeng Hou, Hongwei Dong, Fanzhang Li
Comments: 5 pages,5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[729] arXiv:2309.08259 [pdf, other]
Title: BROW: Better featuRes fOr Whole slide image based on self-distillation
Yuanfeng Wu, Shaojie Li, Zhiqiang Du, Wentao Zhu
Comments: 14 pages including reference part, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[730] arXiv:2309.08264 [pdf, other]
Title: Leveraging the Power of Data Augmentation for Transformer-based Tracking
Jie Zhao, Johan Edstedt, Michael Felsberg, Dong Wang, Huchuan Lu
Comments: 10 pages, 5 figures, 7 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[731] arXiv:2309.08265 [pdf, other]
Title: Edge Based Oriented Object Detection
Jianghu Shen, Xiaojun Wu
Comments: 9 pages, 8 figures, 1 algorithm,
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[732] arXiv:2309.08273 [pdf, html, other]
Title: A Generative Framework for Self-Supervised Facial Representation Learning
Ruian He, Zhen Xing, Weimin Tan, Bo Yan
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[733] arXiv:2309.08289 [pdf, html, other]
Title: Large Intestine 3D Shape Refinement Using Point Diffusion Models for Digital Phantom Generation
Kaouther Mouheb, Mobina Ghojogh Nejad, Lavsen Dahal, Ehsan Samei, Kyle J. Lafata, W. Paul Segars, Joseph Y. Lo
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[734] arXiv:2309.08302 [pdf, other]
Title: T-UDA: Temporal Unsupervised Domain Adaptation in Sequential Point Clouds
Awet Haileslassie Gebrehiwot, David Hurych, Karel Zimmermann, Patrick Pérez, Tomáš Svoboda
Comments: Will appear at IEEE/RSJ International Conference on Intelligent Robots and Systems 2023 (IROS 2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[735] arXiv:2309.08353 [pdf, other]
Title: Continual Learning with Deep Streaming Regularized Discriminant Analysis
Joe Khawand, Peter Hanappe, David Colliaux
Journal-ref: In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 3455-3462) 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[736] arXiv:2309.08365 [pdf, other]
Title: M$^3$Net: Multilevel, Mixed and Multistage Attention Network for Salient Object Detection
Yao Yuan, Pan Gao, XiaoYang Tan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[737] arXiv:2309.08368 [pdf, other]
Title: Robust Burned Area Delineation through Multitask Learning
Edoardo Arnaudo, Luca Barco, Matteo Merlo, Claudio Rossi
Comments: Accepted at ECML PKDD 2023 - MACLEAN Workshop (11 pages, 3 figures)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[738] arXiv:2309.08369 [pdf, other]
Title: An Efficient Wide-Range Pseudo-3D Vehicle Detection Using A Single Camera
Zhupeng Ye, Yinqi Li, Zejian Yuan
Comments: 11 pages, 27 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[739] arXiv:2309.08372 [pdf, other]
Title: Beyond Domain Gap: Exploiting Subjectivity in Sketch-Based Person Retrieval
Kejun Lin, Zhixiang Wang, Zheng Wang, Yinqiang Zheng, Shin'ichi Satoh
Comments: ACM Multimedia 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[740] arXiv:2309.08379 [pdf, other]
Title: PatFig: Generating Short and Long Captions for Patent Figures
Dana Aubakirova, Kim Gerdes, Lufei Liu
Comments: accepted to the ICCV 2023, CLVL: 5th Workshop on Closing the Loop Between Vision and Language
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[741] arXiv:2309.08382 [pdf, other]
Title: Double Domain Guided Real-Time Low-Light Image Enhancement for Ultra-High-Definition Transportation Surveillance
Jingxiang Qu, Ryan Wen Liu, Yuan Gao, Yu Guo, Fenghua Zhu, Fei-yue Wang
Comments: 12 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[742] arXiv:2309.08416 [pdf, other]
Title: Deformable Neural Radiance Fields using RGB and Event Cameras
Qi Ma, Danda Pani Paudel, Ajad Chhatkuli, Luc Van Gool
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[743] arXiv:2309.08424 [pdf, other]
Title: X-PDNet: Accurate Joint Plane Instance Segmentation and Monocular Depth Estimation with Cross-Task Distillation and Boundary Correction
Cao Dinh Duc, Jongwoo Lim
Comments: Accepted to BMVC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[744] arXiv:2309.08442 [pdf, other]
Title: Toward responsible face datasets: modeling the distribution of a disentangled latent space for sampling face images from demographic groups
Parsa Rahimi, Christophe Ecabert, Sebastien Marcel
Comments: IJCB 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[745] arXiv:2309.08471 [pdf, html, other]
Title: TreeLearn: A deep learning method for segmenting individual trees from ground-based LiDAR forest point clouds
Jonathan Henrich, Jan van Delden, Dominik Seidel, Thomas Kneib, Alexander Ecker
Journal-ref: Ecological Informatics, Volume 84, 2024, 102888
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[746] arXiv:2309.08480 [pdf, html, other]
Title: PoseFix: Correcting 3D Human Poses with Natural Language
Ginger Delmas, Philippe Weinzaepfel, Francesc Moreno-Noguer, Grégory Rogez
Comments: Published in ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[747] arXiv:2309.08481 [pdf, other]
Title: 3D Arterial Segmentation via Single 2D Projections and Depth Supervision in Contrast-Enhanced CT Images
Alina F. Dima, Veronika A. Zimmer, Martin J. Menten, Hongwei Bran Li, Markus Graf, Tristan Lemke, Philipp Raffler, Robert Graf, Jan S. Kirschke, Rickmer Braren, Daniel Rueckert
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[748] arXiv:2309.08482 [pdf, html, other]
Title: YCB-Ev 1.1: Event-vision dataset for 6DoF object pose estimation
Pavel Rojtberg, Thomas Pöllabauer
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[749] arXiv:2309.08513 [pdf, html, other]
Title: SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels
Henry Hengyuan Zhao, Pichao Wang, Yuyang Zhao, Hao Luo, Fan Wang, Mike Zheng Shou
Comments: This work has been accepted by IJCV
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[750] arXiv:2309.08523 [pdf, other]
Title: Breathing New Life into 3D Assets with Generative Repainting
Tianfu Wang, Menelaos Kanakis, Konrad Schindler, Luc Van Gool, Anton Obukhov
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
Total of 2022 entries : 251-750 501-1000 1001-1500 1501-2000 ... 2001-2022
Showing up to 500 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack