Computer Vision and Pattern Recognition

Authors and titles for September 2023

Total of 2022 entries : 1-250 251-500 501-750 751-1000 1001-1250 ... 2001-2022

Showing up to 250 entries per page: fewer | more | all

[251] arXiv:2309.02420 [pdf, other]: Title: Doppelgangers: Learning to Disambiguate Images of Similar Structures

Ruojin Cai, Joseph Tung, Qianqian Wang, Hadar Averbuch-Elor, Bharath Hariharan, Noah Snavely

Comments: Published in ICCV 2023 (Oral); Project page: this http URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[252] arXiv:2309.02423 [pdf, other]: Title: EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding

Yue Xu, Yong-Lu Li, Zhemin Huang, Michael Xu Liu, Cewu Lu, Yu-Wing Tai, Chi-Keung Tang

Comments: ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[253] arXiv:2309.02429 [pdf, other]: Title: Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach

Vimal K B, Saketh Bachu, Tanmay Garg, Niveditha Lakshmi Narasimhan, Raghavan Konuru, Vineeth N Balasubramanian

Comments: To appear at ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[254] arXiv:2309.02434 [pdf, other]: Title: ReliTalk: Relightable Talking Portrait Generation from a Single Video

Haonan Qiu, Zhaoxi Chen, Yuming Jiang, Hang Zhou, Xiangyu Fan, Lei Yang, Wayne Wu, Ziwei Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[255] arXiv:2309.02436 [pdf, other]: Title: GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction

Youmin Zhang, Fabio Tosi, Stefano Mattoccia, Matteo Poggi

Comments: ICCV 2023. Code: this https URL - Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[256] arXiv:2309.02450 [pdf, other]: Title: Self-Supervised Video Transformers for Isolated Sign Language Recognition

Marcelo Sandoval-Castaneda, Yanhong Li, Diane Brentari, Karen Livescu, Gregory Shakhnarovich

Comments: 14 pages. Submitted to WACV 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[257] arXiv:2309.02455 [pdf, html, other]: Title: RSDiff: Remote Sensing Image Generation from Text Using Diffusion Model

Ahmad Sebaq, Mohamed ElHelw

Journal-ref: Neural Comput & Applic (2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[258] arXiv:2309.02527 [pdf, other]: Title: A skeletonization algorithm for gradient-based optimization

Martin J. Menten, Johannes C. Paetzold, Veronika A. Zimmer, Suprosanna Shit, Ivan Ezhov, Robbie Holland, Monika Probst, Julia A. Schnabel, Daniel Rueckert

Comments: Accepted at ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[259] arXiv:2309.02556 [pdf, other]: Title: Domain Adaptation for Efficiently Fine-tuning Vision Transformer with Encrypted Images

Teru Nagamori, Sayaka Shiota, Hitoshi Kiya

Comments: Accepted by APSIPA 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[260] arXiv:2309.02562 [pdf, other]: Title: Recurrence-Free Survival Prediction for Anal Squamous Cell Carcinoma Chemoradiotherapy using Planning CT-based Radiomics Model

Shanshan Tang, Kai Wang, David Hein, Gloria Lin, Nina N. Sanford, Jing Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[261] arXiv:2309.02578 [pdf, other]: Title: Anatomy-Driven Pathology Detection on Chest X-rays

Philip Müller, Felix Meissen, Johannes Brandt, Georgios Kaissis, Daniel Rueckert

Comments: Accepted at MICCAI 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[262] arXiv:2309.02596 [pdf, other]: Title: Self-Supervised Pretraining Improves Performance and Inference Efficiency in Multiple Lung Ultrasound Interpretation Tasks

Blake VanBerlo, Brian Li, Jesse Hoey, Alexander Wong

Comments: 10 pages, 5 figures, submitted to IEEE Access

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[263] arXiv:2309.02617 [pdf, other]: Title: Compressing Vision Transformers for Low-Resource Visual Learning

Eric Youn, Sai Mitheran J, Sanjana Prabhu, Siyuan Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[264] arXiv:2309.02636 [pdf, other]: Title: Multiclass Alignment of Confidence and Certainty for Network Calibration

Vinith Kugathasan, Muhammad Haris Khan

Comments: Accepted at GCPR 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[265] arXiv:2309.02666 [pdf, other]: Title: Fast and Resource-Efficient Object Tracking on Edge Devices: A Measurement Study

Sanjana Vijay Ganesh, Yanzhao Wu, Gaowen Liu, Ramana Kompella, Ling Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[266] arXiv:2309.02676 [pdf, other]: Title: Efficient Training for Visual Tracking with Deformable Transformer

Qingmao Wei, Guotian Zeng, Bi Zeng

Comments: arXiv admin note: text overlap with arXiv:2303.16580 by other authors

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[267] arXiv:2309.02702 [pdf, other]: Title: Gene-induced Multimodal Pre-training for Image-omic Classification

Ting Jin, Xingran Xie, Renjie Wan, Qingli Li, Yan Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[268] arXiv:2309.02713 [pdf, other]: Title: SlAction: Non-intrusive, Lightweight Obstructive Sleep Apnea Detection using Infrared Video

You Rim Choi, Gyeongseon Eo, Wonhyuck Youn, Hyojin Lee, Haemin Jang, Dongyoon Kim, Hyunwoo Shin, Hyung-Sin Kim

Comments: Accepted to ICCV CVAMD 2023, poster

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[269] arXiv:2309.02719 [pdf, other]: Title: DMKD: Improving Feature-based Knowledge Distillation for Object Detection Via Dual Masking Augmentation

Guang Yang, Yin Tang, Zhijian Wu, Jun Li, Jianhua Xu, Xili Wan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[270] arXiv:2309.02742 [pdf, html, other]: Title: MLN-net: A multi-source medical image segmentation method for clustered microcalcifications using multiple layer normalization

Ke Wang, Zanting Ye, Xiang Xie, Haidong Cui, Tao Chen, Banteng Liu

Comments: 17 pages, 9 figures, 3 tables

Journal-ref: Knowledge-Based Systems, 2024, 283: 111127

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[271] arXiv:2309.02773 [pdf, html, other]: Title: Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter

Jinglong Wang, Xiawei Li, Jing Zhang, Qingyuan Xu, Qin Zhou, Qian Yu, Lu Sheng, Dong Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[272] arXiv:2309.02777 [pdf, html, other]: Title: LightNeuS: Neural Surface Reconstruction in Endoscopy using Illumination Decline

Víctor M. Batlle, José M. M. Montiel, Pascal Fua, Juan D. Tardós

Comments: 13 pages, 7 figures, 1 table

Journal-ref: MICCAI 2023. Lecture Notes in Computer Science, vol 14229 (2023) pp 502-512

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[273] arXiv:2309.02801 [pdf, other]: Title: 3D Trajectory Reconstruction of Drones using a Single Camera

Seobin Hwang, Hanyoung Kim, Chaeyeon Heo, Youkyoung Na, Cheongeun Lee, Yeongjun Cho

Comments: 10 pages, 9 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[274] arXiv:2309.02833 [pdf, html, other]: Title: Image-Object-Specific Prompt Learning for Few-Shot Class-Incremental Learning

In-Ug Yoon, Tae-Min Choi, Sun-Kyung Lee, Young-Min Kim, Jong-Hwan Kim

Comments: 8 pages, 4 figures, 4 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[275] arXiv:2309.02843 [pdf, other]: Title: Knowledge Distillation Layer that Lets the Student Decide

Ada Gorgun, Yeti Z. Gurbuz, A. Aydin Alatan

Comments: Accepted at the British Machine Vision Conference 2023 (BMVC 2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[276] arXiv:2309.02855 [pdf, other]: Title: Bandwidth-efficient Inference for Neural Image Compression

Shanzhi Yin, Tongda Xu, Yongsheng Liang, Yuanyuan Wang, Yanghao Li, Yan Wang, Jingjing Liu

Comments: 9 pages, 6 figures, submitted to ICASSP 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[277] arXiv:2309.02861 [pdf, other]: Title: Image Aesthetics Assessment via Learnable Queries

Zhiwei Xiong, Yunfan Zhang, Zhiqi Shen, Peiran Ren, Han Yu

Comments: This work has been submitted to the IEEE for possible publication

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[278] arXiv:2309.02875 [pdf, other]: Title: MAD: Modality Agnostic Distance Measure for Image Registration

Vasiliki Sideri-Lampretsa, Veronika A. Zimmer, Huaqi Qiu, Georgios Kaissis, Daniel Rueckert

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[279] arXiv:2309.02903 [pdf, other]: Title: Towards Efficient Training with Negative Samples in Visual Tracking

Qingmao Wei, Bi Zeng, Guotian Zeng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[280] arXiv:2309.02923 [pdf, other]: Title: Patched Line Segment Learning for Vector Road Mapping

Jiakun Xu, Bowen Xu, Gui-Song Xia, Liang Dong, Nan Xue

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[281] arXiv:2309.02954 [pdf, other]: Title: M3D-NCA: Robust 3D Segmentation with Built-in Quality Control

John Kalkhof, Anirban Mukhopadhyay

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[282] arXiv:2309.02964 [pdf, other]: Title: Hierarchical-level rain image generative model based on GAN

Zhenyuan Liu, Tong Jia, Xingyu Xing, Jianfeng Wu, Junyi Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[283] arXiv:2309.02965 [pdf, other]: Title: Dynamic Hyperbolic Attention Network for Fine Hand-object Reconstruction

Zhiying Leng, Shun-Cheng Wu, Mahdi Saleh, Antonio Montanaro, Hao Yu, Yin Wang, Nassir Navab, Xiaohui Liang, Federico Tombari

Comments: Accpeted by ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[284] arXiv:2309.02975 [pdf, other]: Title: FishMOT: A Simple and Effective Method for Fish Tracking Based on IoU Matching

Shuo Liu, Lulu Han, Xiaoyang Liu, Junli Ren, Fang Wang, YingLiu, Yuanshan Lin

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[285] arXiv:2309.02995 [pdf, other]: Title: Continual Evidential Deep Learning for Out-of-Distribution Detection

Eduardo Aguilar, Bogdan Raducanu, Petia Radeva, Joost Van de Weijer

Comments: Accepted at Visual Continual Learning workshop (ICCV2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[286] arXiv:2309.02999 [pdf, other]: Title: Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning

Sijin Chen, Hongyuan Zhu, Mingsheng Li, Xin Chen, Peng Guo, Yinjie Lei, Gang Yu, Taihao Li, Tao Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[287] arXiv:2309.03008 [pdf, html, other]: Title: Sparse 3D Reconstruction via Object-Centric Ray Sampling

Llukman Cerkezi, Paolo Favaro

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[288] arXiv:2309.03020 [pdf, html, other]: Title: SEAL: A Framework for Systematic Evaluation of Real-World Super-Resolution

Wenlong Zhang, Xiaohui Li, Xiangyu Chen, Yu Qiao, Xiao-Ming Wu, Chao Dong

Comments: ICLR 2024, Spotlight. The source code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[289] arXiv:2309.03031 [pdf, other]: Title: MCM: Multi-condition Motion Synthesis Framework for Multi-scenario

Zeyu Ling, Bo Han, Yongkang Wong, Mohan Kangkanhalli, Weidong Geng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[290] arXiv:2309.03047 [pdf, other]: Title: Combining pre-trained Vision Transformers and CIDER for Out Of Domain Detection

Grégor Jouet, Clément Duhart, Francis Rousseaux, Julio Laborde, Cyril de Runz

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[291] arXiv:2309.03048 [pdf, html, other]: Title: Exploring Semantic Consistency in Unpaired Image Translation to Generate Data for Surgical Applications

Danush Kumar Venkatesh, Dominik Rivoir, Micha Pfeiffer, Fiona Kolbinger, Marius Distler, Jürgen Weitz, Stefanie Speidel

Comments: Accepted at IPCAI 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[292] arXiv:2309.03049 [pdf, other]: Title: Adaptive Growth: Real-time CNN Layer Expansion

Yunjie Zhu, Yunhao Chen

Comments: Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[293] arXiv:2309.03063 [pdf, other]: Title: Prompt-based Ingredient-Oriented All-in-One Image Restoration

Hu Gao, Depeng Dang

Comments: IEEE Transactions on Circuits and Systems for Video Technology (2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[294] arXiv:2309.03072 [pdf, other]: Title: Character Queries: A Transformer-based Approach to On-Line Handwritten Character Segmentation

Michael Jungo, Beat Wolf, Andrii Maksai, Claudiu Musat, Andreas Fischer

Comments: ICDAR 2023 Best Student Paper Award. Code available at this https URL

Journal-ref: International Conference on Document Analysis and Recognition - ICDAR 2023, pp. 98-114. Cham: Springer Nature Switzerland

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[295] arXiv:2309.03100 [pdf, other]: Title: FArMARe: a Furniture-Aware Multi-task methodology for Recommending Apartments based on the user interests

Ali Abdari, Alex Falcon, Giuseppe Serra

Comments: accepted for presentation at the ICCV2023 CV4Metaverse workshop

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[296] arXiv:2309.03110 [pdf, other]: Title: Do We Still Need Non-Maximum Suppression? Accurate Confidence Estimates and Implicit Duplication Modeling with IoU-Aware Calibration

Johannes Gilg, Torben Teepe, Fabian Herzog, Philipp Wolters, Gerhard Rigoll

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[297] arXiv:2309.03160 [pdf, html, other]: Title: ResFields: Residual Neural Fields for Spatiotemporal Signals

Marko Mihajlovic, Sergey Prokudin, Marc Pollefeys, Siyu Tang

Comments: [ICLR 2024 Spotlight] Project and code at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[298] arXiv:2309.03173 [pdf, other]: Title: PDiscoNet: Semantically consistent part discovery for fine-grained recognition

Robert van der Klis, Stephan Alaniz, Massimiliano Mancini, Cassio F. Dantas, Dino Ienco, Zeynep Akata, Diego Marcos

Comments: 9 pages, 8 figures, ICCV

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[299] arXiv:2309.03179 [pdf, html, other]: Title: SLiMe: Segment Like Me

Aliasghar Khani, Saeid Asgari Taghanaki, Aditya Sanghi, Ali Mahdavi Amiri, Ghassan Hamarneh

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[300] arXiv:2309.03185 [pdf, other]: Title: Bayes' Rays: Uncertainty Quantification for Neural Radiance Fields

Lily Goli, Cody Reading, Silvia Sellán, Alec Jacobson, Andrea Tagliasacchi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[301] arXiv:2309.03198 [pdf, other]: Title: My Art My Choice: Adversarial Protection Against Unruly AI

Anthony Rhodes, Ram Bhagat, Umur Aybars Ciftci, Ilke Demir

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[302] arXiv:2309.03216 [pdf, other]: Title: A Multisensor Hyperspectral Benchmark Dataset For Unmixing of Intimate Mixtures

Bikram Koirala, Behnood Rasti, Zakaria Bnoulkacem, Andrea de Lima Ribeiro, Yuleika Madriz, Erik Herrmann, Arthur Gestels, Thomas De Kerf, Sandra Lorenz, Margret Fuchs, Koen Janssens, Gunther Steenackers, Richard Gloaguen, Paul Scheunders

Comments: Currently, this paper is under review in IEEE

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[303] arXiv:2309.03240 [pdf, other]: Title: RepSGG: Novel Representations of Entities and Relationships for Scene Graph Generation

Hengyue Liu, Bir Bhanu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[304] arXiv:2309.03247 [pdf, other]: Title: Robust Visual Tracking by Motion Analyzing

Mohammed Leo, Kurban Ubul, ShengJie Cheng, Michael Ma

Comments: found some key point that is missed,considering that it will take a lot of time to reproduce the results and revise our mistakes,we would like to withdraw the manuscript to avoid further mislead

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[305] arXiv:2309.03295 [pdf, other]: Title: Comparative Analysis of Deep-Fake Algorithms

Nikhil Sontakke, Sejal Utekar, Shivansh Rastogi, Shriraj Sonawane

Comments: 7 pages, 4 figures, 2 tables, Published with International Journal of Computer Science Trends and Technology (IJCST)

Journal-ref: International Journal of Computer Science Trends and Technology (IJCST) V11(4): Page(109-115) Jul - Aug 2023. ISSN: 2347-8578

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[306] arXiv:2309.03329 [pdf, other]: Title: MEGANet: Multi-Scale Edge-Guided Attention Network for Weak Boundary Polyp Segmentation

Nhat-Tan Bui, Dinh-Hieu Hoang, Quang-Thuc Nguyen, Minh-Triet Tran, Ngan Le

Comments: Accepted at the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[307] arXiv:2309.03331 [pdf, other]: Title: Expert Uncertainty and Severity Aware Chest X-Ray Classification by Multi-Relationship Graph Learning

Mengliang Zhang, Xinyue Hu, Lin Gu, Liangchen Liu, Kazuma Kobayashi, Tatsuya Harada, Ronald M. Summers, Yingying Zhu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[308] arXiv:2309.03335 [pdf, other]: Title: SADIR: Shape-Aware Diffusion Models for 3D Image Reconstruction

Nivetha Jayakumar, Tonmoy Hossain, Miaomiao Zhang

Comments: ShapeMI MICCAI 2023: Workshop on Shape in Medical Imaging

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[309] arXiv:2309.03350 [pdf, other]: Title: Relay Diffusion: Unifying diffusion process across resolutions for image synthesis

Jiayan Teng, Wendi Zheng, Ming Ding, Wenyi Hong, Jianqiao Wangni, Zhuoyi Yang, Jie Tang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[310] arXiv:2309.03351 [pdf, other]: Title: Using Neural Networks for Fast SAR Roughness Estimation of High Resolution Images

Li Fan, Jeova Farias Sales Rocha Neto

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Applications (stat.AP)
[311] arXiv:2309.03353 [pdf, other]: Title: Source Camera Identification and Detection in Digital Videos through Blind Forensics

Venkata Udaya Sameer, Shilpa Mukhopadhyay, Ruchira Naskar, Ishaan Dali

Comments: Submitted to IEEE for inclusion in Xplore- Digital Library. Paper presented at the International Conference on Recent Trends in Computational Engineering & Technologies (ICRTCET 18)with Paper Id: ICRTCET-227

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[312] arXiv:2309.03360 [pdf, other]: Title: ViewMix: Augmentation for Robust Representation in Self-Supervised Learning

Arjon Das, Xin Zhong

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[313] arXiv:2309.03367 [pdf, other]: Title: Self-Supervised Masked Digital Elevation Models Encoding for Low-Resource Downstream Tasks

Priyam Mazumdar, Aiman Soliman, Volodymyr Kindratenko, Luigi Marini, Kenton McHenry

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[314] arXiv:2309.03381 [pdf, other]: Title: Active shooter detection and robust tracking utilizing supplemental synthetic data

Joshua R. Waite, Jiale Feng, Riley Tavassoli, Laura Harris, Sin Yong Tan, Subhadeep Chakraborty, Soumik Sarkar

Comments: 11 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[315] arXiv:2309.03390 [pdf, other]: Title: A novel method for iris recognition using BP neural network and parallel computing by the aid of GPUs (Graphics Processing Units)

Farahnaz Hosseini, Hossein Ebrahimpour, Samaneh Askari

Comments: 8 pages,

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[316] arXiv:2309.03401 [pdf, other]: Title: Reasonable Anomaly Detection in Long Sequences

Yalong Jiang, Changkang Li

Comments: 8 pages, 1 figure

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[317] arXiv:2309.03406 [pdf, other]: Title: Distribution-Aware Prompt Tuning for Vision-Language Models

Eulrang Cho, Jooyeon Kim, Hyunwoo J. Kim

Comments: Accepted to ICCV2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[318] arXiv:2309.03445 [pdf, other]: Title: Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy

Yi Tang, Takafumi Iwaguchi, Hiroshi Kawasaki

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319] arXiv:2309.03452 [pdf, html, other]: Title: Multimodal Guidance Network for Missing-Modality Inference in Content Moderation

Zhuokai Zhao, Harish Palani, Tianyi Liu, Lena Evans, Ruth Toner

Comments: ICME 2024 Camera Ready. Code is available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[320] arXiv:2309.03453 [pdf, html, other]: Title: SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

Yuan Liu, Cheng Lin, Zijiao Zeng, Xiaoxiao Long, Lingjie Liu, Taku Komura, Wenping Wang

Comments: ICLR 2024 Spotlight. Project page: this https URL Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[321] arXiv:2309.03467 [pdf, html, other]: Title: Autoregressive Omni-Aware Outpainting for Open-Vocabulary 360-Degree Image Generation

Zhuqiang Lu, Kun Hu, Chaoyue Wang, Lei Bai, Zhiyong Wang

Comments: Accepted by AAAI 24

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[322] arXiv:2309.03468 [pdf, html, other]: Title: Support-Set Context Matters for Bongard Problems

Nikhil Raghuraman, Adam W. Harley, Leonidas Guibas

Comments: TMLR October 2024. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[323] arXiv:2309.03472 [pdf, other]: Title: Perceptual Quality Assessment of 360$^\circ$ Images Based on Generative Scanpath Representation

Xiangjie Sui, Hanwei Zhu, Xuelin Liu, Yuming Fang, Shiqi Wang, Zhou Wang

Comments: 12 pages, 5 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[324] arXiv:2309.03473 [pdf, other]: Title: Temporal Collection and Distribution for Referring Video Object Segmentation

Jiajin Tang, Ge Zheng, Sibei Yang

Comments: Accepted by ICCV 2023; Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[325] arXiv:2309.03483 [pdf, other]: Title: DetermiNet: A Large-Scale Diagnostic Dataset for Complex Visually-Grounded Referencing using Determiners

Clarence Lee, M Ganesh Kumar, Cheston Tan

Comments: 10 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[326] arXiv:2309.03499 [pdf, other]: Title: Instance Segmentation of Dislocations in TEM Images

Karina Ruzaeva, Kishan Govind, Marc Legros, Stefan Sandfeld

Journal-ref: IEEE 23rd International Conference on Nanotechnology (2023) 1-6

Subjects: Computer Vision and Pattern Recognition (cs.CV); Materials Science (cond-mat.mtrl-sci)
[327] arXiv:2309.03504 [pdf, other]: Title: Stroke-based Neural Painting and Stylization with Dynamically Predicted Painting Region

Teng Hu, Ran Yi, Haokun Zhu, Liang Liu, Jinlong Peng, Yabiao Wang, Chengjie Wang, Lizhuang Ma

Comments: ACM MM 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[328] arXiv:2309.03506 [pdf, other]: Title: Towards Robust Natural-Looking Mammography Lesion Synthesis on Ipsilateral Dual-Views Breast Cancer Analysis

Thanh-Huy Nguyen, Quang Hien Kha, Thai Ngoc Toan Truong, Ba Thinh Lam, Ba Hung Ngo, Quang Vinh Dinh, Nguyen Quoc Khanh Le

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[329] arXiv:2309.03508 [pdf, other]: Title: Dynamic Frame Interpolation in Wavelet Domain

Lingtong Kong, Boyuan Jiang, Donghao Luo, Wenqing Chu, Ying Tai, Chengjie Wang, Jie Yang

Comments: Accepted by IEEE TIP

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[330] arXiv:2309.03509 [pdf, other]: Title: BroadCAM: Outcome-agnostic Class Activation Mapping for Small-scale Weakly Supervised Applications

Jiatai Lin, Guoqiang Han, Xuemiao Xu, Changhong Liang, Tien-Tsin Wong, C. L. Philip Chen, Zaiyi Liu, Chu Han

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[331] arXiv:2309.03530 [pdf, other]: Title: Efficient Single Object Detection on Image Patches with Early Exit Enhanced High-Precision CNNs

Arne Moos

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[332] arXiv:2309.03531 [pdf, other]: Title: A Robust Negative Learning Approach to Partial Domain Adaptation Using Source Prototypes

Sandipan Choudhuri, Suli Adeniye, Arunabha Sen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[333] arXiv:2309.03539 [pdf, other]: Title: YOLO series target detection algorithms for underwater environments

Chenjie Zhang, Pengcheng Jiao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[334] arXiv:2309.03542 [pdf, other]: Title: Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction

Jiankai Li, Yunhong Wang, Weixin Li

Comments: Accept in TOMM 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[335] arXiv:2309.03548 [pdf, other]: Title: Trash to Treasure: Low-Light Object Detection via Decomposition-and-Aggregation

Xiaohan Cui, Long Ma, Tengyu Ma, Jinyuan Liu, Xin Fan, Risheng Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[336] arXiv:2309.03549 [pdf, other]: Title: Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation

Jiaxi Gu, Shicong Wang, Haoyu Zhao, Tianyi Lu, Xing Zhang, Zuxuan Wu, Songcen Xu, Wei Zhang, Yu-Gang Jiang, Hang Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[337] arXiv:2309.03550 [pdf, other]: Title: Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model

Sungwon Hwang, Junha Hyung, Jaegul Choo

Comments: Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[338] arXiv:2309.03558 [pdf, other]: Title: Region Generation and Assessment Network for Occluded Person Re-Identification

Shuting He, Weihua Chen, Kai Wang, Hao Luo, Fan Wang, Wei Jiang, Henghui Ding

Journal-ref: IEEE TIFS 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[339] arXiv:2309.03575 [pdf, other]: Title: Toward High Quality Facial Representation Learning

Yue Wang, Jinlong Peng, Jiangning Zhang, Ran Yi, Liang Liu, Yabiao Wang, Chengjie Wang

Comments: ACM MM 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[340] arXiv:2309.03576 [pdf, other]: Title: DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions

Haochen Wang, Junsong Fan, Yuxi Wang, Kaiyou Song, Tong Wang, Zhaoxiang Zhang

Comments: Accepted by NeurIPS 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[341] arXiv:2309.03598 [pdf, other]: Title: Enhancing Sample Utilization through Sample Adaptive Augmentation in Semi-Supervised Learning

Guan Gui, Zhen Zhao, Lei Qi, Luping Zhou, Lei Wang, Yinghuan Shi

Comments: Accepted as International Conference on Computer Vision (ICCV) 2023

Journal-ref: International Conference on Computer Vision (ICCV) 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[342] arXiv:2309.03599 [pdf, other]: Title: Chasing Consistency in Text-to-3D Generation from a Single Image

Yichen Ouyang, Wenhao Chai, Jiayi Ye, Dapeng Tao, Yibing Zhan, Gaoang Wang

Comments: 9 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[343] arXiv:2309.03640 [pdf, other]: Title: Context-Aware 3D Object Localization from Single Calibrated Images: A Study of Basketballs

Marcello Davide Caio (1), Gabriel Van Zandycke (1 and 2), Christophe De Vleeschouwer (2) ((1) Sportradar AG, (2) UCLouvain)

Comments: 5 pages, 4 figures, MMSports'23, in proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports (MMSports '23), October 29, 2023, Ottawa, ON, Canada

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[344] arXiv:2309.03659 [pdf, other]: Title: Towards Comparable Knowledge Distillation in Semantic Image Segmentation

Onno Niemann, Christopher Vox, Thorben Werner

Comments: Accepted by the ECML PKDD 2023 workshop track: Simplification, Compression, Efficiency, and Frugality for Artificial Intelligence (SCEFA). This preprint has not undergone peer review or any post-submission improvements or corrections

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[345] arXiv:2309.03661 [pdf, html, other]: Title: Prompt-based Context- and Domain-aware Pretraining for Vision and Language Navigation

Ting Liu, Yue Hu, Wansen Wu, Youkai Wang, Kai Xu, Quanjun Yin

Comments: 12 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[346] arXiv:2309.03671 [pdf, other]: Title: Dataset Generation and Bonobo Classification from Weakly Labelled Videos

Pierre-Etienne Martin

Comments: IntelliSys 2023 paper

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[347] arXiv:2309.03696 [pdf, other]: Title: Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory

Ting Lei, Fabian Caba, Qingchao Chen, Hailin Jin, Yuxin Peng, Yang Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[348] arXiv:2309.03722 [pdf, html, other]: Title: A boundary-aware point clustering approach in Euclidean and embedding spaces for roof plane segmentation

Li Li, Qingqing Li, Guozheng Xu, Pengwei Zhou, Jingmin Tu, Jie Li, Mingming Li, Jian Yao

Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing,2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[349] arXiv:2309.03726 [pdf, other]: Title: Interpretable Visual Question Answering via Reasoning Supervision

Maria Parelli, Dimitrios Mallis, Markos Diomataris, Vassilis Pitsikalis

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[350] arXiv:2309.03729 [pdf, other]: Title: Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption

Teng Hu, Jiangning Zhang, Liang Liu, Ran Yi, Siqi Kou, Haokun Zhu, Xu Chen, Yabiao Wang, Chengjie Wang, Lizhuang Ma

Comments: Accepted by ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[351] arXiv:2309.03734 [pdf, other]: Title: ClusterFusion: Leveraging Radar Spatial Features for Radar-Camera 3D Object Detection in Autonomous Vehicles

Irfan Tito Kurniawan, Bambang Riyanto Trilaksono

Comments: Accepted for publication in IEEE Access

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[352] arXiv:2309.03750 [pdf, html, other]: Title: PBP: Path-based Trajectory Prediction for Autonomous Driving

Sepideh Afshar, Nachiket Deo, Akshay Bhagat, Titas Chakraborty, Yunming Shao, Balarama Raju Buddharaju, Adwait Deshpande, Henggang Cui

Comments: Published at ICRA 2024; Sepideh Afshar and Nachiket Deo contributed equally

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353] arXiv:2309.03763 [pdf, other]: Title: dacl1k: Real-World Bridge Damage Dataset Putting Open-Source Data to the Test

Johannes Flotzinger, Philipp J. Rösch, Norbert Oswald, Thomas Braml

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[354] arXiv:2309.03764 [pdf, other]: Title: $L_{2,1}$-Norm Regularized Quaternion Matrix Completion Using Sparse Representation and Quaternion QR Decomposition

Juan Han, Kit Ian Kou, Jifei Miao, Lizhi Liu, Haojiang Li

Subjects: Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
[355] arXiv:2309.03799 [pdf, other]: Title: FisheyePP4AV: A privacy-preserving method for autonomous vehicles on fisheye camera images

Linh Trinh, Bach Ha, Tu Tran

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[356] arXiv:2309.03809 [pdf, html, other]: Title: SimNP: Learning Self-Similarity Priors Between Neural Points

Christopher Wewer, Eddy Ilg, Bernt Schiele, Jan Eric Lenssen

Comments: ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[357] arXiv:2309.03811 [pdf, other]: Title: Panoramas from Photons

Sacha Jungerman, Atul Ingle, Mohit Gupta

Comments: Proc. ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[358] arXiv:2309.03812 [pdf, other]: Title: AnthroNet: Conditional Generation of Humans via Anthropometrics

Francesco Picetti, Shrinath Deshpande, Jonathan Leban, Soroosh Shahtalebi, Jay Patel, Peifeng Jing, Chunpu Wang, Charles Metze III, Cameron Sun, Cera Laidlaw, James Warren, Kathy Huynh, River Page, Jonathan Hogins, Adam Crespi, Sujoy Ganguly, Salehe Erfanian Ebadi

Comments: AnthroNet's Unity data generator source code is available at: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[359] arXiv:2309.03815 [pdf, other]: Title: T2IW: Joint Text to Image & Watermark Generation

An-An Liu, Guokai Zhang, Yuting Su, Ning Xu, Yongdong Zhang, Lanjun Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[360] arXiv:2309.03827 [pdf, other]: Title: ArtHDR-Net: Perceptually Realistic and Accurate HDR Content Creation

Hrishav Bakul Barua, Ganesh Krishnasamy, KokSheik Wong, Kalin Stefanov, Abhinav Dhall

Comments: Accepted in Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Taipei, Taiwan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[361] arXiv:2309.03837 [pdf, other]: Title: Cross-Task Attention Network: Improving Multi-Task Learning for Medical Imaging Applications

Sangwook Kim, Thomas G. Purdie, Chris McIntosh

Comments: 13 pages, 2 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[362] arXiv:2309.03869 [pdf, other]: Title: Text-to-feature diffusion for audio-visual few-shot learning

Otniel-Bogdan Mercea, Thomas Hummel, A. Sophia Koepke, Zeynep Akata

Comments: DAGM GCPR 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[363] arXiv:2309.03874 [pdf, other]: Title: Box-based Refinement for Weakly Supervised and Unsupervised Localization Tasks

Eyal Gomel, Tal Shaharabany, Lior Wolf

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[364] arXiv:2309.03893 [pdf, other]: Title: DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection

Manlin Zhang, Jie Wu, Yuxi Ren, Ming Li, Jie Qin, Xuefeng Xiao, Wei Liu, Rui Wang, Min Zheng, Andy J. Ma

Comments: Code and Models are publicly available. Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[365] arXiv:2309.03895 [pdf, other]: Title: InstructDiffusion: A Generalist Modeling Interface for Vision Tasks

Zigang Geng, Binxin Yang, Tiankai Hang, Chen Li, Shuyang Gu, Ting Zhang, Jianmin Bao, Zheng Zhang, Han Hu, Dong Chen, Baining Guo

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[366] arXiv:2309.03897 [pdf, other]: Title: ProPainter: Improving Propagation and Transformer for Video Inpainting

Shangchen Zhou, Chongyi Li, Kelvin C.K. Chan, Chen Change Loy

Comments: Accepted by ICCV 2023. Code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[367] arXiv:2309.03899 [pdf, other]: Title: The Making and Breaking of Camouflage

Hala Lamdouar, Weidi Xie, Andrew Zisserman

Comments: ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[368] arXiv:2309.03903 [pdf, other]: Title: Tracking Anything with Decoupled Video Segmentation

Ho Kei Cheng, Seoung Wug Oh, Brian Price, Alexander Schwing, Joon-Young Lee

Comments: Accepted to ICCV 2023. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[369] arXiv:2309.03904 [pdf, other]: Title: Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis

Jiapeng Zhu, Ceyuan Yang, Kecheng Zheng, Yinghao Xu, Zifan Shi, Yujun Shen

Comments: Technical report

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[370] arXiv:2309.03921 [pdf, other]: Title: C-CLIP: Contrastive Image-Text Encoders to Close the Descriptive-Commentative Gap

William Theisen, Walter Scheirer

Comments: 11 Pages, 5 Figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[371] arXiv:2309.03930 [pdf, other]: Title: Random Expert Sampling for Deep Learning Segmentation of Acute Ischemic Stroke on Non-contrast CT

Sophie Ostmeier, Brian Axelrod, Benjamin Pulli, Benjamin F.J. Verhaaren, Abdelkader Mahammedi, Yongkai Liu, Christian Federau, Greg Zaharchuk, Jeremy J. Heit

Journal-ref: https://jnis.bmj.com/content/early/2024/02/01/jnis-2023-021283

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[372] arXiv:2309.03933 [pdf, other]: Title: BluNF: Blueprint Neural Field

Robin Courant, Xi Wang, Marc Christie, Vicky Kalogeiton

Comments: ICCV-W (AI3DCC) 2023. Project page with videos and code: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[373] arXiv:2309.03955 [pdf, other]: Title: SimpleNeRF: Regularizing Sparse Input Neural Radiance Fields with Simpler Solutions

Nagabhushan Somraj, Adithyan Karanayil, Rajiv Soundararajan

Comments: SIGGRAPH Asia 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[374] arXiv:2309.03979 [pdf, other]: Title: Separable Self and Mixed Attention Transformers for Efficient Object Tracking

Goutam Yelluru Gopal, Maria A. Amer

Comments: Accepted by WACV2024. Code available at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[375] arXiv:2309.03989 [pdf, other]: Title: CDFSL-V: Cross-Domain Few-Shot Learning for Videos

Sarinda Samarasinghe, Mamshad Nayeem Rizve, Navid Kardan, Mubarak Shah

Comments: ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[376] arXiv:2309.03999 [pdf, html, other]: Title: Adapting Self-Supervised Representations to Multi-Domain Setups

Neha Kalibhat, Sam Sharpe, Jeremy Goodsitt, Bayan Bruss, Soheil Feizi

Comments: Published at BMVC 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[377] arXiv:2309.04001 [pdf, html, other]: Title: MMSFormer: Multimodal Transformer for Material and Semantic Segmentation

Md Kaykobad Reza, Ashley Prater-Bennette, M. Salman Asif

Comments: Accepted by IEEE Open Journal of Signal Processing. 15 pages, 3 figures, 9 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[378] arXiv:2309.04022 [pdf, other]: Title: Improving the Accuracy of Beauty Product Recommendations by Assessing Face Illumination Quality

Parnian Afshar, Jenny Yeon, Andriy Levitskyy, Rahul Suresh, Amin Banitalebi-Dehkordi

Comments: 7 pages, 5 figures. Presented in FAccTRec2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[379] arXiv:2309.04038 [pdf, html, other]: Title: S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing with Statistical Tokens

Rizhao Cai, Zitong Yu, Chenqi Kong, Haoliang Li, Changsheng Chen, Yongjian Hu, Alex Kot

Comments: Accepted by IEEE Transactions on Information Forensics Security (June 2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[380] arXiv:2309.04041 [pdf, html, other]: Title: Evaluation and Enhancement of Semantic Grounding in Large Vision-Language Models

Jiaying Lu, Jinmeng Rao, Kezhen Chen, Xiaoyuan Guo, Yawen Zhang, Baochen Sun, Carl Yang, Jie Yang

Comments: This paper has been accepted to the AAAI'24 Workshop on Responsible Language Models (ReLM 2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[381] arXiv:2309.04063 [pdf, other]: Title: INSURE: An Information Theory Inspired Disentanglement and Purification Model for Domain Generalization

Xi Yu, Huan-Hsin Tseng, Shinjae Yoo, Haibin Ling, Yuewei Lin

Comments: 10 pages, 4 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[382] arXiv:2309.04084 [pdf, html, other]: Title: Towards Efficient SDRTV-to-HDRTV by Learning from Image Formation

Xiangyu Chen, Zheyuan Li, Zhengwen Zhang, Jimmy S. Ren, Yihao Liu, Jingwen He, Yu Qiao, Jiantao Zhou, Chao Dong

Comments: Extended version of HDRTVNet

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[383] arXiv:2309.04089 [pdf, html, other]: Title: Toward Sufficient Spatial-Frequency Interaction for Gradient-aware Underwater Image Enhancement

Chen Zhao, Weiling Cai, Chenyu Dong, Ziqi Zeng

Comments: Accepted by ICASSP 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[384] arXiv:2309.04105 [pdf, other]: Title: Weakly Supervised Point Clouds Transformer for 3D Object Detection

Zuojin Tang, Bo Sun, Tongwei Ma, Daosheng Li, Zhenhui Xu

Comments: International Conference on Intelligent Transportation Systems (ITSC), 2022

Journal-ref: International Conference on Intelligent Transportation Systems (ITSC 2022)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[385] arXiv:2309.04109 [pdf, html, other]: Title: From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models

Changming Xiao, Qi Yang, Feng Zhou, Changshui Zhang

Comments: A revised version of this paper will be published in Neurocomputing, see this https URL

Journal-ref: Neurocomputing, Volume 610, 2024, 128437

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[386] arXiv:2309.04145 [pdf, html, other]: Title: Depth Completion with Multiple Balanced Bases and Confidence for Dense Monocular SLAM

Weijian Xie, Guanyi Chu, Quanhao Qian, Yihao Yu, Hai Li, Danpeng Chen, Shangjin Zhai, Nan Wang, Hujun Bao, Guofeng Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[387] arXiv:2309.04147 [pdf, other]: Title: Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry

Akankshya Kar, Sajal Maheshwari, Shamit Lal, Vinay Sameer Raja Kad

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[388] arXiv:2309.04148 [pdf, html, other]: Title: Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning

Hiroki Nakamura, Masashi Okada, Tadahiro Taniguchi

Comments: Accepted to the IEEE Open Journal of Signal Processing (ICIP2024 track)

Journal-ref: IEEE Open Journal of Signal Processing, vol. 5, pp. 831-840, 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[389] arXiv:2309.04153 [pdf, other]: Title: Mapping EEG Signals to Visual Stimuli: A Deep Learning Approach to Match vs. Mismatch Classification

Yiqian Yang, Zhengqiao Zhao, Qian Wang, Yan Yang, Jingdong Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Engineering, Finance, and Science (cs.CE)
[390] arXiv:2309.04158 [pdf, other]: Title: Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment

Hongyu Hu, Tiancheng Lin, Jie Wang, Zhenbang Sun, Yi Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[391] arXiv:2309.04169 [pdf, other]: Title: Grouping Boundary Proposals for Fast Interactive Image Segmentation

Li Liu, Da Chen, Minglei Shu, Laurent D. Cohen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392] arXiv:2309.04171 [pdf, other]: Title: PRISTA-Net: Deep Iterative Shrinkage Thresholding Network for Coded Diffraction Patterns Phase Retrieval

Aoxu Liu, Xiaohong Fan, Yin Yang, Jianping Zhang

Comments: 12 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Information Theory (cs.IT); Image and Video Processing (eess.IV)
[393] arXiv:2309.04172 [pdf, other]: Title: Unsupervised Object Localization with Representer Point Selection

Yeonghwan Song, Seokwoo Jang, Dina Katabi, Jeany Son

Comments: Accepted by ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[394] arXiv:2309.04183 [pdf, other]: Title: Stereo Matching in Time: 100+ FPS Video Stereo Matching for Extended Reality

Ziang Cheng, Jiayu Yang, Hongdong Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[395] arXiv:2309.04220 [pdf, other]: Title: Score-PA: Score-based 3D Part Assembly

Junfeng Cheng, Mingdong Wu, Ruiyuan Zhang, Guanqi Zhan, Chao Wu, Hao Dong

Comments: BMVC 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[396] arXiv:2309.04225 [pdf, other]: Title: Long-Range Correlation Supervision for Land-Cover Classification from Remote Sensing Images

Dawen Yu, Shunping Ji

Comments: 14 pages, 11 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[397] arXiv:2309.04228 [pdf, other]: Title: FIVA: Facial Image and Video Anonymization and Anonymization Defense

Felix Rosberg, Eren Erdal Aksoy, Cristofer Englund, Fernando Alonso-Fernandez

Comments: Accepted to ICCVW 2023 - DFAD 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[398] arXiv:2309.04247 [pdf, other]: Title: Towards Practical Capture of High-Fidelity Relightable Avatars

Haotian Yang, Mingwu Zheng, Wanquan Feng, Haibin Huang, Yu-Kun Lai, Pengfei Wan, Zhongyuan Wang, Chongyang Ma

Comments: Accepted to SIGGRAPH Asia 2023 (Conference); Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399] arXiv:2309.04302 [pdf, other]: Title: Have We Ever Encountered This Before? Retrieving Out-of-Distribution Road Obstacles from Driving Scenes

Youssef Shoeb, Robin Chan, Gesina Schwalbe, Azarm Nowzard, Fatma Güney, Hanno Gottschalk

Comments: 11 pages, 7 figures, and 3 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[400] arXiv:2309.04312 [pdf, other]: Title: AMLP:Adaptive Masking Lesion Patches for Self-supervised Medical Image Segmentation

Xiangtao Wang, Ruizhi Wang, Jie Zhou, Thomas Lukasiewicz, Zhenghua Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[401] arXiv:2309.04331 [pdf, html, other]: Title: Leveraging Model Fusion for Improved License Plate Recognition

Rayson Laroca, Luiz A. Zanlorensi, Valter Estevam, Rodrigo Minetto, David Menotti

Comments: Accepted for presentation at the Iberoamerican Congress on Pattern Recognition (CIARP) 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402] arXiv:2309.04354 [pdf, other]: Title: Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts

Erik Daxberger, Floris Weers, Bowen Zhang, Tom Gunter, Ruoming Pang, Marcin Eichner, Michael Emmersberger, Yinfei Yang, Alexander Toshev, Xianzhi Du

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[403] arXiv:2309.04357 [pdf, other]: Title: SSIG: A Visually-Guided Graph Edit Distance for Floor Plan Similarity

Casper van Engelenburg, Seyran Khademi, Jan van Gemert

Comments: To be published in ICCVW 2023, 10 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[404] arXiv:2309.04366 [pdf, other]: Title: CNN Injected Transformer for Image Exposure Correction

Shuning Xu, Xiangyu Chen, Binbin Song, Jiantao Zhou

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[405] arXiv:2309.04372 [pdf, html, other]: Title: MoEController: Instruction-based Arbitrary Image Manipulation with Mixture-of-Expert Controllers

Sijia Li, Chen Chen, Haonan Lu

Comments: 6 pages,6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[406] arXiv:2309.04379 [pdf, html, other]: Title: Language Prompt for Autonomous Driving

Dongming Wu, Wencheng Han, Yingfei Liu, Tiancai Wang, Cheng-zhong Xu, Xiangyu Zhang, Jianbing Shen

Comments: Accepted by AAAI2025

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[407] arXiv:2309.04399 [pdf, other]: Title: MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask

Yupeng Zhou, Daquan Zhou, Zuo-Liang Zhu, Yaxing Wang, Qibin Hou, Jiashi Feng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[408] arXiv:2309.04410 [pdf, other]: Title: DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields

Junzhe Zhang, Yushi Lan, Shuai Yang, Fangzhou Hong, Quan Wang, Chai Kiat Yeo, Ziwei Liu, Chen Change Loy

Comments: ICCV 2023. Code: this https URL Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[409] arXiv:2309.04421 [pdf, html, other]: Title: SynthoGestures: A Novel Framework for Synthetic Dynamic Hand Gesture Generation for Driving Scenarios

Amr Gomaa, Robin Zitt, Guillermo Reyes, Antonio Krüger

Comments: Accepted at IEEE IV'24. Shorter versions were accepted as AutomotiveUI2023 Work in Progress and UIST2023 Poster Papers

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[410] arXiv:2309.04422 [pdf, other]: Title: Video Task Decathlon: Unifying Image and Video Tasks in Autonomous Driving

Thomas E. Huang, Yifan Liu, Luc Van Gool, Fisher Yu

Comments: ICCV 2023, project page at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[411] arXiv:2309.04430 [pdf, other]: Title: Create Your World: Lifelong Text-to-Image Diffusion

Gan Sun, Wenqi Liang, Jiahua Dong, Jun Li, Zhengming Ding, Yang Cong

Comments: 15 pages,10 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[412] arXiv:2309.04437 [pdf, html, other]: Title: Single View Refractive Index Tomography with Neural Fields

Brandon Zhao, Aviad Levis, Liam Connor, Pratul P. Srinivasan, Katherine L. Bouman

Subjects: Computer Vision and Pattern Recognition (cs.CV); Cosmology and Nongalactic Astrophysics (astro-ph.CO)
[413] arXiv:2309.04447 [pdf, html, other]: Title: Impact of Blur and Resolution on Demographic Disparities in 1-to-Many Facial Identification

Aman Bhatta, Gabriella Pangelinan, Michael C. King, Kevin W. Bowyer

Comments: 9 pages, 8 figures, Conference submission

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[414] arXiv:2309.04453 [pdf, other]: Title: WiSARD: A Labeled Visual and Thermal Image Dataset for Wilderness Search and Rescue

Daniel Broyles, Christopher R. Hayner, Karen Leung

Journal-ref: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 2022, pp. 9467-9474

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[415] arXiv:2309.04462 [pdf, other]: Title: Generalized Cross-domain Multi-label Few-shot Learning for Chest X-rays

Aroof Aimen, Arsh Verma, Makarand Tapaswi, Narayanan C. Krishnan

Comments: 17 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[416] arXiv:2309.04502 [pdf, other]: Title: On the Efficacy of Multi-scale Data Samplers for Vision Applications

Elvis Nunez, Thomas Merth, Anish Prabhu, Mehrdad Farajtabar, Mohammad Rastegari, Sachin Mehta, Maxwell Horton

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[417] arXiv:2309.04506 [pdf, other]: Title: Unsupervised Gaze-aware Contrastive Learning with Subject-specific Condition

Lingyu Du, Xucong Zhang, Guohao Lan

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[418] arXiv:2309.04542 [pdf, other]: Title: Examining Autoexposure for Challenging Scenes

SaiKiran Tedla, Beixuan Yang, Michael S. Brown

Comments: ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[419] arXiv:2309.04549 [pdf, other]: Title: Poster: Making Edge-assisted LiDAR Perceptions Robust to Lossy Point Cloud Compression

Jin Heo, Gregorie Phillips, Per-Erik Brodin, Ada Gavrilovska

Comments: extended abstract of 2 pages, 2 figures, 1 table

Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[420] arXiv:2309.04561 [pdf, html, other]: Title: Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding

Ozan Unal, Christos Sakaridis, Suman Saha, Luc Van Gool

Comments: Accepted at ECCV 2024. Winner of the ICCV 2023 ScanRefer Challenge

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[421] arXiv:2309.04573 [pdf, other]: Title: Mask2Anomaly: Mask Transformer for Universal Open-set Segmentation

Shyam Nandan Rai, Fabio Cermelli, Barbara Caputo, Carlo Masone

Comments: 16 pages. arXiv admin note: substantial text overlap with arXiv:2307.13316

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[422] arXiv:2309.04579 [pdf, other]: Title: EGOFALLS: A visual-audio dataset and benchmark for fall detection using egocentric cameras

Xueyi Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[423] arXiv:2309.04608 [pdf, other]: Title: Style Generation: Image Synthesis based on Coarsely Matched Texts

Mengyao Cui, Zhe Zhu, Shao-Ping Lu, Yulu Yang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[424] arXiv:2309.04650 [pdf, other]: Title: Exploring Robust Features for Improving Adversarial Robustness

Hong Wang, Yuefan Deng, Shinjae Yoo, Yuewei Lin

Comments: 12 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[425] arXiv:2309.04657 [pdf, other]: Title: Generation and Recombination for Multifocus Image Fusion with Free Number of Inputs

Huafeng Li, Dan Wang, Yuxin Huang, Yafei Zhang, Zhengtao Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[426] arXiv:2309.04659 [pdf, other]: Title: Progressive Feature Adjustment for Semi-supervised Learning from Pretrained Models

Hai-Ming Xu, Lingqiao Liu, Hao Chen, Ehsan Abbasnejad, Rafael Felix

Comments: to appear at ICCVW2023 (Workshop on Visual Continual Learning)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[427] arXiv:2309.04669 [pdf, html, other]: Title: Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization

Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu

Comments: ICLR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[428] arXiv:2309.04675 [pdf, other]: Title: BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification

Takuro Fujii, Shuhei Tarashima

Comments: Accepted at ICCVW 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[429] arXiv:2309.04682 [pdf, other]: Title: DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions

Teng Fu, Xiaocong Wang, Haiyang Yu, Ke Niu, Bin Li, Xiangyang Xue

Comments: ACM Multimedia 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430] arXiv:2309.04702 [pdf, other]: Title: A Spatial-Temporal Deformable Attention based Framework for Breast Lesion Detection in Videos

Chao Qin, Jiale Cao, Huazhu Fu, Rao Muhammad Anwer, Fahad Shahbaz Khan

Comments: Accepted by MICCAI 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[431] arXiv:2309.04708 [pdf, html, other]: Title: UnitModule: A Lightweight Joint Image Enhancement Module for Underwater Object Detection

Zhuoyan Liu, Bo Wang, Ye Li, Jiaxian He, Yunfeng Li

Comments: 15 pages, 10 figures, 13 tables, accepted by PR

Journal-ref: Pattern Recognition 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[432] arXiv:2309.04723 [pdf, other]: Title: Frequency-Aware Self-Supervised Long-Tailed Learning

Ci-Siang Lin, Min-Hung Chen, Yu-Chiang Frank Wang

Comments: ICCV Workshop 2023 (Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[433] arXiv:2309.04734 [pdf, other]: Title: Towards Better Multi-modal Keyphrase Generation via Visual Entity Enhancement and Multi-granularity Image Noise Filtering

Yifan Dong, Suhang Wu, Fandong Meng, Jie Zhou, Xiaoli Wang, Jianxin Lin, Jinsong Su

Comments: Accepted In Proceedings of the 31st ACM International Conference on Multimedia (MM' 23)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[434] arXiv:2309.04747 [pdf, other]: Title: When to Learn What: Model-Adaptive Data Augmentation Curriculum

Chengkai Hou, Jieyu Zhang, Tianyi Zhou

Comments: Our paper is accpeted by ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[435] arXiv:2309.04750 [pdf, html, other]: Title: Mirror-Aware Neural Humans

Daniel Ajisafe, James Tang, Shih-Yang Su, Bastian Wandt, Helge Rhodin

Comments: The 11th International Conference on 3D Vision (3DV 2024). Project website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[436] arXiv:2309.04752 [pdf, other]: Title: Deep Video Restoration for Under-Display Camera

Xuanxi Chen, Tao Wang, Ziqian Shao, Kaihao Zhang, Wenhan Luo, Tong Lu, Zikun Liu, Tae-Kyun Kim, Hongdong Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[437] arXiv:2309.04756 [pdf, other]: Title: Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation

Boyuan Jiang, Lei Hu, Shihong Xia

Comments: 9pages, 5figures, conference

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[438] arXiv:2309.04763 [pdf, other]: Title: Visual Material Characteristics Learning for Circular Healthcare

Federico Zocco, Shahin Rahimifard

Comments: To be submitted

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[439] arXiv:2309.04780 [pdf, html, other]: Title: Latent Degradation Representation Constraint for Single Image Deraining

Yuhong He, Long Peng, Lu Wang, Jun Cheng

Comments: This paper is accepted to ICASSP 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[440] arXiv:2309.04795 [pdf, html, other]: Title: Latent Spatiotemporal Adaptation for Generalized Face Forgery Video Detection

Daichi Zhang, Zihao Xiao, Jianmin Li, Shiming Ge

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[441] arXiv:2309.04800 [pdf, other]: Title: VeRi3D: Generative Vertex-based Radiance Fields for 3D Controllable Human Image Synthesis

Xinya Chen, Jiaxin Huang, Yanrui Bin, Lu Yu, Yiyi Liao

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[442] arXiv:2309.04801 [pdf, other]: Title: TMComposites: Plug-and-Play Collaboration Between Specialized Tsetlin Machines

Ole-Christoffer Granmo

Comments: 8 pages, 6 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[443] arXiv:2309.04803 [pdf, other]: Title: Towards Real-World Burst Image Super-Resolution: Benchmark and Method

Pengxu Wei, Yujing Sun, Xingbei Guo, Chang Liu, Jie Chen, Xiangyang Ji, Liang Lin

Comments: Accepted by ICCV2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[444] arXiv:2309.04806 [pdf, html, other]: Title: Timely Fusion of Surround Radar/Lidar for Object Detection in Autonomous Driving Systems

Wenjing Xie, Tao Hu, Neiwen Ling, Guoliang Xing, Chun Jason Xue, Nan Guan

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[445] arXiv:2309.04814 [pdf, other]: Title: Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video

Xiuzhe Wu, Pengfei Hu, Yang Wu, Xiaoyang Lyu, Yan-Pei Cao, Ying Shan, Wenming Yang, Zhongqian Sun, Xiaojuan Qi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[446] arXiv:2309.04820 [pdf, html, other]: Title: ABC Easy as 123: A Blind Counter for Exemplar-Free Multi-Class Class-agnostic Counting

Michael A. Hobley, Victor A. Prisacariu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[447] arXiv:2309.04825 [pdf, other]: Title: Few-Shot Medical Image Segmentation via a Region-enhanced Prototypical Transformer

Yazhou Zhu, Shidong Wang, Tong Xin, Haofeng Zhang

Comments: Accepted by MICCAI

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[448] arXiv:2309.04836 [pdf, html, other]: Title: Neural Semantic Surface Maps

Luca Morreale, Noam Aigerman, Vladimir G. Kim, Niloy J. Mitra

Comments: Accepted at Eurographics 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[449] arXiv:2309.04840 [pdf, other]: Title: AnyPose: Anytime 3D Human Pose Forecasting via Neural Ordinary Differential Equations

Zixing Wang, Ahmed H. Qureshi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[450] arXiv:2309.04887 [pdf, other]: Title: SortedAP: Rethinking evaluation metrics for instance segmentation

Long Chen, Yuli Wu, Johannes Stegmaier, Dorit Merhof

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[451] arXiv:2309.04888 [pdf, other]: Title: Semi-supervised Instance Segmentation with a Learned Shape Prior

Long Chen, Weiwen Zhang, Yuli Wu, Martin Strauch, Dorit Merhof

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[452] arXiv:2309.04891 [pdf, html, other]: Title: How to Evaluate Semantic Communications for Images with ViTScore Metric?

Tingting Zhu, Bo Peng, Jifan Liang, Tingchen Han, Hai Wan, Jingqiao Fu, Junjie Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Theory (cs.IT)
[453] arXiv:2309.04902 [pdf, other]: Title: Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art

Aref Miri Rekavandi, Shima Rashidi, Farid Boussaid, Stephen Hoefs, Emre Akbas, Mohammed bennamoun

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[454] arXiv:2309.04907 [pdf, other]: Title: Effective Real Image Editing with Accelerated Iterative Diffusion Inversion

Zhihong Pan, Riccardo Gherardi, Xiufeng Xie, Stephen Huang

Comments: Accepted to ICCV 2023 (Oral)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[455] arXiv:2309.04914 [pdf, other]: Title: MFPNet: Multi-scale Feature Propagation Network For Lightweight Semantic Segmentation

Guoan Xu, Wenjing Jia, Tao Wu, Ligeng Chen

Comments: 5 pages, 3 figures, 5tables, conference

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[456] arXiv:2309.04917 [pdf, other]: Title: Editing 3D Scenes via Text Prompts without Retraining

Shuangkang Fang, Yufeng Wang, Yi Yang, Yi-Hsuan Tsai, Wenrui Ding, Shuchang Zhou, Ming-Hsuan Yang

Comments: Project Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[457] arXiv:2309.04958 [pdf, other]: Title: Semi-Supervised learning for Face Anti-Spoofing using Apex frame

Usman Muhammad, Mourad Oussalah, Jorma Laaksonen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[458] arXiv:2309.04965 [pdf, other]: Title: Prefix-diffusion: A Lightweight Diffusion Model for Diverse Image Captioning

Guisheng Liu, Yi Li, Zhengcong Fei, Haiyan Fu, Xiangyang Luo, Yanqing Guo

Comments: 11 pages,4 figures, 6 tables

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[459] arXiv:2309.04967 [pdf, html, other]: Title: Towards Fully Decoupled End-to-End Person Search

Pengcheng Zhang, Xiao Bai, Jin Zheng, Xin Ning

Comments: DICTA 2023 Best Student Paper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[460] arXiv:2309.05013 [pdf, other]: Title: Geometrically Consistent Partial Shape Matching

Viktoria Ehm, Paul Roetzer, Marvin Eisenberger, Maolin Gao, Florian Bernard, Daniel Cremers

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[461] arXiv:2309.05015 [pdf, other]: Title: DeViT: Decomposing Vision Transformers for Collaborative Inference in Edge Devices

Guanyu Xu, Zhiwei Hao, Yong Luo, Han Hu, Jianping An, Shiwen Mao

Comments: Accepted by IEEE Transactions on Mobile Computing

Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[462] arXiv:2309.05028 [pdf, other]: Title: SC-NeRF: Self-Correcting Neural Radiance Field with Sparse Views

Liang Song, Guangming Wang, Jiuming Liu, Zhenyang Fu, Yanzi Miao, Hesheng

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[463] arXiv:2309.05032 [pdf, other]: Title: Unified Contrastive Fusion Transformer for Multimodal Human Action Recognition

Kyoung Ok Yang, Junho Koh, Jun Won Choi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[464] arXiv:2309.05049 [pdf, other]: Title: Multi-view Self-supervised Disentanglement for General Image Denoising

Hao Chen, Chenyuan Qu, Yu Zhang, Chen Chen, Jianbo Jiao

Comments: International Conference on Computer Vision 2023 (ICCV 2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[465] arXiv:2309.05069 [pdf, other]: Title: Exploiting CLIP for Zero-shot HOI Detection Requires Knowledge Distillation at Multiple Levels

Bo Wan, Tinne Tuytelaars

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[466] arXiv:2309.05073 [pdf, html, other]: Title: FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World Conditions

Jiong Wang, Fengyu Yang, Wenbo Gou, Bingliang Li, Danqi Yan, Ailing Zeng, Yijun Gao, Junle Wang, Yanqing Jing, Ruimao Zhang

Comments: CVPR2024 camera ready version. 19 pages, 16 figures. Project page: this https URL ; API: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[467] arXiv:2309.05090 [pdf, other]: Title: Sculpting Efficiency: Pruning Medical Imaging Models for On-Device Inference

Sudarshan Sreeram, Bernhard Kainz

Comments: Accepted at MedNeurIPS 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[468] arXiv:2309.05095 [pdf, other]: Title: MaskRenderer: 3D-Infused Multi-Mask Realistic Face Reenactment

Tina Behrouzi, Atefeh Shahroudnejad, Payam Mousavi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[469] arXiv:2309.05098 [pdf, other]: Title: 3D Implicit Transporter for Temporally Consistent Keypoint Discovery

Chengliang Zhong, Yuhang Zheng, Yupeng Zheng, Hao Zhao, Li Yi, Xiaodong Mu, Ling Wang, Pengfei Li, Guyue Zhou, Chao Yang, Xinliang Zhang, Jian Zhao

Comments: ICCV2023 oral paper

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[470] arXiv:2309.05132 [pdf, other]: Title: DAD++: Improved Data-free Test Time Adversarial Defense

Gaurav Kumar Nayak, Inder Khatri, Shubham Randive, Ruchit Rawal, Anirban Chakraborty

Comments: IJCV Journal (Under Review)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[471] arXiv:2309.05139 [pdf, other]: Title: A Skeleton-based Approach For Rock Crack Detection Towards A Climbing Robot Application

Josselin Somerville Roberts, Paul-Emile Giacomelli, Yoni Gozlan, Julia Di

Journal-ref: IEEE IRC 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[472] arXiv:2309.05148 [pdf, other]: Title: Beyond Skin Tone: A Multidimensional Measure of Apparent Skin Color

William Thong, Przemyslaw Joniak, Alice Xiang

Comments: Accepted at the International Conference on Computer Vision (ICCV) 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[473] arXiv:2309.05150 [pdf, other]: Title: Faster, Lighter, More Accurate: A Deep Learning Ensemble for Content Moderation

Mohammad Hosseini, Mahmudul Hasan

Comments: 6 pages, 22nd IEEE International Conference on Machine Learning and Applications (IEEE ICMLA'23), December 15-17, 2023, Jacksonville Riverfront, Florida, USA. arXiv admin note: substantial text overlap with arXiv:2103.10350

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[474] arXiv:2309.05180 [pdf, html, other]: Title: What's color got to do with it? Face recognition in grayscale

Aman Bhatta, Domingo Mery, Haiyu Wu, Joyce Annan, Micheal C. King, Kevin W. Bowyer

Comments: This is replacement version of the previous arxiv submission: 2309.05180 (Our Deep CNN Face Matchers Have Developed Achromatopsia). The past version is published in CVPRW and available in IEEE proceedings. This submitted version is an extension of the conference paper

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[475] arXiv:2309.05186 [pdf, html, other]: Title: HiLM-D: Enhancing MLLMs with Multi-Scale High-Resolution Details for Autonomous Driving

Xinpeng Ding, Jianhua Han, Hang Xu, Wei Zhang, Xiaomeng Li

Comments: Accepted by IJCV

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[476] arXiv:2309.05192 [pdf, other]: Title: Towards Viewpoint Robustness in Bird's Eye View Segmentation

Tzofi Klinghoffer, Jonah Philion, Wenzheng Chen, Or Litany, Zan Gojcic, Jungseock Joo, Ramesh Raskar, Sanja Fidler, Jose M. Alvarez

Comments: ICCV 2023. Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[477] arXiv:2309.05209 [pdf, other]: Title: Phase-Specific Augmented Reality Guidance for Microscopic Cataract Surgery Using Long-Short Spatiotemporal Aggregation Transformer

Puxun Tu, Hongfei Ye, Haochen Shi, Jeff Young, Meng Xie, Peiquan Zhao, Ce Zheng, Xiaoyi Jiang, Xiaojun Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[478] arXiv:2309.05214 [pdf, other]: Title: Angle Range and Identity Similarity Enhanced Gaze and Head Redirection based on Synthetic data

Jiawei Qin, Xueting Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[479] arXiv:2309.05224 [pdf, other]: Title: SparseSwin: Swin Transformer with Sparse Transformer Block

Krisna Pinasthika, Blessius Sheldo Putra Laksono, Riyandi Banovbi Putera Irsal, Syifa Hukma Shabiyya, Novanto Yudistira

Journal-ref: Neurocomputing, Volume 580, 2024, 127433

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[480] arXiv:2309.05239 [pdf, html, other]: Title: HAT: Hybrid Attention Transformer for Image Restoration

Xiangyu Chen, Xintao Wang, Wenlong Zhang, Xiangtao Kong, Yu Qiao, Jiantao Zhou, Chao Dong

Comments: Extended version of HAT. arXiv admin note: text overlap with arXiv:2205.04437

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[481] arXiv:2309.05251 [pdf, other]: Title: Multi3DRefer: Grounding Text Description to Multiple 3D Objects

Yiming Zhang, ZeMing Gong, Angel X. Chang

Comments: ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[482] arXiv:2309.05254 [pdf, html, other]: Title: Towards Better Data Exploitation in Self-Supervised Monocular Depth Estimation

Jinfeng Liu, Lingtong Kong, Jie Yang, Wei Liu

Comments: 8 pages, 6 figures, accepted by IEEE Robotics and Automation Letters (RA-L 2023)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[483] arXiv:2309.05257 [pdf, other]: Title: FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal Consistent Transformer for 3D Object Detection

Chunyong Hu, Hang Zheng, Kun Li, Jianyun Xu, Weibo Mao, Maochun Luo, Lingxuan Wang, Mingxia Chen, Qihao Peng, Kaixuan Liu, Yiru Zhao, Peihan Hao, Minzhe Liu, Kaicheng Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[484] arXiv:2309.05261 [pdf, other]: Title: Gall Bladder Cancer Detection from US Images with Only Image Level Labels

Soumen Basu, Ashish Papanai, Mayank Gupta, Pankaj Gupta, Chetan Arora

Comments: Accepted at MICCAI 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[485] arXiv:2309.05262 [pdf, other]: Title: A horizon line annotation tool for streamlining autonomous sea navigation experiments

Yassir Zardoua, Abdelhamid El Wahabi, Mohammed Boulaala, Abdelali Astito

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[486] arXiv:2309.05267 [pdf, other]: Title: Diving into Darkness: A Dual-Modulated Framework for High-Fidelity Super-Resolution in Ultra-Dark Environments

Jiaxin Gao, Ziyu Yue, Yaohua Liu, Sihan Xie, Xin Fan, Risheng Liu

Comments: 9 pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[487] arXiv:2309.05277 [pdf, other]: Title: Interactive Class-Agnostic Object Counting

Yifeng Huang, Viresh Ranjan, Minh Hoai

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[488] arXiv:2309.05281 [pdf, other]: Title: Class-Incremental Grouping Network for Continual Audio-Visual Learning

Shentong Mo, Weiguo Pian, Yapeng Tian

Comments: ICCV 2023. arXiv admin note: text overlap with arXiv:2303.17056

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[489] arXiv:2309.05282 [pdf, other]: Title: Can you text what is happening? Integrating pre-trained language encoders into trajectory prediction models for autonomous driving

Ali Keysan, Andreas Look, Eitan Kosman, Gonca Gürsun, Jörg Wagner, Yu Yao, Barbara Rakitsch

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[490] arXiv:2309.05289 [pdf, other]: Title: Task-driven Compression for Collision Encoding based on Depth Images

Mihir Kulkarni, Kostas Alexis

Comments: 14 pages, 5, figures. Accepted to the International Symposium on Visual Computing 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[491] arXiv:2309.05300 [pdf, html, other]: Title: Decoupling Common and Unique Representations for Multimodal Self-supervised Learning

Yi Wang, Conrad M Albrecht, Nassim Ait Ali Braham, Chenying Liu, Zhitong Xiong, Xiao Xiang Zhu

Comments: Accepted to ECCV 2024. 27 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[492] arXiv:2309.05314 [pdf, other]: Title: Semantic Latent Decomposition with Normalizing Flows for Face Editing

Binglei Li, Zhizhong Huang, Hongming Shan, Junping Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[493] arXiv:2309.05330 [pdf, other]: Title: Diff-Privacy: Diffusion-based Face Privacy Protection

Xiao He, Mingrui Zhu, Dongxin Chen, Nannan Wang, Xinbo Gao

Comments: 17pages

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[494] arXiv:2309.05334 [pdf, html, other]: Title: MultIOD: Rehearsal-free Multihead Incremental Object Detector

Eden Belouadah, Arnaud Dapogny, Kevin Bailly

Comments: Accepted at the archival track of the Workshop on Continual Learning in Computer Vision (CVPR 2024)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[495] arXiv:2309.05375 [pdf, other]: Title: Toward a Deeper Understanding: RetNet Viewed through Convolution

Chenghao Li, Chaoning Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[496] arXiv:2309.05380 [pdf, other]: Title: Collective PV-RCNN: A Novel Fusion Technique using Collective Detections for Enhanced Local LiDAR-Based Perception

Sven Teufel, Jörg Gamerdinger, Georg Volk, Oliver Bringmann

Comments: accepted at IEEE ITSC 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[497] arXiv:2309.05388 [pdf, html, other]: Title: Robust Single Rotation Averaging Revisited

Seong Hun Lee, Javier Civera

Comments: Accepted to ECCV 2024 Workshop on Recovering 6D Object Pose (R6D)

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[498] arXiv:2309.05418 [pdf, html, other]: Title: FlowIBR: Leveraging Pre-Training for Efficient Neural Image-Based Rendering of Dynamic Scenes

Marcel Büsching, Josef Bengtson, David Nilsson, Mårten Björkman

Comments: Accepted to CVPR 2024 Workshop on Efficient Deep Learning for Computer Vision. Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[499] arXiv:2309.05438 [pdf, other]: Title: Towards Content-based Pixel Retrieval in Revisited Oxford and Paris

Guoyuan An, Woo Jae Kim, Saelyne Yang, Rong Li, Yuchi Huo, Sung-Eui Yoon

Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[500] arXiv:2309.05448 [pdf, html, other]: Title: Panoptic Vision-Language Feature Fields

Haoran Chen, Kenneth Blomqvist, Francesco Milano, Roland Siegwart

Comments: This work has been accepted by IEEE Robotics and Automation Letters

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)

Total of 2022 entries : 1-250 251-500 501-750 751-1000 1001-1250 ... 2001-2022

Showing up to 250 entries per page: fewer | more | all