Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Vision and Pattern Recognition

Authors and titles for September 2023

Total of 2022 entries : 1-250 251-500 501-750 751-1000 1001-1250 ... 2001-2022
Showing up to 250 entries per page: fewer | more | all
[251] arXiv:2309.02420 [pdf, other]
Title: Doppelgangers: Learning to Disambiguate Images of Similar Structures
Ruojin Cai, Joseph Tung, Qianqian Wang, Hadar Averbuch-Elor, Bharath Hariharan, Noah Snavely
Comments: Published in ICCV 2023 (Oral); Project page: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[252] arXiv:2309.02423 [pdf, other]
Title: EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding
Yue Xu, Yong-Lu Li, Zhemin Huang, Michael Xu Liu, Cewu Lu, Yu-Wing Tai, Chi-Keung Tang
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[253] arXiv:2309.02429 [pdf, other]
Title: Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach
Vimal K B, Saketh Bachu, Tanmay Garg, Niveditha Lakshmi Narasimhan, Raghavan Konuru, Vineeth N Balasubramanian
Comments: To appear at ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[254] arXiv:2309.02434 [pdf, other]
Title: ReliTalk: Relightable Talking Portrait Generation from a Single Video
Haonan Qiu, Zhaoxi Chen, Yuming Jiang, Hang Zhou, Xiangyu Fan, Lei Yang, Wayne Wu, Ziwei Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[255] arXiv:2309.02436 [pdf, other]
Title: GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction
Youmin Zhang, Fabio Tosi, Stefano Mattoccia, Matteo Poggi
Comments: ICCV 2023. Code: this https URL - Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[256] arXiv:2309.02450 [pdf, other]
Title: Self-Supervised Video Transformers for Isolated Sign Language Recognition
Marcelo Sandoval-Castaneda, Yanhong Li, Diane Brentari, Karen Livescu, Gregory Shakhnarovich
Comments: 14 pages. Submitted to WACV 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[257] arXiv:2309.02455 [pdf, html, other]
Title: RSDiff: Remote Sensing Image Generation from Text Using Diffusion Model
Ahmad Sebaq, Mohamed ElHelw
Journal-ref: Neural Comput & Applic (2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[258] arXiv:2309.02527 [pdf, other]
Title: A skeletonization algorithm for gradient-based optimization
Martin J. Menten, Johannes C. Paetzold, Veronika A. Zimmer, Suprosanna Shit, Ivan Ezhov, Robbie Holland, Monika Probst, Julia A. Schnabel, Daniel Rueckert
Comments: Accepted at ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[259] arXiv:2309.02556 [pdf, other]
Title: Domain Adaptation for Efficiently Fine-tuning Vision Transformer with Encrypted Images
Teru Nagamori, Sayaka Shiota, Hitoshi Kiya
Comments: Accepted by APSIPA 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[260] arXiv:2309.02562 [pdf, other]
Title: Recurrence-Free Survival Prediction for Anal Squamous Cell Carcinoma Chemoradiotherapy using Planning CT-based Radiomics Model
Shanshan Tang, Kai Wang, David Hein, Gloria Lin, Nina N. Sanford, Jing Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[261] arXiv:2309.02578 [pdf, other]
Title: Anatomy-Driven Pathology Detection on Chest X-rays
Philip Müller, Felix Meissen, Johannes Brandt, Georgios Kaissis, Daniel Rueckert
Comments: Accepted at MICCAI 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[262] arXiv:2309.02596 [pdf, other]
Title: Self-Supervised Pretraining Improves Performance and Inference Efficiency in Multiple Lung Ultrasound Interpretation Tasks
Blake VanBerlo, Brian Li, Jesse Hoey, Alexander Wong
Comments: 10 pages, 5 figures, submitted to IEEE Access
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[263] arXiv:2309.02617 [pdf, other]
Title: Compressing Vision Transformers for Low-Resource Visual Learning
Eric Youn, Sai Mitheran J, Sanjana Prabhu, Siyuan Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[264] arXiv:2309.02636 [pdf, other]
Title: Multiclass Alignment of Confidence and Certainty for Network Calibration
Vinith Kugathasan, Muhammad Haris Khan
Comments: Accepted at GCPR 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[265] arXiv:2309.02666 [pdf, other]
Title: Fast and Resource-Efficient Object Tracking on Edge Devices: A Measurement Study
Sanjana Vijay Ganesh, Yanzhao Wu, Gaowen Liu, Ramana Kompella, Ling Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[266] arXiv:2309.02676 [pdf, other]
Title: Efficient Training for Visual Tracking with Deformable Transformer
Qingmao Wei, Guotian Zeng, Bi Zeng
Comments: arXiv admin note: text overlap with arXiv:2303.16580 by other authors
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[267] arXiv:2309.02702 [pdf, other]
Title: Gene-induced Multimodal Pre-training for Image-omic Classification
Ting Jin, Xingran Xie, Renjie Wan, Qingli Li, Yan Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[268] arXiv:2309.02713 [pdf, other]
Title: SlAction: Non-intrusive, Lightweight Obstructive Sleep Apnea Detection using Infrared Video
You Rim Choi, Gyeongseon Eo, Wonhyuck Youn, Hyojin Lee, Haemin Jang, Dongyoon Kim, Hyunwoo Shin, Hyung-Sin Kim
Comments: Accepted to ICCV CVAMD 2023, poster
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[269] arXiv:2309.02719 [pdf, other]
Title: DMKD: Improving Feature-based Knowledge Distillation for Object Detection Via Dual Masking Augmentation
Guang Yang, Yin Tang, Zhijian Wu, Jun Li, Jianhua Xu, Xili Wan
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[270] arXiv:2309.02742 [pdf, html, other]
Title: MLN-net: A multi-source medical image segmentation method for clustered microcalcifications using multiple layer normalization
Ke Wang, Zanting Ye, Xiang Xie, Haidong Cui, Tao Chen, Banteng Liu
Comments: 17 pages, 9 figures, 3 tables
Journal-ref: Knowledge-Based Systems, 2024, 283: 111127
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[271] arXiv:2309.02773 [pdf, html, other]
Title: Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter
Jinglong Wang, Xiawei Li, Jing Zhang, Qingyuan Xu, Qin Zhou, Qian Yu, Lu Sheng, Dong Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[272] arXiv:2309.02777 [pdf, html, other]
Title: LightNeuS: Neural Surface Reconstruction in Endoscopy using Illumination Decline
Víctor M. Batlle, José M. M. Montiel, Pascal Fua, Juan D. Tardós
Comments: 13 pages, 7 figures, 1 table
Journal-ref: MICCAI 2023. Lecture Notes in Computer Science, vol 14229 (2023) pp 502-512
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[273] arXiv:2309.02801 [pdf, other]
Title: 3D Trajectory Reconstruction of Drones using a Single Camera
Seobin Hwang, Hanyoung Kim, Chaeyeon Heo, Youkyoung Na, Cheongeun Lee, Yeongjun Cho
Comments: 10 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[274] arXiv:2309.02833 [pdf, html, other]
Title: Image-Object-Specific Prompt Learning for Few-Shot Class-Incremental Learning
In-Ug Yoon, Tae-Min Choi, Sun-Kyung Lee, Young-Min Kim, Jong-Hwan Kim
Comments: 8 pages, 4 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[275] arXiv:2309.02843 [pdf, other]
Title: Knowledge Distillation Layer that Lets the Student Decide
Ada Gorgun, Yeti Z. Gurbuz, A. Aydin Alatan
Comments: Accepted at the British Machine Vision Conference 2023 (BMVC 2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[276] arXiv:2309.02855 [pdf, other]
Title: Bandwidth-efficient Inference for Neural Image Compression
Shanzhi Yin, Tongda Xu, Yongsheng Liang, Yuanyuan Wang, Yanghao Li, Yan Wang, Jingjing Liu
Comments: 9 pages, 6 figures, submitted to ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[277] arXiv:2309.02861 [pdf, other]
Title: Image Aesthetics Assessment via Learnable Queries
Zhiwei Xiong, Yunfan Zhang, Zhiqi Shen, Peiran Ren, Han Yu
Comments: This work has been submitted to the IEEE for possible publication
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[278] arXiv:2309.02875 [pdf, other]
Title: MAD: Modality Agnostic Distance Measure for Image Registration
Vasiliki Sideri-Lampretsa, Veronika A. Zimmer, Huaqi Qiu, Georgios Kaissis, Daniel Rueckert
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[279] arXiv:2309.02903 [pdf, other]
Title: Towards Efficient Training with Negative Samples in Visual Tracking
Qingmao Wei, Bi Zeng, Guotian Zeng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[280] arXiv:2309.02923 [pdf, other]
Title: Patched Line Segment Learning for Vector Road Mapping
Jiakun Xu, Bowen Xu, Gui-Song Xia, Liang Dong, Nan Xue
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[281] arXiv:2309.02954 [pdf, other]
Title: M3D-NCA: Robust 3D Segmentation with Built-in Quality Control
John Kalkhof, Anirban Mukhopadhyay
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[282] arXiv:2309.02964 [pdf, other]
Title: Hierarchical-level rain image generative model based on GAN
Zhenyuan Liu, Tong Jia, Xingyu Xing, Jianfeng Wu, Junyi Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[283] arXiv:2309.02965 [pdf, other]
Title: Dynamic Hyperbolic Attention Network for Fine Hand-object Reconstruction
Zhiying Leng, Shun-Cheng Wu, Mahdi Saleh, Antonio Montanaro, Hao Yu, Yin Wang, Nassir Navab, Xiaohui Liang, Federico Tombari
Comments: Accpeted by ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[284] arXiv:2309.02975 [pdf, other]
Title: FishMOT: A Simple and Effective Method for Fish Tracking Based on IoU Matching
Shuo Liu, Lulu Han, Xiaoyang Liu, Junli Ren, Fang Wang, YingLiu, Yuanshan Lin
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[285] arXiv:2309.02995 [pdf, other]
Title: Continual Evidential Deep Learning for Out-of-Distribution Detection
Eduardo Aguilar, Bogdan Raducanu, Petia Radeva, Joost Van de Weijer
Comments: Accepted at Visual Continual Learning workshop (ICCV2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[286] arXiv:2309.02999 [pdf, other]
Title: Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Sijin Chen, Hongyuan Zhu, Mingsheng Li, Xin Chen, Peng Guo, Yinjie Lei, Gang Yu, Taihao Li, Tao Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[287] arXiv:2309.03008 [pdf, html, other]
Title: Sparse 3D Reconstruction via Object-Centric Ray Sampling
Llukman Cerkezi, Paolo Favaro
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[288] arXiv:2309.03020 [pdf, html, other]
Title: SEAL: A Framework for Systematic Evaluation of Real-World Super-Resolution
Wenlong Zhang, Xiaohui Li, Xiangyu Chen, Yu Qiao, Xiao-Ming Wu, Chao Dong
Comments: ICLR 2024, Spotlight. The source code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[289] arXiv:2309.03031 [pdf, other]
Title: MCM: Multi-condition Motion Synthesis Framework for Multi-scenario
Zeyu Ling, Bo Han, Yongkang Wong, Mohan Kangkanhalli, Weidong Geng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[290] arXiv:2309.03047 [pdf, other]
Title: Combining pre-trained Vision Transformers and CIDER for Out Of Domain Detection
Grégor Jouet, Clément Duhart, Francis Rousseaux, Julio Laborde, Cyril de Runz
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[291] arXiv:2309.03048 [pdf, html, other]
Title: Exploring Semantic Consistency in Unpaired Image Translation to Generate Data for Surgical Applications
Danush Kumar Venkatesh, Dominik Rivoir, Micha Pfeiffer, Fiona Kolbinger, Marius Distler, Jürgen Weitz, Stefanie Speidel
Comments: Accepted at IPCAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[292] arXiv:2309.03049 [pdf, other]
Title: Adaptive Growth: Real-time CNN Layer Expansion
Yunjie Zhu, Yunhao Chen
Comments: Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[293] arXiv:2309.03063 [pdf, other]
Title: Prompt-based Ingredient-Oriented All-in-One Image Restoration
Hu Gao, Depeng Dang
Comments: IEEE Transactions on Circuits and Systems for Video Technology (2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[294] arXiv:2309.03072 [pdf, other]
Title: Character Queries: A Transformer-based Approach to On-Line Handwritten Character Segmentation
Michael Jungo, Beat Wolf, Andrii Maksai, Claudiu Musat, Andreas Fischer
Comments: ICDAR 2023 Best Student Paper Award. Code available at this https URL
Journal-ref: International Conference on Document Analysis and Recognition - ICDAR 2023, pp. 98-114. Cham: Springer Nature Switzerland
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[295] arXiv:2309.03100 [pdf, other]
Title: FArMARe: a Furniture-Aware Multi-task methodology for Recommending Apartments based on the user interests
Ali Abdari, Alex Falcon, Giuseppe Serra
Comments: accepted for presentation at the ICCV2023 CV4Metaverse workshop
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[296] arXiv:2309.03110 [pdf, other]
Title: Do We Still Need Non-Maximum Suppression? Accurate Confidence Estimates and Implicit Duplication Modeling with IoU-Aware Calibration
Johannes Gilg, Torben Teepe, Fabian Herzog, Philipp Wolters, Gerhard Rigoll
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[297] arXiv:2309.03160 [pdf, html, other]
Title: ResFields: Residual Neural Fields for Spatiotemporal Signals
Marko Mihajlovic, Sergey Prokudin, Marc Pollefeys, Siyu Tang
Comments: [ICLR 2024 Spotlight] Project and code at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[298] arXiv:2309.03173 [pdf, other]
Title: PDiscoNet: Semantically consistent part discovery for fine-grained recognition
Robert van der Klis, Stephan Alaniz, Massimiliano Mancini, Cassio F. Dantas, Dino Ienco, Zeynep Akata, Diego Marcos
Comments: 9 pages, 8 figures, ICCV
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[299] arXiv:2309.03179 [pdf, html, other]
Title: SLiMe: Segment Like Me
Aliasghar Khani, Saeid Asgari Taghanaki, Aditya Sanghi, Ali Mahdavi Amiri, Ghassan Hamarneh
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[300] arXiv:2309.03185 [pdf, other]
Title: Bayes' Rays: Uncertainty Quantification for Neural Radiance Fields
Lily Goli, Cody Reading, Silvia Sellán, Alec Jacobson, Andrea Tagliasacchi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[301] arXiv:2309.03198 [pdf, other]
Title: My Art My Choice: Adversarial Protection Against Unruly AI
Anthony Rhodes, Ram Bhagat, Umur Aybars Ciftci, Ilke Demir
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[302] arXiv:2309.03216 [pdf, other]
Title: A Multisensor Hyperspectral Benchmark Dataset For Unmixing of Intimate Mixtures
Bikram Koirala, Behnood Rasti, Zakaria Bnoulkacem, Andrea de Lima Ribeiro, Yuleika Madriz, Erik Herrmann, Arthur Gestels, Thomas De Kerf, Sandra Lorenz, Margret Fuchs, Koen Janssens, Gunther Steenackers, Richard Gloaguen, Paul Scheunders
Comments: Currently, this paper is under review in IEEE
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[303] arXiv:2309.03240 [pdf, other]
Title: RepSGG: Novel Representations of Entities and Relationships for Scene Graph Generation
Hengyue Liu, Bir Bhanu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[304] arXiv:2309.03247 [pdf, other]
Title: Robust Visual Tracking by Motion Analyzing
Mohammed Leo, Kurban Ubul, ShengJie Cheng, Michael Ma
Comments: found some key point that is missed,considering that it will take a lot of time to reproduce the results and revise our mistakes,we would like to withdraw the manuscript to avoid further mislead
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[305] arXiv:2309.03295 [pdf, other]
Title: Comparative Analysis of Deep-Fake Algorithms
Nikhil Sontakke, Sejal Utekar, Shivansh Rastogi, Shriraj Sonawane
Comments: 7 pages, 4 figures, 2 tables, Published with International Journal of Computer Science Trends and Technology (IJCST)
Journal-ref: International Journal of Computer Science Trends and Technology (IJCST) V11(4): Page(109-115) Jul - Aug 2023. ISSN: 2347-8578
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[306] arXiv:2309.03329 [pdf, other]
Title: MEGANet: Multi-Scale Edge-Guided Attention Network for Weak Boundary Polyp Segmentation
Nhat-Tan Bui, Dinh-Hieu Hoang, Quang-Thuc Nguyen, Minh-Triet Tran, Ngan Le
Comments: Accepted at the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[307] arXiv:2309.03331 [pdf, other]
Title: Expert Uncertainty and Severity Aware Chest X-Ray Classification by Multi-Relationship Graph Learning
Mengliang Zhang, Xinyue Hu, Lin Gu, Liangchen Liu, Kazuma Kobayashi, Tatsuya Harada, Ronald M. Summers, Yingying Zhu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[308] arXiv:2309.03335 [pdf, other]
Title: SADIR: Shape-Aware Diffusion Models for 3D Image Reconstruction
Nivetha Jayakumar, Tonmoy Hossain, Miaomiao Zhang
Comments: ShapeMI MICCAI 2023: Workshop on Shape in Medical Imaging
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[309] arXiv:2309.03350 [pdf, other]
Title: Relay Diffusion: Unifying diffusion process across resolutions for image synthesis
Jiayan Teng, Wendi Zheng, Ming Ding, Wenyi Hong, Jianqiao Wangni, Zhuoyi Yang, Jie Tang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[310] arXiv:2309.03351 [pdf, other]
Title: Using Neural Networks for Fast SAR Roughness Estimation of High Resolution Images
Li Fan, Jeova Farias Sales Rocha Neto
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Applications (stat.AP)
[311] arXiv:2309.03353 [pdf, other]
Title: Source Camera Identification and Detection in Digital Videos through Blind Forensics
Venkata Udaya Sameer, Shilpa Mukhopadhyay, Ruchira Naskar, Ishaan Dali
Comments: Submitted to IEEE for inclusion in Xplore- Digital Library. Paper presented at the International Conference on Recent Trends in Computational Engineering & Technologies (ICRTCET 18)with Paper Id: ICRTCET-227
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[312] arXiv:2309.03360 [pdf, other]
Title: ViewMix: Augmentation for Robust Representation in Self-Supervised Learning
Arjon Das, Xin Zhong
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[313] arXiv:2309.03367 [pdf, other]
Title: Self-Supervised Masked Digital Elevation Models Encoding for Low-Resource Downstream Tasks
Priyam Mazumdar, Aiman Soliman, Volodymyr Kindratenko, Luigi Marini, Kenton McHenry
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[314] arXiv:2309.03381 [pdf, other]
Title: Active shooter detection and robust tracking utilizing supplemental synthetic data
Joshua R. Waite, Jiale Feng, Riley Tavassoli, Laura Harris, Sin Yong Tan, Subhadeep Chakraborty, Soumik Sarkar
Comments: 11 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[315] arXiv:2309.03390 [pdf, other]
Title: A novel method for iris recognition using BP neural network and parallel computing by the aid of GPUs (Graphics Processing Units)
Farahnaz Hosseini, Hossein Ebrahimpour, Samaneh Askari
Comments: 8 pages,
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[316] arXiv:2309.03401 [pdf, other]
Title: Reasonable Anomaly Detection in Long Sequences
Yalong Jiang, Changkang Li
Comments: 8 pages, 1 figure
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[317] arXiv:2309.03406 [pdf, other]
Title: Distribution-Aware Prompt Tuning for Vision-Language Models
Eulrang Cho, Jooyeon Kim, Hyunwoo J. Kim
Comments: Accepted to ICCV2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[318] arXiv:2309.03445 [pdf, other]
Title: Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy
Yi Tang, Takafumi Iwaguchi, Hiroshi Kawasaki
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[319] arXiv:2309.03452 [pdf, html, other]
Title: Multimodal Guidance Network for Missing-Modality Inference in Content Moderation
Zhuokai Zhao, Harish Palani, Tianyi Liu, Lena Evans, Ruth Toner
Comments: ICME 2024 Camera Ready. Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[320] arXiv:2309.03453 [pdf, html, other]
Title: SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
Yuan Liu, Cheng Lin, Zijiao Zeng, Xiaoxiao Long, Lingjie Liu, Taku Komura, Wenping Wang
Comments: ICLR 2024 Spotlight. Project page: this https URL Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR)
[321] arXiv:2309.03467 [pdf, html, other]
Title: Autoregressive Omni-Aware Outpainting for Open-Vocabulary 360-Degree Image Generation
Zhuqiang Lu, Kun Hu, Chaoyue Wang, Lei Bai, Zhiyong Wang
Comments: Accepted by AAAI 24
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[322] arXiv:2309.03468 [pdf, html, other]
Title: Support-Set Context Matters for Bongard Problems
Nikhil Raghuraman, Adam W. Harley, Leonidas Guibas
Comments: TMLR October 2024. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[323] arXiv:2309.03472 [pdf, other]
Title: Perceptual Quality Assessment of 360$^\circ$ Images Based on Generative Scanpath Representation
Xiangjie Sui, Hanwei Zhu, Xuelin Liu, Yuming Fang, Shiqi Wang, Zhou Wang
Comments: 12 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[324] arXiv:2309.03473 [pdf, other]
Title: Temporal Collection and Distribution for Referring Video Object Segmentation
Jiajin Tang, Ge Zheng, Sibei Yang
Comments: Accepted by ICCV 2023; Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[325] arXiv:2309.03483 [pdf, other]
Title: DetermiNet: A Large-Scale Diagnostic Dataset for Complex Visually-Grounded Referencing using Determiners
Clarence Lee, M Ganesh Kumar, Cheston Tan
Comments: 10 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[326] arXiv:2309.03499 [pdf, other]
Title: Instance Segmentation of Dislocations in TEM Images
Karina Ruzaeva, Kishan Govind, Marc Legros, Stefan Sandfeld
Journal-ref: IEEE 23rd International Conference on Nanotechnology (2023) 1-6
Subjects: Computer Vision and Pattern Recognition (cs.CV); Materials Science (cond-mat.mtrl-sci)
[327] arXiv:2309.03504 [pdf, other]
Title: Stroke-based Neural Painting and Stylization with Dynamically Predicted Painting Region
Teng Hu, Ran Yi, Haokun Zhu, Liang Liu, Jinlong Peng, Yabiao Wang, Chengjie Wang, Lizhuang Ma
Comments: ACM MM 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[328] arXiv:2309.03506 [pdf, other]
Title: Towards Robust Natural-Looking Mammography Lesion Synthesis on Ipsilateral Dual-Views Breast Cancer Analysis
Thanh-Huy Nguyen, Quang Hien Kha, Thai Ngoc Toan Truong, Ba Thinh Lam, Ba Hung Ngo, Quang Vinh Dinh, Nguyen Quoc Khanh Le
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[329] arXiv:2309.03508 [pdf, other]
Title: Dynamic Frame Interpolation in Wavelet Domain
Lingtong Kong, Boyuan Jiang, Donghao Luo, Wenqing Chu, Ying Tai, Chengjie Wang, Jie Yang
Comments: Accepted by IEEE TIP
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[330] arXiv:2309.03509 [pdf, other]
Title: BroadCAM: Outcome-agnostic Class Activation Mapping for Small-scale Weakly Supervised Applications
Jiatai Lin, Guoqiang Han, Xuemiao Xu, Changhong Liang, Tien-Tsin Wong, C. L. Philip Chen, Zaiyi Liu, Chu Han
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[331] arXiv:2309.03530 [pdf, other]
Title: Efficient Single Object Detection on Image Patches with Early Exit Enhanced High-Precision CNNs
Arne Moos
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[332] arXiv:2309.03531 [pdf, other]
Title: A Robust Negative Learning Approach to Partial Domain Adaptation Using Source Prototypes
Sandipan Choudhuri, Suli Adeniye, Arunabha Sen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[333] arXiv:2309.03539 [pdf, other]
Title: YOLO series target detection algorithms for underwater environments
Chenjie Zhang, Pengcheng Jiao
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[334] arXiv:2309.03542 [pdf, other]
Title: Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction
Jiankai Li, Yunhong Wang, Weixin Li
Comments: Accept in TOMM 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[335] arXiv:2309.03548 [pdf, other]
Title: Trash to Treasure: Low-Light Object Detection via Decomposition-and-Aggregation
Xiaohan Cui, Long Ma, Tengyu Ma, Jinyuan Liu, Xin Fan, Risheng Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[336] arXiv:2309.03549 [pdf, other]
Title: Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation
Jiaxi Gu, Shicong Wang, Haoyu Zhao, Tianyi Lu, Xing Zhang, Zuxuan Wu, Songcen Xu, Wei Zhang, Yu-Gang Jiang, Hang Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Multimedia (cs.MM)
[337] arXiv:2309.03550 [pdf, other]
Title: Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model
Sungwon Hwang, Junha Hyung, Jaegul Choo
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[338] arXiv:2309.03558 [pdf, other]
Title: Region Generation and Assessment Network for Occluded Person Re-Identification
Shuting He, Weihua Chen, Kai Wang, Hao Luo, Fan Wang, Wei Jiang, Henghui Ding
Journal-ref: IEEE TIFS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[339] arXiv:2309.03575 [pdf, other]
Title: Toward High Quality Facial Representation Learning
Yue Wang, Jinlong Peng, Jiangning Zhang, Ran Yi, Liang Liu, Yabiao Wang, Chengjie Wang
Comments: ACM MM 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[340] arXiv:2309.03576 [pdf, other]
Title: DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
Haochen Wang, Junsong Fan, Yuxi Wang, Kaiyou Song, Tong Wang, Zhaoxiang Zhang
Comments: Accepted by NeurIPS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[341] arXiv:2309.03598 [pdf, other]
Title: Enhancing Sample Utilization through Sample Adaptive Augmentation in Semi-Supervised Learning
Guan Gui, Zhen Zhao, Lei Qi, Luping Zhou, Lei Wang, Yinghuan Shi
Comments: Accepted as International Conference on Computer Vision (ICCV) 2023
Journal-ref: International Conference on Computer Vision (ICCV) 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[342] arXiv:2309.03599 [pdf, other]
Title: Chasing Consistency in Text-to-3D Generation from a Single Image
Yichen Ouyang, Wenhao Chai, Jiayi Ye, Dapeng Tao, Yibing Zhan, Gaoang Wang
Comments: 9 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[343] arXiv:2309.03640 [pdf, other]
Title: Context-Aware 3D Object Localization from Single Calibrated Images: A Study of Basketballs
Marcello Davide Caio (1), Gabriel Van Zandycke (1 and 2), Christophe De Vleeschouwer (2) ((1) Sportradar AG, (2) UCLouvain)
Comments: 5 pages, 4 figures, MMSports'23, in proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports (MMSports '23), October 29, 2023, Ottawa, ON, Canada
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[344] arXiv:2309.03659 [pdf, other]
Title: Towards Comparable Knowledge Distillation in Semantic Image Segmentation
Onno Niemann, Christopher Vox, Thorben Werner
Comments: Accepted by the ECML PKDD 2023 workshop track: Simplification, Compression, Efficiency, and Frugality for Artificial Intelligence (SCEFA). This preprint has not undergone peer review or any post-submission improvements or corrections
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[345] arXiv:2309.03661 [pdf, html, other]
Title: Prompt-based Context- and Domain-aware Pretraining for Vision and Language Navigation
Ting Liu, Yue Hu, Wansen Wu, Youkai Wang, Kai Xu, Quanjun Yin
Comments: 12 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[346] arXiv:2309.03671 [pdf, other]
Title: Dataset Generation and Bonobo Classification from Weakly Labelled Videos
Pierre-Etienne Martin
Comments: IntelliSys 2023 paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[347] arXiv:2309.03696 [pdf, other]
Title: Efficient Adaptive Human-Object Interaction Detection with Concept-guided Memory
Ting Lei, Fabian Caba, Qingchao Chen, Hailin Jin, Yuxin Peng, Yang Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[348] arXiv:2309.03722 [pdf, html, other]
Title: A boundary-aware point clustering approach in Euclidean and embedding spaces for roof plane segmentation
Li Li, Qingqing Li, Guozheng Xu, Pengwei Zhou, Jingmin Tu, Jie Li, Mingming Li, Jian Yao
Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing,2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[349] arXiv:2309.03726 [pdf, other]
Title: Interpretable Visual Question Answering via Reasoning Supervision
Maria Parelli, Dimitrios Mallis, Markos Diomataris, Vassilis Pitsikalis
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[350] arXiv:2309.03729 [pdf, other]
Title: Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption
Teng Hu, Jiangning Zhang, Liang Liu, Ran Yi, Siqi Kou, Haokun Zhu, Xu Chen, Yabiao Wang, Chengjie Wang, Lizhuang Ma
Comments: Accepted by ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[351] arXiv:2309.03734 [pdf, other]
Title: ClusterFusion: Leveraging Radar Spatial Features for Radar-Camera 3D Object Detection in Autonomous Vehicles
Irfan Tito Kurniawan, Bambang Riyanto Trilaksono
Comments: Accepted for publication in IEEE Access
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[352] arXiv:2309.03750 [pdf, html, other]
Title: PBP: Path-based Trajectory Prediction for Autonomous Driving
Sepideh Afshar, Nachiket Deo, Akshay Bhagat, Titas Chakraborty, Yunming Shao, Balarama Raju Buddharaju, Adwait Deshpande, Henggang Cui
Comments: Published at ICRA 2024; Sepideh Afshar and Nachiket Deo contributed equally
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[353] arXiv:2309.03763 [pdf, other]
Title: dacl1k: Real-World Bridge Damage Dataset Putting Open-Source Data to the Test
Johannes Flotzinger, Philipp J. Rösch, Norbert Oswald, Thomas Braml
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[354] arXiv:2309.03764 [pdf, other]
Title: $L_{2,1}$-Norm Regularized Quaternion Matrix Completion Using Sparse Representation and Quaternion QR Decomposition
Juan Han, Kit Ian Kou, Jifei Miao, Lizhi Liu, Haojiang Li
Subjects: Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
[355] arXiv:2309.03799 [pdf, other]
Title: FisheyePP4AV: A privacy-preserving method for autonomous vehicles on fisheye camera images
Linh Trinh, Bach Ha, Tu Tran
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[356] arXiv:2309.03809 [pdf, html, other]
Title: SimNP: Learning Self-Similarity Priors Between Neural Points
Christopher Wewer, Eddy Ilg, Bernt Schiele, Jan Eric Lenssen
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[357] arXiv:2309.03811 [pdf, other]
Title: Panoramas from Photons
Sacha Jungerman, Atul Ingle, Mohit Gupta
Comments: Proc. ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[358] arXiv:2309.03812 [pdf, other]
Title: AnthroNet: Conditional Generation of Humans via Anthropometrics
Francesco Picetti, Shrinath Deshpande, Jonathan Leban, Soroosh Shahtalebi, Jay Patel, Peifeng Jing, Chunpu Wang, Charles Metze III, Cameron Sun, Cera Laidlaw, James Warren, Kathy Huynh, River Page, Jonathan Hogins, Adam Crespi, Sujoy Ganguly, Salehe Erfanian Ebadi
Comments: AnthroNet's Unity data generator source code is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[359] arXiv:2309.03815 [pdf, other]
Title: T2IW: Joint Text to Image & Watermark Generation
An-An Liu, Guokai Zhang, Yuting Su, Ning Xu, Yongdong Zhang, Lanjun Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[360] arXiv:2309.03827 [pdf, other]
Title: ArtHDR-Net: Perceptually Realistic and Accurate HDR Content Creation
Hrishav Bakul Barua, Ganesh Krishnasamy, KokSheik Wong, Kalin Stefanov, Abhinav Dhall
Comments: Accepted in Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Taipei, Taiwan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[361] arXiv:2309.03837 [pdf, other]
Title: Cross-Task Attention Network: Improving Multi-Task Learning for Medical Imaging Applications
Sangwook Kim, Thomas G. Purdie, Chris McIntosh
Comments: 13 pages, 2 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[362] arXiv:2309.03869 [pdf, other]
Title: Text-to-feature diffusion for audio-visual few-shot learning
Otniel-Bogdan Mercea, Thomas Hummel, A. Sophia Koepke, Zeynep Akata
Comments: DAGM GCPR 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[363] arXiv:2309.03874 [pdf, other]
Title: Box-based Refinement for Weakly Supervised and Unsupervised Localization Tasks
Eyal Gomel, Tal Shaharabany, Lior Wolf
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[364] arXiv:2309.03893 [pdf, other]
Title: DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection
Manlin Zhang, Jie Wu, Yuxi Ren, Ming Li, Jie Qin, Xuefeng Xiao, Wei Liu, Rui Wang, Min Zheng, Andy J. Ma
Comments: Code and Models are publicly available. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[365] arXiv:2309.03895 [pdf, other]
Title: InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
Zigang Geng, Binxin Yang, Tiankai Hang, Chen Li, Shuyang Gu, Ting Zhang, Jianmin Bao, Zheng Zhang, Han Hu, Dong Chen, Baining Guo
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[366] arXiv:2309.03897 [pdf, other]
Title: ProPainter: Improving Propagation and Transformer for Video Inpainting
Shangchen Zhou, Chongyi Li, Kelvin C.K. Chan, Chen Change Loy
Comments: Accepted by ICCV 2023. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[367] arXiv:2309.03899 [pdf, other]
Title: The Making and Breaking of Camouflage
Hala Lamdouar, Weidi Xie, Andrew Zisserman
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[368] arXiv:2309.03903 [pdf, other]
Title: Tracking Anything with Decoupled Video Segmentation
Ho Kei Cheng, Seoung Wug Oh, Brian Price, Alexander Schwing, Joon-Young Lee
Comments: Accepted to ICCV 2023. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[369] arXiv:2309.03904 [pdf, other]
Title: Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis
Jiapeng Zhu, Ceyuan Yang, Kecheng Zheng, Yinghao Xu, Zifan Shi, Yujun Shen
Comments: Technical report
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[370] arXiv:2309.03921 [pdf, other]
Title: C-CLIP: Contrastive Image-Text Encoders to Close the Descriptive-Commentative Gap
William Theisen, Walter Scheirer
Comments: 11 Pages, 5 Figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[371] arXiv:2309.03930 [pdf, other]
Title: Random Expert Sampling for Deep Learning Segmentation of Acute Ischemic Stroke on Non-contrast CT
Sophie Ostmeier, Brian Axelrod, Benjamin Pulli, Benjamin F.J. Verhaaren, Abdelkader Mahammedi, Yongkai Liu, Christian Federau, Greg Zaharchuk, Jeremy J. Heit
Journal-ref: https://jnis.bmj.com/content/early/2024/02/01/jnis-2023-021283
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[372] arXiv:2309.03933 [pdf, other]
Title: BluNF: Blueprint Neural Field
Robin Courant, Xi Wang, Marc Christie, Vicky Kalogeiton
Comments: ICCV-W (AI3DCC) 2023. Project page with videos and code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[373] arXiv:2309.03955 [pdf, other]
Title: SimpleNeRF: Regularizing Sparse Input Neural Radiance Fields with Simpler Solutions
Nagabhushan Somraj, Adithyan Karanayil, Rajiv Soundararajan
Comments: SIGGRAPH Asia 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[374] arXiv:2309.03979 [pdf, other]
Title: Separable Self and Mixed Attention Transformers for Efficient Object Tracking
Goutam Yelluru Gopal, Maria A. Amer
Comments: Accepted by WACV2024. Code available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[375] arXiv:2309.03989 [pdf, other]
Title: CDFSL-V: Cross-Domain Few-Shot Learning for Videos
Sarinda Samarasinghe, Mamshad Nayeem Rizve, Navid Kardan, Mubarak Shah
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[376] arXiv:2309.03999 [pdf, html, other]
Title: Adapting Self-Supervised Representations to Multi-Domain Setups
Neha Kalibhat, Sam Sharpe, Jeremy Goodsitt, Bayan Bruss, Soheil Feizi
Comments: Published at BMVC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[377] arXiv:2309.04001 [pdf, html, other]
Title: MMSFormer: Multimodal Transformer for Material and Semantic Segmentation
Md Kaykobad Reza, Ashley Prater-Bennette, M. Salman Asif
Comments: Accepted by IEEE Open Journal of Signal Processing. 15 pages, 3 figures, 9 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[378] arXiv:2309.04022 [pdf, other]
Title: Improving the Accuracy of Beauty Product Recommendations by Assessing Face Illumination Quality
Parnian Afshar, Jenny Yeon, Andriy Levitskyy, Rahul Suresh, Amin Banitalebi-Dehkordi
Comments: 7 pages, 5 figures. Presented in FAccTRec2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[379] arXiv:2309.04038 [pdf, html, other]
Title: S-Adapter: Generalizing Vision Transformer for Face Anti-Spoofing with Statistical Tokens
Rizhao Cai, Zitong Yu, Chenqi Kong, Haoliang Li, Changsheng Chen, Yongjian Hu, Alex Kot
Comments: Accepted by IEEE Transactions on Information Forensics Security (June 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[380] arXiv:2309.04041 [pdf, html, other]
Title: Evaluation and Enhancement of Semantic Grounding in Large Vision-Language Models
Jiaying Lu, Jinmeng Rao, Kezhen Chen, Xiaoyuan Guo, Yawen Zhang, Baochen Sun, Carl Yang, Jie Yang
Comments: This paper has been accepted to the AAAI'24 Workshop on Responsible Language Models (ReLM 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[381] arXiv:2309.04063 [pdf, other]
Title: INSURE: An Information Theory Inspired Disentanglement and Purification Model for Domain Generalization
Xi Yu, Huan-Hsin Tseng, Shinjae Yoo, Haibin Ling, Yuewei Lin
Comments: 10 pages, 4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[382] arXiv:2309.04084 [pdf, html, other]
Title: Towards Efficient SDRTV-to-HDRTV by Learning from Image Formation
Xiangyu Chen, Zheyuan Li, Zhengwen Zhang, Jimmy S. Ren, Yihao Liu, Jingwen He, Yu Qiao, Jiantao Zhou, Chao Dong
Comments: Extended version of HDRTVNet
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[383] arXiv:2309.04089 [pdf, html, other]
Title: Toward Sufficient Spatial-Frequency Interaction for Gradient-aware Underwater Image Enhancement
Chen Zhao, Weiling Cai, Chenyu Dong, Ziqi Zeng
Comments: Accepted by ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[384] arXiv:2309.04105 [pdf, other]
Title: Weakly Supervised Point Clouds Transformer for 3D Object Detection
Zuojin Tang, Bo Sun, Tongwei Ma, Daosheng Li, Zhenhui Xu
Comments: International Conference on Intelligent Transportation Systems (ITSC), 2022
Journal-ref: International Conference on Intelligent Transportation Systems (ITSC 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[385] arXiv:2309.04109 [pdf, html, other]
Title: From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models
Changming Xiao, Qi Yang, Feng Zhou, Changshui Zhang
Comments: A revised version of this paper will be published in Neurocomputing, see this https URL
Journal-ref: Neurocomputing, Volume 610, 2024, 128437
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[386] arXiv:2309.04145 [pdf, html, other]
Title: Depth Completion with Multiple Balanced Bases and Confidence for Dense Monocular SLAM
Weijian Xie, Guanyi Chu, Quanhao Qian, Yihao Yu, Hai Li, Danpeng Chen, Shangjin Zhai, Nan Wang, Hujun Bao, Guofeng Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[387] arXiv:2309.04147 [pdf, other]
Title: Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry
Akankshya Kar, Sajal Maheshwari, Shamit Lal, Vinay Sameer Raja Kad
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[388] arXiv:2309.04148 [pdf, html, other]
Title: Representation Synthesis by Probabilistic Many-Valued Logic Operation in Self-Supervised Learning
Hiroki Nakamura, Masashi Okada, Tadahiro Taniguchi
Comments: Accepted to the IEEE Open Journal of Signal Processing (ICIP2024 track)
Journal-ref: IEEE Open Journal of Signal Processing, vol. 5, pp. 831-840, 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[389] arXiv:2309.04153 [pdf, other]
Title: Mapping EEG Signals to Visual Stimuli: A Deep Learning Approach to Match vs. Mismatch Classification
Yiqian Yang, Zhengqiao Zhao, Qian Wang, Yan Yang, Jingdong Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computational Engineering, Finance, and Science (cs.CE)
[390] arXiv:2309.04158 [pdf, other]
Title: Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment
Hongyu Hu, Tiancheng Lin, Jie Wang, Zhenbang Sun, Yi Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[391] arXiv:2309.04169 [pdf, other]
Title: Grouping Boundary Proposals for Fast Interactive Image Segmentation
Li Liu, Da Chen, Minglei Shu, Laurent D. Cohen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[392] arXiv:2309.04171 [pdf, other]
Title: PRISTA-Net: Deep Iterative Shrinkage Thresholding Network for Coded Diffraction Patterns Phase Retrieval
Aoxu Liu, Xiaohong Fan, Yin Yang, Jianping Zhang
Comments: 12 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Information Theory (cs.IT); Image and Video Processing (eess.IV)
[393] arXiv:2309.04172 [pdf, other]
Title: Unsupervised Object Localization with Representer Point Selection
Yeonghwan Song, Seokwoo Jang, Dina Katabi, Jeany Son
Comments: Accepted by ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[394] arXiv:2309.04183 [pdf, other]
Title: Stereo Matching in Time: 100+ FPS Video Stereo Matching for Extended Reality
Ziang Cheng, Jiayu Yang, Hongdong Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[395] arXiv:2309.04220 [pdf, other]
Title: Score-PA: Score-based 3D Part Assembly
Junfeng Cheng, Mingdong Wu, Ruiyuan Zhang, Guanqi Zhan, Chao Wu, Hao Dong
Comments: BMVC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[396] arXiv:2309.04225 [pdf, other]
Title: Long-Range Correlation Supervision for Land-Cover Classification from Remote Sensing Images
Dawen Yu, Shunping Ji
Comments: 14 pages, 11 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[397] arXiv:2309.04228 [pdf, other]
Title: FIVA: Facial Image and Video Anonymization and Anonymization Defense
Felix Rosberg, Eren Erdal Aksoy, Cristofer Englund, Fernando Alonso-Fernandez
Comments: Accepted to ICCVW 2023 - DFAD 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[398] arXiv:2309.04247 [pdf, other]
Title: Towards Practical Capture of High-Fidelity Relightable Avatars
Haotian Yang, Mingwu Zheng, Wanquan Feng, Haibin Huang, Yu-Kun Lai, Pengfei Wan, Zhongyuan Wang, Chongyang Ma
Comments: Accepted to SIGGRAPH Asia 2023 (Conference); Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[399] arXiv:2309.04302 [pdf, other]
Title: Have We Ever Encountered This Before? Retrieving Out-of-Distribution Road Obstacles from Driving Scenes
Youssef Shoeb, Robin Chan, Gesina Schwalbe, Azarm Nowzard, Fatma Güney, Hanno Gottschalk
Comments: 11 pages, 7 figures, and 3 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[400] arXiv:2309.04312 [pdf, other]
Title: AMLP:Adaptive Masking Lesion Patches for Self-supervised Medical Image Segmentation
Xiangtao Wang, Ruizhi Wang, Jie Zhou, Thomas Lukasiewicz, Zhenghua Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[401] arXiv:2309.04331 [pdf, html, other]
Title: Leveraging Model Fusion for Improved License Plate Recognition
Rayson Laroca, Luiz A. Zanlorensi, Valter Estevam, Rodrigo Minetto, David Menotti
Comments: Accepted for presentation at the Iberoamerican Congress on Pattern Recognition (CIARP) 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[402] arXiv:2309.04354 [pdf, other]
Title: Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts
Erik Daxberger, Floris Weers, Bowen Zhang, Tom Gunter, Ruoming Pang, Marcin Eichner, Michael Emmersberger, Yinfei Yang, Alexander Toshev, Xianzhi Du
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[403] arXiv:2309.04357 [pdf, other]
Title: SSIG: A Visually-Guided Graph Edit Distance for Floor Plan Similarity
Casper van Engelenburg, Seyran Khademi, Jan van Gemert
Comments: To be published in ICCVW 2023, 10 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[404] arXiv:2309.04366 [pdf, other]
Title: CNN Injected Transformer for Image Exposure Correction
Shuning Xu, Xiangyu Chen, Binbin Song, Jiantao Zhou
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[405] arXiv:2309.04372 [pdf, html, other]
Title: MoEController: Instruction-based Arbitrary Image Manipulation with Mixture-of-Expert Controllers
Sijia Li, Chen Chen, Haonan Lu
Comments: 6 pages,6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[406] arXiv:2309.04379 [pdf, html, other]
Title: Language Prompt for Autonomous Driving
Dongming Wu, Wencheng Han, Yingfei Liu, Tiancai Wang, Cheng-zhong Xu, Xiangyu Zhang, Jianbing Shen
Comments: Accepted by AAAI2025
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[407] arXiv:2309.04399 [pdf, other]
Title: MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask
Yupeng Zhou, Daquan Zhou, Zuo-Liang Zhu, Yaxing Wang, Qibin Hou, Jiashi Feng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[408] arXiv:2309.04410 [pdf, other]
Title: DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields
Junzhe Zhang, Yushi Lan, Shuai Yang, Fangzhou Hong, Quan Wang, Chai Kiat Yeo, Ziwei Liu, Chen Change Loy
Comments: ICCV 2023. Code: this https URL Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[409] arXiv:2309.04421 [pdf, html, other]
Title: SynthoGestures: A Novel Framework for Synthetic Dynamic Hand Gesture Generation for Driving Scenarios
Amr Gomaa, Robin Zitt, Guillermo Reyes, Antonio Krüger
Comments: Accepted at IEEE IV'24. Shorter versions were accepted as AutomotiveUI2023 Work in Progress and UIST2023 Poster Papers
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
[410] arXiv:2309.04422 [pdf, other]
Title: Video Task Decathlon: Unifying Image and Video Tasks in Autonomous Driving
Thomas E. Huang, Yifan Liu, Luc Van Gool, Fisher Yu
Comments: ICCV 2023, project page at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[411] arXiv:2309.04430 [pdf, other]
Title: Create Your World: Lifelong Text-to-Image Diffusion
Gan Sun, Wenqi Liang, Jiahua Dong, Jun Li, Zhengming Ding, Yang Cong
Comments: 15 pages,10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[412] arXiv:2309.04437 [pdf, html, other]
Title: Single View Refractive Index Tomography with Neural Fields
Brandon Zhao, Aviad Levis, Liam Connor, Pratul P. Srinivasan, Katherine L. Bouman
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cosmology and Nongalactic Astrophysics (astro-ph.CO)
[413] arXiv:2309.04447 [pdf, html, other]
Title: Impact of Blur and Resolution on Demographic Disparities in 1-to-Many Facial Identification
Aman Bhatta, Gabriella Pangelinan, Michael C. King, Kevin W. Bowyer
Comments: 9 pages, 8 figures, Conference submission
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[414] arXiv:2309.04453 [pdf, other]
Title: WiSARD: A Labeled Visual and Thermal Image Dataset for Wilderness Search and Rescue
Daniel Broyles, Christopher R. Hayner, Karen Leung
Journal-ref: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 2022, pp. 9467-9474
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[415] arXiv:2309.04462 [pdf, other]
Title: Generalized Cross-domain Multi-label Few-shot Learning for Chest X-rays
Aroof Aimen, Arsh Verma, Makarand Tapaswi, Narayanan C. Krishnan
Comments: 17 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[416] arXiv:2309.04502 [pdf, other]
Title: On the Efficacy of Multi-scale Data Samplers for Vision Applications
Elvis Nunez, Thomas Merth, Anish Prabhu, Mehrdad Farajtabar, Mohammad Rastegari, Sachin Mehta, Maxwell Horton
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[417] arXiv:2309.04506 [pdf, other]
Title: Unsupervised Gaze-aware Contrastive Learning with Subject-specific Condition
Lingyu Du, Xucong Zhang, Guohao Lan
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[418] arXiv:2309.04542 [pdf, other]
Title: Examining Autoexposure for Challenging Scenes
SaiKiran Tedla, Beixuan Yang, Michael S. Brown
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[419] arXiv:2309.04549 [pdf, other]
Title: Poster: Making Edge-assisted LiDAR Perceptions Robust to Lossy Point Cloud Compression
Jin Heo, Gregorie Phillips, Per-Erik Brodin, Ada Gavrilovska
Comments: extended abstract of 2 pages, 2 figures, 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[420] arXiv:2309.04561 [pdf, html, other]
Title: Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding
Ozan Unal, Christos Sakaridis, Suman Saha, Luc Van Gool
Comments: Accepted at ECCV 2024. Winner of the ICCV 2023 ScanRefer Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[421] arXiv:2309.04573 [pdf, other]
Title: Mask2Anomaly: Mask Transformer for Universal Open-set Segmentation
Shyam Nandan Rai, Fabio Cermelli, Barbara Caputo, Carlo Masone
Comments: 16 pages. arXiv admin note: substantial text overlap with arXiv:2307.13316
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[422] arXiv:2309.04579 [pdf, other]
Title: EGOFALLS: A visual-audio dataset and benchmark for fall detection using egocentric cameras
Xueyi Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[423] arXiv:2309.04608 [pdf, other]
Title: Style Generation: Image Synthesis based on Coarsely Matched Texts
Mengyao Cui, Zhe Zhu, Shao-Ping Lu, Yulu Yang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[424] arXiv:2309.04650 [pdf, other]
Title: Exploring Robust Features for Improving Adversarial Robustness
Hong Wang, Yuefan Deng, Shinjae Yoo, Yuewei Lin
Comments: 12 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[425] arXiv:2309.04657 [pdf, other]
Title: Generation and Recombination for Multifocus Image Fusion with Free Number of Inputs
Huafeng Li, Dan Wang, Yuxin Huang, Yafei Zhang, Zhengtao Yu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[426] arXiv:2309.04659 [pdf, other]
Title: Progressive Feature Adjustment for Semi-supervised Learning from Pretrained Models
Hai-Ming Xu, Lingqiao Liu, Hao Chen, Ehsan Abbasnejad, Rafael Felix
Comments: to appear at ICCVW2023 (Workshop on Visual Continual Learning)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[427] arXiv:2309.04669 [pdf, html, other]
Title: Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu
Comments: ICLR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[428] arXiv:2309.04675 [pdf, other]
Title: BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification
Takuro Fujii, Shuhei Tarashima
Comments: Accepted at ICCVW 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[429] arXiv:2309.04682 [pdf, other]
Title: DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions
Teng Fu, Xiaocong Wang, Haiyang Yu, Ke Niu, Bin Li, Xiangyang Xue
Comments: ACM Multimedia 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[430] arXiv:2309.04702 [pdf, other]
Title: A Spatial-Temporal Deformable Attention based Framework for Breast Lesion Detection in Videos
Chao Qin, Jiale Cao, Huazhu Fu, Rao Muhammad Anwer, Fahad Shahbaz Khan
Comments: Accepted by MICCAI 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[431] arXiv:2309.04708 [pdf, html, other]
Title: UnitModule: A Lightweight Joint Image Enhancement Module for Underwater Object Detection
Zhuoyan Liu, Bo Wang, Ye Li, Jiaxian He, Yunfeng Li
Comments: 15 pages, 10 figures, 13 tables, accepted by PR
Journal-ref: Pattern Recognition 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[432] arXiv:2309.04723 [pdf, other]
Title: Frequency-Aware Self-Supervised Long-Tailed Learning
Ci-Siang Lin, Min-Hung Chen, Yu-Chiang Frank Wang
Comments: ICCV Workshop 2023 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[433] arXiv:2309.04734 [pdf, other]
Title: Towards Better Multi-modal Keyphrase Generation via Visual Entity Enhancement and Multi-granularity Image Noise Filtering
Yifan Dong, Suhang Wu, Fandong Meng, Jie Zhou, Xiaoli Wang, Jianxin Lin, Jinsong Su
Comments: Accepted In Proceedings of the 31st ACM International Conference on Multimedia (MM' 23)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Multimedia (cs.MM)
[434] arXiv:2309.04747 [pdf, other]
Title: When to Learn What: Model-Adaptive Data Augmentation Curriculum
Chengkai Hou, Jieyu Zhang, Tianyi Zhou
Comments: Our paper is accpeted by ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[435] arXiv:2309.04750 [pdf, html, other]
Title: Mirror-Aware Neural Humans
Daniel Ajisafe, James Tang, Shih-Yang Su, Bastian Wandt, Helge Rhodin
Comments: The 11th International Conference on 3D Vision (3DV 2024). Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[436] arXiv:2309.04752 [pdf, other]
Title: Deep Video Restoration for Under-Display Camera
Xuanxi Chen, Tao Wang, Ziqian Shao, Kaihao Zhang, Wenhan Luo, Tong Lu, Zikun Liu, Tae-Kyun Kim, Hongdong Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[437] arXiv:2309.04756 [pdf, other]
Title: Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation
Boyuan Jiang, Lei Hu, Shihong Xia
Comments: 9pages, 5figures, conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[438] arXiv:2309.04763 [pdf, other]
Title: Visual Material Characteristics Learning for Circular Healthcare
Federico Zocco, Shahin Rahimifard
Comments: To be submitted
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[439] arXiv:2309.04780 [pdf, html, other]
Title: Latent Degradation Representation Constraint for Single Image Deraining
Yuhong He, Long Peng, Lu Wang, Jun Cheng
Comments: This paper is accepted to ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[440] arXiv:2309.04795 [pdf, html, other]
Title: Latent Spatiotemporal Adaptation for Generalized Face Forgery Video Detection
Daichi Zhang, Zihao Xiao, Jianmin Li, Shiming Ge
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[441] arXiv:2309.04800 [pdf, other]
Title: VeRi3D: Generative Vertex-based Radiance Fields for 3D Controllable Human Image Synthesis
Xinya Chen, Jiaxin Huang, Yanrui Bin, Lu Yu, Yiyi Liao
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[442] arXiv:2309.04801 [pdf, other]
Title: TMComposites: Plug-and-Play Collaboration Between Specialized Tsetlin Machines
Ole-Christoffer Granmo
Comments: 8 pages, 6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[443] arXiv:2309.04803 [pdf, other]
Title: Towards Real-World Burst Image Super-Resolution: Benchmark and Method
Pengxu Wei, Yujing Sun, Xingbei Guo, Chang Liu, Jie Chen, Xiangyang Ji, Liang Lin
Comments: Accepted by ICCV2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[444] arXiv:2309.04806 [pdf, html, other]
Title: Timely Fusion of Surround Radar/Lidar for Object Detection in Autonomous Driving Systems
Wenjing Xie, Tao Hu, Neiwen Ling, Guoliang Xing, Chun Jason Xue, Nan Guan
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[445] arXiv:2309.04814 [pdf, other]
Title: Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video
Xiuzhe Wu, Pengfei Hu, Yang Wu, Xiaoyang Lyu, Yan-Pei Cao, Ying Shan, Wenming Yang, Zhongqian Sun, Xiaojuan Qi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[446] arXiv:2309.04820 [pdf, html, other]
Title: ABC Easy as 123: A Blind Counter for Exemplar-Free Multi-Class Class-agnostic Counting
Michael A. Hobley, Victor A. Prisacariu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[447] arXiv:2309.04825 [pdf, other]
Title: Few-Shot Medical Image Segmentation via a Region-enhanced Prototypical Transformer
Yazhou Zhu, Shidong Wang, Tong Xin, Haofeng Zhang
Comments: Accepted by MICCAI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[448] arXiv:2309.04836 [pdf, html, other]
Title: Neural Semantic Surface Maps
Luca Morreale, Noam Aigerman, Vladimir G. Kim, Niloy J. Mitra
Comments: Accepted at Eurographics 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[449] arXiv:2309.04840 [pdf, other]
Title: AnyPose: Anytime 3D Human Pose Forecasting via Neural Ordinary Differential Equations
Zixing Wang, Ahmed H. Qureshi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[450] arXiv:2309.04887 [pdf, other]
Title: SortedAP: Rethinking evaluation metrics for instance segmentation
Long Chen, Yuli Wu, Johannes Stegmaier, Dorit Merhof
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[451] arXiv:2309.04888 [pdf, other]
Title: Semi-supervised Instance Segmentation with a Learned Shape Prior
Long Chen, Weiwen Zhang, Yuli Wu, Martin Strauch, Dorit Merhof
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[452] arXiv:2309.04891 [pdf, html, other]
Title: How to Evaluate Semantic Communications for Images with ViTScore Metric?
Tingting Zhu, Bo Peng, Jifan Liang, Tingchen Han, Hai Wan, Jingqiao Fu, Junjie Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Information Theory (cs.IT)
[453] arXiv:2309.04902 [pdf, other]
Title: Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art
Aref Miri Rekavandi, Shima Rashidi, Farid Boussaid, Stephen Hoefs, Emre Akbas, Mohammed bennamoun
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[454] arXiv:2309.04907 [pdf, other]
Title: Effective Real Image Editing with Accelerated Iterative Diffusion Inversion
Zhihong Pan, Riccardo Gherardi, Xiufeng Xie, Stephen Huang
Comments: Accepted to ICCV 2023 (Oral)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[455] arXiv:2309.04914 [pdf, other]
Title: MFPNet: Multi-scale Feature Propagation Network For Lightweight Semantic Segmentation
Guoan Xu, Wenjing Jia, Tao Wu, Ligeng Chen
Comments: 5 pages, 3 figures, 5tables, conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[456] arXiv:2309.04917 [pdf, other]
Title: Editing 3D Scenes via Text Prompts without Retraining
Shuangkang Fang, Yufeng Wang, Yi Yang, Yi-Hsuan Tsai, Wenrui Ding, Shuchang Zhou, Ming-Hsuan Yang
Comments: Project Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[457] arXiv:2309.04958 [pdf, other]
Title: Semi-Supervised learning for Face Anti-Spoofing using Apex frame
Usman Muhammad, Mourad Oussalah, Jorma Laaksonen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[458] arXiv:2309.04965 [pdf, other]
Title: Prefix-diffusion: A Lightweight Diffusion Model for Diverse Image Captioning
Guisheng Liu, Yi Li, Zhengcong Fei, Haiyan Fu, Xiangyang Luo, Yanqing Guo
Comments: 11 pages,4 figures, 6 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[459] arXiv:2309.04967 [pdf, html, other]
Title: Towards Fully Decoupled End-to-End Person Search
Pengcheng Zhang, Xiao Bai, Jin Zheng, Xin Ning
Comments: DICTA 2023 Best Student Paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[460] arXiv:2309.05013 [pdf, other]
Title: Geometrically Consistent Partial Shape Matching
Viktoria Ehm, Paul Roetzer, Marvin Eisenberger, Maolin Gao, Florian Bernard, Daniel Cremers
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[461] arXiv:2309.05015 [pdf, other]
Title: DeViT: Decomposing Vision Transformers for Collaborative Inference in Edge Devices
Guanyu Xu, Zhiwei Hao, Yong Luo, Han Hu, Jianping An, Shiwen Mao
Comments: Accepted by IEEE Transactions on Mobile Computing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
[462] arXiv:2309.05028 [pdf, other]
Title: SC-NeRF: Self-Correcting Neural Radiance Field with Sparse Views
Liang Song, Guangming Wang, Jiuming Liu, Zhenyang Fu, Yanzi Miao, Hesheng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[463] arXiv:2309.05032 [pdf, other]
Title: Unified Contrastive Fusion Transformer for Multimodal Human Action Recognition
Kyoung Ok Yang, Junho Koh, Jun Won Choi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[464] arXiv:2309.05049 [pdf, other]
Title: Multi-view Self-supervised Disentanglement for General Image Denoising
Hao Chen, Chenyuan Qu, Yu Zhang, Chen Chen, Jianbo Jiao
Comments: International Conference on Computer Vision 2023 (ICCV 2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[465] arXiv:2309.05069 [pdf, other]
Title: Exploiting CLIP for Zero-shot HOI Detection Requires Knowledge Distillation at Multiple Levels
Bo Wan, Tinne Tuytelaars
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[466] arXiv:2309.05073 [pdf, html, other]
Title: FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World Conditions
Jiong Wang, Fengyu Yang, Wenbo Gou, Bingliang Li, Danqi Yan, Ailing Zeng, Yijun Gao, Junle Wang, Yanqing Jing, Ruimao Zhang
Comments: CVPR2024 camera ready version. 19 pages, 16 figures. Project page: this https URL ; API: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[467] arXiv:2309.05090 [pdf, other]
Title: Sculpting Efficiency: Pruning Medical Imaging Models for On-Device Inference
Sudarshan Sreeram, Bernhard Kainz
Comments: Accepted at MedNeurIPS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[468] arXiv:2309.05095 [pdf, other]
Title: MaskRenderer: 3D-Infused Multi-Mask Realistic Face Reenactment
Tina Behrouzi, Atefeh Shahroudnejad, Payam Mousavi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[469] arXiv:2309.05098 [pdf, other]
Title: 3D Implicit Transporter for Temporally Consistent Keypoint Discovery
Chengliang Zhong, Yuhang Zheng, Yupeng Zheng, Hao Zhao, Li Yi, Xiaodong Mu, Ling Wang, Pengfei Li, Guyue Zhou, Chao Yang, Xinliang Zhang, Jian Zhao
Comments: ICCV2023 oral paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[470] arXiv:2309.05132 [pdf, other]
Title: DAD++: Improved Data-free Test Time Adversarial Defense
Gaurav Kumar Nayak, Inder Khatri, Shubham Randive, Ruchit Rawal, Anirban Chakraborty
Comments: IJCV Journal (Under Review)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[471] arXiv:2309.05139 [pdf, other]
Title: A Skeleton-based Approach For Rock Crack Detection Towards A Climbing Robot Application
Josselin Somerville Roberts, Paul-Emile Giacomelli, Yoni Gozlan, Julia Di
Journal-ref: IEEE IRC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[472] arXiv:2309.05148 [pdf, other]
Title: Beyond Skin Tone: A Multidimensional Measure of Apparent Skin Color
William Thong, Przemyslaw Joniak, Alice Xiang
Comments: Accepted at the International Conference on Computer Vision (ICCV) 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[473] arXiv:2309.05150 [pdf, other]
Title: Faster, Lighter, More Accurate: A Deep Learning Ensemble for Content Moderation
Mohammad Hosseini, Mahmudul Hasan
Comments: 6 pages, 22nd IEEE International Conference on Machine Learning and Applications (IEEE ICMLA'23), December 15-17, 2023, Jacksonville Riverfront, Florida, USA. arXiv admin note: substantial text overlap with arXiv:2103.10350
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[474] arXiv:2309.05180 [pdf, html, other]
Title: What's color got to do with it? Face recognition in grayscale
Aman Bhatta, Domingo Mery, Haiyu Wu, Joyce Annan, Micheal C. King, Kevin W. Bowyer
Comments: This is replacement version of the previous arxiv submission: 2309.05180 (Our Deep CNN Face Matchers Have Developed Achromatopsia). The past version is published in CVPRW and available in IEEE proceedings. This submitted version is an extension of the conference paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
[475] arXiv:2309.05186 [pdf, html, other]
Title: HiLM-D: Enhancing MLLMs with Multi-Scale High-Resolution Details for Autonomous Driving
Xinpeng Ding, Jianhua Han, Hang Xu, Wei Zhang, Xiaomeng Li
Comments: Accepted by IJCV
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[476] arXiv:2309.05192 [pdf, other]
Title: Towards Viewpoint Robustness in Bird's Eye View Segmentation
Tzofi Klinghoffer, Jonah Philion, Wenzheng Chen, Or Litany, Zan Gojcic, Jungseock Joo, Ramesh Raskar, Sanja Fidler, Jose M. Alvarez
Comments: ICCV 2023. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[477] arXiv:2309.05209 [pdf, other]
Title: Phase-Specific Augmented Reality Guidance for Microscopic Cataract Surgery Using Long-Short Spatiotemporal Aggregation Transformer
Puxun Tu, Hongfei Ye, Haochen Shi, Jeff Young, Meng Xie, Peiquan Zhao, Ce Zheng, Xiaoyi Jiang, Xiaojun Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[478] arXiv:2309.05214 [pdf, other]
Title: Angle Range and Identity Similarity Enhanced Gaze and Head Redirection based on Synthetic data
Jiawei Qin, Xueting Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[479] arXiv:2309.05224 [pdf, other]
Title: SparseSwin: Swin Transformer with Sparse Transformer Block
Krisna Pinasthika, Blessius Sheldo Putra Laksono, Riyandi Banovbi Putera Irsal, Syifa Hukma Shabiyya, Novanto Yudistira
Journal-ref: Neurocomputing, Volume 580, 2024, 127433
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[480] arXiv:2309.05239 [pdf, html, other]
Title: HAT: Hybrid Attention Transformer for Image Restoration
Xiangyu Chen, Xintao Wang, Wenlong Zhang, Xiangtao Kong, Yu Qiao, Jiantao Zhou, Chao Dong
Comments: Extended version of HAT. arXiv admin note: text overlap with arXiv:2205.04437
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[481] arXiv:2309.05251 [pdf, other]
Title: Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Yiming Zhang, ZeMing Gong, Angel X. Chang
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[482] arXiv:2309.05254 [pdf, html, other]
Title: Towards Better Data Exploitation in Self-Supervised Monocular Depth Estimation
Jinfeng Liu, Lingtong Kong, Jie Yang, Wei Liu
Comments: 8 pages, 6 figures, accepted by IEEE Robotics and Automation Letters (RA-L 2023)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[483] arXiv:2309.05257 [pdf, other]
Title: FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal Consistent Transformer for 3D Object Detection
Chunyong Hu, Hang Zheng, Kun Li, Jianyun Xu, Weibo Mao, Maochun Luo, Lingxuan Wang, Mingxia Chen, Qihao Peng, Kaixuan Liu, Yiru Zhao, Peihan Hao, Minzhe Liu, Kaicheng Yu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[484] arXiv:2309.05261 [pdf, other]
Title: Gall Bladder Cancer Detection from US Images with Only Image Level Labels
Soumen Basu, Ashish Papanai, Mayank Gupta, Pankaj Gupta, Chetan Arora
Comments: Accepted at MICCAI 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[485] arXiv:2309.05262 [pdf, other]
Title: A horizon line annotation tool for streamlining autonomous sea navigation experiments
Yassir Zardoua, Abdelhamid El Wahabi, Mohammed Boulaala, Abdelali Astito
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[486] arXiv:2309.05267 [pdf, other]
Title: Diving into Darkness: A Dual-Modulated Framework for High-Fidelity Super-Resolution in Ultra-Dark Environments
Jiaxin Gao, Ziyu Yue, Yaohua Liu, Sihan Xie, Xin Fan, Risheng Liu
Comments: 9 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[487] arXiv:2309.05277 [pdf, other]
Title: Interactive Class-Agnostic Object Counting
Yifeng Huang, Viresh Ranjan, Minh Hoai
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[488] arXiv:2309.05281 [pdf, other]
Title: Class-Incremental Grouping Network for Continual Audio-Visual Learning
Shentong Mo, Weiguo Pian, Yapeng Tian
Comments: ICCV 2023. arXiv admin note: text overlap with arXiv:2303.17056
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[489] arXiv:2309.05282 [pdf, other]
Title: Can you text what is happening? Integrating pre-trained language encoders into trajectory prediction models for autonomous driving
Ali Keysan, Andreas Look, Eitan Kosman, Gonca Gürsun, Jörg Wagner, Yu Yao, Barbara Rakitsch
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[490] arXiv:2309.05289 [pdf, other]
Title: Task-driven Compression for Collision Encoding based on Depth Images
Mihir Kulkarni, Kostas Alexis
Comments: 14 pages, 5, figures. Accepted to the International Symposium on Visual Computing 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[491] arXiv:2309.05300 [pdf, html, other]
Title: Decoupling Common and Unique Representations for Multimodal Self-supervised Learning
Yi Wang, Conrad M Albrecht, Nassim Ait Ali Braham, Chenying Liu, Zhitong Xiong, Xiao Xiang Zhu
Comments: Accepted to ECCV 2024. 27 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[492] arXiv:2309.05314 [pdf, other]
Title: Semantic Latent Decomposition with Normalizing Flows for Face Editing
Binglei Li, Zhizhong Huang, Hongming Shan, Junping Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[493] arXiv:2309.05330 [pdf, other]
Title: Diff-Privacy: Diffusion-based Face Privacy Protection
Xiao He, Mingrui Zhu, Dongxin Chen, Nannan Wang, Xinbo Gao
Comments: 17pages
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[494] arXiv:2309.05334 [pdf, html, other]
Title: MultIOD: Rehearsal-free Multihead Incremental Object Detector
Eden Belouadah, Arnaud Dapogny, Kevin Bailly
Comments: Accepted at the archival track of the Workshop on Continual Learning in Computer Vision (CVPR 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[495] arXiv:2309.05375 [pdf, other]
Title: Toward a Deeper Understanding: RetNet Viewed through Convolution
Chenghao Li, Chaoning Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[496] arXiv:2309.05380 [pdf, other]
Title: Collective PV-RCNN: A Novel Fusion Technique using Collective Detections for Enhanced Local LiDAR-Based Perception
Sven Teufel, Jörg Gamerdinger, Georg Volk, Oliver Bringmann
Comments: accepted at IEEE ITSC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[497] arXiv:2309.05388 [pdf, html, other]
Title: Robust Single Rotation Averaging Revisited
Seong Hun Lee, Javier Civera
Comments: Accepted to ECCV 2024 Workshop on Recovering 6D Object Pose (R6D)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[498] arXiv:2309.05418 [pdf, html, other]
Title: FlowIBR: Leveraging Pre-Training for Efficient Neural Image-Based Rendering of Dynamic Scenes
Marcel Büsching, Josef Bengtson, David Nilsson, Mårten Björkman
Comments: Accepted to CVPR 2024 Workshop on Efficient Deep Learning for Computer Vision. Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[499] arXiv:2309.05438 [pdf, other]
Title: Towards Content-based Pixel Retrieval in Revisited Oxford and Paris
Guoyuan An, Woo Jae Kim, Saelyne Yang, Rong Li, Yuchi Huo, Sung-Eui Yoon
Subjects: Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[500] arXiv:2309.05448 [pdf, html, other]
Title: Panoptic Vision-Language Feature Fields
Haoran Chen, Kenneth Blomqvist, Francesco Milano, Roland Siegwart
Comments: This work has been accepted by IEEE Robotics and Automation Letters
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Total of 2022 entries : 1-250 251-500 501-750 751-1000 1001-1250 ... 2001-2022
Showing up to 250 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack