Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Vision and Pattern Recognition

Authors and titles for January 2024

Total of 1882 entries : 51-150 101-200 201-300 301-400 ... 1801-1882
Showing up to 100 entries per page: fewer | more | all
[51] arXiv:2401.00617 [pdf, html, other]
Title: Towards Improved Proxy-based Deep Metric Learning via Data-Augmented Domain Adaptation
Li Ren, Chen Chen, Liqiang Wang, Kien Hua
Comments: Accepted by AAAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[52] arXiv:2401.00639 [pdf, html, other]
Title: Geometry Depth Consistency in RGBD Relative Pose Estimation
Sourav Kumar, Chiang-Heng Chien, Benjamin Kimia
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[53] arXiv:2401.00652 [pdf, html, other]
Title: From Covert Hiding to Visual Editing: Robust Generative Video Steganography
Xueying Mao, Xiaoxiao Hu, Wanli Peng, Zhenliang Gan, Qichao Ying, Zhenxing Qian, Sheng Li, Xinpeng Zhang
Comments: Under Review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[54] arXiv:2401.00653 [pdf, html, other]
Title: PROMPT-IML: Image Manipulation Localization with Pre-trained Foundation Models Through Prompt Tuning
Xuntao Liu, Yuzhou Yang, Qichao Ying, Zhenxing Qian, Xinpeng Zhang, Sheng Li
Comments: Under Review
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[55] arXiv:2401.00663 [pdf, html, other]
Title: 1st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation
Zhuoyan Luo, Yicheng Xiao, Yong Liu, Yitong Wang, Yansong Tang, Xiu Li, Yujiu Yang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[56] arXiv:2401.00695 [pdf, html, other]
Title: Credible Teacher for Semi-Supervised Object Detection in Open Scene
Jingyu Zhuang, Kuo Wang, Liang Lin, Guanbin Li
Comments: Accpet by ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[57] arXiv:2401.00701 [pdf, html, other]
Title: Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning
Kaibin Tian, Yanhua Cheng, Yi Liu, Xinglin Hou, Quan Chen, Han Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[58] arXiv:2401.00708 [pdf, html, other]
Title: Revisiting Nonlocal Self-Similarity from Continuous Representation
Yisi Luo, Xile Zhao, Deyu Meng
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[59] arXiv:2401.00711 [pdf, html, other]
Title: Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute
Chaoqun Gong, Yuqin Dai, Ronghui Li, Achun Bao, Jun Li, Jian Yang, Yachao Zhang, Xiu Li
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[60] arXiv:2401.00719 [pdf, html, other]
Title: Depth Map Denoising Network and Lightweight Fusion Network for Enhanced 3D Face Recognition
Ruizhuo Xu, Ke Wang, Chao Deng, Mei Wang, Xi Chen, Wenhui Huang, Junlan Feng, Weihong Deng
Comments: Accepted by Pattern Recognition
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[61] arXiv:2401.00722 [pdf, other]
Title: BRAU-Net++: U-Shaped Hybrid CNN-Transformer Network for Medical Image Segmentation
Libin Lan, Pengzhou Cai, Lu Jiang, Xiaojuan Liu, Yongmei Li, Yudong Zhang
Comments: 13 pages, 7 figures, 9 tables. This work has been submitted to the IEEE TETCI for possible publication. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[62] arXiv:2401.00729 [pdf, html, other]
Title: NightRain: Nighttime Video Deraining via Adaptive-Rain-Removal and Adaptive-Correction
Beibei Lin, Yeying Jin, Wending Yan, Wei Ye, Yuan Yuan, Shunli Zhang, Robby Tan
Comments: Accepted by AAAI24
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[63] arXiv:2401.00736 [pdf, html, other]
Title: Diffusion Models, Image Super-Resolution And Everything: A Survey
Brian B. Moser, Arundhati S. Shanbhag, Federico Raue, Stanislav Frolov, Sebastian Palacio, Andreas Dengel
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
[64] arXiv:2401.00739 [pdf, other]
Title: DiffMorph: Text-less Image Morphing with Diffusion Models
Shounak Chatterjee
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[65] arXiv:2401.00766 [pdf, html, other]
Title: Exposure Bracketing Is All You Need For A High-Quality Image
Zhilu Zhang, Shuohao Zhang, Renlong Wu, Zifei Yan, Wangmeng Zuo
Comments: ICLR 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[66] arXiv:2401.00789 [pdf, html, other]
Title: Retrieval-Augmented Egocentric Video Captioning
Jilan Xu, Yifei Huang, Junlin Hou, Guo Chen, Yuejie Zhang, Rui Feng, Weidi Xie
Comments: CVPR 2024. Project page is available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[67] arXiv:2401.00816 [pdf, html, other]
Title: Glimpse: Generalized Locality for Scalable and Robust CT
AmirEhsan Khorashadizadeh, Valentin Debarnot, Tianlin Liu, Ivan Dokmanić
Comments: 21 pages, 14 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[68] arXiv:2401.00825 [pdf, html, other]
Title: Sharp-NeRF: Grid-based Fast Deblurring Neural Radiance Fields Using Sharpness Prior
Byeonghyeon Lee, Howoong Lee, Usman Ali, Eunbyung Park
Comments: Accepted to WACV 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
[69] arXiv:2401.00833 [pdf, html, other]
Title: Rethinking RAFT for Efficient Optical Flow
Navid Eslami, Farnoosh Arefi, Amir M. Mansourian, Shohreh Kasaei
Comments: 7 pages, 5 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[70] arXiv:2401.00834 [pdf, html, other]
Title: Deblurring 3D Gaussian Splatting
Byeonghyeon Lee, Howoong Lee, Xiangyu Sun, Usman Ali, Eunbyung Park
Comments: 29 pages, 16 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[71] arXiv:2401.00847 [pdf, html, other]
Title: Mocap Everyone Everywhere: Lightweight Motion Capture With Smartwatches and a Head-Mounted Camera
Jiye Lee, Hanbyul Joo
Comments: Accepted to CVPR 2024; Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[72] arXiv:2401.00849 [pdf, html, other]
Title: COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training
Alex Jinpeng Wang, Linjie Li, Kevin Qinghong Lin, Jianfeng Wang, Kevin Lin, Zhengyuan Yang, Lijuan Wang, Mike Zheng Shou
Comments: 16 pages; Website: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[73] arXiv:2401.00850 [pdf, html, other]
Title: Refining Pre-Trained Motion Models
Xinglong Sun, Adam W. Harley, Leonidas J. Guibas
Comments: Accepted at ICRA 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[74] arXiv:2401.00869 [pdf, html, other]
Title: FlashVideo: A Framework for Swift Inference in Text-to-Video Generation
Bin Lei, le Chen, Caiwen Ding
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[75] arXiv:2401.00871 [pdf, html, other]
Title: PlanarNeRF: Online Learning of Planar Primitives with Neural Radiance Fields
Zheng Chen, Qingan Yan, Huangying Zhan, Changjiang Cai, Xiangyu Xu, Yuzhong Huang, Weihan Wang, Ziyue Feng, Yi Xu, Lantao Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[76] arXiv:2401.00889 [pdf, html, other]
Title: 3D Human Pose Perception from Egocentric Stereo Videos
Hiroyasu Akada, Jian Wang, Vladislav Golyanik, Christian Theobalt
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[77] arXiv:2401.00896 [pdf, html, other]
Title: TrailBlazer: Trajectory Control for Diffusion-Based Video Generation
Wan-Duo Kurt Ma, J.P. Lewis, W. Bastiaan Kleijn
Comments: 14 pages, 18 figures, Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[78] arXiv:2401.00897 [pdf, other]
Title: Masked Modeling for Self-supervised Representation Learning on Vision and Beyond
Siyuan Li, Luyuan Zhang, Zedong Wang, Di Wu, Lirong Wu, Zicheng Liu, Jun Xia, Cheng Tan, Yang Liu, Baigui Sun, Stan Z. Li
Comments: Preprint v2 (fix typos and citations). GitHub project at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[79] arXiv:2401.00901 [pdf, html, other]
Title: Video-GroundingDINO: Towards Open-Vocabulary Spatio-Temporal Video Grounding
Syed Talal Wasim, Muzammal Naseer, Salman Khan, Ming-Hsuan Yang, Fahad Shahbaz Khan
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[80] arXiv:2401.00909 [pdf, html, other]
Title: Taming Mode Collapse in Score Distillation for Text-to-3D Generation
Peihao Wang, Dejia Xu, Zhiwen Fan, Dilin Wang, Sreyas Mohan, Forrest Iandola, Rakesh Ranjan, Yilei Li, Qiang Liu, Zhangyang Wang, Vikas Chandra
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[81] arXiv:2401.00910 [pdf, other]
Title: WoodScape Motion Segmentation for Autonomous Driving -- CVPR 2023 OmniCV Workshop Challenge
Saravanabalagi Ramachandran, Nathaniel Cibik, Ganesh Sistu, John McDonald
Comments: CVPR 2023 OmniCV Workshop Challenge
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[82] arXiv:2401.00912 [pdf, html, other]
Title: ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention
Chenhang He, Ruihuang Li, Guowen Zhang, Lei Zhang
Comments: 14 pages, 6 figures, Accepted to ECCV2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[83] arXiv:2401.00921 [pdf, html, other]
Title: Skeleton2vec: A Self-supervised Learning Framework with Contextualized Target Representations for Skeleton Sequence
Ruizhuo Xu, Linzhi Huang, Mei Wang, Jiani Hu, Weihong Deng
Comments: Submitted to CVPR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[84] arXiv:2401.00926 [pdf, html, other]
Title: Accurate Leukocyte Detection Based on Deformable-DETR and Multi-Level Feature Fusion for Aiding Diagnosis of Blood Diseases
Yifei Chen, Chenyan Zhang, Ben Chen, Yiyu Huang, Yifei Sun, Changmiao Wang, Xianjun Fu, Yuxing Dai, Feiwei Qin, Yong Peng, Yu Gao
Comments: 15 pages, 11 figures, accept Computers in Biology and Medicine 2024
Journal-ref: Computers in Biology and Medicine 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[85] arXiv:2401.00935 [pdf, html, other]
Title: Boundary Attention: Learning curves, corners, junctions and grouping
Mia Gaia Polansky, Charles Herrmann, Junhwa Hur, Deqing Sun, Dor Verbin, Todd Zickler
Comments: Project website at this http URL: this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[86] arXiv:2401.00964 [pdf, html, other]
Title: Data Augmentation Techniques for Cross-Domain WiFi CSI-based Human Activity Recognition
Julian Strohmayer, Martin Kampel
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[87] arXiv:2401.00971 [pdf, html, other]
Title: Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual Adapters
Jiayou Chao, Wei Zhu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[88] arXiv:2401.00979 [pdf, html, other]
Title: 3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands
Xuan Huang, Hanhui Li, Zejun Yang, Zhisheng Wang, Xiaodan Liang
Comments: Accepted by AAAI-24
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[89] arXiv:2401.00986 [pdf, other]
Title: Real-Time Object Detection in Occluded Environment with Background Cluttering Effects Using Deep Learning
Syed Muhammad Aamir, Hongbin Ma, Malak Abid Ali Khan, Muhammad Aaqib
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[90] arXiv:2401.00988 [pdf, html, other]
Title: Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models
Xinpeng Ding, Jinahua Han, Hang Xu, Xiaodan Liang, Wei Zhang, Xiaomeng Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[91] arXiv:2401.00989 [pdf, html, other]
Title: Diversity-aware Buffer for Coping with Temporally Correlated Data Streams in Online Test-time Adaptation
Mario Döbler, Florian Marencke, Robert A. Marsden, Bin Yang
Comments: Accepted at ICASSP 2024. arXiv admin note: text overlap with arXiv:2306.00650
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[92] arXiv:2401.01002 [pdf, other]
Title: AI Mobile Application for Archaeological Dating of Bronze Dings
Chuntao Li, Ruihua Qi, Chuan Tang, Jiafu Wei, Xi Yang, Qian Zhang, Rixin Zhou
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[93] arXiv:2401.01003 [pdf, other]
Title: Rink-Agnostic Hockey Rink Registration
Jia Cheng Shang, Yuhao Chen, Mohammad Javad Shafiee, David A. Clausi
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[94] arXiv:2401.01008 [pdf, html, other]
Title: Fast Sampling Through The Reuse Of Attention Maps In Diffusion Models
Rosco Hunter, Łukasz Dudziak, Mohamed S. Abdelfattah, Abhinav Mehrotra, Sourav Bhattacharya, Hongkai Wen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[95] arXiv:2401.01010 [pdf, html, other]
Title: Unsupervised Continual Anomaly Detection with Contrastively-learned Prompt
Jiaqi Liu, Kai Wu, Qiang Nie, Ying Chen, Bin-Bin Gao, Yong Liu, Jinbao Wang, Chengjie Wang, Feng Zheng
Comments: Accepted by AAAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[96] arXiv:2401.01018 [pdf, html, other]
Title: Small Bird Detection using YOLOv7 with Test-Time Augmentation
Kosuke Shigematsu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[97] arXiv:2401.01021 [pdf, other]
Title: Class Relevance Learning For Out-of-distribution Detection
Butian Xiong, Liguang Zhou, Tin Lun Lam, Yangsheng Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[98] arXiv:2401.01032 [pdf, other]
Title: A Comparison of Bounding Box and Landmark Detection Methods for Video-Based Heart Rate Estimation
Laurence Liang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[99] arXiv:2401.01035 [pdf, html, other]
Title: Online Continual Domain Adaptation for Semantic Image Segmentation Using Internal Representations
Serban Stan, Mohammad Rostami
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[100] arXiv:2401.01042 [pdf, html, other]
Title: Relating Events and Frames Based on Self-Supervised Learning and Uncorrelated Conditioning for Unsupervised Domain Adaptation
Mohammad Rostami, Dayuan Jian, Ruitong Sun
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[101] arXiv:2401.01065 [pdf, html, other]
Title: BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving
Tao Tang, Dafeng Wei, Zhengyu Jia, Tian Gao, Changwei Cai, Chengkai Hou, Peng Jia, Kun Zhan, Haiyang Sun, Jingchen Fan, Yixing Zhao, Fu Liu, Xiaodan Liang, Xianpeng Lang, Yang Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[102] arXiv:2401.01066 [pdf, html, other]
Title: DTBS: Dual-Teacher Bi-directional Self-training for Domain Adaptation in Nighttime Semantic Segmentation
Fanding Huang, Zihao Yao, Wenhui Zhou
Journal-ref: ECAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[103] arXiv:2401.01074 [pdf, html, other]
Title: AliFuse: Aligning and Fusing Multi-modal Medical Data for Computer-Aided Diagnosis
Qiuhui Chen, Yi Hong
Comments: BIBM 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[104] arXiv:2401.01075 [pdf, html, other]
Title: Depth-discriminative Metric Learning for Monocular 3D Object Detection
Wonhyeok Choi, Mingyu Shin, Sunghoon Im
Comments: Accepted at NeurIPS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[105] arXiv:2401.01093 [pdf, html, other]
Title: Exploring Hyperspectral Anomaly Detection with Human Vision: A Small Target Aware Detector
Jitao Ma, Weiying Xie, Yunsong Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[106] arXiv:2401.01097 [pdf, html, other]
Title: Robust single-particle cryo-EM image denoising and restoration
Jing Zhang, Tengfei Zhao, ShiYu Hu, Xin Zhao
Comments: This paper is accepted to ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[107] arXiv:2401.01102 [pdf, html, other]
Title: Dual Teacher Knowledge Distillation with Domain Alignment for Face Anti-spoofing
Zhe Kong, Wentian Zhang, Tao Wang, Kaihao Zhang, Yuexiang Li, Xiaoying Tang, Wenhan Luo
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[108] arXiv:2401.01107 [pdf, html, other]
Title: CityPulse: Fine-Grained Assessment of Urban Change with Street View Time Series
Tianyuan Huang, Zejia Wu, Jiajun Wu, Jackelyn Hwang, Ram Rajagopal
Comments: Accepted by AAAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[109] arXiv:2401.01117 [pdf, html, other]
Title: Q-Refine: A Perceptual Quality Refiner for AI-Generated Image
Chunyi Li, Haoning Wu, Zicheng Zhang, Hongkun Hao, Kaiwei Zhang, Lei Bai, Xiaohong Liu, Xiongkuo Min, Weisi Lin, Guangtao Zhai
Comments: 6 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[110] arXiv:2401.01128 [pdf, html, other]
Title: SSP: A Simple and Safe automatic Prompt engineering method towards realistic image synthesis on LVM
Weijin Cheng, Jianzhi Liu, Jiawen Deng, Fuji Ren
Comments: 10 pages, 8 figures
Journal-ref: 2024 IEEE International Conference on Systems, Man, and Cybernetics (SMC)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[111] arXiv:2401.01130 [pdf, html, other]
Title: Joint Generative Modeling of Grounded Scene Graphs and Images via Diffusion Models
Bicheng Xu, Qi Yan, Renjie Liao, Lele Wang, Leonid Sigal
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[112] arXiv:2401.01134 [pdf, html, other]
Title: Hybrid Pooling and Convolutional Network for Improving Accuracy and Training Convergence Speed in Object Detection
Shiwen Zhao, Wei Wang, Junhui Hou, Hai Wu
Comments: 10 pages,5 figures, conference
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[113] arXiv:2401.01163 [pdf, html, other]
Title: NU-Class Net: A Novel Approach for Video Quality Enhancement
Parham Zilouchian Moghaddam, Mehdi Modarressi, Mohammad Amin Sadeghi
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[114] arXiv:2401.01164 [pdf, html, other]
Title: Distilling Local Texture Features for Colorectal Tissue Classification in Low Data Regimes
Dmitry Demidov, Roba Al Majzoub, Amandeep Kumar, Fahad Khan
Journal-ref: Machine Learning in Medical Imaging (MLMI) 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[115] arXiv:2401.01173 [pdf, html, other]
Title: En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data
Yifang Men, Biwen Lei, Yuan Yao, Miaomiao Cui, Zhouhui Lian, Xuansong Xie
Comments: Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[116] arXiv:2401.01175 [pdf, html, other]
Title: Learning Surface Scattering Parameters From SAR Images Using Differentiable Ray Tracing
Jiangtao Wei, Yixiang Luomei, Xu Zhang, Feng Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[117] arXiv:2401.01178 [pdf, other]
Title: GBSS:a global building semantic segmentation dataset for large-scale remote sensing building extraction
Yuping Hu, Xin Huang, Jiayi Li, Zhen Zhang
Comments: 5 pages,6 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[118] arXiv:2401.01179 [pdf, html, other]
Title: Freeze the backbones: A Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-training
Jiuming Qin, Che Liu, Sibo Cheng, Yike Guo, Rossella Arcucci
Comments: Accepted by ICASSP 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[119] arXiv:2401.01180 [pdf, html, other]
Title: Accurate and Efficient Urban Street Tree Inventory with Deep Learning on Mobile Phone Imagery
Asim Khan, Umair Nawaz, Anwaar Ulhaq, Iqbal Gondal, Sajid Javed
Comments: 8 Pages, 7 figures and 5 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[120] arXiv:2401.01181 [pdf, html, other]
Title: Query-Based Knowledge Sharing for Open-Vocabulary Multi-Label Classification
Xuelin Zhu, Jian Liu, Dongqi Tang, Jiawei Ge, Weijia Liu, Bo Liu, Jiuxin Cao
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[121] arXiv:2401.01200 [pdf, html, other]
Title: Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithms
Flavio P. Loss, Pedro H. da Cunha, Matheus B. Rocha, Madson Poltronieri Zanoni, Leandro M. de Lima, Isadora Tavares Nascimento, Isabella Rezende, Tania R. P. Canuto, Luciana de Paula Vieira, Renan Rossoni, Maria C. S. Santos, Patricia Lyra Frasson, Wanderson Romão, Paulo R. Filgueiras, Renato A. Krohling
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[122] arXiv:2401.01201 [pdf, html, other]
Title: Whole-examination AI estimation of fetal biometrics from 20-week ultrasound scans
Lorenzo Venturini, Samuel Budd, Alfonso Farruggia, Robert Wright, Jacqueline Matthew, Thomas G. Day, Bernhard Kainz, Reza Razavi, Jo V. Hajnal
Comments: 14 pages, 16 figures. Submitted to NPJ digital medicine. For associated video file, see this http URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[123] arXiv:2401.01207 [pdf, other]
Title: Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation
Renshuai Liu, Bowen Ma, Wei Zhang, Zhipeng Hu, Changjie Fan, Tangjie Lv, Yu Ding, Xuan Cheng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[124] arXiv:2401.01208 [pdf, html, other]
Title: FGENet: Fine-Grained Extraction Network for Congested Crowd Counting
Hao-Yuan Ma, Li Zhang, Xiang-Yi Wei
Comments: Accepted by 30th International Conference on MultiMedia Modeling
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[125] arXiv:2401.01214 [pdf, html, other]
Title: YOLO algorithm with hybrid attention feature pyramid network for solder joint defect detection
Li Ang, Siti Khatijah Nor Abdul Rahim, Raseeda Hamzah, Raihah Aminuddin, Gao Yousheng
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[126] arXiv:2401.01216 [pdf, html, other]
Title: Noise-NeRF: Hide Information in Neural Radiance Fields using Trainable Noise
Qinglong Huang, Haoran Li, Yong Liao, Yanbin Hao, Pengyuan Zhou
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[127] arXiv:2401.01219 [pdf, html, other]
Title: Distribution Matching for Multi-Task Learning of Classification Tasks: a Large-Scale Study on Faces & Beyond
Dimitrios Kollias, Viktoriia Sharmanska, Stefanos Zafeiriou
Comments: accepted at AAAI 2024. arXiv admin note: text overlap with arXiv:2105.03790
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[128] arXiv:2401.01227 [pdf, other]
Title: IdentiFace : A VGG Based Multimodal Facial Biometric System
Mahmoud Rabea, Hanya Ahmed, Sohaila Mahmoud, Nourhan Sayed
Comments: 12 pages, 22 figures and 9 images
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[129] arXiv:2401.01244 [pdf, html, other]
Title: Temporal Adaptive RGBT Tracking with Modality Prompt
Hongyu Wang, Xiaotao Liu, Yifan Li, Meng Sun, Dian Yuan, Jing Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[130] arXiv:2401.01247 [pdf, html, other]
Title: Deep Learning-Based Computational Model for Disease Identification in Cocoa Pods (Theobroma cacao L.)
Darlyn Buenaño Vera, Byron Oviedo, Washington Chiriboga Casanova, Cristian Zambrano-Vega
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[131] arXiv:2401.01256 [pdf, html, other]
Title: VideoStudio: Generating Consistent-Content and Multi-Scene Videos
Fuchen Long, Zhaofan Qiu, Ting Yao, Tao Mei
Comments: ECCV 2024. Source code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[132] arXiv:2401.01272 [pdf, html, other]
Title: MOC-RVQ: Multilevel Codebook-Assisted Digital Generative Semantic Communication
Yingbin Zhou, Yaping Sun, Guanying Chen, Xiaodong Xu, Hao Chen, Binhong Huang, Shuguang Cui, Ping Zhang
Comments: Accepted by GLOBECOM 2024. Project code at: $\href{this https URL}{\text{this https URL}}$
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[133] arXiv:2401.01339 [pdf, html, other]
Title: Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting
Yunzhi Yan, Haotong Lin, Chenxu Zhou, Weijie Wang, Haiyang Sun, Kun Zhan, Xianpeng Lang, Xiaowei Zhou, Sida Peng
Comments: Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
[134] arXiv:2401.01345 [pdf, other]
Title: A Synthetic Modal Generation of Additive Manufacturing Roughness Surfaces from Images
T.B. Keesom, P.P. Popov, P. Dhyani, G.B. Jacobs
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[135] arXiv:2401.01361 [pdf, html, other]
Title: Optimizing Convolutional Neural Network Architecture
Luis Balderas, Miguel Lastra, José M. Benítez
Journal-ref: Mathematics 2024, 12(19), 3032
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[136] arXiv:2401.01362 [pdf, other]
Title: Assisting Blind People Using Object Detection with Vocal Feedback
Heba Najm, Khirallah Elferjani, Alhaam Alariyibi
Journal-ref: 2022 IEEE 2nd International Maghreb Meeting of the Conference on Sciences and Techniques of Automatic Control and Computer Engineering (MI-STA), pp. 48-52
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[137] arXiv:2401.01370 [pdf, html, other]
Title: Fast Quantum Convolutional Neural Networks for Low-Complexity Object Detection in Autonomous Driving Applications
Hankyul Baek, Donghyeon Kim, Joongheon Kim
Comments: 11 pages, 9 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET)
[138] arXiv:2401.01373 [pdf, html, other]
Title: Boosting Defect Detection in Manufacturing using Tensor Convolutional Neural Networks
Pablo Martin-Ramiro, Unai Sainz de la Maza, Sukhbinder Singh, Roman Orus, Samuel Mugel
Comments: 12 pages, 4 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Quantum Physics (quant-ph)
[139] arXiv:2401.01375 [pdf, html, other]
Title: Mapping Walnut Water Stress with High Resolution Multispectral UAV Imagery and Machine Learning
Kaitlyn Wang, Yufang Jin
Comments: 17 pages and 22 figures. To be published in Remote Sensing
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[140] arXiv:2401.01387 [pdf, html, other]
Title: DiffAugment: Diffusion based Long-Tailed Visual Relationship Recognition
Parul Gupta, Tuan Nguyen, Abhinav Dhall, Munawar Hayat, Trung Le, Thanh-Toan Do
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[141] arXiv:2401.01388 [pdf, html, other]
Title: Directional Antenna Systems for Long-Range Through-Wall Human Activity Recognition
Julian Strohmayer, Martin Kampel
Comments: arXiv admin note: text overlap with arXiv:2401.00964
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[142] arXiv:2401.01391 [pdf, html, other]
Title: On Optimal Sampling for Learning SDF Using MLPs Equipped with Positional Encoding
Guying Lin, Lei Yang, Yuan Liu, Congyi Zhang, Junhui Hou, Xiaogang Jin, Taku Komura, John Keyser, Wenping Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
[143] arXiv:2401.01395 [pdf, html, other]
Title: Deep autoregressive modeling for land use land cover
Christopher Krapu, Mark Borsuk, Ryan Calder
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[144] arXiv:2401.01439 [pdf, html, other]
Title: Off-Road LiDAR Intensity Based Semantic Segmentation
Kasi Viswanath, Peng Jiang, Sujit PB, Srikanth Saripalli
Comments: Accepted to ISER 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[145] arXiv:2401.01445 [pdf, html, other]
Title: Indoor Obstacle Discovery on Reflective Ground via Monocular Camera
Feng Xue, Yicong Chang, Tianxi Wang, Yu Zhou, Anlong Ming
Comments: International Journal of Computer Vision (IJCV) 2023. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[146] arXiv:2401.01448 [pdf, html, other]
Title: ProbMCL: Simple Probabilistic Contrastive Learning for Multi-label Visual Classification
Ahmad Sajedi, Samir Khaki, Yuri A. Lawryshyn, Konstantinos N. Plataniotis
Comments: This paper has been accepted for the ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[147] arXiv:2401.01454 [pdf, html, other]
Title: A Survey on Autonomous Driving Datasets: Statistics, Annotation Quality, and a Future Outlook
Mingyu Liu, Ekim Yurtsever, Jonathan Fossaert, Xingcheng Zhou, Walter Zimmer, Yuning Cui, Bare Luka Zagar, Alois C. Knoll
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[148] arXiv:2401.01456 [pdf, html, other]
Title: ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text
Dingkun Yan, Liang Yuan, Erwin Wu, Yuma Nishioka, Issei Fujishiro, Suguru Saito
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[149] arXiv:2401.01461 [pdf, html, other]
Title: Efficient Hybrid Zoom using Camera Fusion on Mobile Phones
Xiaotong Wu, Wei-Sheng Lai, YiChang Shih, Charles Herrmann, Michael Krainin, Deqing Sun, Chia-Kai Liang
Comments: Accepted to SIGGRAPH Asia 2023 (ACM TOG). Project website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[150] arXiv:2401.01470 [pdf, other]
Title: TPC-ViT: Token Propagation Controller for Efficient Vision Transformer
Wentao Zhu
Comments: Accepted by the main conference of WACV 2024; well-formatted PDF is in this https URL ; supplementary is in this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Neural and Evolutionary Computing (cs.NE)
Total of 1882 entries : 51-150 101-200 201-300 301-400 ... 1801-1882
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack