Computer Vision and Pattern Recognition

Authors and titles for September 2023

Total of 2022 entries : 1-500 501-1000 1001-1500 1501-2000 2001-2022

Showing up to 500 entries per page: fewer | more | all

[1501] arXiv:2309.16949 [pdf, other]: Title: CrossZoom: Simultaneously Motion Deblurring and Event Super-Resolving

Chi Zhang, Xiang Zhang, Mingyuan Lin, Cheng Li, Chu He, Wen Yang, Gui-Song Xia, Lei Yu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1502] arXiv:2309.16956 [pdf, other]: Title: Model2Scene: Learning 3D Scene Representation via Contrastive Language-CAD Models Pre-training

Runnan Chen, Xinge Zhu, Nenglun Chen, Dawei Wang, Wei Li, Yuexin Ma, Ruigang Yang, Tongliang Liu, Wenping Wang

Comments: arXiv admin note: substantial text overlap with arXiv:2203.10546

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1503] arXiv:2309.16959 [pdf, other]: Title: COMNet: Co-Occurrent Matching for Weakly Supervised Semantic Segmentation

Yukun Su, Jingliang Deng, Zonghan Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1504] arXiv:2309.16964 [pdf, html, other]: Title: AdaPose: Towards Cross-Site Device-Free Human Pose Estimation with Commodity WiFi

Yunjiao Zhou, Jianfei Yang, He Huang, Lihua Xie

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1505] arXiv:2309.16967 [pdf, html, other]: Title: nnSAM: Plug-and-play Segment Anything Model Improves nnUNet Performance

Yunxiang Li, Bowen Jing, Zihan Li, Jing Wang, You Zhang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1506] arXiv:2309.16968 [pdf, other]: Title: Synthetic Data Generation and Deep Learning for the Topological Analysis of 3D Data

Dylan Peek, Matt P. Skerritt, Stephan Chalup

Comments: 8 pages, 7 figures, Dicta 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1507] arXiv:2309.16975 [pdf, other]: Title: Perceptual Tone Mapping Model for High Dynamic Range Imaging

Imran Mehmood, Xinye Shi, M. Usman Khan, Ming Ronnier Luo

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1508] arXiv:2309.16987 [pdf, other]: Title: SpikeMOT: Event-based Multi-Object Tracking with Sparse Motion Features

Song Wang, Zhu Wang, Can Li, Xiaojuan Qi, Hayden Kwok-Hay So

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1509] arXiv:2309.16992 [pdf, html, other]: Title: Segment Anything Model is a Good Teacher for Local Feature Learning

Jingqian Wu, Rongtao Xu, Zach Wood-Doughty, Changwei Wang, Shibiao Xu, Edmund Y. Lam

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1510] arXiv:2309.17024 [pdf, other]: Title: HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World

Xin Wang, Taein Kwon, Mahdi Rad, Bowen Pan, Ishani Chakraborty, Sean Andrist, Dan Bohus, Ashley Feniello, Bugra Tekin, Felipe Vieira Frujeri, Neel Joshi, Marc Pollefeys

Comments: ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1511] arXiv:2309.17031 [pdf, other]: Title: Scalable Multi-Temporal Remote Sensing Change Data Generation via Simulating Stochastic Change Process

Zhuo Zheng, Shiqi Tian, Ailong Ma, Liangpei Zhang, Yanfei Zhong

Comments: ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1512] arXiv:2309.17033 [pdf, other]: Title: Unveiling Document Structures with YOLOv5 Layout Detection

Herman Sugiharto, Yorissa Silviana, Yani Siti Nurpazrin

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1513] arXiv:2309.17051 [pdf, other]: Title: On Uniform Scalar Quantization for Learned Image Compression

Haotian Zhang, Li Li, Dong Liu

Comments: 30 pages, 19 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1514] arXiv:2309.17054 [pdf, other]: Title: A 5-Point Minimal Solver for Event Camera Relative Motion Estimation

Ling Gao, Hang Su, Daniel Gehrig, Marco Cannici, Davide Scaramuzza, Laurent Kneip

Journal-ref: IEEE/CVF International Conference on Computer Vision (ICCV), 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1515] arXiv:2309.17058 [pdf, other]: Title: Imagery Dataset for Condition Monitoring of Synthetic Fibre Ropes

Anju Rani, Daniel O. Arroyo, Petar Durdevic

Comments: 7 pages, 3 figures, database

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1516] arXiv:2309.17059 [pdf, html, other]: Title: GSDC Transformer: An Efficient and Effective Cue Fusion for Monocular Multi-Frame Depth Estimation

Naiyu Fang, Lemiao Qiu, Shuyou Zhang, Zili Wang, Zheyuan Zhou, Kerui Hu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1517] arXiv:2309.17074 [pdf, html, other]: Title: AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation

Shengkun Tang, Yaqing Wang, Caiwen Ding, Yi Liang, Yao Li, Dongkuan Xu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1518] arXiv:2309.17080 [pdf, other]: Title: GAIA-1: A Generative World Model for Autonomous Driving

Anthony Hu, Lloyd Russell, Hudson Yeo, Zak Murez, George Fedoseev, Alex Kendall, Jamie Shotton, Gianluca Corrado

Comments: Technical Report

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[1519] arXiv:2309.17083 [pdf, other]: Title: SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning

Risa Shinoda, Ryo Hayamizu, Kodai Nakashima, Nakamasa Inoue, Rio Yokota, Hirokatsu Kataoka

Comments: ICCV2023. Code: this https URL, Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1520] arXiv:2309.17093 [pdf, html, other]: Title: Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval

Hao Li, Jingkuan Song, Lianli Gao, Xiaosu Zhu, Heng Tao Shen

Comments: Accepted to NeurIPS 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1521] arXiv:2309.17102 [pdf, other]: Title: Guiding Instruction-based Image Editing via Multimodal Large Language Models

Tsu-Jui Fu, Wenze Hu, Xianzhi Du, William Yang Wang, Yinfei Yang, Zhe Gan

Comments: ICLR'24 (Spotlight) ; Project at this https URL ; Code at this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1522] arXiv:2309.17104 [pdf, other]: Title: Prototype-guided Cross-modal Completion and Alignment for Incomplete Text-based Person Re-identification

Tiantian Gong, Guodong Du, Junsheng Wang, Yongkang Ding, Liyan Zhang

Comments: Sorry, some collaborators do not agree to publish it on Arxiv, so please withdraw this paper

Journal-ref: ACM International Conference on Multimedia 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1523] arXiv:2309.17105 [pdf, html, other]: Title: Continual Action Assessment via Task-Consistent Score-Discriminative Feature Distribution Modeling

Yuan-Ming Li, Ling-An Zeng, Jing-Ke Meng, Wei-Shi Zheng

Comments: 16 pages, 8 figures

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1524] arXiv:2309.17123 [pdf, other]: Title: Reconstruction of Patient-Specific Confounders in AI-based Radiologic Image Interpretation using Generative Pretraining

Tianyu Han, Laura Žigutytė, Luisa Huck, Marc Huppertz, Robert Siepmann, Yossi Gandelsman, Christian Blüthgen, Firas Khader, Christiane Kuhl, Sven Nebelung, Jakob Kather, Daniel Truhn

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1525] arXiv:2309.17128 [pdf, other]: Title: HAvatar: High-fidelity Head Avatar via Facial Model Conditioned Neural Radiance Field

Xiaochen Zhao, Lizhen Wang, Jingxiang Sun, Hongwen Zhang, Jinli Suo, Yebin Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1526] arXiv:2309.17143 [pdf, other]: Title: Revisiting Cephalometric Landmark Detection from the view of Human Pose Estimation with Lightweight Super-Resolution Head

Qian Wu, Si Yong Yeo, Yufei Chen, Jun Liu

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1527] arXiv:2309.17144 [pdf, other]: Title: Prototype Generation: Robust Feature Visualisation for Data Independent Interpretability

Arush Tagade, Jessica Rumbelow

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[1528] arXiv:2309.17162 [pdf, other]: Title: APNet: Urban-level Scene Segmentation of Aerial Images and Point Clouds

Weijie Wei, Martin R. Oswald, Fatemeh Karimi Nejadasl, Theo Gevers

Comments: Accepted by ICCV Workshop 2023 and selected as an oral

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1529] arXiv:2309.17164 [pdf, html, other]: Title: Retail-786k: a Large-Scale Dataset for Visual Entity Matching

Bianca Lamm (1 and 2), Janis Keuper (1) ((1) IMLA, Offenburg University, (2) Markant Services International GmbH)

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1530] arXiv:2309.17166 [pdf, html, other]: Title: Advances in Kidney Biopsy Lesion Assessment through Dense Instance Segmentation

Zhan Xiong, Junling He, Pieter Valkema, Tri Q. Nguyen, Maarten Naesens, Jesper Kers, Fons J. Verbeek

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1531] arXiv:2309.17172 [pdf, other]: Title: Domain-Adaptive Learning: Unsupervised Adaptation for Histology Images with Improved Loss Function Combination

Ravi Kant Gupta, Shounak Das, Amit Sethi

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1532] arXiv:2309.17175 [pdf, html, other]: Title: TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields

Tianyu Huang, Yihan Zeng, Bowen Dong, Hang Xu, Songcen Xu, Rynson W.H. Lau, Wangmeng Zuo

Comments: Accepted by ICLR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1533] arXiv:2309.17187 [pdf, html, other]: Title: TBD Pedestrian Data Collection: Towards Rich, Portable, and Large-Scale Natural Pedestrian Data

Allan Wang, Daisuke Sato, Yasser Corzo, Sonya Simkin, Abhijat Biswas, Aaron Steinfeld

Comments: This work has been accepted by IEEE ICRA 2024. arXiv admin note: substantial text overlap with arXiv:2203.01974

Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Robotics (cs.RO)
[1534] arXiv:2309.17190 [pdf, other]: Title: PARF: Primitive-Aware Radiance Fusion for Indoor Scene Novel View Synthesis

Haiyang Ying, Baowei Jiang, Jinzhi Zhang, Di Xu, Tao Yu, Qionghai Dai, Lu Fang

Comments: Accepted to ICCV 2023; Project page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1535] arXiv:2309.17205 [pdf, other]: Title: Towards Complex-query Referring Image Segmentation: A Novel Benchmark

Wei Ji, Li Li, Hao Fei, Xiangyan Liu, Xun Yang, Juncheng Li, Roger Zimmermann

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1536] arXiv:2309.17211 [pdf, html, other]: Title: Data-Free Dynamic Compression of CNNs for Tractable Efficiency

Lukas Meiner, Jens Mehnert, Alexandru Paul Condurache

Comments: Accepted at VISAPP 2025

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1537] arXiv:2309.17218 [pdf, other]: Title: When Epipolar Constraint Meets Non-local Operators in Multi-View Stereo

Tianqi Liu, Xinyi Ye, Weiyue Zhao, Zhiyu Pan, Min Shi, Zhiguo Cao

Comments: ICCV2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1538] arXiv:2309.17239 [pdf, other]: Title: EGVD: Event-Guided Video Deraining

Yueyi Zhang, Jin Wang, Wenming Weng, Xiaoyan Sun, Zhiwei Xiong

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1539] arXiv:2309.17257 [pdf, other]: Title: A Survey on Deep Learning Techniques for Action Anticipation

Zeyun Zhong, Manuel Martin, Michael Voit, Juergen Gall, Jürgen Beyerer

Comments: Submitted to TPAMI

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1540] arXiv:2309.17261 [pdf, html, other]: Title: Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors

Yukang Lin, Haonan Han, Chaoqun Gong, Zunnan Xu, Yachao Zhang, Xiu Li

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1541] arXiv:2309.17264 [pdf, html, other]: Title: A Foundation Model for General Moving Object Segmentation in Medical Images

Zhongnuo Yan, Tong Han, Yuhao Huang, Lian Liu, Han Zhou, Jiongquan Chen, Wenlong Shi, Yan Cao, Xin Yang, Dong Ni

Comments: 5 pages, 7 figures, 3 tables. This paper has been accepted by ISBI 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1542] arXiv:2309.17265 [pdf, other]: Title: Effect of structure-based training on 3D localization precision and quality

Armin Abdehkakha, Craig Snoeyink

Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1543] arXiv:2309.17281 [pdf, html, other]: Title: Information Flow in Self-Supervised Learning

Zhiquan Tan, Jingqin Yang, Weiran Huang, Yang Yuan, Yifan Zhang

Comments: Published at ICML 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1544] arXiv:2309.17285 [pdf, other]: Title: Efficient Large Scale Medical Image Dataset Preparation for Machine Learning Applications

Stefan Denner, Jonas Scherer, Klaus Kades, Dimitrios Bounias, Philipp Schader, Lisa Kausch, Markus Bujotzek, Andreas Michael Bucher, Tobias Penzkofer, Klaus Maier-Hein

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1545] arXiv:2309.17327 [pdf, html, other]: Title: Telling Stories for Common Sense Zero-Shot Action Recognition

Shreyank N Gowda, Laura Sevilla-Lara

Comments: Accepted in ACCV 2024!

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1546] arXiv:2309.17329 [pdf, html, other]: Title: Efficient Anatomical Labeling of Pulmonary Tree Structures via Deep Point-Graph Representation-based Implicit Fields

Kangxian Xie, Jiancheng Yang, Donglai Wei, Ziqiao Weng, Pascal Fua

Comments: Accepted by Medical Image Analysis

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1547] arXiv:2309.17336 [pdf, html, other]: Title: Robust 3D Object Detection from LiDAR-Radar Point Clouds via Cross-Modal Feature Augmentation

Jianning Deng, Gabriel Chan, Hantao Zhong, Chris Xiaoxuan Lu

Comments: Accepted to ICRA 2024. 8 pages, 4 figures. Equal contribution for Gabriel Chan and Hantao Zhong, listed randomly

Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1548] arXiv:2309.17342 [pdf, other]: Title: Towards Free Data Selection with General-Purpose Models

Yichen Xie, Mingyu Ding, Masayoshi Tomizuka, Wei Zhan

Comments: accepted by NeurIPS 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1549] arXiv:2309.17361 [pdf, other]: Title: Network Memory Footprint Compression Through Jointly Learnable Codebooks and Mappings

Edouard Yvinec, Arnaud Dapogny, Kevin Bailly

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1550] arXiv:2309.17389 [pdf, html, other]: Title: Prompt-based test-time real image dehazing: a novel pipeline

Zixuan Chen, Zewei He, Ziqian Lu, Xuecheng Sun, Zhe-Ming Lu

Comments: Accepted by ECCV 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1551] arXiv:2309.17390 [pdf, other]: Title: Forward Flow for Novel View Synthesis of Dynamic Scenes

Xiang Guo, Jiadai Sun, Yuchao Dai, Guanying Chen, Xiaoqing Ye, Xiao Tan, Errui Ding, Yumeng Zhang, Jingdong Wang

Comments: Accepted by ICCV2023 as oral. Project page: this https URL

Journal-ref: ICCV2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1552] arXiv:2309.17399 [pdf, other]: Title: IFAST: Weakly Supervised Interpretable Face Anti-spoofing from Single-shot Binocular NIR Images

Jiancheng Huang, Donghao Zhou, Shifeng Chen

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1553] arXiv:2309.17400 [pdf, html, other]: Title: Directly Fine-Tuning Diffusion Models on Differentiable Rewards

Kevin Clark, Paul Vicol, Kevin Swersky, David J Fleet

Comments: Published at ICLR 2024

Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1554] arXiv:2309.17421 [pdf, other]: Title: The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)

Zhengyuan Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Chung-Ching Lin, Zicheng Liu, Lijuan Wang

Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[1555] arXiv:2309.17426 [pdf, other]: Title: Classification of Potholes Based on Surface Area Using Pre-Trained Models of Convolutional Neural Network

Chauhdary Fazeel Ahmad, Abdullah Cheema, Waqas Qayyum, Rana Ehtisham, Muhammad Haroon Yousaf, Junaid Mir, Nasim Shakouri Mahmoudabadi, Afaq Ahmad

Comments: 24 Pages, 26 Figures

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1556] arXiv:2309.17430 [pdf, other]: Title: FACTS: First Amplify Correlations and Then Slice to Discover Bias

Sriram Yenamandra, Pratik Ramesh, Viraj Prabhu, Judy Hoffman

Comments: Accepted to ICCV 2023

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1557] arXiv:2309.17444 [pdf, html, other]: Title: LLM-grounded Video Diffusion Models

Long Lian, Baifeng Shi, Adam Yala, Trevor Darrell, Boyi Li

Comments: ICLR 2024. Project Page: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[1558] arXiv:2309.17448 [pdf, html, other]: Title: SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation

Zhongang Cai, Wanqi Yin, Ailing Zeng, Chen Wei, Qingping Sun, Yanjun Wang, Hui En Pang, Haiyi Mei, Mingyuan Zhang, Lei Zhang, Chen Change Loy, Lei Yang, Ziwei Liu

Comments: Homepage: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1559] arXiv:2309.17450 [pdf, other]: Title: Multi-task View Synthesis with Neural Radiance Fields

Shuhong Zheng, Zhipeng Bao, Martial Hebert, Yu-Xiong Wang

Comments: ICCV 2023, Website: this https URL

Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1560] arXiv:2309.00006 (cross-list from eess.SP) [pdf, other]: Title: Dual Radar SAR Controller

Josiah Smith

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1561] arXiv:2309.00027 (cross-list from eess.IV) [pdf, other]: Title: A Sequential Framework for Detection and Classification of Abnormal Teeth in Panoramic X-rays

Tudor Dascalu, Shaqayeq Ramezanzade, Azam Bakhshandeh, Lars Bjorndal, Bulat Ibragimov

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1562] arXiv:2309.00140 (cross-list from cs.SD) [pdf, other]: Title: Improving vision-inspired keyword spotting using dynamic module skipping in streaming conformer encoder

Alexandre Bittar, Paul Dixon, Mohammad Samragh, Kumari Nishu, Devang Naik

Journal-ref: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1563] arXiv:2309.00147 (cross-list from eess.IV) [pdf, other]: Title: Optimized Deep Feature Selection for Pneumonia Detection: A Novel RegNet and XOR-Based PSO Approach

Fatemehsadat Ghanadi Ladani, Samaneh Hosseini Semnani

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1564] arXiv:2309.00187 (cross-list from eess.SY) [pdf, other]: Title: Vision-aided nonlinear control framework for shake table tests

Zhongwei Chen, T.Y. Yang, Yifei Xiao, Xiao Pan, Wanyan Yang

Comments: 10 pages, 7 figures, accepted in the Canadian Conference - Pacific Conference on Earthquake Engineering 2023, Vancouver, British Columbia

Subjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV)
[1565] arXiv:2309.00265 (cross-list from eess.IV) [pdf, other]: Title: Application of Machine Learning in Melanoma Detection and the Identification of 'Ugly Duckling' and Suspicious Naevi: A Review

Fatima Al Zegair, Nathasha Naranpanawa, Brigid Betz-Stablein, Monika Janda, H. Peter Soyer, Shekhar S. Chandra

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1566] arXiv:2309.00305 (cross-list from cs.LG) [pdf, other]: Title: Efficient Surrogate Models for Materials Science Simulations: Machine Learning-based Prediction of Microstructure Properties

Binh Duong Nguyen, Pavlo Potapenko, Aytekin Dermici, Kishan Govind, Sébastien Bompas, Stefan Sandfeld

Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci); Computer Vision and Pattern Recognition (cs.CV)
[1567] arXiv:2309.00347 (cross-list from cs.IR) [pdf, other]: Title: Towards Contrastive Learning in Music Video Domain

Karel Veldkamp, Mariya Hendriksen, Zoltán Szlávik, Alexander Keijser

Comments: 6 pages, 2 figures, 2 tables

Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1568] arXiv:2309.00350 (cross-list from eess.IV) [pdf, other]: Title: How You Split Matters: Data Leakage and Subject Characteristics Studies in Longitudinal Brain MRI Analysis

Dewinda Julianensi Rumala

Comments: submitted to MICCAI FAIMI 2023

Journal-ref: MICCAI FAIMI 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1569] arXiv:2309.00359 (cross-list from cs.CL) [pdf, html, other]: Title: Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior

Ashmit Khandelwal, Aditya Agrawal, Aanisha Bhattacharyya, Yaman K Singla, Somesh Singh, Uttaran Bhattacharya, Ishita Dasgupta, Stefano Petrangeli, Rajiv Ratn Shah, Changyou Chen, Balaji Krishnamurthy

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1570] arXiv:2309.00372 (cross-list from eess.IV) [pdf, other]: Title: On the Localization of Ultrasound Image Slices within Point Distribution Models

Lennart Bastian, Vincent Bürgin, Ha Young Kim, Alexander Baumann, Benjamin Busam, Mahdi Saleh, Nassir Navab

Comments: ShapeMI Workshop @ MICCAI 2023; 12 pages 2 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1571] arXiv:2309.00378 (cross-list from cs.CL) [pdf, html, other]: Title: Long-Term Ad Memorability: Understanding & Generating Memorable Ads

Harini SI, Somesh Singh, Yaman K Singla, Aanisha Bhattacharyya, Veeky Baths, Changyou Chen, Rajiv Ratn Shah, Balaji Krishnamurthy

Comments: Published in WACV-2025

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[1572] arXiv:2309.00472 (cross-list from cs.IR) [pdf, other]: Title: General and Practical Tuning Method for Off-the-Shelf Graph-Based Index: SISAP Indexing Challenge Report by Team UTokyo

Yutaro Oguri, Yusuke Matsui

Comments: Accepted paper on 2nd place solution of SISAP 2023 Indexing Challenge Task A

Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB)
[1573] arXiv:2309.00494 (cross-list from eess.IV) [pdf, html, other]: Title: Multi-stage Deep Learning Artifact Reduction for Pallel-beam Computed Tomography

Jiayang Shi, Daniel M. Pelt, K. Joost Batenburg

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1574] arXiv:2309.00569 (cross-list from eess.IV) [pdf, other]: Title: Amyloid-Beta Axial Plane PET Synthesis from Structural MRI: An Image Translation Approach for Screening Alzheimer's Disease

Fernando Vega, Abdoljalil Addeh, M. Ethan MacDonald

Comments: Abstract submitted and presented to the International Society of Magnetic Resonance in Medicine (ISMRM 2023), Toronto, Canada

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1575] arXiv:2309.00570 (cross-list from stat.ML) [pdf, other]: Title: Mechanism of feature learning in convolutional neural networks

Daniel Beaglehole, Adityanarayanan Radhakrishnan, Parthe Pandit, Mikhail Belkin

Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1576] arXiv:2309.00727 (cross-list from eess.IV) [pdf, other]: Title: Deep learning in medical image registration: introduction and survey

Ahmad Hammoudeh, Stéphane Dupont

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1577] arXiv:2309.00769 (cross-list from eess.IV) [pdf, other]: Title: Full Reference Video Quality Assessment for Machine Learning-Based Video Codecs

Abrar Majeedi, Babak Naderi, Yasaman Hosseinkashi, Juhee Cho, Ruben Alvarez Martinez, Ross Cutler

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1578] arXiv:2309.00831 (cross-list from eess.IV) [pdf, other]: Title: Multi-scale, Data-driven and Anatomically Constrained Deep Learning Image Registration for Adult and Fetal Echocardiography

Md. Kamrul Hasan, Haobo Zhu, Guang Yang, Choon Hwai Yap

Comments: Our data-driven and anatomically constrained DLIR method's source code will be publicly available at this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1579] arXiv:2309.00853 (cross-list from eess.IV) [pdf, other]: Title: Correlated and Multi-frequency Diffusion Modeling for Highly Under-sampled MRI Reconstruction

Yu Guan, Chuanming Yu, Shiyu Lu, Zhuoxu Cui, Dong Liang, Qiegen Liu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1580] arXiv:2309.00864 (cross-list from cs.LG) [pdf, other]: Title: Equitable-FL: Federated Learning with Sparsity for Resource-Constrained Environment

Indrajeet Kumar Sinha, Shekhar Verma, Krishna Pratap Singh

Comments: 12 pages, 7 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[1581] arXiv:2309.00885 (cross-list from eess.IV) [pdf, other]: Title: A Generic Fundus Image Enhancement Network Boosted by Frequency Self-supervised Representation Learning

Heng Li, Haofeng Liu, Huazhu Fu, Yanwu Xu, Hui Shu, Ke Niu, Yan Hu, Jiang Liu

Comments: Accepted by Medical Image Analysis in Auguest, 2023

Journal-ref: Medical Image Analysis, 2023, 90:102945

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1582] arXiv:2309.00911 (cross-list from eess.IV) [pdf, other]: Title: A novel framework employing deep multi-attention channels network for the autonomous detection of metastasizing cells through fluorescence microscopy

Michail Mamalakis, Sarah C. Macfarlane, Scott V. Notley, Annica K.B Gad, George Panoutsos

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1583] arXiv:2309.00962 (cross-list from cs.RO) [pdf, other]: Title: NTU4DRadLM: 4D Radar-centric Multi-Modal Dataset for Localization and Mapping

Jun Zhang, Huayang Zhuge, Yiyao Liu, Guohao Peng, Zhenyu Wu, Haoyuan Zhang, Qiyang Lyu, Heshan Li, Chunyang Zhao, Dogan Kircali, Sanat Mharolkar, Xun Yang, Su Yi, Yuanzhe Wang, Danwei Wang

Comments: 2023 IEEE International Intelligent Transportation Systems Conference (ITSC 2023)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1584] arXiv:2309.00971 (cross-list from eess.IV) [pdf, other]: Title: AdLER: Adversarial Training with Label Error Rectification for One-Shot Medical Image Segmentation

Xiangyu Zhao, Sheng Wang, Zhiyun Song, Zhenrong Shen, Linlin Yao, Haolei Yuan, Qian Wang, Lichi Zhang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1585] arXiv:2309.00995 (cross-list from eess.IV) [pdf, other]: Title: Constrained CycleGAN for Effective Generation of Ultrasound Sector Images of Improved Spatial Resolution

Xiaofei Sun, He Li, Wei-Ning Lee

Journal-ref: Physics in Medicine & Biology 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1586] arXiv:2309.01007 (cross-list from eess.IV) [pdf, other]: Title: Comparative Analysis of Deep Learning Architectures for Breast Cancer Diagnosis Using the BreaKHis Dataset

İrem Sayın, Muhammed Ali Soydaş, Yunus Emre Mert, Arda Yarkataş, Berk Ergun, Selma Sözen Yeh, Hüseyin Üvet

Comments: 7 pages, 1 figure, 2 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1587] arXiv:2309.01072 (cross-list from eess.IV) [pdf, other]: Title: Channel Attention Separable Convolution Network for Skin Lesion Segmentation

Changlu Guo, Jiangyan Dai, Marton Szemenyei, Yugen Yi

Comments: Accepted by ICONIP 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1588] arXiv:2309.01077 (cross-list from cs.LG) [pdf, other]: Title: Robust Adversarial Defense by Tensor Factorization

Manish Bhattarai, Mehmet Cagri Kaymak, Ryan Barron, Ben Nebgen, Kim Rasmussen, Boian Alexandrov

Comments: Accepted at 2023 ICMLA Conference

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1589] arXiv:2309.01171 (cross-list from eess.IV) [pdf, html, other]: Title: Deep Unfolding Convolutional Dictionary Model for Multi-Contrast MRI Super-resolution and Reconstruction

Pengcheng Lei, Faming Fang, Guixu Zhang, Ming Xu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1590] arXiv:2309.01202 (cross-list from cs.GR) [pdf, other]: Title: MAGMA: Music Aligned Generative Motion Autodecoder

Sohan Anisetty, Amit Raj, James Hays

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1591] arXiv:2309.01207 (cross-list from eess.IV) [pdf, other]: Title: Spectral Adversarial MixUp for Few-Shot Unsupervised Domain Adaptation

Jiajin Zhang, Hanqing Chao, Amit Dhurandhar, Pin-Yu Chen, Ali Tajer, Yangyang Xu, Pingkun Yan

Comments: Accepted by MICCAI 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1592] arXiv:2309.01235 (cross-list from eess.IV) [pdf, other]: Title: Generalizability and Application of the Skin Reflectance Estimate Based on Dichromatic Separation (SREDS)

Joseph Drahos, Richard Plesh, Keivan Bahmani, Mahesh Banavar, Stephanie Schuckers

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1593] arXiv:2309.01312 (cross-list from eess.IV) [pdf, other]: Title: Enhancing Automated and Early Detection of Alzheimer's Disease Using Out-Of-Distribution Detection

Audrey Paleczny, Shubham Parab, Maxwell Zhang

Comments: 10 pages, 8 figures, 3 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1594] arXiv:2309.01322 (cross-list from eess.IV) [pdf, other]: Title: FAU-Net: An Attention U-Net Extension with Feature Pyramid Attention for Prostate Cancer Segmentation

Pablo Cesar Quihui-Rubio, Daniel Flores-Araiza, Miguel Gonzalez-Mendoza, Christian Mata, Gilberto Ochoa-Ruiz

Comments: This paper has been accepted at the 22nd Mexican International Conference on Artificial Intelligence (MICAI 2023)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1595] arXiv:2309.01339 (cross-list from cs.CL) [pdf, other]: Title: UniSA: Unified Generative Framework for Sentiment Analysis

Zaijing Li, Ting-En Lin, Yuchuan Wu, Meng Liu, Fengxiao Tang, Ming Zhao, Yongbin Li

Comments: Accepted to ACM MM 2023

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1596] arXiv:2309.01340 (cross-list from cs.SD) [pdf, other]: Title: MDSC: Towards Evaluating the Style Consistency Between Music and Dance

Zixiang Zhou, Weiyuan Li, Baoyuan Wang

Comments: 19 pages, 19 figure

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1597] arXiv:2309.01361 (cross-list from cs.ET) [pdf, other]: Title: High Frequency, High Accuracy Pointing onboard Nanosats using Neuromorphic Event Sensing and Piezoelectric Actuation

Yasir Latif, Peter Anastasiou, Yonhon Ng, Zebb Prime, Tien-Fu Lu, Matthew Tetlow, Robert Mahony, Tat-Jun Chin

Subjects: Emerging Technologies (cs.ET); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1598] arXiv:2309.01446 (cross-list from cs.CL) [pdf, html, other]: Title: Open Sesame! Universal Black Box Jailbreaking of Large Language Models

Raz Lapid, Ron Langberg, Moshe Sipper

Comments: Accepted at SeT-LLM @ ICLR 2024

Journal-ref: ICLR 2024 Workshop on Secure and Trustworthy Large Language Models

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1599] arXiv:2309.01532 (cross-list from cs.LG) [pdf, other]: Title: Are We Using Autoencoders in a Wrong Way?

Gabriele Martino, Davide Moroni, Massimo Martinelli

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1600] arXiv:2309.01587 (cross-list from cs.AR) [pdf, other]: Title: SATAY: A Streaming Architecture Toolflow for Accelerating YOLO Models on FPGA Devices

Alexander Montgomerie-Corcoran, Petros Toupas, Zhewen Yu, Christos-Savvas Bouganis

Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1601] arXiv:2309.01590 (cross-list from cs.LG) [pdf, other]: Title: Probabilistic Precision and Recall Towards Reliable Evaluation of Generative Models

Dogyun Park, Suhyun Kim

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1602] arXiv:2309.01646 (cross-list from cs.RO) [pdf, other]: Title: ReLoc-PDR: Visual Relocalization Enhanced Pedestrian Dead Reckoning via Graph Optimization

Zongyang Chen, Xianfei Pan, Changhao Chen

Comments: 11 pages, 14 figures

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1603] arXiv:2309.01729 (cross-list from cs.LG) [pdf, other]: Title: Softmax Bias Correction for Quantized Generative Models

Nilesh Prasad Pandey, Marios Fournarakis, Chirag Patel, Markus Nagel

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1604] arXiv:2309.01740 (cross-list from eess.IV) [pdf, other]: Title: An Empirical Analysis for Zero-Shot Multi-Label Classification on COVID-19 CT Scans and Uncurated Reports

Ethan Dack, Lorenzo Brigato, Matthew McMurray, Matthias Fontanellaz, Thomas Frauenfelder, Hanno Hoppe, Aristomenis Exadaktylos, Thomas Geiser, Manuela Funke-Chambour, Andreas Christe, Lukas Ebner, Stavroula Mougiakakou

Comments: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops 2023

Subjects: Image and Video Processing (eess.IV); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1605] arXiv:2309.01751 (cross-list from eess.IV) [pdf, html, other]: Title: Multispectral Indices for Wildfire Management

Afonso Oliveira, João P. Matos-Carvalho, Filipe Moutinho, Nuno Fachada

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[1606] arXiv:2309.01823 (cross-list from eess.IV) [pdf, other]: Title: Multi-dimension unified Swin Transformer for 3D Lesion Segmentation in Multiple Anatomical Locations

Shaoyan Pan, Yiqiao Liu, Sarah Halek, Michal Tomaszewski, Shubing Wang, Richard Baumgartner, Jianda Yuan, Gregory Goldmacher, Antong Chen

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1607] arXiv:2309.01904 (cross-list from cs.RO) [pdf, other]: Title: Improving Drone Imagery For Computer Vision/Machine Learning in Wilderness Search and Rescue

Robin Murphy, Thomas Manzini

Comments: 6 pages, 4 figures

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1608] arXiv:2309.02007 (cross-list from eess.IV) [pdf, other]: Title: Logarithmic Mathematical Morphology: theory and applications

Guillaume Noyel (LHC)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Functional Analysis (math.FA); Numerical Analysis (math.NA)
[1609] arXiv:2309.02020 (cross-list from eess.IV) [pdf, other]: Title: RawHDR: High Dynamic Range Image Reconstruction from a Single Raw Image

Yunhao Zou, Chenggang Yan, Ying Fu

Comments: ICCV 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1610] arXiv:2309.02022 (cross-list from cs.LG) [pdf, other]: Title: Dynamic Early Exiting Predictive Coding Neural Networks

Alaa Zniber, Ouassim Karrakchou, Mounir Ghogho

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1611] arXiv:2309.02140 (cross-list from eess.IV) [pdf, other]: Title: A Lightweight, Rapid and Efficient Deep Convolutional Network for Chest X-Ray Tuberculosis Detection

Daniel Capellán-Martín, Juan J. Gómez-Valverde, David Bermejo-Peláez, María J. Ledesma-Carbayo

Comments: 5 pages, 3 figures, 3 tables. This paper has been accepted at ISBI 2023

Journal-ref: 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI), Cartagena, Colombia, 2023

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1612] arXiv:2309.02147 (cross-list from eess.IV) [pdf, other]: Title: INCEPTNET: Precise And Early Disease Detection Application For Medical Images Analyses

Amirhossein Sajedi, Mohammad Javad Fadaeieslam

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1613] arXiv:2309.02159 (cross-list from cs.CR) [pdf, other]: Title: The Adversarial Implications of Variable-Time Inference

Dudi Biton, Aditi Misra, Efrat Levy, Jaidip Kotak, Ron Bitton, Roei Schuster, Nicolas Papernot, Yuval Elovici, Ben Nassi

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1614] arXiv:2309.02179 (cross-list from eess.IV) [pdf, other]: Title: High-resolution 3D Maps of Left Atrial Displacements using an Unsupervised Image Registration Neural Network

Christoforos Galazis, Anil Anthony Bharath, Marta Varela

Journal-ref: Medical Imaging with Deep Learning, short paper track, 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1615] arXiv:2309.02335 (cross-list from eess.IV) [pdf, other]: Title: DEEPBEAS3D: Deep Learning and B-Spline Explicit Active Surfaces

Helena Williams, João Pedrosa, Muhammad Asad, Laura Cattani, Tom Vercauteren, Jan Deprest, Jan D'hooge

Comments: 4 pages, 3 figures, 1 table, conference

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1616] arXiv:2309.02404 (cross-list from cs.SD) [pdf, other]: Title: Voice Morphing: Two Identities in One Voice

Sushanta K. Pani, Anurag Chowdhury, Morgan Sandler, Arun Ross

Comments: Accepted oral paper at BIOSIG 2023

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1617] arXiv:2309.02435 (cross-list from cs.LG) [pdf, other]: Title: Efficient RL via Disentangled Environment and Agent Representations

Kevin Gmelin, Shikhar Bahl, Russell Mendonca, Deepak Pathak

Comments: ICML 2023. Website at this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO)
[1618] arXiv:2309.02555 (cross-list from cs.LG) [pdf, other]: Title: A Survey of the Impact of Self-Supervised Pretraining for Diagnostic Tasks with Radiological Images

Blake VanBerlo, Jesse Hoey, Alexander Wong

Comments: 32 pages, 6 figures, a literature survey submitted to BMC Medical Imaging

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1619] arXiv:2309.02561 (cross-list from cs.RO) [pdf, html, other]: Title: Physically Grounded Vision-Language Models for Robotic Manipulation

Jensen Gao, Bidipta Sarkar, Fei Xia, Ted Xiao, Jiajun Wu, Brian Ichter, Anirudha Majumdar, Dorsa Sadigh

Comments: Updated version for ICRA 2024

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1620] arXiv:2309.02563 (cross-list from eess.IV) [pdf, other]: Title: Evaluation Kidney Layer Segmentation on Whole Slide Imaging using Convolutional Neural Networks and Transformers

Muhao Liu, Chenyang Qi, Shunxing Bao, Quan Liu, Ruining Deng, Yu Wang, Shilin Zhao, Haichun Yang, Yuankai Huo

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1621] arXiv:2309.02576 (cross-list from eess.IV) [pdf, other]: Title: Emphysema Subtyping on Thoracic Computed Tomography Scans using Deep Neural Networks

Weiyi Xie, Colin Jacobs, Jean-Paul Charbonnier, Dirk Jan Slebos, Bram van Ginneken

Journal-ref: Sci Rep. 2023 Aug 29;13(1):14147

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1622] arXiv:2309.02591 (cross-list from cs.LG) [pdf, other]: Title: Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning

Lili Yu, Bowen Shi, Ramakanth Pasunuru, Benjamin Muller, Olga Golovneva, Tianlu Wang, Arun Babu, Binh Tang, Brian Karrer, Shelly Sheynin, Candace Ross, Adam Polyak, Russell Howes, Vasu Sharma, Puxin Xu, Hovhannes Tamoyan, Oron Ashual, Uriel Singer, Shang-Wen Li, Susan Zhang, Richard James, Gargi Ghosh, Yaniv Taigman, Maryam Fazel-Zarandi, Asli Celikyilmaz, Luke Zettlemoyer, Armen Aghajanyan

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1623] arXiv:2309.02670 (cross-list from eess.IV) [pdf, other]: Title: Progressive Attention Guidance for Whole Slide Vulvovaginal Candidiasis Screening

Jiangdong Cai, Honglin Xiong, Maosong Cao, Luyan Liu, Lichi Zhang, Qian Wang

Comments: Accepted in the main conference MICCAI 2023

Journal-ref: 26th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2023)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1624] arXiv:2309.02681 (cross-list from eess.IV) [pdf, other]: Title: Improving Image Classification of Knee Radiographs: An Automated Image Labeling Approach

Jikai Zhang, Carlos Santos, Christine Park, Maciej Mazurowski, Roy Colglazier

Comments: This is the preprint version

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1625] arXiv:2309.02691 (cross-list from cs.CL) [pdf, other]: Title: A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models

Noriyuki Kojima, Hadar Averbuch-Elor, Yoav Artzi

Comments: This was published in TMLR in 2024, on January 24th

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1626] arXiv:2309.02783 (cross-list from eess.IV) [pdf, other]: Title: Improving diagnosis and prognosis of lung cancer using vision transformers: A scoping review

Hazrat Ali, Farida Mohsen, Zubair Shah

Comments: submitted to BMC Medical Imaging journal

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1627] arXiv:2309.02841 (cross-list from cs.IT) [pdf, other]: Title: Adjacency-hopping de Bruijn Sequences for Non-repetitive Coding

Bin Chen, Zhenglin Liang, Shiqian Wu

Subjects: Information Theory (cs.IT); Computer Vision and Pattern Recognition (cs.CV); Discrete Mathematics (cs.DM)
[1628] arXiv:2309.02898 (cross-list from cs.LG) [pdf, other]: Title: A Unified Framework for Discovering Discrete Symmetries

Pavan Karjol, Rohan Kashyap, Aditya Gopalan, Prathosh A.P

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1629] arXiv:2309.02959 (cross-list from eess.IV) [pdf, html, other]: Title: A Non-Invasive Interpretable NAFLD Diagnostic Method Combining TCM Tongue Features

Shan Cao, Qunsheng Ruan, Qingfeng Wu, Weiqiang Lin

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1630] arXiv:2309.02961 (cross-list from eess.SP) [pdf, html, other]: Title: LuViRA Dataset Validation and Discussion: Comparing Vision, Radio, and Audio Sensors for Indoor Localization

Ilayda Yaman, Guoda Tian, Erik Tegler, Jens Gulin, Nikhil Challa, Fredrik Tufvesson, Ove Edfors, Kalle Astrom, Steffen Malkowsky, Liang Liu

Comments: 10 pages, 11 figures

Journal-ref: IEEE Journal of Indoor and Seamless Positioning and Navigation (2024) 1-11

Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1631] arXiv:2309.03064 (cross-list from cs.CL) [pdf, other]: Title: A Multimodal Analysis of Influencer Content on Twitter

Danae Sánchez Villegas, Catalina Goanta, Nikolaos Aletras

Comments: Accepted at AACL 2023

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1632] arXiv:2309.03113 (cross-list from cs.LG) [pdf, other]: Title: Detecting Manufacturing Defects in PCBs via Data-Centric Machine Learning on Solder Paste Inspection Features

Jubilee Prasad-Rao, Roohollah Heidary, Jesse Williams

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1633] arXiv:2309.03177 (cross-list from eess.SY) [pdf, other]: Title: 3D Object Positioning Using Differentiable Multimodal Learning

Sean Zanyk-McLean, Krishna Kumar, Paul Navratil

Comments: 7 pages, 8 figures

Subjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[1634] arXiv:2309.03183 (cross-list from eess.IV) [pdf, other]: Title: 3D Transformer based on deformable patch location for differential diagnosis between Alzheimer's disease and Frontotemporal dementia

Huy-Dung Nguyen, Michaël Clément, Boris Mansencal, Pierrick Coupé

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1635] arXiv:2309.03215 (cross-list from cs.AI) [pdf, other]: Title: Explainable and Trustworthy Traffic Sign Detection for Safe Autonomous Driving: An Inductive Logic Programming Approach

Zahra Chaghazardi (University of Surrey), Saber Fallah (University of Surrey), Alireza Tamaddoni-Nezhad (University of Surrey)

Comments: In Proceedings ICLP 2023, arXiv:2308.14898

Journal-ref: EPTCS 385, 2023, pp. 201-212

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Logic in Computer Science (cs.LO)
[1636] arXiv:2309.03232 (cross-list from cs.LG) [pdf, other]: Title: Retail store customer behavior analysis system: Design and Implementation

Tuan Dinh Nguyen, Keisuke Hihara, Tung Cao Hoang, Yumeka Utada, Akihiko Torii, Naoki Izumi, Nguyen Thanh Thuy, Long Quoc Tran

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[1637] arXiv:2309.03244 (cross-list from eess.IV) [pdf, html, other]: Title: EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation

Nikolai Körber, Eduard Kromer, Andreas Siebert, Sascha Hauke, Daniel Mueller-Gritschneder, Björn Schuller

Comments: ECCV 2024 Camera Ready

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1638] arXiv:2309.03320 (cross-list from eess.IV) [pdf, html, other]: Title: CoNeS: Conditional neural fields with shift modulation for multi-sequence MRI translation

Yunjie Chen, Marius Staring, Olaf M. Neve, Stephan R. Romeijn, Erik F. Hensen, Berit M. Verbist, Jelmer M. Wolterink, Qian Tao

Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) this https URL

Journal-ref: Machine.Learning.for.Biomedical.Imaging. 2 (2024)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1639] arXiv:2309.03383 (cross-list from eess.IV) [pdf, other]: Title: Kidney abnormality segmentation in thorax-abdomen CT scans

Gabriel Efrain Humpire Mamani, Nikolas Lessmann, Ernst Th. Scholten, Mathias Prokop, Colin Jacobs, Bram van Ginneken

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1640] arXiv:2309.03440 (cross-list from eess.IV) [pdf, other]: Title: Punctate White Matter Lesion Segmentation in Preterm Infants Powered by Counterfactually Generative Learning

Zehua Ren, Yongheng Sun, Miaomiao Wang, Yuying Feng, Xianjun Li, Chao Jin, Jian Yang, Chunfeng Lian, Fan Wang

Comments: 10 pages, 3 figures, Medical Image Computing and Computer Assisted Intervention(MICCAI)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1641] arXiv:2309.03469 (cross-list from cs.LG) [pdf, other]: Title: Fast FixMatch: Faster Semi-Supervised Learning with Curriculum Batch Size

John Chen, Chen Dun, Anastasios Kyrillidis

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1642] arXiv:2309.03477 (cross-list from eess.IV) [pdf, other]: Title: TSI-Net: A Timing Sequence Image Segmentation Network for Intracranial Artery Segmentation in Digital Subtraction Angiography

Lemeng Wang, Wentao Liu, Weijin Xu, Haoyuan Li, Huihua Yang, Feng Gao

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1643] arXiv:2309.03493 (cross-list from eess.IV) [pdf, html, other]: Title: SAM3D: Segment Anything Model in Volumetric Medical Images

Nhat-Tan Bui, Dinh-Hieu Hoang, Minh-Triet Tran, Gianfranco Doretto, Donald Adjeroh, Brijesh Patel, Arabinda Choudhary, Ngan Le

Comments: Accepted at ISBI 2024

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1644] arXiv:2309.03494 (cross-list from eess.IV) [pdf, other]: Title: Evaluating Deep Learning-based Melanoma Classification using Immunohistochemistry and Routine Histology: A Three Center Study

Christoph Wies, Lucas Schneider, Sarah Haggenmueller, Tabea-Clara Bucher, Sarah Hobelsberger, Markus V. Heppt, Gerardo Ferrara, Eva I. Krieghoff-Henning, Titus J. Brinker

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[1645] arXiv:2309.03535 (cross-list from eess.IV) [pdf, other]: Title: Feature Enhancer Segmentation Network (FES-Net) for Vessel Segmentation

Tariq M. Khan, Muhammad Arsalan, Shahzaib Iqbal, Imran Razzak, Erik Meijering

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1646] arXiv:2309.03569 (cross-list from cs.LG) [pdf, other]: Title: Sparse Federated Training of Object Detection in the Internet of Vehicles

Luping Rao, Chuan Ma, Ming Ding, Yuwen Qian, Lu Zhou, Zhe Liu

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1647] arXiv:2309.03590 (cross-list from eess.IV) [pdf, other]: Title: Spatial encoding of BOLD fMRI time series for categorizing static images across visual datasets: A pilot study on human vision

Vamshi K. Kancharala, Debanjali Bhattacharya, Neelam Sinha

Comments: This paper is accepted for publication in IEEE Region 10 Technical conference, TENCON 2023, to be held in Chiang Mai, Thailand from 31 October - 3 November, 2023

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1648] arXiv:2309.03641 (cross-list from cs.SD) [pdf, html, other]: Title: Spiking Structured State Space Model for Monaural Speech Enhancement

Yu Du, Xu Liu, Yansong Chua

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1649] arXiv:2309.03652 (cross-list from eess.IV) [pdf, other]: Title: Anatomy-informed Data Augmentation for Enhanced Prostate Cancer Detection

Balint Kovacs, Nils Netzer, Michael Baumgartner, Carolin Eith, Dimitrios Bounias, Clara Meinzer, Paul F. Jaeger, Kevin S. Zhang, Ralf Floca, Adrian Schrader, Fabian Isensee, Regula Gnirs, Magdalena Goertz, Viktoria Schuetz, Albrecht Stenzinger, Markus Hohenfellner, Heinz-Peter Schlemmer, Ivo Wolf, David Bonekamp, Klaus H. Maier-Hein

Comments: Accepted at MICCAI 2023

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1650] arXiv:2309.03686 (cross-list from eess.IV) [pdf, other]: Title: MS-UNet-v2: Adaptive Denoising Method and Training Strategy for Medical Image Segmentation with Small Training Data

Haoyuan Chen, Yufei Han, Pin Xu, Yanyi Li, Kuan Li, Jianping Yin

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1651] arXiv:2309.03702 (cross-list from cs.LG) [pdf, other]: Title: DiffDefense: Defending against Adversarial Attacks via Diffusion Models

Hondamunige Prasanna Silva, Lorenzo Seidenari, Alberto Del Bimbo

Comments: Paper published at ICIAP23

Journal-ref: ICIAP 2023

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1652] arXiv:2309.03744 (cross-list from eess.IV) [pdf, html, other]: Title: Label-efficient Contrastive Learning-based model for nuclei detection and classification in 3D Cardiovascular Immunofluorescent Images

Nazanin Moradinasab, Rebecca A. Deaton, Laura S. Shankman, Gary K. Owens, Donald E. Brown

Comments: 11 pages, 5 figures, MICCAI Workshop Conference 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1653] arXiv:2309.03759 (cross-list from eess.IV) [pdf, other]: Title: M(otion)-mode Based Prediction of Ejection Fraction using Echocardiograms

Ece Ozkan, Thomas M. Sutter, Yurong Hu, Sebastian Balzer, Julia E. Vogt

Comments: Accepted at GCPR 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1654] arXiv:2309.03774 (cross-list from cs.LG) [pdf, html, other]: Title: Deep Learning Safety Concerns in Automated Driving Perception

Stephanie Abrecht, Alexander Hirsch, Shervin Raafatnia, Matthias Woehrle

Comments: Added note regarding accepted version at IEEE Transactions on Intelligent Vehicles with DOI

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[1655] arXiv:2309.03851 (cross-list from cs.LG) [pdf, html, other]: Title: CenTime: Event-Conditional Modelling of Censoring in Survival Analysis

Ahmed H. Shahin, An Zhao, Alexander C. Whitehead, Daniel C. Alexander, Joseph Jacob, David Barber

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1656] arXiv:2309.03879 (cross-list from cs.LG) [pdf, other]: Title: Better Practices for Domain Adaptation

Linus Ericsson, Da Li, Timothy M. Hospedales

Comments: AutoML 2023 (Best paper award)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1657] arXiv:2309.03891 (cross-list from cs.RO) [pdf, html, other]: Title: ArtiGrasp: Physically Plausible Synthesis of Bi-Manual Dexterous Grasping and Articulation

Hui Zhang, Sammy Christen, Zicong Fan, Luocheng Zheng, Jemin Hwangbo, Jie Song, Otmar Hilliges

Comments: 3DV-2024 camera ready. Project page: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1658] arXiv:2309.03900 (cross-list from eess.IV) [pdf, other]: Title: Learning Continuous Exposure Value Representations for Single-Image HDR Reconstruction

Su-Kai Chen, Hung-Lin Yen, Yu-Lun Liu, Min-Hung Chen, Hou-Ning Hu, Wen-Hsiao Peng, Yen-Yu Lin

Comments: ICCV 2023. Project page: this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1659] arXiv:2309.03905 (cross-list from cs.MM) [pdf, other]: Title: ImageBind-LLM: Multi-modality Instruction Tuning

Jiaming Han, Renrui Zhang, Wenqi Shao, Peng Gao, Peng Xu, Han Xiao, Kaipeng Zhang, Chris Liu, Song Wen, Ziyu Guo, Xudong Lu, Shuai Ren, Yafei Wen, Xiaoxin Chen, Xiangyu Yue, Hongsheng Li, Yu Qiao

Comments: Code is available at this https URL

Subjects: Multimedia (cs.MM); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1660] arXiv:2309.03906 (cross-list from eess.IV) [pdf, other]: Title: A-Eval: A Benchmark for Cross-Dataset Evaluation of Abdominal Multi-Organ Segmentation

Ziyan Huang, Zhongying Deng, Jin Ye, Haoyu Wang, Yanzhou Su, Tianbin Li, Hui Sun, Junlong Cheng, Jianpin Chen, Junjun He, Yun Gu, Shaoting Zhang, Lixu Gu, Yu Qiao

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1661] arXiv:2309.03964 (cross-list from cs.LG) [pdf, other]: Title: REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation

Skyler Seto, Barry-John Theobald, Federico Danieli, Navdeep Jaitly, Dan Busbridge

Comments: Accepted at WACV 2024, 17 pages, 7 figures, 11 tables

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1662] arXiv:2309.03965 (cross-list from cs.LG) [pdf, other]: Title: Improving Resnet-9 Generalization Trained on Small Datasets

Omar Mohamed Awad, Habib Hajimolahoseini, Michael Lim, Gurpreet Gosal, Walid Ahmed, Yang Liu, Gordon Deng

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1663] arXiv:2309.04028 (cross-list from math.AG) [pdf, other]: Title: Algebra and Geometry of Camera Resectioning

Erin Connelly, Timothy Duff, Jessie Loucks-Tavitas

Comments: 27 pages

Subjects: Algebraic Geometry (math.AG); Computer Vision and Pattern Recognition (cs.CV); Commutative Algebra (math.AC)
[1664] arXiv:2309.04071 (cross-list from eess.IV) [pdf, html, other]: Title: Enhancing Hierarchical Transformers for Whole Brain Segmentation with Intracranial Measurements Integration

Xin Yu, Yucheng Tang, Qi Yang, Ho Hin Lee, Shunxing Bao, Yuankai Huo, Bennett A. Landman

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1665] arXiv:2309.04081 (cross-list from cs.LG) [pdf, other]: Title: UER: A Heuristic Bias Addressing Approach for Online Continual Learning

Huiwei Lin, Shanshan Feng, Baoquan Zhang, Hongliang Qiao, Xutao Li, Yunming Ye

Comments: 9 pages, 12 figures, ACM MM2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1666] arXiv:2309.04190 (cross-list from eess.IV) [pdf, html, other]: Title: SegmentAnything helps microscopy images based automatic and quantitative organoid detection and analysis

Xiaodan Xing, Chunling Tang, Yunzhe Guo, Nicholas Kurniawan, Guang Yang

Comments: Replace Figure 4 with the correct version. The original version is wrong due to a column name mismatch

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1667] arXiv:2309.04293 (cross-list from eess.IV) [pdf, other]: Title: How Can We Tame the Long-Tail of Chest X-ray Datasets?

Arsh Verma

Comments: Extended Abstract presented at Computer Vision for Automated Medical Diagnosis Workshop at the International Conference on Computer Vision 2023, October 2nd 2023, Paris, France, & Virtual, this https URL, 7 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1668] arXiv:2309.04342 (cross-list from physics.optics) [pdf, other]: Title: Revealing the preference for correcting separated aberrations in joint optic-image design

Jingwen Zhou, Shiqi Chen, Zheng Ren, Wenguan Zhang, Jiapu Yan, Huajun Feng, Qi Li, Yueting Chen

Comments: 19 pages

Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV)
[1669] arXiv:2309.04441 (cross-list from cs.RO) [pdf, other]: Title: Comparative Study of Visual SLAM-Based Mobile Robot Localization Using Fiducial Markers

Jongwon Lee, Su Yeon Choi, David Hanley, Timothy Bretl

Comments: IEEE 2023 IROS Workshop "Closing the Loop on Localization". For more information, see this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1670] arXiv:2309.04461 (cross-list from cs.CL) [pdf, html, other]: Title: Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models

Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran

Comments: NAACL 2024 Main Conference. The data is released at this https URL

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1671] arXiv:2309.04509 (cross-list from cs.SD) [pdf, other]: Title: The Power of Sound (TPoS): Audio Reactive Video Generation with Stable Diffusion

Yujin Jeong, Wonjeong Ryoo, Seunghyun Lee, Dabin Seo, Wonmin Byeon, Sangpil Kim, Jinkyu Kim

Comments: ICCV2023

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Audio and Speech Processing (eess.AS)
[1672] arXiv:2309.04511 (cross-list from eess.IV) [pdf, other]: Title: Systematic Review of Techniques in Brain Image Synthesis using Deep Learning

Shubham Singh, Ammar Ranapurwala, Mrunal Bewoor, Sheetal Patil, Satyam Rai

Comments: 8 pages

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1673] arXiv:2309.04581 (cross-list from cs.GR) [pdf, other]: Title: Dynamic Mesh-Aware Radiance Fields

Yi-Ling Qiao, Alexander Gao, Yiran Xu, Yue Feng, Jia-Bin Huang, Ming C. Lin

Comments: ICCV 2023

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1674] arXiv:2309.04631 (cross-list from q-bio.TO) [pdf, other]: Title: Open and reusable deep learning for pathology with WSInfer and QuPath

Jakub R. Kaczmarzyk, Alan O'Callaghan, Fiona Inglis, Tahsin Kurc, Rajarsi Gupta, Erich Bremer, Peter Bankhead, Joel H. Saltz

Subjects: Tissues and Organs (q-bio.TO); Computer Vision and Pattern Recognition (cs.CV)
[1675] arXiv:2309.04651 (cross-list from eess.IV) [pdf, other]: Title: Video and Synthetic MRI Pre-training of 3D Vision Architectures for Neuroimage Analysis

Nikhil J. Dhinagar, Amit Singh, Saket Ozarkar, Ketaki Buwa, Sophia I. Thomopoulos, Conor Owens-Walton, Emily Laltoo, Yao-Liang Chen, Philip Cook, Corey McMillan, Chih-Chien Tsai, J-J Wang, Yih-Ru Wu, Paul M. Thompson

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1676] arXiv:2309.04672 (cross-list from eess.IV) [pdf, html, other]: Title: SSHNN: Semi-Supervised Hybrid NAS Network for Echocardiographic Image Segmentation

Renqi Chen, Jingjing Luo, Fan Nian, Yuhui Cen, Yiheng Peng, Zekuan Yu

Comments: Accepted by ICASSP2024

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1677] arXiv:2309.04710 (cross-list from cs.RO) [pdf, other]: Title: Jade: A Differentiable Physics Engine for Articulated Rigid Bodies with Intersection-Free Frictional Contact

Gang Yang, Siyuan Luo, Lin Shao

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Systems and Control (eess.SY)
[1678] arXiv:2309.04760 (cross-list from cs.LG) [pdf, other]: Title: RR-CP: Reliable-Region-Based Conformal Prediction for Trustworthy Medical Image Classification

Yizhe Zhang, Shuo Wang, Yejia Zhang, Danny Z. Chen

Comments: UNSURE2023 (Uncertainty for Safe Utilization of Machine Learning in Medical Imaging) at MICCAI2023; Spotlight

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1679] arXiv:2309.04762 (cross-list from cs.SD) [pdf, other]: Title: AudRandAug: Random Image Augmentations for Audio Classification

Teerath Kumar, Muhammad Turab, Alessandra Mileo, Malika Bendechache, Takfarinas Saber

Comments: Paper has accepted at 25th Irish Machine Vision and Image Processing Conference

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1680] arXiv:2309.04777 (cross-list from cs.CR) [pdf, other]: Title: Towards Robust Model Watermark via Reducing Parametric Vulnerability

Guanhao Gan, Yiming Li, Dongxian Wu, Shu-Tao Xia

Comments: This paper is accepted by ICCV 2023

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1681] arXiv:2309.04946 (cross-list from cs.SD) [pdf, other]: Title: Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation

Yuan Gan, Zongxin Yang, Xihang Yue, Lingyun Sun, Yi Yang

Comments: Accepted to ICCV 2023. Project page: this https URL

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Audio and Speech Processing (eess.AS)
[1682] arXiv:2309.04956 (cross-list from eess.IV) [pdf, other]: Title: Anatomy Completor: A Multi-class Completion Framework for 3D Anatomy Reconstruction

Jianning Li, Antonio Pepe, Gijs Luijten, Christina Schwarz-Gsaxner, Jens Kleesiek, Jan Egger

Comments: 15 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1683] arXiv:2309.04960 (cross-list from eess.IV) [pdf, other]: Title: SdCT-GAN: Reconstructing CT from Biplanar X-Rays with Self-driven Generative Adversarial Networks

Shuangqin Cheng, Qingliang Chen, Qiyi Zhang, Ming Li, Yamuhanmode Alike, Kaile Su, Pengcheng Wen

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1684] arXiv:2309.04961 (cross-list from cs.IR) [pdf, other]: Title: Multi-modal Extreme Classification

Anshul Mittal, Kunal Dahiya, Shreya Malani, Janani Ramaswamy, Seba Kuruvilla, Jitendra Ajmera, Keng-hao Chang, Sumeet Agarwal, Purushottam Kar, Manik Varma

Journal-ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022

Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV)
[1685] arXiv:2309.05036 (cross-list from cs.RO) [pdf, other]: Title: What Is Near?: Room Locality Learning for Enhanced Robot Vision-Language-Navigation in Indoor Living Environments

Muraleekrishna Gopinathan, Jumana Abu-Khalaf, David Suter, Sidike Paheding, Nathir A. Rawashdeh

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1686] arXiv:2309.05071 (cross-list from math.AP) [pdf, other]: Title: Super-Resolution Surface Reconstruction from Few Low-Resolution Slices

Yiyao Zhang, Ke Chen, Shang-Hua Yang

Comments: 33 pages, 25 figures

Journal-ref: AIMS Journal Inverse Problems and Imaging (IPI) 2023

Subjects: Analysis of PDEs (math.AP); Computer Vision and Pattern Recognition (cs.CV)
[1687] arXiv:2309.05162 (cross-list from cs.CL) [pdf, other]: Title: Collecting Visually-Grounded Dialogue with A Game Of Sorts

Bram Willemsen, Dmytro Kalpakchi, Gabriel Skantze

Comments: Published at LREC 2022

Journal-ref: Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC 2022), pages 2257-2268, Marseille, France. European Language Resources Association

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1688] arXiv:2309.05173 (cross-list from cs.CL) [pdf, other]: Title: DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning

Zhengxiang Shi, Aldo Lipani

Comments: ICLR 2024. Code is available at this https URL

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1689] arXiv:2309.05197 (cross-list from cs.RO) [pdf, other]: Title: Learning Sequential Acquisition Policies for Robot-Assisted Feeding

Priya Sundaresan, Jiajun Wu, Dorsa Sadigh

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1690] arXiv:2309.05271 (cross-list from eess.IV) [pdf, other]: Title: AutoFuse: Automatic Fusion Networks for Deformable Medical Image Registration

Mingyuan Meng, Michael Fulham, Dagan Feng, Lei Bi, Jinman Kim

Comments: Published at Pattern Recognition

Journal-ref: Pattern Recognition, vol. 161, p. 111338, 2025

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1691] arXiv:2309.05339 (cross-list from cs.RO) [pdf, other]: Title: PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D representations for agricultural robotics

Claus Smitt, Michael Halstead, Patrick Zimmer, Thomas Läbe, Esra Guclu, Cyrill Stachniss, Chris McCool

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1692] arXiv:2309.05346 (cross-list from cs.LG) [pdf, other]: Title: Learning Geometric Representations of Objects via Interaction

Alfredo Reichlin, Giovanni Luca Marchetti, Hang Yin, Anastasiia Varava, Danica Kragic

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1693] arXiv:2309.05405 (cross-list from eess.IV) [pdf, other]: Title: Two-Stage Hybrid Supervision Framework for Fast, Low-resource, and Accurate Organ and Pan-cancer Segmentation in Abdomen CT

Wentao Liu, Tong Tian, Weijin Xu, Lemeng Wang, Haoyuan Li, Huihua Yang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1694] arXiv:2309.05406 (cross-list from eess.IV) [pdf, html, other]: Title: Treatment-aware Diffusion Probabilistic Model for Longitudinal MRI Generation and Diffuse Glioma Growth Prediction

Qinghui Liu, Elies Fuster-Garcia, Ivar Thokle Hovden, Bradley J MacIntosh, Edvard Grødem, Petter Brandal, Carles Lopez-Mateu, Donatas Sederevicius, Karoline Skogen, Till Schellhorn, Atle Bjørnerud, Kyrre Eeg Emblem

Comments: preprints in IEEE-TMI, 14 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1695] arXiv:2309.05446 (cross-list from eess.IV) [pdf, other]: Title: A Localization-to-Segmentation Framework for Automatic Tumor Segmentation in Whole-Body PET/CT Images

Linghan Cai, Jianhao Huang, Zihang Zhu, Jinpeng Lu, Yongbing Zhang

Comments: 7 pages,3 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1696] arXiv:2309.05534 (cross-list from cs.CL) [pdf, other]: Title: PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud

Chengyu Wang, Zhongjie Duan, Bingyan Liu, Xinyi Zou, Cen Chen, Kui Jia, Jun Huang

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1697] arXiv:2309.05662 (cross-list from cs.RO) [pdf, other]: Title: ViHOPE: Visuotactile In-Hand Object 6D Pose Estimation with Shape Completion

Hongyu Li, Snehal Dikhale, Soshi Iba, Nawid Jamali

Comments: Accepted by RA-L

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1698] arXiv:2309.05665 (cross-list from cs.RO) [pdf, other]: Title: Robot Parkour Learning

Ziwen Zhuang, Zipeng Fu, Jianren Wang, Christopher Atkeson, Soeren Schwertfeger, Chelsea Finn, Hang Zhao

Comments: CoRL 2023 (Oral). Project website at this https URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1699] arXiv:2309.05674 (cross-list from eess.IV) [pdf, other]: Title: ConvFormer: Plug-and-Play CNN-Style Transformers for Improving Medical Image Segmentation

Xian Lin, Zengqiang Yan, Xianbo Deng, Chuansheng Zheng, Li Yu

Comments: Accepted by MICCAI 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1700] arXiv:2309.05780 (cross-list from eess.IV) [pdf, other]: Title: LUNet: Deep Learning for the Segmentation of Arterioles and Venules in High Resolution Fundus Images

Jonathan Fhima, Jan Van Eijgen, Hana Kulenovic, Valérie Debeuf, Marie Vangilbergen, Marie-Isaline Billen, Heloïse Brackenier, Moti Freiman, Ingeborg Stalmans, Joachim A. Behar

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1701] arXiv:2309.05826 (cross-list from cs.LG) [pdf, other]: Title: KD-FixMatch: Knowledge Distillation Siamese Neural Networks

Chien-Chih Wang, Shaoyuan Xu, Jinmiao Fu, Yang Liu, Bryan Wang

Comments: 5 pages, 1 figure, 5 tables. To be published in ICIP 2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1702] arXiv:2309.05857 (cross-list from eess.IV) [pdf, other]: Title: Radiomics Boosts Deep Learning Model for IPMN Classification

Lanhong Yao, Zheyuan Zhang, Ugur Demir, Elif Keles, Camila Vendrami, Emil Agarunov, Candice Bolan, Ivo Schoots, Marc Bruno, Rajesh Keswani, Frank Miller, Tamas Gonda, Cemal Yazici, Temel Tirkes, Michael Wallace, Concetto Spampinato, Ulas Bagci

Comments: 10 pages, MICCAI MLMI 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1703] arXiv:2309.05879 (cross-list from cs.CR) [pdf, other]: Title: Generalized Attacks on Face Verification Systems

Ehsan Nazari, Paula Branco, Guy-Vincent Jourdan

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1704] arXiv:2309.05919 (cross-list from eess.IV) [pdf, html, other]: Title: Deep evidential fusion with uncertainty quantification and contextual discounting for multimodal medical image segmentation

Ling Huang, Su Ruan, Pierre Decazes, Thierry Denoeux

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1705] arXiv:2309.05929 (cross-list from eess.IV) [pdf, other]: Title: Introducing Shape Prior Module in Diffusion Model for Medical Image Segmentation

Zhiqing Zhang, Guojia Fan, Tianyong Liu, Nan Li, Yuyang Liu, Ziyu Liu, Canwei Dong, Shoujun Zhou

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1706] arXiv:2309.05950 (cross-list from cs.CL) [pdf, html, other]: Title: Language Models as Black-Box Optimizers for Vision-Language Models

Shihong Liu, Zhiqiu Lin, Samuel Yu, Ryan Lee, Tiffany Ling, Deepak Pathak, Deva Ramanan

Comments: Published at CVPR 2024. Project site: this https URL

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[1707] arXiv:2309.06046 (cross-list from cs.LG) [pdf, other]: Title: BatMan-CLR: Making Few-shots Meta-Learners Resilient Against Label Noise

Jeroen M. Galjaard, Robert Birke, Juan Perez, Lydia Y. Chen

Comments: 10 pages,3 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1708] arXiv:2309.06054 (cross-list from cs.LG) [pdf, html, other]: Title: Breaking through the learning plateaus of in-context learning in Transformer

Jingwen Fu, Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1709] arXiv:2309.06062 (cross-list from cs.LG) [pdf, other]: Title: Selection of contributing factors for predicting landslide susceptibility using machine learning and deep learning models

Cheng Chen, Lei Fan

Comments: Stochastic Environmental Research and Risk Assessment

Journal-ref: Stochastic Environmental Research and Risk Assessment, 13 September 2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[1710] arXiv:2309.06067 (cross-list from eess.IV) [pdf, html, other]: Title: Efficient MRI Parallel Imaging Reconstruction by K-Space Rendering via Generalized Implicit Neural Representation

Hao Li, Yusheng Zhou, Jianan Liu, Xiling Liu, Tao Huang, Zhihan Lyu, Weidong Cai, Wei Chen

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1711] arXiv:2309.06075 (cross-list from eess.IV) [pdf, html, other]: Title: A2V: A Semi-Supervised Domain Adaptation Framework for Brain Vessel Segmentation via Two-Phase Training Angiography-to-Venography Translation

Francesco Galati, Daniele Falcetta, Rosa Cortese, Barbara Casolla, Ferran Prados, Ninon Burgos, Maria A. Zuluaga

Comments: Accepted at the 34th British Machine Vision Conference (BMVC)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1712] arXiv:2309.06086 (cross-list from cs.LG) [pdf, other]: Title: Plasticity-Optimized Complementary Networks for Unsupervised Continual Learning

Alex Gomez-Villa, Bartlomiej Twardowski, Kai Wang, Joost van de Weijer

Comments: Accepted at WACV2024

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1713] arXiv:2309.06135 (cross-list from cs.CL) [pdf, html, other]: Title: Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts

Zhi-Yi Chin, Chieh-Ming Jiang, Ching-Chun Huang, Pin-Yu Chen, Wei-Chen Chiu

Comments: ICML 2024 main conference paper. The source code is available at this https URL

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1714] arXiv:2309.06143 (cross-list from eess.IV) [pdf, html, other]: Title: Improving Generalization Capability of Deep Learning-Based Nuclei Instance Segmentation by Non-deterministic Train Time and Deterministic Test Time Stain Normalization

Amirreza Mahbod, Georg Dorffner, Isabella Ellinger, Ramona Woitek, Sepideh Hatamikia

Comments: Accepted at Computational and Structural Biotechnology Journal

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1715] arXiv:2309.06166 (cross-list from cs.LG) [pdf, other]: Title: Certified Robust Models with Slack Control and Large Lipschitz Constants

Max Losch, David Stutz, Bernt Schiele, Mario Fritz

Comments: To be published at GCPR 2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1716] arXiv:2309.06169 (cross-list from cs.LG) [pdf, other]: Title: Elucidating the solution space of extended reverse-time SDE for diffusion models

Qinpeng Cui, Xinyi Zhang, Qiqi Bao, Qingmin Liao

Comments: This paper has been accepted by WACV 2025 (Oral). The official version lacked proper attribution to the co-authors, and this version has been updated accordingly

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1717] arXiv:2309.06286 (cross-list from cs.AI) [pdf, other]: Title: Transferability analysis of data-driven additive manufacturing knowledge: a case study between powder bed fusion and directed energy deposition

Mutahar Safdar, Jiarui Xie, Hyunwoong Ko, Yan Lu, Guy Lamouche, Yaoyao Fiona Zhao

Comments: 11 pages, 7 figures. This paper has been accepted to be published in the proceedings of IDETC-CIE 2023

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1718] arXiv:2309.06380 (cross-list from cs.LG) [pdf, html, other]: Title: InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

Xingchao Liu, Xiwen Zhang, Jianzhu Ma, Jian Peng, Qiang Liu

Comments: ICLR 2024

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1719] arXiv:2309.06386 (cross-list from eess.IV) [pdf, other]: Title: Lung Diseases Image Segmentation using Faster R-CNNs

Mihir Jain

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1720] arXiv:2309.06421 (cross-list from eess.IV) [pdf, other]: Title: AGMDT: Virtual Staining of Renal Histology Images with Adjacency-Guided Multi-Domain Transfer

Tao Ma, Chao Zhang, Min Lu, Lin Luo

Comments: BMVC 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1721] arXiv:2309.06440 (cross-list from cs.RO) [pdf, other]: Title: LEAP Hand: Low-Cost, Efficient, and Anthropomorphic Hand for Robot Learning

Kenneth Shaw, Ananye Agarwal, Deepak Pathak

Comments: Website at this https URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[1722] arXiv:2309.06612 (cross-list from cs.LG) [pdf, other]: Title: Harmonic-NAS: Hardware-Aware Multimodal Neural Architecture Search on Resource-constrained Devices

Mohamed Imed Eddine Ghebriout, Halima Bouzidi, Smail Niar, Hamza Ouarnoughi

Comments: Accepted to the 15th Asian Conference on Machine Learning (ACML 2023)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1723] arXiv:2309.06652 (cross-list from cs.NE) [pdf, other]: Title: Event-Driven Imaging in Turbid Media: A Confluence of Optoelectronics and Neuromorphic Computation

Ning Zhang, Timothy Shea, Arto Nurmikko

Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
[1724] arXiv:2309.06660 (cross-list from cs.LG) [pdf, other]: Title: Generalizable Neural Fields as Partially Observed Neural Processes

Jeffrey Gu, Kuan-Chieh Wang, Serena Yeung

Comments: To appear ICCV 2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1725] arXiv:2309.06825 (cross-list from eess.IV) [pdf, other]: Title: Topology-inspired Cross-domain Network for Developmental Cervical Stenosis Quantification

Zhenxi Zhang, Yanyang Wang, Yao Wu, Weifei Wu

Comments: We have discovered that some authors' contributions have been overlooked. We need to spend some time confirming whether the authors adhere to the paper's authorship guidelines and whether their authorship order complies with the standards. After discussion with all co-authors, we decide to withdraw this paper

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1726] arXiv:2309.06882 (cross-list from cs.LG) [pdf, other]: Title: ProMap: Datasets for Product Mapping in E-commerce

Kateřina Macková, Martin Pilát

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[1727] arXiv:2309.06928 (cross-list from cs.CL) [pdf, other]: Title: Dynamic Causal Disentanglement Model for Dialogue Emotion Detection

Yuting Su, Yichen Wei, Weizhi Nie, Sicheng Zhao, Anan Liu

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1728] arXiv:2309.06948 (cross-list from eess.IV) [pdf, other]: Title: Limited-Angle Tomography Reconstruction via Deep End-To-End Learning on Synthetic Data

Thomas Germer, Jan Robine, Sebastian Konietzny, Stefan Harmeling, Tobias Uelwer

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1729] arXiv:2309.07085 (cross-list from cs.LG) [pdf, html, other]: Title: Mitigating Group Bias in Federated Learning for Heterogeneous Devices

Khotso Selialia, Yasra Chandio, Fatima M. Anwar

Journal-ref: Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1730] arXiv:2309.07094 (cross-list from cs.RO) [pdf, other]: Title: RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline

Mirko Usuelli, Matteo Frosi, Paolo Cudrano, Simone Mentasti, Matteo Matteucci

Comments: 7 pages, 2 figures

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1731] arXiv:2309.07096 (cross-list from q-bio.NC) [pdf, other]: Title: Computational limits to the legibility of the imaged human brain

James K Ruffle, Robert J Gray, Samia Mohinta, Guilherme Pombo, Chaitanya Kaul, Harpreet Hyare, Geraint Rees, Parashkev Nachev

Comments: 38 pages, 6 figures, 1 table, 2 supplementary figures, 1 supplementary table

Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1732] arXiv:2309.07115 (cross-list from cs.SD) [pdf, html, other]: Title: Getting More for Less: Using Weak Labels and AV-Mixup for Robust Audio-Visual Speaker Verification

Anith Selvakumar, Homa Fashandi

Comments: Accepted to INTERSPEECH 2024

Journal-ref: Proc. Interspeech 2024, 4728-4732

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1733] arXiv:2309.07117 (cross-list from cs.LG) [pdf, html, other]: Title: PILOT: A Pre-Trained Model-Based Continual Learning Toolbox

Hai-Long Sun, Da-Wei Zhou, De-Chuan Zhan, Han-Jia Ye

Comments: Accepted to SCIENCE CHINA Information Sciences. Code is available at this https URL

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1734] arXiv:2309.07120 (cross-list from cs.CL) [pdf, other]: Title: Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics

Haoqin Tu, Bingchen Zhao, Chen Wei, Cihang Xie

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[1735] arXiv:2309.07173 (cross-list from cs.LG) [pdf, other]: Title: Using Unsupervised and Supervised Learning and Digital Twin for Deep Convective Ice Storm Classification

Jason Swope, Steve Chien, Emily Dunkel, Xavier Bosch-Lluis, Qing Yue, William Deal

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1736] arXiv:2309.07255 (cross-list from eess.IV) [pdf, other]: Title: Automated segmentation of rheumatoid arthritis immunohistochemistry stained synovial tissue

Amaya Gallagher-Syed, Abbas Khan, Felice Rivellese, Costantino Pitzalis, Myles J. Lewis, Gregory Slabaugh, Michael R. Barnes

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1737] arXiv:2309.07332 (cross-list from cs.LG) [pdf, other]: Title: Reliability-based cleaning of noisy training labels with inductive conformal prediction in multi-modal biomedical data mining

Xianghao Zhan, Qinmei Xu, Yuanning Zheng, Guangming Lu, Olivier Gevaert

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM); Applications (stat.AP); Machine Learning (stat.ML)
[1738] arXiv:2309.07387 (cross-list from cs.CL) [pdf, other]: Title: VDialogUE: A Unified Evaluation Benchmark for Visually-grounded Dialogue

Yunshui Li, Binyuan Hui, Zhaochao Yin, Wanwei He, Run Luo, Yuxing Long, Min Yang, Fei Huang, Yongbin Li

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1739] arXiv:2309.07461 (cross-list from cs.CR) [pdf, other]: Title: Detecting Unknown Attacks in IoT Environments: An Open Set Classifier for Enhanced Network Intrusion Detection

Yasir Ali Farrukh, Syed Wali, Irfan Khan, Nathaniel D. Bastian

Comments: 6 Pages, 5 figures

Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1740] arXiv:2309.07510 (cross-list from cs.RO) [pdf, other]: Title: Learning Environment-Aware Affordance for 3D Articulated Object Manipulation under Occlusions

Kai Cheng, Ruihai Wu, Yan Shen, Chuanruo Ning, Guanqi Zhan, Hao Dong

Comments: In 37th Conference on Neural Information Processing Systems (NeurIPS 2023). Website at this https URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1741] arXiv:2309.07609 (cross-list from cs.RO) [pdf, other]: Title: Learning Quasi-Static 3D Models of Markerless Deformable Linear Objects for Bimanual Robotic Manipulation

Piotr Kicki, Michał Bidziński, Krzysztof Walas

Comments: Under review for IEEE Robotics and Automation Letters

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1742] arXiv:2309.07778 (cross-list from eess.IV) [pdf, html, other]: Title: Virchow: A Million-Slide Digital Pathology Foundation Model

Eugene Vorontsov, Alican Bozkurt, Adam Casson, George Shaikovski, Michal Zelechowski, Siqi Liu, Kristen Severson, Eric Zimmermann, James Hall, Neil Tenenholtz, Nicolo Fusi, Philippe Mathieu, Alexander van Eck, Donghun Lee, Julian Viret, Eric Robert, Yi Kan Wang, Jeremy D. Kunz, Matthew C. H. Lee, Jan Bernhard, Ran A. Godrich, Gerard Oakley, Ewan Millar, Matthew Hanna, Juan Retamero, William A. Moye, Razik Yousfi, Christopher Kanan, David Klimstra, Brandon Rothrock, Thomas J. Fuchs

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Tissues and Organs (q-bio.TO)
[1743] arXiv:2309.07878 (cross-list from cs.SI) [pdf, other]: Title: Using network metrics to explore the community structure that underlies movement patterns

Anh Pham Thi Minh, Abhishek Kumar Singh, Soumya Snigdha Kundu

Comments: 6 pages excluding References

Subjects: Social and Information Networks (cs.SI); Computer Vision and Pattern Recognition (cs.CV)
[1744] arXiv:2309.07907 (cross-list from cs.RO) [pdf, other]: Title: Physically Plausible Full-Body Hand-Object Interaction Synthesis

Jona Braun, Sammy Christen, Muhammed Kocabas, Emre Aksan, Otmar Hilliges

Comments: Project page at this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1745] arXiv:2309.07909 (cross-list from cs.LG) [pdf, html, other]: Title: DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data Augmentation

Zelin Zang, Hao Luo, Kai Wang, Panpan Zhang, Fan Wang, Stan.Z Li, Yang You

Comments: accepted by ICML24

Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Computer Vision and Pattern Recognition (cs.CV)
[1746] arXiv:2309.07915 (cross-list from cs.CL) [pdf, html, other]: Title: MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning

Haozhe Zhao, Zefan Cai, Shuzheng Si, Xiaojian Ma, Kaikai An, Liang Chen, Zixuan Liu, Sheng Wang, Wenjuan Han, Baobao Chang

Comments: Accepted by ICLR2024

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1747] arXiv:2309.07926 (cross-list from eess.IV) [pdf, other]: Title: COMPASS: High-Efficiency Deep Image Compression with Arbitrary-scale Spatial Scalability

Jongmin Park, Jooyoung Lee, Munchurl Kim

Comments: Accepted in ICCV 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1748] arXiv:2309.07970 (cross-list from cs.RO) [pdf, other]: Title: Language Embedded Radiance Fields for Zero-Shot Task-Oriented Grasping

Adam Rashid, Satvik Sharma, Chung Min Kim, Justin Kerr, Lawrence Chen, Angjoo Kanazawa, Ken Goldberg

Comments: See the project website at: this http URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1749] arXiv:2309.07973 (cross-list from eess.IV) [pdf, other]: Title: M3Dsynth: A dataset of medical 3D images with AI-generated local manipulations

Giada Zingarini, Davide Cozzolino, Riccardo Corvi, Giovanni Poggi, Luisa Verdoliva

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1750] arXiv:2309.08086 (cross-list from cs.RO) [pdf, other]: Title: Fast and Accurate Deep Loop Closing and Relocalization for Reliable LiDAR SLAM

Chenghao Shi, Xieyuanli Chen, Junhao Xiao, Bin Dai, Huimin Lu

Comments: 20 pages 10 figures 7 tables

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1751] arXiv:2309.08106 (cross-list from cs.RO) [pdf, other]: Title: Data-Driven Goal Recognition in Transhumeral Prostheses Using Process Mining Techniques

Zihang Su, Tianshi Yu, Nir Lipovetzky, Alireza Mohammadi, Denny Oetomo, Artem Polyvyanyy, Sebastian Sardina, Ying Tan, Nick van Beest

Comments: The 5th International Conference on Process Mining (ICPM 2023)

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1752] arXiv:2309.08129 (cross-list from eess.IV) [pdf, other]: Title: Increasing diversity of omni-directional images generated from single image using cGAN based on MLPMixer

Atsuya Nakata, Ryuto Miyazaki, Takao Yamanaka

Comments: This is a pre-print of an article in ACPR2023. The proceedings will be published in Lecture Notes in Computer Science (LNCS). The code is available at this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1753] arXiv:2309.08146 (cross-list from cs.SD) [pdf, other]: Title: Syn-Att: Synthetic Speech Attribution via Semi-Supervised Unknown Multi-Class Ensemble of CNNs

Md Awsafur Rahman, Bishmoy Paul, Najibul Haque Sarker, Zaber Ibn Abdul Hakim, Shaikh Anowarul Fattah, Mohammad Saquib

Comments: Winning Solution of IEEE SP Cup at ICASSP 2022

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1754] arXiv:2309.08160 (cross-list from eess.IV) [pdf, other]: Title: Cross-Modal Synthesis of Structural MRI and Functional Connectivity Networks via Conditional ViT-GANs

Yuda Bi, Anees Abrol, Jing Sui, Vince Calhoun

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1755] arXiv:2309.08197 (cross-list from eess.IV) [pdf, other]: Title: Hyperspectral Image Denoising via Self-Modulating Convolutional Neural Networks

Orhan Torun, Seniha Esen Yuksel, Erkut Erdem, Nevrez Imamoglu, Aykut Erdem

Journal-ref: Signal Processing, Volume 214, January 2024, 109248

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1756] arXiv:2309.08227 (cross-list from cs.LG) [pdf, other]: Title: VERSE: Virtual-Gradient Aware Streaming Lifelong Learning with Anytime Inference

Soumya Banerjee, Vinay K. Verma, Avideep Mukherjee, Deepak Gupta, Vinay P. Namboodiri, Piyush Rai

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1757] arXiv:2309.08234 (cross-list from eess.IV) [pdf, other]: Title: Efficient Polyp Segmentation Via Integrity Learning

Ziqiang Chen, Kang Wang, Yun Liu

Comments: submited to ICASSP 2024

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1758] arXiv:2309.08295 (cross-list from eess.AS) [pdf, other]: Title: A Real-Time Active Speaker Detection System Integrating an Audio-Visual Signal with a Spatial Querying Mechanism

Ilya Gurvich, Ido Leichter, Dharmendar Reddy Palle, Yossi Asher, Alon Vinnikov, Igor Abramovski, Vishak Gopal, Ross Cutler, Eyal Krupka

Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD)
[1759] arXiv:2309.08381 (cross-list from eess.IV) [pdf, other]: Title: On undesired emergent behaviors in compound prostate cancer detection systems

Erlend Sortland Rolfsnes, Philip Thangngat, Trygve Eftestøl, Tobias Nordström, Fredrik Jäderling, Martin Eklund, Alvaro Fernandez-Quilez

Comments: Accepted in MICCAI 2025, CapTiON

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1760] arXiv:2309.08387 (cross-list from cs.GR) [pdf, other]: Title: Efficient Graphics Representation with Differentiable Indirection

Sayantan Datta, Carl Marshall, Derek Nowrouzezahrai, Zhao Dong, Zhengqin Li

Comments: Project website: this https URL

Journal-ref: SIGGRAPH Asia 2023 Conference Papers (SA Conference Papers '23), December 12--15, 2023, Sydney, NSW, Australia

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1761] arXiv:2309.08402 (cross-list from eess.IV) [pdf, html, other]: Title: 3D SA-UNet: 3D Spatial Attention UNet with 3D Atrous Spatial Pyramid Pooling for White Matter Hyperintensities Segmentation

Changlu Guo

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1762] arXiv:2309.08421 (cross-list from eess.IV) [pdf, html, other]: Title: MIML: Multiplex Image Machine Learning for High Precision Cell Classification via Mechanical Traits within Microfluidic Systems

Khayrul Islam, Ratul Paul, Shen Wang, Yuwen Zhao, Partho Adhikary, Qiying Li, Xiaochen Qin, Yaling Liu

Comments: major change

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[1763] arXiv:2309.08434 (cross-list from eess.IV) [pdf, other]: Title: Segment Anything Model for Brain Tumor Segmentation

Peng Zhang, Yaping Wang

Comments: 9 pages, 60 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1764] arXiv:2309.08504 (cross-list from cs.RO) [pdf, html, other]: Title: OccupancyDETR: Using DETR for Mixed Dense-sparse 3D Occupancy Prediction

Yupeng Jia, Jie He, Runze Chen, Fang Zhao, Haiyong Luo

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1765] arXiv:2309.08511 (cross-list from eess.IV) [pdf, html, other]: Title: Generalised Diffusion Probabilistic Scale-Spaces

Pascal Peter

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1766] arXiv:2309.08607 (cross-list from cs.CY) [pdf, html, other]: Title: Monitoring of Urban Changes with multi-modal Sentinel 1 and 2 Data in Mariupol, Ukraine, in 2022/23

Georg Zitzlsberger, Michal Podhoranyi

Comments: Accepted for publication in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

Subjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1767] arXiv:2309.08612 (cross-list from cs.AI) [pdf, other]: Title: Explaining Vision and Language through Graphs of Events in Space and Time

Mihai Masala, Nicolae Cudlenco, Traian Rebedea, Marius Leordeanu

Comments: Accepted at IEEE International Conference on Computer Vision (ICCV) 2023 Workshops: 5th Workshop On Closing The Loop Between Vision And Language

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1768] arXiv:2309.08794 (cross-list from cs.AI) [pdf, other]: Title: Privacy-preserving Early Detection of Epileptic Seizures in Videos

Deval Mehta, Shobi Sivathamboo, Hugh Simpson, Patrick Kwan, Terence O`Brien, Zongyuan Ge

Comments: Accepted to MICCAI 2023

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1769] arXiv:2309.08798 (cross-list from cs.AI) [pdf, html, other]: Title: D3: Data Diversity Design for Systematic Generalization in Visual Question Answering

Amir Rahimi, Vanessa D'Amario, Moyuru Yamada, Kentaro Takemoto, Tomotake Sasaki, Xavier Boix

Comments: TMLR (this https URL)

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1770] arXiv:2309.08931 (cross-list from cs.AI) [pdf, html, other]: Title: A Novel Neural-symbolic System under Statistical Relational Learning

Dongran Yu, Xueyan Liu, Shirui Pan, Anchen Li, Bo Yang

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1771] arXiv:2309.08989 (cross-list from cs.RO) [pdf, other]: Title: RMP: A Random Mask Pretrain Framework for Motion Prediction

Yi Yang, Qingwen Zhang, Thomas Gilles, Nazre Batool, John Folkesson

Comments: IEEE International Conference on Intelligent Transportation Systems (ITSC 2023)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1772] arXiv:2309.08997 (cross-list from cs.RO) [pdf, other]: Title: OmniLRS: A Photorealistic Simulator for Lunar Robotics

Antoine Richard, Junnosuke Kamohara, Kentaro Uno, Shreya Santra, Dave van der Meer, Miguel Olivares-Mendez, Kazuya Yoshida

Comments: 7 pages, 4 figures

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1773] arXiv:2309.09038 (cross-list from cs.AI) [pdf, other]: Title: A store-and-forward cloud-based telemonitoring system for automatic assessing dysarthria evolution in neurological diseases from video-recording analysis

Lucia Migliorelli, Daniele Berardini, Kevin Cela, Michela Coccia, Laura Villani, Emanuele Frontoni, Sara Moccia

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1774] arXiv:2309.09080 (cross-list from cs.RO) [pdf, other]: Title: Multi-camera Bird's Eye View Perception for Autonomous Driving

David Unger, Nikhil Gosala, Varun Ravi Kumar, Shubhankar Borse, Abhinav Valada, Senthil Yogamani

Comments: Taylor & Francis (CRC Press) book chapter. Book title: Computer Vision: Challenges, Trends, and Opportunities

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1775] arXiv:2309.09183 (cross-list from cs.RO) [pdf, other]: Title: CLIPUNetr: Assisting Human-robot Interface for Uncalibrated Visual Servoing Control with CLIP-driven Referring Expression Segmentation

Chen Jiang, Yuchen Yang, Martin Jagersand

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1776] arXiv:2309.09206 (cross-list from cs.RO) [pdf, other]: Title: Differentiable SLAM Helps Deep Learning-based LiDAR Perception Tasks

Prashant Kumar, Dheeraj Vattikonda, Vedang Bhupesh Shenvi Nadkarni, Erqun Dong, Sabyasachi Sahoo

Comments: 15 pages,6 Tables, 3 figures. Accepted at BMVC 2023

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1777] arXiv:2309.09215 (cross-list from physics.optics) [pdf, other]: Title: All-optical image denoising using a diffractive visual processor

Cagatay Isıl, Tianyi Gan, F. Onuralp Ardic, Koray Mentesoglu, Jagrit Digani, Huseyin Karaca, Hanlong Chen, Jingxi Li, Deniz Mengu, Mona Jarrahi, Kaan Akşit, Aydogan Ozcan

Comments: 21 Pages, 7 Figures

Journal-ref: Light: Science & Applications (2024)

Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Applied Physics (physics.app-ph)
[1778] arXiv:2309.09314 (cross-list from cs.GR) [pdf, other]: Title: MOVIN: Real-time Motion Capture using a Single LiDAR

Deok-Kyeong Jang, Dongseok Yang, Deok-Yun Jang, Byeoli Choi, Taeil Jin, Sung-Hee Lee

Journal-ref: Computer Graphics Forum 2023, presented at Pacific Graphics 2023

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1779] arXiv:2309.09392 (cross-list from eess.IV) [pdf, other]: Title: Deep conditional generative models for longitudinal single-slice abdominal computed tomography harmonization

Xin Yu, Qi Yang, Yucheng Tang, Riqiang Gao, Shunxing Bao, Leon Y. Cai, Ho Hin Lee, Yuankai Huo, Ann Zenobia Moore, Luigi Ferrucci, Bennett A. Landman

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1780] arXiv:2309.09405 (cross-list from cs.AI) [pdf, other]: Title: Does Video Summarization Require Videos? Quantifying the Effectiveness of Language in Video Summarization

Yoonsoo Nam, Adam Lehavi, Daniel Yang, Digbalay Bose, Swabha Swayamdipta, Shrikanth Narayanan

Comments: \c{opyright} 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1781] arXiv:2309.09410 (cross-list from eess.IV) [pdf, other]: Title: BRONCO: Automated modelling of the bronchovascular bundle using the Computed Tomography Images

Wojciech Prażuch, Marek Socha, Anna Mrukwa, Aleksandra Suwalska, Agata Durawa, Malgorzata Jelitto-Górska, Katarzyna Dziadziuszko, Edyta Szurowska, Pawel Bożek, Michal Marczyk, Witold Rzyman, Joanna Polanska

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1782] arXiv:2309.09426 (cross-list from eess.IV) [pdf, other]: Title: Joint Demosaicing and Denoising with Double Deep Image Priors

Taihui Li, Anish Lahiri, Yutong Dai, Owen Mayer

Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[1783] arXiv:2309.09427 (cross-list from cs.RO) [pdf, other]: Title: TransTouch: Learning Transparent Objects Depth Sensing Through Sparse Touches

Liuyu Bian, Pengyang Shi, Weihang Chen, Jing Xu, Li Yi, Rui Chen

Comments: Accepted to the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1784] arXiv:2309.09483 (cross-list from eess.IV) [pdf, other]: Title: An Accurate and Efficient Neural Network for OCTA Vessel Segmentation and a New Dataset

Haojian Ning, Chengliang Wang, Xinrun Chen, Shiying Li

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1785] arXiv:2309.09490 (cross-list from eess.IV) [pdf, other]: Title: Self-supervised TransUNet for Ultrasound regional segmentation of the distal radius in children

Yuyue Zhou, Jessica Knight, Banafshe Felfeliyan, Christopher Keen, Abhilash Rakkunedeth Hareendranathan, Jacob L. Jaremko

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1786] arXiv:2309.09720 (cross-list from cs.LG) [pdf, other]: Title: Traffic Scene Similarity: a Graph-based Contrastive Learning Approach

Maximilian Zipfl, Moritz Jarosch, J. Marius Zöllner

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1787] arXiv:2309.09756 (cross-list from cs.RO) [pdf, other]: Title: Privileged to Predicted: Towards Sensorimotor Reinforcement Learning for Urban Driving

Ege Onat Özsüer, Barış Akgün, Fatma Güney

Comments: 7 pages

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1788] arXiv:2309.09774 (cross-list from cs.LG) [pdf, other]: Title: Towards Self-Adaptive Pseudo-Label Filtering for Semi-Supervised Learning

Lei Zhu, Zhanghan Ke, Rynson Lau

Comments: This paper was first submitted to NeurIPS 2021

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1789] arXiv:2309.09818 (cross-list from cs.RO) [pdf, other]: Title: Grasp-Anything: Large-scale Grasp Dataset from Foundation Models

An Dinh Vuong, Minh Nhat Vu, Hieu Le, Baoru Huang, Binh Huynh, Thieu Vo, Andreas Kugi, Anh Nguyen

Comments: Project page: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1790] arXiv:2309.09844 (cross-list from cs.RO) [pdf, other]: Title: CC-SGG: Corner Case Scenario Generation using Learned Scene Graphs

George Drayson, Efimia Panagiotaki, Daniel Omeiza, Lars Kunze

Comments: The first two authors contributed equally to this work

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1791] arXiv:2309.09865 (cross-list from cs.RO) [pdf, html, other]: Title: Contrastive Learning for Enhancing Robust Scene Transfer in Vision-based Agile Flight

Jiaxu Xing, Leonard Bauersfeld, Yunlong Song, Chunwei Xing, Davide Scaramuzza

Journal-ref: IEEE International Conference on Robotics and Automation (ICRA), 2024

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1792] arXiv:2309.09875 (cross-list from cs.RO) [pdf, html, other]: Title: RaLF: Flow-based Global and Metric Radar Localization in LiDAR Maps

Abhijeet Nayak, Daniele Cattaneo, Abhinav Valada

Journal-ref: 2024 IEEE International Conference on Robotics and Automation (ICRA)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1793] arXiv:2309.09907 (cross-list from quant-ph) [pdf, html, other]: Title: Quantum Vision Clustering

Xuan Bac Nguyen, Hugh Churchill, Khoa Luu, Samee U. Khan

Comments: arXiv admin note: text overlap with arXiv:2202.08837 by other authors

Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV)
[1794] arXiv:2309.09944 (cross-list from cs.LG) [pdf, other]: Title: DiffusionWorldViewer: Exposing and Broadening the Worldview Reflected by Generative Text-to-Image Models

Zoe De Simone, Angie Boggust, Arvind Satyanarayan, Ashia Wilson

Comments: 20 pages, 8 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[1795] arXiv:2309.09954 (cross-list from eess.IV) [pdf, html, other]: Title: vSHARP: variable Splitting Half-quadratic Admm algorithm for Reconstruction of inverse-Problems

George Yiasemis, Nikita Moriakov, Jan-Jakob Sonke, Jonas Teuwen

Comments: 22 pages, 9 figures, 5 tables

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1796] arXiv:2309.09979 (cross-list from cs.RO) [pdf, other]: Title: General In-Hand Object Rotation with Vision and Touch

Haozhi Qi, Brent Yi, Sudharshan Suresh, Mike Lambeta, Yi Ma, Roberto Calandra, Jitendra Malik

Comments: CoRL 2023; Website: this https URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1797] arXiv:2309.09983 (cross-list from q-bio.NC) [pdf, other]: Title: Exploration and Comparison of Deep Learning Architectures to Predict Brain Response to Realistic Pictures

Riccardo Chimisso, Sathya Buršić, Paolo Marocco, Giuseppe Vizzari, Dimitri Ognibene

Comments: Submitted to The Algonauts Project 2023 - Exploration and Comparison of Deep Learning Architectures to Predict Brain Response to Realistic Pictures - this http URL

Subjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Signal Processing (eess.SP)
[1798] arXiv:2309.09987 (cross-list from cs.LG) [pdf, other]: Title: TCGF: A unified tensorized consensus graph framework for multi-view representation learning

Xiangzhu Meng, Wei Wei, Qiang Liu, Shu Wu, Liang Wang

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1799] arXiv:2309.10012 (cross-list from cs.LG) [pdf, other]: Title: Looking through the past: better knowledge retention for generative replay in continual learning

Valeriya Khan, Sebastian Cygert, Kamil Deja, Tomasz Trzciński, Bartłomiej Twardowski

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1800] arXiv:2309.10131 (cross-list from cs.LG) [pdf, other]: Title: Deep Prompt Tuning for Graph Transformers

Reza Shirkavand, Heng Huang

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1801] arXiv:2309.10153 (cross-list from eess.IV) [pdf, html, other]: Title: Preserving Tumor Volumes for Unsupervised Medical Image Registration

Qihua Dong, Hao Du, Ying Song, Yan Xu, Jing Liao

Comments: ICCV 2023 Poster

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1802] arXiv:2309.10172 (cross-list from physics.flu-dyn) [pdf, html, other]: Title: Enhancing wind field resolution in complex terrain through a knowledge-driven machine learning approach

Jacob Wulff Wold, Florian Stadtmann, Adil Rasheed, Mandar Tabib, Omer San, Jan-Tore Horn

Subjects: Fluid Dynamics (physics.flu-dyn); Computer Vision and Pattern Recognition (cs.CV)
[1803] arXiv:2309.10210 (cross-list from eess.IV) [pdf, other]: Title: ProtoKD: Learning from Extremely Scarce Data for Parasite Ova Recognition

Shubham Trehan, Udhav Ramachandran, Ruth Scimeca, Sathyanarayanan N. Aakur

Comments: To Appear at IEEE ICMLA 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1804] arXiv:2309.10227 (cross-list from eess.IV) [pdf, other]: Title: Learning Dynamic MRI Reconstruction with Convolutional Network Assisted Reconstruction Swin Transformer

Di Xu, Hengjie Liu, Dan Ruan, Ke Sheng

Comments: MICCAI 2023 Workshop

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1805] arXiv:2309.10266 (cross-list from physics.flu-dyn) [pdf, other]: Title: Correlation between morphological evolution of splashing drop and exerted impact force revealed by interpretation of explainable artificial intelligence

Jingzu Yee, Daichi Igarashi, Pradipto, Akinori Yamanaka, Yoshiyuki Tagawa

Comments: 23 pages, 13 figures

Subjects: Fluid Dynamics (physics.flu-dyn); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1806] arXiv:2309.10302 (cross-list from cs.LG) [pdf, html, other]: Title: Decoupled Training: Return of Frustratingly Easy Multi-Domain Learning

Ximei Wang, Junwei Pan, Xingzhuo Guo, Dapeng Liu, Jie Jiang

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[1807] arXiv:2309.10314 (cross-list from cs.RO) [pdf, html, other]: Title: Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration

Hongbo Zhao, Yikang Zhang, Qijun Chen, Rui Fan

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1808] arXiv:2309.10329 (cross-list from cs.GR) [pdf, other]: Title: Learning based 2D Irregular Shape Packing

Zeshi Yang, Zherong Pan, Manyi Li, Kui Wu, Xifeng Gao

Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1809] arXiv:2309.10348 (cross-list from cs.LG) [pdf, other]: Title: Language Guided Adversarial Purification

Himanshu Singh, A V Subramanyam

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1810] arXiv:2309.10625 (cross-list from cs.AI) [pdf, html, other]: Title: NoisyNN: Exploring the Impact of Information Entropy Change in Learning Systems

Xiaowei Yu, Zhe Huang, Minheng Chen, Lu Zhang, Tianming Liu, Dajiang Zhu

Comments: Task Entropy, ViT, CNN

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1811] arXiv:2309.10646 (cross-list from eess.IV) [pdf, other]: Title: Self-Supervised Super-Resolution Approach for Isotropic Reconstruction of 3D Electron Microscopy Images from Anisotropic Acquisition

Mohammad Khateri, Morteza Ghahremani, Alejandra Sierra, Jussi Tohka

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1812] arXiv:2309.10784 (cross-list from eess.IV) [pdf, other]: Title: Context-Aware Neural Video Compression on Solar Dynamics Observatory

Atefeh Khoshkhahtinat, Ali Zafari, Piyush M. Mehta, Nasser M. Nasrabadi, Barbara J. Thompson, Michael S. F. Kirk, Daniel da Silva

Comments: Accepted to IEEE 22$^{nd}$ International Conference on Machine Learning and Applications 2023 (ICMLA) - Selected for Oral Presentation

Subjects: Image and Video Processing (eess.IV); Solar and Stellar Astrophysics (astro-ph.SR); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (cs.LG)
[1813] arXiv:2309.10787 (cross-list from eess.AS) [pdf, html, other]: Title: AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models

Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee

Comments: Accepted to ICASSP 2024; Evaluation Code: this https URL Submission Platform: this https URL

Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD)
[1814] arXiv:2309.10790 (cross-list from cs.LG) [pdf, other]: Title: Guide Your Agent with Adaptive Multimodal Rewards

Changyeon Kim, Younggyo Seo, Hao Liu, Lisa Lee, Jinwoo Shin, Honglak Lee, Kimin Lee

Comments: Accepted to NeurIPS 2023. Project webpage: this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1815] arXiv:2309.10791 (cross-list from eess.IV) [pdf, other]: Title: Multi-spectral Entropy Constrained Neural Compression of Solar Imagery

Ali Zafari, Atefeh Khoshkhahtinat, Piyush M. Mehta, Nasser M. Nasrabadi, Barbara J. Thompson, Michael S. F. Kirk, Daniel da Silva

Comments: Accepted to IEEE 22$^{nd}$ International Conference on Machine Learning and Applications 2023 (ICMLA)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[1816] arXiv:2309.10799 (cross-list from eess.IV) [pdf, other]: Title: Multi-Context Dual Hyper-Prior Neural Image Compression

Atefeh Khoshkhahtinat, Ali Zafari, Piyush M. Mehta, Mohammad Akyash, Hossein Kashiani, Nasser M. Nasrabadi

Comments: Accepted to IEEE 22$^nd$ International Conference on Machine Learning and Applications 2023 (ICMLA) - Selected for Oral Presentation

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1817] arXiv:2309.10817 (cross-list from eess.IV) [pdf, other]: Title: Assessing the capacity of a denoising diffusion probabilistic model to reproduce spatial context

Rucha Deshpande, Muzaffer Özbey, Hua Li, Mark A. Anastasio, Frank J. Brooks

Comments: This paper is under consideration at IEEE TMI

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1818] arXiv:2309.10829 (cross-list from eess.IV) [pdf, other]: Title: Comparative study of Deep Learning Models for Binary Classification on Combined Pulmonary Chest X-ray Dataset

Shabbir Ahmed Shuvo, Md Aminul Islam, Md. Mozammel Hoque, Rejwan Bin Sulaiman

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1819] arXiv:2309.10834 (cross-list from cs.LG) [pdf, html, other]: Title: Communication-Efficient Federated Learning via Regularized Sparse Random Networks

Mohamad Mestoukirdi, Omid Esrafilian, David Gesbert, Qianrui Li, Nicolas Gresset

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Data Structures and Algorithms (cs.DS)
[1820] arXiv:2309.10835 (cross-list from eess.IV) [pdf, other]: Title: Analysing race and sex bias in brain age prediction

Carolina Piçarra, Ben Glocker

Comments: MICCAI Workshop on Fairness of AI in Medical Imaging (FAIMI 2023)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1821] arXiv:2309.10878 (cross-list from cs.LG) [pdf, other]: Title: DeepliteRT: Computer Vision at the Edge

Saad Ashfaq, Alexander Hoffman, Saptarshi Mitra, Sudhakar Sah, MohammadHossein AskariHemmat, Ehsan Saboori

Comments: Accepted at British Machine Vision Conference (BMVC) 2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1822] arXiv:2309.10885 (cross-list from cs.RO) [pdf, other]: Title: GelSight Svelte: A Human Finger-shaped Single-camera Tactile Robot Finger with Large Sensing Coverage and Proprioceptive Sensing

Jialiang Zhao, Edward H. Adelson

Comments: Submitted and accepted to 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1823] arXiv:2309.10900 (cross-list from cs.RO) [pdf, other]: Title: Incremental Multimodal Surface Mapping via Self-Organizing Gaussian Mixture Models

Kshitij Goel, Wennie Tabib

Comments: 8 pages, 7 figures, published in IEEE Robotics and Automation Letters

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1824] arXiv:2309.10948 (cross-list from cs.LG) [pdf, other]: Title: A Novel Deep Neural Network for Trajectory Prediction in Automated Vehicles Using Velocity Vector Field

MReza Alipour Sormoli, Amir Samadi, Sajjad Mozaffari, Konstantinos Koufos, Mehrdad Dianati, Roger Woodman

Comments: This paper has been accepted and nominated as the best student paper at the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC 2023)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1825] arXiv:2309.10987 (cross-list from cs.NE) [pdf, html, other]: Title: SpikingNeRF: Making Bio-inspired Neural Networks See through the Real World

Xingting Yao, Qinghao Hu, Fei Zhou, Tielong Liu, Zitao Mo, Zeyu Zhu, Zhengyang Zhuge, Jian Cheng

Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1826] arXiv:2309.11006 (cross-list from cs.RO) [pdf, other]: Title: STARNet: Sensor Trustworthiness and Anomaly Recognition via Approximated Likelihood Regret for Robust Edge Autonomy

Nastaran Darabi, Sina Tayebati, Sureshkumar S., Sathya Ravi, Theja Tulabandhula, Amit R. Trivedi

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1827] arXiv:2309.11018 (cross-list from cs.LG) [pdf, other]: Title: Conformalized Multimodal Uncertainty Regression and Reasoning

Domenico Parente, Nastaran Darabi, Alex C. Stutts, Theja Tulabandhula, Amit Ranjan Trivedi

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1828] arXiv:2309.11038 (cross-list from cs.RO) [pdf, html, other]: Title: CaveSeg: Deep Semantic Segmentation and Scene Parsing for Autonomous Underwater Cave Exploration

A. Abdullah, T. Barua, R. Tibbetts, Z. Chen, M. J. Islam, I. Rekleitis

Comments: Accepted in ICRA 2024. 10 pages, 9 figures

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1829] arXiv:2309.11139 (cross-list from eess.IV) [pdf, other]: Title: More complex encoder is not all you need

Weibin Yang, Longwei Xu, Pengwei Wang, Dehua Geng, Yusong Li, Mingyuan Xu, Zhiqi Dong

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1830] arXiv:2309.11148 (cross-list from cs.RO) [pdf, html, other]: Title: Online Calibration of a Single-Track Ground Vehicle Dynamics Model by Tight Fusion with Visual-Inertial Odometry

Haolong Li, Joerg Stueckler

Comments: Accepted for publication in IEEE International Conference on Robotics and Automation (ICRA), 2024

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1831] arXiv:2309.11252 (cross-list from cs.CL) [pdf, other]: Title: The Scenario Refiner: Grounding subjects in images at the morphological level

Claudia Tagliaferri, Sofia Axioti, Albert Gatt, Denis Paperno

Comments: presented at the LIMO workshop (Linguistic Insights from and for Multimodal Language Processing @KONVENS 2023)

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1832] arXiv:2309.11258 (cross-list from cs.GR) [pdf, other]: Title: TwinTex: Geometry-aware Texture Generation for Abstracted 3D Architectural Models

Weidan Xiong, Hongqian Zhang, Botao Peng, Ziyu Hu, Yongli Wu, Jianwei Guo, Hui Huang

Comments: Accepted to SIGGRAPH ASIA 2023

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1833] arXiv:2309.11382 (cross-list from cs.RO) [pdf, other]: Title: Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions

Yuxing Long, Xiaoqi Li, Wenzhe Cai, Hao Dong

Comments: Submitted to ICRA 2024

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1834] arXiv:2309.11413 (cross-list from cs.RO) [pdf, other]: Title: Enhancing motion trajectory segmentation of rigid bodies using a novel screw-based trajectory-shape representation

Arno Verduyn, Maxim Vochten, Joris De Schutter

Comments: This work has been submitted to the IEEE International Conference on Robotics and Automation (ICRA) for possible publication

Journal-ref: 2024 IEEE International Conference on Robotics and Automation (ICRA)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1835] arXiv:2309.11419 (cross-list from cs.CL) [pdf, html, other]: Title: KOSMOS-2.5: A Multimodal Literate Model

Tengchao Lv, Yupan Huang, Jingye Chen, Yuzhong Zhao, Yilin Jia, Lei Cui, Shuming Ma, Yaoyao Chang, Shaohan Huang, Wenhui Wang, Li Dong, Weiyao Luo, Shaoxiang Wu, Guoxin Wang, Cha Zhang, Furu Wei

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1836] arXiv:2309.11421 (cross-list from eess.IV) [pdf, other]: Title: CalibFPA: A Focal Plane Array Imaging System based on Online Deep-Learning Calibration

Alper Güngör, M. Umut Bahceci, Yasin Ergen, Ahmet Sözak, O. Oner Ekiz, Tolga Yelboga, Tolga Çukur

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1837] arXiv:2309.11446 (cross-list from cs.LG) [pdf, other]: Title: Weight Averaging Improves Knowledge Distillation under Domain Shift

Valeriy Berezovskiy, Nikita Morozov

Comments: ICCV 2023 Workshop on Out-of-Distribution Generalization in Computer Vision (OOD-CV)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1838] arXiv:2309.11500 (cross-list from cs.SD) [pdf, html, other]: Title: Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning

Luoyi Sun, Xuenan Xu, Mengyue Wu, Weidi Xie

Comments: Accepted by ACM MM 2024

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1839] arXiv:2309.11510 (cross-list from cs.IR) [pdf, other]: Title: When is a Foundation Model a Foundation Model

Saghir Alfasly, Peyman Nejat, Sobhan Hemati, Jibran Khan, Isaiah Lahr, Areej Alsaafin, Abubakr Shafique, Nneka Comfere, Dennis Murphree, Chady Meroueh, Saba Yasir, Aaron Mangold, Lisa Boardman, Vijay Shah, Joaquin J. Garcia, H.R. Tizhoosh

Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1840] arXiv:2309.11705 (cross-list from cs.LG) [pdf, other]: Title: Meta OOD Learning for Continuously Adaptive OOD Detection

Xinheng Wu, Jie Lu, Zhen Fang, Guangquan Zhang

Comments: Accepted by ICCV 2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1841] arXiv:2309.11710 (cross-list from cs.CL) [pdf, other]: Title: ContextRef: Evaluating Referenceless Metrics For Image Description Generation

Elisa Kreiss, Eric Zelikman, Christopher Potts, Nick Haber

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1842] arXiv:2309.11745 (cross-list from eess.IV) [pdf, other]: Title: PIE: Simulating Disease Progression via Progressive Image Editing

Kaizhao Liang, Xu Cao, Kuei-Da Liao, Tianren Gao, Wenqian Ye, Zhengyu Chen, Jianguo Cao, Tejas Nama, Jimeng Sun

Comments: Code and checkpoints for replicating our results can be found at this https URL and this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1843] arXiv:2309.11766 (cross-list from cs.CR) [pdf, html, other]: Title: Dictionary Attack on IMU-based Gait Authentication

Rajesh Kumar, Can Isik, Chilukuri K. Mohan

Comments: 12 pages, 9 figures, accepted at AISec23 colocated with ACM CCS, November 30, 2023, Copenhagen, Denmark

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[1844] arXiv:2309.11820 (cross-list from eess.IV) [pdf, html, other]: Title: Automatic Endoscopic Ultrasound Station Recognition with Limited Data

Abhijit Ramesh, Anantha Nandanan, Nikhil Boggavarapu, Priya Nair MD, Gilad Gressel

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1845] arXiv:2309.11891 (cross-list from eess.IV) [pdf, other]: Title: Heart Rate Detection Using an Event Camera

Aniket Jagtap, RamaKrishna Venkatesh Saripalli, Joe Lemley, Waseem Shariff, Alan F. Smeaton

Comments: Dataset available at this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1846] arXiv:2309.11913 (cross-list from eess.IV) [pdf, other]: Title: Spatial-Temporal Transformer based Video Compression Framework

Yanbo Gao, Wenjia Huang, Shuai Li, Hui Yuan, Mao Ye, Siwei Ma

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1847] arXiv:2309.11930 (cross-list from cs.LG) [pdf, html, other]: Title: Bridging the Gap: Learning Pace Synchronization for Open-World Semi-Supervised Learning

Bo Ye, Kai Gan, Tong Wei, Min-Ling Zhang

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1848] arXiv:2309.11989 (cross-list from cs.RO) [pdf, html, other]: Title: A Vision-Based Navigation System for Arable Fields

Rajitha de Silva, Grzegorz Cielniak, Junfeng Gao

Comments: Submitted to Journal of Field Robotics

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1849] arXiv:2309.11993 (cross-list from cs.GR) [pdf, other]: Title: Neural Stochastic Screened Poisson Reconstruction

Silvia Sellán, Alec Jacobson

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1850] arXiv:2309.11995 (cross-list from eess.IV) [pdf, other]: Title: Identification of pneumonia on chest x-ray images through machine learning

Eduardo Augusto Roeder

Comments: In Brazilian Portuguese, 30 pages, 16 figures. This thesis was elaborated by the guidance of Prof. Dr. Akihito Inca Atahualpa Urdiales

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1851] arXiv:2309.12010 (cross-list from eess.IV) [pdf, other]: Title: Convolution and Attention Mixer for Synthetic Aperture Radar Image Change Detection

Haopeng Zhang, Zijing Lin, Feng Gao, Junyu Dong, Qian Du, Heng-Chao Li

Comments: Accepted by IEEE GRSL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1852] arXiv:2309.12022 (cross-list from cs.AI) [pdf, html, other]: Title: Demystifying Visual Features of Movie Posters for Multi-Label Genre Identification

Utsav Kumar Nareti, Chandranath Adak, Soumi Chattopadhyay

Comments: IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS (Accepted)

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1853] arXiv:2309.12095 (cross-list from stat.ML) [pdf, other]: Title: Bayesian sparsification for deep neural networks with Bayesian model reduction

Dimitrije Marković, Karl J. Friston, Stefan J. Kiebel

Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1854] arXiv:2309.12114 (cross-list from eess.IV) [pdf, other]: Title: AutoPET Challenge 2023: Sliding Window-based Optimization of U-Net

Matthias Hadlich, Zdravko Marinov, Rainer Stiefelhagen

Comments: 9 pages, 1 figure, MICCAI 2023 - AutoPET Challenge Submission Version 2: Added all results on the preliminary test set

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1855] arXiv:2309.12159 (cross-list from cs.CR) [pdf, other]: Title: Information Forensics and Security: A quarter-century-long journey

Mauro Barni, Patrizio Campisi, Edward J. Delp, Gwenael Doërr, Jessica Fridrich, Nasir Memon, Fernando Pérez-González, Anderson Rocha, Luisa Verdoliva, Min Wu

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1856] arXiv:2309.12188 (cross-list from cs.RO) [pdf, html, other]: Title: SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on Scene Graphs

Guangyao Zhai, Xiaoni Cai, Dianye Huang, Yan Di, Fabian Manhardt, Federico Tombari, Nassir Navab, Benjamin Busam

Comments: ICRA 2024 accepted. Project website: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1857] arXiv:2309.12193 (cross-list from eess.IV) [pdf, other]: Title: Brain Tumor Detection Using Deep Learning Approaches

Razia Sultana Misu

Comments: Bachelor's thesis. Supervisor: Nushrat Jahan Ria

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1858] arXiv:2309.12245 (cross-list from eess.IV) [pdf, html, other]: Title: Adaptive Input-image Normalization for Solving the Mode Collapse Problem in GAN-based X-ray Images

Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

Comments: Submitted to the Elsevier Journal

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1859] arXiv:2309.12300 (cross-list from cs.RO) [pdf, other]: Title: See to Touch: Learning Tactile Dexterity through Visual Incentives

Irmak Guzey, Yinlong Dai, Ben Evans, Soumith Chintala, Lerrel Pinto

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1860] arXiv:2309.12301 (cross-list from cs.LG) [pdf, other]: Title: Environment-biased Feature Ranking for Novelty Detection Robustness

Stefan Smeu, Elena Burceanu, Emanuela Haller, Andrei Liviu Nicolicioiu

Comments: The updated, long version of the paper is available at arXiv:2310.03738

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1861] arXiv:2309.12312 (cross-list from cs.RO) [pdf, other]: Title: ForceSight: Text-Guided Mobile Manipulation with Visual-Force Goals

Jeremy A. Collins, Cody Houff, You Liang Tan, Charles C. Kemp

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1862] arXiv:2309.12325 (cross-list from cs.CY) [pdf, other]: Title: FUTURE-AI: International consensus guideline for trustworthy and deployable artificial intelligence in healthcare

Karim Lekadir, Aasa Feragen, Abdul Joseph Fofanah, Alejandro F Frangi, Alena Buyx, Anais Emelie, Andrea Lara, Antonio R Porras, An-Wen Chan, Arcadi Navarro, Ben Glocker, Benard O Botwe, Bishesh Khanal, Brigit Beger, Carol C Wu, Celia Cintas, Curtis P Langlotz, Daniel Rueckert, Deogratias Mzurikwao, Dimitrios I Fotiadis, Doszhan Zhussupov, Enzo Ferrante, Erik Meijering, Eva Weicken, Fabio A González, Folkert W Asselbergs, Fred Prior, Gabriel P Krestin, Gary Collins, Geletaw S Tegenaw, Georgios Kaissis, Gianluca Misuraca, Gianna Tsakou, Girish Dwivedi, Haridimos Kondylakis, Harsha Jayakody, Henry C Woodruf, Horst Joachim Mayer, Hugo JWL Aerts, Ian Walsh, Ioanna Chouvarda, Irène Buvat, Isabell Tributsch, Islem Rekik, James Duncan, Jayashree Kalpathy-Cramer, Jihad Zahir, Jinah Park, John Mongan, Judy W Gichoya, Julia A Schnabel, Kaisar Kushibar, Katrine Riklund, Kensaku Mori, Kostas Marias, Lameck M Amugongo, Lauren A Fromont, Lena Maier-Hein, Leonor Cerdá Alberich, Leticia Rittner, Lighton Phiri, Linda Marrakchi-Kacem, Lluís Donoso-Bach, Luis Martí-Bonmatí, M Jorge Cardoso, Maciej Bobowicz, Mahsa Shabani, Manolis Tsiknakis, Maria A Zuluaga, Maria Bielikova, Marie-Christine Fritzsche, Marina Camacho, Marius George Linguraru, Markus Wenzel, Marleen De Bruijne, Martin G Tolsgaard, Marzyeh Ghassemi, Md Ashrafuzzaman, Melanie Goisauf, Mohammad Yaqub, Mónica Cano Abadía, Mukhtar M E Mahmoud, Mustafa Elattar, Nicola Rieke, Nikolaos Papanikolaou, Noussair Lazrak, Oliver Díaz, Olivier Salvado, Oriol Pujol, Ousmane Sall, Pamela Guevara, Peter Gordebeke, Philippe Lambin, Pieta Brown, Purang Abolmaesumi, Qi Dou, Qinghua Lu, Richard Osuala, Rose Nakasi, S Kevin Zhou

Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1863] arXiv:2309.12397 (cross-list from cs.RO) [pdf, html, other]: Title: POLAR-Sim: Augmenting NASA's POLAR Dataset for Data-Driven Lunar Perception and Rover Simulation

Bo-Hsun Chen, Peter Negrut, Thomas Liang, Nevindu Batagoda, Harry Zhang, Dan Negrut

Comments: 11 pages, 9 figures. This work has been submitted to the IEEE for possible publication

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1864] arXiv:2309.12443 (cross-list from cs.CL) [pdf, html, other]: Title: Active Learning for Multilingual Fingerspelling Corpora

Shuai Wang, Eric Nalisnick

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1865] arXiv:2309.12460 (cross-list from cs.LG) [pdf, other]: Title: Multimodal Deep Learning for Scientific Imaging Interpretation

Abdulelah S. Alshehri, Franklin L. Lee, Shihu Wang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1866] arXiv:2309.12559 (cross-list from cs.LG) [pdf, html, other]: Title: Invariant Learning via Probability of Sufficient and Necessary Causes

Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jianhong Wang, Jun Wang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1867] arXiv:2309.12572 (cross-list from eess.IV) [pdf, other]: Title: Interpretable 3D Multi-Modal Residual Convolutional Neural Network for Mild Traumatic Brain Injury Diagnosis

Hanem Ellethy, Viktor Vegh, Shekhar S. Chandra

Comments: Accepted by the Australasian Joint Conference on Artificial Intelligence 2023 (AJCAI 2023). 12 pages and 5 Figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1868] arXiv:2309.12593 (cross-list from cs.LG) [pdf, other]: Title: Improving Machine Learning Robustness via Adversarial Training

Long Dang, Thushari Hapuarachchi, Kaiqi Xiong, Jing Lin

Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1869] arXiv:2309.12634 (cross-list from cs.RO) [pdf, other]: Title: Learning Actions and Control of Focus of Attention with a Log-Polar-like Sensor

Robin Göransson, Volker Krueger

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1870] arXiv:2309.12638 (cross-list from eess.IV) [pdf, other]: Title: Auto-Lesion Segmentation with a Novel Intensity Dark Channel Prior for COVID-19 Detection

Basma Jumaa Saleh, Zaid Omar, Vikrant Bhateja, Lila Iznita Izhar

Comments: The study requires withdrawal due to technical inconsistencies in the reported data that affect the conclusions. We apologize for any inconvenience

Journal-ref: Journal of Physics: Conference Series 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Metric Geometry (math.MG); Optimization and Control (math.OC)
[1871] arXiv:2309.12673 (cross-list from cs.LG) [pdf, other]: Title: On Sparse Modern Hopfield Model

Jerry Yao-Chieh Hu, Donglin Yang, Dennis Wu, Chenwei Xu, Bo-Yu Chen, Han Liu

Comments: 37 pages, accepted at NeurIPS 2023. [v2] updated to match with camera-ready version. Code is available at this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1872] arXiv:2309.12675 (cross-list from cs.AI) [pdf, other]: Title: Vision Transformers for Computer Go

Amani Sagri, Tristan Cazenave, Jérôme Arjonilla, Abdallah Saffidine

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1873] arXiv:2309.12685 (cross-list from cs.RO) [pdf, html, other]: Title: eWand: A calibration framework for wide baseline frame-based and event-based camera systems

Thomas Gossard, Andreas Ziegler, Levin Kolmar, Jonas Tebbe, Andreas Zell

Comments: Accepted for 2024 IEEE International Conference on Robotics and Automation (ICRA 2024). Project web page: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1874] arXiv:2309.12706 (cross-list from cs.LG) [pdf, other]: Title: Multi-Label Noise Transition Matrix Estimation with Label Correlations: Theory and Algorithm

Shikun Li, Xiaobo Xia, Hansong Zhang, Shiming Ge, Tongliang Liu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1875] arXiv:2309.12805 (cross-list from eess.IV) [pdf, other]: Title: Automatic view plane prescription for cardiac magnetic resonance imaging via supervision by spatial relationship between views

Dong Wei, Yawen Huang, Donghuan Lu, Yuexiang Li, Yefeng Zheng

Comments: Medical Physics. arXiv admin note: text overlap with arXiv:2109.11715

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1876] arXiv:2309.12855 (cross-list from eess.IV) [pdf, other]: Title: Cross-Modal Translation and Alignment for Survival Analysis

Fengtao Zhou, Hao Chen

Comments: Accepted by ICCV2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1877] arXiv:2309.12862 (cross-list from cs.LG) [pdf, html, other]: Title: Associative Transformer

Yuwei Sun, Hideya Ochiai, Zhirong Wu, Stephen Lin, Ryota Kanai

Comments: Accepted for CVPR 2025

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1878] arXiv:2309.12953 (cross-list from eess.IV) [pdf, other]: Title: Inter-vendor harmonization of Computed Tomography (CT) reconstruction kernels using unpaired image translation

Aravind R. Krishnan, Kaiwen Xu, Thomas Li, Chenyu Gao, Lucas W. Remedios, Praitayini Kanakaraj, Ho Hin Lee, Shunxing Bao, Kim L. Sandler, Fabien Maldonado, Ivana Isgum, Bennett A. Landman

Comments: 10 pages, 6 figures, 1 table, Submitted to SPIE Medical Imaging : Image Processing. San Diego, CA. February 2024

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1879] arXiv:2309.12955 (cross-list from cs.CR) [pdf, other]: Title: On Data Fabrication in Collaborative Vehicular Perception: Attacks and Countermeasures

Qingzhao Zhang, Shuowei Jin, Ruiyang Zhu, Jiachen Sun, Xumiao Zhang, Qi Alfred Chen, Z. Morley Mao

Comments: 18 pages, 24 figures, accepted by Usenix Security 2024

Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1880] arXiv:2309.12970 (cross-list from eess.IV) [pdf, other]: Title: PI-RADS v2 Compliant Automated Segmentation of Prostate Zones Using co-training Motivated Multi-task Dual-Path CNN

Arnab Das, Suhita Ghosh, Sebastian Stober

Comments: Authors Arnab Das and Suhita Ghosh contributed equally. Submitted in ISBI 2022

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1881] arXiv:2309.12996 (cross-list from cs.LG) [pdf, other]: Title: Point Cloud Network: An Order of Magnitude Improvement in Linear Layer Parameter Count

Charles Hetterich

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1882] arXiv:2309.13013 (cross-list from eess.IV) [pdf, other]: Title: Performance Analysis of UNet and Variants for Medical Image Segmentation

Walid Ehab, Yongmin Li

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1883] arXiv:2309.13041 (cross-list from cs.RO) [pdf, other]: Title: Robotic Offline RL from Internet Videos via Value-Function Pre-Training

Chethan Bhateja, Derek Guo, Dibya Ghosh, Anikait Singh, Manan Tomar, Quan Vuong, Yevgen Chebotar, Sergey Levine, Aviral Kumar

Comments: First three authors contributed equally

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1884] arXiv:2309.13150 (cross-list from cs.LG) [pdf, html, other]: Title: Pixel-wise Smoothing for Certified Robustness against Camera Motion Perturbations

Hanjiang Hu, Zuxin Liu, Linyi Li, Jiacheng Zhu, Ding Zhao

Comments: Camera-ready version of AISTATS 2024, 30 pages, 5 figures, 13 tables

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1885] arXiv:2309.13160 (cross-list from cs.LG) [pdf, html, other]: Title: How to train your VAE

Mariano Rivera

Comments: 5 pages, 3 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1886] arXiv:2309.13167 (cross-list from cs.LG) [pdf, other]: Title: Flow Factorized Representation Learning

Yue Song, T. Anderson Keller, Nicu Sebe, Max Welling

Comments: NeurIPS23

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1887] arXiv:2309.13181 (cross-list from cs.LG) [pdf, other]: Title: Diagnosing and exploiting the computational demands of videos games for deep reinforcement learning

Lakshmi Narasimhan Govindarajan, Rex G Liu, Drew Linsley, Alekh Karkada Ashok, Max Reuter, Michael J Frank, Thomas Serre

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1888] arXiv:2309.13190 (cross-list from cs.LG) [pdf, other]: Title: Spatial-frequency channels, shape bias, and adversarial robustness

Ajay Subramanian, Elena Sizikova, Najib J. Majaj, Denis G. Pelli

Comments: Neural Information Processing Systems (NeurIPS) 2023 (Oral Presentation). Camera-ready version

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1889] arXiv:2309.13302 (cross-list from cs.NE) [pdf, html, other]: Title: Gaining the Sparse Rewards by Exploring Lottery Tickets in Spiking Neural Network

Hao Cheng, Jiahang Cao, Erjia Xiao, Mengshu Sun, Renjing Xu

Comments: This paper is accepted by IROS 2024

Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
[1890] arXiv:2309.13303 (cross-list from cs.LG) [pdf, other]: Title: C$^2$VAE: Gaussian Copula-based VAE Differing Disentangled from Coupled Representations with Contrastive Posterior

Zhangkai Wu, Longbing Cao

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1891] arXiv:2309.13385 (cross-list from eess.IV) [pdf, other]: Title: Cine cardiac MRI reconstruction using a convolutional recurrent network with refinement

Yuyang Xue, Yuning Du, Gianluca Carloni, Eva Pachetti, Connor Jordan, Sotirios A. Tsaftaris

Comments: MICCAI STACOM workshop 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1892] arXiv:2309.13398 (cross-list from eess.IV) [pdf, other]: Title: A mirror-Unet architecture for PET/CT lesion segmentation

Yamila Rotstein Habarnau, Mauro Namías

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1893] arXiv:2309.13404 (cross-list from eess.IV) [pdf, html, other]: Title: Weakly Supervised YOLO Network for Surgical Instrument Localization in Endoscopic Videos

Rongfeng Wei, Jinlin Wu, Xuexue Bai, Ming Feng, Zhen Lei, Hongbin Liu, Zhen Chen

Comments: Accepted by ICRA 2024 Workshop on C4 Surgical Robotic Systems in the Embodied AI Era; Surgical Tool Localization in Endoscopic Videos Challenge of MICCAI2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1894] arXiv:2309.13411 (cross-list from cs.LG) [pdf, other]: Title: Towards Attributions of Input Variables in a Coalition

Xinhao Zheng, Huiqi Deng, Bo Fan, Quanshi Zhang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1895] arXiv:2309.13415 (cross-list from cs.LG) [pdf, other]: Title: Dream the Impossible: Outlier Imagination with Diffusion Models

Xuefeng Du, Yiyou Sun, Xiaojin Zhu, Yixuan Li

Comments: NeurIPS 2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1896] arXiv:2309.13430 (cross-list from cs.CL) [pdf, other]: Title: Resolving References in Visually-Grounded Dialogue via Text Generation

Bram Willemsen, Livia Qian, Gabriel Skantze

Comments: Published at SIGDIAL 2023

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1897] arXiv:2309.13457 (cross-list from cs.LG) [pdf, other]: Title: Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 Data

Wai Tong Chung, Bassem Akoush, Pushan Sharma, Alex Tamkin, Ki Sung Jung, Jacqueline H. Chen, Jack Guo, Davy Brouzet, Mohsen Talei, Bruno Savard, Alexei Y. Poludnenko, Matthias Ihme

Comments: Accepted in Adv. in Neural Information Processing Systems 36 (NeurIPS 2023). Link: this https URL . 55 pages, 21 figures. Keywords: Super-resolution, 3D, Neural Scaling, Physics-informed Loss, Computational Fluid Dynamics, Partial Differential Equations, Turbulent Reacting Flows, Direct Numerical Simulation, Fluid Mechanics, Combustion, Computer Vision

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Computational Physics (physics.comp-ph); Fluid Dynamics (physics.flu-dyn)
[1898] arXiv:2309.13475 (cross-list from cs.RO) [pdf, html, other]: Title: Detecting and Mitigating System-Level Anomalies of Vision-Based Controllers

Aryaman Gupta, Kaustav Chakraborty, Somil Bansal

Journal-ref: 2024/5/13 Conference 2024 IEEE International Conference on Robotics and Automation (ICRA) Pages 9953-9959 Publisher 2024 IEEE International Conference on Robotics and Automation (ICRA)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[1899] arXiv:2309.13549 (cross-list from cs.RO) [pdf, other]: Title: Towards Robust Robot 3D Perception in Urban Environments: The UT Campus Object Dataset

Arthur Zhang, Chaitanya Eranki, Christina Zhang, Ji-Hwan Park, Raymond Hong, Pranav Kalyani, Lochana Kalyanaraman, Arsh Gamare, Arnav Bagad, Maria Esteva, Joydeep Biswas

Comments: 19 pages, 18 figures, 12 tables

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1900] arXiv:2309.13553 (cross-list from eess.IV) [pdf, other]: Title: Generalized Dice Focal Loss trained 3D Residual UNet for Automated Lesion Segmentation in Whole-Body FDG PET/CT Images

Shadab Ahamed, Arman Rahmim

Comments: AutoPET-II challenge (2023)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1901] arXiv:2309.13571 (cross-list from eess.IV) [pdf, other]: Title: Matrix Completion-Informed Deep Unfolded Equilibrium Models for Self-Supervised k-Space Interpolation in MRI

Chen Luo, Huayu Wang, Taofeng Xie, Qiyu Jin, Guoqing Chen, Zhuo-Xu Cui, Dong Liang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1902] arXiv:2309.13584 (cross-list from eess.IV) [pdf, other]: Title: Solving Low-Dose CT Reconstruction via GAN with Local Coherence

Wenjie Liu

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1903] arXiv:2309.13587 (cross-list from eess.IV) [pdf, other]: Title: Benchmarking Encoder-Decoder Architectures for Biplanar X-ray to 3D Shape Reconstruction

Mahesh Shakya, Bishesh Khanal

Comments: accepted to NeurIPS 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1904] arXiv:2309.13742 (cross-list from cs.GR) [pdf, other]: Title: DROP: Dynamics Responses from Human Motion Prior and Projective Dynamics

Yifeng Jiang, Jungdam Won, Yuting Ye, C. Karen Liu

Comments: SIGGRAPH Asia 2023, Video this https URL, Website: this https URL

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1905] arXiv:2309.13745 (cross-list from cs.RO) [pdf, html, other]: Title: Overview of Computer Vision Techniques in Robotized Wire Harness Assembly: Current State and Future Opportunities

Hao Wang, Omkar Salunkhe, Walter Quadrini, Dan Lämkull, Fredrik Ore, Björn Johansson, Johan Stahre

Comments: Presented at the 56th CIRP Conference on Manufacturing Systems (CIRP CMS 2023), Cape Town, South Africa, 24-26 October 2023. Published in Procedia CIRP

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1906] arXiv:2309.13746 (cross-list from cs.RO) [pdf, other]: Title: Deep Learning-Based Connector Detection for Robotized Assembly of Automotive Wire Harnesses

Hao Wang, Björn Johansson

Comments: This paper has been accepted by IEEE CASE 2023 and has been presented on the conference. The information of the published version will be updated later

Journal-ref: 2023 IEEE 19th International Conference on Automation Science and Engineering (CASE), Auckland, New Zealand, 2023, pp. 1-8

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1907] arXiv:2309.13747 (cross-list from eess.IV) [pdf, html, other]: Title: Look Ma, no code: fine tuning nnU-Net for the AutoPET II challenge by only adjusting its JSON plans

Fabian Isensee, Klaus H.Maier-Hein

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1908] arXiv:2309.13770 (cross-list from cs.LG) [pdf, other]: Title: Devil in the Number: Towards Robust Multi-modality Data Filter

Yichen Xu, Zihan Xu, Wenhao Chai, Zhonghan Zhao, Enxin Song, Gaoang Wang

Comments: ICCV 2023 Workshop: TNGCV-DataComp

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1909] arXiv:2309.13773 (cross-list from cs.LG) [pdf, other]: Title: GHN-QAT: Training Graph Hypernetworks to Predict Quantization-Robust Parameters of Unseen Limited Precision Neural Networks

Stone Yun, Alexander Wong

Comments: Poster and extended abstract to be presented at the Workshop for Low Bit Quantized Neural Networks (LQBNN) @ ICCV 2023

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1910] arXiv:2309.13777 (cross-list from eess.IV) [pdf, other]: Title: Diffeomorphic Multi-Resolution Deep Learning Registration for Applications in Breast MRI

Matthew G. French, Gonzalo D. Maso Talou, Thiranja P. Babarenda Gamage, Martyn P. Nash, Poul M. Nielsen, Anthony J. Doyle, Juan Eugenio Iglesias, Yaël Balbastre, Sean I. Young

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1911] arXiv:2309.13817 (cross-list from eess.IV) [pdf, other]: Title: MMA-Net: Multiple Morphology-Aware Network for Automated Cobb Angle Measurement

Zhengxuan Qiu, Jie Yang, Jiankun Wang

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1912] arXiv:2309.13835 (cross-list from eess.IV) [pdf, html, other]: Title: IBVC: Interpolation-driven B-frame Video Compression

Chenming Xu, Meiqin Liu, Chao Yao, Weisi Lin, Yao Zhao

Comments: Submitted to Pattern Recognition

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1913] arXiv:2309.13839 (cross-list from eess.IV) [pdf, other]: Title: Fill the K-Space and Refine the Image: Prompting for Dynamic and Multi-Contrast MRI Reconstruction

Bingyu Xin, Meng Ye, Leon Axel, Dimitris N. Metaxas

Comments: STACOM 2023; Code is available at this https URL

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1914] arXiv:2309.13842 (cross-list from cs.RO) [pdf, other]: Title: Traj-LO: In Defense of LiDAR-Only Odometry Using an Effective Continuous-Time Trajectory

Xin Zheng, Jianke Zhu

Comments: Video this https URL and Project site this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1915] arXiv:2309.13866 (cross-list from cs.LG) [pdf, other]: Title: On Calibration of Modern Quantized Efficient Neural Networks

Joey Kuang, Alexander Wong

Comments: Accepted as an extended abstract at the ICCV 2023 Workshop on Low-Bit Quantized Neural Networks. Corrected some typos

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1916] arXiv:2309.13872 (cross-list from eess.IV) [pdf, other]: Title: Attention and Pooling based Sigmoid Colon Segmentation in 3D CT images

Md Akizur Rahman, Sonit Singh, Kuruparan Shanmugalingam, Sankaran Iyer, Alan Blair, Praveen Ravindran, Arcot Sowmya

Comments: 8 Pages, 6 figures, Accepted at IEEE DICTA 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1917] arXiv:2309.13885 (cross-list from cs.LG) [pdf, html, other]: Title: TouchUp-G: Improving Feature Representation through Graph-Centric Finetuning

Jing Zhu, Xiang Song, Vassilis N. Ioannidis, Danai Koutra, Christos Faloutsos

Comments: SIGIR 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Social and Information Networks (cs.SI)
[1918] arXiv:2309.13893 (cross-list from cs.RO) [pdf, html, other]: Title: Scene Informer: Anchor-based Occlusion Inference and Trajectory Prediction in Partially Observable Environments

Bernard Lange, Jiachen Li, Mykel J. Kochenderfer

Comments: Accepted to 2024 IEEE International Conference on Robotics and Automation (ICRA)

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1919] arXiv:2309.13980 (cross-list from eess.IV) [pdf, other]: Title: Better Generalization of White Matter Tract Segmentation to Arbitrary Datasets with Scaled Residual Bootstrap

Wan Liu, Chuyang Ye

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1920] arXiv:2309.14054 (cross-list from cs.LG) [pdf, html, other]: Title: Adapt then Unlearn: Exploring Parameter Space Semantics for Unlearning in Generative Adversarial Networks

Piyush Tiwary, Atri Guha, Subhodip Panda, Prathosh A.P

Comments: Accepted at Transactions on Machine Learning Research (TMLR)

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1921] arXiv:2309.14068 (cross-list from cs.LG) [pdf, html, other]: Title: Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models

Yangming Li, Boris van Breugel, Mihaela van der Schaar

Comments: Accepted by ICLR-2024

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1922] arXiv:2309.14090 (cross-list from cs.LG) [pdf, other]: Title: Convolutional autoencoder-based multimodal one-class classification

Firas Laakom, Fahad Sohrab, Jenni Raitoharju, Alexandros Iosifidis, Moncef Gabbouj

Comments: 5 pages, 1 figure, 4 tables

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1923] arXiv:2309.14198 (cross-list from cs.LG) [pdf, other]: Title: (Predictable) Performance Bias in Unsupervised Anomaly Detection

Felix Meissen, Svenja Breuer, Moritz Knolle, Alena Buyx, Ruth Müller, Georgios Kaissis, Benedikt Wiestler, Daniel Rückert

Comments: 11 pages, 5 Figures, 1 panel

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Image and Video Processing (eess.IV)
[1924] arXiv:2309.14211 (cross-list from cs.RO) [pdf, other]: Title: QuadricsNet: Learning Concise Representation for Geometric Primitives in Point Clouds

Ji Wu, Huai Yu, Wen Yang, Gui-Song Xia

Comments: Submitted to ICRA 2024. 7 pages

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1925] arXiv:2309.14236 (cross-list from cs.RO) [pdf, html, other]: Title: MoDem-V2: Visuo-Motor World Models for Real-World Robot Manipulation

Patrick Lancaster, Nicklas Hansen, Aravind Rajeswaran, Vikash Kumar

Comments: 10 pages, 8 figures

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1926] arXiv:2309.14265 (cross-list from cs.RO) [pdf, html, other]: Title: Industrial Application of 6D Pose Estimation for Robotic Manipulation in Automotive Internal Logistics

Philipp Quentin, Dino Knoll, Daniel Goehring

Comments: Accepted for publication at IEEE International Conference on Automation Science and Engineering (CASE 2023)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1927] arXiv:2309.14306 (cross-list from eess.IV) [pdf, other]: Title: DeepMesh: Mesh-based Cardiac Motion Tracking using Deep Learning

Qingjie Meng, Wenjia Bai, Declan P O'Regan, and Daniel Rueckert

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1928] arXiv:2309.14329 (cross-list from cs.HC) [pdf, other]: Title: Innovative Digital Storytelling with AIGC: Exploration and Discussion of Recent Advances

Rongzhang Gu, Hui Li, Changyue Su, Wayne Wu

Comments: Project page: this https URL

Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Multimedia (cs.MM)
[1929] arXiv:2309.14341 (cross-list from cs.RO) [pdf, other]: Title: Extreme Parkour with Legged Robots

Xuxin Cheng, Kexin Shi, Ananye Agarwal, Deepak Pathak

Comments: Website and videos at this https URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[1930] arXiv:2309.14356 (cross-list from cs.LG) [pdf, other]: Title: COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs

Tiep Le, Vasudev Lal, Phillip Howard

Comments: Accepted to NeurIPS 2023 Datasets and Benchmarks Track

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1931] arXiv:2309.14360 (cross-list from cs.LG) [pdf, other]: Title: Domain-Guided Conditional Diffusion Model for Unsupervised Domain Adaptation

Yulong Zhang, Shuhao Chen, Weisen Jiang, Yu Zhang, Jiangang Lu, James T. Kwok

Comments: Work in progress

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1932] arXiv:2309.14392 (cross-list from eess.IV) [pdf, other]: Title: Unveiling Fairness Biases in Deep Learning-Based Brain MRI Reconstruction

Yuning Du, Yuyang Xue, Rohan Dharmakumar, Sotirios A. Tsaftaris

Comments: Accepted for publication at FAIMI 2023 (Fairness of AI in Medical Imaging) at MICCAI

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1933] arXiv:2309.14425 (cross-list from cs.RO) [pdf, other]: Title: Self-Recovery Prompting: Promptable General Purpose Service Robot System with Foundation Models and Self-Recovery

Mimo Shirasaka, Tatsuya Matsushima, Soshi Tsunashima, Yuya Ikeda, Aoi Horo, So Ikoma, Chikaha Tsuji, Hikaru Wada, Tsunekazu Omija, Dai Komukai, Yutaka Matsuo Yusuke Iwasawa

Comments: Website: this https URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[1934] arXiv:2309.14474 (cross-list from eess.IV) [pdf, other]: Title: Gastro-Intestinal Tract Segmentation Using an Explainable 3D Unet

Kai Li, Jonathan Chan

Comments: 5 pages, 8 figures, 13th Joint Symposium on Computational Intelligence (JSCI13)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1935] arXiv:2309.14483 (cross-list from astro-ph.SR) [pdf, other]: Title: Unveiling the Potential of Deep Learning Models for Solar Flare Prediction in Near-Limb Regions

Chetraj Pandey, Rafal A. Angryk, Berkay Aydin

Comments: This is a preprint accepted at the 22nd International Conference on Machine Learning and Applications (ICMLA), 2023. 7 Pages, 6 Figures

Subjects: Solar and Stellar Astrophysics (astro-ph.SR); Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1936] arXiv:2309.14492 (cross-list from eess.IV) [pdf, other]: Title: AiAReSeg: Catheter Detection and Segmentation in Interventional Ultrasound using Transformers

Alex Ranne, Yordanka Velikova, Nassir Navab, Ferdinando Rodriguez y Baena

Comments: This work has been submitted to the IEEE for possible publication

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1937] arXiv:2309.14540 (cross-list from cs.LG) [pdf, other]: Title: Effect of roundabout design on the behavior of road users: A case study of roundabouts with application of Unsupervised Machine Learning

Tasnim M. Dwekat, Ayda A. Almsre, Huthaifa I. Ashqar

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[1938] arXiv:2309.14550 (cross-list from eess.IV) [pdf, html, other]: Title: MEMO: Dataset and Methods for Robust Multimodal Retinal Image Registration with Large or Small Vessel Density Differences

Chiao-Yi Wang, Faranguisse Kakhi Sadrieh, Yi-Ting Shen, Shih-En Chen, Sarah Kim, Victoria Chen, Achyut Raghavendra, Dongyi Wang, Osamah Saeedi, Yang Tao

Comments: Biomedical Optics Express

Journal-ref: Biomed. Opt. Express 15, 3457-3479 (2024)

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1939] arXiv:2309.14580 (cross-list from cs.LG) [pdf, other]: Title: CWCL: Cross-Modal Transfer with Continuously Weighted Contrastive Loss

Rakshith Sharma Srinivasa, Jaejin Cho, Chouchang Yang, Yashas Malur Saidutta, Ching-Hua Lee, Yilin Shen, Hongxia Jin

Comments: Accepted to Neural Information Processing Systems (NeurIPS) 2023 conference

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1940] arXiv:2309.14586 (cross-list from cs.SD) [pdf, other]: Title: Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer

Xiaofeng Liu, Fangxu Xing, Maureen Stone, Jiachen Zhuo, Sidney Fels, Jerry L. Prince, Georges El Fakhri, Jonghye Woo

Comments: MICCAI 2023 (Oral presentation)

Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[1941] arXiv:2309.14591 (cross-list from eess.IV) [pdf, other]: Title: Applications of Sequential Learning for Medical Image Classification

Sohaib Naim, Brian Caffo, Haris I Sair, Craig K Jones

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1942] arXiv:2309.14630 (cross-list from econ.EM) [pdf, html, other]: Title: Free Discontinuity Regression: With an Application to the Economic Effects of Internet Shutdowns

Florian Gunsilius, David Van Dijcke

Comments: 24 pages, 3 figures, 2 tables; authors listed alphabetically; code available at this https URL

Subjects: Econometrics (econ.EM); Computer Vision and Pattern Recognition (cs.CV); Statistics Theory (math.ST); Applications (stat.AP); Methodology (stat.ME)
[1943] arXiv:2309.14655 (cross-list from cs.RO) [pdf, html, other]: Title: Probabilistic 3D Multi-Object Cooperative Tracking for Autonomous Driving via Differentiable Multi-Sensor Kalman Filter

Hsu-kuang Chiu, Chien-Yi Wang, Min-Hung Chen, Stephen F. Smith

Comments: Accepted by IEEE International Conference on Robotics and Automation (ICRA), 2024. Code: this https URL Video: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1944] arXiv:2309.14685 (cross-list from cs.RO) [pdf, html, other]: Title: DriveSceneGen: Generating Diverse and Realistic Driving Scenarios from Scratch

Shuo Sun, Zekai Gu, Tianchen Sun, Jiawei Sun, Chengran Yuan, Yuhang Han, Dongen Li, Marcelo H. Ang Jr

Comments: 8 pages, 5 figures, 2 tables

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1945] arXiv:2309.14737 (cross-list from cs.RO) [pdf, html, other]: Title: Volumetric Semantically Consistent 3D Panoptic Mapping

Yang Miao, Iro Armeni, Marc Pollefeys, Daniel Barath

Comments: 8 pages, 2 figures

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1946] arXiv:2309.14759 (cross-list from cs.GR) [pdf, other]: Title: Diffusion-based Holistic Texture Rectification and Synthesis

Guoqing Hao, Satoshi Iizuka, Kensho Hara, Edgar Simo-Serra, Hirokatsu Kataoka, Kazuhiro Fukui

Comments: SIGGRAPH Asia 2023 Conference Paper

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1947] arXiv:2309.14774 (cross-list from cs.LG) [pdf, other]: Title: BLIP-Adapter: Parameter-Efficient Transfer Learning for Mobile Screenshot Captioning

Ching-Yu Chiang, I-Hua Chang, Shih-Wei Liao

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[1948] arXiv:2309.14816 (cross-list from cs.LG) [pdf, other]: Title: A Comparative Study of Population-Graph Construction Methods and Graph Neural Networks for Brain Age Regression

Kyriaki-Margarita Bintsi, Tamara T. Mueller, Sophie Starck, Vasileios Baltatzis, Alexander Hammers, Daniel Rueckert

Comments: Accepted at GRAIL, MICCAI 2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1949] arXiv:2309.14949 (cross-list from cs.LG) [pdf, html, other]: Title: Towards Real-World Test-Time Adaptation: Tri-Net Self-Training with Balanced Normalization

Yongyi Su, Xun Xu, Kui Jia

Comments: Accepted by AAAI 2024. 19 pages, 7 figures and 22 tables

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1950] arXiv:2309.15038 (cross-list from cs.LG) [pdf, html, other]: Title: HPCR: Holistic Proxy-based Contrastive Replay for Online Continual Learning

Huiwei Lin, Shanshan Feng, Baoquan Zhang, Xutao Li, Yunming Ye

Comments: 15 pages, 10 figures

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1951] arXiv:2309.15048 (cross-list from cs.LG) [pdf, html, other]: Title: Class Incremental Learning via Likelihood Ratio Based Task Prediction

Haowei Lin, Yijia Shao, Weinan Qian, Ningxin Pan, Yiduo Guo, Bing Liu

Journal-ref: ICLR 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1952] arXiv:2309.15065 (cross-list from cs.RO) [pdf, html, other]: Title: Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding

Christina Kassab, Matias Mattamala, Lintong Zhang, Maurice Fallon

Comments: Accepted at ICRA 2024

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1953] arXiv:2309.15135 (cross-list from cs.LG) [pdf, html, other]: Title: Contrastive Continual Multi-view Clustering with Filtered Structural Fusion

Xinhang Wan, Jiyuan Liu, Hao Yu, Ao Li, Xinwang Liu, Ke Liang, Zhibin Dong, En Zhu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1954] arXiv:2309.15216 (cross-list from cs.LG) [pdf, other]: Title: A Comparative Study of Filters and Deep Learning Models to predict Diabetic Retinopathy

Roshan Vasu Muddaluru, Sharvaani Ravikumar Thoguluva, Shruti Prabha, Tanuja Konda Reddy, Suja Palaniswamy

Comments: 6 pages, 5 figures, I2CT , 2 tables

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1955] arXiv:2309.15243 (cross-list from eess.IV) [pdf, other]: Title: APIS: A paired CT-MRI dataset for ischemic stroke segmentation challenge

Santiago Gómez, Daniel Mantilla, Gustavo Garzón, Edgar Rangel, Andrés Ortiz, Franklin Sierra-Jerez, Fabio Martínez

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[1956] arXiv:2309.15245 (cross-list from cs.AI) [pdf, other]: Title: SeMAnD: Self-Supervised Anomaly Detection in Multimodal Geospatial Datasets

Daria Reshetova, Swetava Ganguli, C. V. Krishnakumar Iyer, Vipul Pandey

Comments: Extended version of the accepted research track paper at the 31st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL 2023), Hamburg, Germany. 11 pages, 8 figures, 6 tables

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1957] arXiv:2309.15259 (cross-list from quant-ph) [pdf, other]: Title: SLIQ: Quantum Image Similarity Networks on Noisy Quantum Computers

Daniel Silver, Tirthak Patel, Aditya Ranjan, Harshitta Gandhi, William Cutler, Devesh Tiwari

Journal-ref: Vol. 37 No. 8: AAAI-2023 Technical Tracks 8

Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1958] arXiv:2309.15268 (cross-list from cs.RO) [pdf, other]: Title: ObVi-SLAM: Long-Term Object-Visual SLAM

Amanda Adkins, Taijing Chen, Joydeep Biswas

Comments: 8 pages, 7 figures, 1 table plus appendix with 4 figures and 1 table

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1959] arXiv:2309.15278 (cross-list from cs.RO) [pdf, html, other]: Title: Out of Sight, Still in Mind: Reasoning and Planning about Unobserved Objects with Video Tracking Enabled Memory Models

Yixuan Huang, Jialin Yuan, Chanho Kim, Pupul Pradhan, Bryan Chen, Li Fuxin, Tucker Hermans

Comments: Presented at IEEE Conference on Robotics and Automation (ICRA) 2024. Website: this https URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1960] arXiv:2309.15302 (cross-list from cs.RO) [pdf, other]: Title: STERLING: Self-Supervised Terrain Representation Learning from Unconstrained Robot Experience

Haresh Karnan, Elvin Yang, Daniel Farkash, Garrett Warnell, Joydeep Biswas, Peter Stone

Comments: Project website: this https URL

Journal-ref: Conference on Robot Learning (CoRL 2023)

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1961] arXiv:2309.15314 (cross-list from physics.med-ph) [pdf, other]: Title: Conversion of single-energy computed tomography to parametric maps of dual-energy computed tomography using convolutional neural network

Sangwook Kim, Jimin Lee, Jungye Kim, Bitbyeol Kim, Chang Heon Choi, Seongmoon Jung

Comments: 29 pages, 17 figures

Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
[1962] arXiv:2309.15332 (cross-list from cs.RO) [pdf, other]: Title: Multimodal Dataset for Localization, Mapping and Crop Monitoring in Citrus Tree Farms

Hanzhe Teng, Yipeng Wang, Xiaoao Song, Konstantinos Karydis

Comments: Accepted to the 18th International Symposium on Visual Computing (ISVC 2023)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1963] arXiv:2309.15420 (cross-list from cs.LG) [pdf, other]: Title: The Triad of Failure Modes and a Possible Way Out

Emanuele Sansone

Comments: Some sentences in the Background Section are overlapping with Section 2 in arXiv:2304.11357 However, the main technical content and all other sections are different

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1964] arXiv:2309.15459 (cross-list from cs.RO) [pdf, html, other]: Title: GAMMA: Graspability-Aware Mobile MAnipulation Policy Learning based on Online Grasping Pose Fusion

Jiazhao Zhang, Nandiraju Gireesh, Jilong Wang, Xiaomeng Fang, Chaoyi Xu, Weiguang Chen, Liu Dai, He Wang

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1965] arXiv:2309.15477 (cross-list from cs.GR) [pdf, other]: Title: A Tutorial on Uniform B-Spline

Yi Zhou

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1966] arXiv:2309.15485 (cross-list from eess.IV) [pdf, other]: Title: Style Transfer and Self-Supervised Learning Powered Myocardium Infarction Super-Resolution Segmentation

Lichao Wang, Jiahao Huang, Xiaodan Xing, Yinzhe Wu, Ramyah Rajakulasingam, Andrew D. Scott, Pedro F Ferreira, Ranil De Silva, Sonia Nielles-Vallespin, Guang Yang

Comments: 6 pages, 8 figures, conference, accepted by SIPAIM2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1967] arXiv:2309.15516 (cross-list from cs.CL) [pdf, html, other]: Title: Teaching Text-to-Image Models to Communicate in Dialog

Xiaowen Sun, Jiazhan Feng, Yuxuan Wang, Yuxuan Lai, Xingyu Shen, Dongyan Zhao

Comments: Work in progress

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1968] arXiv:2309.15520 (cross-list from cs.LG) [pdf, other]: Title: SAF-Net: Self-Attention Fusion Network for Myocardial Infarction Detection using Multi-View Echocardiography

Ilke Adalioglu, Mete Ahishali, Aysen Degerli, Serkan Kiranyaz, Moncef Gabbouj

Comments: 4 pages, 3 figures, Computing in Cardiology (CinC) 2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1969] arXiv:2309.15521 (cross-list from cs.LG) [pdf, other]: Title: MLOps for Scarce Image Data: A Use Case in Microscopic Image Analysis

Angelo Yamachui Sitcheu, Nils Friederich, Simon Baeuerle, Oliver Neumann, Markus Reischl, Ralf Mikut

Comments: 21 pages, 5 figures , 33. Workshop on Computational Intelligence Berlin Germany

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1970] arXiv:2309.15529 (cross-list from eess.IV) [pdf, other]: Title: Missing-modality Enabled Multi-modal Fusion Architecture for Medical Data

Muyu Wang, Shiyu Fan, Yichen Li, Hui Chen

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1971] arXiv:2309.15551 (cross-list from cs.LG) [pdf, html, other]: Title: DeepRepViz: Identifying Confounders in Deep Learning Model Predictions

Roshan Prakash Rane, JiHoon Kim, Arjun Umesha, Didem Stark, Marc-André Schulz, Kerstin Ritter

Journal-ref: MICCAI 2024. Lecture Notes in Computer Science, vol 15010. pp 186 to 196. Springer, Cham

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1972] arXiv:2309.15564 (cross-list from cs.LG) [pdf, other]: Title: Jointly Training Large Autoregressive Multimodal Models

Emanuele Aiello, Lili Yu, Yixin Nie, Armen Aghajanyan, Barlas Oguz

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1973] arXiv:2309.15573 (cross-list from cs.CG) [pdf, other]: Title: The Maximum Cover with Rotating Field of View

Igor Potapov, Jason Ralph, Theofilos Triommatis

Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Algebraic Geometry (math.AG)
[1974] arXiv:2309.15596 (cross-list from cs.RO) [pdf, other]: Title: PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation

Shizhe Chen, Ricardo Garcia, Cordelia Schmid, Ivan Laptev

Comments: Accepted to CoRL 2023. Project website: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1975] arXiv:2309.15608 (cross-list from eess.IV) [pdf, html, other]: Title: NoSENSE: Learned unrolled cardiac MRI reconstruction without explicit sensitivity maps

Felix Frederik Zimmermann, Andreas Kofler

Comments: Accepted at MICCAI STACOM 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1976] arXiv:2309.15638 (cross-list from eess.IV) [pdf, html, other]: Title: RSF-Conv: Rotation-and-Scale Equivariant Fourier Parameterized Convolution for Retinal Vessel Segmentation

Zihong Sun, Hong Wang, Qi Xie, Yefeng Zheng, Deyu Meng

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1977] arXiv:2309.15696 (cross-list from cs.LG) [pdf, other]: Title: A Unified View of Differentially Private Deep Generative Modeling

Dingfan Chen, Raouf Kerkouche, Mario Fritz

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1978] arXiv:2309.15750 (cross-list from eess.IV) [pdf, other]: Title: Automated CT Lung Cancer Screening Workflow using 3D Camera

Brian Teixeira, Vivek Singh, Birgi Tamersoy, Andreas Prokein, Ankur Kapoor

Comments: Accepted at MICCAI 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1979] arXiv:2309.15792 (cross-list from quant-ph) [pdf, html, other]: Title: Quantum Block-Matching Algorithm using Dissimilarity Measure

M. Martínez-Felipe, J. Montiel-Pérez, V. Onofre, A. Maldonado-Romo, Ricky Young

Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV)
[1980] arXiv:2309.15889 (cross-list from eess.IV) [pdf, html, other]: Title: High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models

Selim F. Yilmaz, Xueyan Niu, Bo Bai, Wei Han, Lei Deng, Deniz Gunduz

Comments: 6 pages, 5 figures. Published at INFOCOM 2024 Workshops

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (cs.LG); Multimedia (cs.MM)
[1981] arXiv:2309.15940 (cross-list from cs.RO) [pdf, other]: Title: Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs

Haonan Chang, Kowndinya Boyalakuntla, Shiyang Lu, Siwei Cai, Eric Jing, Shreesh Keskar, Shijie Geng, Adeeb Abbas, Lifeng Zhou, Kostas Bekris, Abdeslam Boularias

Comments: The code and dataset used for evaluation can be found at this https URL}{this https URL. This paper has been accepted by CoRL2023

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1982] arXiv:2309.15977 (cross-list from cs.SD) [pdf, other]: Title: Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields

Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1983] arXiv:2309.16053 (cross-list from eess.IV) [pdf, other]: Title: Diagnosis of Helicobacter pylori using AutoEncoders for the Detection of Anomalous Staining Patterns in Immunohistochemistry Images

Pau Cano, Álvaro Caravaca, Debora Gil, Eva Musulen

Comments: 9 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1984] arXiv:2309.16058 (cross-list from cs.LG) [pdf, other]: Title: AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Seungwhan Moon, Andrea Madotto, Zhaojiang Lin, Tushar Nagarajan, Matt Smith, Shashank Jain, Chun-Fu Yeh, Prakash Murugesan, Peyman Heidari, Yue Liu, Kavya Srinet, Babak Damavandi, Anuj Kumar

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1985] arXiv:2309.16118 (cross-list from cs.RO) [pdf, html, other]: Title: D$^3$Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement

Yixuan Wang, Mingtong Zhang, Zhuoran Li, Tarik Kelestemur, Katherine Driggs-Campbell, Jiajun Wu, Li Fei-Fei, Yunzhu Li

Comments: Accepted to Conference on Robot Learning (CoRL 2024) as Oral Presentation. The first three authors contributed equally. Project Page: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1986] arXiv:2309.16140 (cross-list from cs.MM) [pdf, other]: Title: CLIP-Hand3D: Exploiting 3D Hand Pose Estimation via Context-Aware Prompting

Shaoxiang Guo, Qing Cai, Lin Qi, Junyu Dong

Comments: Accepted In Proceedings of the 31st ACM International Conference on Multimedia (MM' 23)

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1987] arXiv:2309.16143 (cross-list from cs.LG) [pdf, other]: Title: Generative Semi-supervised Learning with Meta-Optimized Synthetic Samples

Shin'ya Yamaguchi

Comments: Accepted to the 15th Asian Conference on Machine Learning (ACML2023); a preprint of the camera-ready version

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1988] arXiv:2309.16164 (cross-list from cs.RO) [pdf, other]: Title: Learning to Terminate in Object Navigation

Yuhang Song, Anh Nguyen, Chun-Yi Lee

Comments: 16 pages

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1989] arXiv:2309.16206 (cross-list from eess.IV) [pdf, other]: Title: Alzheimer's Disease Prediction via Brain Structural-Functional Deep Fusing Network

Qiankun Zuo, Junren Pan, Shuqiang Wang

Comments: 10 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1990] arXiv:2309.16210 (cross-list from eess.IV) [pdf, other]: Title: Abdominal multi-organ segmentation in CT using Swinunter

Mingjin Chen, Yongkang He, Yongyi Lu

Comments: 8pages. arXiv admin note: text overlap with arXiv:2201.01266 by other authors

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1991] arXiv:2309.16221 (cross-list from cs.RO) [pdf, other]: Title: Off-the-shelf bin picking workcell with visual pose estimation: A case study on the world robot summit 2018 kitting task

Frederik Hagelskjær, Kasper Høj Lorenzen, Dirk Kraft

Comments: 7 pages, 7 figures, 2 tables

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1992] arXiv:2309.16264 (cross-list from cs.RO) [pdf, html, other]: Title: GAMMA: Generalizable Articulation Modeling and Manipulation for Articulated Objects

Qiaojun Yu, Junbo Wang, Wenhai Liu, Ce Hao, Liu Liu, Lin Shao, Weiming Wang, Cewu Lu

Comments: 8 pages, 5 figures, ICRA 2024

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1993] arXiv:2309.16354 (cross-list from cs.LG) [pdf, html, other]: Title: Transformer-VQ: Linear-Time Transformers via Vector Quantization

Lucas D. Lingle

Comments: ICLR 2024 camera-ready

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1994] arXiv:2309.16536 (cross-list from eess.IV) [pdf, other]: Title: Uncertainty Quantification for Eosinophil Segmentation

Kevin Lin, Donald Brown, Sana Syed, Adam Greene

Comments: Preprint, Final Article Submitted to ICBRA 2023 and will be published in the International Conference Proceedings by ACM, Association for Computing Machinery (ISBN: 979-8-4007-0815-2), which will be archived in ACM Digital Library, indexed by Ei Compendex and Scopus

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1995] arXiv:2309.16569 (cross-list from cs.SD) [pdf, other]: Title: Audio-Visual Speaker Verification via Joint Cross-Attention

R. Gnana Praveen, Jahangir Alam

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1996] arXiv:2309.16627 (cross-list from eess.IV) [pdf, other]: Title: Class Activation Map-based Weakly supervised Hemorrhage Segmentation using Resnet-LSTM in Non-Contrast Computed Tomography images

Shreyas H Ramananda, Vaanathi Sundaresan

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1997] arXiv:2309.16633 (cross-list from cs.LG) [pdf, html, other]: Title: SupReMix: Supervised Contrastive Learning for Medical Imaging Regression with Mixup

Yilei Wu, Zijian Dong, Chongyao Chen, Wangchunshu Zhou, Juan Helen Zhou

Comments: The first two authors equally contributed to this work. Previously titled "Mixup Your Own Pair", content extended and revised

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1998] arXiv:2309.16650 (cross-list from cs.RO) [pdf, other]: Title: ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning

Qiao Gu, Alihusein Kuwajerwala, Sacha Morin, Krishna Murthy Jatavallabhula, Bipasha Sen, Aditya Agarwal, Corban Rivera, William Paul, Kirsty Ellis, Rama Chellappa, Chuang Gan, Celso Miguel de Melo, Joshua B. Tenenbaum, Antonio Torralba, Florian Shkurti, Liam Paull

Comments: Project page: this https URL Explainer video: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1999] arXiv:2309.16702 (cross-list from cs.AI) [pdf, other]: Title: Prediction and Interpretation of Vehicle Trajectories in the Graph Spectral Domain

Marion Neumeier, Sebastian Dorn, Michael Botsch, Wolfgang Utschick

Comments: Accepted as a conference paper for IEEE ITSC 2023, Bilbao, Spain

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2000] arXiv:2309.16704 (cross-list from q-bio.NC) [pdf, other]: Title: Memories in the Making: Predicting Video Memorability with Encoding Phase EEG

Lorin Sweeney, Graham Healy, Alan F. Smeaton

Comments: Content-Based Multimedia Indexing, CBMI, September 20-22, Orleans, France, 2023

Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)

Total of 2022 entries : 1-500 501-1000 1001-1500 1501-2000 2001-2022

Showing up to 500 entries per page: fewer | more | all