Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Vision and Pattern Recognition

Authors and titles for September 2023

Total of 2022 entries : 1-500 501-1000 1001-1500 1501-2000 2001-2022
Showing up to 500 entries per page: fewer | more | all
[1501] arXiv:2309.16949 [pdf, other]
Title: CrossZoom: Simultaneously Motion Deblurring and Event Super-Resolving
Chi Zhang, Xiang Zhang, Mingyuan Lin, Cheng Li, Chu He, Wen Yang, Gui-Song Xia, Lei Yu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1502] arXiv:2309.16956 [pdf, other]
Title: Model2Scene: Learning 3D Scene Representation via Contrastive Language-CAD Models Pre-training
Runnan Chen, Xinge Zhu, Nenglun Chen, Dawei Wang, Wei Li, Yuexin Ma, Ruigang Yang, Tongliang Liu, Wenping Wang
Comments: arXiv admin note: substantial text overlap with arXiv:2203.10546
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1503] arXiv:2309.16959 [pdf, other]
Title: COMNet: Co-Occurrent Matching for Weakly Supervised Semantic Segmentation
Yukun Su, Jingliang Deng, Zonghan Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1504] arXiv:2309.16964 [pdf, html, other]
Title: AdaPose: Towards Cross-Site Device-Free Human Pose Estimation with Commodity WiFi
Yunjiao Zhou, Jianfei Yang, He Huang, Lihua Xie
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1505] arXiv:2309.16967 [pdf, html, other]
Title: nnSAM: Plug-and-play Segment Anything Model Improves nnUNet Performance
Yunxiang Li, Bowen Jing, Zihan Li, Jing Wang, You Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1506] arXiv:2309.16968 [pdf, other]
Title: Synthetic Data Generation and Deep Learning for the Topological Analysis of 3D Data
Dylan Peek, Matt P. Skerritt, Stephan Chalup
Comments: 8 pages, 7 figures, Dicta 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1507] arXiv:2309.16975 [pdf, other]
Title: Perceptual Tone Mapping Model for High Dynamic Range Imaging
Imran Mehmood, Xinye Shi, M. Usman Khan, Ming Ronnier Luo
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1508] arXiv:2309.16987 [pdf, other]
Title: SpikeMOT: Event-based Multi-Object Tracking with Sparse Motion Features
Song Wang, Zhu Wang, Can Li, Xiaojuan Qi, Hayden Kwok-Hay So
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1509] arXiv:2309.16992 [pdf, html, other]
Title: Segment Anything Model is a Good Teacher for Local Feature Learning
Jingqian Wu, Rongtao Xu, Zach Wood-Doughty, Changwei Wang, Shibiao Xu, Edmund Y. Lam
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1510] arXiv:2309.17024 [pdf, other]
Title: HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World
Xin Wang, Taein Kwon, Mahdi Rad, Bowen Pan, Ishani Chakraborty, Sean Andrist, Dan Bohus, Ashley Feniello, Bugra Tekin, Felipe Vieira Frujeri, Neel Joshi, Marc Pollefeys
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1511] arXiv:2309.17031 [pdf, other]
Title: Scalable Multi-Temporal Remote Sensing Change Data Generation via Simulating Stochastic Change Process
Zhuo Zheng, Shiqi Tian, Ailong Ma, Liangpei Zhang, Yanfei Zhong
Comments: ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1512] arXiv:2309.17033 [pdf, other]
Title: Unveiling Document Structures with YOLOv5 Layout Detection
Herman Sugiharto, Yorissa Silviana, Yani Siti Nurpazrin
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1513] arXiv:2309.17051 [pdf, other]
Title: On Uniform Scalar Quantization for Learned Image Compression
Haotian Zhang, Li Li, Dong Liu
Comments: 30 pages, 19 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1514] arXiv:2309.17054 [pdf, other]
Title: A 5-Point Minimal Solver for Event Camera Relative Motion Estimation
Ling Gao, Hang Su, Daniel Gehrig, Marco Cannici, Davide Scaramuzza, Laurent Kneip
Journal-ref: IEEE/CVF International Conference on Computer Vision (ICCV), 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1515] arXiv:2309.17058 [pdf, other]
Title: Imagery Dataset for Condition Monitoring of Synthetic Fibre Ropes
Anju Rani, Daniel O. Arroyo, Petar Durdevic
Comments: 7 pages, 3 figures, database
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1516] arXiv:2309.17059 [pdf, html, other]
Title: GSDC Transformer: An Efficient and Effective Cue Fusion for Monocular Multi-Frame Depth Estimation
Naiyu Fang, Lemiao Qiu, Shuyou Zhang, Zili Wang, Zheyuan Zhou, Kerui Hu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1517] arXiv:2309.17074 [pdf, html, other]
Title: AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation
Shengkun Tang, Yaqing Wang, Caiwen Ding, Yi Liang, Yao Li, Dongkuan Xu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1518] arXiv:2309.17080 [pdf, other]
Title: GAIA-1: A Generative World Model for Autonomous Driving
Anthony Hu, Lloyd Russell, Hudson Yeo, Zak Murez, George Fedoseev, Alex Kendall, Jamie Shotton, Gianluca Corrado
Comments: Technical Report
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
[1519] arXiv:2309.17083 [pdf, other]
Title: SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning
Risa Shinoda, Ryo Hayamizu, Kodai Nakashima, Nakamasa Inoue, Rio Yokota, Hirokatsu Kataoka
Comments: ICCV2023. Code: this https URL, Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1520] arXiv:2309.17093 [pdf, html, other]
Title: Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval
Hao Li, Jingkuan Song, Lianli Gao, Xiaosu Zhu, Heng Tao Shen
Comments: Accepted to NeurIPS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1521] arXiv:2309.17102 [pdf, other]
Title: Guiding Instruction-based Image Editing via Multimodal Large Language Models
Tsu-Jui Fu, Wenze Hu, Xianzhi Du, William Yang Wang, Yinfei Yang, Zhe Gan
Comments: ICLR'24 (Spotlight) ; Project at this https URL ; Code at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1522] arXiv:2309.17104 [pdf, other]
Title: Prototype-guided Cross-modal Completion and Alignment for Incomplete Text-based Person Re-identification
Tiantian Gong, Guodong Du, Junsheng Wang, Yongkang Ding, Liyan Zhang
Comments: Sorry, some collaborators do not agree to publish it on Arxiv, so please withdraw this paper
Journal-ref: ACM International Conference on Multimedia 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1523] arXiv:2309.17105 [pdf, html, other]
Title: Continual Action Assessment via Task-Consistent Score-Discriminative Feature Distribution Modeling
Yuan-Ming Li, Ling-An Zeng, Jing-Ke Meng, Wei-Shi Zheng
Comments: 16 pages, 8 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1524] arXiv:2309.17123 [pdf, other]
Title: Reconstruction of Patient-Specific Confounders in AI-based Radiologic Image Interpretation using Generative Pretraining
Tianyu Han, Laura Žigutytė, Luisa Huck, Marc Huppertz, Robert Siepmann, Yossi Gandelsman, Christian Blüthgen, Firas Khader, Christiane Kuhl, Sven Nebelung, Jakob Kather, Daniel Truhn
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1525] arXiv:2309.17128 [pdf, other]
Title: HAvatar: High-fidelity Head Avatar via Facial Model Conditioned Neural Radiance Field
Xiaochen Zhao, Lizhen Wang, Jingxiang Sun, Hongwen Zhang, Jinli Suo, Yebin Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1526] arXiv:2309.17143 [pdf, other]
Title: Revisiting Cephalometric Landmark Detection from the view of Human Pose Estimation with Lightweight Super-Resolution Head
Qian Wu, Si Yong Yeo, Yufei Chen, Jun Liu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1527] arXiv:2309.17144 [pdf, other]
Title: Prototype Generation: Robust Feature Visualisation for Data Independent Interpretability
Arush Tagade, Jessica Rumbelow
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[1528] arXiv:2309.17162 [pdf, other]
Title: APNet: Urban-level Scene Segmentation of Aerial Images and Point Clouds
Weijie Wei, Martin R. Oswald, Fatemeh Karimi Nejadasl, Theo Gevers
Comments: Accepted by ICCV Workshop 2023 and selected as an oral
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1529] arXiv:2309.17164 [pdf, html, other]
Title: Retail-786k: a Large-Scale Dataset for Visual Entity Matching
Bianca Lamm (1 and 2), Janis Keuper (1) ((1) IMLA, Offenburg University, (2) Markant Services International GmbH)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1530] arXiv:2309.17166 [pdf, html, other]
Title: Advances in Kidney Biopsy Lesion Assessment through Dense Instance Segmentation
Zhan Xiong, Junling He, Pieter Valkema, Tri Q. Nguyen, Maarten Naesens, Jesper Kers, Fons J. Verbeek
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1531] arXiv:2309.17172 [pdf, other]
Title: Domain-Adaptive Learning: Unsupervised Adaptation for Histology Images with Improved Loss Function Combination
Ravi Kant Gupta, Shounak Das, Amit Sethi
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1532] arXiv:2309.17175 [pdf, html, other]
Title: TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields
Tianyu Huang, Yihan Zeng, Bowen Dong, Hang Xu, Songcen Xu, Rynson W.H. Lau, Wangmeng Zuo
Comments: Accepted by ICLR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1533] arXiv:2309.17187 [pdf, html, other]
Title: TBD Pedestrian Data Collection: Towards Rich, Portable, and Large-Scale Natural Pedestrian Data
Allan Wang, Daisuke Sato, Yasser Corzo, Sonya Simkin, Abhijat Biswas, Aaron Steinfeld
Comments: This work has been accepted by IEEE ICRA 2024. arXiv admin note: substantial text overlap with arXiv:2203.01974
Subjects: Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Robotics (cs.RO)
[1534] arXiv:2309.17190 [pdf, other]
Title: PARF: Primitive-Aware Radiance Fusion for Indoor Scene Novel View Synthesis
Haiyang Ying, Baowei Jiang, Jinzhi Zhang, Di Xu, Tao Yu, Qionghai Dai, Lu Fang
Comments: Accepted to ICCV 2023; Project page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1535] arXiv:2309.17205 [pdf, other]
Title: Towards Complex-query Referring Image Segmentation: A Novel Benchmark
Wei Ji, Li Li, Hao Fei, Xiangyan Liu, Xun Yang, Juncheng Li, Roger Zimmermann
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1536] arXiv:2309.17211 [pdf, html, other]
Title: Data-Free Dynamic Compression of CNNs for Tractable Efficiency
Lukas Meiner, Jens Mehnert, Alexandru Paul Condurache
Comments: Accepted at VISAPP 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1537] arXiv:2309.17218 [pdf, other]
Title: When Epipolar Constraint Meets Non-local Operators in Multi-View Stereo
Tianqi Liu, Xinyi Ye, Weiyue Zhao, Zhiyu Pan, Min Shi, Zhiguo Cao
Comments: ICCV2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1538] arXiv:2309.17239 [pdf, other]
Title: EGVD: Event-Guided Video Deraining
Yueyi Zhang, Jin Wang, Wenming Weng, Xiaoyan Sun, Zhiwei Xiong
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1539] arXiv:2309.17257 [pdf, other]
Title: A Survey on Deep Learning Techniques for Action Anticipation
Zeyun Zhong, Manuel Martin, Michael Voit, Juergen Gall, Jürgen Beyerer
Comments: Submitted to TPAMI
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1540] arXiv:2309.17261 [pdf, html, other]
Title: Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors
Yukang Lin, Haonan Han, Chaoqun Gong, Zunnan Xu, Yachao Zhang, Xiu Li
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1541] arXiv:2309.17264 [pdf, html, other]
Title: A Foundation Model for General Moving Object Segmentation in Medical Images
Zhongnuo Yan, Tong Han, Yuhao Huang, Lian Liu, Han Zhou, Jiongquan Chen, Wenlong Shi, Yan Cao, Xin Yang, Dong Ni
Comments: 5 pages, 7 figures, 3 tables. This paper has been accepted by ISBI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[1542] arXiv:2309.17265 [pdf, other]
Title: Effect of structure-based training on 3D localization precision and quality
Armin Abdehkakha, Craig Snoeyink
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1543] arXiv:2309.17281 [pdf, html, other]
Title: Information Flow in Self-Supervised Learning
Zhiquan Tan, Jingqin Yang, Weiran Huang, Yang Yuan, Yifan Zhang
Comments: Published at ICML 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1544] arXiv:2309.17285 [pdf, other]
Title: Efficient Large Scale Medical Image Dataset Preparation for Machine Learning Applications
Stefan Denner, Jonas Scherer, Klaus Kades, Dimitrios Bounias, Philipp Schader, Lisa Kausch, Markus Bujotzek, Andreas Michael Bucher, Tobias Penzkofer, Klaus Maier-Hein
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1545] arXiv:2309.17327 [pdf, html, other]
Title: Telling Stories for Common Sense Zero-Shot Action Recognition
Shreyank N Gowda, Laura Sevilla-Lara
Comments: Accepted in ACCV 2024!
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1546] arXiv:2309.17329 [pdf, html, other]
Title: Efficient Anatomical Labeling of Pulmonary Tree Structures via Deep Point-Graph Representation-based Implicit Fields
Kangxian Xie, Jiancheng Yang, Donglai Wei, Ziqiao Weng, Pascal Fua
Comments: Accepted by Medical Image Analysis
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[1547] arXiv:2309.17336 [pdf, html, other]
Title: Robust 3D Object Detection from LiDAR-Radar Point Clouds via Cross-Modal Feature Augmentation
Jianning Deng, Gabriel Chan, Hantao Zhong, Chris Xiaoxuan Lu
Comments: Accepted to ICRA 2024. 8 pages, 4 figures. Equal contribution for Gabriel Chan and Hantao Zhong, listed randomly
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1548] arXiv:2309.17342 [pdf, other]
Title: Towards Free Data Selection with General-Purpose Models
Yichen Xie, Mingyu Ding, Masayoshi Tomizuka, Wei Zhan
Comments: accepted by NeurIPS 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1549] arXiv:2309.17361 [pdf, other]
Title: Network Memory Footprint Compression Through Jointly Learnable Codebooks and Mappings
Edouard Yvinec, Arnaud Dapogny, Kevin Bailly
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1550] arXiv:2309.17389 [pdf, html, other]
Title: Prompt-based test-time real image dehazing: a novel pipeline
Zixuan Chen, Zewei He, Ziqian Lu, Xuecheng Sun, Zhe-Ming Lu
Comments: Accepted by ECCV 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1551] arXiv:2309.17390 [pdf, other]
Title: Forward Flow for Novel View Synthesis of Dynamic Scenes
Xiang Guo, Jiadai Sun, Yuchao Dai, Guanying Chen, Xiaoqing Ye, Xiao Tan, Errui Ding, Yumeng Zhang, Jingdong Wang
Comments: Accepted by ICCV2023 as oral. Project page: this https URL
Journal-ref: ICCV2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1552] arXiv:2309.17399 [pdf, other]
Title: IFAST: Weakly Supervised Interpretable Face Anti-spoofing from Single-shot Binocular NIR Images
Jiancheng Huang, Donghao Zhou, Shifeng Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1553] arXiv:2309.17400 [pdf, html, other]
Title: Directly Fine-Tuning Diffusion Models on Differentiable Rewards
Kevin Clark, Paul Vicol, Kevin Swersky, David J Fleet
Comments: Published at ICLR 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1554] arXiv:2309.17421 [pdf, other]
Title: The Dawn of LMMs: Preliminary Explorations with GPT-4V(ision)
Zhengyuan Yang, Linjie Li, Kevin Lin, Jianfeng Wang, Chung-Ching Lin, Zicheng Liu, Lijuan Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL)
[1555] arXiv:2309.17426 [pdf, other]
Title: Classification of Potholes Based on Surface Area Using Pre-Trained Models of Convolutional Neural Network
Chauhdary Fazeel Ahmad, Abdullah Cheema, Waqas Qayyum, Rana Ehtisham, Muhammad Haroon Yousaf, Junaid Mir, Nasim Shakouri Mahmoudabadi, Afaq Ahmad
Comments: 24 Pages, 26 Figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
[1556] arXiv:2309.17430 [pdf, other]
Title: FACTS: First Amplify Correlations and Then Slice to Discover Bias
Sriram Yenamandra, Pratik Ramesh, Viraj Prabhu, Judy Hoffman
Comments: Accepted to ICCV 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1557] arXiv:2309.17444 [pdf, html, other]
Title: LLM-grounded Video Diffusion Models
Long Lian, Baifeng Shi, Adam Yala, Trevor Darrell, Boyi Li
Comments: ICLR 2024. Project Page: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
[1558] arXiv:2309.17448 [pdf, html, other]
Title: SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation
Zhongang Cai, Wanqi Yin, Ailing Zeng, Chen Wei, Qingping Sun, Yanjun Wang, Hui En Pang, Haiyi Mei, Mingyuan Zhang, Lei Zhang, Chen Change Loy, Lei Yang, Ziwei Liu
Comments: Homepage: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1559] arXiv:2309.17450 [pdf, other]
Title: Multi-task View Synthesis with Neural Radiance Fields
Shuhong Zheng, Zhipeng Bao, Martial Hebert, Yu-Xiong Wang
Comments: ICCV 2023, Website: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV)
[1560] arXiv:2309.00006 (cross-list from eess.SP) [pdf, other]
Title: Dual Radar SAR Controller
Josiah Smith
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1561] arXiv:2309.00027 (cross-list from eess.IV) [pdf, other]
Title: A Sequential Framework for Detection and Classification of Abnormal Teeth in Panoramic X-rays
Tudor Dascalu, Shaqayeq Ramezanzade, Azam Bakhshandeh, Lars Bjorndal, Bulat Ibragimov
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1562] arXiv:2309.00140 (cross-list from cs.SD) [pdf, other]
Title: Improving vision-inspired keyword spotting using dynamic module skipping in streaming conformer encoder
Alexandre Bittar, Paul Dixon, Mohammad Samragh, Kumari Nishu, Devang Naik
Journal-ref: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1563] arXiv:2309.00147 (cross-list from eess.IV) [pdf, other]
Title: Optimized Deep Feature Selection for Pneumonia Detection: A Novel RegNet and XOR-Based PSO Approach
Fatemehsadat Ghanadi Ladani, Samaneh Hosseini Semnani
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1564] arXiv:2309.00187 (cross-list from eess.SY) [pdf, other]
Title: Vision-aided nonlinear control framework for shake table tests
Zhongwei Chen, T.Y. Yang, Yifei Xiao, Xiao Pan, Wanyan Yang
Comments: 10 pages, 7 figures, accepted in the Canadian Conference - Pacific Conference on Earthquake Engineering 2023, Vancouver, British Columbia
Subjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV)
[1565] arXiv:2309.00265 (cross-list from eess.IV) [pdf, other]
Title: Application of Machine Learning in Melanoma Detection and the Identification of 'Ugly Duckling' and Suspicious Naevi: A Review
Fatima Al Zegair, Nathasha Naranpanawa, Brigid Betz-Stablein, Monika Janda, H. Peter Soyer, Shekhar S. Chandra
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1566] arXiv:2309.00305 (cross-list from cs.LG) [pdf, other]
Title: Efficient Surrogate Models for Materials Science Simulations: Machine Learning-based Prediction of Microstructure Properties
Binh Duong Nguyen, Pavlo Potapenko, Aytekin Dermici, Kishan Govind, Sébastien Bompas, Stefan Sandfeld
Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci); Computer Vision and Pattern Recognition (cs.CV)
[1567] arXiv:2309.00347 (cross-list from cs.IR) [pdf, other]
Title: Towards Contrastive Learning in Music Video Domain
Karel Veldkamp, Mariya Hendriksen, Zoltán Szlávik, Alexander Keijser
Comments: 6 pages, 2 figures, 2 tables
Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1568] arXiv:2309.00350 (cross-list from eess.IV) [pdf, other]
Title: How You Split Matters: Data Leakage and Subject Characteristics Studies in Longitudinal Brain MRI Analysis
Dewinda Julianensi Rumala
Comments: submitted to MICCAI FAIMI 2023
Journal-ref: MICCAI FAIMI 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1569] arXiv:2309.00359 (cross-list from cs.CL) [pdf, html, other]
Title: Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior
Ashmit Khandelwal, Aditya Agrawal, Aanisha Bhattacharyya, Yaman K Singla, Somesh Singh, Uttaran Bhattacharya, Ishita Dasgupta, Stefano Petrangeli, Rajiv Ratn Shah, Changyou Chen, Balaji Krishnamurthy
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1570] arXiv:2309.00372 (cross-list from eess.IV) [pdf, other]
Title: On the Localization of Ultrasound Image Slices within Point Distribution Models
Lennart Bastian, Vincent Bürgin, Ha Young Kim, Alexander Baumann, Benjamin Busam, Mahdi Saleh, Nassir Navab
Comments: ShapeMI Workshop @ MICCAI 2023; 12 pages 2 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1571] arXiv:2309.00378 (cross-list from cs.CL) [pdf, html, other]
Title: Long-Term Ad Memorability: Understanding & Generating Memorable Ads
Harini SI, Somesh Singh, Yaman K Singla, Aanisha Bhattacharyya, Veeky Baths, Changyou Chen, Rajiv Ratn Shah, Balaji Krishnamurthy
Comments: Published in WACV-2025
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[1572] arXiv:2309.00472 (cross-list from cs.IR) [pdf, other]
Title: General and Practical Tuning Method for Off-the-Shelf Graph-Based Index: SISAP Indexing Challenge Report by Team UTokyo
Yutaro Oguri, Yusuke Matsui
Comments: Accepted paper on 2nd place solution of SISAP 2023 Indexing Challenge Task A
Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV); Databases (cs.DB)
[1573] arXiv:2309.00494 (cross-list from eess.IV) [pdf, html, other]
Title: Multi-stage Deep Learning Artifact Reduction for Pallel-beam Computed Tomography
Jiayang Shi, Daniel M. Pelt, K. Joost Batenburg
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1574] arXiv:2309.00569 (cross-list from eess.IV) [pdf, other]
Title: Amyloid-Beta Axial Plane PET Synthesis from Structural MRI: An Image Translation Approach for Screening Alzheimer's Disease
Fernando Vega, Abdoljalil Addeh, M. Ethan MacDonald
Comments: Abstract submitted and presented to the International Society of Magnetic Resonance in Medicine (ISMRM 2023), Toronto, Canada
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1575] arXiv:2309.00570 (cross-list from stat.ML) [pdf, other]
Title: Mechanism of feature learning in convolutional neural networks
Daniel Beaglehole, Adityanarayanan Radhakrishnan, Parthe Pandit, Mikhail Belkin
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1576] arXiv:2309.00727 (cross-list from eess.IV) [pdf, other]
Title: Deep learning in medical image registration: introduction and survey
Ahmad Hammoudeh, Stéphane Dupont
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1577] arXiv:2309.00769 (cross-list from eess.IV) [pdf, other]
Title: Full Reference Video Quality Assessment for Machine Learning-Based Video Codecs
Abrar Majeedi, Babak Naderi, Yasaman Hosseinkashi, Juhee Cho, Ruben Alvarez Martinez, Ross Cutler
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1578] arXiv:2309.00831 (cross-list from eess.IV) [pdf, other]
Title: Multi-scale, Data-driven and Anatomically Constrained Deep Learning Image Registration for Adult and Fetal Echocardiography
Md. Kamrul Hasan, Haobo Zhu, Guang Yang, Choon Hwai Yap
Comments: Our data-driven and anatomically constrained DLIR method's source code will be publicly available at this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1579] arXiv:2309.00853 (cross-list from eess.IV) [pdf, other]
Title: Correlated and Multi-frequency Diffusion Modeling for Highly Under-sampled MRI Reconstruction
Yu Guan, Chuanming Yu, Shiyu Lu, Zhuoxu Cui, Dong Liang, Qiegen Liu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1580] arXiv:2309.00864 (cross-list from cs.LG) [pdf, other]
Title: Equitable-FL: Federated Learning with Sparsity for Resource-Constrained Environment
Indrajeet Kumar Sinha, Shekhar Verma, Krishna Pratap Singh
Comments: 12 pages, 7 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC)
[1581] arXiv:2309.00885 (cross-list from eess.IV) [pdf, other]
Title: A Generic Fundus Image Enhancement Network Boosted by Frequency Self-supervised Representation Learning
Heng Li, Haofeng Liu, Huazhu Fu, Yanwu Xu, Hui Shu, Ke Niu, Yan Hu, Jiang Liu
Comments: Accepted by Medical Image Analysis in Auguest, 2023
Journal-ref: Medical Image Analysis, 2023, 90:102945
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1582] arXiv:2309.00911 (cross-list from eess.IV) [pdf, other]
Title: A novel framework employing deep multi-attention channels network for the autonomous detection of metastasizing cells through fluorescence microscopy
Michail Mamalakis, Sarah C. Macfarlane, Scott V. Notley, Annica K.B Gad, George Panoutsos
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1583] arXiv:2309.00962 (cross-list from cs.RO) [pdf, other]
Title: NTU4DRadLM: 4D Radar-centric Multi-Modal Dataset for Localization and Mapping
Jun Zhang, Huayang Zhuge, Yiyao Liu, Guohao Peng, Zhenyu Wu, Haoyuan Zhang, Qiyang Lyu, Heshan Li, Chunyang Zhao, Dogan Kircali, Sanat Mharolkar, Xun Yang, Su Yi, Yuanzhe Wang, Danwei Wang
Comments: 2023 IEEE International Intelligent Transportation Systems Conference (ITSC 2023)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1584] arXiv:2309.00971 (cross-list from eess.IV) [pdf, other]
Title: AdLER: Adversarial Training with Label Error Rectification for One-Shot Medical Image Segmentation
Xiangyu Zhao, Sheng Wang, Zhiyun Song, Zhenrong Shen, Linlin Yao, Haolei Yuan, Qian Wang, Lichi Zhang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1585] arXiv:2309.00995 (cross-list from eess.IV) [pdf, other]
Title: Constrained CycleGAN for Effective Generation of Ultrasound Sector Images of Improved Spatial Resolution
Xiaofei Sun, He Li, Wei-Ning Lee
Journal-ref: Physics in Medicine & Biology 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1586] arXiv:2309.01007 (cross-list from eess.IV) [pdf, other]
Title: Comparative Analysis of Deep Learning Architectures for Breast Cancer Diagnosis Using the BreaKHis Dataset
İrem Sayın, Muhammed Ali Soydaş, Yunus Emre Mert, Arda Yarkataş, Berk Ergun, Selma Sözen Yeh, Hüseyin Üvet
Comments: 7 pages, 1 figure, 2 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1587] arXiv:2309.01072 (cross-list from eess.IV) [pdf, other]
Title: Channel Attention Separable Convolution Network for Skin Lesion Segmentation
Changlu Guo, Jiangyan Dai, Marton Szemenyei, Yugen Yi
Comments: Accepted by ICONIP 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1588] arXiv:2309.01077 (cross-list from cs.LG) [pdf, other]
Title: Robust Adversarial Defense by Tensor Factorization
Manish Bhattarai, Mehmet Cagri Kaymak, Ryan Barron, Ben Nebgen, Kim Rasmussen, Boian Alexandrov
Comments: Accepted at 2023 ICMLA Conference
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1589] arXiv:2309.01171 (cross-list from eess.IV) [pdf, html, other]
Title: Deep Unfolding Convolutional Dictionary Model for Multi-Contrast MRI Super-resolution and Reconstruction
Pengcheng Lei, Faming Fang, Guixu Zhang, Ming Xu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1590] arXiv:2309.01202 (cross-list from cs.GR) [pdf, other]
Title: MAGMA: Music Aligned Generative Motion Autodecoder
Sohan Anisetty, Amit Raj, James Hays
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1591] arXiv:2309.01207 (cross-list from eess.IV) [pdf, other]
Title: Spectral Adversarial MixUp for Few-Shot Unsupervised Domain Adaptation
Jiajin Zhang, Hanqing Chao, Amit Dhurandhar, Pin-Yu Chen, Ali Tajer, Yangyang Xu, Pingkun Yan
Comments: Accepted by MICCAI 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1592] arXiv:2309.01235 (cross-list from eess.IV) [pdf, other]
Title: Generalizability and Application of the Skin Reflectance Estimate Based on Dichromatic Separation (SREDS)
Joseph Drahos, Richard Plesh, Keivan Bahmani, Mahesh Banavar, Stephanie Schuckers
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1593] arXiv:2309.01312 (cross-list from eess.IV) [pdf, other]
Title: Enhancing Automated and Early Detection of Alzheimer's Disease Using Out-Of-Distribution Detection
Audrey Paleczny, Shubham Parab, Maxwell Zhang
Comments: 10 pages, 8 figures, 3 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1594] arXiv:2309.01322 (cross-list from eess.IV) [pdf, other]
Title: FAU-Net: An Attention U-Net Extension with Feature Pyramid Attention for Prostate Cancer Segmentation
Pablo Cesar Quihui-Rubio, Daniel Flores-Araiza, Miguel Gonzalez-Mendoza, Christian Mata, Gilberto Ochoa-Ruiz
Comments: This paper has been accepted at the 22nd Mexican International Conference on Artificial Intelligence (MICAI 2023)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1595] arXiv:2309.01339 (cross-list from cs.CL) [pdf, other]
Title: UniSA: Unified Generative Framework for Sentiment Analysis
Zaijing Li, Ting-En Lin, Yuchuan Wu, Meng Liu, Fengxiao Tang, Ming Zhao, Yongbin Li
Comments: Accepted to ACM MM 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1596] arXiv:2309.01340 (cross-list from cs.SD) [pdf, other]
Title: MDSC: Towards Evaluating the Style Consistency Between Music and Dance
Zixiang Zhou, Weiyuan Li, Baoyuan Wang
Comments: 19 pages, 19 figure
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1597] arXiv:2309.01361 (cross-list from cs.ET) [pdf, other]
Title: High Frequency, High Accuracy Pointing onboard Nanosats using Neuromorphic Event Sensing and Piezoelectric Actuation
Yasir Latif, Peter Anastasiou, Yonhon Ng, Zebb Prime, Tien-Fu Lu, Matthew Tetlow, Robert Mahony, Tat-Jun Chin
Subjects: Emerging Technologies (cs.ET); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1598] arXiv:2309.01446 (cross-list from cs.CL) [pdf, html, other]
Title: Open Sesame! Universal Black Box Jailbreaking of Large Language Models
Raz Lapid, Ron Langberg, Moshe Sipper
Comments: Accepted at SeT-LLM @ ICLR 2024
Journal-ref: ICLR 2024 Workshop on Secure and Trustworthy Large Language Models
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1599] arXiv:2309.01532 (cross-list from cs.LG) [pdf, other]
Title: Are We Using Autoencoders in a Wrong Way?
Gabriele Martino, Davide Moroni, Massimo Martinelli
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1600] arXiv:2309.01587 (cross-list from cs.AR) [pdf, other]
Title: SATAY: A Streaming Architecture Toolflow for Accelerating YOLO Models on FPGA Devices
Alexander Montgomerie-Corcoran, Petros Toupas, Zhewen Yu, Christos-Savvas Bouganis
Subjects: Hardware Architecture (cs.AR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1601] arXiv:2309.01590 (cross-list from cs.LG) [pdf, other]
Title: Probabilistic Precision and Recall Towards Reliable Evaluation of Generative Models
Dogyun Park, Suhyun Kim
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1602] arXiv:2309.01646 (cross-list from cs.RO) [pdf, other]
Title: ReLoc-PDR: Visual Relocalization Enhanced Pedestrian Dead Reckoning via Graph Optimization
Zongyang Chen, Xianfei Pan, Changhao Chen
Comments: 11 pages, 14 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1603] arXiv:2309.01729 (cross-list from cs.LG) [pdf, other]
Title: Softmax Bias Correction for Quantized Generative Models
Nilesh Prasad Pandey, Marios Fournarakis, Chirag Patel, Markus Nagel
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1604] arXiv:2309.01740 (cross-list from eess.IV) [pdf, other]
Title: An Empirical Analysis for Zero-Shot Multi-Label Classification on COVID-19 CT Scans and Uncurated Reports
Ethan Dack, Lorenzo Brigato, Matthew McMurray, Matthias Fontanellaz, Thomas Frauenfelder, Hanno Hoppe, Aristomenis Exadaktylos, Thomas Geiser, Manuela Funke-Chambour, Andreas Christe, Lukas Ebner, Stavroula Mougiakakou
Comments: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops 2023
Subjects: Image and Video Processing (eess.IV); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1605] arXiv:2309.01751 (cross-list from eess.IV) [pdf, html, other]
Title: Multispectral Indices for Wildfire Management
Afonso Oliveira, João P. Matos-Carvalho, Filipe Moutinho, Nuno Fachada
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[1606] arXiv:2309.01823 (cross-list from eess.IV) [pdf, other]
Title: Multi-dimension unified Swin Transformer for 3D Lesion Segmentation in Multiple Anatomical Locations
Shaoyan Pan, Yiqiao Liu, Sarah Halek, Michal Tomaszewski, Shubing Wang, Richard Baumgartner, Jianda Yuan, Gregory Goldmacher, Antong Chen
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1607] arXiv:2309.01904 (cross-list from cs.RO) [pdf, other]
Title: Improving Drone Imagery For Computer Vision/Machine Learning in Wilderness Search and Rescue
Robin Murphy, Thomas Manzini
Comments: 6 pages, 4 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1608] arXiv:2309.02007 (cross-list from eess.IV) [pdf, other]
Title: Logarithmic Mathematical Morphology: theory and applications
Guillaume Noyel (LHC)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Functional Analysis (math.FA); Numerical Analysis (math.NA)
[1609] arXiv:2309.02020 (cross-list from eess.IV) [pdf, other]
Title: RawHDR: High Dynamic Range Image Reconstruction from a Single Raw Image
Yunhao Zou, Chenggang Yan, Ying Fu
Comments: ICCV 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1610] arXiv:2309.02022 (cross-list from cs.LG) [pdf, other]
Title: Dynamic Early Exiting Predictive Coding Neural Networks
Alaa Zniber, Ouassim Karrakchou, Mounir Ghogho
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1611] arXiv:2309.02140 (cross-list from eess.IV) [pdf, other]
Title: A Lightweight, Rapid and Efficient Deep Convolutional Network for Chest X-Ray Tuberculosis Detection
Daniel Capellán-Martín, Juan J. Gómez-Valverde, David Bermejo-Peláez, María J. Ledesma-Carbayo
Comments: 5 pages, 3 figures, 3 tables. This paper has been accepted at ISBI 2023
Journal-ref: 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI), Cartagena, Colombia, 2023
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1612] arXiv:2309.02147 (cross-list from eess.IV) [pdf, other]
Title: INCEPTNET: Precise And Early Disease Detection Application For Medical Images Analyses
Amirhossein Sajedi, Mohammad Javad Fadaeieslam
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1613] arXiv:2309.02159 (cross-list from cs.CR) [pdf, other]
Title: The Adversarial Implications of Variable-Time Inference
Dudi Biton, Aditi Misra, Efrat Levy, Jaidip Kotak, Ron Bitton, Roei Schuster, Nicolas Papernot, Yuval Elovici, Ben Nassi
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1614] arXiv:2309.02179 (cross-list from eess.IV) [pdf, other]
Title: High-resolution 3D Maps of Left Atrial Displacements using an Unsupervised Image Registration Neural Network
Christoforos Galazis, Anil Anthony Bharath, Marta Varela
Journal-ref: Medical Imaging with Deep Learning, short paper track, 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1615] arXiv:2309.02335 (cross-list from eess.IV) [pdf, other]
Title: DEEPBEAS3D: Deep Learning and B-Spline Explicit Active Surfaces
Helena Williams, João Pedrosa, Muhammad Asad, Laura Cattani, Tom Vercauteren, Jan Deprest, Jan D'hooge
Comments: 4 pages, 3 figures, 1 table, conference
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1616] arXiv:2309.02404 (cross-list from cs.SD) [pdf, other]
Title: Voice Morphing: Two Identities in One Voice
Sushanta K. Pani, Anurag Chowdhury, Morgan Sandler, Arun Ross
Comments: Accepted oral paper at BIOSIG 2023
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1617] arXiv:2309.02435 (cross-list from cs.LG) [pdf, other]
Title: Efficient RL via Disentangled Environment and Agent Representations
Kevin Gmelin, Shikhar Bahl, Russell Mendonca, Deepak Pathak
Comments: ICML 2023. Website at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO)
[1618] arXiv:2309.02555 (cross-list from cs.LG) [pdf, other]
Title: A Survey of the Impact of Self-Supervised Pretraining for Diagnostic Tasks with Radiological Images
Blake VanBerlo, Jesse Hoey, Alexander Wong
Comments: 32 pages, 6 figures, a literature survey submitted to BMC Medical Imaging
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1619] arXiv:2309.02561 (cross-list from cs.RO) [pdf, html, other]
Title: Physically Grounded Vision-Language Models for Robotic Manipulation
Jensen Gao, Bidipta Sarkar, Fei Xia, Ted Xiao, Jiajun Wu, Brian Ichter, Anirudha Majumdar, Dorsa Sadigh
Comments: Updated version for ICRA 2024
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1620] arXiv:2309.02563 (cross-list from eess.IV) [pdf, other]
Title: Evaluation Kidney Layer Segmentation on Whole Slide Imaging using Convolutional Neural Networks and Transformers
Muhao Liu, Chenyang Qi, Shunxing Bao, Quan Liu, Ruining Deng, Yu Wang, Shilin Zhao, Haichun Yang, Yuankai Huo
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1621] arXiv:2309.02576 (cross-list from eess.IV) [pdf, other]
Title: Emphysema Subtyping on Thoracic Computed Tomography Scans using Deep Neural Networks
Weiyi Xie, Colin Jacobs, Jean-Paul Charbonnier, Dirk Jan Slebos, Bram van Ginneken
Journal-ref: Sci Rep. 2023 Aug 29;13(1):14147
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1622] arXiv:2309.02591 (cross-list from cs.LG) [pdf, other]
Title: Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Lili Yu, Bowen Shi, Ramakanth Pasunuru, Benjamin Muller, Olga Golovneva, Tianlu Wang, Arun Babu, Binh Tang, Brian Karrer, Shelly Sheynin, Candace Ross, Adam Polyak, Russell Howes, Vasu Sharma, Puxin Xu, Hovhannes Tamoyan, Oron Ashual, Uriel Singer, Shang-Wen Li, Susan Zhang, Richard James, Gargi Ghosh, Yaniv Taigman, Maryam Fazel-Zarandi, Asli Celikyilmaz, Luke Zettlemoyer, Armen Aghajanyan
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1623] arXiv:2309.02670 (cross-list from eess.IV) [pdf, other]
Title: Progressive Attention Guidance for Whole Slide Vulvovaginal Candidiasis Screening
Jiangdong Cai, Honglin Xiong, Maosong Cao, Luyan Liu, Lichi Zhang, Qian Wang
Comments: Accepted in the main conference MICCAI 2023
Journal-ref: 26th International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI 2023)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1624] arXiv:2309.02681 (cross-list from eess.IV) [pdf, other]
Title: Improving Image Classification of Knee Radiographs: An Automated Image Labeling Approach
Jikai Zhang, Carlos Santos, Christine Park, Maciej Mazurowski, Roy Colglazier
Comments: This is the preprint version
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1625] arXiv:2309.02691 (cross-list from cs.CL) [pdf, other]
Title: A Joint Study of Phrase Grounding and Task Performance in Vision and Language Models
Noriyuki Kojima, Hadar Averbuch-Elor, Yoav Artzi
Comments: This was published in TMLR in 2024, on January 24th
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1626] arXiv:2309.02783 (cross-list from eess.IV) [pdf, other]
Title: Improving diagnosis and prognosis of lung cancer using vision transformers: A scoping review
Hazrat Ali, Farida Mohsen, Zubair Shah
Comments: submitted to BMC Medical Imaging journal
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1627] arXiv:2309.02841 (cross-list from cs.IT) [pdf, other]
Title: Adjacency-hopping de Bruijn Sequences for Non-repetitive Coding
Bin Chen, Zhenglin Liang, Shiqian Wu
Subjects: Information Theory (cs.IT); Computer Vision and Pattern Recognition (cs.CV); Discrete Mathematics (cs.DM)
[1628] arXiv:2309.02898 (cross-list from cs.LG) [pdf, other]
Title: A Unified Framework for Discovering Discrete Symmetries
Pavan Karjol, Rohan Kashyap, Aditya Gopalan, Prathosh A.P
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1629] arXiv:2309.02959 (cross-list from eess.IV) [pdf, html, other]
Title: A Non-Invasive Interpretable NAFLD Diagnostic Method Combining TCM Tongue Features
Shan Cao, Qunsheng Ruan, Qingfeng Wu, Weiqiang Lin
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1630] arXiv:2309.02961 (cross-list from eess.SP) [pdf, html, other]
Title: LuViRA Dataset Validation and Discussion: Comparing Vision, Radio, and Audio Sensors for Indoor Localization
Ilayda Yaman, Guoda Tian, Erik Tegler, Jens Gulin, Nikhil Challa, Fredrik Tufvesson, Ove Edfors, Kalle Astrom, Steffen Malkowsky, Liang Liu
Comments: 10 pages, 11 figures
Journal-ref: IEEE Journal of Indoor and Seamless Positioning and Navigation (2024) 1-11
Subjects: Signal Processing (eess.SP); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1631] arXiv:2309.03064 (cross-list from cs.CL) [pdf, other]
Title: A Multimodal Analysis of Influencer Content on Twitter
Danae Sánchez Villegas, Catalina Goanta, Nikolaos Aletras
Comments: Accepted at AACL 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1632] arXiv:2309.03113 (cross-list from cs.LG) [pdf, other]
Title: Detecting Manufacturing Defects in PCBs via Data-Centric Machine Learning on Solder Paste Inspection Features
Jubilee Prasad-Rao, Roohollah Heidary, Jesse Williams
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1633] arXiv:2309.03177 (cross-list from eess.SY) [pdf, other]
Title: 3D Object Positioning Using Differentiable Multimodal Learning
Sean Zanyk-McLean, Krishna Kumar, Paul Navratil
Comments: 7 pages, 8 figures
Subjects: Systems and Control (eess.SY); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
[1634] arXiv:2309.03183 (cross-list from eess.IV) [pdf, other]
Title: 3D Transformer based on deformable patch location for differential diagnosis between Alzheimer's disease and Frontotemporal dementia
Huy-Dung Nguyen, Michaël Clément, Boris Mansencal, Pierrick Coupé
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1635] arXiv:2309.03215 (cross-list from cs.AI) [pdf, other]
Title: Explainable and Trustworthy Traffic Sign Detection for Safe Autonomous Driving: An Inductive Logic Programming Approach
Zahra Chaghazardi (University of Surrey), Saber Fallah (University of Surrey), Alireza Tamaddoni-Nezhad (University of Surrey)
Comments: In Proceedings ICLP 2023, arXiv:2308.14898
Journal-ref: EPTCS 385, 2023, pp. 201-212
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Logic in Computer Science (cs.LO)
[1636] arXiv:2309.03232 (cross-list from cs.LG) [pdf, other]
Title: Retail store customer behavior analysis system: Design and Implementation
Tuan Dinh Nguyen, Keisuke Hihara, Tung Cao Hoang, Yumeka Utada, Akihiko Torii, Naoki Izumi, Nguyen Thanh Thuy, Long Quoc Tran
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[1637] arXiv:2309.03244 (cross-list from eess.IV) [pdf, html, other]
Title: EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation
Nikolai Körber, Eduard Kromer, Andreas Siebert, Sascha Hauke, Daniel Mueller-Gritschneder, Björn Schuller
Comments: ECCV 2024 Camera Ready
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1638] arXiv:2309.03320 (cross-list from eess.IV) [pdf, html, other]
Title: CoNeS: Conditional neural fields with shift modulation for multi-sequence MRI translation
Yunjie Chen, Marius Staring, Olaf M. Neve, Stephan R. Romeijn, Erik F. Hensen, Berit M. Verbist, Jelmer M. Wolterink, Qian Tao
Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) this https URL
Journal-ref: Machine.Learning.for.Biomedical.Imaging. 2 (2024)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1639] arXiv:2309.03383 (cross-list from eess.IV) [pdf, other]
Title: Kidney abnormality segmentation in thorax-abdomen CT scans
Gabriel Efrain Humpire Mamani, Nikolas Lessmann, Ernst Th. Scholten, Mathias Prokop, Colin Jacobs, Bram van Ginneken
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1640] arXiv:2309.03440 (cross-list from eess.IV) [pdf, other]
Title: Punctate White Matter Lesion Segmentation in Preterm Infants Powered by Counterfactually Generative Learning
Zehua Ren, Yongheng Sun, Miaomiao Wang, Yuying Feng, Xianjun Li, Chao Jin, Jian Yang, Chunfeng Lian, Fan Wang
Comments: 10 pages, 3 figures, Medical Image Computing and Computer Assisted Intervention(MICCAI)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1641] arXiv:2309.03469 (cross-list from cs.LG) [pdf, other]
Title: Fast FixMatch: Faster Semi-Supervised Learning with Curriculum Batch Size
John Chen, Chen Dun, Anastasios Kyrillidis
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1642] arXiv:2309.03477 (cross-list from eess.IV) [pdf, other]
Title: TSI-Net: A Timing Sequence Image Segmentation Network for Intracranial Artery Segmentation in Digital Subtraction Angiography
Lemeng Wang, Wentao Liu, Weijin Xu, Haoyuan Li, Huihua Yang, Feng Gao
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1643] arXiv:2309.03493 (cross-list from eess.IV) [pdf, html, other]
Title: SAM3D: Segment Anything Model in Volumetric Medical Images
Nhat-Tan Bui, Dinh-Hieu Hoang, Minh-Triet Tran, Gianfranco Doretto, Donald Adjeroh, Brijesh Patel, Arabinda Choudhary, Ngan Le
Comments: Accepted at ISBI 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1644] arXiv:2309.03494 (cross-list from eess.IV) [pdf, other]
Title: Evaluating Deep Learning-based Melanoma Classification using Immunohistochemistry and Routine Histology: A Three Center Study
Christoph Wies, Lucas Schneider, Sarah Haggenmueller, Tabea-Clara Bucher, Sarah Hobelsberger, Markus V. Heppt, Gerardo Ferrara, Eva I. Krieghoff-Henning, Titus J. Brinker
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[1645] arXiv:2309.03535 (cross-list from eess.IV) [pdf, other]
Title: Feature Enhancer Segmentation Network (FES-Net) for Vessel Segmentation
Tariq M. Khan, Muhammad Arsalan, Shahzaib Iqbal, Imran Razzak, Erik Meijering
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1646] arXiv:2309.03569 (cross-list from cs.LG) [pdf, other]
Title: Sparse Federated Training of Object Detection in the Internet of Vehicles
Luping Rao, Chuan Ma, Ming Ding, Yuwen Qian, Lu Zhou, Zhe Liu
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1647] arXiv:2309.03590 (cross-list from eess.IV) [pdf, other]
Title: Spatial encoding of BOLD fMRI time series for categorizing static images across visual datasets: A pilot study on human vision
Vamshi K. Kancharala, Debanjali Bhattacharya, Neelam Sinha
Comments: This paper is accepted for publication in IEEE Region 10 Technical conference, TENCON 2023, to be held in Chiang Mai, Thailand from 31 October - 3 November, 2023
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[1648] arXiv:2309.03641 (cross-list from cs.SD) [pdf, html, other]
Title: Spiking Structured State Space Model for Monaural Speech Enhancement
Yu Du, Xu Liu, Yansong Chua
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1649] arXiv:2309.03652 (cross-list from eess.IV) [pdf, other]
Title: Anatomy-informed Data Augmentation for Enhanced Prostate Cancer Detection
Balint Kovacs, Nils Netzer, Michael Baumgartner, Carolin Eith, Dimitrios Bounias, Clara Meinzer, Paul F. Jaeger, Kevin S. Zhang, Ralf Floca, Adrian Schrader, Fabian Isensee, Regula Gnirs, Magdalena Goertz, Viktoria Schuetz, Albrecht Stenzinger, Markus Hohenfellner, Heinz-Peter Schlemmer, Ivo Wolf, David Bonekamp, Klaus H. Maier-Hein
Comments: Accepted at MICCAI 2023
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1650] arXiv:2309.03686 (cross-list from eess.IV) [pdf, other]
Title: MS-UNet-v2: Adaptive Denoising Method and Training Strategy for Medical Image Segmentation with Small Training Data
Haoyuan Chen, Yufei Han, Pin Xu, Yanyi Li, Kuan Li, Jianping Yin
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1651] arXiv:2309.03702 (cross-list from cs.LG) [pdf, other]
Title: DiffDefense: Defending against Adversarial Attacks via Diffusion Models
Hondamunige Prasanna Silva, Lorenzo Seidenari, Alberto Del Bimbo
Comments: Paper published at ICIAP23
Journal-ref: ICIAP 2023
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1652] arXiv:2309.03744 (cross-list from eess.IV) [pdf, html, other]
Title: Label-efficient Contrastive Learning-based model for nuclei detection and classification in 3D Cardiovascular Immunofluorescent Images
Nazanin Moradinasab, Rebecca A. Deaton, Laura S. Shankman, Gary K. Owens, Donald E. Brown
Comments: 11 pages, 5 figures, MICCAI Workshop Conference 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1653] arXiv:2309.03759 (cross-list from eess.IV) [pdf, other]
Title: M(otion)-mode Based Prediction of Ejection Fraction using Echocardiograms
Ece Ozkan, Thomas M. Sutter, Yurong Hu, Sebastian Balzer, Julia E. Vogt
Comments: Accepted at GCPR 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1654] arXiv:2309.03774 (cross-list from cs.LG) [pdf, html, other]
Title: Deep Learning Safety Concerns in Automated Driving Perception
Stephanie Abrecht, Alexander Hirsch, Shervin Raafatnia, Matthias Woehrle
Comments: Added note regarding accepted version at IEEE Transactions on Intelligent Vehicles with DOI
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Systems and Control (eess.SY)
[1655] arXiv:2309.03851 (cross-list from cs.LG) [pdf, html, other]
Title: CenTime: Event-Conditional Modelling of Censoring in Survival Analysis
Ahmed H. Shahin, An Zhao, Alexander C. Whitehead, Daniel C. Alexander, Joseph Jacob, David Barber
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1656] arXiv:2309.03879 (cross-list from cs.LG) [pdf, other]
Title: Better Practices for Domain Adaptation
Linus Ericsson, Da Li, Timothy M. Hospedales
Comments: AutoML 2023 (Best paper award)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1657] arXiv:2309.03891 (cross-list from cs.RO) [pdf, html, other]
Title: ArtiGrasp: Physically Plausible Synthesis of Bi-Manual Dexterous Grasping and Articulation
Hui Zhang, Sammy Christen, Zicong Fan, Luocheng Zheng, Jemin Hwangbo, Jie Song, Otmar Hilliges
Comments: 3DV-2024 camera ready. Project page: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1658] arXiv:2309.03900 (cross-list from eess.IV) [pdf, other]
Title: Learning Continuous Exposure Value Representations for Single-Image HDR Reconstruction
Su-Kai Chen, Hung-Lin Yen, Yu-Lun Liu, Min-Hung Chen, Hou-Ning Hu, Wen-Hsiao Peng, Yen-Yu Lin
Comments: ICCV 2023. Project page: this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1659] arXiv:2309.03905 (cross-list from cs.MM) [pdf, other]
Title: ImageBind-LLM: Multi-modality Instruction Tuning
Jiaming Han, Renrui Zhang, Wenqi Shao, Peng Gao, Peng Xu, Han Xiao, Kaipeng Zhang, Chris Liu, Song Wen, Ziyu Guo, Xudong Lu, Shuai Ren, Yafei Wen, Xiaoxin Chen, Xiangyu Yue, Hongsheng Li, Yu Qiao
Comments: Code is available at this https URL
Subjects: Multimedia (cs.MM); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[1660] arXiv:2309.03906 (cross-list from eess.IV) [pdf, other]
Title: A-Eval: A Benchmark for Cross-Dataset Evaluation of Abdominal Multi-Organ Segmentation
Ziyan Huang, Zhongying Deng, Jin Ye, Haoyu Wang, Yanzhou Su, Tianbin Li, Hui Sun, Junlong Cheng, Jianpin Chen, Junjun He, Yun Gu, Shaoting Zhang, Lixu Gu, Yu Qiao
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1661] arXiv:2309.03964 (cross-list from cs.LG) [pdf, other]
Title: REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation
Skyler Seto, Barry-John Theobald, Federico Danieli, Navdeep Jaitly, Dan Busbridge
Comments: Accepted at WACV 2024, 17 pages, 7 figures, 11 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1662] arXiv:2309.03965 (cross-list from cs.LG) [pdf, other]
Title: Improving Resnet-9 Generalization Trained on Small Datasets
Omar Mohamed Awad, Habib Hajimolahoseini, Michael Lim, Gurpreet Gosal, Walid Ahmed, Yang Liu, Gordon Deng
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1663] arXiv:2309.04028 (cross-list from math.AG) [pdf, other]
Title: Algebra and Geometry of Camera Resectioning
Erin Connelly, Timothy Duff, Jessie Loucks-Tavitas
Comments: 27 pages
Subjects: Algebraic Geometry (math.AG); Computer Vision and Pattern Recognition (cs.CV); Commutative Algebra (math.AC)
[1664] arXiv:2309.04071 (cross-list from eess.IV) [pdf, html, other]
Title: Enhancing Hierarchical Transformers for Whole Brain Segmentation with Intracranial Measurements Integration
Xin Yu, Yucheng Tang, Qi Yang, Ho Hin Lee, Shunxing Bao, Yuankai Huo, Bennett A. Landman
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1665] arXiv:2309.04081 (cross-list from cs.LG) [pdf, other]
Title: UER: A Heuristic Bias Addressing Approach for Online Continual Learning
Huiwei Lin, Shanshan Feng, Baoquan Zhang, Hongliang Qiao, Xutao Li, Yunming Ye
Comments: 9 pages, 12 figures, ACM MM2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1666] arXiv:2309.04190 (cross-list from eess.IV) [pdf, html, other]
Title: SegmentAnything helps microscopy images based automatic and quantitative organoid detection and analysis
Xiaodan Xing, Chunling Tang, Yunzhe Guo, Nicholas Kurniawan, Guang Yang
Comments: Replace Figure 4 with the correct version. The original version is wrong due to a column name mismatch
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1667] arXiv:2309.04293 (cross-list from eess.IV) [pdf, other]
Title: How Can We Tame the Long-Tail of Chest X-ray Datasets?
Arsh Verma
Comments: Extended Abstract presented at Computer Vision for Automated Medical Diagnosis Workshop at the International Conference on Computer Vision 2023, October 2nd 2023, Paris, France, & Virtual, this https URL, 7 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1668] arXiv:2309.04342 (cross-list from physics.optics) [pdf, other]
Title: Revealing the preference for correcting separated aberrations in joint optic-image design
Jingwen Zhou, Shiqi Chen, Zheng Ren, Wenguan Zhang, Jiapu Yan, Huajun Feng, Qi Li, Yueting Chen
Comments: 19 pages
Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV)
[1669] arXiv:2309.04441 (cross-list from cs.RO) [pdf, other]
Title: Comparative Study of Visual SLAM-Based Mobile Robot Localization Using Fiducial Markers
Jongwon Lee, Su Yeon Choi, David Hanley, Timothy Bretl
Comments: IEEE 2023 IROS Workshop "Closing the Loop on Localization". For more information, see this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1670] arXiv:2309.04461 (cross-list from cs.CL) [pdf, html, other]
Title: Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models
Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran
Comments: NAACL 2024 Main Conference. The data is released at this https URL
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1671] arXiv:2309.04509 (cross-list from cs.SD) [pdf, other]
Title: The Power of Sound (TPoS): Audio Reactive Video Generation with Stable Diffusion
Yujin Jeong, Wonjeong Ryoo, Seunghyun Lee, Dabin Seo, Wonmin Byeon, Sangpil Kim, Jinkyu Kim
Comments: ICCV2023
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Audio and Speech Processing (eess.AS)
[1672] arXiv:2309.04511 (cross-list from eess.IV) [pdf, other]
Title: Systematic Review of Techniques in Brain Image Synthesis using Deep Learning
Shubham Singh, Ammar Ranapurwala, Mrunal Bewoor, Sheetal Patil, Satyam Rai
Comments: 8 pages
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1673] arXiv:2309.04581 (cross-list from cs.GR) [pdf, other]
Title: Dynamic Mesh-Aware Radiance Fields
Yi-Ling Qiao, Alexander Gao, Yiran Xu, Yue Feng, Jia-Bin Huang, Ming C. Lin
Comments: ICCV 2023
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1674] arXiv:2309.04631 (cross-list from q-bio.TO) [pdf, other]
Title: Open and reusable deep learning for pathology with WSInfer and QuPath
Jakub R. Kaczmarzyk, Alan O'Callaghan, Fiona Inglis, Tahsin Kurc, Rajarsi Gupta, Erich Bremer, Peter Bankhead, Joel H. Saltz
Subjects: Tissues and Organs (q-bio.TO); Computer Vision and Pattern Recognition (cs.CV)
[1675] arXiv:2309.04651 (cross-list from eess.IV) [pdf, other]
Title: Video and Synthetic MRI Pre-training of 3D Vision Architectures for Neuroimage Analysis
Nikhil J. Dhinagar, Amit Singh, Saket Ozarkar, Ketaki Buwa, Sophia I. Thomopoulos, Conor Owens-Walton, Emily Laltoo, Yao-Liang Chen, Philip Cook, Corey McMillan, Chih-Chien Tsai, J-J Wang, Yih-Ru Wu, Paul M. Thompson
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1676] arXiv:2309.04672 (cross-list from eess.IV) [pdf, html, other]
Title: SSHNN: Semi-Supervised Hybrid NAS Network for Echocardiographic Image Segmentation
Renqi Chen, Jingjing Luo, Fan Nian, Yuhui Cen, Yiheng Peng, Zekuan Yu
Comments: Accepted by ICASSP2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1677] arXiv:2309.04710 (cross-list from cs.RO) [pdf, other]
Title: Jade: A Differentiable Physics Engine for Articulated Rigid Bodies with Intersection-Free Frictional Contact
Gang Yang, Siyuan Luo, Lin Shao
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Systems and Control (eess.SY)
[1678] arXiv:2309.04760 (cross-list from cs.LG) [pdf, other]
Title: RR-CP: Reliable-Region-Based Conformal Prediction for Trustworthy Medical Image Classification
Yizhe Zhang, Shuo Wang, Yejia Zhang, Danny Z. Chen
Comments: UNSURE2023 (Uncertainty for Safe Utilization of Machine Learning in Medical Imaging) at MICCAI2023; Spotlight
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1679] arXiv:2309.04762 (cross-list from cs.SD) [pdf, other]
Title: AudRandAug: Random Image Augmentations for Audio Classification
Teerath Kumar, Muhammad Turab, Alessandra Mileo, Malika Bendechache, Takfarinas Saber
Comments: Paper has accepted at 25th Irish Machine Vision and Image Processing Conference
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
[1680] arXiv:2309.04777 (cross-list from cs.CR) [pdf, other]
Title: Towards Robust Model Watermark via Reducing Parametric Vulnerability
Guanhao Gan, Yiming Li, Dongxian Wu, Shu-Tao Xia
Comments: This paper is accepted by ICCV 2023
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1681] arXiv:2309.04946 (cross-list from cs.SD) [pdf, other]
Title: Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation
Yuan Gan, Zongxin Yang, Xihang Yue, Lingyun Sun, Yi Yang
Comments: Accepted to ICCV 2023. Project page: this https URL
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Audio and Speech Processing (eess.AS)
[1682] arXiv:2309.04956 (cross-list from eess.IV) [pdf, other]
Title: Anatomy Completor: A Multi-class Completion Framework for 3D Anatomy Reconstruction
Jianning Li, Antonio Pepe, Gijs Luijten, Christina Schwarz-Gsaxner, Jens Kleesiek, Jan Egger
Comments: 15 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1683] arXiv:2309.04960 (cross-list from eess.IV) [pdf, other]
Title: SdCT-GAN: Reconstructing CT from Biplanar X-Rays with Self-driven Generative Adversarial Networks
Shuangqin Cheng, Qingliang Chen, Qiyi Zhang, Ming Li, Yamuhanmode Alike, Kaile Su, Pengcheng Wen
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1684] arXiv:2309.04961 (cross-list from cs.IR) [pdf, other]
Title: Multi-modal Extreme Classification
Anshul Mittal, Kunal Dahiya, Shreya Malani, Janani Ramaswamy, Seba Kuruvilla, Jitendra Ajmera, Keng-hao Chang, Sumeet Agarwal, Purushottam Kar, Manik Varma
Journal-ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022
Subjects: Information Retrieval (cs.IR); Computer Vision and Pattern Recognition (cs.CV)
[1685] arXiv:2309.05036 (cross-list from cs.RO) [pdf, other]
Title: What Is Near?: Room Locality Learning for Enhanced Robot Vision-Language-Navigation in Indoor Living Environments
Muraleekrishna Gopinathan, Jumana Abu-Khalaf, David Suter, Sidike Paheding, Nathir A. Rawashdeh
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1686] arXiv:2309.05071 (cross-list from math.AP) [pdf, other]
Title: Super-Resolution Surface Reconstruction from Few Low-Resolution Slices
Yiyao Zhang, Ke Chen, Shang-Hua Yang
Comments: 33 pages, 25 figures
Journal-ref: AIMS Journal Inverse Problems and Imaging (IPI) 2023
Subjects: Analysis of PDEs (math.AP); Computer Vision and Pattern Recognition (cs.CV)
[1687] arXiv:2309.05162 (cross-list from cs.CL) [pdf, other]
Title: Collecting Visually-Grounded Dialogue with A Game Of Sorts
Bram Willemsen, Dmytro Kalpakchi, Gabriel Skantze
Comments: Published at LREC 2022
Journal-ref: Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC 2022), pages 2257-2268, Marseille, France. European Language Resources Association
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1688] arXiv:2309.05173 (cross-list from cs.CL) [pdf, other]
Title: DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning
Zhengxiang Shi, Aldo Lipani
Comments: ICLR 2024. Code is available at this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1689] arXiv:2309.05197 (cross-list from cs.RO) [pdf, other]
Title: Learning Sequential Acquisition Policies for Robot-Assisted Feeding
Priya Sundaresan, Jiajun Wu, Dorsa Sadigh
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1690] arXiv:2309.05271 (cross-list from eess.IV) [pdf, other]
Title: AutoFuse: Automatic Fusion Networks for Deformable Medical Image Registration
Mingyuan Meng, Michael Fulham, Dagan Feng, Lei Bi, Jinman Kim
Comments: Published at Pattern Recognition
Journal-ref: Pattern Recognition, vol. 161, p. 111338, 2025
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1691] arXiv:2309.05339 (cross-list from cs.RO) [pdf, other]
Title: PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D representations for agricultural robotics
Claus Smitt, Michael Halstead, Patrick Zimmer, Thomas Läbe, Esra Guclu, Cyrill Stachniss, Chris McCool
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1692] arXiv:2309.05346 (cross-list from cs.LG) [pdf, other]
Title: Learning Geometric Representations of Objects via Interaction
Alfredo Reichlin, Giovanni Luca Marchetti, Hang Yin, Anastasiia Varava, Danica Kragic
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1693] arXiv:2309.05405 (cross-list from eess.IV) [pdf, other]
Title: Two-Stage Hybrid Supervision Framework for Fast, Low-resource, and Accurate Organ and Pan-cancer Segmentation in Abdomen CT
Wentao Liu, Tong Tian, Weijin Xu, Lemeng Wang, Haoyuan Li, Huihua Yang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1694] arXiv:2309.05406 (cross-list from eess.IV) [pdf, html, other]
Title: Treatment-aware Diffusion Probabilistic Model for Longitudinal MRI Generation and Diffuse Glioma Growth Prediction
Qinghui Liu, Elies Fuster-Garcia, Ivar Thokle Hovden, Bradley J MacIntosh, Edvard Grødem, Petter Brandal, Carles Lopez-Mateu, Donatas Sederevicius, Karoline Skogen, Till Schellhorn, Atle Bjørnerud, Kyrre Eeg Emblem
Comments: preprints in IEEE-TMI, 14 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1695] arXiv:2309.05446 (cross-list from eess.IV) [pdf, other]
Title: A Localization-to-Segmentation Framework for Automatic Tumor Segmentation in Whole-Body PET/CT Images
Linghan Cai, Jianhao Huang, Zihang Zhu, Jinpeng Lu, Yongbing Zhang
Comments: 7 pages,3 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1696] arXiv:2309.05534 (cross-list from cs.CL) [pdf, other]
Title: PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud
Chengyu Wang, Zhongjie Duan, Bingyan Liu, Xinyi Zou, Cen Chen, Kui Jia, Jun Huang
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1697] arXiv:2309.05662 (cross-list from cs.RO) [pdf, other]
Title: ViHOPE: Visuotactile In-Hand Object 6D Pose Estimation with Shape Completion
Hongyu Li, Snehal Dikhale, Soshi Iba, Nawid Jamali
Comments: Accepted by RA-L
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1698] arXiv:2309.05665 (cross-list from cs.RO) [pdf, other]
Title: Robot Parkour Learning
Ziwen Zhuang, Zipeng Fu, Jianren Wang, Christopher Atkeson, Soeren Schwertfeger, Chelsea Finn, Hang Zhao
Comments: CoRL 2023 (Oral). Project website at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1699] arXiv:2309.05674 (cross-list from eess.IV) [pdf, other]
Title: ConvFormer: Plug-and-Play CNN-Style Transformers for Improving Medical Image Segmentation
Xian Lin, Zengqiang Yan, Xianbo Deng, Chuansheng Zheng, Li Yu
Comments: Accepted by MICCAI 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1700] arXiv:2309.05780 (cross-list from eess.IV) [pdf, other]
Title: LUNet: Deep Learning for the Segmentation of Arterioles and Venules in High Resolution Fundus Images
Jonathan Fhima, Jan Van Eijgen, Hana Kulenovic, Valérie Debeuf, Marie Vangilbergen, Marie-Isaline Billen, Heloïse Brackenier, Moti Freiman, Ingeborg Stalmans, Joachim A. Behar
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1701] arXiv:2309.05826 (cross-list from cs.LG) [pdf, other]
Title: KD-FixMatch: Knowledge Distillation Siamese Neural Networks
Chien-Chih Wang, Shaoyuan Xu, Jinmiao Fu, Yang Liu, Bryan Wang
Comments: 5 pages, 1 figure, 5 tables. To be published in ICIP 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1702] arXiv:2309.05857 (cross-list from eess.IV) [pdf, other]
Title: Radiomics Boosts Deep Learning Model for IPMN Classification
Lanhong Yao, Zheyuan Zhang, Ugur Demir, Elif Keles, Camila Vendrami, Emil Agarunov, Candice Bolan, Ivo Schoots, Marc Bruno, Rajesh Keswani, Frank Miller, Tamas Gonda, Cemal Yazici, Temel Tirkes, Michael Wallace, Concetto Spampinato, Ulas Bagci
Comments: 10 pages, MICCAI MLMI 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1703] arXiv:2309.05879 (cross-list from cs.CR) [pdf, other]
Title: Generalized Attacks on Face Verification Systems
Ehsan Nazari, Paula Branco, Guy-Vincent Jourdan
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1704] arXiv:2309.05919 (cross-list from eess.IV) [pdf, html, other]
Title: Deep evidential fusion with uncertainty quantification and contextual discounting for multimodal medical image segmentation
Ling Huang, Su Ruan, Pierre Decazes, Thierry Denoeux
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1705] arXiv:2309.05929 (cross-list from eess.IV) [pdf, other]
Title: Introducing Shape Prior Module in Diffusion Model for Medical Image Segmentation
Zhiqing Zhang, Guojia Fan, Tianyong Liu, Nan Li, Yuyang Liu, Ziyu Liu, Canwei Dong, Shoujun Zhou
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1706] arXiv:2309.05950 (cross-list from cs.CL) [pdf, html, other]
Title: Language Models as Black-Box Optimizers for Vision-Language Models
Shihong Liu, Zhiqiu Lin, Samuel Yu, Ryan Lee, Tiffany Ling, Deepak Pathak, Deva Ramanan
Comments: Published at CVPR 2024. Project site: this https URL
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM)
[1707] arXiv:2309.06046 (cross-list from cs.LG) [pdf, other]
Title: BatMan-CLR: Making Few-shots Meta-Learners Resilient Against Label Noise
Jeroen M. Galjaard, Robert Birke, Juan Perez, Lydia Y. Chen
Comments: 10 pages,3 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1708] arXiv:2309.06054 (cross-list from cs.LG) [pdf, html, other]
Title: Breaking through the learning plateaus of in-context learning in Transformer
Jingwen Fu, Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1709] arXiv:2309.06062 (cross-list from cs.LG) [pdf, other]
Title: Selection of contributing factors for predicting landslide susceptibility using machine learning and deep learning models
Cheng Chen, Lei Fan
Comments: Stochastic Environmental Research and Risk Assessment
Journal-ref: Stochastic Environmental Research and Risk Assessment, 13 September 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Geophysics (physics.geo-ph)
[1710] arXiv:2309.06067 (cross-list from eess.IV) [pdf, html, other]
Title: Efficient MRI Parallel Imaging Reconstruction by K-Space Rendering via Generalized Implicit Neural Representation
Hao Li, Yusheng Zhou, Jianan Liu, Xiling Liu, Tao Huang, Zhihan Lyu, Weidong Cai, Wei Chen
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1711] arXiv:2309.06075 (cross-list from eess.IV) [pdf, html, other]
Title: A2V: A Semi-Supervised Domain Adaptation Framework for Brain Vessel Segmentation via Two-Phase Training Angiography-to-Venography Translation
Francesco Galati, Daniele Falcetta, Rosa Cortese, Barbara Casolla, Ferran Prados, Ninon Burgos, Maria A. Zuluaga
Comments: Accepted at the 34th British Machine Vision Conference (BMVC)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1712] arXiv:2309.06086 (cross-list from cs.LG) [pdf, other]
Title: Plasticity-Optimized Complementary Networks for Unsupervised Continual Learning
Alex Gomez-Villa, Bartlomiej Twardowski, Kai Wang, Joost van de Weijer
Comments: Accepted at WACV2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1713] arXiv:2309.06135 (cross-list from cs.CL) [pdf, html, other]
Title: Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts
Zhi-Yi Chin, Chieh-Ming Jiang, Ching-Chun Huang, Pin-Yu Chen, Wei-Chen Chiu
Comments: ICML 2024 main conference paper. The source code is available at this https URL
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1714] arXiv:2309.06143 (cross-list from eess.IV) [pdf, html, other]
Title: Improving Generalization Capability of Deep Learning-Based Nuclei Instance Segmentation by Non-deterministic Train Time and Deterministic Test Time Stain Normalization
Amirreza Mahbod, Georg Dorffner, Isabella Ellinger, Ramona Woitek, Sepideh Hatamikia
Comments: Accepted at Computational and Structural Biotechnology Journal
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1715] arXiv:2309.06166 (cross-list from cs.LG) [pdf, other]
Title: Certified Robust Models with Slack Control and Large Lipschitz Constants
Max Losch, David Stutz, Bernt Schiele, Mario Fritz
Comments: To be published at GCPR 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1716] arXiv:2309.06169 (cross-list from cs.LG) [pdf, other]
Title: Elucidating the solution space of extended reverse-time SDE for diffusion models
Qinpeng Cui, Xinyi Zhang, Qiqi Bao, Qingmin Liao
Comments: This paper has been accepted by WACV 2025 (Oral). The official version lacked proper attribution to the co-authors, and this version has been updated accordingly
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1717] arXiv:2309.06286 (cross-list from cs.AI) [pdf, other]
Title: Transferability analysis of data-driven additive manufacturing knowledge: a case study between powder bed fusion and directed energy deposition
Mutahar Safdar, Jiarui Xie, Hyunwoong Ko, Yan Lu, Guy Lamouche, Yaoyao Fiona Zhao
Comments: 11 pages, 7 figures. This paper has been accepted to be published in the proceedings of IDETC-CIE 2023
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1718] arXiv:2309.06380 (cross-list from cs.LG) [pdf, html, other]
Title: InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation
Xingchao Liu, Xiwen Zhang, Jianzhu Ma, Jian Peng, Qiang Liu
Comments: ICLR 2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1719] arXiv:2309.06386 (cross-list from eess.IV) [pdf, other]
Title: Lung Diseases Image Segmentation using Faster R-CNNs
Mihir Jain
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1720] arXiv:2309.06421 (cross-list from eess.IV) [pdf, other]
Title: AGMDT: Virtual Staining of Renal Histology Images with Adjacency-Guided Multi-Domain Transfer
Tao Ma, Chao Zhang, Min Lu, Lin Luo
Comments: BMVC 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1721] arXiv:2309.06440 (cross-list from cs.RO) [pdf, other]
Title: LEAP Hand: Low-Cost, Efficient, and Anthropomorphic Hand for Robot Learning
Kenneth Shaw, Ananye Agarwal, Deepak Pathak
Comments: Website at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[1722] arXiv:2309.06612 (cross-list from cs.LG) [pdf, other]
Title: Harmonic-NAS: Hardware-Aware Multimodal Neural Architecture Search on Resource-constrained Devices
Mohamed Imed Eddine Ghebriout, Halima Bouzidi, Smail Niar, Hamza Ouarnoughi
Comments: Accepted to the 15th Asian Conference on Machine Learning (ACML 2023)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1723] arXiv:2309.06652 (cross-list from cs.NE) [pdf, other]
Title: Event-Driven Imaging in Turbid Media: A Confluence of Optoelectronics and Neuromorphic Computation
Ning Zhang, Timothy Shea, Arto Nurmikko
Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
[1724] arXiv:2309.06660 (cross-list from cs.LG) [pdf, other]
Title: Generalizable Neural Fields as Partially Observed Neural Processes
Jeffrey Gu, Kuan-Chieh Wang, Serena Yeung
Comments: To appear ICCV 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1725] arXiv:2309.06825 (cross-list from eess.IV) [pdf, other]
Title: Topology-inspired Cross-domain Network for Developmental Cervical Stenosis Quantification
Zhenxi Zhang, Yanyang Wang, Yao Wu, Weifei Wu
Comments: We have discovered that some authors' contributions have been overlooked. We need to spend some time confirming whether the authors adhere to the paper's authorship guidelines and whether their authorship order complies with the standards. After discussion with all co-authors, we decide to withdraw this paper
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1726] arXiv:2309.06882 (cross-list from cs.LG) [pdf, other]
Title: ProMap: Datasets for Product Mapping in E-commerce
Kateřina Macková, Martin Pilát
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[1727] arXiv:2309.06928 (cross-list from cs.CL) [pdf, other]
Title: Dynamic Causal Disentanglement Model for Dialogue Emotion Detection
Yuting Su, Yichen Wei, Weizhi Nie, Sicheng Zhao, Anan Liu
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1728] arXiv:2309.06948 (cross-list from eess.IV) [pdf, other]
Title: Limited-Angle Tomography Reconstruction via Deep End-To-End Learning on Synthetic Data
Thomas Germer, Jan Robine, Sebastian Konietzny, Stefan Harmeling, Tobias Uelwer
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1729] arXiv:2309.07085 (cross-list from cs.LG) [pdf, html, other]
Title: Mitigating Group Bias in Federated Learning for Heterogeneous Devices
Khotso Selialia, Yasra Chandio, Fatima M. Anwar
Journal-ref: Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1730] arXiv:2309.07094 (cross-list from cs.RO) [pdf, other]
Title: RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline
Mirko Usuelli, Matteo Frosi, Paolo Cudrano, Simone Mentasti, Matteo Matteucci
Comments: 7 pages, 2 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1731] arXiv:2309.07096 (cross-list from q-bio.NC) [pdf, other]
Title: Computational limits to the legibility of the imaged human brain
James K Ruffle, Robert J Gray, Samia Mohinta, Guilherme Pombo, Chaitanya Kaul, Harpreet Hyare, Geraint Rees, Parashkev Nachev
Comments: 38 pages, 6 figures, 1 table, 2 supplementary figures, 1 supplementary table
Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1732] arXiv:2309.07115 (cross-list from cs.SD) [pdf, html, other]
Title: Getting More for Less: Using Weak Labels and AV-Mixup for Robust Audio-Visual Speaker Verification
Anith Selvakumar, Homa Fashandi
Comments: Accepted to INTERSPEECH 2024
Journal-ref: Proc. Interspeech 2024, 4728-4732
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1733] arXiv:2309.07117 (cross-list from cs.LG) [pdf, html, other]
Title: PILOT: A Pre-Trained Model-Based Continual Learning Toolbox
Hai-Long Sun, Da-Wei Zhou, De-Chuan Zhan, Han-Jia Ye
Comments: Accepted to SCIENCE CHINA Information Sciences. Code is available at this https URL
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1734] arXiv:2309.07120 (cross-list from cs.CL) [pdf, other]
Title: Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics
Haoqin Tu, Bingchen Zhao, Chen Wei, Cihang Xie
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[1735] arXiv:2309.07173 (cross-list from cs.LG) [pdf, other]
Title: Using Unsupervised and Supervised Learning and Digital Twin for Deep Convective Ice Storm Classification
Jason Swope, Steve Chien, Emily Dunkel, Xavier Bosch-Lluis, Qing Yue, William Deal
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1736] arXiv:2309.07255 (cross-list from eess.IV) [pdf, other]
Title: Automated segmentation of rheumatoid arthritis immunohistochemistry stained synovial tissue
Amaya Gallagher-Syed, Abbas Khan, Felice Rivellese, Costantino Pitzalis, Myles J. Lewis, Gregory Slabaugh, Michael R. Barnes
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[1737] arXiv:2309.07332 (cross-list from cs.LG) [pdf, other]
Title: Reliability-based cleaning of noisy training labels with inductive conformal prediction in multi-modal biomedical data mining
Xianghao Zhan, Qinmei Xu, Yuanning Zheng, Guangming Lu, Olivier Gevaert
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM); Applications (stat.AP); Machine Learning (stat.ML)
[1738] arXiv:2309.07387 (cross-list from cs.CL) [pdf, other]
Title: VDialogUE: A Unified Evaluation Benchmark for Visually-grounded Dialogue
Yunshui Li, Binyuan Hui, Zhaochao Yin, Wanwei He, Run Luo, Yuxing Long, Min Yang, Fei Huang, Yongbin Li
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1739] arXiv:2309.07461 (cross-list from cs.CR) [pdf, other]
Title: Detecting Unknown Attacks in IoT Environments: An Open Set Classifier for Enhanced Network Intrusion Detection
Yasir Ali Farrukh, Syed Wali, Irfan Khan, Nathaniel D. Bastian
Comments: 6 Pages, 5 figures
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1740] arXiv:2309.07510 (cross-list from cs.RO) [pdf, other]
Title: Learning Environment-Aware Affordance for 3D Articulated Object Manipulation under Occlusions
Kai Cheng, Ruihai Wu, Yan Shen, Chuanruo Ning, Guanqi Zhan, Hao Dong
Comments: In 37th Conference on Neural Information Processing Systems (NeurIPS 2023). Website at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1741] arXiv:2309.07609 (cross-list from cs.RO) [pdf, other]
Title: Learning Quasi-Static 3D Models of Markerless Deformable Linear Objects for Bimanual Robotic Manipulation
Piotr Kicki, Michał Bidziński, Krzysztof Walas
Comments: Under review for IEEE Robotics and Automation Letters
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1742] arXiv:2309.07778 (cross-list from eess.IV) [pdf, html, other]
Title: Virchow: A Million-Slide Digital Pathology Foundation Model
Eugene Vorontsov, Alican Bozkurt, Adam Casson, George Shaikovski, Michal Zelechowski, Siqi Liu, Kristen Severson, Eric Zimmermann, James Hall, Neil Tenenholtz, Nicolo Fusi, Philippe Mathieu, Alexander van Eck, Donghun Lee, Julian Viret, Eric Robert, Yi Kan Wang, Jeremy D. Kunz, Matthew C. H. Lee, Jan Bernhard, Ran A. Godrich, Gerard Oakley, Ewan Millar, Matthew Hanna, Juan Retamero, William A. Moye, Razik Yousfi, Christopher Kanan, David Klimstra, Brandon Rothrock, Thomas J. Fuchs
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Tissues and Organs (q-bio.TO)
[1743] arXiv:2309.07878 (cross-list from cs.SI) [pdf, other]
Title: Using network metrics to explore the community structure that underlies movement patterns
Anh Pham Thi Minh, Abhishek Kumar Singh, Soumya Snigdha Kundu
Comments: 6 pages excluding References
Subjects: Social and Information Networks (cs.SI); Computer Vision and Pattern Recognition (cs.CV)
[1744] arXiv:2309.07907 (cross-list from cs.RO) [pdf, other]
Title: Physically Plausible Full-Body Hand-Object Interaction Synthesis
Jona Braun, Sammy Christen, Muhammed Kocabas, Emre Aksan, Otmar Hilliges
Comments: Project page at this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1745] arXiv:2309.07909 (cross-list from cs.LG) [pdf, html, other]
Title: DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data Augmentation
Zelin Zang, Hao Luo, Kai Wang, Panpan Zhang, Fan Wang, Stan.Z Li, Yang You
Comments: accepted by ICML24
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Computer Vision and Pattern Recognition (cs.CV)
[1746] arXiv:2309.07915 (cross-list from cs.CL) [pdf, html, other]
Title: MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning
Haozhe Zhao, Zefan Cai, Shuzheng Si, Xiaojian Ma, Kaikai An, Liang Chen, Zixuan Liu, Sheng Wang, Wenjuan Han, Baobao Chang
Comments: Accepted by ICLR2024
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1747] arXiv:2309.07926 (cross-list from eess.IV) [pdf, other]
Title: COMPASS: High-Efficiency Deep Image Compression with Arbitrary-scale Spatial Scalability
Jongmin Park, Jooyoung Lee, Munchurl Kim
Comments: Accepted in ICCV 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1748] arXiv:2309.07970 (cross-list from cs.RO) [pdf, other]
Title: Language Embedded Radiance Fields for Zero-Shot Task-Oriented Grasping
Adam Rashid, Satvik Sharma, Chung Min Kim, Justin Kerr, Lawrence Chen, Angjoo Kanazawa, Ken Goldberg
Comments: See the project website at: this http URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1749] arXiv:2309.07973 (cross-list from eess.IV) [pdf, other]
Title: M3Dsynth: A dataset of medical 3D images with AI-generated local manipulations
Giada Zingarini, Davide Cozzolino, Riccardo Corvi, Giovanni Poggi, Luisa Verdoliva
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1750] arXiv:2309.08086 (cross-list from cs.RO) [pdf, other]
Title: Fast and Accurate Deep Loop Closing and Relocalization for Reliable LiDAR SLAM
Chenghao Shi, Xieyuanli Chen, Junhao Xiao, Bin Dai, Huimin Lu
Comments: 20 pages 10 figures 7 tables
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1751] arXiv:2309.08106 (cross-list from cs.RO) [pdf, other]
Title: Data-Driven Goal Recognition in Transhumeral Prostheses Using Process Mining Techniques
Zihang Su, Tianshi Yu, Nir Lipovetzky, Alireza Mohammadi, Denny Oetomo, Artem Polyvyanyy, Sebastian Sardina, Ying Tan, Nick van Beest
Comments: The 5th International Conference on Process Mining (ICPM 2023)
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1752] arXiv:2309.08129 (cross-list from eess.IV) [pdf, other]
Title: Increasing diversity of omni-directional images generated from single image using cGAN based on MLPMixer
Atsuya Nakata, Ryuto Miyazaki, Takao Yamanaka
Comments: This is a pre-print of an article in ACPR2023. The proceedings will be published in Lecture Notes in Computer Science (LNCS). The code is available at this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1753] arXiv:2309.08146 (cross-list from cs.SD) [pdf, other]
Title: Syn-Att: Synthetic Speech Attribution via Semi-Supervised Unknown Multi-Class Ensemble of CNNs
Md Awsafur Rahman, Bishmoy Paul, Najibul Haque Sarker, Zaber Ibn Abdul Hakim, Shaikh Anowarul Fattah, Mohammad Saquib
Comments: Winning Solution of IEEE SP Cup at ICASSP 2022
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1754] arXiv:2309.08160 (cross-list from eess.IV) [pdf, other]
Title: Cross-Modal Synthesis of Structural MRI and Functional Connectivity Networks via Conditional ViT-GANs
Yuda Bi, Anees Abrol, Jing Sui, Vince Calhoun
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1755] arXiv:2309.08197 (cross-list from eess.IV) [pdf, other]
Title: Hyperspectral Image Denoising via Self-Modulating Convolutional Neural Networks
Orhan Torun, Seniha Esen Yuksel, Erkut Erdem, Nevrez Imamoglu, Aykut Erdem
Journal-ref: Signal Processing, Volume 214, January 2024, 109248
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1756] arXiv:2309.08227 (cross-list from cs.LG) [pdf, other]
Title: VERSE: Virtual-Gradient Aware Streaming Lifelong Learning with Anytime Inference
Soumya Banerjee, Vinay K. Verma, Avideep Mukherjee, Deepak Gupta, Vinay P. Namboodiri, Piyush Rai
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1757] arXiv:2309.08234 (cross-list from eess.IV) [pdf, other]
Title: Efficient Polyp Segmentation Via Integrity Learning
Ziqiang Chen, Kang Wang, Yun Liu
Comments: submited to ICASSP 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1758] arXiv:2309.08295 (cross-list from eess.AS) [pdf, other]
Title: A Real-Time Active Speaker Detection System Integrating an Audio-Visual Signal with a Spatial Querying Mechanism
Ilya Gurvich, Ido Leichter, Dharmendar Reddy Palle, Yossi Asher, Alon Vinnikov, Igor Abramovski, Vishak Gopal, Ross Cutler, Eyal Krupka
Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD)
[1759] arXiv:2309.08381 (cross-list from eess.IV) [pdf, other]
Title: On undesired emergent behaviors in compound prostate cancer detection systems
Erlend Sortland Rolfsnes, Philip Thangngat, Trygve Eftestøl, Tobias Nordström, Fredrik Jäderling, Martin Eklund, Alvaro Fernandez-Quilez
Comments: Accepted in MICCAI 2025, CapTiON
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1760] arXiv:2309.08387 (cross-list from cs.GR) [pdf, other]
Title: Efficient Graphics Representation with Differentiable Indirection
Sayantan Datta, Carl Marshall, Derek Nowrouzezahrai, Zhao Dong, Zhengqin Li
Comments: Project website: this https URL
Journal-ref: SIGGRAPH Asia 2023 Conference Papers (SA Conference Papers '23), December 12--15, 2023, Sydney, NSW, Australia
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1761] arXiv:2309.08402 (cross-list from eess.IV) [pdf, html, other]
Title: 3D SA-UNet: 3D Spatial Attention UNet with 3D Atrous Spatial Pyramid Pooling for White Matter Hyperintensities Segmentation
Changlu Guo
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1762] arXiv:2309.08421 (cross-list from eess.IV) [pdf, html, other]
Title: MIML: Multiplex Image Machine Learning for High Precision Cell Classification via Mechanical Traits within Microfluidic Systems
Khayrul Islam, Ratul Paul, Shen Wang, Yuwen Zhao, Partho Adhikary, Qiying Li, Xiaochen Qin, Yaling Liu
Comments: major change
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Quantitative Methods (q-bio.QM)
[1763] arXiv:2309.08434 (cross-list from eess.IV) [pdf, other]
Title: Segment Anything Model for Brain Tumor Segmentation
Peng Zhang, Yaping Wang
Comments: 9 pages, 60 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1764] arXiv:2309.08504 (cross-list from cs.RO) [pdf, html, other]
Title: OccupancyDETR: Using DETR for Mixed Dense-sparse 3D Occupancy Prediction
Yupeng Jia, Jie He, Runze Chen, Fang Zhao, Haiyong Luo
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1765] arXiv:2309.08511 (cross-list from eess.IV) [pdf, html, other]
Title: Generalised Diffusion Probabilistic Scale-Spaces
Pascal Peter
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1766] arXiv:2309.08607 (cross-list from cs.CY) [pdf, html, other]
Title: Monitoring of Urban Changes with multi-modal Sentinel 1 and 2 Data in Mariupol, Ukraine, in 2022/23
Georg Zitzlsberger, Michal Podhoranyi
Comments: Accepted for publication in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects: Computers and Society (cs.CY); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1767] arXiv:2309.08612 (cross-list from cs.AI) [pdf, other]
Title: Explaining Vision and Language through Graphs of Events in Space and Time
Mihai Masala, Nicolae Cudlenco, Traian Rebedea, Marius Leordeanu
Comments: Accepted at IEEE International Conference on Computer Vision (ICCV) 2023 Workshops: 5th Workshop On Closing The Loop Between Vision And Language
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1768] arXiv:2309.08794 (cross-list from cs.AI) [pdf, other]
Title: Privacy-preserving Early Detection of Epileptic Seizures in Videos
Deval Mehta, Shobi Sivathamboo, Hugh Simpson, Patrick Kwan, Terence O`Brien, Zongyuan Ge
Comments: Accepted to MICCAI 2023
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1769] arXiv:2309.08798 (cross-list from cs.AI) [pdf, html, other]
Title: D3: Data Diversity Design for Systematic Generalization in Visual Question Answering
Amir Rahimi, Vanessa D'Amario, Moyuru Yamada, Kentaro Takemoto, Tomotake Sasaki, Xavier Boix
Comments: TMLR (this https URL)
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1770] arXiv:2309.08931 (cross-list from cs.AI) [pdf, html, other]
Title: A Novel Neural-symbolic System under Statistical Relational Learning
Dongran Yu, Xueyan Liu, Shirui Pan, Anchen Li, Bo Yang
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1771] arXiv:2309.08989 (cross-list from cs.RO) [pdf, other]
Title: RMP: A Random Mask Pretrain Framework for Motion Prediction
Yi Yang, Qingwen Zhang, Thomas Gilles, Nazre Batool, John Folkesson
Comments: IEEE International Conference on Intelligent Transportation Systems (ITSC 2023)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1772] arXiv:2309.08997 (cross-list from cs.RO) [pdf, other]
Title: OmniLRS: A Photorealistic Simulator for Lunar Robotics
Antoine Richard, Junnosuke Kamohara, Kentaro Uno, Shreya Santra, Dave van der Meer, Miguel Olivares-Mendez, Kazuya Yoshida
Comments: 7 pages, 4 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1773] arXiv:2309.09038 (cross-list from cs.AI) [pdf, other]
Title: A store-and-forward cloud-based telemonitoring system for automatic assessing dysarthria evolution in neurological diseases from video-recording analysis
Lucia Migliorelli, Daniele Berardini, Kevin Cela, Michela Coccia, Laura Villani, Emanuele Frontoni, Sara Moccia
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1774] arXiv:2309.09080 (cross-list from cs.RO) [pdf, other]
Title: Multi-camera Bird's Eye View Perception for Autonomous Driving
David Unger, Nikhil Gosala, Varun Ravi Kumar, Shubhankar Borse, Abhinav Valada, Senthil Yogamani
Comments: Taylor & Francis (CRC Press) book chapter. Book title: Computer Vision: Challenges, Trends, and Opportunities
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1775] arXiv:2309.09183 (cross-list from cs.RO) [pdf, other]
Title: CLIPUNetr: Assisting Human-robot Interface for Uncalibrated Visual Servoing Control with CLIP-driven Referring Expression Segmentation
Chen Jiang, Yuchen Yang, Martin Jagersand
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1776] arXiv:2309.09206 (cross-list from cs.RO) [pdf, other]
Title: Differentiable SLAM Helps Deep Learning-based LiDAR Perception Tasks
Prashant Kumar, Dheeraj Vattikonda, Vedang Bhupesh Shenvi Nadkarni, Erqun Dong, Sabyasachi Sahoo
Comments: 15 pages,6 Tables, 3 figures. Accepted at BMVC 2023
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1777] arXiv:2309.09215 (cross-list from physics.optics) [pdf, other]
Title: All-optical image denoising using a diffractive visual processor
Cagatay Isıl, Tianyi Gan, F. Onuralp Ardic, Koray Mentesoglu, Jagrit Digani, Huseyin Karaca, Hanlong Chen, Jingxi Li, Deniz Mengu, Mona Jarrahi, Kaan Akşit, Aydogan Ozcan
Comments: 21 Pages, 7 Figures
Journal-ref: Light: Science & Applications (2024)
Subjects: Optics (physics.optics); Computer Vision and Pattern Recognition (cs.CV); Applied Physics (physics.app-ph)
[1778] arXiv:2309.09314 (cross-list from cs.GR) [pdf, other]
Title: MOVIN: Real-time Motion Capture using a Single LiDAR
Deok-Kyeong Jang, Dongseok Yang, Deok-Yun Jang, Byeoli Choi, Taeil Jin, Sung-Hee Lee
Journal-ref: Computer Graphics Forum 2023, presented at Pacific Graphics 2023
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1779] arXiv:2309.09392 (cross-list from eess.IV) [pdf, other]
Title: Deep conditional generative models for longitudinal single-slice abdominal computed tomography harmonization
Xin Yu, Qi Yang, Yucheng Tang, Riqiang Gao, Shunxing Bao, Leon Y. Cai, Ho Hin Lee, Yuankai Huo, Ann Zenobia Moore, Luigi Ferrucci, Bennett A. Landman
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1780] arXiv:2309.09405 (cross-list from cs.AI) [pdf, other]
Title: Does Video Summarization Require Videos? Quantifying the Effectiveness of Language in Video Summarization
Yoonsoo Nam, Adam Lehavi, Daniel Yang, Digbalay Bose, Swabha Swayamdipta, Shrikanth Narayanan
Comments: \c{opyright} 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Subjects: Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1781] arXiv:2309.09410 (cross-list from eess.IV) [pdf, other]
Title: BRONCO: Automated modelling of the bronchovascular bundle using the Computed Tomography Images
Wojciech Prażuch, Marek Socha, Anna Mrukwa, Aleksandra Suwalska, Agata Durawa, Malgorzata Jelitto-Górska, Katarzyna Dziadziuszko, Edyta Szurowska, Pawel Bożek, Michal Marczyk, Witold Rzyman, Joanna Polanska
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1782] arXiv:2309.09426 (cross-list from eess.IV) [pdf, other]
Title: Joint Demosaicing and Denoising with Double Deep Image Priors
Taihui Li, Anish Lahiri, Yutong Dai, Owen Mayer
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[1783] arXiv:2309.09427 (cross-list from cs.RO) [pdf, other]
Title: TransTouch: Learning Transparent Objects Depth Sensing Through Sparse Touches
Liuyu Bian, Pengyang Shi, Weihang Chen, Jing Xu, Li Yi, Rui Chen
Comments: Accepted to the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1784] arXiv:2309.09483 (cross-list from eess.IV) [pdf, other]
Title: An Accurate and Efficient Neural Network for OCTA Vessel Segmentation and a New Dataset
Haojian Ning, Chengliang Wang, Xinrun Chen, Shiying Li
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1785] arXiv:2309.09490 (cross-list from eess.IV) [pdf, other]
Title: Self-supervised TransUNet for Ultrasound regional segmentation of the distal radius in children
Yuyue Zhou, Jessica Knight, Banafshe Felfeliyan, Christopher Keen, Abhilash Rakkunedeth Hareendranathan, Jacob L. Jaremko
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1786] arXiv:2309.09720 (cross-list from cs.LG) [pdf, other]
Title: Traffic Scene Similarity: a Graph-based Contrastive Learning Approach
Maximilian Zipfl, Moritz Jarosch, J. Marius Zöllner
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1787] arXiv:2309.09756 (cross-list from cs.RO) [pdf, other]
Title: Privileged to Predicted: Towards Sensorimotor Reinforcement Learning for Urban Driving
Ege Onat Özsüer, Barış Akgün, Fatma Güney
Comments: 7 pages
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1788] arXiv:2309.09774 (cross-list from cs.LG) [pdf, other]
Title: Towards Self-Adaptive Pseudo-Label Filtering for Semi-Supervised Learning
Lei Zhu, Zhanghan Ke, Rynson Lau
Comments: This paper was first submitted to NeurIPS 2021
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1789] arXiv:2309.09818 (cross-list from cs.RO) [pdf, other]
Title: Grasp-Anything: Large-scale Grasp Dataset from Foundation Models
An Dinh Vuong, Minh Nhat Vu, Hieu Le, Baoru Huang, Binh Huynh, Thieu Vo, Andreas Kugi, Anh Nguyen
Comments: Project page: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1790] arXiv:2309.09844 (cross-list from cs.RO) [pdf, other]
Title: CC-SGG: Corner Case Scenario Generation using Learned Scene Graphs
George Drayson, Efimia Panagiotaki, Daniel Omeiza, Lars Kunze
Comments: The first two authors contributed equally to this work
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1791] arXiv:2309.09865 (cross-list from cs.RO) [pdf, html, other]
Title: Contrastive Learning for Enhancing Robust Scene Transfer in Vision-based Agile Flight
Jiaxu Xing, Leonard Bauersfeld, Yunlong Song, Chunwei Xing, Davide Scaramuzza
Journal-ref: IEEE International Conference on Robotics and Automation (ICRA), 2024
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1792] arXiv:2309.09875 (cross-list from cs.RO) [pdf, html, other]
Title: RaLF: Flow-based Global and Metric Radar Localization in LiDAR Maps
Abhijeet Nayak, Daniele Cattaneo, Abhinav Valada
Journal-ref: 2024 IEEE International Conference on Robotics and Automation (ICRA)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1793] arXiv:2309.09907 (cross-list from quant-ph) [pdf, html, other]
Title: Quantum Vision Clustering
Xuan Bac Nguyen, Hugh Churchill, Khoa Luu, Samee U. Khan
Comments: arXiv admin note: text overlap with arXiv:2202.08837 by other authors
Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV)
[1794] arXiv:2309.09944 (cross-list from cs.LG) [pdf, other]
Title: DiffusionWorldViewer: Exposing and Broadening the Worldview Reflected by Generative Text-to-Image Models
Zoe De Simone, Angie Boggust, Arvind Satyanarayan, Ashia Wilson
Comments: 20 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[1795] arXiv:2309.09954 (cross-list from eess.IV) [pdf, html, other]
Title: vSHARP: variable Splitting Half-quadratic Admm algorithm for Reconstruction of inverse-Problems
George Yiasemis, Nikita Moriakov, Jan-Jakob Sonke, Jonas Teuwen
Comments: 22 pages, 9 figures, 5 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[1796] arXiv:2309.09979 (cross-list from cs.RO) [pdf, other]
Title: General In-Hand Object Rotation with Vision and Touch
Haozhi Qi, Brent Yi, Sudharshan Suresh, Mike Lambeta, Yi Ma, Roberto Calandra, Jitendra Malik
Comments: CoRL 2023; Website: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1797] arXiv:2309.09983 (cross-list from q-bio.NC) [pdf, other]
Title: Exploration and Comparison of Deep Learning Architectures to Predict Brain Response to Realistic Pictures
Riccardo Chimisso, Sathya Buršić, Paolo Marocco, Giuseppe Vizzari, Dimitri Ognibene
Comments: Submitted to The Algonauts Project 2023 - Exploration and Comparison of Deep Learning Architectures to Predict Brain Response to Realistic Pictures - this http URL
Subjects: Neurons and Cognition (q-bio.NC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Signal Processing (eess.SP)
[1798] arXiv:2309.09987 (cross-list from cs.LG) [pdf, other]
Title: TCGF: A unified tensorized consensus graph framework for multi-view representation learning
Xiangzhu Meng, Wei Wei, Qiang Liu, Shu Wu, Liang Wang
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1799] arXiv:2309.10012 (cross-list from cs.LG) [pdf, other]
Title: Looking through the past: better knowledge retention for generative replay in continual learning
Valeriya Khan, Sebastian Cygert, Kamil Deja, Tomasz Trzciński, Bartłomiej Twardowski
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1800] arXiv:2309.10131 (cross-list from cs.LG) [pdf, other]
Title: Deep Prompt Tuning for Graph Transformers
Reza Shirkavand, Heng Huang
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1801] arXiv:2309.10153 (cross-list from eess.IV) [pdf, html, other]
Title: Preserving Tumor Volumes for Unsupervised Medical Image Registration
Qihua Dong, Hao Du, Ying Song, Yan Xu, Jing Liao
Comments: ICCV 2023 Poster
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1802] arXiv:2309.10172 (cross-list from physics.flu-dyn) [pdf, html, other]
Title: Enhancing wind field resolution in complex terrain through a knowledge-driven machine learning approach
Jacob Wulff Wold, Florian Stadtmann, Adil Rasheed, Mandar Tabib, Omer San, Jan-Tore Horn
Subjects: Fluid Dynamics (physics.flu-dyn); Computer Vision and Pattern Recognition (cs.CV)
[1803] arXiv:2309.10210 (cross-list from eess.IV) [pdf, other]
Title: ProtoKD: Learning from Extremely Scarce Data for Parasite Ova Recognition
Shubham Trehan, Udhav Ramachandran, Ruth Scimeca, Sathyanarayanan N. Aakur
Comments: To Appear at IEEE ICMLA 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1804] arXiv:2309.10227 (cross-list from eess.IV) [pdf, other]
Title: Learning Dynamic MRI Reconstruction with Convolutional Network Assisted Reconstruction Swin Transformer
Di Xu, Hengjie Liu, Dan Ruan, Ke Sheng
Comments: MICCAI 2023 Workshop
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1805] arXiv:2309.10266 (cross-list from physics.flu-dyn) [pdf, other]
Title: Correlation between morphological evolution of splashing drop and exerted impact force revealed by interpretation of explainable artificial intelligence
Jingzu Yee, Daichi Igarashi, Pradipto, Akinori Yamanaka, Yoshiyuki Tagawa
Comments: 23 pages, 13 figures
Subjects: Fluid Dynamics (physics.flu-dyn); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1806] arXiv:2309.10302 (cross-list from cs.LG) [pdf, html, other]
Title: Decoupled Training: Return of Frustratingly Easy Multi-Domain Learning
Ximei Wang, Junwei Pan, Xingzhuo Guo, Dapeng Liu, Jie Jiang
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[1807] arXiv:2309.10314 (cross-list from cs.RO) [pdf, html, other]
Title: Dive Deeper into Rectifying Homography for Stereo Camera Online Self-Calibration
Hongbo Zhao, Yikang Zhang, Qijun Chen, Rui Fan
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1808] arXiv:2309.10329 (cross-list from cs.GR) [pdf, other]
Title: Learning based 2D Irregular Shape Packing
Zeshi Yang, Zherong Pan, Manyi Li, Kui Wu, Xifeng Gao
Subjects: Graphics (cs.GR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1809] arXiv:2309.10348 (cross-list from cs.LG) [pdf, other]
Title: Language Guided Adversarial Purification
Himanshu Singh, A V Subramanyam
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1810] arXiv:2309.10625 (cross-list from cs.AI) [pdf, html, other]
Title: NoisyNN: Exploring the Impact of Information Entropy Change in Learning Systems
Xiaowei Yu, Zhe Huang, Minheng Chen, Lu Zhang, Tianming Liu, Dajiang Zhu
Comments: Task Entropy, ViT, CNN
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1811] arXiv:2309.10646 (cross-list from eess.IV) [pdf, other]
Title: Self-Supervised Super-Resolution Approach for Isotropic Reconstruction of 3D Electron Microscopy Images from Anisotropic Acquisition
Mohammad Khateri, Morteza Ghahremani, Alejandra Sierra, Jussi Tohka
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1812] arXiv:2309.10784 (cross-list from eess.IV) [pdf, other]
Title: Context-Aware Neural Video Compression on Solar Dynamics Observatory
Atefeh Khoshkhahtinat, Ali Zafari, Piyush M. Mehta, Nasser M. Nasrabadi, Barbara J. Thompson, Michael S. F. Kirk, Daniel da Silva
Comments: Accepted to IEEE 22$^{nd}$ International Conference on Machine Learning and Applications 2023 (ICMLA) - Selected for Oral Presentation
Subjects: Image and Video Processing (eess.IV); Solar and Stellar Astrophysics (astro-ph.SR); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (cs.LG)
[1813] arXiv:2309.10787 (cross-list from eess.AS) [pdf, html, other]
Title: AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models
Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee
Comments: Accepted to ICASSP 2024; Evaluation Code: this https URL Submission Platform: this https URL
Subjects: Audio and Speech Processing (eess.AS); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Sound (cs.SD)
[1814] arXiv:2309.10790 (cross-list from cs.LG) [pdf, other]
Title: Guide Your Agent with Adaptive Multimodal Rewards
Changyeon Kim, Younggyo Seo, Hao Liu, Lisa Lee, Jinwoo Shin, Honglak Lee, Kimin Lee
Comments: Accepted to NeurIPS 2023. Project webpage: this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1815] arXiv:2309.10791 (cross-list from eess.IV) [pdf, other]
Title: Multi-spectral Entropy Constrained Neural Compression of Solar Imagery
Ali Zafari, Atefeh Khoshkhahtinat, Piyush M. Mehta, Nasser M. Nasrabadi, Barbara J. Thompson, Michael S. F. Kirk, Daniel da Silva
Comments: Accepted to IEEE 22$^{nd}$ International Conference on Machine Learning and Applications 2023 (ICMLA)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT)
[1816] arXiv:2309.10799 (cross-list from eess.IV) [pdf, other]
Title: Multi-Context Dual Hyper-Prior Neural Image Compression
Atefeh Khoshkhahtinat, Ali Zafari, Piyush M. Mehta, Mohammad Akyash, Hossein Kashiani, Nasser M. Nasrabadi
Comments: Accepted to IEEE 22$^nd$ International Conference on Machine Learning and Applications 2023 (ICMLA) - Selected for Oral Presentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1817] arXiv:2309.10817 (cross-list from eess.IV) [pdf, other]
Title: Assessing the capacity of a denoising diffusion probabilistic model to reproduce spatial context
Rucha Deshpande, Muzaffer Özbey, Hua Li, Mark A. Anastasio, Frank J. Brooks
Comments: This paper is under consideration at IEEE TMI
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[1818] arXiv:2309.10829 (cross-list from eess.IV) [pdf, other]
Title: Comparative study of Deep Learning Models for Binary Classification on Combined Pulmonary Chest X-ray Dataset
Shabbir Ahmed Shuvo, Md Aminul Islam, Md. Mozammel Hoque, Rejwan Bin Sulaiman
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1819] arXiv:2309.10834 (cross-list from cs.LG) [pdf, html, other]
Title: Communication-Efficient Federated Learning via Regularized Sparse Random Networks
Mohamad Mestoukirdi, Omid Esrafilian, David Gesbert, Qianrui Li, Nicolas Gresset
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Data Structures and Algorithms (cs.DS)
[1820] arXiv:2309.10835 (cross-list from eess.IV) [pdf, other]
Title: Analysing race and sex bias in brain age prediction
Carolina Piçarra, Ben Glocker
Comments: MICCAI Workshop on Fairness of AI in Medical Imaging (FAIMI 2023)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1821] arXiv:2309.10878 (cross-list from cs.LG) [pdf, other]
Title: DeepliteRT: Computer Vision at the Edge
Saad Ashfaq, Alexander Hoffman, Saptarshi Mitra, Sudhakar Sah, MohammadHossein AskariHemmat, Ehsan Saboori
Comments: Accepted at British Machine Vision Conference (BMVC) 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1822] arXiv:2309.10885 (cross-list from cs.RO) [pdf, other]
Title: GelSight Svelte: A Human Finger-shaped Single-camera Tactile Robot Finger with Large Sensing Coverage and Proprioceptive Sensing
Jialiang Zhao, Edward H. Adelson
Comments: Submitted and accepted to 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1823] arXiv:2309.10900 (cross-list from cs.RO) [pdf, other]
Title: Incremental Multimodal Surface Mapping via Self-Organizing Gaussian Mixture Models
Kshitij Goel, Wennie Tabib
Comments: 8 pages, 7 figures, published in IEEE Robotics and Automation Letters
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1824] arXiv:2309.10948 (cross-list from cs.LG) [pdf, other]
Title: A Novel Deep Neural Network for Trajectory Prediction in Automated Vehicles Using Velocity Vector Field
MReza Alipour Sormoli, Amir Samadi, Sajjad Mozaffari, Konstantinos Koufos, Mehrdad Dianati, Roger Woodman
Comments: This paper has been accepted and nominated as the best student paper at the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC 2023)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1825] arXiv:2309.10987 (cross-list from cs.NE) [pdf, html, other]
Title: SpikingNeRF: Making Bio-inspired Neural Networks See through the Real World
Xingting Yao, Qinghao Hu, Fei Zhou, Tielong Liu, Zitao Mo, Zeyu Zhu, Zhengyang Zhuge, Jian Cheng
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1826] arXiv:2309.11006 (cross-list from cs.RO) [pdf, other]
Title: STARNet: Sensor Trustworthiness and Anomaly Recognition via Approximated Likelihood Regret for Robust Edge Autonomy
Nastaran Darabi, Sina Tayebati, Sureshkumar S., Sathya Ravi, Theja Tulabandhula, Amit R. Trivedi
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1827] arXiv:2309.11018 (cross-list from cs.LG) [pdf, other]
Title: Conformalized Multimodal Uncertainty Regression and Reasoning
Domenico Parente, Nastaran Darabi, Alex C. Stutts, Theja Tulabandhula, Amit Ranjan Trivedi
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1828] arXiv:2309.11038 (cross-list from cs.RO) [pdf, html, other]
Title: CaveSeg: Deep Semantic Segmentation and Scene Parsing for Autonomous Underwater Cave Exploration
A. Abdullah, T. Barua, R. Tibbetts, Z. Chen, M. J. Islam, I. Rekleitis
Comments: Accepted in ICRA 2024. 10 pages, 9 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1829] arXiv:2309.11139 (cross-list from eess.IV) [pdf, other]
Title: More complex encoder is not all you need
Weibin Yang, Longwei Xu, Pengwei Wang, Dehua Geng, Yusong Li, Mingyuan Xu, Zhiqi Dong
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1830] arXiv:2309.11148 (cross-list from cs.RO) [pdf, html, other]
Title: Online Calibration of a Single-Track Ground Vehicle Dynamics Model by Tight Fusion with Visual-Inertial Odometry
Haolong Li, Joerg Stueckler
Comments: Accepted for publication in IEEE International Conference on Robotics and Automation (ICRA), 2024
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1831] arXiv:2309.11252 (cross-list from cs.CL) [pdf, other]
Title: The Scenario Refiner: Grounding subjects in images at the morphological level
Claudia Tagliaferri, Sofia Axioti, Albert Gatt, Denis Paperno
Comments: presented at the LIMO workshop (Linguistic Insights from and for Multimodal Language Processing @KONVENS 2023)
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1832] arXiv:2309.11258 (cross-list from cs.GR) [pdf, other]
Title: TwinTex: Geometry-aware Texture Generation for Abstracted 3D Architectural Models
Weidan Xiong, Hongqian Zhang, Botao Peng, Ziyu Hu, Yongli Wu, Jianwei Guo, Hui Huang
Comments: Accepted to SIGGRAPH ASIA 2023
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1833] arXiv:2309.11382 (cross-list from cs.RO) [pdf, other]
Title: Discuss Before Moving: Visual Language Navigation via Multi-expert Discussions
Yuxing Long, Xiaoqi Li, Wenzhe Cai, Hao Dong
Comments: Submitted to ICRA 2024
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1834] arXiv:2309.11413 (cross-list from cs.RO) [pdf, other]
Title: Enhancing motion trajectory segmentation of rigid bodies using a novel screw-based trajectory-shape representation
Arno Verduyn, Maxim Vochten, Joris De Schutter
Comments: This work has been submitted to the IEEE International Conference on Robotics and Automation (ICRA) for possible publication
Journal-ref: 2024 IEEE International Conference on Robotics and Automation (ICRA)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1835] arXiv:2309.11419 (cross-list from cs.CL) [pdf, html, other]
Title: KOSMOS-2.5: A Multimodal Literate Model
Tengchao Lv, Yupan Huang, Jingye Chen, Yuzhong Zhao, Yilin Jia, Lei Cui, Shuming Ma, Yaoyao Chang, Shaohan Huang, Wenhui Wang, Li Dong, Weiyao Luo, Shaoxiang Wu, Guoxin Wang, Cha Zhang, Furu Wei
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1836] arXiv:2309.11421 (cross-list from eess.IV) [pdf, other]
Title: CalibFPA: A Focal Plane Array Imaging System based on Online Deep-Learning Calibration
Alper Güngör, M. Umut Bahceci, Yasin Ergen, Ahmet Sözak, O. Oner Ekiz, Tolga Yelboga, Tolga Çukur
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1837] arXiv:2309.11446 (cross-list from cs.LG) [pdf, other]
Title: Weight Averaging Improves Knowledge Distillation under Domain Shift
Valeriy Berezovskiy, Nikita Morozov
Comments: ICCV 2023 Workshop on Out-of-Distribution Generalization in Computer Vision (OOD-CV)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1838] arXiv:2309.11500 (cross-list from cs.SD) [pdf, html, other]
Title: Auto-ACD: A Large-scale Dataset for Audio-Language Representation Learning
Luoyi Sun, Xuenan Xu, Mengyue Wu, Weidi Xie
Comments: Accepted by ACM MM 2024
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1839] arXiv:2309.11510 (cross-list from cs.IR) [pdf, other]
Title: When is a Foundation Model a Foundation Model
Saghir Alfasly, Peyman Nejat, Sobhan Hemati, Jibran Khan, Isaiah Lahr, Areej Alsaafin, Abubakr Shafique, Nneka Comfere, Dennis Murphree, Chady Meroueh, Saba Yasir, Aaron Mangold, Lisa Boardman, Vijay Shah, Joaquin J. Garcia, H.R. Tizhoosh
Subjects: Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1840] arXiv:2309.11705 (cross-list from cs.LG) [pdf, other]
Title: Meta OOD Learning for Continuously Adaptive OOD Detection
Xinheng Wu, Jie Lu, Zhen Fang, Guangquan Zhang
Comments: Accepted by ICCV 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1841] arXiv:2309.11710 (cross-list from cs.CL) [pdf, other]
Title: ContextRef: Evaluating Referenceless Metrics For Image Description Generation
Elisa Kreiss, Eric Zelikman, Christopher Potts, Nick Haber
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1842] arXiv:2309.11745 (cross-list from eess.IV) [pdf, other]
Title: PIE: Simulating Disease Progression via Progressive Image Editing
Kaizhao Liang, Xu Cao, Kuei-Da Liao, Tianren Gao, Wenqian Ye, Zhengyu Chen, Jianguo Cao, Tejas Nama, Jimeng Sun
Comments: Code and checkpoints for replicating our results can be found at this https URL and this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1843] arXiv:2309.11766 (cross-list from cs.CR) [pdf, html, other]
Title: Dictionary Attack on IMU-based Gait Authentication
Rajesh Kumar, Can Isik, Chilukuri K. Mohan
Comments: 12 pages, 9 figures, accepted at AISec23 colocated with ACM CCS, November 30, 2023, Copenhagen, Denmark
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[1844] arXiv:2309.11820 (cross-list from eess.IV) [pdf, html, other]
Title: Automatic Endoscopic Ultrasound Station Recognition with Limited Data
Abhijit Ramesh, Anantha Nandanan, Nikhil Boggavarapu, Priya Nair MD, Gilad Gressel
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1845] arXiv:2309.11891 (cross-list from eess.IV) [pdf, other]
Title: Heart Rate Detection Using an Event Camera
Aniket Jagtap, RamaKrishna Venkatesh Saripalli, Joe Lemley, Waseem Shariff, Alan F. Smeaton
Comments: Dataset available at this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1846] arXiv:2309.11913 (cross-list from eess.IV) [pdf, other]
Title: Spatial-Temporal Transformer based Video Compression Framework
Yanbo Gao, Wenjia Huang, Shuai Li, Hui Yuan, Mao Ye, Siwei Ma
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[1847] arXiv:2309.11930 (cross-list from cs.LG) [pdf, html, other]
Title: Bridging the Gap: Learning Pace Synchronization for Open-World Semi-Supervised Learning
Bo Ye, Kai Gan, Tong Wei, Min-Ling Zhang
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1848] arXiv:2309.11989 (cross-list from cs.RO) [pdf, html, other]
Title: A Vision-Based Navigation System for Arable Fields
Rajitha de Silva, Grzegorz Cielniak, Junfeng Gao
Comments: Submitted to Journal of Field Robotics
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1849] arXiv:2309.11993 (cross-list from cs.GR) [pdf, other]
Title: Neural Stochastic Screened Poisson Reconstruction
Silvia Sellán, Alec Jacobson
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1850] arXiv:2309.11995 (cross-list from eess.IV) [pdf, other]
Title: Identification of pneumonia on chest x-ray images through machine learning
Eduardo Augusto Roeder
Comments: In Brazilian Portuguese, 30 pages, 16 figures. This thesis was elaborated by the guidance of Prof. Dr. Akihito Inca Atahualpa Urdiales
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1851] arXiv:2309.12010 (cross-list from eess.IV) [pdf, other]
Title: Convolution and Attention Mixer for Synthetic Aperture Radar Image Change Detection
Haopeng Zhang, Zijing Lin, Feng Gao, Junyu Dong, Qian Du, Heng-Chao Li
Comments: Accepted by IEEE GRSL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1852] arXiv:2309.12022 (cross-list from cs.AI) [pdf, html, other]
Title: Demystifying Visual Features of Movie Posters for Multi-Label Genre Identification
Utsav Kumar Nareti, Chandranath Adak, Soumi Chattopadhyay
Comments: IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS (Accepted)
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1853] arXiv:2309.12095 (cross-list from stat.ML) [pdf, other]
Title: Bayesian sparsification for deep neural networks with Bayesian model reduction
Dimitrije Marković, Karl J. Friston, Stefan J. Kiebel
Subjects: Machine Learning (stat.ML); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1854] arXiv:2309.12114 (cross-list from eess.IV) [pdf, other]
Title: AutoPET Challenge 2023: Sliding Window-based Optimization of U-Net
Matthias Hadlich, Zdravko Marinov, Rainer Stiefelhagen
Comments: 9 pages, 1 figure, MICCAI 2023 - AutoPET Challenge Submission Version 2: Added all results on the preliminary test set
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1855] arXiv:2309.12159 (cross-list from cs.CR) [pdf, other]
Title: Information Forensics and Security: A quarter-century-long journey
Mauro Barni, Patrizio Campisi, Edward J. Delp, Gwenael Doërr, Jessica Fridrich, Nasir Memon, Fernando Pérez-González, Anderson Rocha, Luisa Verdoliva, Min Wu
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1856] arXiv:2309.12188 (cross-list from cs.RO) [pdf, html, other]
Title: SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on Scene Graphs
Guangyao Zhai, Xiaoni Cai, Dianye Huang, Yan Di, Fabian Manhardt, Federico Tombari, Nassir Navab, Benjamin Busam
Comments: ICRA 2024 accepted. Project website: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1857] arXiv:2309.12193 (cross-list from eess.IV) [pdf, other]
Title: Brain Tumor Detection Using Deep Learning Approaches
Razia Sultana Misu
Comments: Bachelor's thesis. Supervisor: Nushrat Jahan Ria
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1858] arXiv:2309.12245 (cross-list from eess.IV) [pdf, html, other]
Title: Adaptive Input-image Normalization for Solving the Mode Collapse Problem in GAN-based X-ray Images
Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly
Comments: Submitted to the Elsevier Journal
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1859] arXiv:2309.12300 (cross-list from cs.RO) [pdf, other]
Title: See to Touch: Learning Tactile Dexterity through Visual Incentives
Irmak Guzey, Yinlong Dai, Ben Evans, Soumith Chintala, Lerrel Pinto
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1860] arXiv:2309.12301 (cross-list from cs.LG) [pdf, other]
Title: Environment-biased Feature Ranking for Novelty Detection Robustness
Stefan Smeu, Elena Burceanu, Emanuela Haller, Andrei Liviu Nicolicioiu
Comments: The updated, long version of the paper is available at arXiv:2310.03738
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1861] arXiv:2309.12312 (cross-list from cs.RO) [pdf, other]
Title: ForceSight: Text-Guided Mobile Manipulation with Visual-Force Goals
Jeremy A. Collins, Cody Houff, You Liang Tan, Charles C. Kemp
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1862] arXiv:2309.12325 (cross-list from cs.CY) [pdf, other]
Title: FUTURE-AI: International consensus guideline for trustworthy and deployable artificial intelligence in healthcare
Karim Lekadir, Aasa Feragen, Abdul Joseph Fofanah, Alejandro F Frangi, Alena Buyx, Anais Emelie, Andrea Lara, Antonio R Porras, An-Wen Chan, Arcadi Navarro, Ben Glocker, Benard O Botwe, Bishesh Khanal, Brigit Beger, Carol C Wu, Celia Cintas, Curtis P Langlotz, Daniel Rueckert, Deogratias Mzurikwao, Dimitrios I Fotiadis, Doszhan Zhussupov, Enzo Ferrante, Erik Meijering, Eva Weicken, Fabio A González, Folkert W Asselbergs, Fred Prior, Gabriel P Krestin, Gary Collins, Geletaw S Tegenaw, Georgios Kaissis, Gianluca Misuraca, Gianna Tsakou, Girish Dwivedi, Haridimos Kondylakis, Harsha Jayakody, Henry C Woodruf, Horst Joachim Mayer, Hugo JWL Aerts, Ian Walsh, Ioanna Chouvarda, Irène Buvat, Isabell Tributsch, Islem Rekik, James Duncan, Jayashree Kalpathy-Cramer, Jihad Zahir, Jinah Park, John Mongan, Judy W Gichoya, Julia A Schnabel, Kaisar Kushibar, Katrine Riklund, Kensaku Mori, Kostas Marias, Lameck M Amugongo, Lauren A Fromont, Lena Maier-Hein, Leonor Cerdá Alberich, Leticia Rittner, Lighton Phiri, Linda Marrakchi-Kacem, Lluís Donoso-Bach, Luis Martí-Bonmatí, M Jorge Cardoso, Maciej Bobowicz, Mahsa Shabani, Manolis Tsiknakis, Maria A Zuluaga, Maria Bielikova, Marie-Christine Fritzsche, Marina Camacho, Marius George Linguraru, Markus Wenzel, Marleen De Bruijne, Martin G Tolsgaard, Marzyeh Ghassemi, Md Ashrafuzzaman, Melanie Goisauf, Mohammad Yaqub, Mónica Cano Abadía, Mukhtar M E Mahmoud, Mustafa Elattar, Nicola Rieke, Nikolaos Papanikolaou, Noussair Lazrak, Oliver Díaz, Olivier Salvado, Oriol Pujol, Ousmane Sall, Pamela Guevara, Peter Gordebeke, Philippe Lambin, Pieta Brown, Purang Abolmaesumi, Qi Dou, Qinghua Lu, Richard Osuala, Rose Nakasi, S Kevin Zhou
Subjects: Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1863] arXiv:2309.12397 (cross-list from cs.RO) [pdf, html, other]
Title: POLAR-Sim: Augmenting NASA's POLAR Dataset for Data-Driven Lunar Perception and Rover Simulation
Bo-Hsun Chen, Peter Negrut, Thomas Liang, Nevindu Batagoda, Harry Zhang, Dan Negrut
Comments: 11 pages, 9 figures. This work has been submitted to the IEEE for possible publication
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1864] arXiv:2309.12443 (cross-list from cs.CL) [pdf, html, other]
Title: Active Learning for Multilingual Fingerspelling Corpora
Shuai Wang, Eric Nalisnick
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1865] arXiv:2309.12460 (cross-list from cs.LG) [pdf, other]
Title: Multimodal Deep Learning for Scientific Imaging Interpretation
Abdulelah S. Alshehri, Franklin L. Lee, Shihu Wang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1866] arXiv:2309.12559 (cross-list from cs.LG) [pdf, html, other]
Title: Invariant Learning via Probability of Sufficient and Necessary Causes
Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jianhong Wang, Jun Wang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1867] arXiv:2309.12572 (cross-list from eess.IV) [pdf, other]
Title: Interpretable 3D Multi-Modal Residual Convolutional Neural Network for Mild Traumatic Brain Injury Diagnosis
Hanem Ellethy, Viktor Vegh, Shekhar S. Chandra
Comments: Accepted by the Australasian Joint Conference on Artificial Intelligence 2023 (AJCAI 2023). 12 pages and 5 Figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1868] arXiv:2309.12593 (cross-list from cs.LG) [pdf, other]
Title: Improving Machine Learning Robustness via Adversarial Training
Long Dang, Thushari Hapuarachchi, Kaiqi Xiong, Jing Lin
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1869] arXiv:2309.12634 (cross-list from cs.RO) [pdf, other]
Title: Learning Actions and Control of Focus of Attention with a Log-Polar-like Sensor
Robin Göransson, Volker Krueger
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1870] arXiv:2309.12638 (cross-list from eess.IV) [pdf, other]
Title: Auto-Lesion Segmentation with a Novel Intensity Dark Channel Prior for COVID-19 Detection
Basma Jumaa Saleh, Zaid Omar, Vikrant Bhateja, Lila Iznita Izhar
Comments: The study requires withdrawal due to technical inconsistencies in the reported data that affect the conclusions. We apologize for any inconvenience
Journal-ref: Journal of Physics: Conference Series 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Metric Geometry (math.MG); Optimization and Control (math.OC)
[1871] arXiv:2309.12673 (cross-list from cs.LG) [pdf, other]
Title: On Sparse Modern Hopfield Model
Jerry Yao-Chieh Hu, Donglin Yang, Dennis Wu, Chenwei Xu, Bo-Yu Chen, Han Liu
Comments: 37 pages, accepted at NeurIPS 2023. [v2] updated to match with camera-ready version. Code is available at this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1872] arXiv:2309.12675 (cross-list from cs.AI) [pdf, other]
Title: Vision Transformers for Computer Go
Amani Sagri, Tristan Cazenave, Jérôme Arjonilla, Abdallah Saffidine
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1873] arXiv:2309.12685 (cross-list from cs.RO) [pdf, html, other]
Title: eWand: A calibration framework for wide baseline frame-based and event-based camera systems
Thomas Gossard, Andreas Ziegler, Levin Kolmar, Jonas Tebbe, Andreas Zell
Comments: Accepted for 2024 IEEE International Conference on Robotics and Automation (ICRA 2024). Project web page: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1874] arXiv:2309.12706 (cross-list from cs.LG) [pdf, other]
Title: Multi-Label Noise Transition Matrix Estimation with Label Correlations: Theory and Algorithm
Shikun Li, Xiaobo Xia, Hansong Zhang, Shiming Ge, Tongliang Liu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1875] arXiv:2309.12805 (cross-list from eess.IV) [pdf, other]
Title: Automatic view plane prescription for cardiac magnetic resonance imaging via supervision by spatial relationship between views
Dong Wei, Yawen Huang, Donghuan Lu, Yuexiang Li, Yefeng Zheng
Comments: Medical Physics. arXiv admin note: text overlap with arXiv:2109.11715
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1876] arXiv:2309.12855 (cross-list from eess.IV) [pdf, other]
Title: Cross-Modal Translation and Alignment for Survival Analysis
Fengtao Zhou, Hao Chen
Comments: Accepted by ICCV2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1877] arXiv:2309.12862 (cross-list from cs.LG) [pdf, html, other]
Title: Associative Transformer
Yuwei Sun, Hideya Ochiai, Zhirong Wu, Stephen Lin, Ryota Kanai
Comments: Accepted for CVPR 2025
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1878] arXiv:2309.12953 (cross-list from eess.IV) [pdf, other]
Title: Inter-vendor harmonization of Computed Tomography (CT) reconstruction kernels using unpaired image translation
Aravind R. Krishnan, Kaiwen Xu, Thomas Li, Chenyu Gao, Lucas W. Remedios, Praitayini Kanakaraj, Ho Hin Lee, Shunxing Bao, Kim L. Sandler, Fabien Maldonado, Ivana Isgum, Bennett A. Landman
Comments: 10 pages, 6 figures, 1 table, Submitted to SPIE Medical Imaging : Image Processing. San Diego, CA. February 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1879] arXiv:2309.12955 (cross-list from cs.CR) [pdf, other]
Title: On Data Fabrication in Collaborative Vehicular Perception: Attacks and Countermeasures
Qingzhao Zhang, Shuowei Jin, Ruiyang Zhu, Jiachen Sun, Xumiao Zhang, Qi Alfred Chen, Z. Morley Mao
Comments: 18 pages, 24 figures, accepted by Usenix Security 2024
Subjects: Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[1880] arXiv:2309.12970 (cross-list from eess.IV) [pdf, other]
Title: PI-RADS v2 Compliant Automated Segmentation of Prostate Zones Using co-training Motivated Multi-task Dual-Path CNN
Arnab Das, Suhita Ghosh, Sebastian Stober
Comments: Authors Arnab Das and Suhita Ghosh contributed equally. Submitted in ISBI 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1881] arXiv:2309.12996 (cross-list from cs.LG) [pdf, other]
Title: Point Cloud Network: An Order of Magnitude Improvement in Linear Layer Parameter Count
Charles Hetterich
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
[1882] arXiv:2309.13013 (cross-list from eess.IV) [pdf, other]
Title: Performance Analysis of UNet and Variants for Medical Image Segmentation
Walid Ehab, Yongmin Li
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1883] arXiv:2309.13041 (cross-list from cs.RO) [pdf, other]
Title: Robotic Offline RL from Internet Videos via Value-Function Pre-Training
Chethan Bhateja, Derek Guo, Dibya Ghosh, Anikait Singh, Manan Tomar, Quan Vuong, Yevgen Chebotar, Sergey Levine, Aviral Kumar
Comments: First three authors contributed equally
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1884] arXiv:2309.13150 (cross-list from cs.LG) [pdf, html, other]
Title: Pixel-wise Smoothing for Certified Robustness against Camera Motion Perturbations
Hanjiang Hu, Zuxin Liu, Linyi Li, Jiacheng Zhu, Ding Zhao
Comments: Camera-ready version of AISTATS 2024, 30 pages, 5 figures, 13 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1885] arXiv:2309.13160 (cross-list from cs.LG) [pdf, html, other]
Title: How to train your VAE
Mariano Rivera
Comments: 5 pages, 3 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1886] arXiv:2309.13167 (cross-list from cs.LG) [pdf, other]
Title: Flow Factorized Representation Learning
Yue Song, T. Anderson Keller, Nicu Sebe, Max Welling
Comments: NeurIPS23
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1887] arXiv:2309.13181 (cross-list from cs.LG) [pdf, other]
Title: Diagnosing and exploiting the computational demands of videos games for deep reinforcement learning
Lakshmi Narasimhan Govindarajan, Rex G Liu, Drew Linsley, Alekh Karkada Ashok, Max Reuter, Michael J Frank, Thomas Serre
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1888] arXiv:2309.13190 (cross-list from cs.LG) [pdf, other]
Title: Spatial-frequency channels, shape bias, and adversarial robustness
Ajay Subramanian, Elena Sizikova, Najib J. Majaj, Denis G. Pelli
Comments: Neural Information Processing Systems (NeurIPS) 2023 (Oral Presentation). Camera-ready version
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1889] arXiv:2309.13302 (cross-list from cs.NE) [pdf, html, other]
Title: Gaining the Sparse Rewards by Exploring Lottery Tickets in Spiking Neural Network
Hao Cheng, Jiahang Cao, Erjia Xiao, Mengshu Sun, Renjing Xu
Comments: This paper is accepted by IROS 2024
Subjects: Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
[1890] arXiv:2309.13303 (cross-list from cs.LG) [pdf, other]
Title: C$^2$VAE: Gaussian Copula-based VAE Differing Disentangled from Coupled Representations with Contrastive Posterior
Zhangkai Wu, Longbing Cao
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1891] arXiv:2309.13385 (cross-list from eess.IV) [pdf, other]
Title: Cine cardiac MRI reconstruction using a convolutional recurrent network with refinement
Yuyang Xue, Yuning Du, Gianluca Carloni, Eva Pachetti, Connor Jordan, Sotirios A. Tsaftaris
Comments: MICCAI STACOM workshop 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1892] arXiv:2309.13398 (cross-list from eess.IV) [pdf, other]
Title: A mirror-Unet architecture for PET/CT lesion segmentation
Yamila Rotstein Habarnau, Mauro Namías
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1893] arXiv:2309.13404 (cross-list from eess.IV) [pdf, html, other]
Title: Weakly Supervised YOLO Network for Surgical Instrument Localization in Endoscopic Videos
Rongfeng Wei, Jinlin Wu, Xuexue Bai, Ming Feng, Zhen Lei, Hongbin Liu, Zhen Chen
Comments: Accepted by ICRA 2024 Workshop on C4 Surgical Robotic Systems in the Embodied AI Era; Surgical Tool Localization in Endoscopic Videos Challenge of MICCAI2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1894] arXiv:2309.13411 (cross-list from cs.LG) [pdf, other]
Title: Towards Attributions of Input Variables in a Coalition
Xinhao Zheng, Huiqi Deng, Bo Fan, Quanshi Zhang
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1895] arXiv:2309.13415 (cross-list from cs.LG) [pdf, other]
Title: Dream the Impossible: Outlier Imagination with Diffusion Models
Xuefeng Du, Yiyou Sun, Xiaojin Zhu, Yixuan Li
Comments: NeurIPS 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1896] arXiv:2309.13430 (cross-list from cs.CL) [pdf, other]
Title: Resolving References in Visually-Grounded Dialogue via Text Generation
Bram Willemsen, Livia Qian, Gabriel Skantze
Comments: Published at SIGDIAL 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1897] arXiv:2309.13457 (cross-list from cs.LG) [pdf, other]
Title: Turbulence in Focus: Benchmarking Scaling Behavior of 3D Volumetric Super-Resolution with BLASTNet 2.0 Data
Wai Tong Chung, Bassem Akoush, Pushan Sharma, Alex Tamkin, Ki Sung Jung, Jacqueline H. Chen, Jack Guo, Davy Brouzet, Mohsen Talei, Bruno Savard, Alexei Y. Poludnenko, Matthias Ihme
Comments: Accepted in Adv. in Neural Information Processing Systems 36 (NeurIPS 2023). Link: this https URL . 55 pages, 21 figures. Keywords: Super-resolution, 3D, Neural Scaling, Physics-informed Loss, Computational Fluid Dynamics, Partial Differential Equations, Turbulent Reacting Flows, Direct Numerical Simulation, Fluid Mechanics, Combustion, Computer Vision
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Computational Physics (physics.comp-ph); Fluid Dynamics (physics.flu-dyn)
[1898] arXiv:2309.13475 (cross-list from cs.RO) [pdf, html, other]
Title: Detecting and Mitigating System-Level Anomalies of Vision-Based Controllers
Aryaman Gupta, Kaustav Chakraborty, Somil Bansal
Journal-ref: 2024/5/13 Conference 2024 IEEE International Conference on Robotics and Automation (ICRA) Pages 9953-9959 Publisher 2024 IEEE International Conference on Robotics and Automation (ICRA)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[1899] arXiv:2309.13549 (cross-list from cs.RO) [pdf, other]
Title: Towards Robust Robot 3D Perception in Urban Environments: The UT Campus Object Dataset
Arthur Zhang, Chaitanya Eranki, Christina Zhang, Ji-Hwan Park, Raymond Hong, Pranav Kalyani, Lochana Kalyanaraman, Arsh Gamare, Arnav Bagad, Maria Esteva, Joydeep Biswas
Comments: 19 pages, 18 figures, 12 tables
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1900] arXiv:2309.13553 (cross-list from eess.IV) [pdf, other]
Title: Generalized Dice Focal Loss trained 3D Residual UNet for Automated Lesion Segmentation in Whole-Body FDG PET/CT Images
Shadab Ahamed, Arman Rahmim
Comments: AutoPET-II challenge (2023)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1901] arXiv:2309.13571 (cross-list from eess.IV) [pdf, other]
Title: Matrix Completion-Informed Deep Unfolded Equilibrium Models for Self-Supervised k-Space Interpolation in MRI
Chen Luo, Huayu Wang, Taofeng Xie, Qiyu Jin, Guoqing Chen, Zhuo-Xu Cui, Dong Liang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1902] arXiv:2309.13584 (cross-list from eess.IV) [pdf, other]
Title: Solving Low-Dose CT Reconstruction via GAN with Local Coherence
Wenjie Liu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1903] arXiv:2309.13587 (cross-list from eess.IV) [pdf, other]
Title: Benchmarking Encoder-Decoder Architectures for Biplanar X-ray to 3D Shape Reconstruction
Mahesh Shakya, Bishesh Khanal
Comments: accepted to NeurIPS 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1904] arXiv:2309.13742 (cross-list from cs.GR) [pdf, other]
Title: DROP: Dynamics Responses from Human Motion Prior and Projective Dynamics
Yifeng Jiang, Jungdam Won, Yuting Ye, C. Karen Liu
Comments: SIGGRAPH Asia 2023, Video this https URL, Website: this https URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1905] arXiv:2309.13745 (cross-list from cs.RO) [pdf, html, other]
Title: Overview of Computer Vision Techniques in Robotized Wire Harness Assembly: Current State and Future Opportunities
Hao Wang, Omkar Salunkhe, Walter Quadrini, Dan Lämkull, Fredrik Ore, Björn Johansson, Johan Stahre
Comments: Presented at the 56th CIRP Conference on Manufacturing Systems (CIRP CMS 2023), Cape Town, South Africa, 24-26 October 2023. Published in Procedia CIRP
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1906] arXiv:2309.13746 (cross-list from cs.RO) [pdf, other]
Title: Deep Learning-Based Connector Detection for Robotized Assembly of Automotive Wire Harnesses
Hao Wang, Björn Johansson
Comments: This paper has been accepted by IEEE CASE 2023 and has been presented on the conference. The information of the published version will be updated later
Journal-ref: 2023 IEEE 19th International Conference on Automation Science and Engineering (CASE), Auckland, New Zealand, 2023, pp. 1-8
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1907] arXiv:2309.13747 (cross-list from eess.IV) [pdf, html, other]
Title: Look Ma, no code: fine tuning nnU-Net for the AutoPET II challenge by only adjusting its JSON plans
Fabian Isensee, Klaus H.Maier-Hein
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1908] arXiv:2309.13770 (cross-list from cs.LG) [pdf, other]
Title: Devil in the Number: Towards Robust Multi-modality Data Filter
Yichen Xu, Zihan Xu, Wenhao Chai, Zhonghan Zhao, Enxin Song, Gaoang Wang
Comments: ICCV 2023 Workshop: TNGCV-DataComp
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1909] arXiv:2309.13773 (cross-list from cs.LG) [pdf, other]
Title: GHN-QAT: Training Graph Hypernetworks to Predict Quantization-Robust Parameters of Unseen Limited Precision Neural Networks
Stone Yun, Alexander Wong
Comments: Poster and extended abstract to be presented at the Workshop for Low Bit Quantized Neural Networks (LQBNN) @ ICCV 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1910] arXiv:2309.13777 (cross-list from eess.IV) [pdf, other]
Title: Diffeomorphic Multi-Resolution Deep Learning Registration for Applications in Breast MRI
Matthew G. French, Gonzalo D. Maso Talou, Thiranja P. Babarenda Gamage, Martyn P. Nash, Poul M. Nielsen, Anthony J. Doyle, Juan Eugenio Iglesias, Yaël Balbastre, Sean I. Young
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1911] arXiv:2309.13817 (cross-list from eess.IV) [pdf, other]
Title: MMA-Net: Multiple Morphology-Aware Network for Automated Cobb Angle Measurement
Zhengxuan Qiu, Jie Yang, Jiankun Wang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1912] arXiv:2309.13835 (cross-list from eess.IV) [pdf, html, other]
Title: IBVC: Interpolation-driven B-frame Video Compression
Chenming Xu, Meiqin Liu, Chao Yao, Weisi Lin, Yao Zhao
Comments: Submitted to Pattern Recognition
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1913] arXiv:2309.13839 (cross-list from eess.IV) [pdf, other]
Title: Fill the K-Space and Refine the Image: Prompting for Dynamic and Multi-Contrast MRI Reconstruction
Bingyu Xin, Meng Ye, Leon Axel, Dimitris N. Metaxas
Comments: STACOM 2023; Code is available at this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1914] arXiv:2309.13842 (cross-list from cs.RO) [pdf, other]
Title: Traj-LO: In Defense of LiDAR-Only Odometry Using an Effective Continuous-Time Trajectory
Xin Zheng, Jianke Zhu
Comments: Video this https URL and Project site this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1915] arXiv:2309.13866 (cross-list from cs.LG) [pdf, other]
Title: On Calibration of Modern Quantized Efficient Neural Networks
Joey Kuang, Alexander Wong
Comments: Accepted as an extended abstract at the ICCV 2023 Workshop on Low-Bit Quantized Neural Networks. Corrected some typos
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1916] arXiv:2309.13872 (cross-list from eess.IV) [pdf, other]
Title: Attention and Pooling based Sigmoid Colon Segmentation in 3D CT images
Md Akizur Rahman, Sonit Singh, Kuruparan Shanmugalingam, Sankaran Iyer, Alan Blair, Praveen Ravindran, Arcot Sowmya
Comments: 8 Pages, 6 figures, Accepted at IEEE DICTA 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1917] arXiv:2309.13885 (cross-list from cs.LG) [pdf, html, other]
Title: TouchUp-G: Improving Feature Representation through Graph-Centric Finetuning
Jing Zhu, Xiang Song, Vassilis N. Ioannidis, Danai Koutra, Christos Faloutsos
Comments: SIGIR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Social and Information Networks (cs.SI)
[1918] arXiv:2309.13893 (cross-list from cs.RO) [pdf, html, other]
Title: Scene Informer: Anchor-based Occlusion Inference and Trajectory Prediction in Partially Observable Environments
Bernard Lange, Jiachen Li, Mykel J. Kochenderfer
Comments: Accepted to 2024 IEEE International Conference on Robotics and Automation (ICRA)
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1919] arXiv:2309.13980 (cross-list from eess.IV) [pdf, other]
Title: Better Generalization of White Matter Tract Segmentation to Arbitrary Datasets with Scaled Residual Bootstrap
Wan Liu, Chuyang Ye
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1920] arXiv:2309.14054 (cross-list from cs.LG) [pdf, html, other]
Title: Adapt then Unlearn: Exploring Parameter Space Semantics for Unlearning in Generative Adversarial Networks
Piyush Tiwary, Atri Guha, Subhodip Panda, Prathosh A.P
Comments: Accepted at Transactions on Machine Learning Research (TMLR)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1921] arXiv:2309.14068 (cross-list from cs.LG) [pdf, html, other]
Title: Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models
Yangming Li, Boris van Breugel, Mihaela van der Schaar
Comments: Accepted by ICLR-2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1922] arXiv:2309.14090 (cross-list from cs.LG) [pdf, other]
Title: Convolutional autoencoder-based multimodal one-class classification
Firas Laakom, Fahad Sohrab, Jenni Raitoharju, Alexandros Iosifidis, Moncef Gabbouj
Comments: 5 pages, 1 figure, 4 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1923] arXiv:2309.14198 (cross-list from cs.LG) [pdf, other]
Title: (Predictable) Performance Bias in Unsupervised Anomaly Detection
Felix Meissen, Svenja Breuer, Moritz Knolle, Alena Buyx, Ruth Müller, Georgios Kaissis, Benedikt Wiestler, Daniel Rückert
Comments: 11 pages, 5 Figures, 1 panel
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Image and Video Processing (eess.IV)
[1924] arXiv:2309.14211 (cross-list from cs.RO) [pdf, other]
Title: QuadricsNet: Learning Concise Representation for Geometric Primitives in Point Clouds
Ji Wu, Huai Yu, Wen Yang, Gui-Song Xia
Comments: Submitted to ICRA 2024. 7 pages
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1925] arXiv:2309.14236 (cross-list from cs.RO) [pdf, html, other]
Title: MoDem-V2: Visuo-Motor World Models for Real-World Robot Manipulation
Patrick Lancaster, Nicklas Hansen, Aravind Rajeswaran, Vikash Kumar
Comments: 10 pages, 8 figures
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1926] arXiv:2309.14265 (cross-list from cs.RO) [pdf, html, other]
Title: Industrial Application of 6D Pose Estimation for Robotic Manipulation in Automotive Internal Logistics
Philipp Quentin, Dino Knoll, Daniel Goehring
Comments: Accepted for publication at IEEE International Conference on Automation Science and Engineering (CASE 2023)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1927] arXiv:2309.14306 (cross-list from eess.IV) [pdf, other]
Title: DeepMesh: Mesh-based Cardiac Motion Tracking using Deep Learning
Qingjie Meng, Wenjia Bai, Declan P O'Regan, and Daniel Rueckert
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1928] arXiv:2309.14329 (cross-list from cs.HC) [pdf, other]
Title: Innovative Digital Storytelling with AIGC: Exploration and Discussion of Recent Advances
Rongzhang Gu, Hui Li, Changyue Su, Wayne Wu
Comments: Project page: this https URL
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Multimedia (cs.MM)
[1929] arXiv:2309.14341 (cross-list from cs.RO) [pdf, other]
Title: Extreme Parkour with Legged Robots
Xuxin Cheng, Kexin Shi, Ananye Agarwal, Deepak Pathak
Comments: Website and videos at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[1930] arXiv:2309.14356 (cross-list from cs.LG) [pdf, other]
Title: COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs
Tiep Le, Vasudev Lal, Phillip Howard
Comments: Accepted to NeurIPS 2023 Datasets and Benchmarks Track
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1931] arXiv:2309.14360 (cross-list from cs.LG) [pdf, other]
Title: Domain-Guided Conditional Diffusion Model for Unsupervised Domain Adaptation
Yulong Zhang, Shuhao Chen, Weisen Jiang, Yu Zhang, Jiangang Lu, James T. Kwok
Comments: Work in progress
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1932] arXiv:2309.14392 (cross-list from eess.IV) [pdf, other]
Title: Unveiling Fairness Biases in Deep Learning-Based Brain MRI Reconstruction
Yuning Du, Yuyang Xue, Rohan Dharmakumar, Sotirios A. Tsaftaris
Comments: Accepted for publication at FAIMI 2023 (Fairness of AI in Medical Imaging) at MICCAI
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1933] arXiv:2309.14425 (cross-list from cs.RO) [pdf, other]
Title: Self-Recovery Prompting: Promptable General Purpose Service Robot System with Foundation Models and Self-Recovery
Mimo Shirasaka, Tatsuya Matsushima, Soshi Tsunashima, Yuya Ikeda, Aoi Horo, So Ikoma, Chikaha Tsuji, Hikaru Wada, Tsunekazu Omija, Dai Komukai, Yutaka Matsuo Yusuke Iwasawa
Comments: Website: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[1934] arXiv:2309.14474 (cross-list from eess.IV) [pdf, other]
Title: Gastro-Intestinal Tract Segmentation Using an Explainable 3D Unet
Kai Li, Jonathan Chan
Comments: 5 pages, 8 figures, 13th Joint Symposium on Computational Intelligence (JSCI13)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1935] arXiv:2309.14483 (cross-list from astro-ph.SR) [pdf, other]
Title: Unveiling the Potential of Deep Learning Models for Solar Flare Prediction in Near-Limb Regions
Chetraj Pandey, Rafal A. Angryk, Berkay Aydin
Comments: This is a preprint accepted at the 22nd International Conference on Machine Learning and Applications (ICMLA), 2023. 7 Pages, 6 Figures
Subjects: Solar and Stellar Astrophysics (astro-ph.SR); Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1936] arXiv:2309.14492 (cross-list from eess.IV) [pdf, other]
Title: AiAReSeg: Catheter Detection and Segmentation in Interventional Ultrasound using Transformers
Alex Ranne, Yordanka Velikova, Nassir Navab, Ferdinando Rodriguez y Baena
Comments: This work has been submitted to the IEEE for possible publication
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1937] arXiv:2309.14540 (cross-list from cs.LG) [pdf, other]
Title: Effect of roundabout design on the behavior of road users: A case study of roundabouts with application of Unsupervised Machine Learning
Tasnim M. Dwekat, Ayda A. Almsre, Huthaifa I. Ashqar
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[1938] arXiv:2309.14550 (cross-list from eess.IV) [pdf, html, other]
Title: MEMO: Dataset and Methods for Robust Multimodal Retinal Image Registration with Large or Small Vessel Density Differences
Chiao-Yi Wang, Faranguisse Kakhi Sadrieh, Yi-Ting Shen, Shih-En Chen, Sarah Kim, Victoria Chen, Achyut Raghavendra, Dongyi Wang, Osamah Saeedi, Yang Tao
Comments: Biomedical Optics Express
Journal-ref: Biomed. Opt. Express 15, 3457-3479 (2024)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1939] arXiv:2309.14580 (cross-list from cs.LG) [pdf, other]
Title: CWCL: Cross-Modal Transfer with Continuously Weighted Contrastive Loss
Rakshith Sharma Srinivasa, Jaejin Cho, Chouchang Yang, Yashas Malur Saidutta, Ching-Hua Lee, Yilin Shen, Hongxia Jin
Comments: Accepted to Neural Information Processing Systems (NeurIPS) 2023 conference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1940] arXiv:2309.14586 (cross-list from cs.SD) [pdf, other]
Title: Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer
Xiaofeng Liu, Fangxu Xing, Maureen Stone, Jiachen Zhuo, Sidney Fels, Jerry L. Prince, Georges El Fakhri, Jonghye Woo
Comments: MICCAI 2023 (Oral presentation)
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[1941] arXiv:2309.14591 (cross-list from eess.IV) [pdf, other]
Title: Applications of Sequential Learning for Medical Image Classification
Sohaib Naim, Brian Caffo, Haris I Sair, Craig K Jones
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1942] arXiv:2309.14630 (cross-list from econ.EM) [pdf, html, other]
Title: Free Discontinuity Regression: With an Application to the Economic Effects of Internet Shutdowns
Florian Gunsilius, David Van Dijcke
Comments: 24 pages, 3 figures, 2 tables; authors listed alphabetically; code available at this https URL
Subjects: Econometrics (econ.EM); Computer Vision and Pattern Recognition (cs.CV); Statistics Theory (math.ST); Applications (stat.AP); Methodology (stat.ME)
[1943] arXiv:2309.14655 (cross-list from cs.RO) [pdf, html, other]
Title: Probabilistic 3D Multi-Object Cooperative Tracking for Autonomous Driving via Differentiable Multi-Sensor Kalman Filter
Hsu-kuang Chiu, Chien-Yi Wang, Min-Hung Chen, Stephen F. Smith
Comments: Accepted by IEEE International Conference on Robotics and Automation (ICRA), 2024. Code: this https URL Video: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1944] arXiv:2309.14685 (cross-list from cs.RO) [pdf, html, other]
Title: DriveSceneGen: Generating Diverse and Realistic Driving Scenarios from Scratch
Shuo Sun, Zekai Gu, Tianchen Sun, Jiawei Sun, Chengran Yuan, Yuhang Han, Dongen Li, Marcelo H. Ang Jr
Comments: 8 pages, 5 figures, 2 tables
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1945] arXiv:2309.14737 (cross-list from cs.RO) [pdf, html, other]
Title: Volumetric Semantically Consistent 3D Panoptic Mapping
Yang Miao, Iro Armeni, Marc Pollefeys, Daniel Barath
Comments: 8 pages, 2 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1946] arXiv:2309.14759 (cross-list from cs.GR) [pdf, other]
Title: Diffusion-based Holistic Texture Rectification and Synthesis
Guoqing Hao, Satoshi Iizuka, Kensho Hara, Edgar Simo-Serra, Hirokatsu Kataoka, Kazuhiro Fukui
Comments: SIGGRAPH Asia 2023 Conference Paper
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1947] arXiv:2309.14774 (cross-list from cs.LG) [pdf, other]
Title: BLIP-Adapter: Parameter-Efficient Transfer Learning for Mobile Screenshot Captioning
Ching-Yu Chiang, I-Hua Chang, Shih-Wei Liao
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[1948] arXiv:2309.14816 (cross-list from cs.LG) [pdf, other]
Title: A Comparative Study of Population-Graph Construction Methods and Graph Neural Networks for Brain Age Regression
Kyriaki-Margarita Bintsi, Tamara T. Mueller, Sophie Starck, Vasileios Baltatzis, Alexander Hammers, Daniel Rueckert
Comments: Accepted at GRAIL, MICCAI 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1949] arXiv:2309.14949 (cross-list from cs.LG) [pdf, html, other]
Title: Towards Real-World Test-Time Adaptation: Tri-Net Self-Training with Balanced Normalization
Yongyi Su, Xun Xu, Kui Jia
Comments: Accepted by AAAI 2024. 19 pages, 7 figures and 22 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1950] arXiv:2309.15038 (cross-list from cs.LG) [pdf, html, other]
Title: HPCR: Holistic Proxy-based Contrastive Replay for Online Continual Learning
Huiwei Lin, Shanshan Feng, Baoquan Zhang, Xutao Li, Yunming Ye
Comments: 15 pages, 10 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1951] arXiv:2309.15048 (cross-list from cs.LG) [pdf, html, other]
Title: Class Incremental Learning via Likelihood Ratio Based Task Prediction
Haowei Lin, Yijia Shao, Weinan Qian, Ningxin Pan, Yiduo Guo, Bing Liu
Journal-ref: ICLR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1952] arXiv:2309.15065 (cross-list from cs.RO) [pdf, html, other]
Title: Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding
Christina Kassab, Matias Mattamala, Lintong Zhang, Maurice Fallon
Comments: Accepted at ICRA 2024
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1953] arXiv:2309.15135 (cross-list from cs.LG) [pdf, html, other]
Title: Contrastive Continual Multi-view Clustering with Filtered Structural Fusion
Xinhang Wan, Jiyuan Liu, Hao Yu, Ao Li, Xinwang Liu, Ke Liang, Zhibin Dong, En Zhu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1954] arXiv:2309.15216 (cross-list from cs.LG) [pdf, other]
Title: A Comparative Study of Filters and Deep Learning Models to predict Diabetic Retinopathy
Roshan Vasu Muddaluru, Sharvaani Ravikumar Thoguluva, Shruti Prabha, Tanuja Konda Reddy, Suja Palaniswamy
Comments: 6 pages, 5 figures, I2CT , 2 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1955] arXiv:2309.15243 (cross-list from eess.IV) [pdf, other]
Title: APIS: A paired CT-MRI dataset for ischemic stroke segmentation challenge
Santiago Gómez, Daniel Mantilla, Gustavo Garzón, Edgar Rangel, Andrés Ortiz, Franklin Sierra-Jerez, Fabio Martínez
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[1956] arXiv:2309.15245 (cross-list from cs.AI) [pdf, other]
Title: SeMAnD: Self-Supervised Anomaly Detection in Multimodal Geospatial Datasets
Daria Reshetova, Swetava Ganguli, C. V. Krishnakumar Iyer, Vipul Pandey
Comments: Extended version of the accepted research track paper at the 31st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL 2023), Hamburg, Germany. 11 pages, 8 figures, 6 tables
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1957] arXiv:2309.15259 (cross-list from quant-ph) [pdf, other]
Title: SLIQ: Quantum Image Similarity Networks on Noisy Quantum Computers
Daniel Silver, Tirthak Patel, Aditya Ranjan, Harshitta Gandhi, William Cutler, Devesh Tiwari
Journal-ref: Vol. 37 No. 8: AAAI-2023 Technical Tracks 8
Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1958] arXiv:2309.15268 (cross-list from cs.RO) [pdf, other]
Title: ObVi-SLAM: Long-Term Object-Visual SLAM
Amanda Adkins, Taijing Chen, Joydeep Biswas
Comments: 8 pages, 7 figures, 1 table plus appendix with 4 figures and 1 table
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1959] arXiv:2309.15278 (cross-list from cs.RO) [pdf, html, other]
Title: Out of Sight, Still in Mind: Reasoning and Planning about Unobserved Objects with Video Tracking Enabled Memory Models
Yixuan Huang, Jialin Yuan, Chanho Kim, Pupul Pradhan, Bryan Chen, Li Fuxin, Tucker Hermans
Comments: Presented at IEEE Conference on Robotics and Automation (ICRA) 2024. Website: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1960] arXiv:2309.15302 (cross-list from cs.RO) [pdf, other]
Title: STERLING: Self-Supervised Terrain Representation Learning from Unconstrained Robot Experience
Haresh Karnan, Elvin Yang, Daniel Farkash, Garrett Warnell, Joydeep Biswas, Peter Stone
Comments: Project website: this https URL
Journal-ref: Conference on Robot Learning (CoRL 2023)
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1961] arXiv:2309.15314 (cross-list from physics.med-ph) [pdf, other]
Title: Conversion of single-energy computed tomography to parametric maps of dual-energy computed tomography using convolutional neural network
Sangwook Kim, Jimin Lee, Jungye Kim, Bitbyeol Kim, Chang Heon Choi, Seongmoon Jung
Comments: 29 pages, 17 figures
Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
[1962] arXiv:2309.15332 (cross-list from cs.RO) [pdf, other]
Title: Multimodal Dataset for Localization, Mapping and Crop Monitoring in Citrus Tree Farms
Hanzhe Teng, Yipeng Wang, Xiaoao Song, Konstantinos Karydis
Comments: Accepted to the 18th International Symposium on Visual Computing (ISVC 2023)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1963] arXiv:2309.15420 (cross-list from cs.LG) [pdf, other]
Title: The Triad of Failure Modes and a Possible Way Out
Emanuele Sansone
Comments: Some sentences in the Background Section are overlapping with Section 2 in arXiv:2304.11357 However, the main technical content and all other sections are different
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1964] arXiv:2309.15459 (cross-list from cs.RO) [pdf, html, other]
Title: GAMMA: Graspability-Aware Mobile MAnipulation Policy Learning based on Online Grasping Pose Fusion
Jiazhao Zhang, Nandiraju Gireesh, Jilong Wang, Xiaomeng Fang, Chaoyi Xu, Weiguang Chen, Liu Dai, He Wang
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1965] arXiv:2309.15477 (cross-list from cs.GR) [pdf, other]
Title: A Tutorial on Uniform B-Spline
Yi Zhou
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1966] arXiv:2309.15485 (cross-list from eess.IV) [pdf, other]
Title: Style Transfer and Self-Supervised Learning Powered Myocardium Infarction Super-Resolution Segmentation
Lichao Wang, Jiahao Huang, Xiaodan Xing, Yinzhe Wu, Ramyah Rajakulasingam, Andrew D. Scott, Pedro F Ferreira, Ranil De Silva, Sonia Nielles-Vallespin, Guang Yang
Comments: 6 pages, 8 figures, conference, accepted by SIPAIM2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1967] arXiv:2309.15516 (cross-list from cs.CL) [pdf, html, other]
Title: Teaching Text-to-Image Models to Communicate in Dialog
Xiaowen Sun, Jiazhan Feng, Yuxuan Wang, Yuxuan Lai, Xingyu Shen, Dongyan Zhao
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1968] arXiv:2309.15520 (cross-list from cs.LG) [pdf, other]
Title: SAF-Net: Self-Attention Fusion Network for Myocardial Infarction Detection using Multi-View Echocardiography
Ilke Adalioglu, Mete Ahishali, Aysen Degerli, Serkan Kiranyaz, Moncef Gabbouj
Comments: 4 pages, 3 figures, Computing in Cardiology (CinC) 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1969] arXiv:2309.15521 (cross-list from cs.LG) [pdf, other]
Title: MLOps for Scarce Image Data: A Use Case in Microscopic Image Analysis
Angelo Yamachui Sitcheu, Nils Friederich, Simon Baeuerle, Oliver Neumann, Markus Reischl, Ralf Mikut
Comments: 21 pages, 5 figures , 33. Workshop on Computational Intelligence Berlin Germany
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1970] arXiv:2309.15529 (cross-list from eess.IV) [pdf, other]
Title: Missing-modality Enabled Multi-modal Fusion Architecture for Medical Data
Muyu Wang, Shiyu Fan, Yichen Li, Hui Chen
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1971] arXiv:2309.15551 (cross-list from cs.LG) [pdf, html, other]
Title: DeepRepViz: Identifying Confounders in Deep Learning Model Predictions
Roshan Prakash Rane, JiHoon Kim, Arjun Umesha, Didem Stark, Marc-André Schulz, Kerstin Ritter
Journal-ref: MICCAI 2024. Lecture Notes in Computer Science, vol 15010. pp 186 to 196. Springer, Cham
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1972] arXiv:2309.15564 (cross-list from cs.LG) [pdf, other]
Title: Jointly Training Large Autoregressive Multimodal Models
Emanuele Aiello, Lili Yu, Yixin Nie, Armen Aghajanyan, Barlas Oguz
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1973] arXiv:2309.15573 (cross-list from cs.CG) [pdf, other]
Title: The Maximum Cover with Rotating Field of View
Igor Potapov, Jason Ralph, Theofilos Triommatis
Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Algebraic Geometry (math.AG)
[1974] arXiv:2309.15596 (cross-list from cs.RO) [pdf, other]
Title: PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation
Shizhe Chen, Ricardo Garcia, Cordelia Schmid, Ivan Laptev
Comments: Accepted to CoRL 2023. Project website: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1975] arXiv:2309.15608 (cross-list from eess.IV) [pdf, html, other]
Title: NoSENSE: Learned unrolled cardiac MRI reconstruction without explicit sensitivity maps
Felix Frederik Zimmermann, Andreas Kofler
Comments: Accepted at MICCAI STACOM 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1976] arXiv:2309.15638 (cross-list from eess.IV) [pdf, html, other]
Title: RSF-Conv: Rotation-and-Scale Equivariant Fourier Parameterized Convolution for Retinal Vessel Segmentation
Zihong Sun, Hong Wang, Qi Xie, Yefeng Zheng, Deyu Meng
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1977] arXiv:2309.15696 (cross-list from cs.LG) [pdf, other]
Title: A Unified View of Differentially Private Deep Generative Modeling
Dingfan Chen, Raouf Kerkouche, Mario Fritz
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1978] arXiv:2309.15750 (cross-list from eess.IV) [pdf, other]
Title: Automated CT Lung Cancer Screening Workflow using 3D Camera
Brian Teixeira, Vivek Singh, Birgi Tamersoy, Andreas Prokein, Ankur Kapoor
Comments: Accepted at MICCAI 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1979] arXiv:2309.15792 (cross-list from quant-ph) [pdf, html, other]
Title: Quantum Block-Matching Algorithm using Dissimilarity Measure
M. Martínez-Felipe, J. Montiel-Pérez, V. Onofre, A. Maldonado-Romo, Ricky Young
Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV)
[1980] arXiv:2309.15889 (cross-list from eess.IV) [pdf, html, other]
Title: High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models
Selim F. Yilmaz, Xueyan Niu, Bo Bai, Wei Han, Lei Deng, Deniz Gunduz
Comments: 6 pages, 5 figures. Published at INFOCOM 2024 Workshops
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (cs.LG); Multimedia (cs.MM)
[1981] arXiv:2309.15940 (cross-list from cs.RO) [pdf, other]
Title: Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs
Haonan Chang, Kowndinya Boyalakuntla, Shiyang Lu, Siwei Cai, Eric Jing, Shreesh Keskar, Shijie Geng, Adeeb Abbas, Lifeng Zhou, Kostas Bekris, Abdeslam Boularias
Comments: The code and dataset used for evaluation can be found at this https URL}{this https URL. This paper has been accepted by CoRL2023
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1982] arXiv:2309.15977 (cross-list from cs.SD) [pdf, other]
Title: Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields
Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1983] arXiv:2309.16053 (cross-list from eess.IV) [pdf, other]
Title: Diagnosis of Helicobacter pylori using AutoEncoders for the Detection of Anomalous Staining Patterns in Immunohistochemistry Images
Pau Cano, Álvaro Caravaca, Debora Gil, Eva Musulen
Comments: 9 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1984] arXiv:2309.16058 (cross-list from cs.LG) [pdf, other]
Title: AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model
Seungwhan Moon, Andrea Madotto, Zhaojiang Lin, Tushar Nagarajan, Matt Smith, Shashank Jain, Chun-Fu Yeh, Prakash Murugesan, Peyman Heidari, Yue Liu, Kavya Srinet, Babak Damavandi, Anuj Kumar
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1985] arXiv:2309.16118 (cross-list from cs.RO) [pdf, html, other]
Title: D$^3$Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement
Yixuan Wang, Mingtong Zhang, Zhuoran Li, Tarik Kelestemur, Katherine Driggs-Campbell, Jiajun Wu, Li Fei-Fei, Yunzhu Li
Comments: Accepted to Conference on Robot Learning (CoRL 2024) as Oral Presentation. The first three authors contributed equally. Project Page: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1986] arXiv:2309.16140 (cross-list from cs.MM) [pdf, other]
Title: CLIP-Hand3D: Exploiting 3D Hand Pose Estimation via Context-Aware Prompting
Shaoxiang Guo, Qing Cai, Lin Qi, Junyu Dong
Comments: Accepted In Proceedings of the 31st ACM International Conference on Multimedia (MM' 23)
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1987] arXiv:2309.16143 (cross-list from cs.LG) [pdf, other]
Title: Generative Semi-supervised Learning with Meta-Optimized Synthetic Samples
Shin'ya Yamaguchi
Comments: Accepted to the 15th Asian Conference on Machine Learning (ACML2023); a preprint of the camera-ready version
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1988] arXiv:2309.16164 (cross-list from cs.RO) [pdf, other]
Title: Learning to Terminate in Object Navigation
Yuhang Song, Anh Nguyen, Chun-Yi Lee
Comments: 16 pages
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1989] arXiv:2309.16206 (cross-list from eess.IV) [pdf, other]
Title: Alzheimer's Disease Prediction via Brain Structural-Functional Deep Fusing Network
Qiankun Zuo, Junren Pan, Shuqiang Wang
Comments: 10 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1990] arXiv:2309.16210 (cross-list from eess.IV) [pdf, other]
Title: Abdominal multi-organ segmentation in CT using Swinunter
Mingjin Chen, Yongkang He, Yongyi Lu
Comments: 8pages. arXiv admin note: text overlap with arXiv:2201.01266 by other authors
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1991] arXiv:2309.16221 (cross-list from cs.RO) [pdf, other]
Title: Off-the-shelf bin picking workcell with visual pose estimation: A case study on the world robot summit 2018 kitting task
Frederik Hagelskjær, Kasper Høj Lorenzen, Dirk Kraft
Comments: 7 pages, 7 figures, 2 tables
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1992] arXiv:2309.16264 (cross-list from cs.RO) [pdf, html, other]
Title: GAMMA: Generalizable Articulation Modeling and Manipulation for Articulated Objects
Qiaojun Yu, Junbo Wang, Wenhai Liu, Ce Hao, Liu Liu, Lin Shao, Weiming Wang, Cewu Lu
Comments: 8 pages, 5 figures, ICRA 2024
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1993] arXiv:2309.16354 (cross-list from cs.LG) [pdf, html, other]
Title: Transformer-VQ: Linear-Time Transformers via Vector Quantization
Lucas D. Lingle
Comments: ICLR 2024 camera-ready
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1994] arXiv:2309.16536 (cross-list from eess.IV) [pdf, other]
Title: Uncertainty Quantification for Eosinophil Segmentation
Kevin Lin, Donald Brown, Sana Syed, Adam Greene
Comments: Preprint, Final Article Submitted to ICBRA 2023 and will be published in the International Conference Proceedings by ACM, Association for Computing Machinery (ISBN: 979-8-4007-0815-2), which will be archived in ACM Digital Library, indexed by Ei Compendex and Scopus
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1995] arXiv:2309.16569 (cross-list from cs.SD) [pdf, other]
Title: Audio-Visual Speaker Verification via Joint Cross-Attention
R. Gnana Praveen, Jahangir Alam
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1996] arXiv:2309.16627 (cross-list from eess.IV) [pdf, other]
Title: Class Activation Map-based Weakly supervised Hemorrhage Segmentation using Resnet-LSTM in Non-Contrast Computed Tomography images
Shreyas H Ramananda, Vaanathi Sundaresan
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1997] arXiv:2309.16633 (cross-list from cs.LG) [pdf, html, other]
Title: SupReMix: Supervised Contrastive Learning for Medical Imaging Regression with Mixup
Yilei Wu, Zijian Dong, Chongyao Chen, Wangchunshu Zhou, Juan Helen Zhou
Comments: The first two authors equally contributed to this work. Previously titled "Mixup Your Own Pair", content extended and revised
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1998] arXiv:2309.16650 (cross-list from cs.RO) [pdf, other]
Title: ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning
Qiao Gu, Alihusein Kuwajerwala, Sacha Morin, Krishna Murthy Jatavallabhula, Bipasha Sen, Aditya Agarwal, Corban Rivera, William Paul, Kirsty Ellis, Rama Chellappa, Chuang Gan, Celso Miguel de Melo, Joshua B. Tenenbaum, Antonio Torralba, Florian Shkurti, Liam Paull
Comments: Project page: this https URL Explainer video: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1999] arXiv:2309.16702 (cross-list from cs.AI) [pdf, other]
Title: Prediction and Interpretation of Vehicle Trajectories in the Graph Spectral Domain
Marion Neumeier, Sebastian Dorn, Michael Botsch, Wolfgang Utschick
Comments: Accepted as a conference paper for IEEE ITSC 2023, Bilbao, Spain
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2000] arXiv:2309.16704 (cross-list from q-bio.NC) [pdf, other]
Title: Memories in the Making: Predicting Video Memorability with Encoding Phase EEG
Lorin Sweeney, Graham Healy, Alan F. Smeaton
Comments: Content-Based Multimedia Indexing, CBMI, September 20-22, Orleans, France, 2023
Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
Total of 2022 entries : 1-500 501-1000 1001-1500 1501-2000 2001-2022
Showing up to 500 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack