Computer Vision and Pattern Recognition

Authors and titles for September 2023

Total of 2022 entries : 1951-2022 2001-2022

Showing up to 2000 entries per page: fewer | more | all

[1951] arXiv:2309.15048 (cross-list from cs.LG) [pdf, html, other]: Title: Class Incremental Learning via Likelihood Ratio Based Task Prediction

Haowei Lin, Yijia Shao, Weinan Qian, Ningxin Pan, Yiduo Guo, Bing Liu

Journal-ref: ICLR 2024

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1952] arXiv:2309.15065 (cross-list from cs.RO) [pdf, html, other]: Title: Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding

Christina Kassab, Matias Mattamala, Lintong Zhang, Maurice Fallon

Comments: Accepted at ICRA 2024

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1953] arXiv:2309.15135 (cross-list from cs.LG) [pdf, html, other]: Title: Contrastive Continual Multi-view Clustering with Filtered Structural Fusion

Xinhang Wan, Jiyuan Liu, Hao Yu, Ao Li, Xinwang Liu, Ke Liang, Zhibin Dong, En Zhu

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1954] arXiv:2309.15216 (cross-list from cs.LG) [pdf, other]: Title: A Comparative Study of Filters and Deep Learning Models to predict Diabetic Retinopathy

Roshan Vasu Muddaluru, Sharvaani Ravikumar Thoguluva, Shruti Prabha, Tanuja Konda Reddy, Suja Palaniswamy

Comments: 6 pages, 5 figures, I2CT , 2 tables

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1955] arXiv:2309.15243 (cross-list from eess.IV) [pdf, other]: Title: APIS: A paired CT-MRI dataset for ischemic stroke segmentation challenge

Santiago Gómez, Daniel Mantilla, Gustavo Garzón, Edgar Rangel, Andrés Ortiz, Franklin Sierra-Jerez, Fabio Martínez

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[1956] arXiv:2309.15245 (cross-list from cs.AI) [pdf, other]: Title: SeMAnD: Self-Supervised Anomaly Detection in Multimodal Geospatial Datasets

Daria Reshetova, Swetava Ganguli, C. V. Krishnakumar Iyer, Vipul Pandey

Comments: Extended version of the accepted research track paper at the 31st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL 2023), Hamburg, Germany. 11 pages, 8 figures, 6 tables

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1957] arXiv:2309.15259 (cross-list from quant-ph) [pdf, other]: Title: SLIQ: Quantum Image Similarity Networks on Noisy Quantum Computers

Daniel Silver, Tirthak Patel, Aditya Ranjan, Harshitta Gandhi, William Cutler, Devesh Tiwari

Journal-ref: Vol. 37 No. 8: AAAI-2023 Technical Tracks 8

Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1958] arXiv:2309.15268 (cross-list from cs.RO) [pdf, other]: Title: ObVi-SLAM: Long-Term Object-Visual SLAM

Amanda Adkins, Taijing Chen, Joydeep Biswas

Comments: 8 pages, 7 figures, 1 table plus appendix with 4 figures and 1 table

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1959] arXiv:2309.15278 (cross-list from cs.RO) [pdf, html, other]: Title: Out of Sight, Still in Mind: Reasoning and Planning about Unobserved Objects with Video Tracking Enabled Memory Models

Yixuan Huang, Jialin Yuan, Chanho Kim, Pupul Pradhan, Bryan Chen, Li Fuxin, Tucker Hermans

Comments: Presented at IEEE Conference on Robotics and Automation (ICRA) 2024. Website: this https URL

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1960] arXiv:2309.15302 (cross-list from cs.RO) [pdf, other]: Title: STERLING: Self-Supervised Terrain Representation Learning from Unconstrained Robot Experience

Haresh Karnan, Elvin Yang, Daniel Farkash, Garrett Warnell, Joydeep Biswas, Peter Stone

Comments: Project website: this https URL

Journal-ref: Conference on Robot Learning (CoRL 2023)

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1961] arXiv:2309.15314 (cross-list from physics.med-ph) [pdf, other]: Title: Conversion of single-energy computed tomography to parametric maps of dual-energy computed tomography using convolutional neural network

Sangwook Kim, Jimin Lee, Jungye Kim, Bitbyeol Kim, Chang Heon Choi, Seongmoon Jung

Comments: 29 pages, 17 figures

Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
[1962] arXiv:2309.15332 (cross-list from cs.RO) [pdf, other]: Title: Multimodal Dataset for Localization, Mapping and Crop Monitoring in Citrus Tree Farms

Hanzhe Teng, Yipeng Wang, Xiaoao Song, Konstantinos Karydis

Comments: Accepted to the 18th International Symposium on Visual Computing (ISVC 2023)

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1963] arXiv:2309.15420 (cross-list from cs.LG) [pdf, other]: Title: The Triad of Failure Modes and a Possible Way Out

Emanuele Sansone

Comments: Some sentences in the Background Section are overlapping with Section 2 in arXiv:2304.11357 However, the main technical content and all other sections are different

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1964] arXiv:2309.15459 (cross-list from cs.RO) [pdf, html, other]: Title: GAMMA: Graspability-Aware Mobile MAnipulation Policy Learning based on Online Grasping Pose Fusion

Jiazhao Zhang, Nandiraju Gireesh, Jilong Wang, Xiaomeng Fang, Chaoyi Xu, Weiguang Chen, Liu Dai, He Wang

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1965] arXiv:2309.15477 (cross-list from cs.GR) [pdf, other]: Title: A Tutorial on Uniform B-Spline

Yi Zhou

Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1966] arXiv:2309.15485 (cross-list from eess.IV) [pdf, other]: Title: Style Transfer and Self-Supervised Learning Powered Myocardium Infarction Super-Resolution Segmentation

Lichao Wang, Jiahao Huang, Xiaodan Xing, Yinzhe Wu, Ramyah Rajakulasingam, Andrew D. Scott, Pedro F Ferreira, Ranil De Silva, Sonia Nielles-Vallespin, Guang Yang

Comments: 6 pages, 8 figures, conference, accepted by SIPAIM2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1967] arXiv:2309.15516 (cross-list from cs.CL) [pdf, html, other]: Title: Teaching Text-to-Image Models to Communicate in Dialog

Xiaowen Sun, Jiazhan Feng, Yuxuan Wang, Yuxuan Lai, Xingyu Shen, Dongyan Zhao

Comments: Work in progress

Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1968] arXiv:2309.15520 (cross-list from cs.LG) [pdf, other]: Title: SAF-Net: Self-Attention Fusion Network for Myocardial Infarction Detection using Multi-View Echocardiography

Ilke Adalioglu, Mete Ahishali, Aysen Degerli, Serkan Kiranyaz, Moncef Gabbouj

Comments: 4 pages, 3 figures, Computing in Cardiology (CinC) 2023

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1969] arXiv:2309.15521 (cross-list from cs.LG) [pdf, other]: Title: MLOps for Scarce Image Data: A Use Case in Microscopic Image Analysis

Angelo Yamachui Sitcheu, Nils Friederich, Simon Baeuerle, Oliver Neumann, Markus Reischl, Ralf Mikut

Comments: 21 pages, 5 figures , 33. Workshop on Computational Intelligence Berlin Germany

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1970] arXiv:2309.15529 (cross-list from eess.IV) [pdf, other]: Title: Missing-modality Enabled Multi-modal Fusion Architecture for Medical Data

Muyu Wang, Shiyu Fan, Yichen Li, Hui Chen

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1971] arXiv:2309.15551 (cross-list from cs.LG) [pdf, html, other]: Title: DeepRepViz: Identifying Confounders in Deep Learning Model Predictions

Roshan Prakash Rane, JiHoon Kim, Arjun Umesha, Didem Stark, Marc-André Schulz, Kerstin Ritter

Journal-ref: MICCAI 2024. Lecture Notes in Computer Science, vol 15010. pp 186 to 196. Springer, Cham

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1972] arXiv:2309.15564 (cross-list from cs.LG) [pdf, other]: Title: Jointly Training Large Autoregressive Multimodal Models

Emanuele Aiello, Lili Yu, Yixin Nie, Armen Aghajanyan, Barlas Oguz

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1973] arXiv:2309.15573 (cross-list from cs.CG) [pdf, other]: Title: The Maximum Cover with Rotating Field of View

Igor Potapov, Jason Ralph, Theofilos Triommatis

Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Algebraic Geometry (math.AG)
[1974] arXiv:2309.15596 (cross-list from cs.RO) [pdf, other]: Title: PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation

Shizhe Chen, Ricardo Garcia, Cordelia Schmid, Ivan Laptev

Comments: Accepted to CoRL 2023. Project website: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1975] arXiv:2309.15608 (cross-list from eess.IV) [pdf, html, other]: Title: NoSENSE: Learned unrolled cardiac MRI reconstruction without explicit sensitivity maps

Felix Frederik Zimmermann, Andreas Kofler

Comments: Accepted at MICCAI STACOM 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1976] arXiv:2309.15638 (cross-list from eess.IV) [pdf, html, other]: Title: RSF-Conv: Rotation-and-Scale Equivariant Fourier Parameterized Convolution for Retinal Vessel Segmentation

Zihong Sun, Hong Wang, Qi Xie, Yefeng Zheng, Deyu Meng

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1977] arXiv:2309.15696 (cross-list from cs.LG) [pdf, other]: Title: A Unified View of Differentially Private Deep Generative Modeling

Dingfan Chen, Raouf Kerkouche, Mario Fritz

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1978] arXiv:2309.15750 (cross-list from eess.IV) [pdf, other]: Title: Automated CT Lung Cancer Screening Workflow using 3D Camera

Brian Teixeira, Vivek Singh, Birgi Tamersoy, Andreas Prokein, Ankur Kapoor

Comments: Accepted at MICCAI 2023

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1979] arXiv:2309.15792 (cross-list from quant-ph) [pdf, html, other]: Title: Quantum Block-Matching Algorithm using Dissimilarity Measure

M. Martínez-Felipe, J. Montiel-Pérez, V. Onofre, A. Maldonado-Romo, Ricky Young

Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV)
[1980] arXiv:2309.15889 (cross-list from eess.IV) [pdf, html, other]: Title: High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models

Selim F. Yilmaz, Xueyan Niu, Bo Bai, Wei Han, Lei Deng, Deniz Gunduz

Comments: 6 pages, 5 figures. Published at INFOCOM 2024 Workshops

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (cs.LG); Multimedia (cs.MM)
[1981] arXiv:2309.15940 (cross-list from cs.RO) [pdf, other]: Title: Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs

Haonan Chang, Kowndinya Boyalakuntla, Shiyang Lu, Siwei Cai, Eric Jing, Shreesh Keskar, Shijie Geng, Adeeb Abbas, Lifeng Zhou, Kostas Bekris, Abdeslam Boularias

Comments: The code and dataset used for evaluation can be found at this https URL}{this https URL. This paper has been accepted by CoRL2023

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1982] arXiv:2309.15977 (cross-list from cs.SD) [pdf, other]: Title: Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields

Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1983] arXiv:2309.16053 (cross-list from eess.IV) [pdf, other]: Title: Diagnosis of Helicobacter pylori using AutoEncoders for the Detection of Anomalous Staining Patterns in Immunohistochemistry Images

Pau Cano, Álvaro Caravaca, Debora Gil, Eva Musulen

Comments: 9 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1984] arXiv:2309.16058 (cross-list from cs.LG) [pdf, other]: Title: AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Seungwhan Moon, Andrea Madotto, Zhaojiang Lin, Tushar Nagarajan, Matt Smith, Shashank Jain, Chun-Fu Yeh, Prakash Murugesan, Peyman Heidari, Yue Liu, Kavya Srinet, Babak Damavandi, Anuj Kumar

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1985] arXiv:2309.16118 (cross-list from cs.RO) [pdf, html, other]: Title: D$^3$Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement

Yixuan Wang, Mingtong Zhang, Zhuoran Li, Tarik Kelestemur, Katherine Driggs-Campbell, Jiajun Wu, Li Fei-Fei, Yunzhu Li

Comments: Accepted to Conference on Robot Learning (CoRL 2024) as Oral Presentation. The first three authors contributed equally. Project Page: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1986] arXiv:2309.16140 (cross-list from cs.MM) [pdf, other]: Title: CLIP-Hand3D: Exploiting 3D Hand Pose Estimation via Context-Aware Prompting

Shaoxiang Guo, Qing Cai, Lin Qi, Junyu Dong

Comments: Accepted In Proceedings of the 31st ACM International Conference on Multimedia (MM' 23)

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1987] arXiv:2309.16143 (cross-list from cs.LG) [pdf, other]: Title: Generative Semi-supervised Learning with Meta-Optimized Synthetic Samples

Shin'ya Yamaguchi

Comments: Accepted to the 15th Asian Conference on Machine Learning (ACML2023); a preprint of the camera-ready version

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1988] arXiv:2309.16164 (cross-list from cs.RO) [pdf, other]: Title: Learning to Terminate in Object Navigation

Yuhang Song, Anh Nguyen, Chun-Yi Lee

Comments: 16 pages

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1989] arXiv:2309.16206 (cross-list from eess.IV) [pdf, other]: Title: Alzheimer's Disease Prediction via Brain Structural-Functional Deep Fusing Network

Qiankun Zuo, Junren Pan, Shuqiang Wang

Comments: 10 pages

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1990] arXiv:2309.16210 (cross-list from eess.IV) [pdf, other]: Title: Abdominal multi-organ segmentation in CT using Swinunter

Mingjin Chen, Yongkang He, Yongyi Lu

Comments: 8pages. arXiv admin note: text overlap with arXiv:2201.01266 by other authors

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1991] arXiv:2309.16221 (cross-list from cs.RO) [pdf, other]: Title: Off-the-shelf bin picking workcell with visual pose estimation: A case study on the world robot summit 2018 kitting task

Frederik Hagelskjær, Kasper Høj Lorenzen, Dirk Kraft

Comments: 7 pages, 7 figures, 2 tables

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1992] arXiv:2309.16264 (cross-list from cs.RO) [pdf, html, other]: Title: GAMMA: Generalizable Articulation Modeling and Manipulation for Articulated Objects

Qiaojun Yu, Junbo Wang, Wenhai Liu, Ce Hao, Liu Liu, Lin Shao, Weiming Wang, Cewu Lu

Comments: 8 pages, 5 figures, ICRA 2024

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1993] arXiv:2309.16354 (cross-list from cs.LG) [pdf, html, other]: Title: Transformer-VQ: Linear-Time Transformers via Vector Quantization

Lucas D. Lingle

Comments: ICLR 2024 camera-ready

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1994] arXiv:2309.16536 (cross-list from eess.IV) [pdf, other]: Title: Uncertainty Quantification for Eosinophil Segmentation

Kevin Lin, Donald Brown, Sana Syed, Adam Greene

Comments: Preprint, Final Article Submitted to ICBRA 2023 and will be published in the International Conference Proceedings by ACM, Association for Computing Machinery (ISBN: 979-8-4007-0815-2), which will be archived in ACM Digital Library, indexed by Ei Compendex and Scopus

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1995] arXiv:2309.16569 (cross-list from cs.SD) [pdf, other]: Title: Audio-Visual Speaker Verification via Joint Cross-Attention

R. Gnana Praveen, Jahangir Alam

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1996] arXiv:2309.16627 (cross-list from eess.IV) [pdf, other]: Title: Class Activation Map-based Weakly supervised Hemorrhage Segmentation using Resnet-LSTM in Non-Contrast Computed Tomography images

Shreyas H Ramananda, Vaanathi Sundaresan

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1997] arXiv:2309.16633 (cross-list from cs.LG) [pdf, html, other]: Title: SupReMix: Supervised Contrastive Learning for Medical Imaging Regression with Mixup

Yilei Wu, Zijian Dong, Chongyao Chen, Wangchunshu Zhou, Juan Helen Zhou

Comments: The first two authors equally contributed to this work. Previously titled "Mixup Your Own Pair", content extended and revised

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1998] arXiv:2309.16650 (cross-list from cs.RO) [pdf, other]: Title: ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning

Qiao Gu, Alihusein Kuwajerwala, Sacha Morin, Krishna Murthy Jatavallabhula, Bipasha Sen, Aditya Agarwal, Corban Rivera, William Paul, Kirsty Ellis, Rama Chellappa, Chuang Gan, Celso Miguel de Melo, Joshua B. Tenenbaum, Antonio Torralba, Florian Shkurti, Liam Paull

Comments: Project page: this https URL Explainer video: this https URL

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1999] arXiv:2309.16702 (cross-list from cs.AI) [pdf, other]: Title: Prediction and Interpretation of Vehicle Trajectories in the Graph Spectral Domain

Marion Neumeier, Sebastian Dorn, Michael Botsch, Wolfgang Utschick

Comments: Accepted as a conference paper for IEEE ITSC 2023, Bilbao, Spain

Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2000] arXiv:2309.16704 (cross-list from q-bio.NC) [pdf, other]: Title: Memories in the Making: Predicting Video Memorability with Encoding Phase EEG

Lorin Sweeney, Graham Healy, Alan F. Smeaton

Comments: Content-Based Multimedia Indexing, CBMI, September 20-22, Orleans, France, 2023

Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[2001] arXiv:2309.16773 (cross-list from cs.LG) [pdf, other]: Title: Neural scaling laws for phenotypic drug discovery

Drew Linsley, John Griffin, Jason Parker Brown, Adam N Roose, Michael Frank, Peter Linsley, Steven Finkbeiner, Jeremy Linsley

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[2002] arXiv:2309.16818 (cross-list from cs.RO) [pdf, other]: Title: MEM: Multi-Modal Elevation Mapping for Robotics and Learning

Gian Erni, Jonas Frey, Takahiro Miki, Matias Mattamala, Marco Hutter

Comments: Accapted for IROS2023. This work has been submitted to the IEEE for possible publication

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[2003] arXiv:2309.16878 (cross-list from cs.LG) [pdf, other]: Title: Investigating Human-Identifiable Features Hidden in Adversarial Perturbations

Dennis Y. Menn, Tzu-hsun Feng, Sriram Vishwanath, Hung-yi Lee

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2004] arXiv:2309.16898 (cross-list from cs.RO) [pdf, other]: Title: A Sign Language Recognition System with Pepper, Lightweight-Transformer, and LLM

JongYoon Lim, Inkyu Sa, Bruce MacDonald, Ho Seok Ahn

Subjects: Robotics (cs.RO); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[2005] arXiv:2309.16916 (cross-list from cs.LG) [pdf, other]: Title: ONNXExplainer: an ONNX Based Generic Framework to Explain Neural Networks Using Shapley Values

Yong Zhao, Runxin He, Nicholas Kersting, Can Liu, Shubham Agrawal, Chiranjeet Chetia, Yu Gu

Comments: 11 pages, 11 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2006] arXiv:2309.17002 (cross-list from cs.LG) [pdf, html, other]: Title: Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks

Hao Chen, Jindong Wang, Ankit Shah, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj

Comments: ICLR 2024 Spotlight

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2007] arXiv:2309.17036 (cross-list from cs.RO) [pdf, other]: Title: UniQuadric: A SLAM Backend for Unknown Rigid Object 3D Tracking and Light-Weight Modeling

Linghao Yang, Yanmin Wu, Yu Deng, Rui Tian, Xinggang Hu, Tiefeng Ma

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[2008] arXiv:2309.17076 (cross-list from eess.IV) [pdf, other]: Title: Benefits of mirror weight symmetry for 3D mesh segmentation in biomedical applications

Vladislav Dordiuk, Maksim Dzhigil, Konstantin Ushenin

Comments: was sent to IEEE conference

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2009] arXiv:2309.17133 (cross-list from cs.CL) [pdf, other]: Title: Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering

Weizhe Lin, Jinghong Chen, Jingbiao Mei, Alexandru Coca, Bill Byrne

Comments: To appear at NeurIPS 2023. This is the camera-ready version. We fixed some numbers and added more experiments to address reviewers' comments

Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[2010] arXiv:2309.17160 (cross-list from cs.MM) [pdf, other]: Title: Redistributing the Precision and Content in 3D-LUT-based Inverse Tone-mapping for HDR/WCG Display

Cheng Guo, Leidong Fan, Qian Zhang, Hanyuan Liu, Kanglin Liu, Xiuhua Jiang

Comments: Accepted in CVMP2023 (the 20th ACM SIGGRAPH European Conference on Visual Media Production)

Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[2011] arXiv:2309.17170 (cross-list from cs.RO) [pdf, html, other]: Title: Robotic Grasping of Harvested Tomato Trusses Using Vision and Online Learning

Luuk van den Bent, Tomás Coleman, Robert Babuška

Comments: 7 pages, 7 figures

Journal-ref: Proceedings of the IEEE International Conference on Robotics and Automation, ICRA 2024, IEEE, pages 13947-13953

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2012] arXiv:2309.17189 (cross-list from cs.SD) [pdf, html, other]: Title: RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation

Samuel Pegg, Kai Li, Xiaolin Hu

Comments: Accepted by The Twelfth International Conference on Learning Representations (ICLR) 2024, see this https URL

Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[2013] arXiv:2309.17192 (cross-list from cs.LG) [pdf, other]: Title: A Survey of Incremental Transfer Learning: Combining Peer-to-Peer Federated Learning and Domain Incremental Learning for Multicenter Collaboration

Yixing Huang, Christoph Bert, Ahmed Gomaa, Rainer Fietkau, Andreas Maier, Florian Putz

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[2014] arXiv:2309.17197 (cross-list from cs.LG) [pdf, other]: Title: An Investigation Into Race Bias in Random Forest Models Based on Breast DCE-MRI Derived Radiomics Features

Mohamed Huti, Tiarna Lee, Elinor Sawyer, Andrew P. King

Comments: Accepted for publication at the MICCAI Workshop on Fairness of AI in Medical Imaging (FAIMI) 2023

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2015] arXiv:2309.17209 (cross-list from cs.RO) [pdf, other]: Title: Robots That Can See: Leveraging Human Pose for Trajectory Prediction

Tim Salzmann, Lewis Chiang, Markus Ryll, Dorsa Sadigh, Carolina Parada, Alex Bewley

Comments: Project page: this https URL

Journal-ref: IEEE Robotics and Automation Letters, vol. 8, no. 11, pp. 7090-7097, Nov. 2023

Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[2016] arXiv:2309.17223 (cross-list from eess.IV) [pdf, other]: Title: Glioma subtype classification from histopathological images using in-domain and out-of-domain transfer learning: An experimental study

Vladimir Despotovic, Sang-Yoon Kim, Ann-Christin Hau, Aliaksandra Kakoichankava, Gilbert Georg Klamminger, Felix Bruno Kleine Borgmann, Katrin B. M. Frauenknecht, Michel Mittelbronn, Petr V. Nazarov

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2017] arXiv:2309.17269 (cross-list from eess.IV) [pdf, html, other]: Title: Unpaired Optical Coherence Tomography Angiography Image Super-Resolution via Frequency-Aware Inverse-Consistency GAN

Weiwen Zhang, Dawei Yang, Haoxuan Che, An Ran Ran, Carol Y. Cheung, Hao Chen

Comments: 11 pages, 10 figures, in IEEE J-BHI, 2024

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2018] arXiv:2309.17320 (cross-list from eess.IV) [pdf, other]: Title: Development of a Deep Learning Method to Identify Acute Ischemic Stroke Lesions on Brain CT

Alessandro Fontanella, Wenwen Li, Grant Mair, Antreas Antoniou, Eleanor Platt, Paul Armitage, Emanuele Trucco, Joanna Wardlaw, Amos Storkey

Comments: 12 pages, 5 figures

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[2019] arXiv:2309.17334 (cross-list from eess.IV) [pdf, html, other]: Title: Multi-Depth Branch Network for Efficient Image Super-Resolution

Huiyuan Tian, Li Zhang, Shijian Li, Min Yao, Gang Pan

Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2020] arXiv:2309.17338 (cross-list from cs.RO) [pdf, other]: Title: Improving Trajectory Prediction in Dynamic Multi-Agent Environment by Dropping Waypoints

Pranav Singh Chib, Pravendra Singh

Comments: Under Review

Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2021] arXiv:2309.17341 (cross-list from cs.LG) [pdf, other]: Title: MixQuant: Mixed Precision Quantization with a Bit-width Optimization Search

Eliska Kloberdanz, Wei Le

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2022] arXiv:2309.17343 (cross-list from physics.optics) [pdf, other]: Title: Neural Lithography: Close the Design-to-Manufacturing Gap in Computational Optics with a 'Real2Sim' Learned Photolithography Simulator

Cheng Zheng, Guangyuan Zhao, Peter T.C. So

Comments: The paper, titled "Close the Design-to-Manufacturing Gap in Computational Optics with a 'Real2Sim' Learned Two-Photon Neural Lithography Simulator," has been accepted for presentation at SIGGRAPH Asia 2023. This version offers a more comprehensive and accessible read. Project page: this https URL

Subjects: Optics (physics.optics); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)

Total of 2022 entries : 1951-2022 2001-2022

Showing up to 2000 entries per page: fewer | more | all