Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computer Vision and Pattern Recognition

Authors and titles for September 2023

Total of 2022 entries : 1901-2022 2001-2022
Showing up to 2000 entries per page: fewer | more | all
[1901] arXiv:2309.13571 (cross-list from eess.IV) [pdf, other]
Title: Matrix Completion-Informed Deep Unfolded Equilibrium Models for Self-Supervised k-Space Interpolation in MRI
Chen Luo, Huayu Wang, Taofeng Xie, Qiyu Jin, Guoqing Chen, Zhuo-Xu Cui, Dong Liang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1902] arXiv:2309.13584 (cross-list from eess.IV) [pdf, other]
Title: Solving Low-Dose CT Reconstruction via GAN with Local Coherence
Wenjie Liu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1903] arXiv:2309.13587 (cross-list from eess.IV) [pdf, other]
Title: Benchmarking Encoder-Decoder Architectures for Biplanar X-ray to 3D Shape Reconstruction
Mahesh Shakya, Bishesh Khanal
Comments: accepted to NeurIPS 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1904] arXiv:2309.13742 (cross-list from cs.GR) [pdf, other]
Title: DROP: Dynamics Responses from Human Motion Prior and Projective Dynamics
Yifeng Jiang, Jungdam Won, Yuting Ye, C. Karen Liu
Comments: SIGGRAPH Asia 2023, Video this https URL, Website: this https URL
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1905] arXiv:2309.13745 (cross-list from cs.RO) [pdf, html, other]
Title: Overview of Computer Vision Techniques in Robotized Wire Harness Assembly: Current State and Future Opportunities
Hao Wang, Omkar Salunkhe, Walter Quadrini, Dan Lämkull, Fredrik Ore, Björn Johansson, Johan Stahre
Comments: Presented at the 56th CIRP Conference on Manufacturing Systems (CIRP CMS 2023), Cape Town, South Africa, 24-26 October 2023. Published in Procedia CIRP
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1906] arXiv:2309.13746 (cross-list from cs.RO) [pdf, other]
Title: Deep Learning-Based Connector Detection for Robotized Assembly of Automotive Wire Harnesses
Hao Wang, Björn Johansson
Comments: This paper has been accepted by IEEE CASE 2023 and has been presented on the conference. The information of the published version will be updated later
Journal-ref: 2023 IEEE 19th International Conference on Automation Science and Engineering (CASE), Auckland, New Zealand, 2023, pp. 1-8
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1907] arXiv:2309.13747 (cross-list from eess.IV) [pdf, html, other]
Title: Look Ma, no code: fine tuning nnU-Net for the AutoPET II challenge by only adjusting its JSON plans
Fabian Isensee, Klaus H.Maier-Hein
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1908] arXiv:2309.13770 (cross-list from cs.LG) [pdf, other]
Title: Devil in the Number: Towards Robust Multi-modality Data Filter
Yichen Xu, Zihan Xu, Wenhao Chai, Zhonghan Zhao, Enxin Song, Gaoang Wang
Comments: ICCV 2023 Workshop: TNGCV-DataComp
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1909] arXiv:2309.13773 (cross-list from cs.LG) [pdf, other]
Title: GHN-QAT: Training Graph Hypernetworks to Predict Quantization-Robust Parameters of Unseen Limited Precision Neural Networks
Stone Yun, Alexander Wong
Comments: Poster and extended abstract to be presented at the Workshop for Low Bit Quantized Neural Networks (LQBNN) @ ICCV 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1910] arXiv:2309.13777 (cross-list from eess.IV) [pdf, other]
Title: Diffeomorphic Multi-Resolution Deep Learning Registration for Applications in Breast MRI
Matthew G. French, Gonzalo D. Maso Talou, Thiranja P. Babarenda Gamage, Martyn P. Nash, Poul M. Nielsen, Anthony J. Doyle, Juan Eugenio Iglesias, Yaël Balbastre, Sean I. Young
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1911] arXiv:2309.13817 (cross-list from eess.IV) [pdf, other]
Title: MMA-Net: Multiple Morphology-Aware Network for Automated Cobb Angle Measurement
Zhengxuan Qiu, Jie Yang, Jiankun Wang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1912] arXiv:2309.13835 (cross-list from eess.IV) [pdf, html, other]
Title: IBVC: Interpolation-driven B-frame Video Compression
Chenming Xu, Meiqin Liu, Chao Yao, Weisi Lin, Yao Zhao
Comments: Submitted to Pattern Recognition
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1913] arXiv:2309.13839 (cross-list from eess.IV) [pdf, other]
Title: Fill the K-Space and Refine the Image: Prompting for Dynamic and Multi-Contrast MRI Reconstruction
Bingyu Xin, Meng Ye, Leon Axel, Dimitris N. Metaxas
Comments: STACOM 2023; Code is available at this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1914] arXiv:2309.13842 (cross-list from cs.RO) [pdf, other]
Title: Traj-LO: In Defense of LiDAR-Only Odometry Using an Effective Continuous-Time Trajectory
Xin Zheng, Jianke Zhu
Comments: Video this https URL and Project site this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1915] arXiv:2309.13866 (cross-list from cs.LG) [pdf, other]
Title: On Calibration of Modern Quantized Efficient Neural Networks
Joey Kuang, Alexander Wong
Comments: Accepted as an extended abstract at the ICCV 2023 Workshop on Low-Bit Quantized Neural Networks. Corrected some typos
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1916] arXiv:2309.13872 (cross-list from eess.IV) [pdf, other]
Title: Attention and Pooling based Sigmoid Colon Segmentation in 3D CT images
Md Akizur Rahman, Sonit Singh, Kuruparan Shanmugalingam, Sankaran Iyer, Alan Blair, Praveen Ravindran, Arcot Sowmya
Comments: 8 Pages, 6 figures, Accepted at IEEE DICTA 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1917] arXiv:2309.13885 (cross-list from cs.LG) [pdf, html, other]
Title: TouchUp-G: Improving Feature Representation through Graph-Centric Finetuning
Jing Zhu, Xiang Song, Vassilis N. Ioannidis, Danai Koutra, Christos Faloutsos
Comments: SIGIR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Social and Information Networks (cs.SI)
[1918] arXiv:2309.13893 (cross-list from cs.RO) [pdf, html, other]
Title: Scene Informer: Anchor-based Occlusion Inference and Trajectory Prediction in Partially Observable Environments
Bernard Lange, Jiachen Li, Mykel J. Kochenderfer
Comments: Accepted to 2024 IEEE International Conference on Robotics and Automation (ICRA)
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1919] arXiv:2309.13980 (cross-list from eess.IV) [pdf, other]
Title: Better Generalization of White Matter Tract Segmentation to Arbitrary Datasets with Scaled Residual Bootstrap
Wan Liu, Chuyang Ye
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1920] arXiv:2309.14054 (cross-list from cs.LG) [pdf, html, other]
Title: Adapt then Unlearn: Exploring Parameter Space Semantics for Unlearning in Generative Adversarial Networks
Piyush Tiwary, Atri Guha, Subhodip Panda, Prathosh A.P
Comments: Accepted at Transactions on Machine Learning Research (TMLR)
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1921] arXiv:2309.14068 (cross-list from cs.LG) [pdf, html, other]
Title: Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models
Yangming Li, Boris van Breugel, Mihaela van der Schaar
Comments: Accepted by ICLR-2024
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1922] arXiv:2309.14090 (cross-list from cs.LG) [pdf, other]
Title: Convolutional autoencoder-based multimodal one-class classification
Firas Laakom, Fahad Sohrab, Jenni Raitoharju, Alexandros Iosifidis, Moncef Gabbouj
Comments: 5 pages, 1 figure, 4 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1923] arXiv:2309.14198 (cross-list from cs.LG) [pdf, other]
Title: (Predictable) Performance Bias in Unsupervised Anomaly Detection
Felix Meissen, Svenja Breuer, Moritz Knolle, Alena Buyx, Ruth Müller, Georgios Kaissis, Benedikt Wiestler, Daniel Rückert
Comments: 11 pages, 5 Figures, 1 panel
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Image and Video Processing (eess.IV)
[1924] arXiv:2309.14211 (cross-list from cs.RO) [pdf, other]
Title: QuadricsNet: Learning Concise Representation for Geometric Primitives in Point Clouds
Ji Wu, Huai Yu, Wen Yang, Gui-Song Xia
Comments: Submitted to ICRA 2024. 7 pages
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1925] arXiv:2309.14236 (cross-list from cs.RO) [pdf, html, other]
Title: MoDem-V2: Visuo-Motor World Models for Real-World Robot Manipulation
Patrick Lancaster, Nicklas Hansen, Aravind Rajeswaran, Vikash Kumar
Comments: 10 pages, 8 figures
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1926] arXiv:2309.14265 (cross-list from cs.RO) [pdf, html, other]
Title: Industrial Application of 6D Pose Estimation for Robotic Manipulation in Automotive Internal Logistics
Philipp Quentin, Dino Knoll, Daniel Goehring
Comments: Accepted for publication at IEEE International Conference on Automation Science and Engineering (CASE 2023)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1927] arXiv:2309.14306 (cross-list from eess.IV) [pdf, other]
Title: DeepMesh: Mesh-based Cardiac Motion Tracking using Deep Learning
Qingjie Meng, Wenjia Bai, Declan P O'Regan, and Daniel Rueckert
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1928] arXiv:2309.14329 (cross-list from cs.HC) [pdf, other]
Title: Innovative Digital Storytelling with AIGC: Exploration and Discussion of Recent Advances
Rongzhang Gu, Hui Li, Changyue Su, Wayne Wu
Comments: Project page: this https URL
Subjects: Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Multimedia (cs.MM)
[1929] arXiv:2309.14341 (cross-list from cs.RO) [pdf, other]
Title: Extreme Parkour with Legged Robots
Xuxin Cheng, Kexin Shi, Ananye Agarwal, Deepak Pathak
Comments: Website and videos at this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[1930] arXiv:2309.14356 (cross-list from cs.LG) [pdf, other]
Title: COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs
Tiep Le, Vasudev Lal, Phillip Howard
Comments: Accepted to NeurIPS 2023 Datasets and Benchmarks Track
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1931] arXiv:2309.14360 (cross-list from cs.LG) [pdf, other]
Title: Domain-Guided Conditional Diffusion Model for Unsupervised Domain Adaptation
Yulong Zhang, Shuhao Chen, Weisen Jiang, Yu Zhang, Jiangang Lu, James T. Kwok
Comments: Work in progress
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1932] arXiv:2309.14392 (cross-list from eess.IV) [pdf, other]
Title: Unveiling Fairness Biases in Deep Learning-Based Brain MRI Reconstruction
Yuning Du, Yuyang Xue, Rohan Dharmakumar, Sotirios A. Tsaftaris
Comments: Accepted for publication at FAIMI 2023 (Fairness of AI in Medical Imaging) at MICCAI
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1933] arXiv:2309.14425 (cross-list from cs.RO) [pdf, other]
Title: Self-Recovery Prompting: Promptable General Purpose Service Robot System with Foundation Models and Self-Recovery
Mimo Shirasaka, Tatsuya Matsushima, Soshi Tsunashima, Yuya Ikeda, Aoi Horo, So Ikoma, Chikaha Tsuji, Hikaru Wada, Tsunekazu Omija, Dai Komukai, Yutaka Matsuo Yusuke Iwasawa
Comments: Website: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Systems and Control (eess.SY)
[1934] arXiv:2309.14474 (cross-list from eess.IV) [pdf, other]
Title: Gastro-Intestinal Tract Segmentation Using an Explainable 3D Unet
Kai Li, Jonathan Chan
Comments: 5 pages, 8 figures, 13th Joint Symposium on Computational Intelligence (JSCI13)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1935] arXiv:2309.14483 (cross-list from astro-ph.SR) [pdf, other]
Title: Unveiling the Potential of Deep Learning Models for Solar Flare Prediction in Near-Limb Regions
Chetraj Pandey, Rafal A. Angryk, Berkay Aydin
Comments: This is a preprint accepted at the 22nd International Conference on Machine Learning and Applications (ICMLA), 2023. 7 Pages, 6 Figures
Subjects: Solar and Stellar Astrophysics (astro-ph.SR); Instrumentation and Methods for Astrophysics (astro-ph.IM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1936] arXiv:2309.14492 (cross-list from eess.IV) [pdf, other]
Title: AiAReSeg: Catheter Detection and Segmentation in Interventional Ultrasound using Transformers
Alex Ranne, Yordanka Velikova, Nassir Navab, Ferdinando Rodriguez y Baena
Comments: This work has been submitted to the IEEE for possible publication
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1937] arXiv:2309.14540 (cross-list from cs.LG) [pdf, other]
Title: Effect of roundabout design on the behavior of road users: A case study of roundabouts with application of Unsupervised Machine Learning
Tasnim M. Dwekat, Ayda A. Almsre, Huthaifa I. Ashqar
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
[1938] arXiv:2309.14550 (cross-list from eess.IV) [pdf, html, other]
Title: MEMO: Dataset and Methods for Robust Multimodal Retinal Image Registration with Large or Small Vessel Density Differences
Chiao-Yi Wang, Faranguisse Kakhi Sadrieh, Yi-Ting Shen, Shih-En Chen, Sarah Kim, Victoria Chen, Achyut Raghavendra, Dongyi Wang, Osamah Saeedi, Yang Tao
Comments: Biomedical Optics Express
Journal-ref: Biomed. Opt. Express 15, 3457-3479 (2024)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1939] arXiv:2309.14580 (cross-list from cs.LG) [pdf, other]
Title: CWCL: Cross-Modal Transfer with Continuously Weighted Contrastive Loss
Rakshith Sharma Srinivasa, Jaejin Cho, Chouchang Yang, Yashas Malur Saidutta, Ching-Hua Lee, Yilin Shen, Hongxia Jin
Comments: Accepted to Neural Information Processing Systems (NeurIPS) 2023 conference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1940] arXiv:2309.14586 (cross-list from cs.SD) [pdf, other]
Title: Speech Audio Synthesis from Tagged MRI and Non-Negative Matrix Factorization via Plastic Transformer
Xiaofeng Liu, Fangxu Xing, Maureen Stone, Jiachen Zhuo, Sidney Fels, Jerry L. Prince, Georges El Fakhri, Jonghye Woo
Comments: MICCAI 2023 (Oral presentation)
Subjects: Sound (cs.SD); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
[1941] arXiv:2309.14591 (cross-list from eess.IV) [pdf, other]
Title: Applications of Sequential Learning for Medical Image Classification
Sohaib Naim, Brian Caffo, Haris I Sair, Craig K Jones
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1942] arXiv:2309.14630 (cross-list from econ.EM) [pdf, html, other]
Title: Free Discontinuity Regression: With an Application to the Economic Effects of Internet Shutdowns
Florian Gunsilius, David Van Dijcke
Comments: 24 pages, 3 figures, 2 tables; authors listed alphabetically; code available at this https URL
Subjects: Econometrics (econ.EM); Computer Vision and Pattern Recognition (cs.CV); Statistics Theory (math.ST); Applications (stat.AP); Methodology (stat.ME)
[1943] arXiv:2309.14655 (cross-list from cs.RO) [pdf, html, other]
Title: Probabilistic 3D Multi-Object Cooperative Tracking for Autonomous Driving via Differentiable Multi-Sensor Kalman Filter
Hsu-kuang Chiu, Chien-Yi Wang, Min-Hung Chen, Stephen F. Smith
Comments: Accepted by IEEE International Conference on Robotics and Automation (ICRA), 2024. Code: this https URL Video: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1944] arXiv:2309.14685 (cross-list from cs.RO) [pdf, html, other]
Title: DriveSceneGen: Generating Diverse and Realistic Driving Scenarios from Scratch
Shuo Sun, Zekai Gu, Tianchen Sun, Jiawei Sun, Chengran Yuan, Yuhang Han, Dongen Li, Marcelo H. Ang Jr
Comments: 8 pages, 5 figures, 2 tables
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1945] arXiv:2309.14737 (cross-list from cs.RO) [pdf, html, other]
Title: Volumetric Semantically Consistent 3D Panoptic Mapping
Yang Miao, Iro Armeni, Marc Pollefeys, Daniel Barath
Comments: 8 pages, 2 figures
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1946] arXiv:2309.14759 (cross-list from cs.GR) [pdf, other]
Title: Diffusion-based Holistic Texture Rectification and Synthesis
Guoqing Hao, Satoshi Iizuka, Kensho Hara, Edgar Simo-Serra, Hirokatsu Kataoka, Kazuhiro Fukui
Comments: SIGGRAPH Asia 2023 Conference Paper
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV)
[1947] arXiv:2309.14774 (cross-list from cs.LG) [pdf, other]
Title: BLIP-Adapter: Parameter-Efficient Transfer Learning for Mobile Screenshot Captioning
Ching-Yu Chiang, I-Hua Chang, Shih-Wei Liao
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[1948] arXiv:2309.14816 (cross-list from cs.LG) [pdf, other]
Title: A Comparative Study of Population-Graph Construction Methods and Graph Neural Networks for Brain Age Regression
Kyriaki-Margarita Bintsi, Tamara T. Mueller, Sophie Starck, Vasileios Baltatzis, Alexander Hammers, Daniel Rueckert
Comments: Accepted at GRAIL, MICCAI 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1949] arXiv:2309.14949 (cross-list from cs.LG) [pdf, html, other]
Title: Towards Real-World Test-Time Adaptation: Tri-Net Self-Training with Balanced Normalization
Yongyi Su, Xun Xu, Kui Jia
Comments: Accepted by AAAI 2024. 19 pages, 7 figures and 22 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1950] arXiv:2309.15038 (cross-list from cs.LG) [pdf, html, other]
Title: HPCR: Holistic Proxy-based Contrastive Replay for Online Continual Learning
Huiwei Lin, Shanshan Feng, Baoquan Zhang, Xutao Li, Yunming Ye
Comments: 15 pages, 10 figures
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1951] arXiv:2309.15048 (cross-list from cs.LG) [pdf, html, other]
Title: Class Incremental Learning via Likelihood Ratio Based Task Prediction
Haowei Lin, Yijia Shao, Weinan Qian, Ningxin Pan, Yiduo Guo, Bing Liu
Journal-ref: ICLR 2024
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1952] arXiv:2309.15065 (cross-list from cs.RO) [pdf, html, other]
Title: Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding
Christina Kassab, Matias Mattamala, Lintong Zhang, Maurice Fallon
Comments: Accepted at ICRA 2024
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1953] arXiv:2309.15135 (cross-list from cs.LG) [pdf, html, other]
Title: Contrastive Continual Multi-view Clustering with Filtered Structural Fusion
Xinhang Wan, Jiyuan Liu, Hao Yu, Ao Li, Xinwang Liu, Ke Liang, Zhibin Dong, En Zhu
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1954] arXiv:2309.15216 (cross-list from cs.LG) [pdf, other]
Title: A Comparative Study of Filters and Deep Learning Models to predict Diabetic Retinopathy
Roshan Vasu Muddaluru, Sharvaani Ravikumar Thoguluva, Shruti Prabha, Tanuja Konda Reddy, Suja Palaniswamy
Comments: 6 pages, 5 figures, I2CT , 2 tables
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1955] arXiv:2309.15243 (cross-list from eess.IV) [pdf, other]
Title: APIS: A paired CT-MRI dataset for ischemic stroke segmentation challenge
Santiago Gómez, Daniel Mantilla, Gustavo Garzón, Edgar Rangel, Andrés Ortiz, Franklin Sierra-Jerez, Fabio Martínez
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[1956] arXiv:2309.15245 (cross-list from cs.AI) [pdf, other]
Title: SeMAnD: Self-Supervised Anomaly Detection in Multimodal Geospatial Datasets
Daria Reshetova, Swetava Ganguli, C. V. Krishnakumar Iyer, Vipul Pandey
Comments: Extended version of the accepted research track paper at the 31st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL 2023), Hamburg, Germany. 11 pages, 8 figures, 6 tables
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1957] arXiv:2309.15259 (cross-list from quant-ph) [pdf, other]
Title: SLIQ: Quantum Image Similarity Networks on Noisy Quantum Computers
Daniel Silver, Tirthak Patel, Aditya Ranjan, Harshitta Gandhi, William Cutler, Devesh Tiwari
Journal-ref: Vol. 37 No. 8: AAAI-2023 Technical Tracks 8
Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1958] arXiv:2309.15268 (cross-list from cs.RO) [pdf, other]
Title: ObVi-SLAM: Long-Term Object-Visual SLAM
Amanda Adkins, Taijing Chen, Joydeep Biswas
Comments: 8 pages, 7 figures, 1 table plus appendix with 4 figures and 1 table
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1959] arXiv:2309.15278 (cross-list from cs.RO) [pdf, html, other]
Title: Out of Sight, Still in Mind: Reasoning and Planning about Unobserved Objects with Video Tracking Enabled Memory Models
Yixuan Huang, Jialin Yuan, Chanho Kim, Pupul Pradhan, Bryan Chen, Li Fuxin, Tucker Hermans
Comments: Presented at IEEE Conference on Robotics and Automation (ICRA) 2024. Website: this https URL
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1960] arXiv:2309.15302 (cross-list from cs.RO) [pdf, other]
Title: STERLING: Self-Supervised Terrain Representation Learning from Unconstrained Robot Experience
Haresh Karnan, Elvin Yang, Daniel Farkash, Garrett Warnell, Joydeep Biswas, Peter Stone
Comments: Project website: this https URL
Journal-ref: Conference on Robot Learning (CoRL 2023)
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1961] arXiv:2309.15314 (cross-list from physics.med-ph) [pdf, other]
Title: Conversion of single-energy computed tomography to parametric maps of dual-energy computed tomography using convolutional neural network
Sangwook Kim, Jimin Lee, Jungye Kim, Bitbyeol Kim, Chang Heon Choi, Seongmoon Jung
Comments: 29 pages, 17 figures
Subjects: Medical Physics (physics.med-ph); Computer Vision and Pattern Recognition (cs.CV)
[1962] arXiv:2309.15332 (cross-list from cs.RO) [pdf, other]
Title: Multimodal Dataset for Localization, Mapping and Crop Monitoring in Citrus Tree Farms
Hanzhe Teng, Yipeng Wang, Xiaoao Song, Konstantinos Karydis
Comments: Accepted to the 18th International Symposium on Visual Computing (ISVC 2023)
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1963] arXiv:2309.15420 (cross-list from cs.LG) [pdf, other]
Title: The Triad of Failure Modes and a Possible Way Out
Emanuele Sansone
Comments: Some sentences in the Background Section are overlapping with Section 2 in arXiv:2304.11357 However, the main technical content and all other sections are different
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1964] arXiv:2309.15459 (cross-list from cs.RO) [pdf, html, other]
Title: GAMMA: Graspability-Aware Mobile MAnipulation Policy Learning based on Online Grasping Pose Fusion
Jiazhao Zhang, Nandiraju Gireesh, Jilong Wang, Xiaomeng Fang, Chaoyi Xu, Weiguang Chen, Liu Dai, He Wang
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1965] arXiv:2309.15477 (cross-list from cs.GR) [pdf, other]
Title: A Tutorial on Uniform B-Spline
Yi Zhou
Subjects: Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[1966] arXiv:2309.15485 (cross-list from eess.IV) [pdf, other]
Title: Style Transfer and Self-Supervised Learning Powered Myocardium Infarction Super-Resolution Segmentation
Lichao Wang, Jiahao Huang, Xiaodan Xing, Yinzhe Wu, Ramyah Rajakulasingam, Andrew D. Scott, Pedro F Ferreira, Ranil De Silva, Sonia Nielles-Vallespin, Guang Yang
Comments: 6 pages, 8 figures, conference, accepted by SIPAIM2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1967] arXiv:2309.15516 (cross-list from cs.CL) [pdf, html, other]
Title: Teaching Text-to-Image Models to Communicate in Dialog
Xiaowen Sun, Jiazhan Feng, Yuxuan Wang, Yuxuan Lai, Xingyu Shen, Dongyan Zhao
Comments: Work in progress
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1968] arXiv:2309.15520 (cross-list from cs.LG) [pdf, other]
Title: SAF-Net: Self-Attention Fusion Network for Myocardial Infarction Detection using Multi-View Echocardiography
Ilke Adalioglu, Mete Ahishali, Aysen Degerli, Serkan Kiranyaz, Moncef Gabbouj
Comments: 4 pages, 3 figures, Computing in Cardiology (CinC) 2023
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1969] arXiv:2309.15521 (cross-list from cs.LG) [pdf, other]
Title: MLOps for Scarce Image Data: A Use Case in Microscopic Image Analysis
Angelo Yamachui Sitcheu, Nils Friederich, Simon Baeuerle, Oliver Neumann, Markus Reischl, Ralf Mikut
Comments: 21 pages, 5 figures , 33. Workshop on Computational Intelligence Berlin Germany
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[1970] arXiv:2309.15529 (cross-list from eess.IV) [pdf, other]
Title: Missing-modality Enabled Multi-modal Fusion Architecture for Medical Data
Muyu Wang, Shiyu Fan, Yichen Li, Hui Chen
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1971] arXiv:2309.15551 (cross-list from cs.LG) [pdf, html, other]
Title: DeepRepViz: Identifying Confounders in Deep Learning Model Predictions
Roshan Prakash Rane, JiHoon Kim, Arjun Umesha, Didem Stark, Marc-André Schulz, Kerstin Ritter
Journal-ref: MICCAI 2024. Lecture Notes in Computer Science, vol 15010. pp 186 to 196. Springer, Cham
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1972] arXiv:2309.15564 (cross-list from cs.LG) [pdf, other]
Title: Jointly Training Large Autoregressive Multimodal Models
Emanuele Aiello, Lili Yu, Yixin Nie, Armen Aghajanyan, Barlas Oguz
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1973] arXiv:2309.15573 (cross-list from cs.CG) [pdf, other]
Title: The Maximum Cover with Rotating Field of View
Igor Potapov, Jason Ralph, Theofilos Triommatis
Subjects: Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Algebraic Geometry (math.AG)
[1974] arXiv:2309.15596 (cross-list from cs.RO) [pdf, other]
Title: PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation
Shizhe Chen, Ricardo Garcia, Cordelia Schmid, Ivan Laptev
Comments: Accepted to CoRL 2023. Project website: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1975] arXiv:2309.15608 (cross-list from eess.IV) [pdf, html, other]
Title: NoSENSE: Learned unrolled cardiac MRI reconstruction without explicit sensitivity maps
Felix Frederik Zimmermann, Andreas Kofler
Comments: Accepted at MICCAI STACOM 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[1976] arXiv:2309.15638 (cross-list from eess.IV) [pdf, html, other]
Title: RSF-Conv: Rotation-and-Scale Equivariant Fourier Parameterized Convolution for Retinal Vessel Segmentation
Zihong Sun, Hong Wang, Qi Xie, Yefeng Zheng, Deyu Meng
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1977] arXiv:2309.15696 (cross-list from cs.LG) [pdf, other]
Title: A Unified View of Differentially Private Deep Generative Modeling
Dingfan Chen, Raouf Kerkouche, Mario Fritz
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[1978] arXiv:2309.15750 (cross-list from eess.IV) [pdf, other]
Title: Automated CT Lung Cancer Screening Workflow using 3D Camera
Brian Teixeira, Vivek Singh, Birgi Tamersoy, Andreas Prokein, Ankur Kapoor
Comments: Accepted at MICCAI 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1979] arXiv:2309.15792 (cross-list from quant-ph) [pdf, html, other]
Title: Quantum Block-Matching Algorithm using Dissimilarity Measure
M. Martínez-Felipe, J. Montiel-Pérez, V. Onofre, A. Maldonado-Romo, Ricky Young
Subjects: Quantum Physics (quant-ph); Computer Vision and Pattern Recognition (cs.CV)
[1980] arXiv:2309.15889 (cross-list from eess.IV) [pdf, html, other]
Title: High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models
Selim F. Yilmaz, Xueyan Niu, Bo Bai, Wei Han, Lei Deng, Deniz Gunduz
Comments: 6 pages, 5 figures. Published at INFOCOM 2024 Workshops
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Machine Learning (cs.LG); Multimedia (cs.MM)
[1981] arXiv:2309.15940 (cross-list from cs.RO) [pdf, other]
Title: Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs
Haonan Chang, Kowndinya Boyalakuntla, Shiyang Lu, Siwei Cai, Eric Jing, Shreesh Keskar, Shijie Geng, Adeeb Abbas, Lifeng Zhou, Kostas Bekris, Abdeslam Boularias
Comments: The code and dataset used for evaluation can be found at this https URL}{this https URL. This paper has been accepted by CoRL2023
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1982] arXiv:2309.15977 (cross-list from cs.SD) [pdf, other]
Title: Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields
Susan Liang, Chao Huang, Yapeng Tian, Anurag Kumar, Chenliang Xu
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[1983] arXiv:2309.16053 (cross-list from eess.IV) [pdf, other]
Title: Diagnosis of Helicobacter pylori using AutoEncoders for the Detection of Anomalous Staining Patterns in Immunohistochemistry Images
Pau Cano, Álvaro Caravaca, Debora Gil, Eva Musulen
Comments: 9 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1984] arXiv:2309.16058 (cross-list from cs.LG) [pdf, other]
Title: AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model
Seungwhan Moon, Andrea Madotto, Zhaojiang Lin, Tushar Nagarajan, Matt Smith, Shashank Jain, Chun-Fu Yeh, Prakash Murugesan, Peyman Heidari, Yue Liu, Kavya Srinet, Babak Damavandi, Anuj Kumar
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1985] arXiv:2309.16118 (cross-list from cs.RO) [pdf, html, other]
Title: D$^3$Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement
Yixuan Wang, Mingtong Zhang, Zhuoran Li, Tarik Kelestemur, Katherine Driggs-Campbell, Jiajun Wu, Li Fei-Fei, Yunzhu Li
Comments: Accepted to Conference on Robot Learning (CoRL 2024) as Oral Presentation. The first three authors contributed equally. Project Page: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1986] arXiv:2309.16140 (cross-list from cs.MM) [pdf, other]
Title: CLIP-Hand3D: Exploiting 3D Hand Pose Estimation via Context-Aware Prompting
Shaoxiang Guo, Qing Cai, Lin Qi, Junyu Dong
Comments: Accepted In Proceedings of the 31st ACM International Conference on Multimedia (MM' 23)
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[1987] arXiv:2309.16143 (cross-list from cs.LG) [pdf, other]
Title: Generative Semi-supervised Learning with Meta-Optimized Synthetic Samples
Shin'ya Yamaguchi
Comments: Accepted to the 15th Asian Conference on Machine Learning (ACML2023); a preprint of the camera-ready version
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[1988] arXiv:2309.16164 (cross-list from cs.RO) [pdf, other]
Title: Learning to Terminate in Object Navigation
Yuhang Song, Anh Nguyen, Chun-Yi Lee
Comments: 16 pages
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1989] arXiv:2309.16206 (cross-list from eess.IV) [pdf, other]
Title: Alzheimer's Disease Prediction via Brain Structural-Functional Deep Fusing Network
Qiankun Zuo, Junren Pan, Shuqiang Wang
Comments: 10 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1990] arXiv:2309.16210 (cross-list from eess.IV) [pdf, other]
Title: Abdominal multi-organ segmentation in CT using Swinunter
Mingjin Chen, Yongkang He, Yongyi Lu
Comments: 8pages. arXiv admin note: text overlap with arXiv:2201.01266 by other authors
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1991] arXiv:2309.16221 (cross-list from cs.RO) [pdf, other]
Title: Off-the-shelf bin picking workcell with visual pose estimation: A case study on the world robot summit 2018 kitting task
Frederik Hagelskjær, Kasper Høj Lorenzen, Dirk Kraft
Comments: 7 pages, 7 figures, 2 tables
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1992] arXiv:2309.16264 (cross-list from cs.RO) [pdf, html, other]
Title: GAMMA: Generalizable Articulation Modeling and Manipulation for Articulated Objects
Qiaojun Yu, Junbo Wang, Wenhai Liu, Ce Hao, Liu Liu, Lin Shao, Weiming Wang, Cewu Lu
Comments: 8 pages, 5 figures, ICRA 2024
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1993] arXiv:2309.16354 (cross-list from cs.LG) [pdf, html, other]
Title: Transformer-VQ: Linear-Time Transformers via Vector Quantization
Lucas D. Lingle
Comments: ICLR 2024 camera-ready
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[1994] arXiv:2309.16536 (cross-list from eess.IV) [pdf, other]
Title: Uncertainty Quantification for Eosinophil Segmentation
Kevin Lin, Donald Brown, Sana Syed, Adam Greene
Comments: Preprint, Final Article Submitted to ICBRA 2023 and will be published in the International Conference Proceedings by ACM, Association for Computing Machinery (ISBN: 979-8-4007-0815-2), which will be archived in ACM Digital Library, indexed by Ei Compendex and Scopus
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[1995] arXiv:2309.16569 (cross-list from cs.SD) [pdf, other]
Title: Audio-Visual Speaker Verification via Joint Cross-Attention
R. Gnana Praveen, Jahangir Alam
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Audio and Speech Processing (eess.AS)
[1996] arXiv:2309.16627 (cross-list from eess.IV) [pdf, other]
Title: Class Activation Map-based Weakly supervised Hemorrhage Segmentation using Resnet-LSTM in Non-Contrast Computed Tomography images
Shreyas H Ramananda, Vaanathi Sundaresan
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[1997] arXiv:2309.16633 (cross-list from cs.LG) [pdf, html, other]
Title: SupReMix: Supervised Contrastive Learning for Medical Imaging Regression with Mixup
Yilei Wu, Zijian Dong, Chongyao Chen, Wangchunshu Zhou, Juan Helen Zhou
Comments: The first two authors equally contributed to this work. Previously titled "Mixup Your Own Pair", content extended and revised
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[1998] arXiv:2309.16650 (cross-list from cs.RO) [pdf, other]
Title: ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning
Qiao Gu, Alihusein Kuwajerwala, Sacha Morin, Krishna Murthy Jatavallabhula, Bipasha Sen, Aditya Agarwal, Corban Rivera, William Paul, Kirsty Ellis, Rama Chellappa, Chuang Gan, Celso Miguel de Melo, Joshua B. Tenenbaum, Antonio Torralba, Florian Shkurti, Liam Paull
Comments: Project page: this https URL Explainer video: this https URL
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[1999] arXiv:2309.16702 (cross-list from cs.AI) [pdf, other]
Title: Prediction and Interpretation of Vehicle Trajectories in the Graph Spectral Domain
Marion Neumeier, Sebastian Dorn, Michael Botsch, Wolfgang Utschick
Comments: Accepted as a conference paper for IEEE ITSC 2023, Bilbao, Spain
Subjects: Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2000] arXiv:2309.16704 (cross-list from q-bio.NC) [pdf, other]
Title: Memories in the Making: Predicting Video Memorability with Encoding Phase EEG
Lorin Sweeney, Graham Healy, Alan F. Smeaton
Comments: Content-Based Multimedia Indexing, CBMI, September 20-22, Orleans, France, 2023
Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Signal Processing (eess.SP)
[2001] arXiv:2309.16773 (cross-list from cs.LG) [pdf, other]
Title: Neural scaling laws for phenotypic drug discovery
Drew Linsley, John Griffin, Jason Parker Brown, Adam N Roose, Michael Frank, Peter Linsley, Steven Finkbeiner, Jeremy Linsley
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[2002] arXiv:2309.16818 (cross-list from cs.RO) [pdf, other]
Title: MEM: Multi-Modal Elevation Mapping for Robotics and Learning
Gian Erni, Jonas Frey, Takahiro Miki, Matias Mattamala, Marco Hutter
Comments: Accapted for IROS2023. This work has been submitted to the IEEE for possible publication
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[2003] arXiv:2309.16878 (cross-list from cs.LG) [pdf, other]
Title: Investigating Human-Identifiable Features Hidden in Adversarial Perturbations
Dennis Y. Menn, Tzu-hsun Feng, Sriram Vishwanath, Hung-yi Lee
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2004] arXiv:2309.16898 (cross-list from cs.RO) [pdf, other]
Title: A Sign Language Recognition System with Pepper, Lightweight-Transformer, and LLM
JongYoon Lim, Inkyu Sa, Bruce MacDonald, Ho Seok Ahn
Subjects: Robotics (cs.RO); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
[2005] arXiv:2309.16916 (cross-list from cs.LG) [pdf, other]
Title: ONNXExplainer: an ONNX Based Generic Framework to Explain Neural Networks Using Shapley Values
Yong Zhao, Runxin He, Nicholas Kersting, Can Liu, Shubham Agrawal, Chiranjeet Chetia, Yu Gu
Comments: 11 pages, 11 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2006] arXiv:2309.17002 (cross-list from cs.LG) [pdf, html, other]
Title: Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks
Hao Chen, Jindong Wang, Ankit Shah, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj
Comments: ICLR 2024 Spotlight
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2007] arXiv:2309.17036 (cross-list from cs.RO) [pdf, other]
Title: UniQuadric: A SLAM Backend for Unknown Rigid Object 3D Tracking and Light-Weight Modeling
Linghao Yang, Yanmin Wu, Yu Deng, Rui Tian, Xinggang Hu, Tiefeng Ma
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
[2008] arXiv:2309.17076 (cross-list from eess.IV) [pdf, other]
Title: Benefits of mirror weight symmetry for 3D mesh segmentation in biomedical applications
Vladislav Dordiuk, Maksim Dzhigil, Konstantin Ushenin
Comments: was sent to IEEE conference
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2009] arXiv:2309.17133 (cross-list from cs.CL) [pdf, other]
Title: Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering
Weizhe Lin, Jinghong Chen, Jingbiao Mei, Alexandru Coca, Bill Byrne
Comments: To appear at NeurIPS 2023. This is the camera-ready version. We fixed some numbers and added more experiments to address reviewers' comments
Subjects: Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[2010] arXiv:2309.17160 (cross-list from cs.MM) [pdf, other]
Title: Redistributing the Precision and Content in 3D-LUT-based Inverse Tone-mapping for HDR/WCG Display
Cheng Guo, Leidong Fan, Qian Zhang, Hanyuan Liu, Kanglin Liu, Xiuhua Jiang
Comments: Accepted in CVMP2023 (the 20th ACM SIGGRAPH European Conference on Visual Media Production)
Subjects: Multimedia (cs.MM); Computer Vision and Pattern Recognition (cs.CV)
[2011] arXiv:2309.17170 (cross-list from cs.RO) [pdf, html, other]
Title: Robotic Grasping of Harvested Tomato Trusses Using Vision and Online Learning
Luuk van den Bent, Tomás Coleman, Robert Babuška
Comments: 7 pages, 7 figures
Journal-ref: Proceedings of the IEEE International Conference on Robotics and Automation, ICRA 2024, IEEE, pages 13947-13953
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2012] arXiv:2309.17189 (cross-list from cs.SD) [pdf, html, other]
Title: RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation
Samuel Pegg, Kai Li, Xiaolin Hu
Comments: Accepted by The Twelfth International Conference on Learning Representations (ICLR) 2024, see this https URL
Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Audio and Speech Processing (eess.AS)
[2013] arXiv:2309.17192 (cross-list from cs.LG) [pdf, other]
Title: A Survey of Incremental Transfer Learning: Combining Peer-to-Peer Federated Learning and Domain Incremental Learning for Multicenter Collaboration
Yixing Huang, Christoph Bert, Ahmed Gomaa, Rainer Fietkau, Andreas Maier, Florian Putz
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[2014] arXiv:2309.17197 (cross-list from cs.LG) [pdf, other]
Title: An Investigation Into Race Bias in Random Forest Models Based on Breast DCE-MRI Derived Radiomics Features
Mohamed Huti, Tiarna Lee, Elinor Sawyer, Andrew P. King
Comments: Accepted for publication at the MICCAI Workshop on Fairness of AI in Medical Imaging (FAIMI) 2023
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2015] arXiv:2309.17209 (cross-list from cs.RO) [pdf, other]
Title: Robots That Can See: Leveraging Human Pose for Trajectory Prediction
Tim Salzmann, Lewis Chiang, Markus Ryll, Dorsa Sadigh, Carolina Parada, Alex Bewley
Comments: Project page: this https URL
Journal-ref: IEEE Robotics and Automation Letters, vol. 8, no. 11, pp. 7090-7097, Nov. 2023
Subjects: Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
[2016] arXiv:2309.17223 (cross-list from eess.IV) [pdf, other]
Title: Glioma subtype classification from histopathological images using in-domain and out-of-domain transfer learning: An experimental study
Vladimir Despotovic, Sang-Yoon Kim, Ann-Christin Hau, Aliaksandra Kakoichankava, Gilbert Georg Klamminger, Felix Bruno Kleine Borgmann, Katrin B. M. Frauenknecht, Michel Mittelbronn, Petr V. Nazarov
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2017] arXiv:2309.17269 (cross-list from eess.IV) [pdf, html, other]
Title: Unpaired Optical Coherence Tomography Angiography Image Super-Resolution via Frequency-Aware Inverse-Consistency GAN
Weiwen Zhang, Dawei Yang, Haoxuan Che, An Ran Ran, Carol Y. Cheung, Hao Chen
Comments: 11 pages, 10 figures, in IEEE J-BHI, 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2018] arXiv:2309.17320 (cross-list from eess.IV) [pdf, other]
Title: Development of a Deep Learning Method to Identify Acute Ischemic Stroke Lesions on Brain CT
Alessandro Fontanella, Wenwen Li, Grant Mair, Antreas Antoniou, Eleanor Platt, Paul Armitage, Emanuele Trucco, Joanna Wardlaw, Amos Storkey
Comments: 12 pages, 5 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[2019] arXiv:2309.17334 (cross-list from eess.IV) [pdf, html, other]
Title: Multi-Depth Branch Network for Efficient Image Super-Resolution
Huiyuan Tian, Li Zhang, Shijian Li, Min Yao, Gang Pan
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[2020] arXiv:2309.17338 (cross-list from cs.RO) [pdf, other]
Title: Improving Trajectory Prediction in Dynamic Multi-Agent Environment by Dropping Waypoints
Pranav Singh Chib, Pravendra Singh
Comments: Under Review
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[2021] arXiv:2309.17341 (cross-list from cs.LG) [pdf, other]
Title: MixQuant: Mixed Precision Quantization with a Bit-width Optimization Search
Eliska Kloberdanz, Wei Le
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[2022] arXiv:2309.17343 (cross-list from physics.optics) [pdf, other]
Title: Neural Lithography: Close the Design-to-Manufacturing Gap in Computational Optics with a 'Real2Sim' Learned Photolithography Simulator
Cheng Zheng, Guangyuan Zhao, Peter T.C. So
Comments: The paper, titled "Close the Design-to-Manufacturing Gap in Computational Optics with a 'Real2Sim' Learned Two-Photon Neural Lithography Simulator," has been accepted for presentation at SIGGRAPH Asia 2023. This version offers a more comprehensive and accessible read. Project page: this https URL
Subjects: Optics (physics.optics); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR)
Total of 2022 entries : 1901-2022 2001-2022
Showing up to 2000 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack