Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > eess.IV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Image and Video Processing

Authors and titles for January 2024

Total of 319 entries
Showing up to 2000 entries per page: fewer | more | all
[1] arXiv:2401.00008 [pdf, other]
Title: Multispectral palmprint recognition based on three descriptors: LBP, Shift LBP, and Multi Shift LBP with LDA classifier
Salwua Aqreerah, Alhaam Alariyibi, Wafa El-Tarhouni
Journal-ref: 2022 IEEE 2nd International Maghreb Meeting of the Conference on Sciences and Techniques of Automatic Control and Computer Engineering (MI-STA), pp. 506 - 510
Subjects: Image and Video Processing (eess.IV)
[2] arXiv:2401.00016 [pdf, html, other]
Title: Prototype-Based Approach for One-Shot Segmentation of Brain Tumors using Few-Shot Learning
Ahmed Ayman
Comments: Further Improvements
Subjects: Image and Video Processing (eess.IV)
[3] arXiv:2401.00023 [pdf, html, other]
Title: CycleGAN Models for MRI Image Translation
Cassandra Czobit, Reza Samavi
Comments: Accepted and presented in ACML PRHA 2023 workshop
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
[4] arXiv:2401.00135 [pdf, other]
Title: Deep Radon Prior: A Fully Unsupervised Framework for Sparse-View CT Reconstruction
Shuo Xu, Yucheng Zhang, Gang Chen, Xincheng Xiang, Peng Cong, Yuewen Sun
Comments: 11 pages, 12 figures, Journal paper
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[5] arXiv:2401.00153 [pdf, html, other]
Title: USFM: A Universal Ultrasound Foundation Model Generalized to Tasks and Organs towards Label Efficient Image Analysis
Jing Jiao, Jin Zhou, Xiaokang Li, Menghua Xia, Yi Huang, Lihong Huang, Na Wang, Xiaofan Zhang, Shichong Zhou, Yuanyuan Wang, Yi Guo
Comments: Submit to MedIA, 17 pages, 11 figures
Subjects: Image and Video Processing (eess.IV)
[6] arXiv:2401.00159 [pdf, html, other]
Title: Automatic hip osteoarthritis grading with uncertainty estimation from computed tomography using digitally-reconstructed radiographs
Masachika Masuda, Mazen Soufi, Yoshito Otake, Keisuke Uemura, Sotaro Kono, Kazuma Takashima, Hidetoshi Hamada, Yi Gu, Masaki Takao, Seiji Okada, Nobuhiko Sugano, Yoshinobu Sato
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[7] arXiv:2401.00275 [pdf, html, other]
Title: An $\ell^1$-Plug-and-Play Approach for MPI Using a Zero Shot Denoiser with Evaluation on the 3D Open MPI Dataset
Vladyslav Gapyak, Corinna Rentschler, Thomas März, Andreas Weinmann
Comments: 24 pages, 7 figures, additional supplementary material (78 pages total)
Journal-ref: Phys. Med. Biol. (70) 025028 (2025)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Numerical Analysis (math.NA)
[8] arXiv:2401.00314 [pdf, html, other]
Title: GAN-GA: A Generative Model based on Genetic Algorithm for Medical Image Generation
M. AbdulRazek, G. Khoriba, M. Belal
Comments: 10 pages, 2 figures. Abstract published in Frontiers in Medical Technology, presented at the 27th Conference on Medical Image Understanding and Analysis 2023. DOI: https://doi.org/10.3389/978-2-8325-1231-9. URL: this https URL
Journal-ref: 27th Conference on Medical Image Understanding and Analysis 2023, Frontiers, 2023, pp. 30-39
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
[9] arXiv:2401.00523 [pdf, html, other]
Title: Compressing Deep Image Super-resolution Models
Yuxuan Jiang, Jakub Nawala, Fan Zhang, David Bull
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[10] arXiv:2401.00587 [pdf, other]
Title: Brain Tumor Segmentation Based on Deep Learning, Attention Mechanisms, and Energy-Based Uncertainty Prediction
Zachary Schwehr, Sriman Achanta
Comments: 11 pages, 6 figures, code available at this https URL, submitted to Computers in Biology and Medicine
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI)
[11] arXiv:2401.00692 [pdf, html, other]
Title: Self-supervised learning for skin cancer diagnosis with limited training data
Hamish Haggerty, Rohitash Chandra
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[12] arXiv:2401.00728 [pdf, html, other]
Title: MultiFusionNet: Multilayer Multimodal Fusion of Deep Neural Networks for Chest X-Ray Image Classification
Saurabh Agarwal, K. V. Arya, Yogesh Kumar Meena
Comments: 19 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[13] arXiv:2401.00740 [pdf, html, other]
Title: Beyond Subspace Isolation: Many-to-Many Transformer for Light Field Image Super-resolution
Zeke Zexi Hu, Xiaoming Chen, Vera Yuk Ying Chung, Yiran Shen
Comments: Accepted by IEEE Transactions on Multimedia
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[14] arXiv:2401.00859 [pdf, html, other]
Title: Federated Multi-View Synthesizing for Metaverse
Yiyu Guo, Zhijin Qin, Xiaoming Tao, Geoffrey Ye Li
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[15] arXiv:2401.00875 [pdf, html, other]
Title: SASA: Saliency-Aware Self-Adaptive Snapshot Compressive Imaging
Yaping Zhao, Edmund Y. Lam
Comments: 5 pages, 4 figures
Subjects: Image and Video Processing (eess.IV)
[16] arXiv:2401.00877 [pdf, html, other]
Title: Improving the Stability and Efficiency of Diffusion Models for Content Consistent Super-Resolution
Lingchen Sun, Rongyuan Wu, Jie Liang, Zhengqiang Zhang, Hongwei Yong, Lei Zhang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[17] arXiv:2401.01160 [pdf, html, other]
Title: Train-Free Segmentation in MRI with Cubical Persistent Homology
Anton François, Raphaël Tinarrage
Comments: preprint, 17 pages, 19 figures
Subjects: Image and Video Processing (eess.IV); Computational Geometry (cs.CG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[18] arXiv:2401.01303 [pdf, html, other]
Title: Integrating Edges into U-Net Models with Explainable Activation Maps for Brain Tumor Segmentation using MR Images
Subin Sahayam, Umarani Jayaraman
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[19] arXiv:2401.01386 [pdf, other]
Title: Tissue Artifact Segmentation and Severity Analysis for Automated Diagnosis Using Whole Slide Images
Galib Muhammad Shahriar Himel
Comments: Master's thesis, 60 pages, 21 figures, 16 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[20] arXiv:2401.01414 [pdf, html, other]
Title: VALD-MD: Visual Attribution via Latent Diffusion for Medical Diagnostics
Ammar A. Siddiqui (1), Santosh Tirunagari (1), Tehseen Zia (2), David Windridge (1) ((1) Middlesex University, London, UK, (2) COMSATS University, Islamabad, Pakistan)
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
[21] arXiv:2401.01496 [pdf, html, other]
Title: From Pixel to Slide image: Polarization Modality-based Pathological Diagnosis Using Representation Learning
Jia Dong, Yao Yao, Yang Dong, Hui Ma
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[22] arXiv:2401.01539 [pdf, other]
Title: DDPM based X-ray Image Synthesizer
Praveen Mahaulpatha, Thulana Abeywardane, Tomson George
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[23] arXiv:2401.01553 [pdf, html, other]
Title: Multi-modal Learning with Missing Modality in Predicting Axillary Lymph Node Metastasis
Shichuan Zhang, Sunyi Zheng, Zhongyi Shui, Honglin Li, Lin Yang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[24] arXiv:2401.01654 [pdf, html, other]
Title: LESEN: Label-Efficient deep learning for Multi-parametric MRI-based Visual Pathway Segmentation
Alou Diakite (1 and 2), Cheng Li (1), Lei Xie (3), Yuanjing Feng (3), Hua Han (1 and 2), Shanshan Wang (1 and 4) ( (1) Paul C. Lauterbur Research Center for Biomedical Imaging, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China, (2) University of Chinese Academy of Sciences, Beijing, China, (3) Zhejiang University of Technology, Hangzhou, China, (4) Peng Cheng Laboratory, Shenzhen, China)
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
[25] arXiv:2401.01685 [pdf, other]
Title: Modality Exchange Network for Retinogeniculate Visual Pathway Segmentation
Hua Han (1 and 2), Cheng Li (1), Lei Xie (3), Yuanjing Feng (3), Alou Diakite (1 and 2), Shanshan Wang (1 and 4) ((1) Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China, (2) University of Chinese Academy of Sciences, Beijing, China, (3) College of Information Engineering, Zhejiang University of Technology, Hangzhou, China, (4) Peng Cheng Laboratory, Shenzhen, China)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[26] arXiv:2401.02145 [pdf, html, other]
Title: ED: Perceptually tuned Enhanced Compression Model
Pierrick Philippe, Théo Ladune, Stéphane Davenet, Thomas Leguay
Comments: Challenge on Learned Image Compression (CLIC), DCC2024
Subjects: Image and Video Processing (eess.IV)
[27] arXiv:2401.02156 [pdf, html, other]
Title: Cool-Chic: Perceptually Tuned Low Complexity Overfitted Image Coder
Théo Ladune, Pierrick Philippe, Gordon Clare, Félix Henry, Thomas Leguay
Comments: Challenge on Learned Image Compression (CLIC), DCC2024
Subjects: Image and Video Processing (eess.IV)
[28] arXiv:2401.02192 [pdf, other]
Title: Nodule detection and generation on chest X-rays: NODE21 Challenge
Ecem Sogancioglu, Bram van Ginneken, Finn Behrendt, Marcel Bengs, Alexander Schlaefer, Miron Radu, Di Xu, Ke Sheng, Fabien Scalzo, Eric Marcus, Samuele Papa, Jonas Teuwen, Ernst Th. Scholten, Steven Schalekamp, Nils Hendrix, Colin Jacobs, Ward Hendrix, Clara I Sánchez, Keelin Murphy
Comments: 15 pages, 5 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[29] arXiv:2401.02358 [pdf, other]
Title: A novel method to enhance pneumonia detection via a model-level ensembling of CNN and vision transformer
Sandeep Angara, Nishith Reddy Mannuru, Aashrith Mannuru, Sharath Thirunagaru
Comments: NA
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[30] arXiv:2401.02537 [pdf, other]
Title: Using Singular Value Decomposition in a Convolutional Neural Network to Improve Brain Tumor Segmentation Accuracy
Pegah Ahadian, Maryam Babaei, Kourosh Parand
Journal-ref: International Journal of Computer Science and Information Technology 2022
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[31] arXiv:2401.02565 [pdf, html, other]
Title: Demonstration of an Adversarial Attack Against a Multimodal Vision Language Model for Pathology Imaging
Poojitha Thota, Jai Prakash Veerla, Partha Sai Guttikonda, Mohammad S. Nasr, Shirin Nilizadeh, Jacob M. Luber
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Tissues and Organs (q-bio.TO)
[32] arXiv:2401.02759 [pdf, other]
Title: Detection and Classification of Diabetic Retinopathy using Deep Learning Algorithms for Segmentation to Facilitate Referral Recommendation for Test and Treatment Prediction
Manoj S H, Arya A Bosale
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[33] arXiv:2401.02794 [pdf, html, other]
Title: Subjective and Objective Analysis of Indian Social Media Video Quality
Sandeep Mishra, Mukul Jha, Alan C. Bovik
Comments: Submitted to the IEEE Transactions on Image Processing
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[34] arXiv:2401.02884 [pdf, other]
Title: MsDC-DEQ-Net: Deep Equilibrium Model (DEQ) with Multi-scale Dilated Convolution for Image Compressive Sensing (CS)
Youhao Yu, Richard M. Dansereau
Comments: 15 pages, 8 figures, open access journal paper
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI)
[35] arXiv:2401.02962 [pdf, other]
Title: Automated Localization of Blood Vessels in Retinal Images
Vahid Mohammadi Safarzadeh
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[36] arXiv:2401.03002 [pdf, html, other]
Title: Prompt-driven Latent Domain Generalization for Medical Image Classification
Siyuan Yan, Chi Liu, Zhen Yu, Lie Ju, Dwarikanath Mahapatra, Brigid Betz-Stablein, Victoria Mar, Monika Janda, Peter Soyer, Zongyuan Ge
Comments: 10 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[37] arXiv:2401.03060 [pdf, other]
Title: Super-resolution multi-contrast unbiased eye atlases with deep probabilistic refinement
Ho Hin Lee, Adam M. Saunders, Michael E. Kim, Samuel W. Remedios, Lucas W. Remedios, Yucheng Tang, Qi Yang, Xin Yu, Shunxing Bao, Chloe Cho, Louise A. Mawn, Tonia S. Rex, Kevin L. Schey, Blake E. Dewey, Jeffrey M. Spraggins, Jerry L. Prince, Yuankai Huo, Bennett A. Landman
Comments: Published in SPIE Journal of Medical Imaging (this https URL). 27 pages, 6 figures
Journal-ref: J. Med. Imag. 11(6), 064004 (2024)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[38] arXiv:2401.03132 [pdf, other]
Title: Vision Transformers and Bi-LSTM for Alzheimer's Disease Diagnosis from 3D MRI
Taymaz Akan, Sait Alp, Mohammad A. N Bhuiyanb
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[39] arXiv:2401.03150 [pdf, html, other]
Title: O-PRESS: Boosting OCT axial resolution with Prior guidance, Recurrence, and Equivariant Self-Supervision
Kaiyan Li, Jingyuan Yang, Wenxuan Liang, Xingde Li, Chenxi Zhang, Lulu Chen, Chan Wu, Xiao Zhang, Zhiyan Xu, Yuelin Wang, Lihui Meng, Yue Zhang, Youxin Chen, S.Kevin Zhou
Subjects: Image and Video Processing (eess.IV)
[40] arXiv:2401.03166 [pdf, html, other]
Title: Short-Time Fourier Transform for deblurring Variational Autoencoders
Vibhu Dalal
Comments: 9 pages, 5 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[41] arXiv:2401.03173 [pdf, html, other]
Title: UGGNet: Bridging U-Net and VGG for Advanced Breast Cancer Diagnosis
Tran Cao Minh, Nguyen Kim Quoc, Phan Cong Vinh, Dang Nhu Phu, Vuong Xuan Chi, Ha Minh Tan
Comments: Submitted to the journal "EAI Endorsed Transactions on Context-aware Systems and Applications" ,2 images, 5 data tables
Journal-ref: EAI Endorsed Transactions on Contex-aware Systems and Applications, 10(1), 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[42] arXiv:2401.03271 [pdf, html, other]
Title: Analysis and Validation of Image Search Engines in Histopathology
Isaiah Lahr, Saghir Alfasly, Peyman Nejat, Jibran Khan, Luke Kottom, Vaishnavi Kumbhar, Areej Alsaafin, Abubakr Shafique, Sobhan Hemati, Ghazal Alabtah, Nneka Comfere, Dennis Murphee, Aaron Mangold, Saba Yasir, Chady Meroueh, Lisa Boardman, Vijay H. Shah, Joaquin J. Garcia, H.R. Tizhoosh
Journal-ref: IEEE Reviews in Biomedical Engineering, 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR)
[43] arXiv:2401.03302 [pdf, html, other]
Title: Realism in Action: Anomaly-Aware Diagnosis of Brain Tumors from Medical Images Using YOLOv8 and DeiT
Seyed Mohammad Hossein Hashemi, Leila Safari, Mohsen Hooshmand, Amirhossein Dadashzadeh Taromi
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
[44] arXiv:2401.03495 [pdf, html, other]
Title: Segment Anything Model for Medical Image Segmentation: Current Applications and Future Directions
Yichi Zhang, Zhenrong Shen, Rushi Jiao
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[45] arXiv:2401.03615 [pdf, html, other]
Title: Automated Detection of Myopic Maculopathy in MMAC 2023: Achievements in Classification, Segmentation, and Spherical Equivalent Prediction
Yihao Li, Philippe Zhang, Yubo Tan, Jing Zhang, Zhihan Wang, Weili Jiang, Pierre-Henri Conze, Mathieu Lamard, Gwenolé Quellec, Mostafa El Habib Daho
Comments: 18 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[46] arXiv:2401.03621 [pdf, other]
Title: Machine Learning Applications in Traumatic Brain Injury: A Spotlight on Mild TBI
Hanem Ellethy, Shekhar S. Chandra, Viktor Vegh
Comments: The manuscript has 34 pages, 3 figures, and 4 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[47] arXiv:2401.03623 [pdf, other]
Title: A Video Coding Method Based on Neural Network for CLIC2024
Zhengang Li, Jingchi Zhang, Yonghua Wang, Xing Zeng, Zhen Zhang, Yunlin Long, Menghu Jia, Ning Wang
Subjects: Image and Video Processing (eess.IV)
[48] arXiv:2401.03664 [pdf, other]
Title: Dual-Channel Reliable Breast Ultrasound Image Classification Based on Explainable Attribution and Uncertainty Quantification
Shuge Lei, Haonan Hu, Dasheng Sun, Huabin Zhang, Kehong Yuan, Jian Dai, Jijun Tang, Yan Tong
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[49] arXiv:2401.03885 [pdf, html, other]
Title: Hyperspectral Image Denoising via Spatial-Spectral Recurrent Transformer
Guanyiman Fu, Fengchao Xiong, Jianfeng Lu, Jun Zhou, Jiantao Zhou, Yuntao Qian
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[50] arXiv:2401.03912 [pdf, other]
Title: Attention-Guided Erasing: A Novel Augmentation Method for Enhancing Downstream Breast Density Classification
Adarsh Bhandary Panambur, Hui Yu, Sheethal Bhat, Prathmesh Madhu, Siming Bayer, Andreas Maier
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[51] arXiv:2401.03922 [pdf, other]
Title: SNeurodCNN: Structure-focused Neurodegeneration Convolutional Neural Network for Modelling and Classification of Alzheimer's Disease
Simisola Odimayo, Chollette C. Olisah, Khadija Mohammed
Comments: 36 Pages, 10 figures, 4 tables
Journal-ref: Scientific Reports 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[52] arXiv:2401.04079 [pdf, html, other]
Title: RudolfV: A Foundation Model by Pathologists for Pathologists
Jonas Dippel, Barbara Feulner, Tobias Winterhoff, Timo Milbich, Stephan Tietz, Simon Schallenberg, Gabriel Dernbach, Andreas Kunft, Simon Heinke, Marie-Lisa Eich, Julika Ribbat-Idel, Rosemarie Krupar, Philipp Anders, Niklas Prenißl, Philipp Jurmeister, David Horst, Lukas Ruff, Klaus-Robert Müller, Frederick Klauschen, Maximilian Alber
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[53] arXiv:2401.04110 [pdf, other]
Title: Bridging Machine Learning and Clinical Diagnosis: An Explainable Biomarker for ß-Amyloid PET Imaging
Janos Barbero, Ana Franceschi, Luca Giliberto, Patrick Phuoc Do, David Petrover, Jack Nhat Truong, Sean Clouston, Nha Nguyen, Marc Gordon, An Vo
Comments: Submitted to the 2024 OHBM Annual Meeting
Subjects: Image and Video Processing (eess.IV)
[54] arXiv:2401.04244 [pdf, html, other]
Title: Spatio-Temporal Turbulence Mitigation: A Translational Perspective
Xingguang Zhang, Nicholas Chimitt, Yiheng Chi, Zhiyuan Mao, Stanley H. Chan
Comments: Accepted by CVPR 2024, project page this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[55] arXiv:2401.04393 [pdf, html, other]
Title: OrthoSeisnet: Seismic Inversion through Orthogonal Multi-scale Frequency Domain U-Net for Geophysical Exploration
Supriyo Chakraborty, Aurobinda Routray, Sanjay Bhargav Dharavath, Tanmoy Dam
Comments: Under review, once the paper is accepted, the copyright will be transferred to the corresponding journal
Subjects: Image and Video Processing (eess.IV)
[56] arXiv:2401.04412 [pdf, html, other]
Title: Deep Covariance Alignment for Domain Adaptive Remote Sensing Image Segmentation
Linshan Wu, Ming Lu, Leyuan Fang
Comments: A paper accepted by TGRS
Subjects: Image and Video Processing (eess.IV)
[57] arXiv:2401.04570 [pdf, html, other]
Title: An Automatic Cascaded Model for Hemorrhagic Stroke Segmentation and Hemorrhagic Volume Estimation
Weijin Xu, Zhuang Sha, Huihua Yang, Rongcai Jiang, Zhanying Li, Wentao Liu, Ruisheng Su
Comments: Accepted by SWITCH2023: Stroke Workshop on Imaging and Treatment CHallenges, a workshop at MICCAI 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[58] arXiv:2401.04722 [pdf, other]
Title: U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation
Jun Ma, Feifei Li, Bo Wang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[59] arXiv:2401.04740 [pdf, other]
Title: Segment anything model (SAM) for brain extraction in fMRI studies
Dwith Chenna, Suyash Bhogawar
Journal-ref: International Journal of Artificial Intelligence In Medicine (IJAIMED, Volume 1, Issue 01, Jan-Dec 2023, pp. 1-8
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[60] arXiv:2401.04746 [pdf, other]
Title: Skin Cancer Segmentation and Classification Using Vision Transformer for Automatic Analysis in Dermatoscopy-based Non-invasive Digital System
Galib Muhammad Shahriar Himel, Md. Masudul Islam, Kh Abdullah Al-Aff, Shams Ibne Karim, Md. Kabir Uddin Sikder
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[61] arXiv:2401.04953 [pdf, html, other]
Title: Adaptive-avg-pooling based Attention Vision Transformer for Face Anti-spoofing
Jichen Yang, Fangfan Chen, Rohan Kumar Das, Zhengyu Zhu, Shunsi Zhang
Comments: Accepted for Publication in IEEE ICASSP 2024
Subjects: Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[62] arXiv:2401.05030 [pdf, html, other]
Title: An event-based implementation of saliency-based visual attention for rapid scene analysis
Camille Simon Chane, Ernst Niebur, Ryad Benosman, Sio-Hoi Ieng
Subjects: Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[63] arXiv:2401.05137 [pdf, html, other]
Title: DISCOVER: 2-D Multiview Summarization of Optical Coherence Tomography Angiography for Automatic Diabetic Retinopathy Diagnosis
Mostafa El Habib Daho, Yihao Li, Rachid Zeghlache, Hugo Le Boité, Pierre Deman, Laurent Borderie, Hugang Ren, Niranchana Mannivanan, Capucine Lepicard, Béatrice Cochener, Aude Couturier, Ramin Tadayoni, Pierre-Henri Conze, Mathieu Lamard, Gwenolé Quellec
Journal-ref: Artificial Intelligence in Medicine 2024, 102803
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[64] arXiv:2401.05481 [pdf, html, other]
Title: Transformer-CNN Fused Architecture for Enhanced Skin Lesion Segmentation
Siddharth Tiwari
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[65] arXiv:2401.05682 [pdf, other]
Title: Adaptive Regularized Low-Rank Tensor Decomposition for Hyperspectral Image Denoising and Destriping
Dongyi Li, Dong Chu, Xiaobin Guan, Wei He, Huanfeng Shen
Subjects: Image and Video Processing (eess.IV)
[66] arXiv:2401.05915 [pdf, html, other]
Title: Neural Implicit Surface Reconstruction of Freehand 3D Ultrasound Volume with Geometric Constraints
Hongbo Chen, Logiraj Kumaralingam, Shuhang Zhang, Sheng Song, Fayi Zhang, Haibin Zhang, Thanh-Tu Pham, Edmond H. M. Lou, Kumaradevan Punithakumar, Yuyao Zhang, Lawrence H. Le, Rui Zheng
Comments: Preprint
Subjects: Image and Video Processing (eess.IV)
[67] arXiv:2401.06148 [pdf, html, other]
Title: Artificial Intelligence for Digital and Computational Pathology
Andrew H. Song, Guillaume Jaume, Drew F.K. Williamson, Ming Y. Lu, Anurag Vaidya, Tiffany R. Miller, Faisal Mahmood
Journal-ref: Nature Reviews Bioengineering 2023
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[68] arXiv:2401.06150 [pdf, html, other]
Title: D-STGCNT: A Dense Spatio-Temporal Graph Conv-GRU Network based on transformer for assessment of patient physical rehabilitation
Youssef Mourchid, Rim Slama
Comments: 15 pages, Computers in Biology and Medicine Journal
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[69] arXiv:2401.06174 [pdf, other]
Title: Machine Learning Applications in Spine Biomechanics
Farshid Ghezelbash, Amir Hossein Eskandari, Xavier Robert-Lachaine, Frank Cao, Mehran Pesteie, Zhuohua Qiao, Aboulfazl Shirazi-Adl, Christian Larivière
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
[70] arXiv:2401.06180 [pdf, other]
Title: Decentralized Gossip Mutual Learning (GML) for automatic head and neck tumor segmentation
Jingyun Chen, Yading Yuan
Comments: 6 pages, 1 figure, accepted to SPIE Medical Imaging 2024
Subjects: Image and Video Processing (eess.IV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[71] arXiv:2401.06224 [pdf, html, other]
Title: Leveraging Frequency Domain Learning in 3D Vessel Segmentation
Xinyuan Wang, Chengwei Pan, Hongming Dai, Gangming Zhao, Jinpeng Li, Xiao Zhang, Yizhou Yu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[72] arXiv:2401.06272 [pdf, other]
Title: Segmentation of Mediastinal Lymph Nodes in CT with Anatomical Priors
Tejas Sudharshan Mathai, Bohan Liu, Ronald M. Summers
Comments: Submitted to CARS 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[73] arXiv:2401.06349 [pdf, html, other]
Title: ADAPT: Alzheimer Diagnosis through Adaptive Profiling Transformers
Yifeng Wang, Ke Chen, Haohan Wang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[74] arXiv:2401.06499 [pdf, html, other]
Title: Fully Automated Tumor Segmentation for Brain MRI data using Multiplanner UNet
Sumit Pandey, Satyasaran Changdar, Mathias Perslev, Erik B Dam
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[75] arXiv:2401.06517 [pdf, html, other]
Title: LiDAR Depth Map Guided Image Compression Model
Alessandro Gnutti, Stefano Della Fiore, Mattia Savardi, Yi-Hsin Chen, Riccardo Leonardi, Wen-Hsiao Peng
Subjects: Image and Video Processing (eess.IV)
[76] arXiv:2401.06744 [pdf, html, other]
Title: Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part I: Homogeneous Diffusion Inpainting
Niklas Kämper, Vassillen Chizhov, Joachim Weickert
Subjects: Image and Video Processing (eess.IV); Numerical Analysis (math.NA)
[77] arXiv:2401.06747 [pdf, html, other]
Title: Efficient Parallel Data Optimization for Homogeneous Diffusion Inpainting of 4K Images
Niklas Kämper, Vassillen Chizhov, Joachim Weickert
Subjects: Image and Video Processing (eess.IV); Numerical Analysis (math.NA)
[78] arXiv:2401.06777 [pdf, html, other]
Title: Multimodal Neuroimaging Attention-Based architecture for Cognitive Decline Prediction
Jamie Vo, Naeha Sharif, Ghulam Mubashar Hassan
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI)
[79] arXiv:2401.06780 [pdf, html, other]
Title: HA-HI: Synergising fMRI and DTI through Hierarchical Alignments and Hierarchical Interactions for Mild Cognitive Impairment Diagnosis
Xiongri Shen, Zhenxi Song, Linling Li, Min Zhang, Lingyan Liang Honghai Liu, Demao Deng, Zhiguo Zhang
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[80] arXiv:2401.06893 [pdf, html, other]
Title: Local Gamma Augmentation for Ischemic Stroke Lesion Segmentation on MRI
Jon Middleton, Marko Bauer, Kaining Sheng, Jacob Johansen, Mathias Perslev, Silvia Ingala, Mads Nielsen, Akshay Pai
Comments: Camera-ready version for Northern Lights Deep Learning Conference 2024, 7 pages, 2 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[81] arXiv:2401.07020 [pdf, other]
Title: Empowering Medical Imaging with Artificial Intelligence: A Review of Machine Learning Approaches for the Detection, and Segmentation of COVID-19 Using Radiographic and Tomographic Images
Sayed Amir Mousavi Mobarakeh, Kamran Kazemi, Ardalan Aarabi, Habibollah Danyal
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[82] arXiv:2401.07041 [pdf, html, other]
Title: An automated framework for brain vessel centerline extraction from CTA images
Sijie Liu, Ruisheng Su, Jianghang Su, Jingmin Xin, Jiayi Wu, Wim van Zwam, Pieter Jan van Doormaal, Aad van der Lugt, Wiro J. Niessen, Nanning Zheng, Theo van Walsum
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[83] arXiv:2401.07126 [pdf, html, other]
Title: IVIM-Morph: Motion-compensated quantitative Intra-voxel Incoherent Motion (IVIM) analysis for functional fetal lung maturity assessment from diffusion-weighted MRI data
Noga Kertes, Yael Zaffrani-Reznikov, Onur Afacan, Sila Kurugol, Simon K. Warfield, Moti Freiman
Comments: Accepted for publication in the journal: "Medical Image Analysis"
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[84] arXiv:2401.07326 [pdf, html, other]
Title: Beyond Traditional Approaches: Multi-Task Network for Breast Ultrasound Diagnosis
Dat T. Chung, Minh-Anh Dang, Mai-Anh Vu, Minh T. Nguyen, Thanh-Huy Nguyen, Vinh Q. Dinh
Comments: 7 pages, 3 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[85] arXiv:2401.07751 [pdf, other]
Title: DeepThalamus: A novel deep learning method for automatic segmentation of brain thalamic nuclei from multimodal ultra-high resolution MRI
Marina Ruiz-Perez, Sergio Morell-Ortega, Marien Gadea, Roberto Vivo-Hernando, Gregorio Rubio, Fernando Aparici, Mariam de la Iglesia-Vaya, Thomas Tourdias, Pierrick Coupé, José V. Manjón
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[86] arXiv:2401.07782 [pdf, html, other]
Title: Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing
Jakob Hackstein, Gencer Sumbul, Kai Norman Clasen, Begüm Demir
Comments: Accepted at the IEEE Transactions on Geoscience and Remote Sensing. Our code is available at this https URL
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[87] arXiv:2401.07957 [pdf, other]
Title: Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models
Dan Jacobellis, Daniel Cummings, Neeraja J. Yadwadkar
Comments: 10 pages; abridged version published in IEEE Data Compression Conference 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
[88] arXiv:2401.07990 [pdf, other]
Title: How does self-supervised pretraining improve robustness against noisy labels across various medical image classification datasets?
Bidur Khanal, Binod Bhattarai, Bishesh Khanal, Cristian Linte
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[89] arXiv:2401.08098 [pdf, other]
Title: Attention-Based CNN-BiLSTM for Sleep State Classification of Spatiotemporal Wide-Field Calcium Imaging Data
Xiaohui Zhang, Eric C. Landsness, Hanyang Miao, Wei Chen, Michelle Tang, Lindsey M. Brier, Joseph P. Culver, Jin-Moo Lee, Mark A. Anastasio
Subjects: Image and Video Processing (eess.IV); Neurons and Cognition (q-bio.NC)
[90] arXiv:2401.08213 [pdf, other]
Title: Ship Detection in SAR Images with Human-in-the-Loop
Hecheng Jia, Feng Xu
Subjects: Image and Video Processing (eess.IV)
[91] arXiv:2401.08404 [pdf, other]
Title: Training and Comparison of nnU-Net and DeepMedic Methods for Autosegmentation of Pediatric Brain Tumors
Arastoo Vossough, Nastaran Khalili, Ariana M. Familiar, Deep Gandhi, Karthik Viswanathan, Wenxin Tu, Debanjan Haldar, Sina Bagheri, Hannah Anderson, Shuvanjan Haldar, Phillip B. Storm, Adam Resnick, Jeffrey B. Ware, Ali Nabavizadeh, Anahita Fathi Kazerooni
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[92] arXiv:2401.08409 [pdf, html, other]
Title: Faster ISNet for Background Bias Mitigation on Deep Neural Networks
Pedro R. A. S. Bassi, Sergio Decherchi, Andrea Cavalli
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (cs.LG)
[93] arXiv:2401.08469 [pdf, other]
Title: Explanations of Classifiers Enhance Medical Image Segmentation via End-to-end Pre-training
Jiamin Chen, Xuhong Li, Yanwu Xu, Mengnan Du, Haoyi Xiong
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[94] arXiv:2401.08699 [pdf, other]
Title: On Image Search in Histopathology
H.R. Tizhoosh, Liron Pantanowitz
Comments: A chapter in the Book "Artificial INtelligence in Digital Pathology" by Cohen and Chauhan, 2024
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Information Retrieval (cs.IR); Quantitative Methods (q-bio.QM)
[95] arXiv:2401.08821 [pdf, other]
Title: Surface-Enhanced Raman Spectroscopy and Transfer Learning Toward Accurate Reconstruction of the Surgical Zone
Ashutosh Raman, Ren A. Odion, Kent K. Yamamoto, Weston Ross, Tuan Vo-Dinh, Patrick J. Codd
Comments: Accepted to Hamlyn Symposium on Medical Robotics, 2023
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Robotics (cs.RO)
[96] arXiv:2401.08847 [pdf, html, other]
Title: RIDGE: Reproducibility, Integrity, Dependability, Generalizability, and Efficiency Assessment of Medical Image Segmentation Models
Farhad Maleki, Linda Moy, Reza Forghani, Tapotosh Ghosh, Katie Ovens, Steve Langer, Pouria Rouzrokh, Bardia Khosravi, Ali Ganjizadeh, Daniel Warren, Roxana Daneshjou, Mana Moassefi, Atlas Haddadi Avval, Susan Sotardi, Neil Tenenholtz, Felipe Kitamura, Timothy Kline
Comments: 24 pages, 1 Figure, 2 Table
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[97] arXiv:2401.08920 [pdf, html, other]
Title: Idempotence and Perceptual Image Compression
Tongda Xu, Ziran Zhu, Dailan He, Yanghao Li, Lina Guo, Yuanyuan Wang, Zhe Wang, Hongwei Qin, Yan Wang, Jingjing Liu, Ya-Qin Zhang
Comments: ICLR 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[98] arXiv:2401.09019 [pdf, html, other]
Title: Change Detection Between Optical Remote Sensing Imagery and Map Data via Segment Anything Model (SAM)
Hongruixuan Chen, Jian Song, Naoto Yokoya
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[99] arXiv:2401.09283 [pdf, other]
Title: A gradient-based approach to fast and accurate head motion compensation in cone-beam CT
Mareike Thies, Fabian Wagner, Noah Maul, Haijun Yu, Manuela Goldmann, Linda-Sophie Schneider, Mingxuan Gu, Siyuan Mei, Lukas Folle, Alexander Preuhs, Michael Manhart, Andreas Maier
Comments: ©2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works
Journal-ref: in IEEE Transactions on Medical Imaging (2024)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[100] arXiv:2401.09336 [pdf, html, other]
Title: To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection
Luyi Han, Tao Tan, Tianyu Zhang, Yuan Gao, Xin Wang, Valentina Longo, Sofía Ventura-Díaz, Anna D'Angelo, Jonas Teuwen, Ritse Mann
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[101] arXiv:2401.09428 [pdf, html, other]
Title: Multispectral Stereo-Image Fusion for 3D Hyperspectral Scene Reconstruction
Eric L. Wisotzky, Jost Triller, Anna Hilsmann, Peter Eisert
Comments: VISAPP 2024 - 19th International Conference on Computer Vision Theory and Applications
Journal-ref: In Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 3: VISAPP; ISBN 978-989-758-679-8, SciTePress, pages 88-99, 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[102] arXiv:2401.09460 [pdf, other]
Title: Image Restoration: A Comparative Analysis of Image De noising Using Different Spatial Filtering Techniques
E. G. Onyedinma, I. E. Onyenwe
Comments: 9 pages, 10 figures
Journal-ref: INTERNATIONAL JOURNAL OF LATEST TECHNOLOGY IN ENGINEERING, MANAGEMENT & APPLIED SCIENCE (IJLTEMAS) Volume XII, Issue XI, November 2023
Subjects: Image and Video Processing (eess.IV); Information Retrieval (cs.IR)
[103] arXiv:2401.09471 [pdf, other]
Title: Brain Tumor Radiogenomic Classification
Amr Mohamed, Mahmoud Rabea, Aya Sameh, Ehab Kamal
Comments: 6 Pages with 4 Tables, 4 Figures and 4 Images
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[104] arXiv:2401.09481 [pdf, other]
Title: 3DMASC: Accessible, explainable 3D point clouds classification. Application to Bi-spectral Topo-bathymetric lidar data
Mathilde Letard (GR), Dimitri Lague (GR), Arthur Le Guennec (GR), Sébastien Lefèvre (OBELIX), Baptiste Feldmann (GR), Paul Leroy (GR), Daniel Girardeau-Montaut, Thomas Corpetti (LETG - Rennes)
Journal-ref: ISPRS Journal of Photogrammetry and Remote Sensing, 2024, 207, pp.175-197
Subjects: Image and Video Processing (eess.IV)
[105] arXiv:2401.09508 [pdf, html, other]
Title: 4D-ONIX: A deep learning approach for reconstructing 3D movies from sparse X-ray projections
Yuhe Zhang, Zisheng Yao, Robert Klöfkorn, Tobias Ritschel, Pablo Villanueva-Perez
Subjects: Image and Video Processing (eess.IV); Data Analysis, Statistics and Probability (physics.data-an)
[106] arXiv:2401.09624 [pdf, html, other]
Title: MITS-GAN: Safeguarding Medical Imaging from Tampering with Generative Adversarial Networks
Giovanni Pasqualino, Luca Guarnera, Alessandro Ortis, Sebastiano Battiato
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[107] arXiv:2401.09627 [pdf, other]
Title: SymTC: A Symbiotic Transformer-CNN Net for Instance Segmentation of Lumbar Spine MRI
Jiasong Chen, Linchen Qian, Linhai Ma, Timur Urakov, Weiyong Gu, Liang Liang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[108] arXiv:2401.09630 [pdf, html, other]
Title: CT Liver Segmentation via PVT-based Encoding and Refined Decoding
Debesh Jha, Nikhil Kumar Tomar, Koushik Biswas, Gorkem Durak, Alpay Medetalibeyoglu, Matthew Antalek, Yury Velichko, Daniela Ladner, Amir Borhani, Ulas Bagci
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[109] arXiv:2401.09638 [pdf, html, other]
Title: Automatic 3D Multi-modal Ultrasound Segmentation of Human Placenta using Fusion Strategies and Deep Learning
Sonit Singh, Gordon Stevenson, Brendan Mein, Alec Welsh, Arcot Sowmya
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[110] arXiv:2401.09639 [pdf, other]
Title: Uncertainty Modeling in Ultrasound Image Segmentation for Precise Fetal Biometric Measurements
Shuge Lei
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[111] arXiv:2401.09791 [pdf, other]
Title: BreastRegNet: A Deep Learning Framework for Registration of Breast Faxitron and Histopathology Images
Negar Golestani, Aihui Wang, Gregory R Bean, Mirabela Rusu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[112] arXiv:2401.09797 [pdf, html, other]
Title: Memory Efficient Corner Detection for Event-driven Dynamic Vision Sensors
Pao-Sheng Vincent Sun, Arren Glover, Chiara Bartolozzi, Arindam Basu
Subjects: Image and Video Processing (eess.IV)
[113] arXiv:2401.09817 [pdf, html, other]
Title: Automatic Tuning of Denoising Algorithms Parameters Without Ground Truth
Arthur Floquet, Sayantan Dutta, Emmanuel Soubies, Duong Hung Pham, Denis Kouame
Subjects: Image and Video Processing (eess.IV)
[114] arXiv:2401.09833 [pdf, html, other]
Title: Slicer Networks
Hang Zhang, Xiang Chen, Rongguang Wang, Renjiu Hu, Dongdong Liu, Gaolei Li
Comments: 8 figures and 3 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[115] arXiv:2401.09980 [pdf, other]
Title: A Comparative Analysis of U-Net-based models for Segmentation of Cardiac MRI
Ketan Suhaas Saichandran
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[116] arXiv:2401.10128 [pdf, other]
Title: Sub2Full: split spectrum to boost OCT despeckling without clean data
Lingyun Wang, Jose A Sahel, Shaohua Pi
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[117] arXiv:2401.10129 [pdf, html, other]
Title: Few-shot learning for COVID-19 Chest X-Ray Classification with Imbalanced Data: An Inter vs. Intra Domain Study
Alejandro Galán-Cuenca, Antonio Javier Gallego, Marcelo Saval-Calvo, Antonio Pertusa
Comments: Submited to Pattern Analysis and Applications
Journal-ref: Pattern Anal Applic 27, 69 (2024)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[118] arXiv:2401.10345 [pdf, html, other]
Title: Attack and Defense Analysis of Learned Image Compression
Tianyu Zhu, Heming Sun, Xiankui Xiong, Xuanpeng Zhu, Yong Gong, Minge jing, Yibo Fan
Subjects: Image and Video Processing (eess.IV)
[119] arXiv:2401.10373 [pdf, html, other]
Title: Harmonized Spatial and Spectral Learning for Robust and Generalized Medical Image Segmentation
Vandan Gorade, Sparsh Mittal, Debesh Jha, Rekha Singhal, Ulas Bagci
Comments: Early Accepted at ICPR-2024 for Oral Presentation
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[120] arXiv:2401.10389 [pdf, html, other]
Title: Inverse Problem Approach to Aberration Correction for in vivo Transcranial Imaging Based on a Sparse Representation of Contrast-enhanced Ultrasound Data
Paul Xing, Antoine Malescot, Eric Martineau, Ravi Rungta, Jean Provost
Subjects: Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
[121] arXiv:2401.10419 [pdf, other]
Title: M3BUNet: Mobile Mean Max UNet for Pancreas Segmentation on CT-Scans
Juwita juwita, Ghulam Mubashar Hassan, Naveed Akhtar, Amitava Datta
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[122] arXiv:2401.10561 [pdf, html, other]
Title: MAEDiff: Masked Autoencoder-enhanced Diffusion Models for Unsupervised Anomaly Detection in Brain Images
Rui Xu, Yunke Wang, Bo Du
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[123] arXiv:2401.10637 [pdf, html, other]
Title: Towards Universal Unsupervised Anomaly Detection in Medical Imaging
Cosmin I. Bercea, Benedikt Wiestler, Daniel Rueckert, Julia A. Schnabel
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[124] arXiv:2401.10709 [pdf, html, other]
Title: Dense 3D Reconstruction Through Lidar: A Comparative Study on Ex-vivo Porcine Tissue
Guido Caccianiga, Julian Nubert, Marco Hutter, Katherine J. Kuchenbecker
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
[125] arXiv:2401.10732 [pdf, html, other]
Title: Bridging the gap between image coding for machines and humans
Nam Le, Honglei Zhang, Francesco Cricri, Ramin G. Youvalari, Hamed Rezazadegan Tavakoli, Emre Aksu, Miska M. Hannuksela, Esa Rahtu
Journal-ref: IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 2022, pp. 3411-3415
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[126] arXiv:2401.10761 [pdf, html, other]
Title: NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines
Jukka I. Ahonen, Nam Le, Honglei Zhang, Antti Hallapuro, Francesco Cricri, Hamed Rezazadegan Tavakoli, Miska M. Hannuksela, Esa Rahtu
Comments: ISM 2023 Best paper award winner version
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[127] arXiv:2401.10793 [pdf, other]
Title: TDC-less Direct Time-of-Flight Imaging Using Spiking Neural Networks
Jack MacLean, Brian Stewart, Istvan Gyongy
Comments: 7 Pages, 9 Figures
Subjects: Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[128] arXiv:2401.10958 [pdf, html, other]
Title: Detection of Thermal Events by Semi-Supervised Learning for Tokamak First Wall Safety
Christian Staron (IRFM), Hervé Le Borgne (CEA, LIST (CEA)), Raphaël Mitteau (IRFM), Erwan Grelier (IRFM), Nicolas Allezard (DIASI)
Subjects: Image and Video Processing (eess.IV)
[129] arXiv:2401.10966 [pdf, html, other]
Title: HOPE: Hybrid-granularity Ordinal Prototype Learning for Progression Prediction of Mild Cognitive Impairment
Chenhui Wang, Yiming Lei, Tao Chen, Junping Zhang, Yuxin Li, Hongming Shan
Comments: IEEE Journal of Biomedical and Health Informatics, 2024
Journal-ref: IEEE Journal of Biomedical and Health Informatics, 2024
Subjects: Image and Video Processing (eess.IV)
[130] arXiv:2401.11224 [pdf, html, other]
Title: Susceptibility of Adversarial Attack on Medical Image Segmentation Models
Zhongxuan Wang, Leo Xu
Comments: 6 pages, 8 figures, presented at 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI) conference
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[131] arXiv:2401.11270 [pdf, html, other]
Title: RoTIR: Rotation-Equivariant Network and Transformers for Fish Scale Image Registration
Ruixiong Wang, Alin Achim, Renata Raele-Rolfe, Qiao Tong, Dylan Bergen, Chrissy Hammond, Stephen Cross
Comments: 7 pages, 4 figures, 2 tables
Subjects: Image and Video Processing (eess.IV)
[132] arXiv:2401.11413 [pdf, html, other]
Title: Image detection using combinatorial auction
Simon Anuk, Tamir Bendory, Amichai Painsky
Subjects: Image and Video Processing (eess.IV); Signal Processing (eess.SP); Optimization and Control (math.OC)
[133] arXiv:2401.11464 [pdf, other]
Title: Task-specific regularization loss towards model calibration for reliable lung cancer detection
Mehar Prateek Kalra, Mansi Singhal, Rohan Raju Dhanakashirur
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[134] arXiv:2401.11615 [pdf, html, other]
Title: Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding
Yichi Zhang, Zhihao Duan, Ming Lu, Dandan Ding, Fengqing Zhu, Zhan Ma
Comments: The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)
Subjects: Image and Video Processing (eess.IV)
[135] arXiv:2401.11671 [pdf, html, other]
Title: RTA-Former: Reverse Transformer Attention for Polyp Segmentation
Zhikai Li, Murong Yi, Ali Uneri, Sihan Niu, Craig Jones
Comments: The paper has been accepted by EMBC 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[136] arXiv:2401.11675 [pdf, html, other]
Title: ATFusion: An Alternate Cross-Attention Transformer Network for Infrared and Visible Image Fusion
Han Yan, Songlei Xiong, Long Wang, Lihua Jian, Gemine Vivone
Comments: v2 Update: Enhanced version with improved model stability (new augmentation/regularization), added benchmarks (SwinFuse/AEFusion/LRRNet, FMI/SSIM), and RGB-NIR dataset. Updated authorship (this http URL first, this http URL corresponding). Under review at Infrared Physics & Technology. Original: arXiv:2401.11675v1
Subjects: Image and Video Processing (eess.IV)
[137] arXiv:2401.11856 [pdf, html, other]
Title: MOSformer: Momentum encoder-based inter-slice fusion transformer for medical image segmentation
De-Xing Huang, Xiao-Hu Zhou, Mei-Jiang Gui, Xiao-Liang Xie, Shi-Qi Liu, Shuang-Yi Wang, Zhen-Qiu Feng, Zeng-Guang Hou
Comments: Under Review
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[138] arXiv:2401.11859 [pdf, other]
Title: LKFormer: Large Kernel Transformer for Infrared Image Super-Resolution
Feiwei Qin, Kang Yan, Changmiao Wang, Ruiquan Ge, Yong Peng, Kai Zhang
Comments: 14 pages, 4 figures, accept Multimedia Tools and Applications
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[139] arXiv:2401.11902 [pdf, other]
Title: A Training-Free Defense Framework for Robust Learned Image Compression
Myungseo Song, Jinyoung Choi, Bohyung Han
Comments: 10 pages and 14 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[140] arXiv:2401.12004 [pdf, other]
Title: NLCG-Net: A Model-Based Zero-Shot Learning Framework for Undersampled Quantitative MRI Reconstruction
Xinrui Jiang, Yohan Jun, Jaejin Cho, Mengze Gao, Xingwang Yong, Berkin Bilgic
Comments: 8 pages, 5 figures, submitted to International Society for Magnetic Resonance in Medicine 2024
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Signal Processing (eess.SP)
[141] arXiv:2401.12074 [pdf, html, other]
Title: DeepCERES: A Deep learning method for cerebellar lobule segmentation using ultra-high resolution multimodal MRI
Sergio Morell-Ortega, Marina Ruiz-Perez, Marien Gadea, Roberto Vivo-Hernando, Gregorio Rubio, Fernando Aparici, Maria de la Iglesia-Vaya, Gwenaelle Catheline, Pierrick Coupé, José V. Manjón
Comments: 20 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[142] arXiv:2401.12167 [pdf, other]
Title: Dynamic Semantic Compression for CNN Inference in Multi-access Edge Computing: A Graph Reinforcement Learning-based Autoencoder
Nan Li, Alexandros Iosifidis, Qi Zhang
Comments: arXiv admin note: text overlap with arXiv:2211.13745
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[143] arXiv:2401.12438 [pdf, html, other]
Title: Secure Federated Learning Approaches to Diagnosing COVID-19
Rittika Adhikari, Christopher Settles
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
[144] arXiv:2401.12488 [pdf, other]
Title: An Automated Real-Time Approach for Image Processing and Segmentation of Fluoroscopic Images and Videos Using a Single Deep Learning Network
Viet Dung Nguyen, Michael T. LaCour, Richard D. Komistek
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[145] arXiv:2401.12587 [pdf, html, other]
Title: An Efficient Implicit Neural Representation Image Codec Based on Mixed Autoregressive Model for Low-Complexity Decoding
Xiang Liu, Jiahong Chen, Bin Chen, Zimo Liu, Baoyi An, Shu-Tao Xia, Zhi Wang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[146] arXiv:2401.12725 [pdf, html, other]
Title: Two-View Topogram-Based Anatomy-Guided CT Reconstruction for Prospective Risk Minimization
Chang Liu, Laura Klein, Yixing Huang, Edith Baader, Michael Lell, Marc Kachelrieß, Andreas Maier
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[147] arXiv:2401.12771 [pdf, other]
Title: Deep Learning-based Intraoperative MRI Reconstruction
Jon André Ottesen, Tryggve Storas, Svein Are Sirirud Vatnehol, Grethe Løvland, Einar O. Vik-Mo, Till Schellhorn, Karoline Skogen, Christopher Larsson, Atle Bjørnerud, Inge Rasmus Groote-Eindbaas, Matthan W.A. Caan
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Medical Physics (physics.med-ph)
[148] arXiv:2401.12932 [pdf, html, other]
Title: Segmentation of tibiofemoral joint tissues from knee MRI using MtRA-Unet and incorporating shape information: Data from the Osteoarthritis Initiative
Akshay Daydar, Alik Pramanick, Arijit Sur, Subramani Kanagaraj
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[149] arXiv:2401.12938 [pdf, html, other]
Title: Neural deformation fields for template-based reconstruction of cortical surfaces from MRI
Fabian Bongratz, Anne-Marie Rickmann, Christian Wachinger
Comments: To appear in Medical Image Analysis
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[150] arXiv:2401.12974 [pdf, html, other]
Title: SegmentAnyBone: A Universal Model that Segments Any Bone at Any Location on MRI
Hanxue Gu, Roy Colglazier, Haoyu Dong, Jikai Zhang, Yaqian Chen, Zafer Yildiz, Yuwen Chen, Lin Li, Jichen Yang, Jay Willhite, Alex M. Meyer, Brian Guo, Yashvi Atul Shah, Emily Luo, Shipra Rajput, Sally Kuehn, Clark Bulleit, Kevin A. Wu, Jisoo Lee, Brandon Ramirez, Darui Lu, Jay M. Levin, Maciej A. Mazurowski
Comments: 15 pages, 15 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[151] arXiv:2401.13049 [pdf, html, other]
Title: CIS-UNet: Multi-Class Segmentation of the Aorta in Computed Tomography Angiography via Context-Aware Shifted Window Self-Attention
Muhammad Imran, Jonathan R Krebs, Veera Rajasekhar Reddy Gopu, Brian Fazzone, Vishal Balaji Sivaraman, Amarjeet Kumar, Chelsea Viscardi, Robert Evans Heithaus, Benjamin Shickel, Yuyin Zhou, Michol A Cooper, Wei Shao
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG)
[152] arXiv:2401.13114 [pdf, html, other]
Title: Viewport Prediction, Bitrate Selection, and Beamforming Design for THz-Enabled 360° Video Streaming
Mehdi Setayesh, Vincent W.S. Wong
Comments: 17 pages, 15 figures. This paper has been accepted for publication in IEEE Transactions on Wireless Communications
Subjects: Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[153] arXiv:2401.13140 [pdf, html, other]
Title: Dual-Domain Coarse-to-Fine Progressive Estimation Network for Simultaneous Denoising, Limited-View Reconstruction, and Attenuation Correction of Cardiac SPECT
Xiongchao Chen, Bo Zhou, Xueqi Guo, Huidong Xie, Qiong Liu, James S. Duncan, Albert J.Sinusas, Chi Liu
Comments: 11 Pages, 10 figures, 4 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[154] arXiv:2401.13147 [pdf, html, other]
Title: Deep Spatiotemporal Clutter Filtering of Transthoracic Echocardiographic Images: Leveraging Contextual Attention and Residual Learning
Mahdi Tabassian, Somayeh Akbari, Sandro Queirós, Jan D'hooge
Comments: 19 pages, 14 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[155] arXiv:2401.13197 [pdf, html, other]
Title: Predicting Mitral Valve mTEER Surgery Outcomes Using Machine Learning and Deep Learning Techniques
Tejas Vyas, Mohsena Chowdhury, Xiaojiao Xiao, Mathias Claeys, Géraldine Ong, Guanghui Wang
Comments: 5 pages, 1 figure
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[156] arXiv:2401.13220 [pdf, html, other]
Title: Segment Any Cell: A SAM-based Auto-prompting Fine-tuning Framework for Nuclei Segmentation
Saiyang Na, Yuzhi Guo, Feng Jiang, Hehuan Ma, Junzhou Huang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[157] arXiv:2401.13315 [pdf, html, other]
Title: Deep Learning for Improved Polyp Detection from Synthetic Narrow-Band Imaging
Mathias Ramm Haugland, Hemin Ali Qadir, Ilangko Balasingham
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[158] arXiv:2401.13403 [pdf, other]
Title: SEDNet: Shallow Encoder-Decoder Network for Brain Tumor Segmentation
Chollette C. Olisah, Sofie V. Cauter
Comments: 9 pages, 6 figures, 2 Tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[159] arXiv:2401.13472 [pdf, html, other]
Title: Segmenting Cardiac Muscle Z-disks with Deep Neural Networks
Mihaela Croitor Ibrahim, Nishant Ravikumar, Alistair Curd, Joanna Leng, Oliver Umney, Michelle Peckham
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[160] arXiv:2401.13511 [pdf, html, other]
Title: Tissue Cross-Section and Pen Marking Segmentation in Whole Slide Images
Ruben T. Lucassen, Willeke A. M. Blokx, Mitko Veta
Comments: 6 pages, 3 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[161] arXiv:2401.13616 [pdf, html, other]
Title: FLLIC: Functionally Lossless Image Compression
Xi Zhang, Xiaolin Wu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[162] arXiv:2401.13650 [pdf, html, other]
Title: Tyche: Stochastic In-Context Learning for Medical Image Segmentation
Marianne Rakic, Hallee E. Wong, Jose Javier Gonzalez Ortiz, Beth Cimini, John Guttag, Adrian V. Dalca
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[163] arXiv:2401.13959 [pdf, html, other]
Title: Conditional Neural Video Coding with Spatial-Temporal Super-Resolution
Henan Wang, Xiaohan Pan, Runsen Feng, Zongyu Guo, Zhibo Chen
Comments: Accepted by the 2024 Data Compression Conference (DCC) for presentation as a poster
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[164] arXiv:2401.13990 [pdf, html, other]
Title: Deep Learning Innovations in Diagnosing Diabetic Retinopathy: The Potential of Transfer Learning and the DiaCNN Model
Mohamed R. Shoaib, Heba M. Emara, Jun Zhao, Walid El-Shafai, Naglaa F. Soliman, Ahmed S. Mubarak, Osama A. Omer, Fathi E. Abd El-Samie, Hamada Esmaiel
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[165] arXiv:2401.13998 [pdf, html, other]
Title: WAL-Net: Weakly supervised auxiliary task learning network for carotid plaques classification
Haitao Gan, Lingchao Fu, Ran Zhou, Weiyan Gan, Furong Wang, Xiaoyan Wu, Zhi Yang, Zhongwei Huang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[166] arXiv:2401.14007 [pdf, html, other]
Title: Semantic Ensemble Loss and Latent Refinement for High-Fidelity Neural Image Compression
Daxin Li, Yuanchao Bai, Kai Wang, Junjun Jiang, Xianming Liu
Comments: Accepted by VCIP 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[167] arXiv:2401.14130 [pdf, other]
Title: Attention-based Efficient Classification for 3D MRI Image of Alzheimer's Disease
Yihao Lin, Ximeng Li, Yan Zhang, Jinshan Tang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[168] arXiv:2401.14171 [pdf, html, other]
Title: Predicting Hypoxia in Brain Tumors from Multiparametric MRI
Daniele Perlo, Georgia Kanli, Selma Boudissa, Olivier Keunen
Comments: 7 pages, 2 figures
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI)
[169] arXiv:2401.14193 [pdf, other]
Title: Clinical Melanoma Diagnosis with Artificial Intelligence: Insights from a Prospective Multicenter Study
Lukas Heinlein, Roman C. Maron, Achim Hekler, Sarah Haggenmüller, Christoph Wies, Jochen S. Utikal, Friedegund Meier, Sarah Hobelsberger, Frank F. Gellrich, Mildred Sergon, Axel Hauschild, Lars E. French, Lucie Heinzerling, Justin G. Schlager, Kamran Ghoreschi, Max Schlaak, Franz J. Hilke, Gabriela Poch, Sören Korsing, Carola Berking, Markus V. Heppt, Michael Erdmann, Sebastian Haferkamp, Konstantin Drexler, Dirk Schadendorf, Wiebke Sondermann, Matthias Goebeler, Bastian Schilling, Eva Krieghoff-Henning, Titus J. Brinker
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Applications (stat.AP)
[170] arXiv:2401.14206 [pdf, other]
Title: Exploiting Liver CT scans in Colorectal Carcinoma genomics mutation classification
Daniele Perlo, Luca Berton, Alessia Delpiano, Francesca Menchini, Stefano Tibaldi, Marco Grosso, Paolo Fonio
Journal-ref: 2022 IEEE International Conference on Big Data (Big Data)
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI)
[171] arXiv:2401.14220 [pdf, html, other]
Title: Effective stripe artefact removal by a variational method: application to light-sheet microscopy, FIB-SEM and remote sensing images
Niklas Rottmayer, Claudia Redenbach, Florian Fahrbach
Subjects: Image and Video Processing (eess.IV)
[172] arXiv:2401.14248 [pdf, other]
Title: On generalisability of segment anything model for nuclear instance segmentation in histology images
Kesi Xu, Lea Goetz, Nasir Rajpoot
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[173] arXiv:2401.14414 [pdf, other]
Title: Fuzzy Logic-Based System for Brain Tumour Detection and Classification
NVSL Narasimham, Keshav Kumar K
Comments: 14 pages, 9 figures
Journal-ref: Applications of Fuzzy Theory in Applied Sciences and Computer Applications-2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Optimization and Control (math.OC)
[174] arXiv:2401.14705 [pdf, html, other]
Title: Additional Look into GAN-based Augmentation for Deep Learning COVID-19 Image Classification
Oleksandr Fedoruk, Konrad Klimaszewski, Aleksander Ogonowski, Michał Kruk
Comments: Submitted to Machine Graphics & Vision. Version with updated acknowledgments
Journal-ref: Machine Graphics and Vision, 32(3/4), 107-124 (2023)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[175] arXiv:2401.14814 [pdf, other]
Title: Towards Robust Hyperspectral Anomaly Detection: Decomposing Background, Anomaly, and Mixed Noise via Convex Optimization
Koyo Sato, Shunsuke Ono
Comments: Submitted to IEEE Transactions on Geoscience and Remote Sensing
Subjects: Image and Video Processing (eess.IV)
[176] arXiv:2401.15022 [pdf, other]
Title: Applications of artificial intelligence in the analysis of histopathology images of gliomas: a review
Jan-Philipp Redlich, Friedrich Feuerhake, Joachim Weis, Nadine S. Schaadt, Sarah Teuber-Hanselmann, Christoph Buck, Sabine Luttmann, Andrea Eberle, Stefan Nikolin, Arno Appenzeller, Andreas Portmann, André Homeyer
Journal-ref: npj Imaging 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[177] arXiv:2401.15105 [pdf, html, other]
Title: Diffusion Enhancement for Cloud Removal in Ultra-Resolution Remote Sensing Imagery
Jialu Sui, Yiyang Ma, Wenhan Yang, Xiaokang Zhang, Man-On Pun, Jiaying Liu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[178] arXiv:2401.15111 [pdf, html, other]
Title: Improving Fairness of Automated Chest X-ray Diagnosis by Contrastive Learning
Mingquan Lin, Tianhao Li, Zhaoyi Sun, Gregory Holste, Ying Ding, Fei Wang, George Shih, Yifan Peng
Comments: 23 pages, 5 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[179] arXiv:2401.15235 [pdf, html, other]
Title: CascadedGaze: Efficiency in Global Context Extraction for Image Restoration
Amirhosein Ghasemabadi, Muhammad Kamran Janjua, Mohammad Salameh, Chunhua Zhou, Fengyu Sun, Di Niu
Comments: Published in Transactions on Machine Learning Research (TMLR), 2024. 20 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[180] arXiv:2401.15307 [pdf, html, other]
Title: ParaTransCNN: Parallelized TransCNN Encoder for Medical Image Segmentation
Hongkun Sun, Jing Xu, Yuping Duan
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[181] arXiv:2401.15354 [pdf, html, other]
Title: DeepGI: An Automated Approach for Gastrointestinal Tract Segmentation in MRI Scans
Ye Zhang, Yulu Gong, Dongji Cui, Xinrui Li, Xinyu Shen
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[182] arXiv:2401.15434 [pdf, other]
Title: Decentralized Gossip Mutual Learning (GML) for brain tumor segmentation on multi-parametric MRI
Jingyun Chen, Yading Yuan
Comments: 3 pages, 1 figure, accepted to IEEE EMBS 2023. arXiv admin note: text overlap with arXiv:2401.06180
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[183] arXiv:2401.15513 [pdf, other]
Title: MiTU-Net: A fine-tuned U-Net with SegFormer backbone for segmenting pubic symphysis-fetal head
Fangyijie Wang, Guenole Silvestre, Kathleen Curran
Comments: The 5th place in the Pubic Symphysis-Fetal Head Segmentation Challenge in MICCAI 2023
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[184] arXiv:2401.15613 [pdf, html, other]
Title: An efficient dual-branch framework via implicit self-texture enhancement for arbitrary-scale histopathology image super-resolution
Minghong Duan, Linhao Qu, Zhiwei Yang, Manning Wang, Chenxi Zhang, Zhijian Song
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[185] arXiv:2401.15663 [pdf, html, other]
Title: Low-resolution Prior Equilibrium Network for CT Reconstruction
Yijie Yang, Qifeng Gao, Yuping Duan
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[186] arXiv:2401.15804 [pdf, html, other]
Title: Brain Tumor Diagnosis Using Quantum Convolutional Neural Networks
Muhammad Al-Zafar Khan, Abdullah Al Omar Galib, Nouhaila Innan, Mohamed Bennai
Comments: 10 pages, 8 figures
Subjects: Image and Video Processing (eess.IV); Quantum Physics (quant-ph)
[187] arXiv:2401.15913 [pdf, other]
Title: Vision-Informed Flow Image Super-Resolution with Quaternion Spatial Modeling and Dynamic Flow Convolution
Qinglong Cao, Zhengqin Xu, Chao Ma, Xiaokang Yang, Yuntian Chen
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Fluid Dynamics (physics.flu-dyn); Applications (stat.AP)
[188] arXiv:2401.15932 [pdf, other]
Title: Assessment of the area measurement on Cartosat-1 image
Joanna Pluto-Kossakowska, David Grandgirard (INTERACT), Rafal Zielinski (JRC), Simon Kay (JRC)
Journal-ref: International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences (XXIst ISPRS congress) : ''Silk Road for Information from Imagery'', Jul 2008, Benjing Pekin, China. pp.1315-1322
Subjects: Image and Video Processing (eess.IV)
[189] arXiv:2401.15984 [pdf, other]
Title: Choroidal thinning assessment through facial video analysis
Qinghua He, Yi Zhang, Mengxi Shen, Giovanni Gregori, Philip J. Rosenfeld, Ruikang K. Wang
Comments: 8 pages, 4 figures
Subjects: Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
[190] arXiv:2401.15990 [pdf, html, other]
Title: Gland Segmentation Via Dual Encoders and Boundary-Enhanced Attention
Huadeng Wang, Jiejiang Yu, Bingbing Li, Xipeng Pan, Zhenbing Liu, Rushi Lan, Xiaonan Luo
Comments: Published in: ICASSP 2024
Journal-ref: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, Korea, Republic of, 2024, pp. 2345-2349,
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[191] arXiv:2401.16039 [pdf, html, other]
Title: Data-Driven Filter Design in FBP: Transforming CT Reconstruction with Trainable Fourier Series
Yipeng Sun, Linda-Sophie Schneider, Fuxin Fan, Mareike Thies, Mingxuan Gu, Siyuan Mei, Yuzhong Zhou, Siming Bayer, Andreas Maier
Comments: accepted by 8th International Conference on Image Formation in X-Ray Computed Tomography, Bamberg, Germany
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[192] arXiv:2401.16067 [pdf, other]
Title: Encoding Time and Energy Model for SVT-AV1 based on Video Complexity
Lena Eichermüller, Gaurang Chaudhari, Ioannis Katsavounidis, Zhijun Lei, Hassene Tmar, Christian Herglotz, André Kaup
Comments: 5 pages, 1 figure, accepted for IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2024
Subjects: Image and Video Processing (eess.IV); Multimedia (cs.MM)
[193] arXiv:2401.16363 [pdf, other]
Title: Evaluation of pseudo-healthy image reconstruction for anomaly detection with deep generative models: Application to brain FDG PET
Ravi Hassanaly, Camille Brianceau, Maëlys Solal, Olivier Colliot, Ninon Burgos
Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) this https URL
Journal-ref: Machine.Learning.for.Biomedical.Imaging. 2 (2024)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[194] arXiv:2401.16714 [pdf, other]
Title: A Point Cloud Enhancement Method for 4D mmWave Radar Imagery
Qingmian Wan, Hongli Peng, Xing Liao, Kuayue Liu, Junfa Mao
Subjects: Image and Video Processing (eess.IV)
[195] arXiv:2401.16782 [pdf, other]
Title: A Literature Review on Fetus Brain Motion Correction in MRI
Haoran Zhang, Yun Wang
Comments: 8 pages, 2 figures
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
[196] arXiv:2401.16847 [pdf, other]
Title: X-ray Image Generation as a Method of Performance Prediction for Real-Time Inspection: a Case Study
Vladyslav Andriiashen, Robert van Liere, Tristan van Leeuwen, K. Joost Batenburg
Subjects: Image and Video Processing (eess.IV)
[197] arXiv:2401.16928 [pdf, other]
Title: Dynamic MRI reconstruction using low-rank plus sparse decomposition with smoothness regularization
Chee-Ming Ting, Fuad Noman, Raphaël C.-W. Phan, Hernando Ombao
Comments: 9 pages
Journal-ref: 2024 IEEE International Conference on Image Processing (ICIP), Abu Dhabi, United Arab Emirates, 2024, pp. 2800-2806
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[198] arXiv:2401.17104 [pdf, html, other]
Title: H-SynEx: Using synthetic images and ultra-high resolution ex vivo MRI for hypothalamus subregion segmentation
Livia Rodrigues, Martina Bocchetta, Oula Puonti, Douglas Greve, Ana Carolina Londe, Marcondes França, Simone Appenzeller, Juan Eugenio Iglesias, Leticia Rittner
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[199] arXiv:2401.17246 [pdf, other]
Title: SLIC: A Learned Image Codec Using Structure and Color
Srivatsa Prativadibhayankaram, Mahadev Prasad Panda, Thomas Richter, Heiko Sparenberg, Siegfried Fößel, André Kaup
Comments: Accepter paper for Data Compression Conference 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[200] arXiv:2401.17388 [pdf, other]
Title: A Spectral Library and Method for Sparse Unmixing of Hyperspectral Images in Fluorescence Guided Resection of Brain Tumors
David Black, Benoit Liquet, Sadahiro Kaneko, Antonio Di leva, Walter Stummer, Eric Suero Molina
Comments: 17 pages, 4 tables, 6 figures; Under review
Subjects: Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM)
[201] arXiv:2401.17571 [pdf, html, other]
Title: Is Registering Raw Tagged-MR Enough for Strain Estimation in the Era of Deep Learning?
Zhangxing Bian, Ahmed Alshareef, Shuwen Wei, Junyu Chen, Yuli Wang, Jonghye Woo, Dzung L. Pham, Jiachen Zhuo, Aaron Carass, Jerry L. Prince
Comments: Accepted to SPIE Medical Imaging 2024 (oral)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[202] arXiv:2401.17593 [pdf, html, other]
Title: Head and Neck Tumor Segmentation from [18F]F-FDG PET/CT Images Based on 3D Diffusion Model
Yafei Dong, Kuang Gong
Journal-ref: Phys Med Biol. 2024 Jul 16;69(15)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[203] arXiv:2401.18021 [pdf, other]
Title: A Neural Enhancement Post-Processor with a Dynamic AV1 Encoder Configuration Strategy for CLIC 2024
Darren Ramsook, Anil Kokaram
Subjects: Image and Video Processing (eess.IV)
[204] arXiv:2401.00014 (cross-list from q-bio.QM) [pdf, html, other]
Title: Resource-Limited Automated Ki67 Index Estimation in Breast Cancer
J. Gliozzo, G. Marinò, A. Bonometti, M. Frasca, D. Malchiodi
Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[205] arXiv:2401.00237 (cross-list from cs.CV) [pdf, html, other]
Title: A Novel Approach for Defect Detection of Wind Turbine Blade Using Virtual Reality and Deep Learning
Md Fazle Rabbi, Solayman Hossain Emon, Ehtesham Mahmud Nishat, Tzu-Liang (Bill)Tseng, Atira Ferdoushi, Chun-Che Huang, Md Fashiar Rahman
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[206] arXiv:2401.00247 (cross-list from cs.CV) [pdf, html, other]
Title: Probing the Limits and Capabilities of Diffusion Models for the Anatomic Editing of Digital Twins
Karim Kadry, Shreya Gupta, Farhad R. Nezami, Elazer R. Edelman
Comments: 11 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[207] arXiv:2401.00370 (cross-list from cs.CV) [pdf, html, other]
Title: UGPNet: Universal Generative Prior for Image Restoration
Hwayoon Lee, Kyoungkook Kang, Hyeongmin Lee, Seung-Hwan Baek, Sunghyun Cho
Comments: Accepted to WACV 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[208] arXiv:2401.00393 (cross-list from cs.CV) [pdf, other]
Title: Generative Model-Driven Synthetic Training Image Generation: An Approach to Cognition in Rail Defect Detection
Rahatara Ferdousi, Chunsheng Yang, M. Anwar Hossain, Fedwa Laamarti, M. Shamim Hossain, Abdulmotaleb El Saddik
Comments: 26 pages, 13 figures, Springer Journal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[209] arXiv:2401.00440 (cross-list from cs.CV) [pdf, html, other]
Title: TSGAN: An Optical-to-SAR Dual Conditional GAN for Optical based SAR Temporal Shifting
Moien Rangzan, Sara Attarchi, Richard Gloaguen, Seyed Kazem Alavipanah
Comments: Comments: Added acknowledgments and corrected a typo. No changes to the main content
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[210] arXiv:2401.00708 (cross-list from cs.CV) [pdf, html, other]
Title: Revisiting Nonlocal Self-Similarity from Continuous Representation
Yisi Luo, Xile Zhao, Deyu Meng
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[211] arXiv:2401.00766 (cross-list from cs.CV) [pdf, html, other]
Title: Exposure Bracketing Is All You Need For A High-Quality Image
Zhilu Zhang, Shuohao Zhang, Renlong Wu, Zifei Yan, Wangmeng Zuo
Comments: ICLR 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[212] arXiv:2401.00816 (cross-list from cs.CV) [pdf, html, other]
Title: Glimpse: Generalized Locality for Scalable and Robust CT
AmirEhsan Khorashadizadeh, Valentin Debarnot, Tianlin Liu, Ivan Dokmanić
Comments: 21 pages, 14 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[213] arXiv:2401.00825 (cross-list from cs.CV) [pdf, html, other]
Title: Sharp-NeRF: Grid-based Fast Deblurring Neural Radiance Fields Using Sharpness Prior
Byeonghyeon Lee, Howoong Lee, Usman Ali, Eunbyung Park
Comments: Accepted to WACV 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
[214] arXiv:2401.01003 (cross-list from cs.CV) [pdf, other]
Title: Rink-Agnostic Hockey Rink Registration
Jia Cheng Shang, Yuhao Chen, Mohammad Javad Shafiee, David A. Clausi
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[215] arXiv:2401.01117 (cross-list from cs.CV) [pdf, html, other]
Title: Q-Refine: A Perceptual Quality Refiner for AI-Generated Image
Chunyi Li, Haoning Wu, Zicheng Zhang, Hongkun Hao, Kaiwei Zhang, Lei Bai, Xiaohong Liu, Xiongkuo Min, Weisi Lin, Guangtao Zhai
Comments: 6 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[216] arXiv:2401.01180 (cross-list from cs.CV) [pdf, html, other]
Title: Accurate and Efficient Urban Street Tree Inventory with Deep Learning on Mobile Phone Imagery
Asim Khan, Umair Nawaz, Anwaar Ulhaq, Iqbal Gondal, Sajid Javed
Comments: 8 Pages, 7 figures and 5 Tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[217] arXiv:2401.01520 (cross-list from cs.CV) [pdf, html, other]
Title: S$^{2}$-DMs:Skip-Step Diffusion Models
Yixuan Wang, Shuangyin Li
Comments: 12 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[218] arXiv:2401.01693 (cross-list from cs.CV) [pdf, html, other]
Title: AID-DTI: Accelerating High-fidelity Diffusion Tensor Imaging with Detail-Preserving Model-based Deep Learning
Wenxin Fan, Jian Cheng, Cheng Li, Xinrui Ma, Jing Yang, Juan Zou, Ruoyou Wu, Qiegen Liu, Shanshan Wang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[219] arXiv:2401.01813 (cross-list from cs.LG) [pdf, html, other]
Title: Signal Processing in the Retina: Interpretable Graph Classifier to Predict Ganglion Cell Responses
Yasaman Parhizkar, Gene Cheung, Andrew W. Eckford
Journal-ref: IEEE Open Journal of Signal Processing
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV); Neurons and Cognition (q-bio.NC); Quantitative Methods (q-bio.QM)
[220] arXiv:2401.01912 (cross-list from cs.CV) [pdf, other]
Title: Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural Network
Yongqi Ding, Lin Zuo, Mengmeng Jing, Pei He, Yongjun Xiao
Comments: Accepted by AAAI 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[221] arXiv:2401.02106 (cross-list from physics.ins-det) [pdf, other]
Title: Cadmium Zinc Telluride (CZT) photon counting detector Characterisation for soft tissue imaging
K. Hameed, Rafidah Zainon, Mahbubunnabi Tamal
Comments: 29 pages and 11 figures
Subjects: Instrumentation and Detectors (physics.ins-det); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
[222] arXiv:2401.02394 (cross-list from physics.med-ph) [pdf, html, other]
Title: Image denoising and model-independent parameterization for improving IVIM MRI
Caleb Sample, Jonn Wu, Haley Clark
Subjects: Medical Physics (physics.med-ph); Image and Video Processing (eess.IV)
[223] arXiv:2401.02536 (cross-list from cs.LG) [pdf, html, other]
Title: Novel End-to-End Production-Ready Machine Learning Flow for Nanolithography Modeling and Correction
Mohamed S. E. Habib, Hossam A. H. Fahmy, Mohamed F. Abu-ElYazeed
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[224] arXiv:2401.02687 (cross-list from cs.CV) [pdf, html, other]
Title: PAHD: Perception-Action based Human Decision Making using Explainable Graph Neural Networks on SAR Images
Sasindu Wijeratne, Bingyi Zhang, Rajgopal Kannan, Viktor Prasanna, Carl Busart
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[225] arXiv:2401.02831 (cross-list from cs.CV) [pdf, html, other]
Title: Two-stage Progressive Residual Dense Attention Network for Image Denoising
Wencong Wu, An Ge, Guannan Lv, Yuelong Xia, Yungang Zhang, Wen Xiong
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[226] arXiv:2401.02961 (cross-list from cs.LG) [pdf, other]
Title: A Surrogate-Assisted Extended Generative Adversarial Network for Parameter Optimization in Free-Form Metasurface Design
Manna Dai, Yang Jiang, Feng Yang, Joyjit Chattoraj, Yingzhi Xia, Xinxing Xu, Weijiang Zhao, My Ha Dao, Yong Liu
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Optics (physics.optics)
[227] arXiv:2401.02995 (cross-list from cs.CL) [pdf, html, other]
Title: CANAMRF: An Attention-Based Model for Multimodal Depression Detection
Yuntao Wei, Yuzhe Zhang, Shuyang Zhang, Hong Zhang
Comments: 6 pages, 3 figures. Pacific Rim International Conference on Artificial Intelligence. Singapore: Springer Nature Singapore, 2023
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[228] arXiv:2401.03115 (cross-list from cs.CV) [pdf, html, other]
Title: Transferable Learned Image Compression-Resistant Adversarial Perturbations
Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen
Comments: Accepted by BMVC 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM); Image and Video Processing (eess.IV)
[229] arXiv:2401.03122 (cross-list from cs.CV) [pdf, html, other]
Title: SAR Despeckling via Regional Denoising Diffusion Probabilistic Model
Xuran Hu, Ziqiang Xu, Zhihan Chen, Zhengpeng Feng, Mingzhe Zhu, LJubisa Stankovic
Comments: 5 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[230] arXiv:2401.03575 (cross-list from cs.CV) [pdf, html, other]
Title: Involution Fused ConvNet for Classifying Eye-Tracking Patterns of Children with Autism Spectrum Disorder
Md. Farhadul Islam, Meem Arafat Manab, Joyanta Jyoti Mondal, Sarah Zabeen, Fardin Bin Rahman, Md. Zahidul Hasan, Farig Sadeque, Jannatun Noor
Comments: 17 pages, 13 figures, Submitted to Engineering Applications of Artificial Intelligence
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[231] arXiv:2401.03690 (cross-list from physics.med-ph) [pdf, other]
Title: So You Want to Image Myelin Using MRI: Magnetic Susceptibility Source Separation for Myelin Imaging
Jongho Lee, Sooyeon Ji, Se-Hong Oh
Comments: Can now be found in Magnetic Resonance in Medical Sciences this https URL
Subjects: Medical Physics (physics.med-ph); Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM)
[232] arXiv:2401.03800 (cross-list from cs.CV) [pdf, html, other]
Title: MvKSR: Multi-view Knowledge-guided Scene Recovery for Hazy and Rainy Degradation
Dong Yang, Wenyu Xu, Yuan Gao, Yuxu Lu, Jingming Zhang, Yu Guo
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[233] arXiv:2401.03829 (cross-list from physics.optics) [pdf, html, other]
Title: Sub-Rayleigh Ghost Imaging via Structured Speckle Illumination
Liming Li
Subjects: Optics (physics.optics); Image and Video Processing (eess.IV)
[234] arXiv:2401.03835 (cross-list from cs.CV) [pdf, html, other]
Title: Limitations of Data-Driven Spectral Reconstruction -- Optics-Aware Analysis and Mitigation
Qiang Fu, Matheus Souza, Eunsue Choi, Suhyun Shin, Seung-Hwan Baek, Wolfgang Heidrich
Comments: 13 pages, 7 figures, 9 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[235] arXiv:2401.04039 (cross-list from cs.MM) [pdf, other]
Title: Bjøntegaard Delta (BD): A Tutorial Overview of the Metric, Evolution, Challenges, and Recommendations
Nabajeet Barman, Maria G. Martini, Yuriy Reznik
Subjects: Multimedia (cs.MM); Information Theory (cs.IT); Image and Video Processing (eess.IV)
[236] arXiv:2401.04149 (cross-list from q-bio.OT) [pdf, html, other]
Title: Imágenes de Resonancia Magnética con Contraste en el Cáncer de Mama
Virginia del Campo, Iker Malaina
Comments: 9 pages, text in Spanish
Subjects: Other Quantitative Biology (q-bio.OT); Image and Video Processing (eess.IV); Medical Physics (physics.med-ph)
[237] arXiv:2401.04405 (cross-list from cs.MM) [pdf, html, other]
Title: Optimal Transcoding Resolution Prediction for Efficient Per-Title Bitrate Ladder Estimation
Jinhai Yang, Mengxi Guo, Shijie Zhao, Junlin Li, Li Zhang
Comments: Accepted by the 2024 Data Compression Conference (DCC) for presentation as a poster. This is the full paper
Subjects: Multimedia (cs.MM); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[238] arXiv:2401.04579 (cross-list from q-bio.QM) [pdf, other]
Title: A Deep Network for Explainable Prediction of Non-Imaging Phenotypes using Anatomical Multi-View Data
Yuxiang Wei, Yuqian Chen, Tengfei Xue, Leo Zekelman, Nikos Makris, Yogesh Rathi, Weidong Cai, Fan Zhang, Lauren J. O' Donnell
Comments: 2023 The Medical Image Computing and Computer Assisted Intervention Society workshop
Subjects: Quantitative Methods (q-bio.QM); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[239] arXiv:2401.04680 (cross-list from cs.CV) [pdf, other]
Title: CoordGate: Efficiently Computing Spatially-Varying Convolutions in Convolutional Neural Networks
Sunny Howard, Peter Norreys, Andreas Döpp
Journal-ref: BMVC 2023
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[240] arXiv:2401.04988 (cross-list from cs.CV) [pdf, html, other]
Title: Optimising Graph Representation for Hardware Implementation of Graph Convolutional Networks for Event-based Vision
Kamil Jeziorek, Piotr Wzorek, Krzysztof Blachut, Andrea Pinna, Tomasz Kryjak
Comments: Paper was accepted for the DASIP 2024 workshop in conjunction with HiPEAC 2024 (Munich, Germany)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[241] arXiv:2401.05153 (cross-list from cs.CV) [pdf, html, other]
Title: CrossDiff: Exploring Self-Supervised Representation of Pansharpening via Cross-Predictive Diffusion Model
Yinghui Xing, Litao Qu, Shizhou Zhang, Kai Zhang, Yanning Zhang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[242] arXiv:2401.05217 (cross-list from cs.CV) [pdf, html, other]
Title: Exploring Vulnerabilities of No-Reference Image Quality Assessment Models: A Query-Based Black-Box Method
Chenxi Yang, Yujia Liu, Dingquan Li, Tingting Jiang
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[243] arXiv:2401.05633 (cross-list from cs.CV) [pdf, html, other]
Title: Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach
Gang Wu, Junjun Jiang, Junpeng Jiang, Xianming Liu
Comments: Accepted by IEEE TIP
Journal-ref: IEEE Transactions on Image Processing 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[244] arXiv:2401.05844 (cross-list from physics.med-ph) [pdf, other]
Title: Self-navigated 3D diffusion MRI using an optimized CAIPI sampling and structured low-rank reconstruction
Ziyu Li, Karla L. Miller, Xi Chen, Mark Chiew, Wenchuan Wu
Comments: 10 pages, 11 figures, 2 tables. This work has been submitted to the IEEE for possible publication
Subjects: Medical Physics (physics.med-ph); Image and Video Processing (eess.IV)
[245] arXiv:2401.06009 (cross-list from cs.CV) [pdf, other]
Title: Sea ice detection using concurrent multispectral and synthetic aperture radar imagery
Martin S J Rogers, Maria Fox, Andrew Fleming, Louisa van Zeeland, Jeremy Wilkinson, J. Scott Hosking
Comments: 34 pages, 10 figures, 2 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[246] arXiv:2401.06149 (cross-list from cs.CV) [pdf, html, other]
Title: Image Classifier Based Generative Method for Planar Antenna Design
Yang Zhong, Weiping Dou, Andrew Cohen, Dia'a Bisharat, Yuandong Tian, Jiang Zhu, Qing Huo Liu
Comments: 13 pages, 18 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[247] arXiv:2401.06182 (cross-list from q-bio.QM) [pdf, html, other]
Title: Prediction of Cellular Identities from Trajectory and Cell Fate Information
Baiyang Dai, Jiamin Yang, Hari Shroff, Patrick La Riviere
Subjects: Quantitative Methods (q-bio.QM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[248] arXiv:2401.06528 (cross-list from cs.CV) [pdf, html, other]
Title: PCB-Vision: A Multiscene RGB-Hyperspectral Benchmark Dataset of Printed Circuit Boards
Elias Arbash, Margret Fuchs, Behnood Rasti, Sandra Lorenz, Pedram Ghamisi, Richard Gloaguen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[249] arXiv:2401.06798 (cross-list from q-bio.NC) [pdf, other]
Title: Evaluation of Mean Shift, ComBat, and CycleGAN for Harmonizing Brain Connectivity Matrices Across Sites
Hanliang Xu, Nancy R. Newlin, Michael E. Kim, Chenyu Gao, Praitayini Kanakaraj, Aravind R. Krishnan, Lucas W. Remedios, Nazirah Mohd Khairi, Kimberly Pechman, Derek Archer, Timothy J. Hohman, Angela L. Jefferson, The BIOCARD Study Team, Ivana Isgum, Yuankai Huo, Daniel Moyer, Kurt G. Schilling, Bennett A. Landman
Comments: 11 pages, 5 figures, to be published in SPIE Medical Imaging 2024: Image Processing
Subjects: Neurons and Cognition (q-bio.NC); Image and Video Processing (eess.IV)
[250] arXiv:2401.07124 (cross-list from cs.CV) [pdf, other]
Title: Concrete Surface Crack Detection with Convolutional-based Deep Learning Models
Sara Shomal Zadeh, Sina Aalipour birgani, Meisam Khorshidi, Farhad Kooban
Comments: 11 pages, 3 figures, Journal paper
Journal-ref: International Journal of Novel Research in Civil Structural and Earth Sciences, Vol. 10, Issue 3, (2023) pp: (25-35)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[251] arXiv:2401.07139 (cross-list from cs.CV) [pdf, html, other]
Title: Deep Blind Super-Resolution for Satellite Video
Yi Xiao, Qiangqiang Yuan, Qiang Zhang, Liangpei Zhang
Comments: Published in IEEE TGRS
Journal-ref: IEEE Transactions on Geoscience and Remote Sensing, vol. 61, pp. 1-16, 2023, Art no. 5516316
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[252] arXiv:2401.07200 (cross-list from cs.CV) [pdf, html, other]
Title: Exploring Compressed Image Representation as a Perceptual Proxy: A Study
Chen-Hsiu Huang, Ja-Ling Wu
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[253] arXiv:2401.07398 (cross-list from cs.CV) [pdf, html, other]
Title: Cross Domain Early Crop Mapping using CropSTGAN
Yiqun Wang, Hui Huang, Radu State
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[254] arXiv:2401.07528 (cross-list from astro-ph.EP) [pdf, other]
Title: Automatic characterization of boulders on planetary surfaces from high-resolution satellite images
Nils C. Prieur, Brian Amaro, Emiliano Gonzalez, Hannah Kerner, Sergei Medvedev, Lior Rubanenko, Stephanie C. Werner, Zhiyong Xiao8, Dmitry Zastrozhnov, Mathieu G. A. Lapôtre
Subjects: Earth and Planetary Astrophysics (astro-ph.EP); Instrumentation and Methods for Astrophysics (astro-ph.IM); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[255] arXiv:2401.07746 (cross-list from cs.CV) [pdf, other]
Title: Sparsity-based background removal for STORM super-resolution images
Patris Valera, Josué Page Vizcaíno, Tobias Lasser
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[256] arXiv:2401.08105 (cross-list from cs.CV) [pdf, other]
Title: Hardware Acceleration for Real-Time Wildfire Detection Onboard Drone Networks
Austin Briley, Fatemeh Afghah
Comments: 6 pages, 7 figures, NETROBOTICS conference submission
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[257] arXiv:2401.08154 (cross-list from cs.CV) [pdf, html, other]
Title: TLIC: Learned Image Compression with ROI-Weighted Distortion and Bit Allocation
Wei Jiang, Yongqi Zhai, Hangyu Li, Ronggang Wang
Comments: 2nd Place in the Image Compression Track, CLIC 2024, DCC 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[258] arXiv:2401.08185 (cross-list from cs.CV) [pdf, other]
Title: DPAFNet:Dual Path Attention Fusion Network for Single Image Deraining
Bingcai Wei
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[259] arXiv:2401.08522 (cross-list from cs.CV) [pdf, other]
Title: Video Quality Assessment Based on Swin TransformerV2 and Coarse to Fine Strategy
Zihao Yu, Fengbin Guan, Yiting Lu, Xin Li, Zhibo Chen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[260] arXiv:2401.08584 (cross-list from cs.CV) [pdf, other]
Title: Nahid: AI-based Algorithm for operating fully-automatic surgery
Sina Saadati
Comments: 8 pages, 10 figures, 1 table
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO); Image and Video Processing (eess.IV)
[261] arXiv:2401.08837 (cross-list from cs.CV) [pdf, other]
Title: Image Fusion in Remote Sensing: An Overview and Meta Analysis
Hessah Albanwan, Rongjun Qin, Yang Tang
Comments: 21pages, 10 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[262] arXiv:2401.08865 (cross-list from cs.CV) [pdf, html, other]
Title: The Effect of Intrinsic Dataset Properties on Generalization: Unraveling Learning Differences Between Natural and Medical Images
Nicholas Konz, Maciej A. Mazurowski
Comments: ICLR 2024. Code: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
[263] arXiv:2401.08913 (cross-list from cs.CV) [pdf, html, other]
Title: Efficient Image Super-Resolution via Symmetric Visual Attention Network
Chengxu Wu, Qinrui Fan, Shu Hu, Xi Wu, Xin Wang, Jing Hu
Comments: 13 pages,4 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[264] arXiv:2401.08926 (cross-list from cs.CV) [pdf, html, other]
Title: Stochasticity-aware No-Reference Point Cloud Quality Assessment
Songlin Fan, Wei Gao, Zhineng Chen, Ge Li, Guoqing Liu, Qicheng Wang
Comments: Accepted to IJCAI 2025
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[265] arXiv:2401.09207 (cross-list from eess.SY) [pdf, html, other]
Title: An Energy-efficient Capacitive-RRAM Content Addressable Memory
Yihan Pan, Adrian Wheeldon, Mohammed Mughal, Shady Agwa, Themis Prodromakis, Alexantrou Serb
Comments: This work has been accepted by IEEE TCAS-I for publication
Journal-ref: IEEE Transactions on Circuits and Systems - Part I: Regular Papers (TCAS-I), 2024
Subjects: Systems and Control (eess.SY); Image and Video Processing (eess.IV)
[266] arXiv:2401.09467 (cross-list from cs.CV) [pdf, other]
Title: Offline Handwriting Signature Verification: A Transfer Learning and Feature Selection Approach
Fatih Ozyurt, Jafar Majidpour, Tarik A. Rashid, Canan Koc
Comments: 11 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[267] arXiv:2401.09472 (cross-list from cs.CV) [pdf, html, other]
Title: Plug-in for visualizing 3D tool tracking from videos of Minimally Invasive Surgeries
Shubhangi Nema, Abhishek Mathur, Leena Vachhani
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Systems and Control (eess.SY)
[268] arXiv:2401.09517 (cross-list from cs.LG) [pdf, other]
Title: Dimensional Neuroimaging Endophenotypes: Neurobiological Representations of Disease Heterogeneity Through Machine Learning
Junhao Wen, Mathilde Antoniades, Zhijian Yang, Gyujoon Hwang, Ioanna Skampardoni, Rongguang Wang, Christos Davatzikos
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV); Quantitative Methods (q-bio.QM)
[269] arXiv:2401.09607 (cross-list from cs.CV) [pdf, html, other]
Title: Land Cover Image Classification
Antonio Rangel, Juan Terven, Diana M. Cordova-Esparza, E.A. Chavez-Urbiola
Comments: 7 pages, 4 figures, 1 table, published in conference
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[270] arXiv:2401.09673 (cross-list from cs.CV) [pdf, html, other]
Title: Artwork Protection Against Neural Style Transfer Using Locally Adaptive Adversarial Color Attack
Zhongliang Guo, Junhao Dong, Yifei Qian, Kaixuan Wang, Weiye Li, Ziheng Guo, Yuheng Wang, Yanli Li, Ognjen Arandjelović, Lei Fang
Comments: 9 pages, 5 figures, 4 tables
Subjects: Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[271] arXiv:2401.09721 (cross-list from cs.CV) [pdf, html, other]
Title: Fast graph-based denoising for point cloud color information
Ryosuke Watanabe, Keisuke Nonaka, Eduardo Pavez, Tatsuya Kobayashi, Antonio Ortega
Comments: Published in the proceeding of 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[272] arXiv:2401.10256 (cross-list from cs.CV) [pdf, html, other]
Title: Active headrest combined with a depth camera-based ear-positioning system
Yuteng Liu, Haowen Li, Haishan Zou, Jing Lu, Zhibin Lin
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[273] arXiv:2401.10598 (cross-list from physics.optics) [pdf, html, other]
Title: Active Axial Motion Compensation in Multiphoton-Excited Fluorescence Microscopy
Manuel Kunisch, Sascha Beutler, Christian Pilger, Friedemann Kiefer, Thomas Huser, Benedikt Wirth
Comments: 16 pages, 6 figures
Subjects: Optics (physics.optics); Image and Video Processing (eess.IV); Biological Physics (physics.bio-ph)
[274] arXiv:2401.10643 (cross-list from cs.CV) [pdf, other]
Title: A Comprehensive Survey on Deep-Learning-based Vehicle Re-Identification: Models, Data Sets and Challenges
Ali Amiri, Aydin Kaya, Ali Seydi Keceli
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[275] arXiv:2401.11313 (cross-list from cs.CV) [pdf, html, other]
Title: Weakly-Supervised Semantic Segmentation of Circular-Scan, Synthetic-Aperture-Sonar Imagery
Isaac J. Sledge, Dominic M. Byrne, Jonathan L. King, Steven H. Ostertag, Denton L. Woods, James L. Prater, Jermaine L. Kennedy, Timothy M. Marston, Jose C. Principe
Comments: Submitted to the IEEE Journal of Oceanic Engineering
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[276] arXiv:2401.11485 (cross-list from cs.CV) [pdf, html, other]
Title: ColorVideoVDP: A visual difference predictor for image, video and display distortions
Rafal K. Mantiuk, Param Hanji, Maliha Ashraf, Yuta Asano, Alexandre Chapiro
Comments: 28 pages
Journal-ref: SIGGRAPH 2024 Technical Papers, Article 129
Subjects: Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Image and Video Processing (eess.IV)
[277] arXiv:2401.11519 (cross-list from cs.CV) [pdf, html, other]
Title: CaBuAr: California Burned Areas dataset for delineation
Daniele Rege Cambrin, Luca Colomba, Paolo Garza
Comments: Accepted at the IEEE Geoscience and Remote Sensing Magazine
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[278] arXiv:2401.11582 (cross-list from cs.CV) [pdf, html, other]
Title: Thermal Image Calibration and Correction using Unpaired Cycle-Consistent Adversarial Networks
Hossein Rajoli, Pouya Afshin, Fatemeh Afghah
Comments: This paper has been accepted at the Asilomar 2023 Conference and will be published
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[279] arXiv:2401.11960 (cross-list from cs.CV) [pdf, other]
Title: Observation-Guided Meteorological Field Downscaling at Station Scale: A Benchmark and a New Method
Zili Liu, Hao Chen, Lei Bai, Wenyuan Li, Keyan Chen, Zhengyi Wang, Wanli Ouyang, Zhengxia Zou, Zhenwei Shi
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[280] arXiv:2401.12132 (cross-list from cs.LG) [pdf, other]
Title: Evaluation of QCNN-LSTM for Disability Forecasting in Multiple Sclerosis Using Sequential Multisequence MRI
John D. Mayfield, Issam El Naqa
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Emerging Technologies (cs.ET); Image and Video Processing (eess.IV)
[281] arXiv:2401.12251 (cross-list from cs.LG) [pdf, html, other]
Title: Diffusion Representation for Asymmetric Kernels
Alvaro Almeida Gomez, Antonio Silva Neto, Jorge zubelli
Journal-ref: Applied Numerical Mathematics, 2021
Subjects: Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[282] arXiv:2401.12264 (cross-list from eess.AS) [pdf, html, other]
Title: CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing
Xianghu Yue, Xiaohai Tian, Lu Lu, Malu Zhang, Zhizheng Wu, Haizhou Li
Subjects: Audio and Speech Processing (eess.AS); Multimedia (cs.MM); Sound (cs.SD); Image and Video Processing (eess.IV)
[283] arXiv:2401.12340 (cross-list from cs.CV) [pdf, html, other]
Title: Contrastive Learning and Cycle Consistency-based Transductive Transfer Learning for Target Annotation
Shoaib Meraj Sami, Md Mahedi Hasan, Nasser M. Nasrabadi, Raghuveer Rao
Comments: This Paper is Accepted in IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS. This Arxiv version is an older version than the reviewed version
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Machine Learning (stat.ML)
[284] arXiv:2401.12609 (cross-list from cs.CV) [pdf, other]
Title: Fast Semisupervised Unmixing Using Nonconvex Optimization
Behnood Rasti (HZDR), Alexandre Zouaoui (Thoth), Julien Mairal (Thoth), Jocelyn Chanussot (Thoth)
Journal-ref: IEEE TGRS, 2024, 62
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[285] arXiv:2401.12826 (cross-list from cs.NI) [pdf, html, other]
Title: Digital Twin-Based Network Management for Better QoE in Multicast Short Video Streaming
Xinyu Huang, Shisheng Hu, Haojun Yang, Xinghan Wang, Yingying Pei, Xuemin Shen
Comments: 13 pages, 12 figures
Subjects: Networking and Internet Architecture (cs.NI); Image and Video Processing (eess.IV)
[286] arXiv:2401.12913 (cross-list from gr-qc) [pdf, html, other]
Title: Advancing Glitch Classification in Gravity Spy: Multi-view Fusion with Attention-based Machine Learning for Advanced LIGO's Fourth Observing Run
Yunan Wu, Michael Zevin, Christopher P. L. Berry, Kevin Crowston, Carsten Østerlund, Zoheyr Doctor, Sharan Banagiri, Corey B. Jackson, Vicky Kalogera, Aggelos K. Katsaggelos
Subjects: General Relativity and Quantum Cosmology (gr-qc); Instrumentation and Methods for Astrophysics (astro-ph.IM); Image and Video Processing (eess.IV)
[287] arXiv:2401.12972 (cross-list from cs.CV) [pdf, html, other]
Title: On the Efficacy of Text-Based Input Modalities for Action Anticipation
Apoorva Beedu, Harish Haresamudram, Karan Samel, Irfan Essa
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[288] arXiv:2401.13006 (cross-list from cs.AI) [pdf, html, other]
Title: CIMGEN: Controlled Image Manipulation by Finetuning Pretrained Generative Models on Limited Data
Chandrakanth Gudavalli, Erik Rosten, Lakshmanan Nataraj, Shivkumar Chandrasekaran, B. S. Manjunath
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[289] arXiv:2401.13051 (cross-list from cs.CV) [pdf, html, other]
Title: PA-SAM: Prompt Adapter SAM for High-Quality Image Segmentation
Zhaozhi Xie, Bochen Guan, Weihao Jiang, Muyang Yi, Yue Ding, Hongtao Lu, Lei Zhang
Comments: Code is available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[290] arXiv:2401.13980 (cross-list from cs.IT) [pdf, html, other]
Title: A Superposition Code-Based Semantic Communication Approach with Quantifiable and Controllable Security
Weixuan Chen, Shuo Shao, Qianqian Yang, Zhaoyang Zhang, Ping Zhang
Subjects: Information Theory (cs.IT); Image and Video Processing (eess.IV)
[291] arXiv:2401.14285 (cross-list from cs.CV) [pdf, html, other]
Title: POUR-Net: A Population-Prior-Aided Over-Under-Representation Network for Low-Count PET Attenuation Map Generation
Bo Zhou, Jun Hou, Tianqi Chen, Yinchi Zhou, Xiongchao Chen, Huidong Xie, Qiong Liu, Xueqi Guo, Yu-Jung Tsai, Vladimir Y. Panin, Takuya Toyonaga, James S. Duncan, Chi Liu
Comments: 10 pages, 5 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[292] arXiv:2401.14641 (cross-list from cs.CV) [pdf, html, other]
Title: Super Efficient Neural Network for Compression Artifacts Reduction and Super Resolution
Wen Ma, Qiuwen Lou, Arman Kazemi, Julian Faraone, Tariq Afzal
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[293] arXiv:2401.14786 (cross-list from cs.CV) [pdf, html, other]
Title: Study of the gOMP Algorithm for Recovery of Compressed Sensed Hyperspectral Images
Jon Alvarez Justo, Milica Orlandic
Comments: Hyperspectral Imaging, Compressive Sensing, Greedy Algorithms, Generalized Orthogonal Matching Pursuit (gOMP), Sparsity, Sparsification, IEEE-copyrighted material (2022), WHISPERS Workshop (13-16 September 2022)
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[294] arXiv:2401.14831 (cross-list from cs.RO) [pdf, html, other]
Title: The Machine Vision Iceberg Explained: Advancing Dynamic Testing by Considering Holistic Environmental Relations
Hubert Padusinski, Christian Steinhauser, Thilo Braun, Lennart Ries, Eric Sax
Comments: Submitted at IEEE ITSC 2024
Subjects: Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Software Engineering (cs.SE); Image and Video Processing (eess.IV)
[295] arXiv:2401.14970 (cross-list from eess.SP) [pdf, html, other]
Title: Microwave lymphedema assessment using deep learning with contour assisted backprojection
Yuyi Chang, Nithin Sugavanam, Emre Ertin
Comments: 6 pages, 6 figures, accepted RadarConf
Subjects: Signal Processing (eess.SP); Image and Video Processing (eess.IV)
[296] arXiv:2401.15183 (cross-list from q-bio.BM) [pdf, html, other]
Title: Moment-based metrics for molecules computable from cryo-EM images
Andy Zhang, Oscar Mickelin, Joe Kileel, Eric J. Verbeke, Nicholas F. Marshall, Marc Aurèle Gilles, Amit Singer
Comments: 21 Pages, 9 Figures, 2 Algorithms, and 3 Tables
Subjects: Biomolecules (q-bio.BM); Image and Video Processing (eess.IV)
[297] arXiv:2401.15204 (cross-list from cs.CV) [pdf, html, other]
Title: LYT-NET: Lightweight YUV Transformer-based Network for Low-light Image Enhancement
A. Brateanu, R. Balmez, A. Avram, C. Orhei, C. Ancuti
Comments: 5 pages
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[298] arXiv:2401.15366 (cross-list from cs.CV) [pdf, html, other]
Title: Face to Cartoon Incremental Super-Resolution using Knowledge Distillation
Trinetra Devkatte, Shiv Ram Dubey, Satish Kumar Singh, Abdenour Hadid
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[299] arXiv:2401.15636 (cross-list from cs.CV) [pdf, html, other]
Title: FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models
Feihong He, Gang Li, Fuhui Sun, Mengyuan Zhang, Lingyu Si, Xiaoyan Wang, Li Shen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[300] arXiv:2401.15647 (cross-list from cs.CV) [pdf, html, other]
Title: UP-CrackNet: Unsupervised Pixel-Wise Road Crack Detection via Adversarial Image Restoration
Nachuan Ma, Rui Fan, Lihua Xie
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Image and Video Processing (eess.IV)
[301] arXiv:2401.15864 (cross-list from cs.CV) [pdf, other]
Title: Spatial Decomposition and Temporal Fusion based Inter Prediction for Learned Video Compression
Xihua Sheng, Li Li, Dong Liu, Houqiang Li
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[302] arXiv:2401.16035 (cross-list from cs.CV) [pdf, other]
Title: Second Order Kinematic Surface Fitting in Anatomical Structures
Wilhelm Wimmer, Hervé Delingette
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[303] arXiv:2401.16087 (cross-list from cs.CV) [pdf, other]
Title: High Resolution Image Quality Database
Huang Huang, Qiang Wan, Jari Korhonen
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[304] arXiv:2401.16099 (cross-list from stat.ME) [pdf, other]
Title: A Ridgelet Approach to Poisson Denoising
Ali Dadras, Klara Leffler, Jun Yu
Comments: 11 pages, 8 figures
Subjects: Methodology (stat.ME); Image and Video Processing (eess.IV)
[305] arXiv:2401.16104 (cross-list from cs.CV) [pdf, other]
Title: A 2D Sinogram-Based Approach to Defect Localization in Computed Tomography
Yuzhong Zhou, Linda-Sophie Schneider, Fuxin Fan, Andreas Maier
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[306] arXiv:2401.16227 (cross-list from cs.CV) [pdf, other]
Title: A Volumetric Saliency Guided Image Summarization for RGB-D Indoor Scene Classification
Preeti Meena, Himanshu Kumar, Sandeep Yadav
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[307] arXiv:2401.16325 (cross-list from astro-ph.IM) [pdf, other]
Title: Making the unmodulated Pyramid wavefront sensor smart. Closed-loop demonstration of neural network wavefront reconstruction with MagAO-X
Rico Landman, Sebastiaan Haffert, Jared Males, Laird Close, Warren Foster, Kyle Van Gorkom, Olivier Guyon, Alex Hedglen, Maggie Kautz, Jay Kueny, Joseph Long, Jennifer Lumbres, Eden McEwen, Avalon McLeod, Lauren Schatz
Comments: Accepted for publication in A&A
Subjects: Instrumentation and Methods for Astrophysics (astro-ph.IM); Earth and Planetary Astrophysics (astro-ph.EP); Image and Video Processing (eess.IV)
[308] arXiv:2401.16407 (cross-list from stat.ML) [pdf, html, other]
Title: Is K-fold cross validation the best model selection method for Machine Learning?
Juan M Gorriz, R. Martin Clemente, F Segovia, J Ramirez, A Ortiz, J. Suckling
Comments: 40 pages, 24 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[309] arXiv:2401.16468 (cross-list from cs.CV) [pdf, html, other]
Title: InstructIR: High-Quality Image Restoration Following Human Instructions
Marcos V. Conde, Gregor Geigle, Radu Timofte
Comments: European Conference on Computer Vision (ECCV) 2024
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
[310] arXiv:2401.16569 (cross-list from cs.LG) [pdf, html, other]
Title: Autoencoder-Based Domain Learning for Semantic Communication with Conceptual Spaces
Dylan Wheeler, Balasubramaniam Natarajan
Comments: 6 pages, 5 figures. This work has been submitted to the IEEE for possible publication
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Image and Video Processing (eess.IV)
[311] arXiv:2401.16592 (cross-list from physics.med-ph) [pdf, other]
Title: A compact and cost-effective laser-powered speckle visibility spectroscopy (SVS) device for measuring cerebral blood flow
Yu Xi Huang, Simon Mahler, Maya Dickson, Aidin Abedi, Julian M. Tyszka, Jack Lo Yu Tung, Jonathan Russin, Charles Liu, Changhuei Yang
Subjects: Medical Physics (physics.med-ph); Image and Video Processing (eess.IV)
[312] arXiv:2401.16700 (cross-list from cs.CV) [pdf, html, other]
Title: Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers
Jianbin Jiao, Xina Cheng, Weijie Chen, Xiaoting Yin, Hao Shi, Kailun Yang
Comments: Accepted to IJCNN 2024. The source code will be available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[313] arXiv:2401.16712 (cross-list from cs.CV) [pdf, html, other]
Title: LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field Cameras
Fei Teng, Jiaming Zhang, Jiawei Liu, Kunyu Peng, Xina Cheng, Zhiyong Li, Kailun Yang
Comments: Accepted to ICPR 2024. The source code is publicly available at: this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[314] arXiv:2401.16830 (cross-list from cs.MM) [pdf, other]
Title: LATENTPATCH: A Non-Parametric Approach for Face Generation and Editing
Benjamin Samuth (UNICAEN, ENSICAEN), Julien Rabin (ENSICAEN, UNICAEN), David Tschumperlé (ENSICAEN, UNICAEN), Frédéric Jurie (ENSICAEN, UNICAEN)
Journal-ref: 2023 IEEE International Conference on Image Processing (ICIP), Oct 2023, Kuala Lumpur, Malaysia. pp.1790-1794
Subjects: Multimedia (cs.MM); Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[315] arXiv:2401.16923 (cross-list from cs.CV) [pdf, html, other]
Title: Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation
Ruiping Liu, Jiaming Zhang, Kunyu Peng, Yufan Chen, Ke Cao, Junwei Zheng, M. Saquib Sarfraz, Kailun Yang, Rainer Stiefelhagen
Comments: Accepted to IEEE IV 2024. The source code is publicly available at this https URL
Subjects: Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
[316] arXiv:2401.16972 (cross-list from cs.CV) [pdf, other]
Title: Deep 3D World Models for Multi-Image Super-Resolution Beyond Optical Flow
Luca Savant Aira, Diego Valsesia, Andrea Bordone Molini, Giulia Fracastoro, Enrico Magli, Andrea Mirabile
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
[317] arXiv:2401.17317 (cross-list from q-bio.NC) [pdf, other]
Title: Detection of Auditory Brainstem Response Peaks Using Image Processing Techniques in Infants with Normal Hearing Sensitivity
Amir Majidpour, Samer Kais Jameel, Jafar Majidpour, Houra Bagheri, Tarik A.Rashid, Ahmadreza Nazeri, Mahshid Moheb Aleaba
Subjects: Neurons and Cognition (q-bio.NC); Computer Vision and Pattern Recognition (cs.CV); Sound (cs.SD); Audio and Speech Processing (eess.AS); Image and Video Processing (eess.IV)
[318] arXiv:2401.17573 (cross-list from stat.ML) [pdf, other]
Title: Tensor-based process control and monitoring for semiconductor manufacturing with unstable disturbances
Yanrong Li, Juan Du, Fugee Tsung, Wei Jiang
Comments: 30 pages, 5 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Image and Video Processing (eess.IV); Systems and Control (eess.SY)
[319] arXiv:2401.17759 (cross-list from cs.CV) [pdf, other]
Title: Rapid post-disaster infrastructure damage characterisation enabled by remote sensing and deep learning technologies -- a tiered approach
Nadiia Kopiika, Andreas Karavias, Pavlos Krassakis, Zehao Ye, Jelena Ninic, Nataliya Shakhovska, Nikolaos Koukouzas, Sotirios Argyroudis, Stergios-Aristoteles Mitoulis
Comments: 43 pages; 20 figures
Subjects: Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Total of 319 entries
Showing up to 2000 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack