Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > eess.IV

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Image and Video Processing

Authors and titles for August 2024

Total of 343 entries : 51-150 101-200 201-300 301-343
Showing up to 100 entries per page: fewer | more | all
[51] arXiv:2408.04227 [pdf, html, other]
Title: Physical prior guided cooperative learning framework for joint turbulence degradation estimation and infrared video restoration
Ziran Zhang, Yuhang Tang, Zhigang Wang, Yueting Chen, Bin Zhao
Comments: 21
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[52] arXiv:2408.04273 [pdf, html, other]
Title: SG-JND: Semantic-Guided Just Noticeable Distortion Predictor For Image Compression
Linhan Cao, Wei Sun, Xiongkuo Min, Jun Jia, Zicheng Zhang, Zijian Chen, Yucheng Zhu, Lizhou Liu, Qiubo Chen, Jing Chen, Guangtao Zhai
Comments: Accepted by ICIP 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[53] arXiv:2408.04290 [pdf, html, other]
Title: Efficient and Accurate Pneumonia Detection Using a Novel Multi-Scale Transformer Approach
Alireza Saber, Pouria Parhami, Alimohammad Siahkarzadeh, Mansoor Fateh, Amirreza Fateh
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[54] arXiv:2408.04300 [pdf, html, other]
Title: An Explainable Non-local Network for COVID-19 Diagnosis
Jingfu Yang, Peng Huang, Jing Hu, Shu Hu, Siwei Lyu, Xin Wang, Jun Guo, Xi Wu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[55] arXiv:2408.04318 [pdf, html, other]
Title: Deep Transfer Learning for Kidney Cancer Diagnosis
Yassine Habchi, Hamza Kheddar, Yassine Himeur, Mohamed Chahine Ghanem, Abdelkrim Boukabou, Shadi Atalla, Wathiq Mansoor, Hussain Al-Ahmad
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[56] arXiv:2408.04535 [pdf, html, other]
Title: Synchronous Multi-modal Semantic Communication System with Packet-level Coding
Yun Tian, Jingkai Ying, Zhijin Qin, Ye Jin, Xiaoming Tao
Comments: 12 pages, 9 figures
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI)
[57] arXiv:2408.04610 [pdf, html, other]
Title: Quantifying the Impact of Population Shift Across Age and Sex for Abdominal Organ Segmentation
Kate Čevora, Ben Glocker, Wenjia Bai
Comments: This paper has been accepted for publication by the MICCAI 2024 Fairness of AI in Medical Imaging (FAIMI) Workshop
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[58] arXiv:2408.04723 [pdf, html, other]
Title: Survey: Transformer-based Models in Data Modality Conversion
Elyas Rashno, Amir Eskandari, Aman Anand, Farhana Zulkernine
Comments: Submitted to ACM Computing Surveys (CSUR)
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Signal Processing (eess.SP)
[59] arXiv:2408.04763 [pdf, html, other]
Title: Segmentation of Mental Foramen in Orthopantomographs: A Deep Learning Approach
Haider Raza, Mohsin Ali, Vishal Krishna Singh, Agustin Wahjuningrum, Rachel Sarig, Akhilanand Chaurasia
Comments: 9 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[60] arXiv:2408.04777 [pdf, other]
Title: Deep Learning-based Unsupervised Domain Adaptation via a Unified Model for Prostate Lesion Detection Using Multisite Bi-parametric MRI Datasets
Hao Li, Han Liu, Heinrich von Busch, Robert Grimm, Henkjan Huisman, Angela Tong, David Winkel, Tobias Penzkofer, Ivan Shabunin, Moon Hyung Choi, Qingsong Yang, Dieter Szolar, Steven Shea, Fergus Coakley, Mukesh Harisinghani, Ipek Oguz, Dorin Comaniciu, Ali Kamen, Bin Lou
Comments: Accept at Radiology: Artificial Intelligence. Journal reference and external DOI will be added once published
Journal-ref: Radiology: Artificial Intelligence 2024;6(5):e230521
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[61] arXiv:2408.04805 [pdf, other]
Title: Improved Robustness for Deep Learning-based Segmentation of Multi-Center Myocardial Perfusion MRI Datasets Using Data Adaptive Uncertainty-guided Space-time Analysis
Dilek M. Yalcinkaya, Khalid Youssef, Bobak Heydari, Janet Wei, Noel Bairey Merz, Robert Judd, Rohan Dharmakumar, Orlando P. Simonetti, Jonathan W. Weinsaft, Subha V. Raman, Behzad Sharif
Comments: Accepted for publication in JCMR, 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Medical Physics (physics.med-ph)
[62] arXiv:2408.04826 [pdf, html, other]
Title: Geo-UNet: A Geometrically Constrained Neural Framework for Clinical-Grade Lumen Segmentation in Intravascular Ultrasound
Yiming Chen, Niharika S. D'Souza, Akshith Mandepally, Patrick Henninger, Satyananda Kashyap, Neerav Karani, Neel Dey, Marcos Zachary, Raed Rizq, Paul Chouinard, Polina Golland, Tanveer F. Syeda-Mahmood
Comments: Accepted into the 15th workshop on Machine Learning in Medical Imaging at MICCAI 2024. (* indicates equal contribution)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[63] arXiv:2408.04949 [pdf, html, other]
Title: CROCODILE: Causality aids RObustness via COntrastive DIsentangled LEarning
Gianluca Carloni, Sotirios A Tsaftaris, Sara Colantonio
Comments: MICCAI 2024 UNSURE Workshop, Accepted for presentation, Submitted Manuscript Version, 10 pages
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[64] arXiv:2408.05052 [pdf, html, other]
Title: Integrating Edge Information into Ground Truth for the Segmentation of the Optic Disc and Cup from Fundus Images
Yoga Sri Varshan V, Hitesh Gupta Kattamuri, Subin Sahayam, Umarani Jayaraman
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[65] arXiv:2408.05056 [pdf, html, other]
Title: Multi-dimensional Parameter Space Exploration for Streamline-specific Tractography
Ruben Vink, Anna Vilanova, Maxime Chamberland
Comments: Accepted at MICCAI 2024 International Workshop on Computational Diffusion MRI
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[66] arXiv:2408.05117 [pdf, html, other]
Title: Beyond the Eye: A Relational Model for Early Dementia Detection Using Retinal OCTA Images
Shouyue Liu, Ziyi Zhang, Yuanyuan Gu, Jinkui Hao, Yonghuai Liu, Huazhu Fu, Xinyu Guo, Hong Song, Shuting Zhang, Yitian Zhao
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[67] arXiv:2408.05372 [pdf, html, other]
Title: PRISM Lite: A lightweight model for interactive 3D placenta segmentation in ultrasound
Hao Li, Baris Oguz, Gabriel Arenas, Xing Yao, Jiacheng Wang, Alison Pouch, Brett Byram, Nadav Schwartz, Ipek Oguz
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[68] arXiv:2408.05645 [pdf, other]
Title: BeyondCT: A deep learning model for predicting pulmonary function from chest CT scans
Kaiwen Geng, Zhiyi Shi, Xiaoyan Zhao, Alaa Ali, Jing Wang, Joseph Leader, Jiantao Pu
Comments: 5 tables, 7 figures,22 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[69] arXiv:2408.05697 [pdf, html, other]
Title: Evaluating BM3D and NBNet: A Comprehensive Study of Image Denoising Across Multiple Datasets
Ghazal Kaviani, Reza Marzban, Ghassan AlRegib
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[70] arXiv:2408.05705 [pdf, html, other]
Title: TC-KANRecon: High-Quality and Accelerated MRI Reconstruction via Adaptive KAN Mechanisms and Intelligent Feature Scaling
Ruiquan Ge, Xiao Yu, Yifei Chen, Guanyu Zhou, Fan Jia, Shenghao Zhu, Junhao Jia, Chenyan Zhang, Yifei Sun, Dong Zeng, Changmiao Wang, Qiegen Liu, Shanzhou Niu
Comments: 11 pages, 3 figures
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[71] arXiv:2408.05803 [pdf, html, other]
Title: Prototype Learning Guided Hybrid Network for Breast Tumor Segmentation in DCE-MRI
Lei Zhou, Yuzhong Zhang, Jiadong Zhang, Xuejun Qian, Chen Gong, Kun Sun, Zhongxiang Ding, Xing Wang, Zhenhui Li, Zaiyi Liu, Dinggang Shen
Journal-ref: 2024,IEEE Transactions on Medical Imaging
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[72] arXiv:2408.05839 [pdf, html, other]
Title: Deep Learning in Medical Image Registration: Magic or Mirage?
Rohit Jena, Deeksha Sethi, Pratik Chaudhari, James C. Gee
Comments: 16 pages; Accepted to NeurIPS 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[73] arXiv:2408.05877 [pdf, html, other]
Title: Towards pedestrian head tracking: A benchmark dataset and a multi-source data fusion network
Kailai Sun, Xinwei Wang, Shaobo Liu, Qianchuan Zhao, Gao Huang, Chang Liu
Comments: dataset:this https URL
Journal-ref: Engineering Applications of Artificial Intelligence, 158, 111265 (2025)
Subjects: Image and Video Processing (eess.IV)
[74] arXiv:2408.05892 [pdf, html, other]
Title: Polyp SAM 2: Advancing Zero shot Polyp Segmentation in Colorectal Cancer Detection
Mobina Mansoori, Sajjad Shahabodini, Jamshid Abouei, Konstantinos N. Plataniotis, Arash Mohammadi
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[75] arXiv:2408.05923 [pdf, html, other]
Title: Image Denoising Using Green Channel Prior
Zhaoming Kong, Fangxi Deng, Xiaowei Yang
Comments: arXiv admin note: text overlap with arXiv:2402.08235
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[76] arXiv:2408.06014 [pdf, html, other]
Title: A Sharpness Based Loss Function for Removing Out-of-Focus Blur
Uditangshu Aurangabadkar, Darren Ramsook, Anil Kokaram
Comments: 6 pages, IEEE MMSP
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[77] arXiv:2408.06049 [pdf, other]
Title: Hardware Architecture Design of Model-Based Image Reconstruction Towards Palm-size Photoacoustic Tomography
Yuwei Zheng, Zijian Gao, Yuting Shen, Jiadong Zhang, Daohuai Jiang, Fengyu Liu, Feng Gao, Fei Gao
Comments: 11 pages, 13 figures
Subjects: Image and Video Processing (eess.IV)
[78] arXiv:2408.06075 [pdf, html, other]
Title: Five Pitfalls When Assessing Synthetic Medical Images with Reference Metrics
Melanie Dohmen, Tuan Truong, Ivo M. Baltruschat, Matthias Lenga
Comments: 10 pages, 5 figures, presented at Deep Generative Models workshop @ MICCAI 2024
Journal-ref: In: Mukhopadhyay, A., Oksuz, I., Engelhardt, S., Mehrof, D., Yuan, Y. (eds) Deep Generative Models. DGM4MICCAI 2024. Lecture Notes in Computer Science, vol 15224. Springer, Cham
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[79] arXiv:2408.06170 [pdf, other]
Title: Zero-shot 3D Segmentation of Abdominal Organs in CT Scans Using Segment Anything Model 2: Adapting Video Tracking Capabilities for 3D Medical Imaging
Yosuke Yamagishi, Shouhei Hanaoka, Tomohiro Kikuchi, Takahiro Nakao, Yuta Nakamura, Yukihiro Nomura, Soichiro Miki, Takeharu Yoshikawa, Osamu Abe
Comments: 20 pages, 7 figures (including 2 supplemental figure), 4 tables
Journal-ref: JMIR AI. 2025;4:e72109
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[80] arXiv:2408.06358 [pdf, html, other]
Title: How good nnU-Net for Segmenting Cardiac MRI: A Comprehensive Evaluation
Malitha Gunawardhana, Fangqiang Xu, Jichao Zhao
Comments: add a supplementary material
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[81] arXiv:2408.06381 [pdf, html, other]
Title: Assessment of Cell Nuclei AI Foundation Models in Kidney Pathology
Junlin Guo, Siqi Lu, Can Cui, Ruining Deng, Tianyuan Yao, Zhewen Tao, Yizhe Lin, Marilyn Lionts, Quan Liu, Juming Xiong, Yu Wang, Shilin Zhao, Catie Chang, Mitchell Wilkes, Mengmeng Yin, Haichun Yang, Yuankai Huo
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[82] arXiv:2408.06403 [pdf, html, other]
Title: From Diagnostic CT to DTI Tractography labels: Using Deep Learning for Corticospinal Tract Injury Assessment and Outcome Prediction in Intracerebral Haemorrhage
Olivia N Murray, Hamied Haroon, Paul Ryu, Hiren Patel, George Harston, Marieke Wermer, Wilmar Jolink, Daniel Hanley, Catharina Klijn, Ulrike Hammerbeck, Adrian Parry-Jones, Timothy Cootes
Comments: Accepted to Miccai Switch Workshop
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
[83] arXiv:2408.06459 [pdf, html, other]
Title: InfLocNet: Enhanced Lung Infection Localization and Disease Detection from Chest X-Ray Images Using Lightweight Deep Learning
Md. Asiful Islam Miah, Shourin Paul, Sunanda Das, M. M. A. Hashem
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[84] arXiv:2408.06600 [pdf, html, other]
Title: Deep Inertia $L_p$ Half-Quadratic Splitting Unrolling Network for Sparse View CT Reconstruction
Yu Guo, Caiying Wu, Yaxin Li, Qiyu Jin, Tieyong Zeng
Comments: This paper was accepted by IEEE Signal Processing Letters on July 28, 2024
Journal-ref: IEEE Signal Processing Letters, 2024, 31:2030-2034
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[85] arXiv:2408.06640 [pdf, html, other]
Title: Attention Based Feature Fusion Network for Monkeypox Skin Lesion Detection
Niloy Kumar Kundu, Mainul Karim, Sarah Kobir, Dewan Md. Farid
Comments: 6 pages with 6 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[86] arXiv:2408.06644 [pdf, html, other]
Title: Specialized Change Detection using Segment Anything
Tahir Ahmad, Sudipan Saha
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[87] arXiv:2408.06684 [pdf, html, other]
Title: How to Best Combine Demosaicing and Denoising?
Yu Guo, Qiyu Jin, Jean-Michel Morel, Gabriele Facciolo
Comments: This paper was accepted by Inverse Problems and Imaging on October, 2023
Journal-ref: Inverse Problems and Imaging, 2024, 18(3):571-599
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[88] arXiv:2408.06727 [pdf, html, other]
Title: Machine Learning Interventions for Weed Detection using Multispectral Imagery and Unmanned Aerial Vehicles -- A Systematic Review
Drishti Goel (1), Bhavya Kapur (2), Prem Prakash Vuppuluri (3) ((1) Research Fellow, Microsoft, Bengaluru, India (2) Data Scientist, NeenOpal Intelligent Solutions Inc., Bengaluru, India (3) Assistant Professor, Dayalbagh Educational Institute (Deemed University), Agra, India)
Subjects: Image and Video Processing (eess.IV)
[89] arXiv:2408.06784 [pdf, html, other]
Title: Enhancing Diabetic Retinopathy Diagnosis: A Lightweight CNN Architecture for Efficient Exudate Detection in Retinal Fundus Images
Mujadded Al Rabbani Alif
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[90] arXiv:2408.06968 [pdf, html, other]
Title: Event-Stream Super Resolution using Sigma-Delta Neural Network
Waseem Shariff, Joe Lemley, Peter Corcoran
Comments: ECCV: The 18th European Conference on Computer Vision ECCV 2024 NeVi Workshop
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[91] arXiv:2408.07028 [pdf, html, other]
Title: Feature-Preserving Rate-Distortion Optimization in Image Coding for Machines
Samuel Fernández Menduiña, Eduardo Pavez, Antonio Ortega
Comments: 6 pages, 6 figures, MMSP
Subjects: Image and Video Processing (eess.IV); Signal Processing (eess.SP)
[92] arXiv:2408.07041 [pdf, html, other]
Title: Subjective and Objective Quality Assessment of Rendered Human Avatar Videos in Virtual Reality
Yu-Chih Chen, Avinab Saha, Alexandre Chapiro, Christian Häne, Jean-Charles Bazin, Bo Qiu, Stefano Zanetti, Ioannis Katsavounidis, Alan C. Bovik
Comments: Accepted to IEEE Transactions on Image Processing, 2024
Subjects: Image and Video Processing (eess.IV)
[93] arXiv:2408.07075 [pdf, html, other]
Title: UniFed: A Universal Federation of a Mixture of Highly Heterogeneous Medical Image Classification Tasks
Atefe Hassani, Islem Rekik
Comments: MLMI@MICCAI 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[94] arXiv:2408.07079 [pdf, html, other]
Title: Anatomical Foundation Models for Brain MRIs
Carlo Alberto Barbano, Matteo Brunello, Benoit Dufumier, Marco Grangetto
Comments: Updated version; added ablation study
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[95] arXiv:2408.07109 [pdf, html, other]
Title: Efficient Deep Model-Based Optoacoustic Image Reconstruction
Christoph Dehner, Guillaume Zahnd
Comments: Preprint accepted at 2024 Ultrasonics, Ferroelectrics, and Frequency Control Joint Symposium
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
[96] arXiv:2408.07114 [pdf, html, other]
Title: Investigation of unsupervised and supervised hyperspectral anomaly detection
Mazharul Hossain, Aaron Robinson, Lan Wang, Chrysanthe Preza
Comments: Published in Proceedings Volume 13138, Applications of Machine Learning 2024; 1313817 (2024). Event: Optical Engineering + Applications, 2024, San Diego, California, United States
Journal-ref: Applications of Machine Learning 2024. 13138 (2024) 251-261
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
[97] arXiv:2408.07171 [pdf, html, other]
Title: BVI-UGC: A Video Quality Database for User-Generated Content Transcoding
Zihao Qi, Chen Feng, Fan Zhang, Xiaozhong Xu, Shan Liu, David Bull
Comments: 12 pages, 11 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[98] arXiv:2408.07264 [pdf, other]
Title: Lesion-aware network for diabetic retinopathy diagnosis
Xue Xia, Kun Zhan, Yuming Fang, Wenhui Jiang, Fei Shen
Comments: This is submitted version wihout improvements by reviewers. The final version is published on International Journal of Imaging Systems and Techonology (this https URL)
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[99] arXiv:2408.07293 [pdf, html, other]
Title: Discriminating retinal microvascular and neuronal differences related to migraines: Deep Learning based Crossectional Study
Feilong Tang, Matt Trinh, Annita Duong, Angelica Ly, Fiona Stapleton, Zhe Chen, Zongyuan Ge, Imran Razzak
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Neurons and Cognition (q-bio.NC)
[100] arXiv:2408.07325 [pdf, html, other]
Title: RoCoSDF: Row-Column Scanned Neural Signed Distance Fields for Freehand 3D Ultrasound Imaging Shape Reconstruction
Hongbo Chen, Yuchong Gao, Shuhang Zhang, Jiangjie Wu, Yuexin Ma, Rui Zheng
Comments: Accepted by MICCAI 2024
Subjects: Image and Video Processing (eess.IV); Graphics (cs.GR)
[101] arXiv:2408.07349 [pdf, html, other]
Title: Automated Retinal Image Analysis and Medical Report Generation through Deep Learning
Jia-Hong Huang
Comments: Ph.D. thesis, 124 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Multimedia (cs.MM)
[102] arXiv:2408.07444 [pdf, html, other]
Title: Costal Cartilage Segmentation with Topology Guided Deformable Mamba: Method and Benchmark
Senmao Wang, Haifan Gong, Runmeng Cui, Boyao Wan, Yicheng Liu, Zhonglin Hu, Haiqing Yang, Jingyang Zhou, Bo Pan, Lin Lin, Haiyue Jiang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[103] arXiv:2408.07532 [pdf, html, other]
Title: Improved 3D Whole Heart Geometry from Sparse CMR Slices
Yiyang Xu, Hao Xu, Matthew Sinclair, Esther Puyol-Antón, Steven A Niederer, Amedeo Chiribiri, Steven E Williams, Michelle C Williams, Alistair A Young
Comments: 13 pages, STACOM2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[104] arXiv:2408.07580 [pdf, html, other]
Title: Theoretical and Practical Progress in Hyperspectral Pixel Unmixing with Large Spectral Libraries from a Sparse Perspective
Jade Preston, William Basener
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
[105] arXiv:2408.07786 [pdf, html, other]
Title: Perspectives: Comparison of Deep Learning Segmentation Models on Biophysical and Biomedical Data
J Shepard Bryan IV, Pedro Pessoa, Meyam Tavakoli, Steve Presse
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Biological Physics (physics.bio-ph)
[106] arXiv:2408.07860 [pdf, other]
Title: A Novel Generative Artificial Intelligence Method for Interference Study on Multiplex Brightfield Immunohistochemistry Images
Satarupa Mukherjee, Jim Martin, Yao Nie
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[107] arXiv:2408.07903 [pdf, html, other]
Title: Deep Joint Denoising and Detection for Enhanced Intracellular Particle Analysis
Yao Yao, Ihor Smal, Ilya Grigoriev, Anna Akhmanova, Erik Meijering
Comments: 11 pages, 4 figures, 4 tables
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[108] arXiv:2408.07932 [pdf, html, other]
Title: MobileMEF: Fast and Efficient Method for Multi-Exposure Fusion
Lucas Nedel Kirsten, Zhicheng Fu, Nikhil Ambha Madhusudhana
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[109] arXiv:2408.07947 [pdf, html, other]
Title: Conditional Brownian Bridge Diffusion Model for VHR SAR to Optical Image Translation
Seon-Hoon Kim, Dae-Won Chung
Comments: 5 pages, 2 figures, 1 table
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[110] arXiv:2408.08038 [pdf, html, other]
Title: PI-Att: Topology Attention for Segmentation Networks through Adaptive Persistence Image Representation
Mehmet Bahadir Erden, Sinan Unver, Ilke Ali Gurses, Rustu Turkay, Cigdem Gunduz-Demir
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[111] arXiv:2408.08115 [pdf, html, other]
Title: Learned denoising with simulated and experimental low-dose CT data
Maximilian B. Kiss, Ander Biguri, Carola-Bibiane Schönlieb, K. Joost Batenburg, Felix Lucka
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
[112] arXiv:2408.08211 [pdf, html, other]
Title: Learned Multimodal Compression for Autonomous Driving
Hadi Hadizadeh, Ivan V. Bajić
Comments: 6 pages, 5 figures, IEEE MMSP 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[113] arXiv:2408.08228 [pdf, html, other]
Title: Rethinking Medical Anomaly Detection in Brain MRI: An Image Quality Assessment Perspective
Zixuan Pan, Jun Xia, Zheyu Yan, Guoyue Xu, Yawen Wu, Zhenge Jia, Jianxu Chen, Yiyu Shi
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[114] arXiv:2408.08306 [pdf, html, other]
Title: Accelerated Image-Aware Generative Diffusion Modeling
Tanmay Asthana, Yufang Bao, Hamid Krim
Subjects: Image and Video Processing (eess.IV)
[115] arXiv:2408.08432 [pdf, html, other]
Title: Predictive uncertainty estimation in deep learning for lung carcinoma classification in digital pathology under real dataset shifts
Abdur R. Fayjie, Jutika Borah, Florencia Carbone, Jan Tack, Patrick Vandewalle
Comments: 17 pages, 2 figures, 5 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[116] arXiv:2408.08456 [pdf, html, other]
Title: Distributional Drift Detection in Medical Imaging with Sketching and Fine-Tuned Transformer
Yusen Wu, Phuong Nguyen, Rose Yesha, Yelena Yesha
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[117] arXiv:2408.08489 [pdf, html, other]
Title: DFT-Based Adversarial Attack Detection in MRI Brain Imaging: Enhancing Diagnostic Accuracy in Alzheimer's Case Studies
Mohammad Hossein Najafi, Mohammad Morsali, Mohammadmahdi Vahediahmar, Saeed Bagheri Shouraki
Comments: 10 pages, 4 figures, conference
Subjects: Image and Video Processing (eess.IV); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
[118] arXiv:2408.08616 [pdf, html, other]
Title: Reference-free Axial Super-resolution of 3D Microscopy Images using Implicit Neural Representation with a 2D Diffusion Prior
Kyungryun Lee, Won-Ki Jeong
Comments: MICCAI2024 accepted
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[119] arXiv:2408.08647 [pdf, html, other]
Title: Modeling the Neonatal Brain Development Using Implicit Neural Representations
Florentin Bieder, Paul Friedrich, Hélène Corbaz, Alicia Durrer, Julia Wolleb, Philippe C. Cattin
Comments: Preprint, Accepted for PRIME MICCAI 2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[120] arXiv:2408.08747 [pdf, html, other]
Title: MicroSSIM: Improved Structural Similarity for Comparing Microscopy Data
Ashesh Ashesh, Joran Deschamps, Florian Jug
Comments: Accepted at BIC workshop, ECCV 24
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[121] arXiv:2408.08784 [pdf, html, other]
Title: Multi-task Learning Approach for Intracranial Hemorrhage Prognosis
Miriam Cobo, Amaia Pérez del Barrio, Pablo Menéndez Fernández-Miranda, Pablo Sanz Bellón, Lara Lloret Iglesias, Wilson Silva
Comments: 16 pages. Accepted at Machine Learning in Medical Imaging Workshop @ MICCAI 2024 (MLMI2024). This is the submitted manuscript with added link to github repo, funding acknowledgements and authors' names and affiliations. No further post submission improvements or corrections were integrated. Final version not published yet
Journal-ref: Machine Learning in Medical Imaging: 15th International Workshop, MLMI 2024, Held in Conjunction with MICCAI 2024, Marrakesh, Morocco, October 6, 2024, Proceedings, Part II
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[122] arXiv:2408.08790 [pdf, html, other]
Title: A Disease-Specific Foundation Model Using Over 100K Fundus Images: Release and Validation for Abnormality and Multi-Disease Classification on Downstream Tasks
Boa Jang, Youngbin Ahn, Eun Kyung Choe, Chang Ki Yoon, Hyuk Jin Choi, Young-Gon Kim
Comments: 10 pages, 4 figures
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[123] arXiv:2408.08792 [pdf, html, other]
Title: Assessing Generalization Capabilities of Malaria Diagnostic Models from Thin Blood Smears
Louise Guillon, Soheib Biga, Axel Puyo, Grégoire Pasquier, Valentin Foucher, Yendoubé E. Kantchire, Stéphane E. Sossou, Ameyo M. Dorkenoo, Laurent Bonnardot, Marc Thellier, Laurence Lachaud, Renaud Piarroux
Comments: MICCAI 2024 AMAI Workshop, Accepted for presentation, Submitted Manuscript Version, 10 pages
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Quantitative Methods (q-bio.QM)
[124] arXiv:2408.08847 [pdf, html, other]
Title: HistoGym: A Reinforcement Learning Environment for Histopathological Image Analysis
Zhi-Bo Liu, Xiaobo Pang, Jizhao Wang, Shuai Liu, Chen Li
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[125] arXiv:2408.08881 [pdf, html, other]
Title: Challenge Summary U-MedSAM: Uncertainty-aware MedSAM for Medical Image Segmentation
Xin Wang, Xiaoyu Liu, Peng Huang, Pu Huang, Shu Hu, Hongtu Zhu
Comments: arXiv admin note: text overlap with arXiv:2405.17496
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[126] arXiv:2408.08883 [pdf, other]
Title: MR Optimized Reconstruction of Simultaneous Multi-Slice Imaging Using Diffusion Model
Ting Zhao, Zhuoxu Cui, Sen Jia, Qingyong Zhu, Congcong Liu, Yihang Zhou, Yanjie Zhu, Dong Liang, Haifeng Wang
Comments: Accepted as ISMRM 2024 Digital Poster 4024
Journal-ref: ISMRM 2024 Digital poster 4024
Subjects: Image and Video Processing (eess.IV)
[127] arXiv:2408.08887 [pdf, other]
Title: Tree species classification at the pixel-level using deep learning and multispectral time series in an imbalanced context
Florian Mouret (CESBIO, UO), David Morin (CESBIO), Milena Planells (CESBIO), Cécile Vincent-Barbaroux
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
[128] arXiv:2408.08939 [pdf, html, other]
Title: Oral squamous cell detection using deep learning
Samrat Kumar Dev Sharma
Comments: This paper is 13 pages and 9 picture
Subjects: Image and Video Processing (eess.IV)
[129] arXiv:2408.09044 [pdf, other]
Title: Explore Cross-Codec Quality-Rate Convex Hulls Relation for Adaptive Streaming
Masoumeh Farhadi Nia
Comments: 20 pages, 11 Figures
Subjects: Image and Video Processing (eess.IV)
[130] arXiv:2408.09218 [pdf, html, other]
Title: FQGA-single: Towards Fewer Training Epochs and Fewer Model Parameters for Image-to-Image Translation Tasks
Cho Yang
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
[131] arXiv:2408.09278 [pdf, html, other]
Title: Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney Pathology
Junchao Zhu, Mengmeng Yin, Ruining Deng, Yitian Long, Yu Wang, Yaohong Wang, Shilin Zhao, Haichun Yang, Yuankai Huo
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[132] arXiv:2408.09315 [pdf, html, other]
Title: Unpaired Volumetric Harmonization of Brain MRI with Conditional Latent Diffusion
Mengqi Wu, Minhui Yu, Shuaiming Jing, Pew-Thian Yap, Zhengwu Zhang, Mingxia Liu
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[133] arXiv:2408.09367 [pdf, html, other]
Title: Improving Lung Cancer Diagnosis and Survival Prediction with Deep Learning and CT Imaging
Xiawei Wang, James Sharpnack, Thomas C.M. Lee
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[134] arXiv:2408.09369 [pdf, html, other]
Title: Flemme: A Flexible and Modular Learning Platform for Medical Images
Guoqing Zhang, Jingyun Yang, Yang Li
Comments: 8 pages, 6 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[135] arXiv:2408.09432 [pdf, html, other]
Title: Deformation-aware GAN for Medical Image Synthesis with Substantially Misaligned Pairs
Bowen Xin, Tony Young, Claire E Wainwright, Tamara Blake, Leo Lebrat, Thomas Gaass, Thomas Benkert, Alto Stemmer, David Coman, Jason Dowling
Comments: Accepted by MIDL2024
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[136] arXiv:2408.09687 [pdf, html, other]
Title: TESL-Net: A Transformer-Enhanced CNN for Accurate Skin Lesion Segmentation
Shahzaib Iqbal, Muhammad Zeeshan, Mehwish Mehmood, Tariq M. Khan, Imran Razzak
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[137] arXiv:2408.09731 [pdf, html, other]
Title: Reconstruct Spine CT from Biplanar X-Rays via Diffusion Learning
Zhi Qiao, Xuhui Liu, Xiaopeng Wang, Runkun Liu, Xiantong Zhen, Pei Dong, Zhen Qian
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[138] arXiv:2408.09736 [pdf, html, other]
Title: Coarse-Fine View Attention Alignment-Based GAN for CT Reconstruction from Biplanar X-Rays
Zhi Qiao, Hanqiang Ouyang, Dongheng Chu, Huishu Yuan, Xiantong Zhen, Pei Dong, Zhen Qian
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[139] arXiv:2408.09754 [pdf, html, other]
Title: Efficient onboard multi-task AI architecture based on self-supervised learning
Gabriele Inzerillo, Diego Valsesia, Enrico Magli
Subjects: Image and Video Processing (eess.IV)
[140] arXiv:2408.09894 [pdf, other]
Title: Preoperative Rotator Cuff Tear Prediction from Shoulder Radiographs using a Convolutional Block Attention Module-Integrated Neural Network
Chris Hyunchul Jo, Jiwoong Yang, Byunghwan Jeon, Hackjoon Shim, Ikbeom Jang
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[141] arXiv:2408.09931 [pdf, html, other]
Title: Pose-GuideNet: Automatic Scanning Guidance for Fetal Head Ultrasound from Pose Estimation
Qianhui Men, Xiaoqing Guo, Aris T. Papageorghiou, J. Alison Noble
Comments: Accepted by MICCAI2024
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[142] arXiv:2408.10067 [pdf, html, other]
Title: Towards a Benchmark for Colorectal Cancer Segmentation in Endorectal Ultrasound Videos: Dataset and Model Development
Yuncheng Jiang, Yiwen Hu, Zixun Zhang, Jun Wei, Chun-Mei Feng, Xuemei Tang, Xiang Wan, Yong Liu, Shuguang Cui, Zhen Li
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[143] arXiv:2408.10236 [pdf, html, other]
Title: AID-DTI: Accelerating High-fidelity Diffusion Tensor Imaging with Detail-preserving Model-based Deep Learning
Wenxin Fan, Jian Cheng, Cheng Li, Jing Yang, Ruoyou Wu, Juan Zou, Shanshan Wang
Comments: 12 pages, 3 figures, MICCAI 2024 Workshop on Computational Diffusion MRI. arXiv admin note: text overlap with arXiv:2401.01693, arXiv:2405.03159
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[144] arXiv:2408.10283 [pdf, html, other]
Title: Perception-based multiplicative noise removal using SDEs
An Vuong, Thinh Nguyen
Comments: 15 pages, 6 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[145] arXiv:2408.10498 [pdf, html, other]
Title: Cervical Cancer Detection Using Multi-Branch Deep Learning Model
Tatsuhiro Baba, Abu Saleh Musa Miah, Jungpil Shin, Md. Al Mehedi Hasan
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[146] arXiv:2408.10572 [pdf, other]
Title: A Tutorial on Explainable Image Classification for Dementia Stages Using Convolutional Neural Network and Gradient-weighted Class Activation Mapping
Kevin Kam Fung Yuen
Comments: 15 pages, 11 figures, 3 tables
Subjects: Image and Video Processing (eess.IV); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
[147] arXiv:2408.10636 [pdf, other]
Title: UWF-RI2FA: Generating Multi-frame Ultrawide-field Fluorescein Angiography from Ultrawide-field Retinal Imaging Improves Diabetic Retinopathy Stratification
Ruoyu Chen, Kezheng Xu, Kangyan Zheng, Weiyi Zhang, Yan Lu, Danli Shi, Mingguang He
Comments: 22 pages, 2 figures
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[148] arXiv:2408.10656 [pdf, html, other]
Title: deepmriprep: Voxel-based Morphometry (VBM) Preprocessing via Deep Neural Networks
Lukas Fisch, Nils R. Winter, Janik Goltermann, Carlotta Barkhau, Daniel Emden, Jan Ernsting, Maximilian Konowski, Ramona Leenings, Tiana Borgers, Kira Flinkenflügel, Dominik Grotegerd, Anna Kraus, Elisabeth J. Leehr, Susanne Meinert, Frederike Stein, Lea Teutenberg, Florian Thomas-Odenthal, Paula Usemann, Marco Hermesdorf, Hamidreza Jamalabadi, Andreas Jansen, Igor Nenadic, Benjamin Straube, Tilo Kircher, Klaus Berger, Benjamin Risse, Udo Dannlowski, Tim Hahn
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
[149] arXiv:2408.10665 [pdf, html, other]
Title: End-to-end learned Lossy Dynamic Point Cloud Attribute Compression
Dat Thanh Nguyen, Daniel Zieger, Marc Stamminger, Andre Kaup
Comments: 6 pages, accepted for presentation at 2024 IEEE International Conference on Image Processing (ICIP) 2024
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG)
[150] arXiv:2408.10733 [pdf, html, other]
Title: Classification of Endoscopy and Video Capsule Images using CNN-Transformer Model
Aliza Subedi, Smriti Regmi, Nisha Regmi, Bhumi Bhusal, Ulas Bagci, Debesh Jha
Subjects: Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Total of 343 entries : 51-150 101-200 201-300 301-343
Showing up to 100 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status
    Get status notifications via email or slack