Skip to main content

Showing 1–50 of 68 results for author: Katsaggelos, A K

.
  1. arXiv:2505.19385  [pdf, ps, other

    cs.CV cs.AI

    Advancing Limited-Angle CT Reconstruction Through Diffusion-Based Sinogram Completion

    Authors: Jiaqi Guo, Santiago Lopez-Tapia, Aggelos K. Katsaggelos

    Abstract: Limited Angle Computed Tomography (LACT) often faces significant challenges due to missing angular information. Unlike previous methods that operate in the image domain, we propose a new method that focuses on sinogram inpainting. We leverage MR-SDEs, a variant of diffusion models that characterize the diffusion process with mean-reverting stochastic differential equations, to fill in missing angu… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: Accepted at the 2025 IEEE International Conference on Image Processing (Oral)

  2. Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature Extractor

    Authors: Jiaqi Guo, Yunan Wu, Evangelos Kaimakamis, Georgios Petmezas, Vasileios E. Papageorgiou, Nicos Maglaveras, Aggelos K. Katsaggelos

    Abstract: With the advent of the COVID-19 pandemic, ultrasound imaging has emerged as a promising technique for COVID-19 detection, due to its non-invasive nature, affordability, and portability. In response, researchers have focused on developing AI-based scoring systems to provide real-time diagnostic support. However, the limited size and lack of proper annotation in publicly available ultrasound dataset… ▽ More

    Submitted 25 May, 2025; v1 submitted 21 January, 2025; originally announced January 2025.

    Comments: Accepted by IEEE ISBI 2025 (Selected for oral presentation); 2025/4/15 (v2): Corrected a notation error in Figure 2

    Journal ref: 2025 IEEE 22nd International Symposium on Biomedical Imaging (ISBI), Houston, TX, USA, 2025, pp. 1-5

  3. arXiv:2501.01372  [pdf

    eess.IV cs.AI cs.CV

    ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI

    Authors: Neda Tavakoli, Amir Ali Rahsepar, Brandon C. Benefield, Daming Shen, Santiago López-Tapia, Florian Schiffers, Jeffrey J. Goldberger, Christine M. Albert, Edwin Wu, Aggelos K. Katsaggelos, Daniel C. Lee, Daniel Kim

    Abstract: Background: Late Gadolinium Enhancement (LGE) imaging is the gold standard for assessing myocardial fibrosis and scarring, with left ventricular (LV) LGE extent predicting major adverse cardiac events (MACE). Despite its importance, routine LGE-based LV scar quantification is hindered by labor-intensive manual segmentation and inter-observer variability. Methods: We propose ScarNet, a hybrid model… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

    Comments: 31 pages, 8 figures

  4. arXiv:2411.11863  [pdf, ps, other

    eess.SP cs.LG

    Longitudinal Wrist PPG Analysis for Reliable Hypertension Risk Screening Using Deep Learning

    Authors: Hui Lin, Jiyang Li, Ramy Hussein, Xin Sui, Xiaoyu Li, Guangpu Zhu, Aggelos K. Katsaggelos, Zijing Zeng, Yelei Li

    Abstract: Hypertension is a leading risk factor for cardiovascular diseases. Traditional blood pressure monitoring methods are cumbersome and inadequate for continuous tracking, prompting the development of PPG-based cuffless blood pressure monitoring wearables. This study leverages deep learning models, including ResNet and Transformer, to analyze wrist PPG data collected with a smartwatch for efficient hy… ▽ More

    Submitted 2 November, 2024; originally announced November 2024.

    Comments: blood pressure, hypertension, cuffless, photoplethysmography, deep learning

  5. arXiv:2410.03276  [pdf, other

    cs.CV cs.LG

    Sm: enhanced localization in Multiple Instance Learning for medical imaging classification

    Authors: Francisco M. Castro-Macías, Pablo Morales-Álvarez, Yunan Wu, Rafael Molina, Aggelos K. Katsaggelos

    Abstract: Multiple Instance Learning (MIL) is widely used in medical imaging classification to reduce the labeling effort. While only bag labels are available for training, one typically seeks predictions at both bag and instance levels (classification and localization tasks, respectively). Early MIL methods treated the instances in a bag independently. Recent methods account for global and local dependenci… ▽ More

    Submitted 15 November, 2024; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: 24 pages, 14 figures, 2024 Conference on Neural Information Processing Systems (NeurIPS 2024)

  6. arXiv:2409.18340  [pdf, ps, other

    eess.IV cs.AI cs.CV

    DRL-STNet: Unsupervised Domain Adaptation for Cross-modality Medical Image Segmentation via Disentangled Representation Learning

    Authors: Hui Lin, Florian Schiffers, Santiago López-Tapia, Neda Tavakoli, Daniel Kim, Aggelos K. Katsaggelos

    Abstract: Unsupervised domain adaptation (UDA) is essential for medical image segmentation, especially in cross-modality data scenarios. UDA aims to transfer knowledge from a labeled source domain to an unlabeled target domain, thereby reducing the dependency on extensive manual annotations. This paper presents DRL-STNet, a novel framework for cross-modality medical image segmentation that leverages generat… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

    Comments: MICCAI 2024 Challenge, FLARE Challenge, Unsupervised domain adaptation, Organ segmentation, Feature disentanglement, Self-training

  7. arXiv:2409.13930  [pdf, other

    eess.IV cs.CV

    RN-SDEs: Limited-Angle CT Reconstruction with Residual Null-Space Diffusion Stochastic Differential Equations

    Authors: Jiaqi Guo, Santiago Lopez-Tapia, Wing Shun Li, Yunnan Wu, Marcelo Carignano, Vadim Backman, Vinayak P. Dravid, Aggelos K. Katsaggelos

    Abstract: Computed tomography is a widely used imaging modality with applications ranging from medical imaging to material analysis. One major challenge arises from the lack of scanning information at certain angles, leading to distorted CT images with artifacts. This results in an ill-posed problem known as the Limited Angle Computed Tomography (LACT) reconstruction problem. To address this problem, we pro… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  8. arXiv:2409.06738  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Characterization of Crystal Properties and Defects in CdZnTe Radiation Detectors

    Authors: Manuel Ballester, Jaromir Kaspar, Francesc Massanes, Srutarshi Banerjee, Alexander Hans Vija, Aggelos K. Katsaggelos

    Abstract: CdZnTe-based detectors are highly valued because of their high spectral resolution, which is an essential feature for nuclear medical imaging. However, this resolution is compromised when there are substantial defects in the CdZnTe crystals. In this study, we present a learning-based approach to determine the spatially dependent bulk properties and defects in semiconductor detectors. This characte… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  9. arXiv:2409.02323  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph physics.optics

    Review and Novel Formulae for Transmittance and Reflectance of Wedged Thin Films on absorbing Substrates

    Authors: Manuel Ballester, Emilio Marquez, John Bass, Christoph Wuersch, Florian Willomitzer, Aggelos K. Katsaggelos

    Abstract: Historically, spectroscopic techniques have been essential for studying the optical properties of thin solid films. However, existing formulae for both normal transmission and reflection spectroscopy often rely on simplified theoretical assumptions, which may not accurately align with real-world conditions. For instance, it is common to assume (1) that the thin solid layers are deposited on comple… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

  10. arXiv:2409.00777  [pdf, other

    cs.CV

    VDPI: Video Deblurring with Pseudo-inverse Modeling

    Authors: Zhihao Huang, Santiago Lopez-Tapia, Aggelos K. Katsaggelos

    Abstract: Video deblurring is a challenging task that aims to recover sharp sequences from blur and noisy observations. The image-formation model plays a crucial role in traditional model-based methods, constraining the possible solutions. However, this is only the case for some deep learning-based methods. Despite deep-learning models achieving better results, traditional model-based methods remain widely… ▽ More

    Submitted 1 September, 2024; originally announced September 2024.

  11. arXiv:2407.06535  [pdf, other

    physics.optics

    An Angular Spectrum Approach to Inverse Synthesis for the Characterization of Optical and Geometrical Properties of Semiconductor Thin Films

    Authors: John M. Bass, Manuel Ballester, Susana M. Fernández, Aggelos K. Katsaggelos, Emilio Márquez, Florian Willomitzer

    Abstract: To design semiconductor-based optical devices, the optical properties of the used semiconductor materials must be precisely measured over a large band. Transmission spectroscopy stands out as an inexpensive and widely available method for this measurement but requires model assumptions and reconstruction algorithms to convert the measured transmittance spectra into optical properties of the thin f… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 10 pages, 7 figures, 5 tables

  12. arXiv:2405.13168  [pdf, other

    physics.ins-det eess.IV

    Modeling and Simulation of Charge-Induced Signals in Photon-Counting CZT Detectors for Medical Imaging Applications

    Authors: Manuel Ballester, Jaromir Kaspar, Francesc Massanes, Srutarshi Banerjee, Alexander Hans Vija, Aggelos K. Katsaggelos

    Abstract: Photon-counting detectors based on CZT are essential in nuclear medical imaging, particularly for SPECT applications. Although CZT detectors are known for their precise energy resolution, defects within the CZT crystals significantly impact their performance. These defects result in inhomogeneous material properties throughout the bulk of the detector. The present work introduces an efficient comp… ▽ More

    Submitted 24 May, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  13. Brighteye: Glaucoma Screening with Color Fundus Photographs based on Vision Transformer

    Authors: Hui Lin, Charilaos Apostolidis, Aggelos K. Katsaggelos

    Abstract: Differences in image quality, lighting conditions, and patient demographics pose challenges to automated glaucoma detection from color fundus photography. Brighteye, a method based on Vision Transformer, is proposed for glaucoma detection and glaucomatous feature classification. Brighteye learns long-range relationships among pixels within large fundus images using a self-attention mechanism. Prio… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: ISBI 2024, JustRAIGS challenge, glaucoma detection

  14. arXiv:2404.15552  [pdf, other

    cs.CV astro-ph.IM cs.LG gr-qc

    Cross-Temporal Spectrogram Autoencoder (CTSAE): Unsupervised Dimensionality Reduction for Clustering Gravitational Wave Glitches

    Authors: Yi Li, Yunan Wu, Aggelos K. Katsaggelos

    Abstract: The advancement of The Laser Interferometer Gravitational-Wave Observatory (LIGO) has significantly enhanced the feasibility and reliability of gravitational wave detection. However, LIGO's high sensitivity makes it susceptible to transient noises known as glitches, which necessitate effective differentiation from real gravitational wave signals. Traditional approaches predominantly employ fully s… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  15. arXiv:2404.04663  [pdf, other

    cs.CV cs.AI

    Focused Active Learning for Histopathological Image Classification

    Authors: Arne Schmidt, Pablo Morales-Álvarez, Lee A. D. Cooper, Lee A. Newberg, Andinet Enquobahrie, Aggelos K. Katsaggelos, Rafael Molina

    Abstract: Active Learning (AL) has the potential to solve a major problem of digital pathology: the efficient acquisition of labeled data for machine learning algorithms. However, existing AL methods often struggle in realistic settings with artifacts, ambiguities, and class imbalances, as commonly seen in the medical field. The lack of precise uncertainty estimations leads to the acquisition of images with… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  16. Hyperbolic Secant representation of the logistic function: Application to probabilistic Multiple Instance Learning for CT intracranial hemorrhage detection

    Authors: F. M. Castro-Macías, P. Morales-Álvarez, Y. Wu, R. Molina, A. K. Katsaggelos

    Abstract: Multiple Instance Learning (MIL) is a weakly supervised paradigm that has been successfully applied to many different scientific areas and is particularly well suited to medical imaging. Probabilistic MIL methods, and more specifically Gaussian Processes (GPs), have achieved excellent results due to their high expressiveness and uncertainty quantification capabilities. One of the most successful G… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 48 pages, 12 figures, published in Artificial Intelligence Journal

    Journal ref: Journal: Artificial Intelligence, Pages: 104115, Publisher: Elsevier, Year: 2024

  17. arXiv:2403.10589  [pdf

    eess.IV cs.CV

    A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models

    Authors: Xijun Wang, Santiago López-Tapia, Alice Lucas, Xinyi Wu, Rafael Molina, Aggelos K. Katsaggelos

    Abstract: Generative Adversarial Networks (GANs) have shown great performance on super-resolution problems since they can generate more visually realistic images and video frames. However, these models often introduce side effects into the outputs, such as unexpected artifacts and noises. To reduce these artifacts and enhance the perceptual quality of the results, in this paper, we propose a general method… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  18. arXiv:2403.06961  [pdf, other

    cs.CV

    Explainable Transformer Prototypes for Medical Diagnoses

    Authors: Ugur Demir, Debesh Jha, Zheyuan Zhang, Elif Keles, Bradley Allen, Aggelos K. Katsaggelos, Ulas Bagci

    Abstract: Deployments of artificial intelligence in medical diagnostics mandate not just accuracy and efficacy but also trust, emphasizing the need for explainability in machine decisions. The recent trend in automated medical image diagnostics leans towards the deployment of Transformer-based architectures, credited to their impressive capabilities. Since the self-attention feature of transformers contribu… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  19. arXiv:2402.07371  [pdf, other

    cs.CV eess.IV

    Real-World Atmospheric Turbulence Correction via Domain Adaptation

    Authors: Xijun Wang, Santiago López-Tapia, Aggelos K. Katsaggelos

    Abstract: Atmospheric turbulence, a common phenomenon in daily life, is primarily caused by the uneven heating of the Earth's surface. This phenomenon results in distorted and blurred acquired images or videos and can significantly impact downstream vision tasks, particularly those that rely on capturing clear, stable images or videos from outdoor environments, such as accurately detecting or recognizing ob… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  20. arXiv:2401.12913  [pdf, other

    gr-qc astro-ph.IM eess.IV

    Advancing Glitch Classification in Gravity Spy: Multi-view Fusion with Attention-based Machine Learning for Advanced LIGO's Fourth Observing Run

    Authors: Yunan Wu, Michael Zevin, Christopher P. L. Berry, Kevin Crowston, Carsten Østerlund, Zoheyr Doctor, Sharan Banagiri, Corey B. Jackson, Vicky Kalogera, Aggelos K. Katsaggelos

    Abstract: The first successful detection of gravitational waves by ground-based observatories, such as the Laser Interferometer Gravitational-Wave Observatory (LIGO), marked a revolutionary breakthrough in our comprehension of the Universe. However, due to the unprecedented sensitivity required to make such observations, gravitational-wave detectors also capture disruptive noise sources called glitches, pot… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  21. arXiv:2311.02290  [pdf, other

    eess.SP

    A Physics based Machine Learning Model to characterize Room Temperature Semiconductor Detectors in 3D

    Authors: Srutarshi Banerjee, Miesher Rodrigues, Manuel Ballester, Alexander H. Vija, Aggelos K. Katsaggelos

    Abstract: Room temperature semiconductor radiation detectors (RTSD) for X-ray and gamma-ray detection are vital tools for medical imaging, astrophysics and other applications. CdZnTe (CZT) has been the main RTSD for more than three decades with desired detection properties. In a typical pixelated configuration, CZT have electrodes on opposite ends. For advanced event reconstruction algorithms at sub-pixel l… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  22. arXiv:2310.15898  [pdf, other

    eess.IV cs.CV

    YOLO-Angio: An Algorithm for Coronary Anatomy Segmentation

    Authors: Tom Liu, Hui Lin, Aggelos K. Katsaggelos, Adrienne Kline

    Abstract: Coronary angiography remains the gold standard for diagnosis of coronary artery disease, the most common cause of death worldwide. While this procedure is performed more than 2 million times annually, there remain few methods for fast and accurate automated measurement of disease and localization of coronary anatomy. Here, we present our solution to the Automatic Region-based Coronary Artery Disea… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: MICCAI Conference ARCADE Grand Challenge, YOLO, Computer Vision,

  23. arXiv:2308.15530  [pdf, other

    gr-qc astro-ph.IM physics.soc-ph

    Gravity Spy: Lessons Learned and a Path Forward

    Authors: Michael Zevin, Corey B. Jackson, Zoheyr Doctor, Yunan Wu, Carsten Østerlund, L. Clifton Johnson, Christopher P. L. Berry, Kevin Crowston, Scott B. Coughlin, Vicky Kalogera, Sharan Banagiri, Derek Davis, Jane Glanzer, Renzhi Hao, Aggelos K. Katsaggelos, Oli Patane, Jennifer Sanchez, Joshua Smith, Siddharth Soni, Laura Trouille, Marissa Walker, Irina Aerith, Wilfried Domainko, Victor-Georges Baranowski, Gerhard Niklasch , et al. (1 additional authors not shown)

    Abstract: The Gravity Spy project aims to uncover the origins of glitches, transient bursts of noise that hamper analysis of gravitational-wave data. By using both the work of citizen-science volunteers and machine-learning algorithms, the Gravity Spy project enables reliable classification of glitches. Citizen science and machine learning are intrinsically coupled within the Gravity Spy framework, with mac… ▽ More

    Submitted 31 January, 2024; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: 33 pages, 5 figures, published in European Physical Journal Plus for focus issue on "Citizen science for physics: From Education and Outreach to Crowdsourcing fundamental research"

    Journal ref: The European Physical Journal Plus, 139, 100 (2024)

  24. arXiv:2307.09457  [pdf, other

    eess.IV cs.LG

    Smooth Attention for Deep Multiple Instance Learning: Application to CT Intracranial Hemorrhage Detection

    Authors: Yunan Wu, Francisco M. Castro-Macías, Pablo Morales-Álvarez, Rafael Molina, Aggelos K. Katsaggelos

    Abstract: Multiple Instance Learning (MIL) has been widely applied to medical imaging diagnosis, where bag labels are known and instance labels inside bags are unknown. Traditional MIL assumes that instances in each bag are independent samples from a given distribution. However, instances are often spatially or sequentially ordered, and one would expect similar diagnostic importance for neighboring instance… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  25. arXiv:2305.05077  [pdf, other

    cs.CV eess.IV

    Atmospheric Turbulence Correction via Variational Deep Diffusion

    Authors: Xijun Wang, Santiago López-Tapia, Aggelos K. Katsaggelos

    Abstract: Atmospheric Turbulence (AT) correction is a challenging restoration task as it consists of two distortions: geometric distortion and spatially variant blur. Diffusion models have shown impressive accomplishments in photo-realistic image synthesis and beyond. In this paper, we propose a novel deep conditional diffusion model under a variational inference framework to solve the AT correction problem… ▽ More

    Submitted 26 July, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: This work has been accepted to the 2023 IEEE 6th International Conference on Multimedia Information Processing and Retrieval (MIPR)

  26. arXiv:2305.04186  [pdf, other

    cs.CV

    Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization

    Authors: Xijun Wang, Aggelos K. Katsaggelos

    Abstract: Weakly-supervised temporal action localization aims to identify and localize the action instances in the untrimmed videos with only video-level action labels. When humans watch videos, we can adapt our abstract-level knowledge about actions in different video scenarios and detect whether some actions are occurring. In this paper, we mimic how humans do and bring a new perspective for locating and… ▽ More

    Submitted 25 December, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

  27. arXiv:2303.17041  [pdf, other

    cs.MM cs.GR cs.LG

    Automatic Camera Trajectory Control with Enhanced Immersion for Virtual Cinematography

    Authors: Xinyi Wu, Haohong Wang, Aggelos K. Katsaggelos

    Abstract: User-generated cinematic creations are gaining popularity as our daily entertainment, yet it is a challenge to master cinematography for producing immersive contents. Many existing automatic methods focus on roughly controlling predefined shot types or movement patterns, which struggle to engage viewers with the circumstances of the actor. Real-world cinematographic rules show that directors can c… ▽ More

    Submitted 21 May, 2024; v1 submitted 29 March, 2023; originally announced March 2023.

  28. arXiv:2301.08798  [pdf

    eess.IV cs.CV

    DeepCOVID-Fuse: A Multi-modality Deep Learning Model Fusing Chest X-Radiographs and Clinical Variables to Predict COVID-19 Risk Levels

    Authors: Yunan Wu, Amil Dravid, Ramsey Michael Wehbe, Aggelos K. Katsaggelos

    Abstract: Propose: To present DeepCOVID-Fuse, a deep learning fusion model to predict risk levels in patients with confirmed coronavirus disease 2019 (COVID-19) and to evaluate the performance of pre-trained fusion models on full or partial combination of chest x-ray (CXRs) or chest radiograph and clinical variables. Materials and Methods: The initial CXRs, clinical variables and outcomes (i.e., mortality… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

  29. arXiv:2207.14392  [pdf, other

    eess.IV cs.CV cs.LG eess.SP

    A Deep Generative Approach to Oversampling in Ptychography

    Authors: Semih Barutcu, Aggelos K. Katsaggelos, Doğa Gürsoy

    Abstract: Ptychography is a well-studied phase imaging method that makes non-invasive imaging possible at a nanometer scale. It has developed into a mainstream technique with various applications across a range of areas such as material science or the defense industry. One major drawback of ptychography is the long data acquisition time due to the high overlap requirement between adjacent illumination areas… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

  30. arXiv:2205.13672  [pdf, other

    astro-ph.IM astro-ph.GA gr-qc

    Discriminative Dimensionality Reduction using Deep Neural Networks for Clustering of LIGO Data

    Authors: Sara Bahaadini, Yunan Wu, Scott Coughlin, Michael Zevin, Aggelos K. Katsaggelos

    Abstract: In this paper, leveraging the capabilities of neural networks for modeling the non-linearities that exist in the data, we propose several models that can project data into a low dimensional, discriminative, and smooth manifold. The proposed models can transfer knowledge from the domain of known classes to a new domain where the classes are unknown. A clustering algorithm is further applied in the… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

  31. arXiv:2205.02397  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    Compressive Ptychography using Deep Image and Generative Priors

    Authors: Semih Barutcu, Doğa Gürsoy, Aggelos K. Katsaggelos

    Abstract: Ptychography is a well-established coherent diffraction imaging technique that enables non-invasive imaging of samples at a nanometer scale. It has been extensively used in various areas such as the defense industry or materials science. One major limitation of ptychography is the long data acquisition time due to mechanical scanning of the sample; therefore, approaches to reduce the scan points a… ▽ More

    Submitted 23 May, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

  32. arXiv:2204.05376  [pdf, other

    cs.CV

    medXGAN: Visual Explanations for Medical Classifiers through a Generative Latent Space

    Authors: Amil Dravid, Florian Schiffers, Boqing Gong, Aggelos K. Katsaggelos

    Abstract: Despite the surge of deep learning in the past decade, some users are skeptical to deploy these models in practice due to their black-box nature. Specifically, in the medical space where there are severe potential repercussions, we need to develop methods to gain confidence in the models' decisions. To this end, we propose a novel medical imaging generative adversarial framework, medXGAN (medical… ▽ More

    Submitted 17 April, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: 10 pages, 11 figures, accepted to CVPR TCV workshop

    ACM Class: I.5.4; I.5.1; I.4.9; I.4.5; I.2.10

  33. arXiv:2203.16683  [pdf, other

    astro-ph.SR cs.LG

    Active Learning for Computationally Efficient Distribution of Binary Evolution Simulations

    Authors: Kyle Akira Rocha, Jeff J. Andrews, Christopher P. L. Berry, Zoheyr Doctor, Aggelos K. Katsaggelos, Juan Gabriel Serra Pérez, Pablo Marchant, Vicky Kalogera, Scott Coughlin, Simone S. Bavera, Aaron Dotter, Tassos Fragos, Konstantinos Kovlakas, Devina Misra, Zepei Xing, Emmanouil Zapartas

    Abstract: Binary stars undergo a variety of interactions and evolutionary phases, critical for predicting and explaining observed properties. Binary population synthesis with full stellar-structure and evolution simulations are computationally expensive requiring a large number of mass-transfer sequences. The recently developed binary population synthesis code POSYDON incorporates grids of MESA binary star… ▽ More

    Submitted 16 September, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: 21 pages, 10 figures, ApJ in press

    Journal ref: Astrophysical Journal; 938(1):64(15); 2022

  34. arXiv:2203.06448  [pdf

    cs.HC econ.TH q-bio.NC

    Discrete, recurrent, and scalable patterns in human judgement underlie affective picture ratings

    Authors: Emanuel A. Azcona, Byoung-Woo Kim, Nicole L. Vike, Sumra Bari, Shamal Lalvani, Leandros Stefanopoulos, Sean Woodward, Martin Block, Aggelos K. Katsaggelos, Hans C. Breiter

    Abstract: Operant keypress tasks, where each action has a consequence, have been analogized to the construct of "wanting" and produce lawful relationships in humans that quantify preferences for approach and avoidance behavior. It is unknown if rating tasks without an operant framework, which can be analogized to "liking", show similar lawful relationships. We studied three independent cohorts of participan… ▽ More

    Submitted 12 March, 2022; originally announced March 2022.

  35. arXiv:2201.09120  [pdf, other

    cs.CV eess.IV

    Investigating the Potential of Auxiliary-Classifier GANs for Image Classification in Low Data Regimes

    Authors: Amil Dravid, Florian Schiffers, Yunan Wu, Oliver Cossairt, Aggelos K. Katsaggelos

    Abstract: Generative Adversarial Networks (GANs) have shown promise in augmenting datasets and boosting convolutional neural networks' (CNN) performance on image classification tasks. But they introduce more hyperparameters to tune as well as the need for additional time and computational power to train supplementary to the CNN. In this work, we examine the potential for Auxiliary-Classifier GANs (AC-GANs)… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

    Comments: 4 pages content, 1 page references, 3 figures, 2 tables, to appear in ICASSP 2022

    ACM Class: I.5.4; I.5.1; I.4.9; I.2.10

  36. arXiv:2111.00116  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Visual Explanations for Convolutional Neural Networks via Latent Traversal of Generative Adversarial Networks

    Authors: Amil Dravid, Aggelos K. Katsaggelos

    Abstract: Lack of explainability in artificial intelligence, specifically deep neural networks, remains a bottleneck for implementing models in practice. Popular techniques such as Gradient-weighted Class Activation Mapping (Grad-CAM) provide a coarse map of salient features in an image, which rarely tells the whole story of what a convolutional neural network (CNN) learned. Using COVID-19 chest X-rays, we… ▽ More

    Submitted 1 November, 2021; v1 submitted 29 October, 2021; originally announced November 2021.

    Comments: 2 pages, 2 figures, to appear as extended abstract at AAAI-22

    ACM Class: I.5.4; I.5.1; I.4.9; I.2.10

  37. arXiv:2105.09892  [pdf, other

    eess.IV

    Improving Acquisition Speed of X-Ray Ptychography through Spatial Undersampling and Regularization

    Authors: Prasan Shedligeri, Florian Schiffers, Semih Barutcu, Pablo Ruiz, Aggelos K Katsaggelos, Oliver Cossairt

    Abstract: X-ray ptychography is one of the versatile techniques for nanometer resolution imaging. The magnitude of the diffraction patterns is recorded on a detector and the phase of the diffraction patterns is estimated using phase retrieval techniques. Most phase retrieval algorithms make the solution well-posed by relying on the constraints imposed by the overlapping region between neighboring diffractio… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: Accepted at ICIP 2021; 5 pages, 6 figures

  38. arXiv:2105.08205  [pdf, other

    cs.CV cs.AI

    Reinforcement Learning for Adaptive Video Compressive Sensing

    Authors: Sidi Lu, Xin Yuan, Aggelos K Katsaggelos, Weisong Shi

    Abstract: We apply reinforcement learning to video compressive sensing to adapt the compression ratio. Specifically, video snapshot compressive imaging (SCI), which captures high-speed video using a low-speed camera is considered in this work, in which multiple (B) video frames can be reconstructed from a snapshot measurement. One research gap in previous studies is how to adapt B in the video SCI system fo… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: 12 pages, 11 figures, 2 tables

    ACM Class: I.2.10

  39. arXiv:2105.05973  [pdf, other

    eess.IV cs.CV

    Removing Blocking Artifacts in Video Streams Using Event Cameras

    Authors: Henry H. Chopp, Srutarshi Banerjee, Oliver Cossairt, Aggelos K. Katsaggelos

    Abstract: In this paper, we propose EveRestNet, a convolutional neural network designed to remove blocking artifacts in videostreams using events from neuromorphic sensors. We first degrade the video frame using a quadtree structure to produce the blocking artifacts to simulate transmitting a video under a heavily constrained bandwidth. Events from the neuromorphic sensor are also simulated, but are transmi… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  40. arXiv:2103.12297  [pdf, other

    cs.CV

    Adaptive Illumination based Depth Sensing using Deep Superpixel and Soft Sampling Approximation

    Authors: Qiqin Dai, Fengqiang Li, Oliver Cossairt, Aggelos K Katsaggelos

    Abstract: Dense depth map capture is challenging in existing active sparse illumination based depth acquisition techniques, such as LiDAR. Various techniques have been proposed to estimate a dense depth map based on fusion of the sparse depth map measurement with the RGB image. Recent advances in hardware enable adaptive depth measurements resulting in further improvement of the dense depth map estimation.… ▽ More

    Submitted 22 February, 2022; v1 submitted 23 March, 2021; originally announced March 2021.

  41. arXiv:2103.12104  [pdf, other

    gr-qc astro-ph.IM physics.ins-det

    Discovering features in gravitational-wave data through detector characterization, citizen science and machine learning

    Authors: S Soni, C P L Berry, S B Coughlin, M Harandi, C B Jackson, K Crowston, C Østerlund, O Patane, A K Katsaggelos, L Trouille, V-G Baranowski, W F Domainko, K Kaminski, M A Lobato Rodriguez, U Marciniak, P Nauta, G Niklasch, R R Rote, B Téglás, C Unsworth, C Zhang

    Abstract: The observation of gravitational waves is hindered by the presence of transient noise (glitches). We study data from the third observing run of the Advanced LIGO detectors, and identify new glitch classes. Using training sets assembled by monitoring of the state of the detector, and by citizen-science volunteers, we update the Gravity Spy machine-learning algorithm for glitch classification. We fi… ▽ More

    Submitted 6 September, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: 26 pages, 10 figures

    Journal ref: Classical and Quantum Gravity, 2021, Volume 38, Number 19

  42. Snapshot Compressive Imaging: Principle, Implementation, Theory, Algorithms and Applications

    Authors: Xin Yuan, David J. Brady, Aggelos K. Katsaggelos

    Abstract: Capturing high-dimensional (HD) data is a long-term challenge in signal processing and related fields. Snapshot compressive imaging (SCI) uses a two-dimensional (2D) detector to capture HD ($\ge3$D) data in a {\em snapshot} measurement. Via novel optical designs, the 2D detector samples the HD data in a {\em compressive} manner; following this, algorithms are employed to reconstruct the desired HD… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: Extension of X. Yuan, D. J. Brady and A. K. Katsaggelos, "Snapshot Compressive Imaging: Theory, Algorithms, and Applications," in IEEE Signal Processing Magazine, vol. 38, no. 2, pp. 65-88, March 2021, doi: 10.1109/MSP.2020.3023869

    Journal ref: in IEEE Signal Processing Magazine, vol. 38, no. 2, pp. 65-88, March 2021

  43. arXiv:2102.12046  [pdf, other

    eess.IV

    An Adaptive Video Acquisition Scheme for Object Tracking and its Performance Optimization

    Authors: Srutarshi Banerjee, Henry H. Chopp, Juan G. Serra, Hao Tian Yang, Oliver Cossairt, A. K. Katsaggelos

    Abstract: We present a novel adaptive host-chip modular architecture for video acquisition to optimize an overall objective task constrained under a given bit rate. The chip is a high resolution imaging sensor such as gigapixel focal plane array (FPA) with low computational power deployed on the field remotely, while the host is a server with high computational power. The communication channel data bandwidt… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

  44. SkinScan: Low-Cost 3D-Scanning for Dermatologic Diagnosis and Documentation

    Authors: Merlin A. Nau, Florian Schiffers, Yunhao Li, Bingjie Xu, Andreas Maier, Jack Tumblin, Marc Walton, Aggelos K. Katsaggelos, Florian Willomitzer, Oliver Cossairt

    Abstract: The utilization of computational photography becomes increasingly essential in the medical field. Today, imaging techniques for dermatology range from two-dimensional (2D) color imagery with a mobile device to professional clinical imaging systems measuring additional detailed three-dimensional (3D) data. The latter are commonly expensive and not accessible to a broad audience. In this work, we pr… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

    Comments: 5 pages, 4 Figures, Submitted at ICIP 2021

  45. arXiv:2012.05214  [pdf, other

    cs.CV

    E3D: Event-Based 3D Shape Reconstruction

    Authors: Alexis Baudron, Zihao W. Wang, Oliver Cossairt, Aggelos K. Katsaggelos

    Abstract: 3D shape reconstruction is a primary component of augmented/virtual reality. Despite being highly advanced, existing solutions based on RGB, RGB-D and Lidar sensors are power and data intensive, which introduces challenges for deployment in edge devices. We approach 3D reconstruction with an event camera, a sensor with significantly lower power, latency and data expense while enabling high dynamic… ▽ More

    Submitted 10 December, 2020; v1 submitted 9 December, 2020; originally announced December 2020.

    Comments: Correct author names and only include primary author email

  46. arXiv:2012.04743  [pdf, other

    eess.IV cs.CV

    2-Step Sparse-View CT Reconstruction with a Domain-Specific Perceptual Network

    Authors: Haoyu Wei, Florian Schiffers, Tobias Würfl, Daming Shen, Daniel Kim, Aggelos K. Katsaggelos, Oliver Cossairt

    Abstract: Computed tomography is widely used to examine internal structures in a non-destructive manner. To obtain high-quality reconstructions, one typically has to acquire a densely sampled trajectory to avoid angular undersampling. However, many scenarios require a sparse-view measurement leading to streak-artifacts if unaccounted for. Current methods do not make full use of the domain-specific informati… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

  47. arXiv:2008.06151  [pdf, other

    eess.IV cs.CV cs.LG math.SP q-bio.NC

    Interpretation of Brain Morphology in Association to Alzheimer's Disease Dementia Classification Using Graph Convolutional Networks on Triangulated Meshes

    Authors: Emanuel A. Azcona, Pierre Besson, Yunan Wu, Arjun Punjabi, Adam Martersteck, Amil Dravid, Todd B. Parrish, S. Kathleen Bandt, Aggelos K. Katsaggelos

    Abstract: We propose a mesh-based technique to aid in the classification of Alzheimer's disease dementia (ADD) using mesh representations of the cortex and subcortical structures. Deep learning methods for classification tasks that utilize structural neuroimaging often require extensive learning parameters to optimize. Frequently, these approaches for automated medical diagnosis also lack visual interpretab… ▽ More

    Submitted 20 August, 2020; v1 submitted 13 August, 2020; originally announced August 2020.

    Comments: Accepted for the Shape in Medical Imaging (ShapeMI) workshop at MICCAI International Conference 2020

  48. arXiv:2001.10964  [pdf, other

    cs.LG cs.CV stat.ML

    Examining the Benefits of Capsule Neural Networks

    Authors: Arjun Punjabi, Jonas Schmid, Aggelos K. Katsaggelos

    Abstract: Capsule networks are a recently developed class of neural networks that potentially address some of the deficiencies with traditional convolutional neural networks. By replacing the standard scalar activations with vectors, and by connecting the artificial neurons in a new way, capsule networks aim to be the next great development for computer vision applications. However, in order to determine wh… ▽ More

    Submitted 29 January, 2020; originally announced January 2020.

  49. arXiv:1912.12879  [pdf

    eess.IV cs.LG stat.ML

    Self-supervised Fine-tuning for Correcting Super-Resolution Convolutional Neural Networks

    Authors: Alice Lucas, Santiago Lopez-Tapia, Rafael Molina, Aggelos K. Katsaggelos

    Abstract: While Convolutional Neural Networks (CNNs) trained for image and video super-resolution (SR) regularly achieve new state-of-the-art performance, they also suffer from significant drawbacks. One of their limitations is their lack of robustness to unseen image formation models during training. Other limitations include the generation of artifacts and hallucinated content when training Generative Adv… ▽ More

    Submitted 15 June, 2020; v1 submitted 30 December, 2019; originally announced December 2019.

    Comments: 15 pages, 11 figures

  50. arXiv:1911.01915  [pdf, other

    cs.LG cs.CV gr-qc stat.ML

    Scalable Variational Gaussian Processes for Crowdsourcing: Glitch Detection in LIGO

    Authors: Pablo Morales-Álvarez, Pablo Ruiz, Scott Coughlin, Rafael Molina, Aggelos K. Katsaggelos

    Abstract: In the last years, crowdsourcing is transforming the way classification training sets are obtained. Instead of relying on a single expert annotator, crowdsourcing shares the labelling effort among a large number of collaborators. For instance, this is being applied to the data acquired by the laureate Laser Interferometer Gravitational Waves Observatory (LIGO), in order to detect glitches which mi… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: 16 pages, under review