Search | arXiv e-print repository

arXiv:2505.19839 [pdf, ps, other]

Chance-constrained Solar PV Hosting Capacity Assessment for Distribution Grids Using Gaussian Process and Logit Learning

Authors: Sel Ly, Anshuman Singh, Petr Vorobev, Yeng Chai Soh, Hung Dinh Nguyen

Abstract: Growing penetration of distributed generation such as solar PV can increase the risk of over-voltage in distribution grids, affecting network security. Therefore, assessment of the so-called, PV hosting capacity (HC) - the maximum amount of PV that a given grid can accommodate becomes an important practical problem. In this paper, we propose a novel chance-constrained HC estimation framework using… ▽ More Growing penetration of distributed generation such as solar PV can increase the risk of over-voltage in distribution grids, affecting network security. Therefore, assessment of the so-called, PV hosting capacity (HC) - the maximum amount of PV that a given grid can accommodate becomes an important practical problem. In this paper, we propose a novel chance-constrained HC estimation framework using Gaussian Process and Logit learning that can account for uncertainty and risk management. Also, we consider the assessment of HC under different voltage control strategies. Our results have demonstrated that the proposed models can achieve high accuracy levels of up to 93% in predicting nodal over-voltage events on IEEE 33-bus and 123-bus test-cases. Thus, these models can be effectively employed to estimate the chance-constrained HC with various risk levels. Moreover, our proposed methods have simple forms and low computational costs of only a few seconds. △ Less

Submitted 26 May, 2025; originally announced May 2025.

arXiv:2503.08589 [pdf]

Integration of nested cross-validation, automated hyperparameter optimization, high-performance computing to reduce and quantify the variance of test performance estimation of deep learning models

Authors: Paul Calle, Averi Bates, Justin C. Reynolds, Yunlong Liu, Haoyang Cui, Sinaro Ly, Chen Wang, Qinghao Zhang, Alberto J. de Armendi, Shashank S. Shettar, Kar Ming Fung, Qinggong Tang, Chongle Pan

Abstract: The variability and biases in the real-world performance benchmarking of deep learning models for medical imaging compromise their trustworthiness for real-world deployment. The common approach of holding out a single fixed test set fails to quantify the variance in the estimation of test performance metrics. This study introduces NACHOS (Nested and Automated Cross-validation and Hyperparameter Op… ▽ More The variability and biases in the real-world performance benchmarking of deep learning models for medical imaging compromise their trustworthiness for real-world deployment. The common approach of holding out a single fixed test set fails to quantify the variance in the estimation of test performance metrics. This study introduces NACHOS (Nested and Automated Cross-validation and Hyperparameter Optimization using Supercomputing) to reduce and quantify the variance of test performance metrics of deep learning models. NACHOS integrates Nested Cross-Validation (NCV) and Automated Hyperparameter Optimization (AHPO) within a parallelized high-performance computing (HPC) framework. NACHOS was demonstrated on a chest X-ray repository and an Optical Coherence Tomography (OCT) dataset under multiple data partitioning schemes. Beyond performance estimation, DACHOS (Deployment with Automated Cross-validation and Hyperparameter Optimization using Supercomputing) is introduced to leverage AHPO and cross-validation to build the final model on the full dataset, improving expected deployment performance. The findings underscore the importance of NCV in quantifying and reducing estimation variance, AHPO in optimizing hyperparameters consistently across test folds, and HPC in ensuring computational feasibility. By integrating these methodologies, NACHOS and DACHOS provide a scalable, reproducible, and trustworthy framework for DL model evaluation and deployment in medical imaging. △ Less

Submitted 11 March, 2025; originally announced March 2025.

arXiv:2503.07157 [pdf, other]

MIRAM: Masked Image Reconstruction Across Multiple Scales for Breast Lesion Risk Prediction

Authors: Hung Q. Vo, Pengyu Yuan, Zheng Yin, Kelvin K. Wong, Chika F. Ezeana, Son T. Ly, Stephen T. C. Wong, Hien V. Nguyen

Abstract: Self-supervised learning (SSL) has garnered substantial interest within the machine learning and computer vision communities. Two prominent approaches in SSL include contrastive-based learning and self-distillation utilizing cropping augmentation. Lately, masked image modeling (MIM) has emerged as a more potent SSL technique, employing image inpainting as a pretext task. MIM creates a strong induc… ▽ More Self-supervised learning (SSL) has garnered substantial interest within the machine learning and computer vision communities. Two prominent approaches in SSL include contrastive-based learning and self-distillation utilizing cropping augmentation. Lately, masked image modeling (MIM) has emerged as a more potent SSL technique, employing image inpainting as a pretext task. MIM creates a strong inductive bias toward meaningful spatial and semantic understanding. This has opened up new opportunities for SSL to contribute not only to classification tasks but also to more complex applications like object detection and image segmentation. Building upon this progress, our research paper introduces a scalable and practical SSL approach centered around more challenging pretext tasks that facilitate the acquisition of robust features. Specifically, we leverage multi-scale image reconstruction from randomly masked input images as the foundation for feature learning. Our hypothesis posits that reconstructing high-resolution images enables the model to attend to finer spatial details, particularly beneficial for discerning subtle intricacies within medical images. The proposed SSL features help improve classification performance on the Curated Breast Imaging Subset of Digital Database for Screening Mammography (CBIS-DDSM) dataset. In pathology classification, our method demonstrates a 3\% increase in average precision (AP) and a 1\% increase in the area under the receiver operating characteristic curve (AUC) when compared to state-of-the-art (SOTA) algorithms. Moreover, in mass margins classification, our approach achieves a 4\% increase in AP and a 2\% increase in AUC. △ Less

Submitted 22 March, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

arXiv:2503.06759 [pdf, other]

Revisiting Invariant Learning for Out-of-Domain Generalization on Multi-Site Mammogram Datasets

Authors: Hung Q. Vo, Samira Zare, Son T. Ly, Lin Wang, Chika F. Ezeana, Xiaohui Yu, Kelvin K. Wong, Stephen T. C. Wong, Hien V. Nguyen

Abstract: Despite significant progress in robust deep learning techniques for mammogram breast cancer classification, their reliability in real-world clinical development settings remains uncertain. The translation of these models to clinical practice faces challenges due to variations in medical centers, imaging protocols, and patient populations. To enhance their robustness, invariant learning methods hav… ▽ More Despite significant progress in robust deep learning techniques for mammogram breast cancer classification, their reliability in real-world clinical development settings remains uncertain. The translation of these models to clinical practice faces challenges due to variations in medical centers, imaging protocols, and patient populations. To enhance their robustness, invariant learning methods have been proposed, prioritizing causal factors over misleading features. However, their effectiveness in clinical development and impact on mammogram classification require investigation. This paper reassesses the application of invariant learning for breast cancer risk estimation based on mammograms. Utilizing diverse multi-site public datasets, it represents the first study in this area. The objective is to evaluate invariant learning's benefits in developing robust models. Invariant learning methods, including Invariant Risk Minimization and Variance Risk Extrapolation, are compared quantitatively against Empirical Risk Minimization. Evaluation metrics include accuracy, average precision, and area under the curve. Additionally, interpretability is examined through class activation maps and visualization of learned representations. This research examines the advantages, limitations, and challenges of invariant learning for mammogram classification, guiding future studies to develop generalized methods for breast cancer prediction on whole mammograms in out-of-domain scenarios. △ Less

Submitted 9 March, 2025; originally announced March 2025.

arXiv:2411.19297 [pdf, ps, other]

Enhancing Parameter-Efficient Fine-Tuning of Vision Transformers through Frequency-Based Adaptation

Authors: Son Thai Ly, Hien V. Nguyen

Abstract: Adapting vision transformer foundation models through parameter-efficient fine-tuning (PEFT) methods has become increasingly popular. These methods optimize a limited subset of parameters, enabling efficient adaptation without the need to fine-tune the entire model while still achieving competitive performance. However, traditional PEFT methods may limit the model's capacity to capture complex pat… ▽ More Adapting vision transformer foundation models through parameter-efficient fine-tuning (PEFT) methods has become increasingly popular. These methods optimize a limited subset of parameters, enabling efficient adaptation without the need to fine-tune the entire model while still achieving competitive performance. However, traditional PEFT methods may limit the model's capacity to capture complex patterns, especially those associated with high-frequency spectra. This limitation becomes particularly problematic as existing research indicates that high-frequency features are crucial for distinguishing subtle image structures. To address this issue, we introduce FreqFit, a novel Frequency Fine-tuning module between ViT blocks to enhance model adaptability. FreqFit is simple yet surprisingly effective, and can be integrated with all existing PEFT methods to boost their performance. By manipulating features in the frequency domain, our approach allows models to capture subtle patterns more effectively. Extensive experiments on 24 datasets, using both supervised and self-supervised foundational models with various state-of-the-art PEFT methods, reveal that FreqFit consistently improves performance over the original PEFT methods with performance gains ranging from 1% to 16%. For instance, FreqFit-LoRA surpasses the performances of state-of-the-art baselines on CIFAR100 by more than 10% even without applying regularization or strong augmentation. For reproducibility purposes, the source code is available at https://github.com/tsly123/FreqFiT. △ Less

Submitted 28 November, 2024; originally announced November 2024.

Comments: 24 pages

arXiv:2306.05376 [pdf, other]

Anomaly Detection in Satellite Videos using Diffusion Models

Authors: Akash Awasthi, Son Ly, Jaer Nizam, Samira Zare, Videet Mehta, Safwan Ahmed, Keshav Shah, Ramakrishna Nemani, Saurabh Prasad, Hien Van Nguyen

Abstract: The definition of anomaly detection is the identification of an unexpected event. Real-time detection of extreme events such as wildfires, cyclones, or floods using satellite data has become crucial for disaster management. Although several earth-observing satellites provide information about disasters, satellites in the geostationary orbit provide data at intervals as frequent as every minute, ef… ▽ More The definition of anomaly detection is the identification of an unexpected event. Real-time detection of extreme events such as wildfires, cyclones, or floods using satellite data has become crucial for disaster management. Although several earth-observing satellites provide information about disasters, satellites in the geostationary orbit provide data at intervals as frequent as every minute, effectively creating a video from space. There are many techniques that have been proposed to identify anomalies in surveillance videos; however, the available datasets do not have dynamic behavior, so we discuss an anomaly framework that can work on very high-frequency datasets to find very fast-moving anomalies. In this work, we present a diffusion model which does not need any motion component to capture the fast-moving anomalies and outperforms the other baseline methods. △ Less

Submitted 25 May, 2023; originally announced June 2023.

arXiv:2210.12707 [pdf, other]

doi 10.1109/ICCD56317.2022.00070

Accelerating the training of single-layer binary neural networks using the HHL quantum algorithm

Authors: Sonia Lopez Alarcon, Cory Merkel, Martin Hoffnagle, Sabrina Ly, Alejandro Pozas-Kerstjens

Abstract: Binary Neural Networks are a promising technique for implementing efficient deep models with reduced storage and computational requirements. The training of these is however, still a compute-intensive problem that grows drastically with the layer size and data input. At the core of this calculation is the linear regression problem. The Harrow-Hassidim-Lloyd (HHL) quantum algorithm has gained relev… ▽ More Binary Neural Networks are a promising technique for implementing efficient deep models with reduced storage and computational requirements. The training of these is however, still a compute-intensive problem that grows drastically with the layer size and data input. At the core of this calculation is the linear regression problem. The Harrow-Hassidim-Lloyd (HHL) quantum algorithm has gained relevance thanks to its promise of providing a quantum state containing the solution of a linear system of equations. The solution is encoded in superposition at the output of a quantum circuit. Although this seems to provide the answer to the linear regression problem for the training neural networks, it also comes with multiple, difficult-to-avoid hurdles. This paper shows, however, that useful information can be extracted from the quantum-mechanical implementation of HHL, and used to reduce the complexity of finding the solution on the classical side. △ Less

Submitted 23 October, 2022; originally announced October 2022.

Comments: Accepted in the 40th IEEE International Conference on Computer Design (ICCD'22). 9 pages, 8 figures, IEEEtran V1.8b

Journal ref: Proceedings of the IEEE 40th International Conference on Computer Design (ICCD), 427-433 (2022)

arXiv:2205.05194 [pdf, other]

Multiplexed Immunofluorescence Brain Image Analysis Using Self-Supervised Dual-Loss Adaptive Masked Autoencoder

Authors: Son T. Ly, Bai Lin, Hung Q. Vo, Dragan Maric, Badri Roysam, Hien V. Nguyen

Abstract: Reliable large-scale cell detection and segmentation is the fundamental first step to understanding biological processes in the brain. The ability to phenotype cells at scale can accelerate preclinical drug evaluation and system-level brain histology studies. The impressive advances in deep learning offer a practical solution to cell image detection and segmentation. Unfortunately, categorizing ce… ▽ More Reliable large-scale cell detection and segmentation is the fundamental first step to understanding biological processes in the brain. The ability to phenotype cells at scale can accelerate preclinical drug evaluation and system-level brain histology studies. The impressive advances in deep learning offer a practical solution to cell image detection and segmentation. Unfortunately, categorizing cells and delineating their boundaries for training deep networks is an expensive process that requires skilled biologists. This paper presents a novel self-supervised Dual-Loss Adaptive Masked Autoencoder (DAMA) for learning rich features from multiplexed immunofluorescence brain images. DAMA's objective function minimizes the conditional entropy in pixel-level reconstruction and feature-level regression. Unlike existing self-supervised learning methods based on a random image masking strategy, DAMA employs a novel adaptive mask sampling strategy to maximize mutual information and effectively learn brain cell data. To the best of our knowledge, this is the first effort to develop a self-supervised learning method for multiplexed immunofluorescence brain images. Our extensive experiments demonstrate that DAMA features enable superior cell detection, segmentation, and classification performance without requiring many annotations. △ Less

Submitted 30 December, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

Comments: Adding new results on multiplexed image data and data efficiency. Pytorch code: https://github.com/hula-ai/DAMA

arXiv:2005.12580 [pdf, ps, other]

Stochastic ordering by g-expectations

Authors: Sel Ly, Nicolas Privault

Abstract: We derive sufficient conditions for the convex and monotonic g-stochastic ordering of diffusion processes under nonlinear g-expectations and g-evaluations. Our approach relies on comparison results for forward-backward stochastic differential equations and on several extensions of convexity, monotonicity and continuous dependence properties for the solutions of associated semilinear parabolic part… ▽ More We derive sufficient conditions for the convex and monotonic g-stochastic ordering of diffusion processes under nonlinear g-expectations and g-evaluations. Our approach relies on comparison results for forward-backward stochastic differential equations and on several extensions of convexity, monotonicity and continuous dependence properties for the solutions of associated semilinear parabolic partial differential equations. Applications to contingent claim price comparison under different hedging portfolio constraints are provided. △ Less

Submitted 12 April, 2022; v1 submitted 26 May, 2020; originally announced May 2020.

MSC Class: 60E15; 35B51; 60H10; 60H30

arXiv:1811.10907 [pdf, other]

Efficient Image Retrieval via Decoupling Diffusion into Online and Offline Processing

Authors: Fan Yang, Ryota Hinami, Yusuke Matsui, Steven Ly, Shin'ichi Satoh

Abstract: Diffusion is commonly used as a ranking or re-ranking method in retrieval tasks to achieve higher retrieval performance, and has attracted lots of attention in recent years. A downside to diffusion is that it performs slowly in comparison to the naive k-NN search, which causes a non-trivial online computational cost on large datasets. To overcome this weakness, we propose a novel diffusion techniq… ▽ More Diffusion is commonly used as a ranking or re-ranking method in retrieval tasks to achieve higher retrieval performance, and has attracted lots of attention in recent years. A downside to diffusion is that it performs slowly in comparison to the naive k-NN search, which causes a non-trivial online computational cost on large datasets. To overcome this weakness, we propose a novel diffusion technique in this paper. In our work, instead of applying diffusion to the query, we pre-compute the diffusion results of each element in the database, making the online search a simple linear combination on top of the k-NN search process. Our proposed method becomes 10~ times faster in terms of online search speed. Moreover, we propose to use late truncation instead of early truncation in previous works to achieve better retrieval performance. △ Less

Submitted 4 January, 2019; v1 submitted 27 November, 2018; originally announced November 2018.

Comments: Accepted by AAAI 2019

arXiv:1712.05692 [pdf]

doi 10.1126/scitranslmed.aal5088

Increased adaptive immune responses and proper feedback regulation protect against clinical dengue

Authors: Etienne Simon-Loriere, Veasna Duong, Ahmed Tawfik, Sivlin Ung, Sowath Ly, Isabelle Casademont, Matthieu Prot, Noémie Courtejoie, Kevin Bleakley, Philippe Buchy, Arnaud Tarantola, Philippe Dussart, Tineke Cantaert, Anavaj Sakuntabhai

Abstract: Dengue is the most prevalent arthropod-borne viral disease. Clinical symptoms of dengue virus (DENV) infection range from classical mild dengue fever to severe, life-threatening dengue shock syndrome. However, most DENV infections cause few or no symptoms. Asymptomatic DENV-infected patients provide a unique opportunity to decipher the host immune responses leading to virus elimination without neg… ▽ More Dengue is the most prevalent arthropod-borne viral disease. Clinical symptoms of dengue virus (DENV) infection range from classical mild dengue fever to severe, life-threatening dengue shock syndrome. However, most DENV infections cause few or no symptoms. Asymptomatic DENV-infected patients provide a unique opportunity to decipher the host immune responses leading to virus elimination without negative impact on t v 'health. We used an integrated approach of transcriptional profiling and immunological analysis comparing a Cambodian population of strictly asymptomatic viremic individuals with clinical dengue patients. Whereas inflammatory pathways and innate immune responses were similar between asymptomatic individuals and clinical dengue patients, expression of proteins related to antigen presentation and subsequent T and B cell activation pathways were differentially regulated, independent of viral load or previous DENV infection. Feedback mechanisms controlled the immune response in asymptomatic viremic individuals as demonstrated by increased activation of T cell apoptosis-related pathways and Fc$γ$RIIB signaling associated with decreased anti-DENV specific antibody concentrations. Taken together, our data illustrate that symptom-free DENV infection in children is determined by increased activation of the adaptive immune compartment and proper control mechanisms leading to elimination of viral infection without excessive immune activation, having implications for novel vaccine development strategies. △ Less

Submitted 11 December, 2017; originally announced December 2017.

Journal ref: Science Translational Medicine, American Association for the Advancement of Science, 2017, 9 (405), pp.eaal5088

Showing 1–11 of 11 results for author: Ly, S