Skip to main content

Showing 1–24 of 24 results for author: Halgamuge, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.18183  [pdf, other

    eess.SP cs.LG

    FRAME-C: A knowledge-augmented deep learning pipeline for classifying multi-electrode array electrophysiological signals

    Authors: Nisal Ranasinghe, Dzung Do-Ha, Simon Maksour, Tamasha Malepathirana, Sachith Seneviratne, Lezanne Ooi, Saman Halgamuge

    Abstract: Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disorder characterized by motor neuron degeneration, with alterations in neural excitability serving as key indicators. Recent advancements in induced pluripotent stem cell (iPSC) technology have enabled the generation of human iPSC-derived neuronal cultures, which, when combined with multi-electrode array (MEA) electrophysiology, pr… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  2. arXiv:2503.23617  [pdf, other

    cs.LG cs.AI

    Graph-Eq: Discovering Mathematical Equations using Graph Generative Models

    Authors: Nisal Ranasinghe, Damith Senanayake, Saman Halgamuge

    Abstract: The ability to discover meaningful, accurate, and concise mathematical equations that describe datasets is valuable across various domains. Equations offer explicit relationships between variables, enabling deeper insights into underlying data patterns. Most existing equation discovery methods rely on genetic programming, which iteratively searches the equation space but is often slow and prone to… ▽ More

    Submitted 30 March, 2025; originally announced March 2025.

    Comments: 8 pages, 4 figures

    ACM Class: I.2

  3. arXiv:2503.22063  [pdf, other

    cs.LG

    Arch-LLM: Taming LLMs for Neural Architecture Generation via Unsupervised Discrete Representation Learning

    Authors: Deshani Geethika Poddenige, Sachith Seneviratne, Damith Senanayake, Mahesan Niranjan, PN Suganthan, Saman Halgamuge

    Abstract: Unsupervised representation learning has been widely explored across various modalities, including neural architectures, where it plays a key role in downstream applications like Neural Architecture Search (NAS). These methods typically learn an unsupervised representation space before generating/ sampling architectures for the downstream search. A common approach involves the use of Variational A… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  4. arXiv:2503.08055  [pdf, other

    cs.CV

    Unmasking the Unknown: Facial Deepfake Detection in the Open-Set Paradigm

    Authors: Nadarasar Bahavan, Sanjay Saha, Ken Chen, Sachith Seneviratne, Sanka Rasnayaka, Saman Halgamuge

    Abstract: Facial forgery methods such as deepfakes can be misused for identity manipulation and spreading misinformation. They have evolved alongside advancements in generative AI, leading to new and more sophisticated forgery techniques that diverge from existing 'known' methods. Conventional deepfake detection methods use the closedset paradigm, thus limiting their applicability to detecting forgeries cre… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  5. arXiv:2503.08049  [pdf, other

    cs.CV

    SphOR: A Representation Learning Perspective on Open-set Recognition for Identifying Unknown Classes in Deep Learning Models

    Authors: Nadarasar Bahavan, Sachith Seneviratne, Saman Halgamuge

    Abstract: The widespread use of deep learning classifiers necessitates Open-set recognition (OSR), which enables the identification of input data not only from classes known during training but also from unknown classes that might be present in test data. Many existing OSR methods are computationally expensive due to the reliance on complex generative models or suffer from high training costs. We investigat… ▽ More

    Submitted 19 March, 2025; v1 submitted 11 March, 2025; originally announced March 2025.

  6. arXiv:2412.07114  [pdf, other

    cs.CV

    TT-MPD: Test Time Model Pruning and Distillation

    Authors: Haihang Wu, Wei Wang, Tamasha Malepathirana, Sachith Seneviratne, Denny Oetomo, Saman Halgamuge

    Abstract: Pruning can be an effective method of compressing large pre-trained models for inference speed acceleration. Previous pruning approaches rely on access to the original training dataset for both pruning and subsequent fine-tuning. However, access to the training data can be limited due to concerns such as data privacy and commercial confidentiality. Furthermore, with covariate shift (disparities be… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

  7. arXiv:2412.04806  [pdf, other

    cs.LG cs.AI cs.CL

    Rethinking Time Series Forecasting with LLMs via Nearest Neighbor Contrastive Learning

    Authors: Jayanie Bogahawatte, Sachith Seneviratne, Maneesha Perera, Saman Halgamuge

    Abstract: Adapting Large Language Models (LLMs) that are extensively trained on abundant text data, and customizing the input prompt to enable time series forecasting has received considerable attention. While recent work has shown great potential for adapting the learned prior of LLMs, the formulation of the prompt to finetune LLMs remains challenging as prompt should be aligned with time series data. Addi… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

  8. arXiv:2411.10921  [pdf, other

    cs.LG cs.CV

    Distributed solar generation forecasting using attention-based deep neural networks for cloud movement prediction

    Authors: Maneesha Perera, Julian De Hoog, Kasun Bandara, Saman Halgamuge

    Abstract: Accurate forecasts of distributed solar generation are necessary to reduce negative impacts resulting from the increased uptake of distributed solar photovoltaic (PV) systems. However, the high variability of solar generation over short time intervals (seconds to minutes) caused by cloud movement makes this forecasting task difficult. To address this, using cloud images, which capture the second-t… ▽ More

    Submitted 16 November, 2024; originally announced November 2024.

  9. arXiv:2408.14780  [pdf, other

    cs.LG cs.AI

    GINN-KAN: Interpretability pipelining with applications in Physics Informed Neural Networks

    Authors: Nisal Ranasinghe, Yu Xia, Sachith Seneviratne, Saman Halgamuge

    Abstract: Neural networks are powerful function approximators, yet their ``black-box" nature often renders them opaque and difficult to interpret. While many post-hoc explanation methods exist, they typically fail to capture the underlying reasoning processes of the networks. A truly interpretable neural network would be trained similarly to conventional models using techniques such as backpropagation, but… ▽ More

    Submitted 28 August, 2024; v1 submitted 27 August, 2024; originally announced August 2024.

  10. arXiv:2408.02938  [pdf, other

    cs.DC

    Reinforcement Learning based Workflow Scheduling in Cloud and Edge Computing Environments: A Taxonomy, Review and Future Directions

    Authors: Amanda Jayanetti, Saman Halgamuge, Rajkumar Buyya

    Abstract: Deep Reinforcement Learning (DRL) techniques have been successfully applied for solving complex decision-making and control tasks in multiple fields including robotics, autonomous driving, healthcare and natural language processing. The ability of DRL agents to learn from experience and utilize real-time data for making decisions makes it an ideal candidate for dealing with the complexities associ… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  11. arXiv:2408.02926  [pdf, other

    cs.DC

    A Deep Reinforcement Learning Approach for Cost Optimized Workflow Scheduling in Cloud Computing Environments

    Authors: Amanda Jayanetti, Saman Halgamuge, Rajkumar Buyya

    Abstract: Cost optimization is a common goal of workflow schedulers operating in cloud computing environments. The use of spot instances is a potential means of achieving this goal, as they are offered by cloud providers at discounted prices compared to their on-demand counterparts in exchange for reduced reliability. This is due to the fact that spot instances are subjected to interruptions when spare comp… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: 9 pages, 4 figures

  12. AniFaceDiff: Animating Stylized Avatars via Parametric Conditioned Diffusion Models

    Authors: Ken Chen, Sachith Seneviratne, Wei Wang, Dongting Hu, Sanjay Saha, Md. Tarek Hasan, Sanka Rasnayaka, Tamasha Malepathirana, Mingming Gong, Saman Halgamuge

    Abstract: Animating stylized avatars with dynamic poses and expressions has attracted increasing attention for its broad range of applications. Previous research has made significant progress by training controllable generative models to synthesize animations based on reference characteristics, pose, and expression conditions. However, the mechanisms used in these methods to control pose and expression ofte… ▽ More

    Submitted 2 December, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Journal ref: Pattern Recognition 170 (2026) 112017

  13. arXiv:2403.04492  [pdf, other

    cs.CV

    Discriminative Sample-Guided and Parameter-Efficient Feature Space Adaptation for Cross-Domain Few-Shot Learning

    Authors: Rashindrie Perera, Saman Halgamuge

    Abstract: In this paper, we look at cross-domain few-shot classification which presents the challenging task of learning new classes in previously unseen domains with few labelled examples. Existing methods, though somewhat effective, encounter several limitations, which we alleviate through two significant improvements. First, we introduce a lightweight parameter-efficient adaptation strategy to address ov… ▽ More

    Submitted 3 April, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Code is available at this link: https://github.com/rashindrie/DIPA

  14. arXiv:2403.01653  [pdf, other

    cs.LG

    Day-ahead regional solar power forecasting with hierarchical temporal convolutional neural networks using historical power generation and weather data

    Authors: Maneesha Perera, Julian De Hoog, Kasun Bandara, Damith Senanayake, Saman Halgamuge

    Abstract: Regional solar power forecasting, which involves predicting the total power generation from all rooftop photovoltaic systems in a region holds significant importance for various stakeholders in the energy sector. However, the vast amount of solar power generation and weather time series from geographically dispersed locations that need to be considered in the forecasting process makes accurate reg… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 37 pages, 16 figures, Accepted to the journal of Applied Energy

  15. arXiv:2401.03104  [pdf, other

    cs.LG

    When To Grow? A Fitting Risk-Aware Policy for Layer Growing in Deep Neural Networks

    Authors: Haihang Wu, Wei Wang, Tamasha Malepathirana, Damith Senanayake, Denny Oetomo, Saman Halgamuge

    Abstract: Neural growth is the process of growing a small neural network to a large network and has been utilized to accelerate the training of deep neural networks. One crucial aspect of neural growth is determining the optimal growth timing. However, few studies investigate this systematically. Our study reveals that neural growth inherently exhibits a regularization effect, whose intensity is influenced… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI'24

  16. arXiv:2312.10913  [pdf, other

    cs.LG cs.AI

    GINN-LP: A Growing Interpretable Neural Network for Discovering Multivariate Laurent Polynomial Equations

    Authors: Nisal Ranasinghe, Damith Senanayake, Sachith Seneviratne, Malin Premaratne, Saman Halgamuge

    Abstract: Traditional machine learning is generally treated as a black-box optimization problem and does not typically produce interpretable functions that connect inputs and outputs. However, the ability to discover such interpretable functions is desirable. In this work, we propose GINN-LP, an interpretable neural network to discover the form and coefficients of the underlying equation of a dataset, when… ▽ More

    Submitted 14 February, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

    Comments: 14 pages, 12 figures, Accepted by AAAI24

  17. arXiv:2308.09297  [pdf, other

    cs.CV

    NAPA-VQ: Neighborhood Aware Prototype Augmentation with Vector Quantization for Continual Learning

    Authors: Tamasha Malepathirana, Damith Senanayake, Saman Halgamuge

    Abstract: Catastrophic forgetting; the loss of old knowledge upon acquiring new knowledge, is a pitfall faced by deep neural networks in real-world applications. Many prevailing solutions to this problem rely on storing exemplars (previously encountered data), which may not be feasible in applications with memory limitations or privacy constraints. Therefore, the recent focus has been on Non-Exemplar based… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  18. arXiv:2305.06564  [pdf, other

    cs.CV

    Undercover Deepfakes: Detecting Fake Segments in Videos

    Authors: Sanjay Saha, Rashindrie Perera, Sachith Seneviratne, Tamasha Malepathirana, Sanka Rasnayaka, Deshani Geethika, Terence Sim, Saman Halgamuge

    Abstract: The recent renaissance in generative models, driven primarily by the advent of diffusion models and iterative improvement in GAN methods, has enabled many creative applications. However, each advancement is also accompanied by a rise in the potential for misuse. In the arena of the deepfake generation, this is a key societal issue. In particular, the ability to modify segments of videos using such… ▽ More

    Submitted 24 August, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: ICCV 2023 Workshop and Challenge on DeepFake Analysis and Detection

  19. Interpretability and accessibility of machine learning in selected food processing, agriculture and health applications

    Authors: N. Ranasinghe, A. Ramanan, S. Fernando, P. N. Hameed, D. Herath, T. Malepathirana, P. Suganthan, M. Niranjan, S. Halgamuge

    Abstract: Artificial Intelligence (AI) and its data-centric branch of machine learning (ML) have greatly evolved over the last few decades. However, as AI is used increasingly in real world use cases, the importance of the interpretability of and accessibility to AI systems have become major research areas. The lack of interpretability of ML based systems is a major hindrance to widespread adoption of these… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: published in the "Journal of the National Science Foundation of Sri Lanka, Volume 50"

    Journal ref: Journal of the National Science Foundation of Sri Lanka (2022), Vol 50, 263-276

  20. Multi-Resolution, Multi-Horizon Distributed Solar PV Power Forecasting with Forecast Combinations

    Authors: Maneesha Perera, Julian De Hoog, Kasun Bandara, Saman Halgamuge

    Abstract: Distributed, small-scale solar photovoltaic (PV) systems are being installed at a rapidly increasing rate. This can cause major impacts on distribution networks and energy markets. As a result, there is a significant need for improved forecasting of the power generation of these systems at different time resolutions and horizons. However, the performance of forecasting models depends on the resolu… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Journal ref: Expert Systems with Applications 205 (2022)

  21. An Incentive-compatible Energy Trading Framework for Neighborhood Area Networks with Shared Energy Storage

    Authors: Chathurika P. Mediwaththe, Marnie Shaw, Saman Halgamuge, David B. Smith, Paul Scott

    Abstract: Here, a novel energy trading system is proposed for demand-side management of a neighborhood area network (NAN) consisting of a shared energy storage (SES) provider, users with non-dispatchable energy generation, and an electricity retailer. In a leader-follower Stackelberg game, the SES provider first maximizes their revenue by setting a price signal and trading energy with the grid. Then, by fol… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

    Comments: Accepted in IEEE Transactions on Sustainable Energy

  22. Self Organizing Nebulous Growths for Robust and Incremental Data Visualization

    Authors: Damith Senanayake, Wei Wang, Shalin H. Naik, Saman Halgamuge

    Abstract: Non-parametric dimensionality reduction techniques, such as t-SNE and UMAP, are proficient in providing visualizations for datasets of fixed sizes. However, they cannot incrementally map and insert new data points into an already provided data visualization. We present Self-Organizing Nebulous Growths (SONG), a parametric nonlinear dimensionality reduction technique that supports incremental data… ▽ More

    Submitted 1 October, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

    Comments: in IEEE Transactions on Neural Networks and Learning Systems

  23. The Fast Heuristic Algorithms and Post-Processing Techniques to Design Large and Low-Cost Communication Networks

    Authors: Yahui Sun, Marcus Brazil, Doreen Thomas, Saman Halgamuge

    Abstract: It is challenging to design large and low-cost communication networks. In this paper, we formulate this challenge as the prize-collecting Steiner Tree Problem (PCSTP). The objective is to minimize the costs of transmission routes and the disconnected monetary or informational profits. Initially, we note that the PCSTP is MAX SNP-hard. Then, we propose some post-processing techniques to improve sub… ▽ More

    Submitted 8 January, 2019; originally announced February 2019.

    Journal ref: IEEE/ACM Transactions on Networking; Jan 2019

  24. arXiv:1812.09916  [pdf, other

    cs.LG cs.CV stat.ML

    Improving MMD-GAN Training with Repulsive Loss Function

    Authors: Wei Wang, Yuan Sun, Saman Halgamuge

    Abstract: Generative adversarial nets (GANs) are widely used to learn the data sampling process and their performance may heavily depend on the loss functions, given a limited computational budget. This study revisits MMD-GAN that uses the maximum mean discrepancy (MMD) as the loss function for GAN and makes two contributions. First, we argue that the existing MMD loss function may discourage the learning o… ▽ More

    Submitted 8 February, 2019; v1 submitted 24 December, 2018; originally announced December 2018.

    Comments: Published as a conference paper at ICLR 2019