-
Zero-Trust Foundation Models: A New Paradigm for Secure and Collaborative Artificial Intelligence for Internet of Things
Authors:
Kai Li,
Conggai Li,
Xin Yuan,
Shenghong Li,
Sai Zou,
Syed Sohail Ahmed,
Wei Ni,
Dusit Niyato,
Abbas Jamalipour,
Falko Dressler,
Ozgur B. Akan
Abstract:
This paper focuses on Zero-Trust Foundation Models (ZTFMs), a novel paradigm that embeds zero-trust security principles into the lifecycle of foundation models (FMs) for Internet of Things (IoT) systems. By integrating core tenets, such as continuous verification, least privilege access (LPA), data confidentiality, and behavioral analytics into the design, training, and deployment of FMs, ZTFMs ca…
▽ More
This paper focuses on Zero-Trust Foundation Models (ZTFMs), a novel paradigm that embeds zero-trust security principles into the lifecycle of foundation models (FMs) for Internet of Things (IoT) systems. By integrating core tenets, such as continuous verification, least privilege access (LPA), data confidentiality, and behavioral analytics into the design, training, and deployment of FMs, ZTFMs can enable secure, privacy-preserving AI across distributed, heterogeneous, and potentially adversarial IoT environments. We present the first structured synthesis of ZTFMs, identifying their potential to transform conventional trust-based IoT architectures into resilient, self-defending ecosystems. Moreover, we propose a comprehensive technical framework, incorporating federated learning (FL), blockchain-based identity management, micro-segmentation, and trusted execution environments (TEEs) to support decentralized, verifiable intelligence at the network edge. In addition, we investigate emerging security threats unique to ZTFM-enabled systems and evaluate countermeasures, such as anomaly detection, adversarial training, and secure aggregation. Through this analysis, we highlight key open research challenges in terms of scalability, secure orchestration, interpretable threat attribution, and dynamic trust calibration. This survey lays a foundational roadmap for secure, intelligent, and trustworthy IoT infrastructures powered by FMs.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
MILUV: A Multi-UAV Indoor Localization dataset with UWB and Vision
Authors:
Mohammed Ayman Shalaby,
Syed Shabbir Ahmed,
Nicholas Dahdah,
Charles Champagne Cossette,
Jerome Le Ny,
James Richard Forbes
Abstract:
This paper introduces MILUV, a Multi-UAV Indoor Localization dataset with UWB and Vision measurements. This dataset comprises 217 minutes of flight time over 36 experiments using three quadcopters, collecting ultra-wideband (UWB) ranging data such as the raw timestamps and channel-impulse response data, vision data from a stereo camera and a bottom-facing monocular camera, inertial measurement uni…
▽ More
This paper introduces MILUV, a Multi-UAV Indoor Localization dataset with UWB and Vision measurements. This dataset comprises 217 minutes of flight time over 36 experiments using three quadcopters, collecting ultra-wideband (UWB) ranging data such as the raw timestamps and channel-impulse response data, vision data from a stereo camera and a bottom-facing monocular camera, inertial measurement unit data, height measurements from a laser rangefinder, magnetometer data, and ground-truth poses from a motion-capture system. The UWB data is collected from up to 12 transceivers affixed to mobile robots and static tripods in both line-of-sight and non-line-of-sight conditions. The UAVs fly at a maximum speed of 4.418 m/s in an indoor environment with visual fiducial markers as features. MILUV is versatile and can be used for a wide range of applications beyond localization, but the primary purpose of MILUV is for testing and validating multi-robot UWB- and vision-based localization algorithms. The dataset can be downloaded at https://doi.org/10.25452/figshare.plus.28386041.v1. A development kit is presented alongside the MILUV dataset, which includes benchmarking algorithms such as visual-inertial odometry, UWB-based localization using an extended Kalman filter, and classification of CIR data using machine learning approaches. The development kit can be found at https://github.com/decargroup/miluv, and is supplemented with a website available at https://decargroup.github.io/miluv/.
△ Less
Submitted 19 April, 2025;
originally announced April 2025.
-
GroundHog: Revolutionizing GLDAS Groundwater Storage Downscaling for Enhanced Recharge Estimation in Bangladesh
Authors:
Saleh Sakib Ahmed,
Rashed Uz Zzaman,
Saifur Rahman Jony,
Faizur Rahman Himel,
Afroza Sharmin,
A. H. M. Khalequr Rahman,
M. Sohel Rahman,
Sara Nowreen
Abstract:
Long-term groundwater level (GWL) measurement is vital for effective policymaking and recharge estimation using annual maxima and minima. However, current methods prioritize short-term predictions and lack multi-year applicability, limiting their utility. Moreover, sparse in-situ measurements lead to reliance on low-resolution satellite data like GLDAS as the ground truth for Machine Learning mode…
▽ More
Long-term groundwater level (GWL) measurement is vital for effective policymaking and recharge estimation using annual maxima and minima. However, current methods prioritize short-term predictions and lack multi-year applicability, limiting their utility. Moreover, sparse in-situ measurements lead to reliance on low-resolution satellite data like GLDAS as the ground truth for Machine Learning models, further constraining accuracy. To overcome these challenges, we first develop an ML model to mitigate data gaps, achieving $R^2$ scores of 0.855 and 0.963 for maximum and minimum GWL predictions, respectively. Subsequently, using these predictions and well observations as ground truth, we train an Upsampling Model that uses low-resolution (25 km) GLDAS data as input to produce high-resolution (2 km) GWLs, achieving an excellent $R^2$ score of 0.96. Our approach successfully upscales GLDAS data for 2003-2024, allowing high-resolution recharge estimations and revealing critical trends for proactive resource management. Our method allows upsampling of groundwater storage (GWS) from GLDAS to high-resolution GWLs for any points independently of officially curated piezometer data, making it a valuable tool for decision-making.
△ Less
Submitted 28 March, 2025;
originally announced March 2025.
-
A light-weight model to generate NDWI from Sentinel-1
Authors:
Saleh Sakib Ahmed,
Saifur Rahman Jony,
Md. Toufikuzzaman,
Saifullah Sayed,
Rashed Uz Zzaman,
Sara Nowreen,
M. Sohel Rahman
Abstract:
The use of Sentinel-2 images to compute Normalized Difference Water Index (NDWI) has many applications, including water body area detection. However, cloud cover poses significant challenges in this regard, which hampers the effectiveness of Sentinel-2 images in this context. In this paper, we present a deep learning model that can generate NDWI given Sentinel-1 images, thereby overcoming this clo…
▽ More
The use of Sentinel-2 images to compute Normalized Difference Water Index (NDWI) has many applications, including water body area detection. However, cloud cover poses significant challenges in this regard, which hampers the effectiveness of Sentinel-2 images in this context. In this paper, we present a deep learning model that can generate NDWI given Sentinel-1 images, thereby overcoming this cloud barrier. We show the effectiveness of our model, where it demonstrates a high accuracy of 0.9134 and an AUC of 0.8656 to predict the NDWI. Additionally, we observe promising results with an R2 score of 0.4984 (for regressing the NDWI values) and a Mean IoU of 0.4139 (for the underlying segmentation task). In conclusion, our model offers a first and robust solution for generating NDWI images directly from Sentinel-1 images and subsequent use for various applications even under challenging conditions such as cloud cover and nighttime.
△ Less
Submitted 22 January, 2025;
originally announced January 2025.
-
FrameCorr: Adaptive, Autoencoder-based Neural Compression for Video Reconstruction in Resource and Timing Constrained Network Settings
Authors:
John Li,
Shehab Sarar Ahmed,
Deepak Nair
Abstract:
Despite the growing adoption of video processing via Internet of Things (IoT) devices due to their cost-effectiveness, transmitting captured data to nearby servers poses challenges due to varying timing constraints and scarcity of network bandwidth. Existing video compression methods face difficulties in recovering compressed data when incomplete data is provided. Here, we introduce FrameCorr, a d…
▽ More
Despite the growing adoption of video processing via Internet of Things (IoT) devices due to their cost-effectiveness, transmitting captured data to nearby servers poses challenges due to varying timing constraints and scarcity of network bandwidth. Existing video compression methods face difficulties in recovering compressed data when incomplete data is provided. Here, we introduce FrameCorr, a deep-learning based solution that utilizes previously received data to predict the missing segments of a frame, enabling the reconstruction of a frame from partially received data.
△ Less
Submitted 10 September, 2024; v1 submitted 4 September, 2024;
originally announced September 2024.
-
GraphAge: Unleashing the power of Graph Neural Network to Decode Epigenetic Aging
Authors:
Saleh Sakib Ahmed,
Nahian Shabab,
Md. Abul Hassan Samee,
M. Sohel Rahman
Abstract:
DNA methylation is a crucial epigenetic marker used in various clocks to predict epigenetic age. However, many existing clocks fail to account for crucial information about CpG sites and their interrelationships, such as co-methylation patterns. We present a novel approach to represent methylation data as a graph, using methylation values and relevant information about CpG sites as nodes, and rela…
▽ More
DNA methylation is a crucial epigenetic marker used in various clocks to predict epigenetic age. However, many existing clocks fail to account for crucial information about CpG sites and their interrelationships, such as co-methylation patterns. We present a novel approach to represent methylation data as a graph, using methylation values and relevant information about CpG sites as nodes, and relationships like co-methylation, same gene, and same chromosome as edges. We then use a Graph Neural Network (GNN) to predict age. Thus our model, GraphAge, leverages both structural and positional information for prediction as well as better interpretation. Although we had to train in a constrained compute setting, GraphAge still showed competitive performance with a Mean Absolute Error (MAE) of 3.207 and a Mean Squared Error (MSE) of 25.277, slightly outperforming the current state of the art. Perhaps more importantly, we utilized GNN explainer for interpretation purposes and were able to unearth interesting insights (e.g., key CpG sites, pathways, and their relationships through Methylation Regulated Networks in the context of aging), which were not possible to 'decode' without leveraging the unique capability of GraphAge to 'encode' various structural relationships. GraphAge has the potential to consume and utilize all relevant information (if available) about an individual that relates to the complex process of aging. So, in that sense, it is one of its kind and can be seen as the first benchmark for a multimodal model that can incorporate all this information in order to close the gap in our understanding of the true nature of aging.
△ Less
Submitted 1 August, 2024;
originally announced August 2024.
-
Optimal Robot Formations: Balancing Range-Based Observability and User-Defined Configurations
Authors:
Syed Shabbir Ahmed,
Mohammed Ayman Shalaby,
Jerome Le Ny,
James Richard Forbes
Abstract:
This paper introduces a set of customizable and novel cost functions that enable the user to easily specify desirable robot formations, such as a ``high-coverage'' infrastructure-inspection formation, while maintaining high relative pose estimation accuracy. The overall cost function balances the need for the robots to be close together for good ranging-based relative localization accuracy and the…
▽ More
This paper introduces a set of customizable and novel cost functions that enable the user to easily specify desirable robot formations, such as a ``high-coverage'' infrastructure-inspection formation, while maintaining high relative pose estimation accuracy. The overall cost function balances the need for the robots to be close together for good ranging-based relative localization accuracy and the need for the robots to achieve specific tasks, such as minimizing the time taken to inspect a given area. The formations found by minimizing the aggregated cost function are evaluated in a coverage path planning task in simulation and experiment, where the robots localize themselves and unknown landmarks using a simultaneous localization and mapping algorithm based on the extended Kalman filter. Compared to an optimal formation that maximizes ranging-based relative localization accuracy, these formations significantly reduce the time to cover a given area with minimal impact on relative pose estimation accuracy.
△ Less
Submitted 10 April, 2025; v1 submitted 1 March, 2024;
originally announced March 2024.
-
Gaussian-Sum Filter for Range-based 3D Relative Pose Estimation in the Presence of Ambiguities
Authors:
Syed S. Ahmed,
Mohammed A. Shalaby,
Charles C. Cossette,
Jerome Le Ny,
James R. Forbes
Abstract:
Multi-robot systems must have the ability to accurately estimate relative states between robots in order to perform collaborative tasks, possibly with no external aiding. Three-dimensional relative pose estimation using range measurements oftentimes suffers from a finite number of non-unique solutions, or ambiguities. This paper: 1) identifies and accurately estimates all possible ambiguities in 2…
▽ More
Multi-robot systems must have the ability to accurately estimate relative states between robots in order to perform collaborative tasks, possibly with no external aiding. Three-dimensional relative pose estimation using range measurements oftentimes suffers from a finite number of non-unique solutions, or ambiguities. This paper: 1) identifies and accurately estimates all possible ambiguities in 2D; 2) treats them as components of a Gaussian mixture model; and 3) presents a computationally-efficient estimator, in the form of a Gaussian-sum filter (GSF), to realize range-based relative pose estimation in an infrastructure-free, 3D, setup. This estimator is evaluated in simulation and experiment and is shown to avoid divergence to local minima induced by the ambiguous poses. Furthermore, the proposed GSF outperforms an extended Kalman filter, demonstrates similar performance to the computationally-demanding particle filter, and is shown to be consistent.
△ Less
Submitted 18 September, 2024; v1 submitted 13 February, 2024;
originally announced February 2024.
-
Studying and Recommending Information Highlighting in Stack Overflow Answers
Authors:
Shahla Shaan Ahmed,
Shaowei Wang,
Yuan Tian,
Tse-Hsun,
Chen,
Haoxiang Zhang
Abstract:
Context: Navigating the knowledge of Stack Overflow (SO) remains challenging. To make the posts vivid to users, SO allows users to write and edit posts with Markdown or HTML so that users can leverage various formatting styles (e.g., bold, italic, and code) to highlight the important information. Nonetheless, there have been limited studies on the highlighted information. Objective: We carried out…
▽ More
Context: Navigating the knowledge of Stack Overflow (SO) remains challenging. To make the posts vivid to users, SO allows users to write and edit posts with Markdown or HTML so that users can leverage various formatting styles (e.g., bold, italic, and code) to highlight the important information. Nonetheless, there have been limited studies on the highlighted information. Objective: We carried out the first large-scale exploratory study on the information highlighted in SO answers in our recent study. To extend our previous study, we develop approaches to automatically recommend highlighted content with formatting styles using neural network architectures initially designed for the Named Entity Recognition task. Method: In this paper, we studied 31,169,429 answers of Stack Overflow. For training recommendation models, we choose CNN-based and BERT-based models for each type of formatting (i.e., Bold, Italic, Code, and Heading) using the information highlighting dataset we collected from SO answers. Results: Our models achieve a precision ranging from 0.50 to 0.72 for different formatting types. It is easier to build a model to recommend Code than other types. Models for text formatting types (i.e., Heading, Bold, and Italic) suffer low recall. Our analysis of failure cases indicates that the majority of the failure cases are due to missing identification. One explanation is that the models are easy to learn the frequent highlighted words while struggling to learn less frequent words (i.g., long-tail knowledge). Conclusion: Our findings suggest that it is possible to develop recommendation models for highlighting information for answers with different formatting styles on Stack Overflow.
△ Less
Submitted 25 April, 2024; v1 submitted 2 January, 2024;
originally announced January 2024.
-
Nonlinear Polarization and Efficiency Droop in Hexagonal InGaN/GaN Disk-in-Wire LEDs
Authors:
Vinay Uday Chimalgi,
Md Rezaul Karim Nishat,
Shaikh Shahid Ahmed
Abstract:
Recent studies suggest that piezoelectric polarization can play an important role in determining the electronic and optical properties of nanoscale nitride heterostructures. Among a few models available, recent first-principles calculations performed by Prodhomme et al. provide a simple yet accurate description of linear and nonlinear piezoelectric coefficients in reduced dimensionality structures…
▽ More
Recent studies suggest that piezoelectric polarization can play an important role in determining the electronic and optical properties of nanoscale nitride heterostructures. Among a few models available, recent first-principles calculations performed by Prodhomme et al. provide a simple yet accurate description of linear and nonlinear piezoelectric coefficients in reduced dimensionality structures having wurtzite crystal symmetry. In this paper, first, within a fully atomistic VFF-sp3s* tight-binding framework, we employ the model proposed by Prodhomme et al. to evaluate the importance of nonlinear piezoelectricity on the single-particle electronic states and interband optical transitions in a recently reported hexagon shaped In0.25Ga0.75N/GaN disk-in-wire LED. The microscopically determined transition parameters are then incorporated into a TCAD toolkit to investigate how atomicity and the net polarization field affect the internal quantum efficiency of the LED and lead to a degraded efficiency droop characteristic
△ Less
Submitted 14 October, 2014; v1 submitted 26 March, 2013;
originally announced March 2013.
-
Quantitative Excited State Spectroscopy of a Single InGaAs Quantum Dot Molecule through Multi-million Atom Electronic Structure Calculations
Authors:
Muhammad Usman,
Yui-Hong Matthias Tan,
Hoon Ryu,
Shaikh S. Ahmed,
Hubert Krenner,
Timothy B. Boykin,
Gerhard Klimeck
Abstract:
Atomistic electronic structure calculations are performed to study the coherent inter-dot couplings of the electronic states in a single InGaAs quantum dot molecule. The experimentally observed excitonic spectrum [12] is quantitatively reproduced, and the correct energy states are identified based on a previously validated atomistic tight binding model. The extended devices are represented explici…
▽ More
Atomistic electronic structure calculations are performed to study the coherent inter-dot couplings of the electronic states in a single InGaAs quantum dot molecule. The experimentally observed excitonic spectrum [12] is quantitatively reproduced, and the correct energy states are identified based on a previously validated atomistic tight binding model. The extended devices are represented explicitly in space with 15 million atom structures. An excited state spectroscopy technique is presented in which the externally applied electric field is swept to probe the ladder of the electronic energy levels (electron or hole) of one quantum dot through anti-crossings with the energy levels of the other quantum dot in a two quantum dot molecule. This technique can be applied to estimate the spatial electron-hole spacing inside the quantum dot molecule as well as to reverse engineer quantum dot geometry parameters such as the quantum dot separation. Crystal deformation induced piezoelectric effects have been discussed in the literature as minor perturbations lifting degeneracies of the electron excited (P and D) states, thus affecting polarization alignment of wave function lobes for III-V Heterostructures such as single InAs/GaAs quantum dots. In contrast this work demonstrates the crucial importance of piezoelectricity to resolve the symmetries and energies of the excited states through matching the experimentally measured spectrum in an InGaAs quantum dot molecule under the influence of an electric field. Both linear and quadratic piezoelectric effects are studied for the first time for a quantum dot molecule and demonstrated to be indeed important. The net piezoelectric contribution is found to be critical in determining the correct energy spectrum, which is in contrast to recent studies reporting vanishing net piezoelectric contributions.
△ Less
Submitted 22 June, 2011; v1 submitted 18 August, 2010;
originally announced August 2010.