-
Dynamical Embedding of Single Channel Electroencephalogram for Artifact Subspace Reconstruction
Authors:
Doli Hazarika,
Vishnu KN,
Ramdas Ransing,
Cota Navin Gupta
Abstract:
This study introduces a novel framework to apply Artifact Subspace Reconstruction (ASR) algorithm on single-channel Electroencephalogram (EEG) data. ASR, renowned for its automated capability to effectively eliminate various artifacts like eye-blinks and eye movements from EEG signals. Importantly it has been implemented on android smartphones, but relied on multiple channels for principal compone…
▽ More
This study introduces a novel framework to apply Artifact Subspace Reconstruction (ASR) algorithm on single-channel Electroencephalogram (EEG) data. ASR, renowned for its automated capability to effectively eliminate various artifacts like eye-blinks and eye movements from EEG signals. Importantly it has been implemented on android smartphones, but relied on multiple channels for principal component subspace calculations. To overcome this limitation, we incorporate the established dynamical embedding approach into the algorithm, naming it Embedded-ASR (E-ASR). In our proposed method, an embedded matrix is first constructed from a single-channel EEG data using series of delay vectors. ASR is then applied to this embedded matrix, and the resulting cleaned single-channel EEG is reconstructed by removing the time lag and concatenating the rows of the embedded matrix. Data was collected from four subjects in resting states with eyes open from pre-frontal (Fp1 and Fp2) electrodes using CameraEEG app. To assess the effectiveness of the E-ASR algorithm on an EEG dataset with artifacts, we employed performance metrics such as relative root mean square error (RRMSE), correlation coefficient (CC), average power ratio as well as estimated the number of eye-blinks with and without the E-ASR approach. E-ASR was able to reduce artifacts from the semi-simulated EEG data, with an RRMSE of 45.45% and a CC of 0.91. For real EEG data, the counted eye-blinks were manually cross-checked with ground truth obtained from CameraEEG video data across all subjects for individual Fp1 and Fp2 electrodes. In conclusion, our study suggests E-ASR framework can remove artifacts from single channel EEG data. This promising algorithm might have potential for smartphone-based natural environment EEG applications, where minimal number of electrodes is a critical factor.
△ Less
Submitted 29 October, 2024; v1 submitted 28 June, 2024;
originally announced July 2024.
-
Systematic Review of Experimental Paradigms and Deep Neural Networks for Electroencephalography-Based Cognitive Workload Detection
Authors:
Vishnu KN,
Cota Navin Gupta
Abstract:
This article summarizes a systematic review of the electroencephalography (EEG)-based cognitive workload (CWL) estimation. The focus of the article is twofold: identify the disparate experimental paradigms used for reliably eliciting discreet and quantifiable levels of cognitive load and the specific nature and representational structure of the commonly used input formulations in deep neural netwo…
▽ More
This article summarizes a systematic review of the electroencephalography (EEG)-based cognitive workload (CWL) estimation. The focus of the article is twofold: identify the disparate experimental paradigms used for reliably eliciting discreet and quantifiable levels of cognitive load and the specific nature and representational structure of the commonly used input formulations in deep neural networks (DNNs) used for signal classification. The analysis revealed a number of studies using EEG signals in its native representation of a two-dimensional matrix for offline classification of CWL. However, only a few studies adopted an online or pseudo-online classification strategy for real-time CWL estimation. Further, only a couple of interpretable DNNs and a single generative model were employed for cognitive load detection till date during this review. More often than not, researchers were using DNNs as black-box type models. In conclusion, DNNs prove to be valuable tools for classifying EEG signals, primarily due to the substantial modeling power provided by the depth of their network architecture. It is further suggested that interpretable and explainable DNN models must be employed for cognitive workload estimation since existing methods are limited in the face of the non-stationary nature of the signal.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Optimally Coordinated Energy Management Framework for Profit Maximization Considering Dispatchable and Non-Dispatchable Energy Resources
Authors:
Rayees Ahmad Thokar,
Nikhil Gupta,
K. R. Niazi,
Anil Swarnkar,
Nand K. Meena,
Jin Yang
Abstract:
Contemporary distribution network can be seen with diverse dispatchable and non-dispatchable energy resources. The coordinated scheduling of these dispatchable resources with non-dispatchable resources can provide several techno-economic and social benefits. Since, battery energy storage systems (BESSs) and microturbine (MT) units are capital intensive, a thorough investigation of their coordinate…
▽ More
Contemporary distribution network can be seen with diverse dispatchable and non-dispatchable energy resources. The coordinated scheduling of these dispatchable resources with non-dispatchable resources can provide several techno-economic and social benefits. Since, battery energy storage systems (BESSs) and microturbine (MT) units are capital intensive, a thorough investigation of their coordinated scheduling on pure economic basis will be an interesting and challenging task while considering dynamic electricity price and uncertainty handling of non-dispatchable resources and load demand. This paper proposes a new methodology for optimal coordinated scheduling of BESSs and MT units considering existing renewable energy resources and dynamic electricity price to maximize daily profit function of the utility by employing a recently explored modified African buffalo optimization (MABO) algorithm. The key attributes of the proposed methodology are comprised of mean price-based adaptive scheduling embedded within a decision mechanism system (DMS) to maximize arbitrage benefits. DMS keeps a track of system states as a-priori thus guides the artificial intelligence based solution technique for sequential optimization. This may also reduce the computational burden of complex real-life engineering optimization problems. Further, a novel concept of fictitious charges is proposed to restrict the counterproductive operational management of BESSs. The application results investigated and compared on a benchmark 33-bus test distribution system highlights the importance of the proposed methodology.
△ Less
Submitted 1 July, 2023;
originally announced July 2023.
-
Unique Brain Network Identification Number for Parkinson's Individuals Using Structural MRI
Authors:
Tanmayee Samantaray,
Utsav Gupta,
Jitender Saini,
Cota Navin Gupta
Abstract:
We propose a novel algorithm called Unique Brain Network Identification Number, UBNIN for encoding the brain networks of individual subjects. To realize this objective, we employed structural MRI on 180 Parkinsons disease PD patients and 70 healthy controls HC from the National Institute of Mental Health and Neurosciences, India. We parcellated each subjects brain volume and constructed an individ…
▽ More
We propose a novel algorithm called Unique Brain Network Identification Number, UBNIN for encoding the brain networks of individual subjects. To realize this objective, we employed structural MRI on 180 Parkinsons disease PD patients and 70 healthy controls HC from the National Institute of Mental Health and Neurosciences, India. We parcellated each subjects brain volume and constructed an individual adjacency matrix using the correlation between the gray matter volumes of every pair of regions. The unique code is derived from values representing connections for every node i, weighted by a factor of 2^1-i. The numerical representation UBNIN was observed to be distinct for each individual brain network, which may also be applied to other neuroimaging modalities. This model may be implemented as a neural signature of a persons unique brain connectivity, thereby making it useful for brainprinting applications. Additionally, we segregated the above datasets into five age cohorts to study the variation in network topology over age. Sparsity was adopted as the threshold estimate to binarize each age-based correlation matrix. For each age cohort, a decreasing trend was observed in the mean clustering coefficient with increasing sparsity. Significantly different clustering coefficients were noted in PD between age cohort B and C, C and E, and in HC between E and B, E and C, E and D, and C and D. Our findings suggest network connectivity patterns change with age, indicating network disruption may be due to the underlying neuropathology. Varying clustering coefficients for different cohorts indicate that information transfer between neighboring nodes changes with age. This provides evidence of age related brain shrinkage and network degeneration. We also discuss limitations and provide an open-access link to software codes and a help file for the entire study.
△ Less
Submitted 19 September, 2023; v1 submitted 2 June, 2023;
originally announced June 2023.
-
A Bi-level Decision Framework for Incentive-Based Demand Response in Distribution Systems
Authors:
Vipin Chandra Pandey,
Nikhil Gupta,
Khaleequr Rehman Niazi,
Anil Swarnkar,
Tanuj Rawat,
Charalambos Konstantinou
Abstract:
In a growing retail electricity market, demand response (DR) is becoming an integral part of the system to enhance economic and operational performances. This is rendered as incentive-based DR (IBDR) in the proposed study. It presents a bi-level decision framework under the ambit of multiple demand response providers (DRPs) in the retail competition. It is formulated as a multi-leader-multi-follow…
▽ More
In a growing retail electricity market, demand response (DR) is becoming an integral part of the system to enhance economic and operational performances. This is rendered as incentive-based DR (IBDR) in the proposed study. It presents a bi-level decision framework under the ambit of multiple demand response providers (DRPs) in the retail competition. It is formulated as a multi-leader-multi-follower game, where multiple DRPs, as the DR stakeholders, are strategically interacting to optimize load serving entity cost at the upper level, and individual DRP as the aggregated customers is optimizing its cost at the lower level. The strategic behavior of DRPs is modeled in a game-theoretic framework using a generalized Stackelberg game. Further, the existence and uniqueness of the game are validated using variational inequalities. It is presented as a nonlinear problem to consider AC network constraints. An equilibrium problem with equilibrium constraints is used as a mathematical program to model the multi-leader-multi-follower, bi-level problem, which is simultaneously solved for all DRPs. The diagonalization method is employed to solve the problem. The detailed numerical analyses are conducted on IEEE 33-bus test and Indian-108 bus distribution systems to demonstrate the applicability and scalability of the proposed model and the suggested method.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
Computer-aided Diagnosis of Malaria through Transfer Learning using the ResNet50 Backbone
Authors:
Sanya Sinha,
Nilay Gupta
Abstract:
According to the World Malaria Report of 2022, 247 million cases of malaria and 619,000 related deaths were reported in 2021. This highlights the predominance of the disease, especially in the tropical and sub-tropical regions of Africa, parts of South-east Asia, Central and Southern America. Malaria is caused due to the Plasmodium parasite which is circulated through the bites of the female Anoph…
▽ More
According to the World Malaria Report of 2022, 247 million cases of malaria and 619,000 related deaths were reported in 2021. This highlights the predominance of the disease, especially in the tropical and sub-tropical regions of Africa, parts of South-east Asia, Central and Southern America. Malaria is caused due to the Plasmodium parasite which is circulated through the bites of the female Anopheles mosquito. Hence, the detection of the parasite in human blood smears could confirm malarial infestation. Since the manual identification of Plasmodium is a lengthy and time-consuming task subject to variability in accuracy, we propose an automated, computer-aided diagnostic method to classify malarial thin smear blood cell images as parasitized and uninfected by using the ResNet50 Deep Neural Network. In this paper, we have used the pre-trained ResNet50 model on the open-access database provided by the National Library of Medicine's Lister Hill National Center for Biomedical Communication for 150 epochs. The results obtained showed accuracy, precision, and recall values of 98.75%, 99.3% and 99.5% on the ResNet50(proposed) model. We have compared these metrics with similar models such as VGG16, Watershed Segmentation and Random Forest, which showed better performance than traditional techniques as well.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
Optimizing Real-Time Performances for Timed-Loop Racing under F1TENTH
Authors:
Nitish Gupta,
Kurt Wilson,
Zhishan Guo
Abstract:
Motion planning and control in autonomous car racing are one of the most challenging and safety-critical tasks due to high speed and dynamism. The lower-level control nodes are expected to be highly optimized due to resource constraints of onboard embedded processing units, although there are strict latency requirements. Some of these guarantees can be provided at the application level, such as us…
▽ More
Motion planning and control in autonomous car racing are one of the most challenging and safety-critical tasks due to high speed and dynamism. The lower-level control nodes are expected to be highly optimized due to resource constraints of onboard embedded processing units, although there are strict latency requirements. Some of these guarantees can be provided at the application level, such as using ROS2's Real-Time executors. However, the performance can be far from satisfactory as many modern control algorithms (such as Model Predictive Control) rely on solving complicated online optimization problems at each iteration. In this paper, we present a simple yet effective multi-threading technique to optimize the throughput of online-control algorithms for resource-constrained autonomous racing platforms. We achieve this by maintaining a systematic pool of worker threads solving the optimization problem in parallel which can improve the system performance by reducing latency between control input commands. We further demonstrate the effectiveness of our method using the Model Predictive Contouring Control (MPCC) algorithm running on Nvidia's Xavier AGX platform.
△ Less
Submitted 8 December, 2022;
originally announced December 2022.
-
Classification of Melanocytic Nevus Images using BigTransfer (BiT)
Authors:
Sanya Sinha,
Nilay Gupta
Abstract:
Skin cancer is a fatal disease that takes a heavy toll over human lives annually. The colored skin images show a significant degree of resemblance between different skin lesions such as melanoma and nevus, making identification and diagnosis more challenging. Melanocytic nevi may mature to cause fatal melanoma. Therefore, the current management protocol involves the removal of those nevi that appe…
▽ More
Skin cancer is a fatal disease that takes a heavy toll over human lives annually. The colored skin images show a significant degree of resemblance between different skin lesions such as melanoma and nevus, making identification and diagnosis more challenging. Melanocytic nevi may mature to cause fatal melanoma. Therefore, the current management protocol involves the removal of those nevi that appear intimidating. However, this necessitates resilient classification paradigms for classifying benign and malignant melanocytic nevi. Early diagnosis necessitates a dependable automated system for melanocytic nevi classification to render diagnosis efficient, timely, and successful. An automated classification algorithm is proposed in the given research. A neural network previously-trained on a separate problem statement is leveraged in this technique for classifying melanocytic nevus images. The suggested method uses BigTransfer (BiT), a ResNet-based transfer learning approach for classifying melanocytic nevi as malignant or benign. The results obtained are compared to that of current techniques, and the new method's classification rate is proven to outperform that of existing methods.
△ Less
Submitted 6 April, 2023; v1 submitted 21 November, 2022;
originally announced November 2022.
-
FREGAN : an application of generative adversarial networks in enhancing the frame rate of videos
Authors:
Rishik Mishra,
Neeraj Gupta,
Nitya Shukla
Abstract:
A digital video is a collection of individual frames, while streaming the video the scene utilized the time slice for each frame. High refresh rate and high frame rate is the demand of all high technology applications. The action tracking in videos becomes easier and motion becomes smoother in gaming applications due to the high refresh rate. It provides a faster response because of less time in b…
▽ More
A digital video is a collection of individual frames, while streaming the video the scene utilized the time slice for each frame. High refresh rate and high frame rate is the demand of all high technology applications. The action tracking in videos becomes easier and motion becomes smoother in gaming applications due to the high refresh rate. It provides a faster response because of less time in between each frame that is displayed on the screen. FREGAN (Frame Rate Enhancement Generative Adversarial Network) model has been proposed, which predicts future frames of a video sequence based on a sequence of past frames. In this paper, we investigated the GAN model and proposed FREGAN for the enhancement of frame rate in videos. We have utilized Huber loss as a loss function in the proposed FREGAN. It provided excellent results in super-resolution and we have tried to reciprocate that performance in the application of frame rate enhancement. We have validated the effectiveness of the proposed model on the standard datasets (UCF101 and RFree500). The experimental outcomes illustrate that the proposed model has a Peak signal-to-noise ratio (PSNR) of 34.94 and a Structural Similarity Index (SSIM) of 0.95.
△ Less
Submitted 1 November, 2021;
originally announced November 2021.
-
On Accelerating Distributed Convex Optimizations
Authors:
Kushal Chakrabarti,
Nirupam Gupta,
Nikhil Chopra
Abstract:
This paper studies a distributed multi-agent convex optimization problem. The system comprises multiple agents in this problem, each with a set of local data points and an associated local cost function. The agents are connected to a server, and there is no inter-agent communication. The agents' goal is to learn a parameter vector that optimizes the aggregate of their local costs without revealing…
▽ More
This paper studies a distributed multi-agent convex optimization problem. The system comprises multiple agents in this problem, each with a set of local data points and an associated local cost function. The agents are connected to a server, and there is no inter-agent communication. The agents' goal is to learn a parameter vector that optimizes the aggregate of their local costs without revealing their local data points. In principle, the agents can solve this problem by collaborating with the server using the traditional distributed gradient-descent method. However, when the aggregate cost is ill-conditioned, the gradient-descent method (i) requires a large number of iterations to converge, and (ii) is highly unstable against process noise. We propose an iterative pre-conditioning technique to mitigate the deleterious effects of the cost function's conditioning on the convergence rate of distributed gradient-descent. Unlike the conventional pre-conditioning techniques, the pre-conditioner matrix in our proposed technique updates iteratively to facilitate implementation on the distributed network. In the distributed setting, we provably show that the proposed algorithm converges linearly with an improved rate of convergence than the traditional and adaptive gradient-descent methods. Additionally, for the special case when the minimizer of the aggregate cost is unique, our algorithm converges superlinearly. We demonstrate our algorithm's superior performance compared to prominent distributed algorithms for solving real logistic regression problems and emulating neural network training via a noisy quadratic model, thereby signifying the proposed algorithm's efficiency for distributively solving non-convex optimization. Moreover, we empirically show that the proposed algorithm results in faster training without compromising the generalization performance.
△ Less
Submitted 19 August, 2021;
originally announced August 2021.
-
An Adaptive Demand Response Framework using Price Elasticity Model in Distribution Networks: A Case Study
Authors:
Vipin Chandra Pandey,
Nikhil Gupta,
K. R. Niazi,
Anil Swarnkar,
Rayees Ahmad Thokar
Abstract:
Price elasticity model (PEM) is an appealing and modest model for assessing the potential of flexible demand in DR. It measures the customers demand sensitivity through elasticity in relation to price variation. However, application of PEM in DR is partially apprehensible on attributing the adaptability and adjustability with intertemporal constraints in DR. Thus, this article presents an adaptive…
▽ More
Price elasticity model (PEM) is an appealing and modest model for assessing the potential of flexible demand in DR. It measures the customers demand sensitivity through elasticity in relation to price variation. However, application of PEM in DR is partially apprehensible on attributing the adaptability and adjustability with intertemporal constraints in DR. Thus, this article presents an adaptive economic DR framework with attributes of DR via a dynamic elasticity approach to model customers sensitivity. This dynamic elasticity is modeled through the deterministic and stochastic approaches. Both approaches envision the notion of load recovery for shiftable/flexible loads to make the proposed DR framework adaptive and adjustable relative to price variation. In stochastic approach, a geometric Brownian motion is employed to emulate load recovery with inclusion of intertemporal constraint of load flexibility. The proposed mathematical model shows what should be the customers elasticity value to achieve the factual DR. The case study is carried out on standard IEEE 33 distribution system bus load data to assess technical and socio-economic impact of DR on customers and is also compared with the exiting model.
△ Less
Submitted 31 May, 2021;
originally announced May 2021.
-
Rational Dynamic Price Model for Demand Response Programs in Modern Distribution Systems
Authors:
Rayees A. Thokar,
Nikhil Gupta,
K. R. Niazi,
Anil Swarnkar,
Nand K. Meena
Abstract:
Demand response (DR) refers to change in electricity consumption pattern of customers during on-peak hours in lieu of financial gains to reduce stress on distribution systems. Existing dynamic price models have not provided adequate success to price-based demand response (PBDR) programs. It happened as these models have raised typical socio-economic problems pertaining to cross-subsidy, free-rider…
▽ More
Demand response (DR) refers to change in electricity consumption pattern of customers during on-peak hours in lieu of financial gains to reduce stress on distribution systems. Existing dynamic price models have not provided adequate success to price-based demand response (PBDR) programs. It happened as these models have raised typical socio-economic problems pertaining to cross-subsidy, free-riders, social inequity, assured profit of utilities, financial gains and comfort of customers, etc. This paper presents a new dynamic price model for PBDR in distribution systems which aims to overcome some of the above mentioned problems of the existing price models. The main aim of the developed price model is to overcome the problems of cross-subsidy and free-riders of the existing price models for widespread acceptance, deployment and efficient utilization of PBDR programs in contemporary distribution systems. Proposed price model generates demand-linked price signal that imposes different price signals to different customers during on-peak hours and remains static otherwise. This makes proposed model a class apart from other existing models. The novelty of the proposed model lies in the fact that the financial benefits and penalties pertaining to DR are self-adjusted among customers while preserving social equity and profit of the utility. Such an ideology has not been yet addressed in the literature. Detailed investigation of application results on a standard test bench reveals that the proposed model equally cares regarding the interests of both customers and utility. For economic assessment, a comparison of the proposed price model with the existing pricing models is also performed.
△ Less
Submitted 21 May, 2021;
originally announced May 2021.
-
Robustness of Iteratively Pre-Conditioned Gradient-Descent Method: The Case of Distributed Linear Regression Problem
Authors:
Kushal Chakrabarti,
Nirupam Gupta,
Nikhil Chopra
Abstract:
This paper considers the problem of multi-agent distributed linear regression in the presence of system noises. In this problem, the system comprises multiple agents wherein each agent locally observes a set of data points, and the agents' goal is to compute a linear model that best fits the collective data points observed by all the agents. We consider a server-based distributed architecture wher…
▽ More
This paper considers the problem of multi-agent distributed linear regression in the presence of system noises. In this problem, the system comprises multiple agents wherein each agent locally observes a set of data points, and the agents' goal is to compute a linear model that best fits the collective data points observed by all the agents. We consider a server-based distributed architecture where the agents interact with a common server to solve the problem; however, the server cannot access the agents' data points. We consider a practical scenario wherein the system either has observation noise, i.e., the data points observed by the agents are corrupted, or has process noise, i.e., the computations performed by the server and the agents are corrupted. In noise-free systems, the recently proposed distributed linear regression algorithm, named the Iteratively Pre-conditioned Gradient-descent (IPG) method, has been claimed to converge faster than related methods. In this paper, we study the robustness of the IPG method, against both the observation noise and the process noise. We empirically show that the robustness of the IPG method compares favorably to the state-of-the-art algorithms.
△ Less
Submitted 26 January, 2021;
originally announced January 2021.
-
Accelerating Distributed SGD for Linear Regression using Iterative Pre-Conditioning
Authors:
Kushal Chakrabarti,
Nirupam Gupta,
Nikhil Chopra
Abstract:
This paper considers the multi-agent distributed linear least-squares problem. The system comprises multiple agents, each agent with a locally observed set of data points, and a common server with whom the agents can interact. The agents' goal is to compute a linear model that best fits the collective data points observed by all the agents. In the server-based distributed settings, the server cann…
▽ More
This paper considers the multi-agent distributed linear least-squares problem. The system comprises multiple agents, each agent with a locally observed set of data points, and a common server with whom the agents can interact. The agents' goal is to compute a linear model that best fits the collective data points observed by all the agents. In the server-based distributed settings, the server cannot access the data points held by the agents. The recently proposed Iteratively Pre-conditioned Gradient-descent (IPG) method has been shown to converge faster than other existing distributed algorithms that solve this problem. In the IPG algorithm, the server and the agents perform numerous iterative computations. Each of these iterations relies on the entire batch of data points observed by the agents for updating the current estimate of the solution. Here, we extend the idea of iterative pre-conditioning to the stochastic settings, where the server updates the estimate and the iterative pre-conditioning matrix based on a single randomly selected data point at every iteration. We show that our proposed Iteratively Pre-conditioned Stochastic Gradient-descent (IPSG) method converges linearly in expectation to a proximity of the solution. Importantly, we empirically show that the proposed IPSG method's convergence rate compares favorably to prominent stochastic algorithms for solving the linear least-squares problem in server-based networks.
△ Less
Submitted 28 November, 2020; v1 submitted 15 November, 2020;
originally announced November 2020.
-
Byzantine Fault-Tolerance in Decentralized Optimization under Minimal Redundancy
Authors:
Nirupam Gupta,
Thinh T. Doan,
Nitin H. Vaidya
Abstract:
This paper considers the problem of Byzantine fault-tolerance in multi-agent decentralized optimization. In this problem, each agent has a local cost function. The goal of a decentralized optimization algorithm is to allow the agents to cooperatively compute a common minimum point of their aggregate cost function. We consider the case when a certain number of agents may be Byzantine faulty. Such f…
▽ More
This paper considers the problem of Byzantine fault-tolerance in multi-agent decentralized optimization. In this problem, each agent has a local cost function. The goal of a decentralized optimization algorithm is to allow the agents to cooperatively compute a common minimum point of their aggregate cost function. We consider the case when a certain number of agents may be Byzantine faulty. Such faulty agents may not follow a prescribed algorithm, and they may share arbitrary or incorrect information with other non-faulty agents. Presence of such Byzantine agents renders a typical decentralized optimization algorithm ineffective. We propose a decentralized optimization algorithm with provable exact fault-tolerance against a bounded number of Byzantine agents, provided the non-faulty agents have a minimal redundancy.
△ Less
Submitted 30 September, 2020;
originally announced September 2020.
-
Iterative Pre-Conditioning for Expediting the Gradient-Descent Method: The Distributed Linear Least-Squares Problem
Authors:
Kushal Chakrabarti,
Nirupam Gupta,
Nikhil Chopra
Abstract:
This paper considers the multi-agent linear least-squares problem in a server-agent network. In this problem, the system comprises multiple agents, each having a set of local data points, that are connected to a server. The goal for the agents is to compute a linear mathematical model that optimally fits the collective data points held by all the agents, without sharing their individual local data…
▽ More
This paper considers the multi-agent linear least-squares problem in a server-agent network. In this problem, the system comprises multiple agents, each having a set of local data points, that are connected to a server. The goal for the agents is to compute a linear mathematical model that optimally fits the collective data points held by all the agents, without sharing their individual local data points. This goal can be achieved, in principle, using the server-agent variant of the traditional iterative gradient-descent method. The gradient-descent method converges linearly to a solution, and its rate of convergence is lower bounded by the conditioning of the agents' collective data points. If the data points are ill-conditioned, the gradient-descent method may require a large number of iterations to converge.
We propose an iterative pre-conditioning technique that mitigates the deleterious effect of the conditioning of data points on the rate of convergence of the gradient-descent method. We rigorously show that the resulting pre-conditioned gradient-descent method, with the proposed iterative pre-conditioning, achieves superlinear convergence when the least-squares problem has a unique solution. In general, the convergence is linear with improved rate of convergence in comparison to the traditional gradient-descent method and the state-of-the-art accelerated gradient-descent methods. We further illustrate the improved rate of convergence of our proposed algorithm through experiments on different real-world least-squares problems in both noise-free and noisy computation environment.
△ Less
Submitted 6 August, 2021; v1 submitted 6 August, 2020;
originally announced August 2020.
-
Machine Learning Automatically Detects COVID-19 using Chest CTs in a Large Multicenter Cohort
Authors:
Eduardo Jose Mortani Barbosa Jr.,
Bogdan Georgescu,
Shikha Chaganti,
Gorka Bastarrika Aleman,
Jordi Broncano Cabrero,
Guillaume Chabin,
Thomas Flohr,
Philippe Grenier,
Sasa Grbic,
Nakul Gupta,
François Mellot,
Savvas Nicolaou,
Thomas Re,
Pina Sanelli,
Alexander W. Sauter,
Youngjin Yoo,
Valentin Ziebandt,
Dorin Comaniciu
Abstract:
Objectives: To investigate machine-learning classifiers and interpretable models using chest CT for detection of COVID-19 and differentiation from other pneumonias, ILD and normal CTs.
Methods: Our retrospective multi-institutional study obtained 2096 chest CTs from 16 institutions (including 1077 COVID-19 patients). Training/testing cohorts included 927/100 COVID-19, 388/33 ILD, 189/33 other pn…
▽ More
Objectives: To investigate machine-learning classifiers and interpretable models using chest CT for detection of COVID-19 and differentiation from other pneumonias, ILD and normal CTs.
Methods: Our retrospective multi-institutional study obtained 2096 chest CTs from 16 institutions (including 1077 COVID-19 patients). Training/testing cohorts included 927/100 COVID-19, 388/33 ILD, 189/33 other pneumonias, and 559/34 normal (no pathologies) CTs. A metric-based approach for classification of COVID-19 used interpretable features, relying on logistic regression and random forests. A deep learning-based classifier differentiated COVID-19 via 3D features extracted directly from CT attenuation and probability distribution of airspace opacities.
Results: Most discriminative features of COVID-19 are percentage of airspace opacity and peripheral and basal predominant opacities, concordant with the typical characterization of COVID-19 in the literature. Unsupervised hierarchical clustering compares feature distribution across COVID-19 and control cohorts. The metrics-based classifier achieved AUC=0.83, sensitivity=0.74, and specificity=0.79 of versus respectively 0.93, 0.90, and 0.83 for the DL-based classifier. Most of ambiguity comes from non-COVID-19 pneumonia with manifestations that overlap with COVID-19, as well as mild COVID-19 cases. Non-COVID-19 classification performance is 91% for ILD, 64% for other pneumonias and 94% for no pathologies, which demonstrates the robustness of our method against different compositions of control groups.
Conclusions: Our new method accurately discriminates COVID-19 from other types of pneumonia, ILD, and no pathologies CTs, using quantitative imaging features derived from chest CT, while balancing interpretability of results and classification performance, and therefore may be useful to facilitate diagnosis of COVID-19.
△ Less
Submitted 9 October, 2020; v1 submitted 8 June, 2020;
originally announced June 2020.
-
Small, Sparse, but Substantial: Techniques for Segmenting Small Agricultural Fields Using Sparse Ground Data
Authors:
Smit Marvaniya,
Umamaheswari Devi,
Jagabondhu Hazra,
Shashank Mujumdar,
Nitin Gupta
Abstract:
The recent thrust on digital agriculture (DA) has renewed significant research interest in the automated delineation of agricultural fields. Most prior work addressing this problem have focused on detecting medium to large fields, while there is strong evidence that around 40\% of the fields world-wide and 70% of the fields in Asia and Africa are small. The lack of adequate labeled images for smal…
▽ More
The recent thrust on digital agriculture (DA) has renewed significant research interest in the automated delineation of agricultural fields. Most prior work addressing this problem have focused on detecting medium to large fields, while there is strong evidence that around 40\% of the fields world-wide and 70% of the fields in Asia and Africa are small. The lack of adequate labeled images for small fields, huge variations in their color, texture, and shape, and faint boundary lines separating them make it difficult to develop an end-to-end learning model for detecting such fields. Hence, in this paper, we present a multi-stage approach that uses a combination of machine learning and image processing techniques. In the first stage, we leverage state-of-the-art edge detection algorithms such as holistically-nested edge detection (HED) to extract first-level contours and polygons. In the second stage, we propose image-processing techniques to identify polygons that are non-fields, over-segmentations, or noise and eliminate them. The next stage tackles under-segmentations using a combination of a novel ``cut-point'' based technique and localized second-level edge detection to obtain individual parcels. Since a few small, non-cropped but vegetated or constructed pockets can be interspersed in areas that are predominantly croplands, in the final stage, we train a classifier for identifying each parcel from the previous stage as an agricultural field or not. In an evaluation using high-resolution imagery, we show that our approach has a high F-Score of 0.84 in areas with large fields and reasonable accuracy with an F-Score of 0.73 in areas with small fields, which is encouraging.
△ Less
Submitted 5 May, 2020;
originally announced May 2020.
-
Iterative Pre-Conditioning to Expedite the Gradient-Descent Method
Authors:
Kushal Chakrabarti,
Nirupam Gupta,
Nikhil Chopra
Abstract:
This paper considers the problem of multi-agent distributed optimization. In this problem, there are multiple agents in the system, and each agent only knows its local cost function. The objective for the agents is to collectively compute a common minimum of the aggregate of all their local cost functions. In principle, this problem is solvable using a distributed variant of the traditional gradie…
▽ More
This paper considers the problem of multi-agent distributed optimization. In this problem, there are multiple agents in the system, and each agent only knows its local cost function. The objective for the agents is to collectively compute a common minimum of the aggregate of all their local cost functions. In principle, this problem is solvable using a distributed variant of the traditional gradient-descent method, which is an iterative method. However, the speed of convergence of the traditional gradient-descent method is highly influenced by the conditioning of the optimization problem being solved. Specifically, the method requires a large number of iterations to converge to a solution if the optimization problem is ill-conditioned.
In this paper, we propose an iterative pre-conditioning approach that can significantly attenuate the influence of the problem's conditioning on the convergence-speed of the gradient-descent method. The proposed pre-conditioning approach can be easily implemented in distributed systems and has minimal computation and communication overhead. For now, we only consider a specific distributed optimization problem wherein the individual local cost functions of the agents are quadratic. Besides the theoretical guarantees, the improved convergence speed of our approach is demonstrated through experiments on a real data-set.
△ Less
Submitted 29 March, 2020; v1 submitted 13 March, 2020;
originally announced March 2020.
-
Privacy of Agents' Costs in Peer-to-Peer Distributed Optimization
Authors:
Nirupam Gupta,
Nikhil Chopra
Abstract:
In this paper, we propose a protocol that preserves (statistical) privacy of agents' costs in peer-to-peer distributed optimization against a passive adversary that corrupts certain number of agents in the network. The proposed protocol guarantees privacy of the affine parts of the honest agents' costs (agents that are not corrupted by the adversary) if the corrupted agents do not form a vertex cu…
▽ More
In this paper, we propose a protocol that preserves (statistical) privacy of agents' costs in peer-to-peer distributed optimization against a passive adversary that corrupts certain number of agents in the network. The proposed protocol guarantees privacy of the affine parts of the honest agents' costs (agents that are not corrupted by the adversary) if the corrupted agents do not form a vertex cut of the underlying communication topology. Therefore, if the (passive) adversary corrupts at most t arbitrary agents in the network then the proposed protocol can preserve the privacy of the affine parts of the remaining honest agents' costs if the communication topology has (t+1)-connectivity. The proposed privacy protocol is a composition of a privacy mechanism (we propose) with any (non-private) distributed optimization algorithm.
△ Less
Submitted 1 May, 2019;
originally announced May 2019.
-
Statistical Privacy in Distributed Average Consensus on Bounded Real Inputs
Authors:
Nirupam Gupta,
Jonathan Katz,
Nikhil Chopra
Abstract:
This paper proposes a privacy protocol for distributed average consensus algorithms on bounded real-valued inputs that guarantees statistical privacy of honest agents' inputs against colluding (passive adversarial) agents, if the set of colluding agents is not a vertex cut in the underlying communication network. This implies that privacy of agents' inputs is preserved against $t$ number of arbitr…
▽ More
This paper proposes a privacy protocol for distributed average consensus algorithms on bounded real-valued inputs that guarantees statistical privacy of honest agents' inputs against colluding (passive adversarial) agents, if the set of colluding agents is not a vertex cut in the underlying communication network. This implies that privacy of agents' inputs is preserved against $t$ number of arbitrary colluding agents if the connectivity of the communication network is at least $(t+1)$. A similar privacy protocol has been proposed for the case of bounded integral inputs in our previous paper~\cite{gupta2018information}. However, many applications of distributed consensus concerning distributed control or state estimation deal with real-valued inputs. Thus, in this paper we propose an extension of the privacy protocol in~\cite{gupta2018information}, for bounded real-valued agents' inputs, where bounds are known apriori to all the agents.
△ Less
Submitted 20 March, 2019;
originally announced March 2019.
-
Information-Theoretic Privacy in Distributed Average Consensus
Authors:
Nirupam Gupta,
Jonathan Katz,
Nikhil Chopra
Abstract:
We present a distributed average consensus protocol that preserves the privacy of agents' inputs. Unlike the differential privacy mechanisms, the presented protocol does not affect the accuracy of the output. It is shown that the protocol preserves the information-theoretic privacy of the agents' inputs against colluding passive adversarial (or honest-but-curious) agents in the network, if the adv…
▽ More
We present a distributed average consensus protocol that preserves the privacy of agents' inputs. Unlike the differential privacy mechanisms, the presented protocol does not affect the accuracy of the output. It is shown that the protocol preserves the information-theoretic privacy of the agents' inputs against colluding passive adversarial (or honest-but-curious) agents in the network, if the adversarial agents do not constitute a vertex cut in the underlying communication network. This implies that we can guarantee information-theoretic privacy of all the honest agents' inputs against $t$ arbitrary colluding passive adversarial agents if the network is $(t+1)$-connected. The protocol is constructed by composing a distributed privacy mechanism that we propose with any (non-private) distributed average consensus algorithm.
△ Less
Submitted 11 April, 2020; v1 submitted 5 September, 2018;
originally announced September 2018.
-
Modeling and Control of Inverted Flight of a Variable-Pitch Quadrotor
Authors:
Namrata Gupta,
Mangal Kothari,
Abhishek
Abstract:
This paper carries out the mathematical modeling, simulation, and control law design for a quadrotor with variable-pitch propellers. The use of variable-pitch propeller for thrust variation instead of RPM regulation facilitates generation of negative thrust, thereby augmenting the rate of change of thrust generation amenable for aggressive maneuvering. Blade element theory along with momentum theo…
▽ More
This paper carries out the mathematical modeling, simulation, and control law design for a quadrotor with variable-pitch propellers. The use of variable-pitch propeller for thrust variation instead of RPM regulation facilitates generation of negative thrust, thereby augmenting the rate of change of thrust generation amenable for aggressive maneuvering. Blade element theory along with momentum theory is used to estimate propeller thrust and torque essential for formulating equation of motion of the vehicle. The proposed flight dynamics model is used for non-linear control design using dynamic inversion technique, which is then used to stabilize, track reference trajectory, and simulate flip maneuver. The rotor torque is an irrational function of the control input which makes the control design challenging. To address this problem, the control design employs three loops. The outer loop solves the translational dynamics to generate the thrust, pitch angle, and roll angle commands required to track the prescribed trajectory. Using the command generated in the outer loop, the inner loop simplifies the rotational dynamics to provide the desired rate of angular velocities. A control allocation loop is added to address the problem of nonlinearity associated with rotor torque. This is done by introducing the derivative of thrust coefficient as a virtual control input. These virtual inputs determine the derivatives of thrust and body moments, which in turn is used to generate the required thrust and body moments. The concept is validated by showing attitude stabilization in real flight for a variable pitch quadrotor. The performance of the proposed design is shown through simulated results for attitude stabilization and trajectory following. Reverse thrust capability of variable-pitch quadrotor is also shown by performing flip maneuver in which quadrotor roll angle changes from 0 to 180 degrees.
△ Less
Submitted 19 September, 2017;
originally announced September 2017.
-
Trajectory Tracking Using Motion Primitives for the Purcell's Swimmer
Authors:
Sudin Kadam,
Kedar Joshi,
Naman Gupta,
Pulkit Katdare,
Ravi Banavar
Abstract:
Locomotion at low Reynolds numbers is a topic of growing interest, spurred by its various engineering and medical applications. This paper presents a novel prototype and a locomotion algorithm for the 3-link planar Purcell's swimmer based on Lie algebraic notions. The kinematic model based on Cox theory of the prototype swimmer is a driftless control-affine system. Using the existing strong contro…
▽ More
Locomotion at low Reynolds numbers is a topic of growing interest, spurred by its various engineering and medical applications. This paper presents a novel prototype and a locomotion algorithm for the 3-link planar Purcell's swimmer based on Lie algebraic notions. The kinematic model based on Cox theory of the prototype swimmer is a driftless control-affine system. Using the existing strong controllability and related results, the existence of motion primitives is initially shown. The Lie algebra of the control vector fields is then used to synthesize control profiles to generate motions along the basis of the Lie algebra associated with the structure group of the system. An open loop control system with vision-based positioning is successfully implemented which allows tracking any given continuous trajectory of the position and orientation of the swimmer's base link. Alongside, the paper also provides a theoretical interpretation of the symmetry arguments presented in the existing literature to generate the control profiles of the swimmer.
△ Less
Submitted 6 July, 2017; v1 submitted 4 March, 2017;
originally announced March 2017.
-
The Pin-Bang Theory: Discovering The Pinterest World
Authors:
Sudip Mittal,
Neha Gupta,
Prateek Dewan,
Ponnurangam Kumaraguru
Abstract:
Pinterest is an image-based online social network, which was launched in the year 2010 and has gained a lot of traction, ever since. Within 3 years, Pinterest has attained 48.7 million unique users. This stupendous growth makes it interesting to study Pinterest, and gives rise to multiple questions about it's users, and content. We characterized Pinterest on the basis of large scale crawls of 3.3…
▽ More
Pinterest is an image-based online social network, which was launched in the year 2010 and has gained a lot of traction, ever since. Within 3 years, Pinterest has attained 48.7 million unique users. This stupendous growth makes it interesting to study Pinterest, and gives rise to multiple questions about it's users, and content. We characterized Pinterest on the basis of large scale crawls of 3.3 million user profiles, and 58.8 million pins. In particular, we explored various attributes of users, pins, boards, pin sources, and user locations, in detail and performed topical analysis of user generated textual content. The characterization revealed most prominent topics among users and pins, top image sources, and geographical distribution of users on Pinterest. We then investigated this social network from a privacy and security standpoint, and found traces of malware in the form of pin sources. Instances of Personally Identifiable Information (PII) leakage were also discovered in the form of phone numbers, BBM (Blackberry Messenger) pins, and email addresses. Further, our analysis demonstrated how Pinterest is a potential venue for copyright infringement, by showing that almost half of the images shared on Pinterest go uncredited. To the best of our knowledge, this is the first attempt to characterize Pinterest at such a large scale.
△ Less
Submitted 18 July, 2013;
originally announced July 2013.
-
A Probabilistic Transmission Expansion Planning Methodology based on Roulette Wheel Selection and Social Welfare
Authors:
Neeraj Gupta,
Rajiv Shekhar,
Prem Kumar Kalra
Abstract:
A new probabilistic methodology for transmission expansion planning (TEP) that does not require a priori specification of new/additional transmission capacities and uses the concept of social welfare has been proposed. Two new concepts have been introduced in this paper: (i) roulette wheel methodology has been used to calculate the capacity of new transmission lines and (ii) load flow analysis has…
▽ More
A new probabilistic methodology for transmission expansion planning (TEP) that does not require a priori specification of new/additional transmission capacities and uses the concept of social welfare has been proposed. Two new concepts have been introduced in this paper: (i) roulette wheel methodology has been used to calculate the capacity of new transmission lines and (ii) load flow analysis has been used to calculate expected demand not served (EDNS). The overall methodology has been implemented on a modified IEEE 5-bus test system. Simulations show an important result: addition of only new transmission lines is not sufficient to minimize EDNS.
△ Less
Submitted 12 March, 2012;
originally announced March 2012.