Search | arXiv e-print repository

An Exploration of Internal States in Collaborative Problem Solving

Authors: Sifatul Anindho, Videep Venkatesha, Mariah Bradford, Anne M. Cleary, Nathaniel Blanchard

Abstract: Collaborative problem solving (CPS) is a complex cognitive, social, and emotional process that is increasingly prevalent in educational and professional settings. This study investigates the emotional states of individuals during CPS using a mixed-methods approach. Teams of four first completed a novel CPS task. Immediately after, each individual was placed in an isolated room where they reviewed… ▽ More Collaborative problem solving (CPS) is a complex cognitive, social, and emotional process that is increasingly prevalent in educational and professional settings. This study investigates the emotional states of individuals during CPS using a mixed-methods approach. Teams of four first completed a novel CPS task. Immediately after, each individual was placed in an isolated room where they reviewed the video of their group performing the task and self-reported their internal experiences throughout the task. We performed a linguistic analysis of these internal monologues, providing insights into the range of emotions individuals experience during CPS. Our analysis showed distinct patterns in language use, including characteristic unigrams and bigrams, key words and phrases, emotion labels, and semantic similarity between emotion-related words. △ Less

Submitted 2 July, 2025; originally announced July 2025.

Comments: Accepted to the International Conference on Human-Computer Interaction (HCII) 2025

arXiv:2507.01166 [pdf, ps, other]

A Methodological Framework for Capturing Cognitive-Affective States in Collaborative Learning

Authors: Sifatul Anindho, Videep Venkatesha, Nathaniel Blanchard

Abstract: Identification of affective and attentional states of individuals within groups is difficult to obtain without disrupting the natural flow of collaboration. Recent work from our group used a retrospect cued recall paradigm where participants spoke about their cognitive-affective states while they viewed videos of their groups. We then collected additional participants where their reports were cons… ▽ More Identification of affective and attentional states of individuals within groups is difficult to obtain without disrupting the natural flow of collaboration. Recent work from our group used a retrospect cued recall paradigm where participants spoke about their cognitive-affective states while they viewed videos of their groups. We then collected additional participants where their reports were constrained to a subset of pre-identified cognitive-affective states. In this latter case, participants either self reported or reported in response to probes. Here, we present an initial analysis of the frequency and temporal distribution of participant reports, and how the distributions of labels changed across the two collections. Our approach has implications for the educational data mining community in tracking cognitive-affective states in collaborative learning more effectively and in developing improved adaptive learning systems that can detect and respond to cognitive-affective states. △ Less

Submitted 1 July, 2025; originally announced July 2025.

Comments: Accepted to the Interactive Workshop: Multimodal, Multiparty Learning Analytics (MMLA) at the conference Educational Data Mining (EDM) 2025

arXiv:2506.00743 [pdf, ps, other]

Assortment of Attention Heads: Accelerating Federated PEFT with Head Pruning and Strategic Client Selection

Authors: Yeshwanth Venkatesha, Souvik Kundu, Priyadarshini Panda

Abstract: Parameter Efficient Fine-Tuning (PEFT) has become the de-facto approach in adapting Large Language Models (LLMs) for downstream tasks in Natural Language Processing. However, its adoption in privacy-preserving distributed learning frameworks, such as Federated Learning (FL), remains relatively limited. This is mainly due to challenges specific to FL, such as resource-constrained devices and divers… ▽ More Parameter Efficient Fine-Tuning (PEFT) has become the de-facto approach in adapting Large Language Models (LLMs) for downstream tasks in Natural Language Processing. However, its adoption in privacy-preserving distributed learning frameworks, such as Federated Learning (FL), remains relatively limited. This is mainly due to challenges specific to FL, such as resource-constrained devices and diverse data distributions among clients. In this paper, we propose an efficient method to perform PEFT within the FL framework for Multi-Head Attention (MHA) based language models. We address the challenges through head pruning, a novel head-specific weighted aggregation mechanism, and a client selection strategy. Head pruning minimizes training complexity within the clients, guided by the importance score computed based on the confidence of the attention head. Weighted aggregation of heads ensures the global model captures crucial updates from diverse clients complementing our client selection strategy. We show results on the MultiNLI benchmark along with 20 Newsgroups, XL-Sum, and E2E NLG datasets. We use the MultiNLI dataset and T5-small model with LoRA as our PEFT method, attaining sparsity levels of up to 90%, resulting in a communication advantage of up to 1.8x and a reduction in training OPs of 3.9x while maintaining the accuracy drop under 2%. △ Less

Submitted 31 May, 2025; originally announced June 2025.

arXiv:2505.21594 [pdf, ps, other]

Fast and Cost-effective Speculative Edge-Cloud Decoding with Early Exits

Authors: Yeshwanth Venkatesha, Souvik Kundu, Priyadarshini Panda

Abstract: Large Language Models (LLMs) enable various applications on edge devices such as smartphones, wearables, and embodied robots. However, their deployment often depends on expensive cloud-based APIs, creating high operational costs, which limit access for smaller organizations and raise sustainability concerns. Certain LLMs can be deployed on-device, offering a cost-effective solution with reduced la… ▽ More Large Language Models (LLMs) enable various applications on edge devices such as smartphones, wearables, and embodied robots. However, their deployment often depends on expensive cloud-based APIs, creating high operational costs, which limit access for smaller organizations and raise sustainability concerns. Certain LLMs can be deployed on-device, offering a cost-effective solution with reduced latency and improved privacy. Yet, limited computing resources constrain the size and accuracy of models that can be deployed, necessitating a collaborative design between edge and cloud. We propose a fast and cost-effective speculative edge-cloud decoding framework with a large target model on the server and a small draft model on the device. By introducing early exits in the target model, tokens are generated mid-verification, allowing the client to preemptively draft subsequent tokens before final verification, thus utilizing idle time and enhancing parallelism between edge and cloud. Using an NVIDIA Jetson Nano (client) and an A100 GPU (server) with Vicuna-68M (draft) and Llama2-7B (target) models, our method achieves up to a 35% reduction in latency compared to cloud-based autoregressive decoding, with an additional 11% improvement from preemptive drafting. To demonstrate real-world applicability, we deploy our method on the Unitree Go2 quadruped robot using Vision-Language Model (VLM) based control, achieving a 21% speedup over traditional cloud-based autoregressive decoding. These results demonstrate the potential of our framework for real-time LLM and VLM applications on resource-constrained edge devices. △ Less

Submitted 27 May, 2025; originally announced May 2025.

arXiv:2503.09511 [pdf, other]

TRACE: Real-Time Multimodal Common Ground Tracking in Situated Collaborative Dialogues

Authors: Hannah VanderHoeven, Brady Bhalla, Ibrahim Khebour, Austin Youngren, Videep Venkatesha, Mariah Bradford, Jack Fitzgerald, Carlos Mabrey, Jingxuan Tu, Yifan Zhu, Kenneth Lai, Changsoo Jung, James Pustejovsky, Nikhil Krishnaswamy

Abstract: We present TRACE, a novel system for live *common ground* tracking in situated collaborative tasks. With a focus on fast, real-time performance, TRACE tracks the speech, actions, gestures, and visual attention of participants, uses these multimodal inputs to determine the set of task-relevant propositions that have been raised as the dialogue progresses, and tracks the group's epistemic position a… ▽ More We present TRACE, a novel system for live *common ground* tracking in situated collaborative tasks. With a focus on fast, real-time performance, TRACE tracks the speech, actions, gestures, and visual attention of participants, uses these multimodal inputs to determine the set of task-relevant propositions that have been raised as the dialogue progresses, and tracks the group's epistemic position and beliefs toward them as the task unfolds. Amid increased interest in AI systems that can mediate collaborations, TRACE represents an important step forward for agents that can engage with multiparty, multimodal discourse. △ Less

Submitted 12 March, 2025; originally announced March 2025.

Comments: 11 pages, 4 tables, 4 figures, to appear at NAACL 2025 Demos program, Albuquerque, NM, USA

arXiv:2502.02692 [pdf, other]

Intelligent Sensing-to-Action for Robust Autonomy at the Edge: Opportunities and Challenges

Authors: Amit Ranjan Trivedi, Sina Tayebati, Hemant Kumawat, Nastaran Darabi, Divake Kumar, Adarsh Kumar Kosta, Yeshwanth Venkatesha, Dinithi Jayasuriya, Nethmi Jayasinghe, Priyadarshini Panda, Saibal Mukhopadhyay, Kaushik Roy

Abstract: Autonomous edge computing in robotics, smart cities, and autonomous vehicles relies on the seamless integration of sensing, processing, and actuation for real-time decision-making in dynamic environments. At its core is the sensing-to-action loop, which iteratively aligns sensor inputs with computational models to drive adaptive control strategies. These loops can adapt to hyper-local conditions,… ▽ More Autonomous edge computing in robotics, smart cities, and autonomous vehicles relies on the seamless integration of sensing, processing, and actuation for real-time decision-making in dynamic environments. At its core is the sensing-to-action loop, which iteratively aligns sensor inputs with computational models to drive adaptive control strategies. These loops can adapt to hyper-local conditions, enhancing resource efficiency and responsiveness, but also face challenges such as resource constraints, synchronization delays in multi-modal data fusion, and the risk of cascading errors in feedback loops. This article explores how proactive, context-aware sensing-to-action and action-to-sensing adaptations can enhance efficiency by dynamically adjusting sensing and computation based on task demands, such as sensing a very limited part of the environment and predicting the rest. By guiding sensing through control actions, action-to-sensing pathways can improve task relevance and resource use, but they also require robust monitoring to prevent cascading errors and maintain reliability. Multi-agent sensing-action loops further extend these capabilities through coordinated sensing and actions across distributed agents, optimizing resource use via collaboration. Additionally, neuromorphic computing, inspired by biological systems, provides an efficient framework for spike-based, event-driven processing that conserves energy, reduces latency, and supports hierarchical control--making it ideal for multi-agent optimization. This article highlights the importance of end-to-end co-design strategies that align algorithmic models with hardware and environmental dynamics and improve cross-layer interdependencies to improve throughput, precision, and adaptability for energy-efficient edge autonomy in complex environments. △ Less

Submitted 4 February, 2025; originally announced February 2025.

arXiv:2501.11618 [pdf, other]

Enhancing IoT Network Security through Adaptive Curriculum Learning and XAI

Authors: Sathwik Narkedimilli, Sujith Makam, Amballa Venkata Sriram, Sai Prashanth Mallellu, MSVPJ Sathvik, Ranga Rao Venkatesha Prasad

Abstract: To address the critical need for secure IoT networks, this study presents a scalable and lightweight curriculum learning framework enhanced with Explainable AI (XAI) techniques, including LIME, to ensure transparency and adaptability. The proposed model employs novel neural network architecture utilized at every stage of Curriculum Learning to efficiently capture and focus on both short- and long-… ▽ More To address the critical need for secure IoT networks, this study presents a scalable and lightweight curriculum learning framework enhanced with Explainable AI (XAI) techniques, including LIME, to ensure transparency and adaptability. The proposed model employs novel neural network architecture utilized at every stage of Curriculum Learning to efficiently capture and focus on both short- and long-term temporal dependencies, improve learning stability, and enhance accuracy while remaining lightweight and robust against noise in sequential IoT data. Robustness is achieved through staged learning, where the model iteratively refines itself by removing low-relevance features and optimizing performance. The workflow includes edge-optimized quantization and pruning to ensure portability that could easily be deployed in the edge-IoT devices. An ensemble model incorporating Random Forest, XGBoost, and the staged learning base further enhances generalization. Experimental results demonstrate 98% accuracy on CIC-IoV-2024 and CIC-APT-IIoT-2024 datasets and 97% on EDGE-IIoT, establishing this framework as a robust, transparent, and high-performance solution for IoT network security. △ Less

Submitted 20 January, 2025; originally announced January 2025.

Comments: 2 tables, 5 figures

arXiv:2412.03065 [pdf, other]

doi 10.1016/j.dark.2024.101759

Chebyshev cosmography in the framework of extended symmetric teleparallel theory

Authors: Sai Swagat Mishra, N. S. Kavya, P. K. Sahoo, V. Venkatesha

Abstract: Cosmography has been extensively utilized to constrain the kinematic state of the Universe using measured distances. In this work, we propose a new method to reconstruct coupling theories using the first kind of Chebyshev polynomial for two variables in which the functional form of the $f(Q,T)$ theory has been obtained. Further, the unknowns that appeared in the series are constrained using the co… ▽ More Cosmography has been extensively utilized to constrain the kinematic state of the Universe using measured distances. In this work, we propose a new method to reconstruct coupling theories using the first kind of Chebyshev polynomial for two variables in which the functional form of the $f(Q,T)$ theory has been obtained. Further, the unknowns that appeared in the series are constrained using the cosmographic parameters. We find the explicit form of the luminosity distance in terms of cosmographic parameters to perform MCMC analysis using the PANTHEON+SH0ES data set. Through the distance modulus function, we observe that the result comes out to be an excellent match to the standard cosmological model and data. △ Less

Submitted 4 December, 2024; originally announced December 2024.

Comments: Accepted version in Physics of the Dark Universe

Journal ref: Physics of the Dark Universe 47 (2025) 101759

arXiv:2412.02707 [pdf, other]

doi 10.1093/mnras/stae2472

Unveiling the Effects of Coupling Extended Proca-Nuevo Gravity on Cosmic Expansion with Recent Observations

Authors: L. Sudharani, N. S. Kavya, V. Venkatesha

Abstract: We study Coupling Extended Proca-Nuevo gravity, a non-linear theory extending from dRGT massive gravity with a spin-1 field. This theory is shown to yield reliable, ghost-free cosmological solutions, modeling both the Universe's thermal history and late-time acceleration. By analyzing data from Dark energy spectroscopic instruments (DESI), Cosmic Chronometer (CCh), Gamma Ray Bursts (GRBs), and Typ… ▽ More We study Coupling Extended Proca-Nuevo gravity, a non-linear theory extending from dRGT massive gravity with a spin-1 field. This theory is shown to yield reliable, ghost-free cosmological solutions, modeling both the Universe's thermal history and late-time acceleration. By analyzing data from Dark energy spectroscopic instruments (DESI), Cosmic Chronometer (CCh), Gamma Ray Bursts (GRBs), and Type Ia Supernova (SNeIa), we derive parameter constraints with up to 3$σ$ confidence, demonstrating good agreement with observations. Our comparison of $BAO$ data from $WiggleZ$ and $DESI$ highlights its constraining power on the Hubble constant. The analysis of the cosmographic parameter, $q$ shows the statistical compatibility with the recent data. Further, this indicates that Universe's current accelerated expansion aligns with quintessential behavior. △ Less

Submitted 26 November, 2024; originally announced December 2024.

Comments: published version in MNRAS

Journal ref: Monthly Notices of the Royal Astronomical Society 535 (2), 1998-2008, 2024

arXiv:2410.19301 [pdf, other]

Any Other Thoughts, Hedgehog? Linking Deliberation Chains in Collaborative Dialogues

Authors: Abhijnan Nath, Videep Venkatesha, Mariah Bradford, Avyakta Chelle, Austin Youngren, Carlos Mabrey, Nathaniel Blanchard, Nikhil Krishnaswamy

Abstract: Question-asking in collaborative dialogue has long been established as key to knowledge construction, both in internal and collaborative problem solving. In this work, we examine probing questions in collaborative dialogues: questions that explicitly elicit responses from the speaker's interlocutors. Specifically, we focus on modeling the causal relations that lead directly from utterances earlier… ▽ More Question-asking in collaborative dialogue has long been established as key to knowledge construction, both in internal and collaborative problem solving. In this work, we examine probing questions in collaborative dialogues: questions that explicitly elicit responses from the speaker's interlocutors. Specifically, we focus on modeling the causal relations that lead directly from utterances earlier in the dialogue to the emergence of the probing question. We model these relations using a novel graph-based framework of deliberation chains, and reframe the problem of constructing such chains as a coreference-style clustering problem. Our framework jointly models probing and causal utterances and the links between them, and we evaluate on two challenging collaborative task datasets: the Weights Task and DeliData. Our results demonstrate the effectiveness of our theoretically-grounded approach compared to both baselines and stronger coreference approaches, and establish a standard of performance in this novel task. △ Less

Submitted 25 October, 2024; originally announced October 2024.

Comments: Accepted at Findings of EMNLP 2024

arXiv:2407.09589 [pdf, other]

doi 10.1093/mnras/stae1723

Can $f(T)$ models play a bridge between early and late time Universe?

Authors: N. S. Kavya, Sai Swagat Mishra, P. K. Sahoo, V. Venkatesha

Abstract: The ability of Big Bang Nucleosynthesis theory to accurately predict the primordial abundances of helium and deuterium, as well as the baryon content of the Universe, is considered one of the most significant achievements in modern physics. In the present study, we consider two highly motivated hybrid $f(T)$ models and constrain them using the observations from the Big Bang Nucleosynthesis era. In… ▽ More The ability of Big Bang Nucleosynthesis theory to accurately predict the primordial abundances of helium and deuterium, as well as the baryon content of the Universe, is considered one of the most significant achievements in modern physics. In the present study, we consider two highly motivated hybrid $f(T)$ models and constrain them using the observations from the Big Bang Nucleosynthesis era. In addition, using late-time observations of Cosmic Chronometers and Gamma-Ray-Bursts, the ranges of the model parameters are confined which are in good agreement with early time bounds. Subsequently, the common ranges obtained from the analysis for early and late time are summarized. Further, we verify the intermediating epochs by investigating the profiles of cosmographic parameters using the model parameter values from the common range. From this study, we find the considered teleparallel models are viable candidates to explain the primordial-intermediating-present epochs. △ Less

Submitted 19 July, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

Comments: MNRAS accepted version

Journal ref: Monthly Notices of the Royal Astronomical Society, 532(3) (2024) 3126

arXiv:2406.06661 [pdf, other]

doi 10.3847/1538-4357/ad5555

Constraining extended teleparallel gravity via cosmography: A model-independent approach

Authors: Sai Swagat Mishra, N. S. Kavya, P. K. Sahoo, V. Venkatesha

Abstract: As a classical approach, the dynamics of the Universe, influenced by its dark components, are unveiled through prior modifications of Einstein's equations. Cosmography, on the other hand, is a highly efficient tool for reconstructing any modified theory in a model-independent manner. By employing kinematic variables, it offers a profound explanation for cosmic expansion. Although the cosmographica… ▽ More As a classical approach, the dynamics of the Universe, influenced by its dark components, are unveiled through prior modifications of Einstein's equations. Cosmography, on the other hand, is a highly efficient tool for reconstructing any modified theory in a model-independent manner. By employing kinematic variables, it offers a profound explanation for cosmic expansion. Although the cosmographical approach has been highly successful in several geometric theories in recent years, it has not been extensively explored in coupled gravities. With this in mind, we intend to constrain an extended teleparallel gravity model, $f(T,\mathcal{T})$, through cosmographic parameters. We utilize Taylor series expansion, assuming a minimally coupled form, to constrain the unknowns involved in the series. To achieve this, we conduct a Markov Chain Monte Carlo analysis (MCMC) using three different datasets (CC, BAO, and Pantheon+SH0ES). The constrained results obtained from MCMC are then compared and verified using various cosmological parameters. Finally, we compare the resulting models with \textbf{three} well-known $f(T,\mathcal{T})$ models. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: ApJ accepted version

Journal ref: The Astrophysical Journal 970 (2024) 57

arXiv:2401.17625 [pdf, other]

doi 10.1140/epjp/s13360-024-04955-0

Cosmic expansion beyond $Λ$CDM: Investigating power-law and logarithmic corrections

Authors: M. Koussour, N. S. Kavya, V. Venkatesha, N. Myrzakulov

Abstract: The cosmic acceleration observed in the expansion of the Universe has sparked extensive research into the nature of dark energy, which is known to constitute approximately 70\% of the Universe's energy content. In this study, we explore two parametrizations of the Hubble parameter, namely power-law and logarithmic corrections, as alternatives to the standard $Λ$CDM model. Using observational data… ▽ More The cosmic acceleration observed in the expansion of the Universe has sparked extensive research into the nature of dark energy, which is known to constitute approximately 70\% of the Universe's energy content. In this study, we explore two parametrizations of the Hubble parameter, namely power-law and logarithmic corrections, as alternatives to the standard $Λ$CDM model. Using observational data from Cosmic Chronometers (CC), Pantheon+, and the Baryonic Acoustic Oscillations (BAO) datasets, we investigate the dynamics of essential cosmological parameters, including the deceleration parameter, energy density, pressure, and equation of state (EoS) parameter. The $Om(z)$ diagnostic test is employed to classify different dark energy models. Our cosmological models, with the power-law and logarithmic corrections, are found to provide a good fit to the recent observational data and efficiently describe the cosmic expansion scenario. △ Less

Submitted 31 January, 2024; originally announced January 2024.

Comments: The European Physical Journal Plus accepted version

Journal ref: Eur. Phys. J. Plus 139, 179 (2024)

arXiv:2312.14364 [pdf, other]

doi 10.1109/JSEN.2024.3397490

GreenScan: Towards large-scale terrestrial monitoring the health of urban trees using mobile sensing

Authors: Akshit Gupta, Simone Mora, Fan Zhang, Martine Rutten, R. Venkatesha Prasad, Carlo Ratti

Abstract: Healthy urban greenery is a fundamental asset to mitigate climate change phenomena such as extreme heat and air pollution. However, urban trees are often affected by abiotic and biotic stressors that hamper their functionality, and whenever not timely managed, even their survival. While the current greenery inspection techniques can help in taking effective measures, they often require a high amou… ▽ More Healthy urban greenery is a fundamental asset to mitigate climate change phenomena such as extreme heat and air pollution. However, urban trees are often affected by abiotic and biotic stressors that hamper their functionality, and whenever not timely managed, even their survival. While the current greenery inspection techniques can help in taking effective measures, they often require a high amount of human labor, making frequent assessments infeasible at city-wide scales. In this paper, we present GreenScan, a ground-based sensing system designed to provide health assessments of urban trees at high spatio-temporal resolutions, with low costs. The system utilises thermal and multi-spectral imaging sensors fused using a custom computer vision model in order to estimate two tree health indexes. The evaluation of the system was performed through data collection experiments in Cambridge, USA. Overall, this work illustrates a novel approach for autonomous mobile ground-based tree health monitoring on city-wide scales at high temporal resolutions with low-costs. △ Less

Submitted 6 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

Comments: 13 pages, submitted to IEEE Sensors

Report number: Volume: 24 Issue: 13

Journal ref: IEEE Sensors Journal 2024

arXiv:2312.09640 [pdf, other]

doi 10.1088/1572-9494/ad15fa

Conformally symmetric wormhole solutions supported by non-commutative geometry in $f(Q,T)$ gravity

Authors: Chaitra Chooda Chalavadi, V. Venkatesha, N. S. Kavya, S. V. Divya Rashmi

Abstract: This manuscript investigates wormhole solutions within the framework of extended symmetric teleparallel gravity, incorporating non-commutative geometry, and conformal symmetries. To achieve this, we examine the linear wormhole model with anisotropic fluid under Gaussian and Lorentzian distributions. The primary objective is to derive wormhole solutions while considering the influence of the shape… ▽ More This manuscript investigates wormhole solutions within the framework of extended symmetric teleparallel gravity, incorporating non-commutative geometry, and conformal symmetries. To achieve this, we examine the linear wormhole model with anisotropic fluid under Gaussian and Lorentzian distributions. The primary objective is to derive wormhole solutions while considering the influence of the shape function on model parameters under Gaussian and Lorentzian distributions. The resulting shape function satisfies all the necessary conditions for a traversable wormhole. Furthermore, we analyze the characteristics of the energy conditions and provide a detailed graphical discussion of the matter contents via energy conditions. Additionally, we explore the effect of anisotropy under Gaussian and Lorentzian distributions. Finally, we present our conclusions based on the obtained results. △ Less

Submitted 15 December, 2023; originally announced December 2023.

Comments: Accepted version in Communications in Theoretical Physics

arXiv:2310.02678 [pdf, other]

doi 10.1109/LSENS.2023.3307121

Open Gimbal: A 3 Degrees of Freedom Open Source Sensing and Testing Platform for Nano and Micro UAVs

Authors: Suryansh Sharma, Tristan Dijkstra, R. Venkatesha Prasad

Abstract: Testing the aerodynamics of micro- and nano-UAVs without actually flying is highly challenging. To address this issue, we introduce Open Gimbal, a specially designed 3 Degrees of Freedom platform that caters to the unique requirements of micro- and nano-UAVs. This platform allows for unrestricted and free rotational motion, enabling comprehensive experimentation and evaluation of these UAVs. Our a… ▽ More Testing the aerodynamics of micro- and nano-UAVs without actually flying is highly challenging. To address this issue, we introduce Open Gimbal, a specially designed 3 Degrees of Freedom platform that caters to the unique requirements of micro- and nano-UAVs. This platform allows for unrestricted and free rotational motion, enabling comprehensive experimentation and evaluation of these UAVs. Our approach focuses on simplicity and accessibility. We developed an open-source, 3D printable electro-mechanical design that has minimal size and low complexity. This design facilitates easy replication and customization, making it widely accessible to researchers and developers. Addressing the challenges of sensing flight dynamics at a small scale, we have devised an integrated wireless batteryless sensor subsystem. Our innovative solution eliminates the need for complex wiring and instead uses wireless power transfer for sensor data reception. To validate the effectiveness of open gimbal, we thoroughly evaluate and test its communication link and sensing performance using a typical nano-quadrotor. Through comprehensive testing, we verify the reliability and accuracy of open gimbal in real-world scenarios. These advancements provide valuable tools and insights for researchers and developers working with mUAVs and nUAVs, contributing to the progress of this rapidly evolving field. △ Less

Submitted 4 October, 2023; originally announced October 2023.

Comments: Link to open source repository: https://doi.org/10.5281/zenodo.8052218

Journal ref: in IEEE Sensors Letters, vol. 7, no. 9, pp. 1-4, Sept. 2023, Art no. 2502704

arXiv:2309.00077 [pdf, other]

Governing accelerating Universe via newly reconstructed Hubble parameter by employing empirical data simulations

Authors: L. Sudharani, Kazuharu Bamba, N. S. Kavya, V. Venkatesha

Abstract: A new parametrization of the Hubble parameter is proposed to explore the issue of the cosmological landscape. The constraints on model parameters are derived through the Markov Chain Monte Carlo (MCMC) method by employing a comprehensive union of datasets such as 34 data points from cosmic chronometers (CC), 42 points from baryonic acoustic oscillations (BAO), a recently updated set of 1701 Panthe… ▽ More A new parametrization of the Hubble parameter is proposed to explore the issue of the cosmological landscape. The constraints on model parameters are derived through the Markov Chain Monte Carlo (MCMC) method by employing a comprehensive union of datasets such as 34 data points from cosmic chronometers (CC), 42 points from baryonic acoustic oscillations (BAO), a recently updated set of 1701 Pantheon$^+$ (P22) data points derived from Type Ia supernovae (SNeIa), and 162 data points from gamma-ray bursts (GRBs). Furthermore, the models are compared by using the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC), so that a comparative assessment of model performance can be available. Additionally, we compare the Dainotti relation via Gaussian likelihood analysis versus new likelihoods and Calibration of the Dainotti relation through a model-independent method. The kinematic behavior of the models is also investigated by encompassing the transition from deceleration to acceleration and the evolution of the jerk parameter. From the analysis of the parametric models, it is strongly indicated that the Universe is currently undergoing an accelerated phase with diagnostics of the model validating the quintessence phase. △ Less

Submitted 3 May, 2024; v1 submitted 31 August, 2023; originally announced September 2023.

Comments: 16 pages, 8 figures, version accepted for publication in Physics of the Dark Universe

Report number: FU-PCG-122

arXiv:2308.07862 [pdf, other]

doi 10.1016/j.newast.2023.102090

Wormhole Geometry and Three-Dimensional Embedding in Extended Symmetric Teleparallel Gravity

Authors: V. Venkatesha, Chaitra Chooda Chalavadi, N. S. Kavya, P. K. Sahoo

Abstract: In the present manuscript, we study traversable wormhole solutions in the background of extended symmetric teleparallel gravity with matter coupling. With the anisotropic matter distribution we probe the wormhole geometry for two different gravity models. Primarily, we consider the linear model $ f(Q,T) =Q + 2 \, ξ\,T$. Firstly, we presume a logarithmic form of shape function and analyze the scena… ▽ More In the present manuscript, we study traversable wormhole solutions in the background of extended symmetric teleparallel gravity with matter coupling. With the anisotropic matter distribution we probe the wormhole geometry for two different gravity models. Primarily, we consider the linear model $ f(Q,T) =Q + 2 \, ξ\,T$. Firstly, we presume a logarithmic form of shape function and analyze the scenario for different redshift functions. Secondly, for a specific form of energy density, we derive a shape function and note its satisfying behavior. Next, for the non-linear model $f(Q,T) = Q + α,Q^2 + β,T$ and a specific shape function we examine the wormhole solution. Further, with the aid of embedding diagrams, we interpreted the geometry of wormhole models. Finally, we conclude results. △ Less

Submitted 25 July, 2023; originally announced August 2023.

Comments: New Astronomy published version

Journal ref: New Astronomy, 105 (2024) 102090

arXiv:2308.01385 [pdf, other]

doi 10.1145/3570361.3592498

BEAVIS: Balloon Enabled Aerial Vehicle for IoT and Sensing

Authors: Suryansh Sharma, Ashutosh Simha, R. Venkatesha Prasad, Shubham Deshmukh, Kavin B. Saravanan, Ravi Ramesh, Luca Mottola

Abstract: UAVs are becoming versatile and valuable platforms for various applications. However, the main limitation is their flying time. We present BEAVIS, a novel aerial robotic platform striking an unparalleled trade-off between the manoeuvrability of drones and the long lasting capacity of blimps. BEAVIS scores highly in applications where drones enjoy unconstrained mobility yet suffer from limited life… ▽ More UAVs are becoming versatile and valuable platforms for various applications. However, the main limitation is their flying time. We present BEAVIS, a novel aerial robotic platform striking an unparalleled trade-off between the manoeuvrability of drones and the long lasting capacity of blimps. BEAVIS scores highly in applications where drones enjoy unconstrained mobility yet suffer from limited lifetime. A nonlinear flight controller exploiting novel, unexplored, aerodynamic phenomena to regulate the ambient pressure and enable all translational and yaw degrees of freedom is proposed without direct actuation in the vertical direction. BEAVIS has built-in rotor fault detection and tolerance. We explain the design and the necessary background in detail. We verify the dynamics of BEAVIS and demonstrate its distinct advantages, such as agility, over existing platforms including the degrees of freedom akin to a drone with 11.36x increased lifetime. We exemplify the potential of BEAVIS to become an invaluable platform for many applications. △ Less

Submitted 2 August, 2023; originally announced August 2023.

Comments: To be published in the 29th Annual International Conference on Mobile Computing and Networking (ACM MobiCom 23), October 2-6, 2023, Madrid, Spain. ACM, New York, NY, USA, 15 pages

arXiv:2307.10286 [pdf]

Distributed Sensing, Computing, Communication, and Control Fabric: A Unified Service-Level Architecture for 6G

Authors: Dejan Vukobratović, Nikolaos Bartzoudis, Mona Ghassemian, Firooz Saghezchi, Peizheng Li, Adnan Aijaz, Ricardo Martinez, Xueli An, Ranga Rao Venkatesha Prasad, Helge Lüders, Shahid Mumtaz

Abstract: With the advent of the multimodal immersive communication system, people can interact with each other using multiple devices for sensing, communication and/or control either onsite or remotely. As a breakthrough concept, a distributed sensing, computing, communications, and control (DS3C) fabric is introduced in this paper for provisioning 6G services in multi-tenant environments in a unified mann… ▽ More With the advent of the multimodal immersive communication system, people can interact with each other using multiple devices for sensing, communication and/or control either onsite or remotely. As a breakthrough concept, a distributed sensing, computing, communications, and control (DS3C) fabric is introduced in this paper for provisioning 6G services in multi-tenant environments in a unified manner. The DS3C fabric can be further enhanced by natively incorporating intelligent algorithms for network automation and managing networking, computing, and sensing resources efficiently to serve vertical use cases with extreme and/or conflicting requirements. As such, the paper proposes a novel end-to-end 6G system architecture with enhanced intelligence spanning across different network, computing, and business domains, identifies vertical use cases and presents an overview of the relevant standardization and pre-standardization landscape. △ Less

Submitted 18 July, 2023; originally announced July 2023.

arXiv:2307.02498 [pdf, other]

doi 10.1016/j.aop.2023.169383

On possible wormhole solutions supported by non-commutative geometry within $f(R, L_m)$ gravity

Authors: N. S. Kavya, V. Venkatesha, G. Mustafa, P. K. Sahoo

Abstract: Non-commutativity is a key feature of spacetime geometry. The current article explores the traversable wormhole solutions in the framework of $f(R,L_m)$ gravity within non-commutative geometry. By using the Gaussian and Lorentzian distributions, we construct tideless wormholes for the nonlinear $f(R,L_m)$ model $f(R,L_m)=\dfrac{R}{2}+L_m^α$. For both cases, we derive shape functions and discuss th… ▽ More Non-commutativity is a key feature of spacetime geometry. The current article explores the traversable wormhole solutions in the framework of $f(R,L_m)$ gravity within non-commutative geometry. By using the Gaussian and Lorentzian distributions, we construct tideless wormholes for the nonlinear $f(R,L_m)$ model $f(R,L_m)=\dfrac{R}{2}+L_m^α$. For both cases, we derive shape functions and discuss the required different properties with satisfying behavior. For the required wormhole properties, we develop some new constraints. The influence of the involved model parameter on energy conditions is analyzed graphically which provides a discussion about the nature of exotic matter. Further, we check the physical behavior regarding the stability of wormhole solutions through the TOV equation. An interesting feature regarding the stability of the obtained solutions via the speed of sound parameters within the scope of average pressure is discussed. Finally, we conclude our results. △ Less

Submitted 4 July, 2023; originally announced July 2023.

Comments: AOP accepted version

Journal ref: Annals of Physics, 455, 169383 (2023)

arXiv:2306.08856 [pdf, other]

doi 10.1016/j.cjph.2024.01.004

Exploring wormhole solutions in curvature-matter coupling gravity supported by noncommutative geometry and conformal symmetry

Authors: N. S. Kavya, G. Mustafa, V. Venkatesha, P. K. Sahoo

Abstract: This article explores new physically viable wormhole solutions within the framework of f(R,Lm) gravity theory, incorporating noncommutative backgrounds and conformal symmetries. The study investigates the impact of model parameters on the existence and properties of wormholes. The derived shape function is found to obey all the required criteria. Specific attention is given to traceless wormholes… ▽ More This article explores new physically viable wormhole solutions within the framework of f(R,Lm) gravity theory, incorporating noncommutative backgrounds and conformal symmetries. The study investigates the impact of model parameters on the existence and properties of wormholes. The derived shape function is found to obey all the required criteria. Specific attention is given to traceless wormholes with Gaussian and Lorentzian distributions, investigating the behavior of the shape functions and energy conditions. In both cases, the presence of exotic fluid is confirmed. △ Less

Submitted 7 January, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

Comments: Accepted version in Chinese Journal of Physics

arXiv:2305.18416 [pdf, other]

doi 10.1145/3583781.3590241

Examining the Role and Limits of Batchnorm Optimization to Mitigate Diverse Hardware-noise in In-memory Computing

Authors: Abhiroop Bhattacharjee, Abhishek Moitra, Youngeun Kim, Yeshwanth Venkatesha, Priyadarshini Panda

Abstract: In-Memory Computing (IMC) platforms such as analog crossbars are gaining focus as they facilitate the acceleration of low-precision Deep Neural Networks (DNNs) with high area- & compute-efficiencies. However, the intrinsic non-idealities in crossbars, which are often non-deterministic and non-linear, degrade the performance of the deployed DNNs. In addition to quantization errors, most frequently… ▽ More In-Memory Computing (IMC) platforms such as analog crossbars are gaining focus as they facilitate the acceleration of low-precision Deep Neural Networks (DNNs) with high area- & compute-efficiencies. However, the intrinsic non-idealities in crossbars, which are often non-deterministic and non-linear, degrade the performance of the deployed DNNs. In addition to quantization errors, most frequently encountered non-idealities during inference include crossbar circuit-level parasitic resistances and device-level non-idealities such as stochastic read noise and temporal drift. In this work, our goal is to closely examine the distortions caused by these non-idealities on the dot-product operations in analog crossbars and explore the feasibility of a nearly training-less solution via crossbar-aware fine-tuning of batchnorm parameters in real-time to mitigate the impact of the non-idealities. This enables reduction in hardware costs in terms of memory and training energy for IMC noise-aware retraining of the DNN weights on crossbars. △ Less

Submitted 28 May, 2023; originally announced May 2023.

Comments: Accepted in Great Lakes Symposium on VLSI 2023 (GLSVLSI 2023) conference

Journal ref: Great Lakes Symposium on VLSI 2023 (GLSVLSI 2023) conference

arXiv:2305.07135 [pdf, other]

Divide-and-Conquer the NAS puzzle in Resource Constrained Federated Learning Systems

Authors: Yeshwanth Venkatesha, Youngeun Kim, Hyoungseob Park, Priyadarshini Panda

Abstract: Federated Learning (FL) is a privacy-preserving distributed machine learning approach geared towards applications in edge devices. However, the problem of designing custom neural architectures in federated environments is not tackled from the perspective of overall system efficiency. In this paper, we propose DC-NAS -- a divide-and-conquer approach that performs supernet-based Neural Architecture… ▽ More Federated Learning (FL) is a privacy-preserving distributed machine learning approach geared towards applications in edge devices. However, the problem of designing custom neural architectures in federated environments is not tackled from the perspective of overall system efficiency. In this paper, we propose DC-NAS -- a divide-and-conquer approach that performs supernet-based Neural Architecture Search (NAS) in a federated system by systematically sampling the search space. We propose a novel diversified sampling strategy that balances exploration and exploitation of the search space by initially maximizing the distance between the samples and progressively shrinking this distance as the training progresses. We then perform channel pruning to reduce the training complexity at the devices further. We show that our approach outperforms several sampling strategies including Hadamard sampling, where the samples are maximally separated. We evaluate our method on the CIFAR10, CIFAR100, EMNIST, and TinyImagenet benchmarks and show a comprehensive analysis of different aspects of federated learning such as scalability, and non-IID data. DC-NAS achieves near iso-accuracy as compared to full-scale federated NAS with 50% fewer resources. △ Less

Submitted 11 May, 2023; originally announced May 2023.

arXiv:2305.04707 [pdf, other]

doi 10.1088/1402-4896/acd483

Geometric structures of Morris-Thorne wormhole metric in $f(R,L_m)$ gravity and energy conditions

Authors: V. Venkatesha, N. S. Kavya, P. K. Sahoo

Abstract: The aim of this manuscript is to study the traversable wormhole (WH) geometries in the curvature matter coupling gravity. We investigate static spherically symmetric Morris-Thorne WHs within the context of $f(R,L_m)$ gravity. To accomplish this, we examine the WH model in four different cases (i) linear $f(R,L_m)$ model, $f(R,L_m)=αR+βL_m$ with anisotropic matter distribution having the relation… ▽ More The aim of this manuscript is to study the traversable wormhole (WH) geometries in the curvature matter coupling gravity. We investigate static spherically symmetric Morris-Thorne WHs within the context of $f(R,L_m)$ gravity. To accomplish this, we examine the WH model in four different cases (i) linear $f(R,L_m)$ model, $f(R,L_m)=αR+βL_m$ with anisotropic matter distribution having the relation $p_r=m p_t$ (ii) linear $f(R,L_m)$ model having anisotropic matter distribution along with the equation of state parameter, $p_r=ωρ$, (iii) non-linear model $f(R,L_m)=\dfrac{1}{2}R+L_m^η$ with specific form of energy density and (iv) non-linear $f(R,L_m)$ model, $f(R,L_m)=\dfrac{1}{2}R+(1+ξR)L_m$ with isotropic matter distribution and having the linear relation between pressure and energy density, $p=ωρ$. Additionally, in the latter case, we consider a specific power-law shape function $b(r)=r_0 \left(\dfrac{r_0}{r}\right)^n$. Furthermore, we analyze the energy conditions for each WH model to verify their physical viability. As a novel outcome, we can see the validation of the null energy condition for the $f(R,L_m)$ model that suggests ruling out the necessity of exotic matter for the traversability of the WH. At last, an embedding diagram for each model is illustrated that describes the WH geometry. △ Less

Submitted 8 May, 2023; originally announced May 2023.

Comments: Physica Scripta accepted version

Journal ref: Physica Scripta 98(6) (2023) 065020

arXiv:2305.01469 [pdf, other]

doi 10.1016/j.cjph.2023.05.002

Static traversable wormhole solutions in $f(R,L_m)$ gravity

Authors: N. S. Kavya, V. Venkatesha, G. Mustafa, P. K. Sahoo, S. V. Divya Rashmi

Abstract: In this study, we explore the new wormhole solutions in the framework of new modified $f(R,L_m)$ gravity. To obtain a characteristic wormhole solution, we use anisotropic matter distribution and a specific form of energy density. As second adopt the isotropic case with a linear EoS relation as a general technique for the system and discuss several physical attributes of the system under the wormho… ▽ More In this study, we explore the new wormhole solutions in the framework of new modified $f(R,L_m)$ gravity. To obtain a characteristic wormhole solution, we use anisotropic matter distribution and a specific form of energy density. As second adopt the isotropic case with a linear EoS relation as a general technique for the system and discuss several physical attributes of the system under the wormhole geometry. Detailed analytical and graphical discussion about the matter contents via energy conditions is discussed. In both cases, the shape function of wormhole geometry satisfies the required conditions. Several interesting points have evolved from the entire investigation along with the features of the exotic matter within the wormhole geometry. Finally, we have concluding remarks. △ Less

Submitted 2 May, 2023; originally announced May 2023.

Comments: Chinese Journal of Physics accepted version

Journal ref: Chinese Journal of Physics (2023)

arXiv:2304.05072 [pdf]

Enhancement in Reliability for Multi-core system consisting of One Instruction Cores

Authors: Shashikiran Venkatesha, Ranjani Parthasarathi

Abstract: Rapid CMOS device size reduction resulted in billions of transistors on a chip have led to integration of many cores leading to many challenges such as increased power dissipation, thermal dissipation, occurrence of transient faults and permanent faults. The mitigation of transient faults and permanent faults at the core level has become an important design parameter in a multi-core scenario. Core… ▽ More Rapid CMOS device size reduction resulted in billions of transistors on a chip have led to integration of many cores leading to many challenges such as increased power dissipation, thermal dissipation, occurrence of transient faults and permanent faults. The mitigation of transient faults and permanent faults at the core level has become an important design parameter in a multi-core scenario. Core level techniques is a redundancy-based fault mitigation technique that improves the lifetime reliability of multi-core systems. In an asymmetric multi-core system, the smaller cores provide fault tolerance to larger cores is a core level fault mitigation technique that has gained momentum and focus from many researchers. The paper presents an economical, asymmetric multi-core system with one instruction cores (MCSOIC). The term Hardware Cost Estimation signifies power and area estimation for MCS-OIC. In MCSOIC, OIC is a warm standby redundant core. OICs provide functional support to conventional cores for shorter periods of time. To evaluate the idea, different configurations of MCSOIC is synthesized using FPGA and ASIC. The maximum power overhead and maximum area overhead are 0.46% and 11.4% respectively. The behavior of OICs in MCS-OIC is modelled using a One-Shot System (OSS) model for reliability analysis. The model parameters namely, readiness, wakeup probability and start-up-strategy for OSS are mapped to the multi-core systems with OICs. Expressions for system reliability is derived. System reliability is estimated for special cases. △ Less

Submitted 11 April, 2023; originally announced April 2023.

Comments: 46 pages

arXiv:2211.14406 [pdf, other]

Exploring Temporal Information Dynamics in Spiking Neural Networks

Authors: Youngeun Kim, Yuhang Li, Hyoungseob Park, Yeshwanth Venkatesha, Anna Hambitzer, Priyadarshini Panda

Abstract: Most existing Spiking Neural Network (SNN) works state that SNNs may utilize temporal information dynamics of spikes. However, an explicit analysis of temporal information dynamics is still missing. In this paper, we ask several important questions for providing a fundamental understanding of SNNs: What are temporal information dynamics inside SNNs? How can we measure the temporal information dyna… ▽ More Most existing Spiking Neural Network (SNN) works state that SNNs may utilize temporal information dynamics of spikes. However, an explicit analysis of temporal information dynamics is still missing. In this paper, we ask several important questions for providing a fundamental understanding of SNNs: What are temporal information dynamics inside SNNs? How can we measure the temporal information dynamics? How do the temporal information dynamics affect the overall learning performance? To answer these questions, we estimate the Fisher Information of the weights to measure the distribution of temporal information during training in an empirical manner. Surprisingly, as training goes on, Fisher information starts to concentrate in the early timesteps. After training, we observe that information becomes highly concentrated in earlier few timesteps, a phenomenon we refer to as temporal information concentration. We observe that the temporal information concentration phenomenon is a common learning feature of SNNs by conducting extensive experiments on various configurations such as architecture, dataset, optimization strategy, time constant, and timesteps. Furthermore, to reveal how temporal information concentration affects the performance of SNNs, we design a loss function to change the trend of temporal information. We find that temporal information concentration is crucial to building a robust SNN but has little effect on classification accuracy. Finally, we propose an efficient iterative pruning method based on our observation on temporal information concentration. Code is available at https://github.com/Intelligent-Computing-Lab-Yale/Exploring-Temporal-Information-Dynamics-in-Spiking-Neural-Networks. △ Less

Submitted 30 November, 2022; v1 submitted 25 November, 2022; originally announced November 2022.

Comments: Accepted to AAAI2023

arXiv:2210.09307 [pdf, other]

doi 10.1016/j.dark.2022.101126

Constraining Anisotropic Cosmological Model in $f(\mathcal{R},\mathscr{L}_m)$ Gravity

Authors: N. S. Kavya, V. Venkatesha, Sanjay Mandal, P. K. Sahoo

Abstract: The observational evidence regarding the present cosmological aspects tells us about the presence of very little anisotropy in the universe on a large scale. Here, in this paper, we attempt to study locally rotationally symmetric (LRS) homogeneous Bianchi-I spacetime with the isotropic matter distribution. This is done within the framework of $f(\mathcal{R},\mathscr{L}_m)$ gravity. Particularly, w… ▽ More The observational evidence regarding the present cosmological aspects tells us about the presence of very little anisotropy in the universe on a large scale. Here, in this paper, we attempt to study locally rotationally symmetric (LRS) homogeneous Bianchi-I spacetime with the isotropic matter distribution. This is done within the framework of $f(\mathcal{R},\mathscr{L}_m)$ gravity. Particularly, we consider a non-linear $f(\mathcal{R},\mathscr{L}_m)$ model, $f(\mathcal{R},\mathscr{L}_m)=\dfrac{1}{2}\mathcal{R}+\mathscr{L}_m^{\,α}$. Furthermore, $ω$, the equation of state parameter, which is vital stuff in determining the present phase of the universe is constrained. To constrain the model parameters and the equation of state parameter, we use 57 Hubble data points and 1048 Pantheon supernovae type Ia data sample. And, for our statistical analysis, we use Markoc Chain Monte Carlo (MCMC) simulation. Moreover, with the help of obtained values of parameters, we measure the anisotropy parameter for our model. △ Less

Submitted 17 October, 2022; originally announced October 2022.

Comments: Physics of the Dark Universe accepted version

Journal ref: Physics of the Dark Universe 38 (2022) 101126

arXiv:2207.06847 [pdf, other]

Covy: An AI-powered Robot with a Compound Vision System for Detecting Breaches in Social Distancing

Authors: Serge Saaybi, Amjad Yousef Majid, R Venkatesha Prasad, Anis Koubaa, Chris Verhoeven

Abstract: This paper introduces a compound vision system that enables robots to localize people up to 15m away using a cheap camera. And, it proposes a robust navigation stack that combines Deep Reinforcement Learning (DRL) and a probabilistic localization method. To test the efficacy of these systems, we prototyped a low-cost mobile robot that we call Covy. Covy can be used for applications such as promoti… ▽ More This paper introduces a compound vision system that enables robots to localize people up to 15m away using a cheap camera. And, it proposes a robust navigation stack that combines Deep Reinforcement Learning (DRL) and a probabilistic localization method. To test the efficacy of these systems, we prototyped a low-cost mobile robot that we call Covy. Covy can be used for applications such as promoting social distancing during pandemics or estimating the density of a crowd. We evaluated Covy's performance through extensive sets of experiments both in simulated and realistic environments. Our results show that Covy's compound vision algorithm doubles the range of the used depth camera, and its hybrid navigation stack is more robust than a pure DRL-based one. △ Less

Submitted 23 August, 2022; v1 submitted 14 July, 2022; originally announced July 2022.

arXiv:2207.01382 [pdf, other]

Exploring Lottery Ticket Hypothesis in Spiking Neural Networks

Authors: Youngeun Kim, Yuhang Li, Hyoungseob Park, Yeshwanth Venkatesha, Ruokai Yin, Priyadarshini Panda

Abstract: Spiking Neural Networks (SNNs) have recently emerged as a new generation of low-power deep neural networks, which is suitable to be implemented on low-power mobile/edge devices. As such devices have limited memory storage, neural pruning on SNNs has been widely explored in recent years. Most existing SNN pruning works focus on shallow SNNs (2~6 layers), however, deeper SNNs (>16 layers) are propos… ▽ More Spiking Neural Networks (SNNs) have recently emerged as a new generation of low-power deep neural networks, which is suitable to be implemented on low-power mobile/edge devices. As such devices have limited memory storage, neural pruning on SNNs has been widely explored in recent years. Most existing SNN pruning works focus on shallow SNNs (2~6 layers), however, deeper SNNs (>16 layers) are proposed by state-of-the-art SNN works, which is difficult to be compatible with the current SNN pruning work. To scale up a pruning technique towards deep SNNs, we investigate Lottery Ticket Hypothesis (LTH) which states that dense networks contain smaller subnetworks (i.e., winning tickets) that achieve comparable performance to the dense networks. Our studies on LTH reveal that the winning tickets consistently exist in deep SNNs across various datasets and architectures, providing up to 97% sparsity without huge performance degradation. However, the iterative searching process of LTH brings a huge training computational cost when combined with the multiple timesteps of SNNs. To alleviate such heavy searching cost, we propose Early-Time (ET) ticket where we find the important weight connectivity from a smaller number of timesteps. The proposed ET ticket can be seamlessly combined with a common pruning techniques for finding winning tickets, such as Iterative Magnitude Pruning (IMP) and Early-Bird (EB) tickets. Our experiment results show that the proposed ET ticket reduces search time by up to 38% compared to IMP or EB methods. Code is available at Github. △ Less

Submitted 20 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

Comments: Accepted to European Conference on Computer Vision (ECCV) 2022

arXiv:2206.04898 [pdf, other]

doi 10.46298/cm.9695

Quasi Yamabe Solitons on 3-Dimensional Contact Metric Manifolds with Q\varphi=\varphi Q

Authors: V. Venkatesha, H. Aruna Kumara

Abstract: In this paper we initiate the study of quasi Yamabe soliton on 3-dimensional contact metric manifold with Q\varphi=\varphi Q and prove that if a 3-dimensional contact metric manifold M such that Q\varphi=\varphi Q admits a quasi Yamabe soliton with non-zero soliton vector field V being point-wise collinear with the Reeb vector field ξ, then V is a constant multiple of ξ, the scalar curvature is co… ▽ More In this paper we initiate the study of quasi Yamabe soliton on 3-dimensional contact metric manifold with Q\varphi=\varphi Q and prove that if a 3-dimensional contact metric manifold M such that Q\varphi=\varphi Q admits a quasi Yamabe soliton with non-zero soliton vector field V being point-wise collinear with the Reeb vector field ξ, then V is a constant multiple of ξ, the scalar curvature is constant and the manifold is Sasakian. Moreover, V is Killing. Finally, we prove that if M is a 3-dimensional compact contact metric manifold such that Q\varphi=\varphi Q endowed with a quasi Yamabe soliton, then either M is flat or soliton is trivial. △ Less

Submitted 10 August, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

Journal ref: Communications in Mathematics, Volume 30 (2022), Issue 1 (September 19, 2022) cm:9695

arXiv:2204.05274 [pdf, other]

doi 10.1145/3489517.3530473

MIME: Adapting a Single Neural Network for Multi-task Inference with Memory-efficient Dynamic Pruning

Authors: Abhiroop Bhattacharjee, Yeshwanth Venkatesha, Abhishek Moitra, Priyadarshini Panda

Abstract: Recent years have seen a paradigm shift towards multi-task learning. This calls for memory and energy-efficient solutions for inference in a multi-task scenario. We propose an algorithm-hardware co-design approach called MIME. MIME reuses the weight parameters of a trained parent task and learns task-specific threshold parameters for inference on multiple child tasks. We find that MIME results in… ▽ More Recent years have seen a paradigm shift towards multi-task learning. This calls for memory and energy-efficient solutions for inference in a multi-task scenario. We propose an algorithm-hardware co-design approach called MIME. MIME reuses the weight parameters of a trained parent task and learns task-specific threshold parameters for inference on multiple child tasks. We find that MIME results in highly memory-efficient DRAM storage of neural-network parameters for multiple tasks compared to conventional multi-task inference. In addition, MIME results in input-dependent dynamic neuronal pruning, thereby enabling energy-efficient inference with higher throughput on a systolic-array hardware. Our experiments with benchmark datasets (child tasks)- CIFAR10, CIFAR100, and Fashion-MNIST, show that MIME achieves ~3.48x memory-efficiency and ~2.4-3.1x energy-savings compared to conventional multi-task inference in Pipelined task mode. △ Less

Submitted 11 April, 2022; originally announced April 2022.

Comments: Accepted in Design Automation Conference (DAC), 2022

Journal ref: 59th Design Automation Conference (DAC), 2022

arXiv:2203.13321 [pdf, other]

Addressing Client Drift in Federated Continual Learning with Adaptive Optimization

Authors: Yeshwanth Venkatesha, Youngeun Kim, Hyoungseob Park, Yuhang Li, Priyadarshini Panda

Abstract: Federated learning has been extensively studied and is the prevalent method for privacy-preserving distributed learning in edge devices. Correspondingly, continual learning is an emerging field targeted towards learning multiple tasks sequentially. However, there is little attention towards additional challenges emerging when federated aggregation is performed in a continual learning system. We id… ▽ More Federated learning has been extensively studied and is the prevalent method for privacy-preserving distributed learning in edge devices. Correspondingly, continual learning is an emerging field targeted towards learning multiple tasks sequentially. However, there is little attention towards additional challenges emerging when federated aggregation is performed in a continual learning system. We identify \textit{client drift} as one of the key weaknesses that arise when vanilla federated averaging is applied in such a system, especially since each client can independently have different order of tasks. We outline a framework for performing Federated Continual Learning (FCL) by using NetTailor as a candidate continual learning approach and show the extent of the problem of client drift. We show that adaptive federated optimization can reduce the adverse impact of client drift and showcase its effectiveness on CIFAR100, MiniImagenet, and Decathlon benchmarks. Further, we provide an empirical analysis highlighting the interplay between different hyperparameters such as client and server learning rates, the number of local training iterations, and communication rounds. Finally, we evaluate our framework on useful characteristics of federated learning systems such as scalability, robustness to the skewness in clients' data distribution, and stragglers. △ Less

Submitted 24 March, 2022; originally announced March 2022.

arXiv:2203.07830 [pdf]

A Survey of fault models and fault tolerance methods for 2D bus-based multi-core systems and TSV based 3D NOC many-core systems

Authors: Shashikiran Venkatesha, Ranjani Parthasarathi

Abstract: Reliability has taken centre stage in the development of high-performance computing processors. A Surge of interest is noticeable in recent times in formulating fault and failure models, understanding failure mechanism and strategizing fault mitigation methods for improving the reliability of the system. The article presents a congregation of concepts illustrated one after the other for a better u… ▽ More Reliability has taken centre stage in the development of high-performance computing processors. A Surge of interest is noticeable in recent times in formulating fault and failure models, understanding failure mechanism and strategizing fault mitigation methods for improving the reliability of the system. The article presents a congregation of concepts illustrated one after the other for a better understanding of damages caused by radiation, relevant fault models, and effects of faults. We examine the state of art fault mitigation techniques at the logical layer for digital CMOS based design and SRAM based FPGA. CMOS SRAM structure is the same for both digital CMOS and FPGA. Understanding of resilient SRAM based FPGA is necessary for developing resilient prototypes and it facilitates a faster integration of digital CMOS designs. At the micro-architectural and architectural layer, error detection and recovery methods are discussed for bus-based multi-core systems. The Through silicon via based 3D Network on chip is the prospective solution for integrating many cores on single die. A suitable interconnection approach for petascale computing on many-core systems. The article presents an elaborate discussion on fault models, failure mechanisms, resilient 3D routers, defect tolerance methods for the TSV based 3D NOC many-core systems. Core redundancy, self-diagnosis and distributed diagnosis at the hardware level are examined for many-core systems. The article presents a gamut of fault tolerance solutions from logic level to processor core level in a multi-core and many-core scenario. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Comments: An Elaborate survey on fault models and fault tolerant designs for multi-core and many-core systems

arXiv:2202.03133 [pdf, other]

Rate Coding or Direct Coding: Which One is Better for Accurate, Robust, and Energy-efficient Spiking Neural Networks?

Authors: Youngeun Kim, Hyoungseob Park, Abhishek Moitra, Abhiroop Bhattacharjee, Yeshwanth Venkatesha, Priyadarshini Panda

Abstract: Recent Spiking Neural Networks (SNNs) works focus on an image classification task, therefore various coding techniques have been proposed to convert an image into temporal binary spikes. Among them, rate coding and direct coding are regarded as prospective candidates for building a practical SNN system as they show state-of-the-art performance on large-scale datasets. Despite their usage, there is… ▽ More Recent Spiking Neural Networks (SNNs) works focus on an image classification task, therefore various coding techniques have been proposed to convert an image into temporal binary spikes. Among them, rate coding and direct coding are regarded as prospective candidates for building a practical SNN system as they show state-of-the-art performance on large-scale datasets. Despite their usage, there is little attention to comparing these two coding schemes in a fair manner. In this paper, we conduct a comprehensive analysis of the two codings from three perspectives: accuracy, adversarial robustness, and energy-efficiency. First, we compare the performance of two coding techniques with various architectures and datasets. Then, we measure the robustness of the coding techniques on two adversarial attack methods. Finally, we compare the energy-efficiency of two coding schemes on a digital hardware platform. Our results show that direct coding can achieve better accuracy especially for a small number of timesteps. In contrast, rate coding shows better robustness to adversarial attacks owing to the non-differentiable spike generation process. Rate coding also yields higher energy-efficiency than direct coding which requires multi-bit precision for the first layer. Our study explores the characteristics of two codings, which is an important design consideration for building SNNs. The code is made available at https://github.com/Intelligent-Computing-Lab-Yale/Rate-vs-Direct. △ Less

Submitted 12 April, 2022; v1 submitted 31 January, 2022; originally announced February 2022.

Comments: Accepted to ICASSP2022

arXiv:2201.10355 [pdf, other]

Neural Architecture Search for Spiking Neural Networks

Authors: Youngeun Kim, Yuhang Li, Hyoungseob Park, Yeshwanth Venkatesha, Priyadarshini Panda

Abstract: Spiking Neural Networks (SNNs) have gained huge attention as a potential energy-efficient alternative to conventional Artificial Neural Networks (ANNs) due to their inherent high-sparsity activation. However, most prior SNN methods use ANN-like architectures (e.g., VGG-Net or ResNet), which could provide sub-optimal performance for temporal sequence processing of binary information in SNNs. To add… ▽ More Spiking Neural Networks (SNNs) have gained huge attention as a potential energy-efficient alternative to conventional Artificial Neural Networks (ANNs) due to their inherent high-sparsity activation. However, most prior SNN methods use ANN-like architectures (e.g., VGG-Net or ResNet), which could provide sub-optimal performance for temporal sequence processing of binary information in SNNs. To address this, in this paper, we introduce a novel Neural Architecture Search (NAS) approach for finding better SNN architectures. Inspired by recent NAS approaches that find the optimal architecture from activation patterns at initialization, we select the architecture that can represent diverse spike activation patterns across different data samples without training. Moreover, to further leverage the temporal information among the spikes, we search for feed forward connections as well as backward connections (i.e., temporal feedback connections) between layers. Interestingly, SNASNet found by our search algorithm achieves higher performance with backward connections, demonstrating the importance of designing SNN architecture for suitably using temporal information. We conduct extensive experiments on three image recognition benchmarks where we show that SNASNet achieves state-of-the-art performance with significantly lower timesteps (5 timesteps). Code is available at Github. △ Less

Submitted 20 July, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

Comments: Accepted to European Conference on Computer Vision (ECCV) 2022

arXiv:2112.14952 [pdf]

A Survey of fault mitigation techniques for multi-core architectures

Authors: Shashikiran Venkatesha, Ranjani Parthasarathi

Abstract: Fault tolerance in multi-core architecture has attracted attention of research community for the past 20 years. Rapid improvements in the CMOS technology resulted in exponential growth of transistor density. It resulted in increased challenges for designing resilient multi-core architecture at the same pace. The article presents a survey of fault tolerant methods like fault detection, recovery, re… ▽ More Fault tolerance in multi-core architecture has attracted attention of research community for the past 20 years. Rapid improvements in the CMOS technology resulted in exponential growth of transistor density. It resulted in increased challenges for designing resilient multi-core architecture at the same pace. The article presents a survey of fault tolerant methods like fault detection, recovery, re-configurability and repair techniques for multi-core architectures. Salvaging at micro-architectural and architectural level are also discussed. Gamut of fault tolerant approaches discussed in this article have tangible improvements on the reliability of the multi-core architectures. Every concept in the seminal articles is examined with respect to relevant metrics like performance cost, area overhead, fault coverage, level of protection, detection latency and Mean Time To Failure. The existing literature is critically examined. New research directions in the form of new fault tolerant design alternatives for both homogeneous and heterogeneous multi-core architectures are presented. Brief on an analytical approach for fault tolerating model is suggested for Intel and AMD based modern homogeneous multi-core architecture are presented to enhance the understanding of the readers about the architecture with respect to performance degradation, memory access time and execution time. △ Less

Submitted 30 December, 2021; originally announced December 2021.

Comments: 21 pages

arXiv:2111.01634 [pdf, other]

Towards Enabling High-Five Over WiFi

Authors: Vineet Gokhale, Mohamad Eid, Kees Kroep, R. Venkatesha Prasad, Vijay Rao

Abstract: The next frontier for immersive applications is enabling sentience over the Internet. Tactile Internet (TI) envisages transporting skills by providing Ultra-Low Latency (ULL) communications for transporting touch senses. In this work, we focus our study on the first/last mile communication, where the future generation WiFi-7 is pitched as the front-runner for ULL applications. We discuss a few can… ▽ More The next frontier for immersive applications is enabling sentience over the Internet. Tactile Internet (TI) envisages transporting skills by providing Ultra-Low Latency (ULL) communications for transporting touch senses. In this work, we focus our study on the first/last mile communication, where the future generation WiFi-7 is pitched as the front-runner for ULL applications. We discuss a few candidate features of WiFi-7 and highlight its major pitfalls with respect to ULL communication. Further, through a specific implementation of WiFi-7 (vanilla WiFi-7) in our custom simulator, we demonstrate the impact of one of the pitfalls - standard practice of using jitter buffer in conjunction with frame aggregation - on TI communication. To circumvent this, we propose Non-Buffered Scheme (NoBuS) - a simple MAC layer enhancement for enabling TI applications on WiFi-7. NoBuS trades off packet loss for latency enabling swift synchronization between the master and controlled domains. Our findings reveal that employing NoBuS yields a significant improvement in RMSE of TI signals. Further, we show that the worst-case WiFi latency with NoBuS is 3.72 ms - an order of magnitude lower than vanilla WiFi-7 even under highly congested network conditions. △ Less

Submitted 2 November, 2021; originally announced November 2021.

arXiv:2110.01411 [pdf, other]

Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey

Authors: Amjad Yousef Majid, Serge Saaybi, Tomas van Rietbergen, Vincent Francois-Lavet, R Venkatesha Prasad, Chris Verhoeven

Abstract: Deep Reinforcement Learning (DRL) and Evolution Strategies (ESs) have surpassed human-level control in many sequential decision-making problems, yet many open challenges still exist. To get insights into the strengths and weaknesses of DRL versus ESs, an analysis of their respective capabilities and limitations is provided. After presenting their fundamental concepts and algorithms, a comparison i… ▽ More Deep Reinforcement Learning (DRL) and Evolution Strategies (ESs) have surpassed human-level control in many sequential decision-making problems, yet many open challenges still exist. To get insights into the strengths and weaknesses of DRL versus ESs, an analysis of their respective capabilities and limitations is provided. After presenting their fundamental concepts and algorithms, a comparison is provided on key aspects such as scalability, exploration, adaptation to dynamic environments, and multi-agent learning. Then, the benefits of hybrid algorithms that combine concepts from DRL and ESs are highlighted. Finally, to have an indication about how they compare in real-world applications, a survey of the literature for the set of applications they support is provided. △ Less

Submitted 28 September, 2021; originally announced October 2021.

arXiv:2108.10168 [pdf]

CGEMs: A Metric Model for Automatic Code Generation using GPT-3

Authors: Aishwarya Narasimhan, Krishna Prasad Agara Venkatesha Rao, Veena M B

Abstract: Today, AI technology is showing its strengths in almost every industry and walks of life. From text generation, text summarization, chatbots, NLP is being used widely. One such paradigm is automatic code generation. An AI could be generating anything; hence the output space is unconstrained. A self-driving car is driven for 100 million miles to validate its safety, but tests cannot be written to m… ▽ More Today, AI technology is showing its strengths in almost every industry and walks of life. From text generation, text summarization, chatbots, NLP is being used widely. One such paradigm is automatic code generation. An AI could be generating anything; hence the output space is unconstrained. A self-driving car is driven for 100 million miles to validate its safety, but tests cannot be written to monitor and cover an unconstrained space. One of the solutions to validate AI-generated content is to constrain the problem and convert it from abstract to realistic, and this can be accomplished by either validating the unconstrained algorithm using theoretical proofs or by using Monte-Carlo simulation methods. In this case, we use the latter approach to test/validate a statistically significant number of samples. This hypothesis of validating the AI-generated code is the main motive of this work and to know if AI-generated code is reliable, a metric model CGEMs is proposed. This is an extremely challenging task as programs can have different logic with different naming conventions, but the metrics must capture the structure and logic of the program. This is similar to the importance grammar carries in AI-based text generation, Q&A, translations, etc. The various metrics that are garnered in this work to support the evaluation of generated code are as follows: Compilation, NL description to logic conversion, number of edits needed, some of the commonly used static-code metrics and NLP metrics. These metrics are applied to 80 codes generated using OpenAI's GPT-3. Post which a Neural network is designed for binary classification (acceptable/not acceptable quality of the generated code). The inputs to this network are the values of the features obtained from the metrics. The model achieves a classification accuracy of 76.92% and an F1 score of 55.56%. XAI is augmented for model interpretability. △ Less

Submitted 23 August, 2021; originally announced August 2021.

Comments: 11 pages, 6 figures, 2 tables

arXiv:2107.08868 [pdf, other]

Energy Efficient Data Recovery from Corrupted LoRa Frames

Authors: Niloofar Yazdani, Nikolaos Kouvelas, R Venkatesha Prasad, Daniel E. Lucani

Abstract: High frame-corruption is widely observed in Long Range Wide Area Networks (LoRaWAN) due to the coexistence with other networks in ISM bands and an Aloha-like MAC layer. LoRa's Forward Error Correction (FEC) mechanism is often insufficient to retrieve corrupted data. In fact, real-life measurements show that at least one-fourth of received transmissions are corrupted. When more frames are dropped,… ▽ More High frame-corruption is widely observed in Long Range Wide Area Networks (LoRaWAN) due to the coexistence with other networks in ISM bands and an Aloha-like MAC layer. LoRa's Forward Error Correction (FEC) mechanism is often insufficient to retrieve corrupted data. In fact, real-life measurements show that at least one-fourth of received transmissions are corrupted. When more frames are dropped, LoRa nodes usually switch over to higher spreading factors (SF), thus increasing transmission times and increasing the required energy. This paper introduces ReDCoS, a novel coding technique at the application layer that improves recovery of corrupted LoRa frames, thus reducing the overall transmission time and energy invested by LoRa nodes by several-fold. ReDCoS utilizes lightweight coding techniques to pre-encode the transmitted data. Therefore, the inbuilt Cyclic Redundancy Check (CRC) that follows is computed based on an already encoded data. At the receiver, we use both the CRC and the coded data to recover data from a corrupted frame beyond the built-in Error Correcting Code (ECC). We compare the performance of ReDCoS to (I) the standard FEC of vanilla-LoRaWAN, and to (ii) RS coding applied as ECC to the data of LoRaWAN. The results indicated a 54x and 13.5x improvement of decoding ratio, respectively, when 20 data symbols were sent. Furthermore, we evaluated ReDCoS on-field using LoRa SX1261 transceivers showing that it outperformed RS-coding by factor of at least 2x (and up to 6x) in terms of the decoding ratio while consuming 38.5% less energy per correctly received transmission. △ Less

Submitted 19 July, 2021; originally announced July 2021.

Comments: 6 pages

arXiv:2107.05343 [pdf, other]

ETVO: Effectively Measuring Tactile Internet with Experimental Validation

Authors: H. J. C. Kroep, V. Gokhale, J. Verburg, R. Venkatesha Prasad

Abstract: The next frontier in communications is teleoperation -- manipulation and control of remote environments with feedback. Compared to conventional networked applications, teleoperation poses widely different requirements, ultra-low latency (ULL) is primary. Realizing ULL communication demands significant redesign of conventional networking techniques, and the network infrastructure envisioned for ach… ▽ More The next frontier in communications is teleoperation -- manipulation and control of remote environments with feedback. Compared to conventional networked applications, teleoperation poses widely different requirements, ultra-low latency (ULL) is primary. Realizing ULL communication demands significant redesign of conventional networking techniques, and the network infrastructure envisioned for achieving this is termed as Tactile Internet (TI). The design of the network infrastructure and meaningful performance metrics are crucial for seamless TI communication. However, existing performance metrics fall severely short of comprehensively characterizing TI performance. We take the first step towards bridging this gap. We take Dynamic Time Warping(DTW) as the basis of our work and identify necessary changes for characterizing TI performance. Through substantial refinements to DTW, we design Effective Time- and Value-Offset (ETVO) -- a new method for measuring the fine-grained performance of TI systems. Through an in-depth objective analysis, we demonstrate the improvements of ETVO over DTW. Through human-in-the-loop subjective experiments, we demonstrate how and why existing QoS and QoE methods fall short of estimating the TI session performance accurately. Using subjective experiments, we demonstrate the behavior of the proposed metrics, their ability to match theoretically derived performance, and finally their ability to reflect user satisfaction in a practical setting. The results are highly encouraging. △ Less

Submitted 12 July, 2021; originally announced July 2021.

Comments: arXiv admin note: text overlap with arXiv:2001.01770

arXiv:2106.06579 [pdf, other]

doi 10.1109/TSP.2021.3121632

Federated Learning with Spiking Neural Networks

Authors: Yeshwanth Venkatesha, Youngeun Kim, Leandros Tassiulas, Priyadarshini Panda

Abstract: As neural networks get widespread adoption in resource-constrained embedded devices, there is a growing need for low-power neural systems. Spiking Neural Networks (SNNs)are emerging to be an energy-efficient alternative to the traditional Artificial Neural Networks (ANNs) which are known to be computationally intensive. From an application perspective, as federated learning involves multiple energ… ▽ More As neural networks get widespread adoption in resource-constrained embedded devices, there is a growing need for low-power neural systems. Spiking Neural Networks (SNNs)are emerging to be an energy-efficient alternative to the traditional Artificial Neural Networks (ANNs) which are known to be computationally intensive. From an application perspective, as federated learning involves multiple energy-constrained devices, there is a huge scope to leverage energy efficiency provided by SNNs. Despite its importance, there has been little attention on training SNNs on a large-scale distributed system like federated learning. In this paper, we bring SNNs to a more realistic federated learning scenario. Specifically, we propose a federated learning framework for decentralized and privacy-preserving training of SNNs. To validate the proposed federated learning framework, we experimentally evaluate the advantages of SNNs on various aspects of federated learning with CIFAR10 and CIFAR100 benchmarks. We observe that SNNs outperform ANNs in terms of overall accuracy by over 15% when the data is distributed across a large number of clients in the federation while providing up to5.3x energy efficiency. In addition to efficiency, we also analyze the sensitivity of the proposed federated SNN framework to data distribution among the clients, stragglers, and gradient noise and perform a comprehensive comparison with ANNs. △ Less

Submitted 11 June, 2021; originally announced June 2021.

arXiv:2104.03414 [pdf, other]

PrivateSNN: Privacy-Preserving Spiking Neural Networks

Authors: Youngeun Kim, Yeshwanth Venkatesha, Priyadarshini Panda

Abstract: How can we bring both privacy and energy-efficiency to a neural system? In this paper, we propose PrivateSNN, which aims to build low-power Spiking Neural Networks (SNNs) from a pre-trained ANN model without leaking sensitive information contained in a dataset. Here, we tackle two types of leakage problems: 1) Data leakage is caused when the networks access real training data during an ANN-SNN con… ▽ More How can we bring both privacy and energy-efficiency to a neural system? In this paper, we propose PrivateSNN, which aims to build low-power Spiking Neural Networks (SNNs) from a pre-trained ANN model without leaking sensitive information contained in a dataset. Here, we tackle two types of leakage problems: 1) Data leakage is caused when the networks access real training data during an ANN-SNN conversion process. 2) Class leakage is caused when class-related features can be reconstructed from network parameters. In order to address the data leakage issue, we generate synthetic images from the pre-trained ANNs and convert ANNs to SNNs using the generated images. However, converted SNNs remain vulnerable to class leakage since the weight parameters have the same (or scaled) value with respect to ANN parameters. Therefore, we encrypt SNN weights by training SNNs with a temporal spike-based learning rule. Updating weight parameters with temporal data makes SNNs difficult to be interpreted in the spatial domain. We observe that the encrypted PrivateSNN eliminates data and class leakage issues with a slight performance drop (less than ~2) and significant energy-efficiency gain (about 55x) compared to the standard ANN. We conduct extensive experiments on various datasets including CIFAR10, CIFAR100, and TinyImageNet, highlighting the importance of privacy-preserving SNN training. △ Less

Submitted 21 May, 2022; v1 submitted 7 April, 2021; originally announced April 2021.

Comments: Accepted to AAAI2022

arXiv:2102.00702 [pdf, other]

FEEL: Fast, Energy-Efficient Localization for Autonomous Indoor Vehicles

Authors: Vineet Gokhale, Gerardo Moyers Barrera, R. Venkatesha Prasad

Abstract: Autonomous vehicles have created a sensation in both outdoor and indoor applications. The famous indoor use-case is process automation inside a warehouse using Autonomous Indoor Vehicles (AIV). These vehicles need to locate themselves not only with an accuracy of a few centimetres but also within a few milliseconds in an energy-efficient manner. Due to these challenges, localization is a holy grai… ▽ More Autonomous vehicles have created a sensation in both outdoor and indoor applications. The famous indoor use-case is process automation inside a warehouse using Autonomous Indoor Vehicles (AIV). These vehicles need to locate themselves not only with an accuracy of a few centimetres but also within a few milliseconds in an energy-efficient manner. Due to these challenges, localization is a holy grail. In this paper, we propose FEEL - an indoor localization system that uses a fusion of three low-energy sensors: IMU, UWB, and radar. We provide detailed software and hardware architecture of FEEL. Further, we propose Adaptive Sensing Algorithm (ASA) for opportunistically minimizing energy consumption of FEEL by adjusting the sensing frequency to the dynamics of the physical environment. Our extensive performance evaluation over diverse test settings reveal that FEEL provides a localization accuracy of <7cm with ultra-low latency of around 3ms. Further, ASA yields up to 20% energy saving with only a marginal trade-off in accuracy. △ Less

Submitted 1 February, 2021; originally announced February 2021.

arXiv:2101.04354 [pdf, other]

Activation Density based Mixed-Precision Quantization for Energy Efficient Neural Networks

Authors: Karina Vasquez, Yeshwanth Venkatesha, Abhiroop Bhattacharjee, Abhishek Moitra, Priyadarshini Panda

Abstract: As neural networks gain widespread adoption in embedded devices, there is a need for model compression techniques to facilitate deployment in resource-constrained environments. Quantization is one of the go-to methods yielding state-of-the-art model compression. Most approaches take a fully trained model, apply different heuristics to determine the optimal bit-precision for different layers of the… ▽ More As neural networks gain widespread adoption in embedded devices, there is a need for model compression techniques to facilitate deployment in resource-constrained environments. Quantization is one of the go-to methods yielding state-of-the-art model compression. Most approaches take a fully trained model, apply different heuristics to determine the optimal bit-precision for different layers of the network, and retrain the network to regain any drop in accuracy. Based on Activation Density (AD)-the proportion of non-zero activations in a layer-we propose an in-training quantization method. Our method calculates bit-width for each layer during training yielding a mixed precision model with competitive accuracy. Since we train lower precision models during training, our approach yields the final quantized model at lower training complexity and also eliminates the need for re-training. We run experiments on benchmark datasets like CIFAR-10, CIFAR-100, TinyImagenet on VGG19/ResNet18 architectures and report the accuracy and energy estimates for the same. We achieve ~4.5x benefit in terms of estimated multiply-and-accumulate (MAC) reduction while reducing the training complexity by 50% in our experiments. To further evaluate the energy benefits of our proposed method, we develop a mixed-precision scalable Process In Memory (PIM) hardware accelerator platform. The hardware platform incorporates shift-add functionality for handling multi-bit precision neural network models. Evaluating the quantized models obtained with our proposed method on the PIM platform yields ~5x energy reduction compared to 16-bit models. Additionally, we find that integrating AD based quantization with AD based pruning (both conducted during training) yields up to ~198x and ~44x energy reductions for VGG19 and ResNet18 architectures respectively on PIM platform compared to baseline 16-bit precision, unpruned models. △ Less

Submitted 12 January, 2021; originally announced January 2021.

Comments: Published in Design, Automation and Test in Europe (DATE) conference

arXiv:2003.11920 [pdf, other]

A simple Stochastic SIR model for COVID 19 Infection Dynamics for Karnataka: Learning from Europe

Authors: Ashutosh Simha, R. Venkatesha Prasad, Sujay Narayana

Abstract: In this short note we model the region-wise trends of the evolution to COVID-19 infections using a stochastic SIR model. The SIR dynamics are expressed using \textit{Itô-stochastic differential equations}. We first derive the parameters of the model from the available daily data from European regions based on a 24-day history of infections, recoveries and deaths. The derived parameters have been a… ▽ More In this short note we model the region-wise trends of the evolution to COVID-19 infections using a stochastic SIR model. The SIR dynamics are expressed using \textit{Itô-stochastic differential equations}. We first derive the parameters of the model from the available daily data from European regions based on a 24-day history of infections, recoveries and deaths. The derived parameters have been aggregated to project future trends for the Indian subcontinent, which is currently at an early stage in the infection cycle. The projections are meant to serve as a guideline for strategizing the socio-political counter measures to mitigate COVID-19. △ Less

Submitted 16 April, 2020; v1 submitted 26 March, 2020; originally announced March 2020.

arXiv:2003.00786 [pdf, ps, other]

doi 10.1142/S0219887820501054

Riemann solitons and almost Riemann solitons on almost Kenmotsu manifolds

Authors: V. Venkatesha, H. Aruna Kumara, Devaraja Mallesha Naik

Abstract: The aim of this article is to study the Riemann soliton and gradient almost Riemann soliton on certain class of almost Kenmotsu manifolds. Also some suitable examples of Kenmotsu and $(κ,μ)'$-almost Kenmotsu manifolds are constructed to justify our results. The aim of this article is to study the Riemann soliton and gradient almost Riemann soliton on certain class of almost Kenmotsu manifolds. Also some suitable examples of Kenmotsu and $(κ,μ)'$-almost Kenmotsu manifolds are constructed to justify our results. △ Less

Submitted 2 March, 2020; originally announced March 2020.

MSC Class: 53C25; 53C15; 53D15

arXiv:2002.02949 [pdf, other]

Activation Density driven Energy-Efficient Pruning in Training

Authors: Timothy Foldy-Porto, Yeshwanth Venkatesha, Priyadarshini Panda

Abstract: Neural network pruning with suitable retraining can yield networks with considerably fewer parameters than the original with comparable degrees of accuracy. Typical pruning methods require large, fully trained networks as a starting point from which they perform a time-intensive iterative pruning and retraining procedure to regain the original accuracy. We propose a novel pruning method that prune… ▽ More Neural network pruning with suitable retraining can yield networks with considerably fewer parameters than the original with comparable degrees of accuracy. Typical pruning methods require large, fully trained networks as a starting point from which they perform a time-intensive iterative pruning and retraining procedure to regain the original accuracy. We propose a novel pruning method that prunes a network real-time during training, reducing the overall training time to achieve an efficient compressed network. We introduce an activation density based analysis to identify the optimal relative sizing or compression for each layer of the network. Our method is architecture agnostic, allowing it to be employed on a wide variety of systems. For VGG-19 and ResNet18 on CIFAR-10, CIFAR-100, and TinyImageNet, we obtain exceedingly sparse networks (up to $200 \times$ reduction in parameters and over $60 \times$ reduction in inference compute operations in the best case) with accuracy comparable to the baseline network. By reducing the network size periodically during training, we achieve total training times that are shorter than those of previously proposed pruning methods. Furthermore, training compressed networks at different epochs with our proposed method yields considerable reduction in training compute complexity ($1.6\times$ to $3.2\times$ lower) at near iso-accuracy as compared to a baseline network trained entirely from scratch. △ Less

Submitted 12 October, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

Comments: 8 pages, 5 figures, 4 tables (Accepted in ICPR 2020)

Showing 1–50 of 73 results for author: Venkatesha