Search | arXiv e-print repository

FAV-NSS: An HIL Framework for Accelerating Validation of Automotive Network Security Strategies

Authors: Changhong Li, Shashwat Khandelwal, Shreejith Shanker

Abstract: Complex electronic control unit (ECU) architectures, software models and in-vehicle networks are consistently improving safety and comfort functions in modern vehicles. However, the extended functionality and increased connectivity introduce new security risks and vulnerabilities that can be exploited on legacy automotive networks such as the controller area network (CAN). With the rising complexi… ▽ More Complex electronic control unit (ECU) architectures, software models and in-vehicle networks are consistently improving safety and comfort functions in modern vehicles. However, the extended functionality and increased connectivity introduce new security risks and vulnerabilities that can be exploited on legacy automotive networks such as the controller area network (CAN). With the rising complexity of vehicular systems and attack vectors, the need for a flexible hardware-in-the-loop (HIL) test fixture that can inject attacks and validate the performance of countermeasures in near-real-world conditions in real time is vital. This paper presents an FPGA-based HIL framework tailored towards validating network security approaches (IDS, IPS) and smart integration strategies of such capabilities for an automotive CAN bus. FAV-NSS replicates an actual vehicular system environment with functional ECUs and network infrastructure on an FPGA, allowing functional validation of IDS/IPS algorithms, accelerator designs and integration schemes (software task on ECU, dedicated accelerator). To show the efficacy of FAV-NSS, we evaluate an IDS accelerator integration problem, both as a traditional coupled accelerator (to the ECU), and secondly close to the CAN controller (mimicking an extended CAN controller). We show that the latter strategy can be fully validated by our framework, which would otherwise require integration of specialised CAN modules into otherwise standard HIL fixtures with ability to instrument internal signals for characterising timing performance. The tests demonstrate a promising latency reduction of 6.3x when compared to the traditional coupled accelerator. Our case study demonstrates the potential of FAV-NSS for accelerating the optimisation, integration and verification of smart ECUs and communication controllers in current and future vehicular systems. △ Less

Submitted 21 May, 2025; originally announced May 2025.

Comments: Accepted to 2025 IEEE 36th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

arXiv:2505.14924 [pdf, ps, other]

SecCAN: An Extended CAN Controller with Embedded Intrusion Detection

Authors: Shashwat Khandelwal, Shreejith Shanker

Abstract: Recent research has highlighted the vulnerability of in-vehicle network protocols such as controller area networks (CAN) and proposed machine learning-based intrusion detection systems (IDSs) as an effective mitigation technique. However, their efficient integration into vehicular architecture is non-trivial, with existing methods relying on electronic control units (ECUs)-coupled IDS accelerators… ▽ More Recent research has highlighted the vulnerability of in-vehicle network protocols such as controller area networks (CAN) and proposed machine learning-based intrusion detection systems (IDSs) as an effective mitigation technique. However, their efficient integration into vehicular architecture is non-trivial, with existing methods relying on electronic control units (ECUs)-coupled IDS accelerators or dedicated ECUs as IDS accelerators. Here, initiating IDS requires complete reception of a CAN message from the controller, incurring data movement and software overheads. In this paper, we present SecCAN, a novel CAN controller architecture that embeds IDS capability within the datapath of the controller. This integration allows IDS to tap messages directly from within the CAN controller as they are received from the bus, removing overheads incurred by existing ML-based IDSs. A custom-quantised machine-learning accelerator is developed as the IDS engine and embedded into SecCAN's receive data path, with optimisations to overlap the IDS inference with the protocol's reception window. We implement SecCAN on AMD XCZU7EV FPGA to quantify its performance and benefits in hardware, using multiple attack datasets. We show that SecCAN can completely hide the IDS latency within the CAN reception window for all CAN packet sizes and detect multiple attacks with state-of-the-art accuracy with zero software overheads on the ECU and low energy overhead (73.7 uJ per message) for IDS inference. Also, SecCAN incurs limited resource overhead compared to a standard CAN controller (< 30% LUT, < 1% FF), making it ideally suited for automotive deployment. △ Less

Submitted 20 May, 2025; originally announced May 2025.

Comments: 4 pages, 3 figures, 3 tables, Accepted in IEEE Embedded Systems Letters (https://ieee-ceda.org/publication/esl)

arXiv:2504.10856 [pdf, ps, other]

On five-dimensional curvature squared supergravity and holography

Authors: Gregory Gold, Peng-Ju Hu, Jessica Hutomo, Saurish Khandelwal, Mehmet Ozkan, Yi Pang, Gabriele Tartaglino-Mazzucchelli

Abstract: In this work, we report the recent progress in obtaining new curvature-squared invariants in 5D, N=1 gauged minimal supergravity. We exhibit the structure of various composite multiplets that are pivotal in the construction. We also present the form of the gauged Riemann-squared and Gauss-Bonnet superinvariants in a dilaton-Weyl multiplet. As a first application of the new curvature squared invari… ▽ More In this work, we report the recent progress in obtaining new curvature-squared invariants in 5D, N=1 gauged minimal supergravity. We exhibit the structure of various composite multiplets that are pivotal in the construction. We also present the form of the gauged Riemann-squared and Gauss-Bonnet superinvariants in a dilaton-Weyl multiplet. As a first application of the new curvature squared invariants, we compute their corrections to holographic central charges and the Euclidean action of supersymmetric charged rotating black holes, exhibiting exact matching between the gravity and CFT results. △ Less

Submitted 15 April, 2025; originally announced April 2025.

Comments: 25 pages, contribution to the proceedings for the MATRIX workshop "New Deformations of Quantum Field and Gravity Theories"

arXiv:2503.00956 [pdf, other]

Simulating quantum instruments with projective measurements and quantum post-processing

Authors: Shishir Khandelwal, Armin Tavakoli

Abstract: Quantum instruments describe both the classical outcome and the updated state associated with a quantum measurement. We ask whether these processes can be simulated using only a natural subset of resources, namely projective measurements on the system and quantum processing of the post-measurement states. We show that the simulability of instruments can be connected to an entanglement classificati… ▽ More Quantum instruments describe both the classical outcome and the updated state associated with a quantum measurement. We ask whether these processes can be simulated using only a natural subset of resources, namely projective measurements on the system and quantum processing of the post-measurement states. We show that the simulability of instruments can be connected to an entanglement classification problem. This leads to a computationally efficient necessary condition for simulation of generic instruments and to a complete characterisation for qubits. We use this to address relevant quantum information tasks, namely (i) the noise-tolerance of standard qubit unsharp measurements, (ii) non-projective advantages in information-disturbance trade-offs, and (iii) increased sequential Bell inequality violations under projective measurements. Moreover, we consider also $d$-dimensional Lüders instruments that correspond to weak versions of standard basis measurements and show that for large $d$ these can permit scalable noise-advantages over projective implementations. △ Less

Submitted 26 May, 2025; v1 submitted 2 March, 2025; originally announced March 2025.

Comments: 6+13 pages

arXiv:2410.20788 [pdf, other]

SCULPT: Systematic Tuning of Long Prompts

Authors: Shanu Kumar, Akhila Yesantarao Venkata, Shubhanshu Khandelwal, Bishal Santra, Parag Agrawal, Manish Gupta

Abstract: Prompt optimization is essential for effective utilization of large language models (LLMs) across diverse tasks. While existing optimization methods are effective in optimizing short prompts, they struggle with longer, more complex ones, often risking information loss and being sensitive to small perturbations. To address these challenges, we propose SCULPT (Systematic Tuning of Long Prompts), a f… ▽ More Prompt optimization is essential for effective utilization of large language models (LLMs) across diverse tasks. While existing optimization methods are effective in optimizing short prompts, they struggle with longer, more complex ones, often risking information loss and being sensitive to small perturbations. To address these challenges, we propose SCULPT (Systematic Tuning of Long Prompts), a framework that treats prompt optimization as a hierarchical tree refinement problem. SCULPT represents prompts as tree structures, enabling targeted modifications while preserving contextual integrity. It employs a Critic-Actor framework that generates reflections and applies actions to refine the prompt. Evaluations demonstrate SCULPT's effectiveness on long prompts, its robustness to adversarial perturbations, and its ability to generate high-performing prompts even without any initial human-written prompt. Compared to existing state of the art methods, SCULPT consistently improves LLM performance by preserving essential task information while applying structured refinements. Both qualitative and quantitative analyses show that SCULPT produces more stable and interpretable prompt modifications, ensuring better generalization across tasks. △ Less

Submitted 23 March, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

arXiv:2409.19034 [pdf, ps, other]

On 4D, ${\mathcal{N}=2}$ deformed vector multiplets and partial supersymmetry breaking in off-shell supergravity

Authors: Gregory Gold, Saurish Khandelwal, Gabriele Tartaglino-Mazzucchelli

Abstract: Electric and magnetic Fayet-Ilioupulous (FI) terms are used to engineer partial breaking of ${\mathcal{N}=2}$ global supersymmetry for systems of vector multiplets. The magnetic FI term induces a deformation of the off-shell field transformations associated with an imaginary constant shift of the triplet of auxiliary fields of the vector multiplet. In this paper, we elaborate on the deformation of… ▽ More Electric and magnetic Fayet-Ilioupulous (FI) terms are used to engineer partial breaking of ${\mathcal{N}=2}$ global supersymmetry for systems of vector multiplets. The magnetic FI term induces a deformation of the off-shell field transformations associated with an imaginary constant shift of the triplet of auxiliary fields of the vector multiplet. In this paper, we elaborate on the deformation of off-shell vector multiplets in supergravity, both in components and superspace. In a superconformal framework, the deformations are associated with (composite) linear multiplets. We engineer an off-shell model that exhibits partial local supersymmetry breaking with a zero cosmological constant. This is based on the hyper-dilaton Weyl multiplet introduced in arXiv:2203.12203, coupled to the SU(1,1)/U(1) special-Kähler sigma model in a symplectic frame admitting a holomorphic prepotential, with one compensating and one physical vector multiplet, the latter magnetically deformed. △ Less

Submitted 27 September, 2024; originally announced September 2024.

Comments: 77 pages (paper) + 14 pages (supplementary file)

arXiv:2409.08100 [pdf, other]

Emergent Liouvillian exceptional points from exact principles

Authors: Shishir Khandelwal, Gianmichele Blasi

Abstract: Recent years have seen a surge of interest in exceptional points in open quantum systems. The natural approach in this area has been the use of Markovian master equations. While the resulting Liouvillian EPs have been seen in a variety of systems and have been associated to numerous exotic effects, it is an open question whether such degeneracies and their peculiarities can persist beyond the vali… ▽ More Recent years have seen a surge of interest in exceptional points in open quantum systems. The natural approach in this area has been the use of Markovian master equations. While the resulting Liouvillian EPs have been seen in a variety of systems and have been associated to numerous exotic effects, it is an open question whether such degeneracies and their peculiarities can persist beyond the validity of master equations. In this work, taking the example of a dissipative double-quantum-dot system, we show that Heisenberg equations for the system exhibit the same EPs as the corresponding master equations. To highlight the importance of this finding, we prove that the paradigmatic property associated to EPs - critical damping, persists well beyond the validity of master equations. Our results demonstrate that Liouvillian EPs can arise from underlying fundamental exact principles, rather than merely as a consequence of approximations involved in deriving master equations. △ Less

Submitted 1 October, 2024; v1 submitted 12 September, 2024; originally announced September 2024.

Comments: 6+5 pages, 3 figures

arXiv:2407.17647 [pdf, other]

An Energy-Efficient Artefact Detection Accelerator on FPGAs for Hyper-Spectral Satellite Imagery

Authors: Cornell Castelino, Shashwat Khandelwal, Shanker Shreejith, Sharatchandra Varma Bogaraju

Abstract: Hyper-Spectral Imaging (HSI) is a crucial technique for analysing remote sensing data acquired from Earth observation satellites. The rich spatial and spectral information obtained through HSI allows for better characterisation and exploration of the Earth's surface over traditional techniques like RGB and Multi-Spectral imaging on the downlinked image data at ground stations. Sometimes, these ima… ▽ More Hyper-Spectral Imaging (HSI) is a crucial technique for analysing remote sensing data acquired from Earth observation satellites. The rich spatial and spectral information obtained through HSI allows for better characterisation and exploration of the Earth's surface over traditional techniques like RGB and Multi-Spectral imaging on the downlinked image data at ground stations. Sometimes, these images do not contain meaningful information due to the presence of clouds or other artefacts, limiting their usefulness. Transmission of such artefact HSI images leads to wasteful use of already scarce energy and time costs required for communication. While detecting such artefacts before transmitting the HSI image is desirable, the computational complexity of these algorithms and the limited power budget on satellites (especially CubeSats) are key constraints. This paper presents an unsupervised learning-based convolutional autoencoder (CAE) model for artefact identification of acquired HSI images at the satellite and a deployment architecture on AMD's Zynq Ultrascale FPGAs. The model is trained and tested on widely used HSI image datasets: Indian Pines, Salinas Valley, the University of Pavia and the Kennedy Space Center. For deployment, the model is quantised to 8-bit precision, fine-tuned using the Vitis-AI framework and integrated as a subordinate accelerator using AMD's Deep-Learning Processing Units (DPU) instance on the Zynq device. Our tests show that the model can process each spectral band in an HSI image in 4 ms, 2.6x better than INT8 inference on Nvidia's Jetson platform & 1.27x better than SOTA artefact detectors. Our model also achieves an f1-score of 92.8% and FPR of 0% across the dataset, while consuming 21.52 mJ per HSI image, 3.6x better than INT8 Jetson inference & 7.5x better than SOTA artefact detectors, making it a viable architecture for deployment in CubeSats. △ Less

Submitted 24 July, 2024; originally announced July 2024.

Journal ref: 27th Euromicro Conference on Digital System Design (DSD), 2024

arXiv:2406.19687 [pdf, ps, other]

Supergravity Component Reduction with Computer Algebra

Authors: Gregory Gold, Saurish Khandelwal, Gabriele Tartaglino-Mazzucchelli

Abstract: Using an interplay between superspace and component superconformal tensor calculus techniques, recently, the off-shell construction of the supersymmetric extension of the three independent curvature-squared invariants for minimal (N = 1) gauged supergravity in five dimensions (5D) was completed. A key ingredient in obtaining these results is the implementation of computer algebra algorithms. In th… ▽ More Using an interplay between superspace and component superconformal tensor calculus techniques, recently, the off-shell construction of the supersymmetric extension of the three independent curvature-squared invariants for minimal (N = 1) gauged supergravity in five dimensions (5D) was completed. A key ingredient in obtaining these results is the implementation of computer algebra algorithms. In this report, we describe how to use cadabra to systematically study component reduction from superspace with computer algebra in the case of 5D, N = 1 supergravity. △ Less

Submitted 28 June, 2024; originally announced June 2024.

Comments: 22 pages; contribution to the proceedings of the MATRIX program "New Deformations of Quantum Field and Gravity Theories", 22 Jan - 2 Feb 2024

arXiv:2404.15144 [pdf, other]

doi 10.3390/e26060497

Finite-time dynamics of an entanglement engine: current, fluctuations and kinetic uncertainty relations

Authors: Jeanne Bourgeois, Gianmichele Blasi, Shishir Khandelwal, Géraldine Haack

Abstract: Entanglement engines are autonomous quantum thermal machines designed to generate entanglement from the presence of a particle current flowing through the device. In this work, we investigate the functioning of a two-qubit entanglement engine beyond the steady-state regime. Within a master equation approach, we derive the time-dependent state, the particle current, as well as the associated curren… ▽ More Entanglement engines are autonomous quantum thermal machines designed to generate entanglement from the presence of a particle current flowing through the device. In this work, we investigate the functioning of a two-qubit entanglement engine beyond the steady-state regime. Within a master equation approach, we derive the time-dependent state, the particle current, as well as the associated current correlation functions. Our findings establish a direct connection between coherence and internal current, elucidating the existence of a critical current that serves as an indicator for entanglement in the steady state. We then apply our results to investigate kinetic uncertainty relations (KURs) at finite times. We demonstrate that there are more than one possible definitions for KURs at finite times. While the two definitions agree in the steady-state regime, they lead to different parameter's ranges for violating KUR at finite times. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Journal ref: Entropy 2024, 26(6), 497

arXiv:2402.12407 [pdf, other]

Accelerating local laplacian filters on FPGAs

Authors: Shashwat Khandelwal, Ziaul Choudhury, Shashwat Shrivastava, Suresh Purini

Abstract: Images when processed using various enhancement techniques often lead to edge degradation and other unwanted artifacts such as halos. These artifacts pose a major problem for photographic applications where they can denude the quality of an image. There is a plethora of edge-aware techniques proposed in the field of image processing. However, these require the application of complex optimization o… ▽ More Images when processed using various enhancement techniques often lead to edge degradation and other unwanted artifacts such as halos. These artifacts pose a major problem for photographic applications where they can denude the quality of an image. There is a plethora of edge-aware techniques proposed in the field of image processing. However, these require the application of complex optimization or post-processing methods. Local Laplacian Filtering is an edge-aware image processing technique that involves the construction of simple Gaussian and Laplacian pyramids. This technique can be successfully applied for detail smoothing, detail enhancement, tone mapping and inverse tone mapping of an image while keeping it artifact-free. The problem though with this approach is that it is computationally expensive. Hence, parallelization schemes using multi-core CPUs and GPUs have been proposed. As is well known, they are not power-efficient, and a well-designed hardware architecture on an FPGA can do better on the performance per watt metric. In this paper, we propose a hardware accelerator, which exploits fully the available parallelism in the Local Laplacian Filtering algorithm, while minimizing the utilization of on-chip FPGA resources. On Virtex-7 FPGA, we obtain a 7.5x speed-up to process a 1 MB image when compared to an optimized baseline CPU implementation. To the best of our knowledge, we are not aware of any other hardware accelerators proposed in the research literature for the Local Laplacian Filtering problem. △ Less

Submitted 18 February, 2024; originally announced February 2024.

Comments: 6 pages, 5 figures, 2 tables

Journal ref: 10.1109/FPL50879.2020.00028

arXiv:2402.06655 [pdf, other]

Adversarial Text Purification: A Large Language Model Approach for Defense

Authors: Raha Moraffah, Shubh Khandelwal, Amrita Bhattacharjee, Huan Liu

Abstract: Adversarial purification is a defense mechanism for safeguarding classifiers against adversarial attacks without knowing the type of attacks or training of the classifier. These techniques characterize and eliminate adversarial perturbations from the attacked inputs, aiming to restore purified samples that retain similarity to the initially attacked ones and are correctly classified by the classif… ▽ More Adversarial purification is a defense mechanism for safeguarding classifiers against adversarial attacks without knowing the type of attacks or training of the classifier. These techniques characterize and eliminate adversarial perturbations from the attacked inputs, aiming to restore purified samples that retain similarity to the initially attacked ones and are correctly classified by the classifier. Due to the inherent challenges associated with characterizing noise perturbations for discrete inputs, adversarial text purification has been relatively unexplored. In this paper, we investigate the effectiveness of adversarial purification methods in defending text classifiers. We propose a novel adversarial text purification that harnesses the generative capabilities of Large Language Models (LLMs) to purify adversarial text without the need to explicitly characterize the discrete noise perturbations. We utilize prompt engineering to exploit LLMs for recovering the purified examples for given adversarial examples such that they are semantically similar and correctly classified. Our proposed method demonstrates remarkable performance over various classifiers, improving their accuracy under the attack by over 65% on average. △ Less

Submitted 4 February, 2024; originally announced February 2024.

Comments: PAKDD 2024

arXiv:2401.12240 [pdf, other]

doi 10.23919/DATE56975.2023.10137016

Quantised Neural Network Accelerators for Low-Power IDS in Automotive Networks

Authors: Shashwat Khandelwal, Anneliese Walsh, Shanker Shreejith

Abstract: In this paper, we explore low-power custom quantised Multi-Layer Perceptrons (MLPs) as an Intrusion Detection System (IDS) for automotive controller area network (CAN). We utilise the FINN framework from AMD/Xilinx to quantise, train and generate hardware IP of our MLP to detect denial of service (DoS) and fuzzying attacks on CAN network, using ZCU104 (XCZU7EV) FPGA as our target ECU architecture… ▽ More In this paper, we explore low-power custom quantised Multi-Layer Perceptrons (MLPs) as an Intrusion Detection System (IDS) for automotive controller area network (CAN). We utilise the FINN framework from AMD/Xilinx to quantise, train and generate hardware IP of our MLP to detect denial of service (DoS) and fuzzying attacks on CAN network, using ZCU104 (XCZU7EV) FPGA as our target ECU architecture with integrated IDS capabilities. Our approach achieves significant improvements in latency (0.12 ms per-message processing latency) and inference energy consumption (0.25 mJ per inference) while achieving similar classification performance as state-of-the-art approaches in the literature. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: 2 pages, 1 figure, 2 tables. arXiv admin note: text overlap with arXiv:2401.11030

Journal ref: 2023 Design, Automation & Test in Europe Conference & Exhibition (DATE)

arXiv:2401.12234 [pdf, other]

doi 10.1109/ICFPT56656.2022.9974508

A Lightweight FPGA-based IDS-ECU Architecture for Automotive CAN

Authors: Shashwat Khandelwal, Shreejith Shanker

Abstract: Recent years have seen an exponential rise in complex software-driven functionality in vehicles, leading to a rising number of electronic control units (ECUs), network capabilities, and interfaces. These expanded capabilities also bring-in new planes of vulnerabilities making intrusion detection and management a critical capability; however, this can often result in more ECUs and network elements… ▽ More Recent years have seen an exponential rise in complex software-driven functionality in vehicles, leading to a rising number of electronic control units (ECUs), network capabilities, and interfaces. These expanded capabilities also bring-in new planes of vulnerabilities making intrusion detection and management a critical capability; however, this can often result in more ECUs and network elements due to the high computational overheads. In this paper, we present a consolidated ECU architecture incorporating an Intrusion Detection System (IDS) for Automotive Controller Area Network (CAN) along with traditional ECU functionality on an off-the-shelf hybrid FPGA device, with near-zero overhead for the ECU functionality. We propose two quantised multi-layer perceptrons (QMLP's) as isolated IDSs for detecting a range of attack vectors including Denial-of-Service, Fuzzing and Spoofing, which are accelerated using off-the-shelf deep-learning processing unit (DPU) IP block from Xilinx, operating fully transparently to the software on the ECU. The proposed models achieve the state-of-the-art classification accuracy for all the attacks, while we observed a 15x reduction in power consumption when compared against the GPU-based implementation of the same models quantised using Nvidia libraries. We also achieved a 2.3x speed up in per-message processing latency (at 0.24 ms from the arrival of a CAN message) to meet the strict end-to-end latency on critical CAN nodes and a 2.6x reduction in power consumption for inference when compared to the state-of-the-art IDS models on embedded IDS and loosely coupled IDS accelerators (GPUs) discussed in the literature. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: 9 pages, 3 figures, 11 tables

Journal ref: 2022 International Conference on Field-Programmable Technology (ICFPT)

arXiv:2401.11030 [pdf, other]

doi 10.1109/FPL60245.2023.00040

Exploring Highly Quantised Neural Networks for Intrusion Detection in Automotive CAN

Authors: Shashwat Khandelwal, Shreejith Shanker

Abstract: Vehicles today comprise intelligent systems like connected autonomous driving and advanced driving assistance systems (ADAS) to enhance the driving experience, which is enabled through increased connectivity to infrastructure and fusion of information from different sensing modes. However, the rising connectivity coupled with the legacy network architecture within vehicles can be exploited for lau… ▽ More Vehicles today comprise intelligent systems like connected autonomous driving and advanced driving assistance systems (ADAS) to enhance the driving experience, which is enabled through increased connectivity to infrastructure and fusion of information from different sensing modes. However, the rising connectivity coupled with the legacy network architecture within vehicles can be exploited for launching active and passive attacks on critical vehicle systems and directly affecting the safety of passengers. Machine learning-based intrusion detection models have been shown to successfully detect multiple targeted attack vectors in recent literature, whose deployments are enabled through quantised neural networks targeting low-power platforms. Multiple models are often required to simultaneously detect multiple attack vectors, increasing the area, (resource) cost, and energy consumption. In this paper, we present a case for utilising custom-quantised MLP's (CQMLP) as a multi-class classification model, capable of detecting multiple attacks from the benign flow of controller area network (CAN) messages. The specific quantisation and neural architecture are determined through a joint design space exploration, resulting in our choice of the 2-bit precision and the n-layer MLP. Our 2-bit version is trained using Brevitas and optimised as a dataflow hardware model through the FINN toolflow from AMD/Xilinx, targeting an XCZU7EV device. We show that the 2-bit CQMLP model, when integrated as the IDS, can detect malicious attack messages (DoS, fuzzing, and spoofing attack) with a very high accuracy of 99.9%, on par with the state-of-the-art methods in the literature. Furthermore, the dataflow model can perform line rate detection at a latency of 0.11 ms from message reception while consuming 0.23 mJ/inference, making it ideally suited for integration with an ECU in critical CAN networks. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: 7 pages, 5 figures, 6 tables. arXiv admin note: substantial text overlap with arXiv:2401.10724

Journal ref: 2023 33rd International Conference on Field-Programmable Logic and Applications (FPL)

arXiv:2401.10724 [pdf, other]

doi 10.1109/ASAP57973.2023.00033

Real-Time Zero-Day Intrusion Detection System for Automotive Controller Area Network on FPGAs

Authors: Shashwat Khandelwal, Shreejith Shanker

Abstract: Increasing automation in vehicles enabled by increased connectivity to the outside world has exposed vulnerabilities in previously siloed automotive networks like controller area networks (CAN). Attributes of CAN such as broadcast-based communication among electronic control units (ECUs) that lowered deployment costs are now being exploited to carry out active injection attacks like denial of serv… ▽ More Increasing automation in vehicles enabled by increased connectivity to the outside world has exposed vulnerabilities in previously siloed automotive networks like controller area networks (CAN). Attributes of CAN such as broadcast-based communication among electronic control units (ECUs) that lowered deployment costs are now being exploited to carry out active injection attacks like denial of service (DoS), fuzzing, and spoofing attacks. Research literature has proposed multiple supervised machine learning models deployed as Intrusion detection systems (IDSs) to detect such malicious activity; however, these are largely limited to identifying previously known attack vectors. With the ever-increasing complexity of active injection attacks, detecting zero-day (novel) attacks in these networks in real-time (to prevent propagation) becomes a problem of particular interest. This paper presents an unsupervised-learning-based convolutional autoencoder architecture for detecting zero-day attacks, which is trained only on benign (attack-free) CAN messages. We quantise the model using Vitis-AI tools from AMD/Xilinx targeting a resource-constrained Zynq Ultrascale platform as our IDS-ECU system for integration. The proposed model successfully achieves equal or higher classification accuracy (> 99.5%) on unseen DoS, fuzzing, and spoofing attacks from a publicly available attack dataset when compared to the state-of-the-art unsupervised learning-based IDSs. Additionally, by cleverly overlapping IDS operation on a window of CAN messages with the reception, the model is able to meet line-rate detection (0.43 ms per window) of high-speed CAN, which when coupled with the low energy consumption per inference, makes this architecture ideally suited for detecting zero-day attacks on critical CAN networks. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: 8 pages, 6 figures, 7 tables

Journal ref: 2023 IEEE 34th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

arXiv:2401.10689 [pdf, other]

doi 10.1109/FPL57034.2022.00070

A Lightweight Multi-Attack CAN Intrusion Detection System on Hybrid FPGAs

Authors: Shashwat Khandelwal, Shreejith Shanker

Abstract: Rising connectivity in vehicles is enabling new capabilities like connected autonomous driving and advanced driver assistance systems (ADAS) for improving the safety and reliability of next-generation vehicles. This increased access to in-vehicle functions compromises critical capabilities that use legacy invehicle networks like Controller Area Network (CAN), which has no inherent security or auth… ▽ More Rising connectivity in vehicles is enabling new capabilities like connected autonomous driving and advanced driver assistance systems (ADAS) for improving the safety and reliability of next-generation vehicles. This increased access to in-vehicle functions compromises critical capabilities that use legacy invehicle networks like Controller Area Network (CAN), which has no inherent security or authentication mechanism. Intrusion detection and mitigation approaches, particularly using machine learning models, have shown promising results in detecting multiple attack vectors in CAN through their ability to generalise to new vectors. However, most deployments require dedicated computing units like GPUs to perform line-rate detection, consuming much higher power. In this paper, we present a lightweight multi-attack quantised machine learning model that is deployed using Xilinx's Deep Learning Processing Unit IP on a Zynq Ultrascale+ (XCZU3EG) FPGA, which is trained and validated using the public CAN Intrusion Detection dataset. The quantised model detects denial of service and fuzzing attacks with an accuracy of above 99 % and a false positive rate of 0.07%, which are comparable to the state-of-the-art techniques in the literature. The Intrusion Detection System (IDS) execution consumes just 2.0 W with software tasks running on the ECU and achieves a 25 % reduction in per-message processing latency over the state-of-the-art implementations. This deployment allows the ECU function to coexist with the IDS with minimal changes to the tasks, making it ideal for real-time IDS in in-vehicle systems. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: 5 pages, 2 figures, 6 tables

Journal ref: 32nd International Conference on Field-Programmable Logic and Applications (FPL) FPL 2022, 425-429

arXiv:2401.10674 [pdf, other]

doi 10.1109/ASAP54787.2022.00023

Deep Learning-based Embedded Intrusion Detection System for Automotive CAN

Authors: Shashwat Khandelwal, Eashan Wadhwa, Shreejith Shanker

Abstract: Rising complexity of in-vehicle electronics is enabling new capabilities like autonomous driving and active safety. However, rising automation also increases risk of security threats which is compounded by lack of in-built security measures in legacy networks like CAN, allowing attackers to observe, tamper and modify information shared over such broadcast networks. Various intrusion detection appr… ▽ More Rising complexity of in-vehicle electronics is enabling new capabilities like autonomous driving and active safety. However, rising automation also increases risk of security threats which is compounded by lack of in-built security measures in legacy networks like CAN, allowing attackers to observe, tamper and modify information shared over such broadcast networks. Various intrusion detection approaches have been proposed to detect and tackle such threats, with machine learning models proving highly effective. However, deploying machine learning models will require high processing power through high-end processors or GPUs to perform them close to line rate. In this paper, we propose a hybrid FPGA-based ECU approach that can transparently integrate IDS functionality through a dedicated off-the-shelf hardware accelerator that implements a deep-CNN intrusion detection model. Our results show that the proposed approach provides an average accuracy of over 99% across multiple attack datasets with 0.64% false detection rates while consuming 94% less energy and achieving 51.8% reduction in per-message processing latency when compared to IDS implementations on GPUs. △ Less

Submitted 19 January, 2024; originally announced January 2024.

Comments: 5 pages, 1 figure, 8 tables

Journal ref: IEEE 33rd International Conference on Application-specific Systems, Architectures and Processors (ASAP), Gothenburg, Sweden, 2022, pp. 88-92

arXiv:2401.01776 [pdf, other]

doi 10.1038/s41534-025-00981-7

Maximal steady-state entanglement in autonomous quantum thermal machines

Authors: Shishir Khandelwal, Björn Annby-Andersson, Giovanni Francesco Diotallevi, Andreas Wacker, Armin Tavakoli

Abstract: We devise an autonomous quantum thermal machine consisting of three pairwise-interacting qubits, two of which are locally coupled to thermal reservoirs. The machine operates autonomously, as it requires no time-coherent control, external driving or quantum bath engineering, and is instead propelled by a chemical potential bias. Under ideal conditions, we show that this out-of-equilibrium system ca… ▽ More We devise an autonomous quantum thermal machine consisting of three pairwise-interacting qubits, two of which are locally coupled to thermal reservoirs. The machine operates autonomously, as it requires no time-coherent control, external driving or quantum bath engineering, and is instead propelled by a chemical potential bias. Under ideal conditions, we show that this out-of-equilibrium system can deterministically generate a maximally entangled steady-state between two of the qubits, or any desired pure two-qubit entangled state, emerging as a dark state of the system. We study the robustness of entanglement production with respect to several relevant parameters, obtaining nearly-maximally-entangled states well-away from the ideal regime of operation. Furthermore, we show that our machine architecture can be generalised to a configuration with $2n-1$ qubits, in which only a potential bias and two-body interactions are sufficient to generate genuine multipartite maximally entangled steady states in the form of a W state of $n$ qubits. △ Less

Submitted 6 January, 2025; v1 submitted 3 January, 2024; originally announced January 2024.

Comments: 6+6 pages, 7 figures

Journal ref: npj Quantum Information 11, 28 (2025)

arXiv:2312.15065 [pdf, other]

doi 10.1103/PhysRevResearch.6.043091

Exact finite-time correlation functions for multi-terminal setups: Connecting theoretical frameworks for quantum transport and thermodynamics

Authors: Gianmichele Blasi, Shishir Khandelwal, Géraldine Haack

Abstract: Transport in open quantum systems can be explored through various theoretical frameworks, including the quantum master equation, scattering matrix, and Heisenberg equation of motion. The choice of framework depends on factors such as the presence of interactions, the coupling strength between the system and environment, and whether the focus is on steady-state or transient regimes. Existing litera… ▽ More Transport in open quantum systems can be explored through various theoretical frameworks, including the quantum master equation, scattering matrix, and Heisenberg equation of motion. The choice of framework depends on factors such as the presence of interactions, the coupling strength between the system and environment, and whether the focus is on steady-state or transient regimes. Existing literature treats these frameworks independently, lacking a unified perspective. Our work addresses this gap by clarifying the role and status of these approaches using a minimal single-level quantum dot model in a two-terminal setup under voltage and temperature biases. We derive analytical expressions for particle and energy currents and their fluctuations in both steady-state and transient regimes. Exact results from the Heisenberg equation are shown to align with scattering matrix and master equation approaches within their respective validity regimes. Crucially, we establish a protocol for the weak-coupling limit, bridging the applicability of master equations at weak-coupling with Heisenberg or scattering matrix approaches at arbitrary coupling strength. △ Less

Submitted 13 November, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

Journal ref: Phys. Rev. Research 6, 043091 (2024)

arXiv:2311.17095 [pdf, other]

Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models

Authors: Jiayun Luo, Siddhesh Khandelwal, Leonid Sigal, Boyang Li

Abstract: From image-text pairs, large-scale vision-language models (VLMs) learn to implicitly associate image regions with words, which prove effective for tasks like visual question answering. However, leveraging the learned association for open-vocabulary semantic segmentation remains a challenge. In this paper, we propose a simple, yet extremely effective, training-free technique, Plug-and-Play Open-Voc… ▽ More From image-text pairs, large-scale vision-language models (VLMs) learn to implicitly associate image regions with words, which prove effective for tasks like visual question answering. However, leveraging the learned association for open-vocabulary semantic segmentation remains a challenge. In this paper, we propose a simple, yet extremely effective, training-free technique, Plug-and-Play Open-Vocabulary Semantic Segmentation (PnP-OVSS) for this task. PnP-OVSS leverages a VLM with direct text-to-image cross-attention and an image-text matching loss. To balance between over-segmentation and under-segmentation, we introduce Salience Dropout; by iteratively dropping patches that the model is most attentive to, we are able to better resolve the entire extent of the segmentation mask. PnP-OVSS does not require any neural network training and performs hyperparameter tuning without the need for any segmentation annotations, even for a validation set. PnP-OVSS demonstrates substantial improvements over comparable baselines (+26.2% mIoU on Pascal VOC, +20.5% mIoU on MS COCO, +3.1% mIoU on COCO Stuff and +3.0% mIoU on ADE20K). Our codebase is at https://github.com/letitiabanana/PnP-OVSS. △ Less

Submitted 15 June, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

Comments: Accepted to CVPR 2024; Earlier version of this paper contained an unintentional error stemming from a bug in the code. This version corrects this error, which had to do with filtering of class names. In consultation with CVPR Program Chairs it was suggested errata be submitted as the updated (fixed) code reinforced original findings (albeit with slightly different final numbers)

arXiv:2311.00679 [pdf, ps, other]

Components of curvature-squared invariants of minimal supergravity in five dimensions

Authors: Gregory Gold, Jessica Hutomo, Saurish Khandelwal, Gabriele Tartaglino-Mazzucchelli

Abstract: We present for the first time the component structure of the supersymmetric completions for all curvature-squared invariants of five-dimensional, off-shell (gauged) minimal supergravity, including all fermions. This is achieved by using an interplay between superspace and superconformal tensor calculus techniques, and by employing results from arXiv:1410.8682 and arXiv:2302.14295. Our analysis is… ▽ More We present for the first time the component structure of the supersymmetric completions for all curvature-squared invariants of five-dimensional, off-shell (gauged) minimal supergravity, including all fermions. This is achieved by using an interplay between superspace and superconformal tensor calculus techniques, and by employing results from arXiv:1410.8682 and arXiv:2302.14295. Our analysis is based on using a standard Weyl multiplet of conformal supergravity coupled to a vector and a linear multiplet compensator to engineer off-shell Poincaré supergravity. We compute all the descendants of the composite linear multiplets that describe gauged supergravity together with the three independent four-derivative invariants. These are the building blocks of the locally superconformal invariant actions. A derivation of the primary equations of motion for minimal gauged off-shell supergravity deformed by an arbitrary combination of these three locally superconformal invariants, is then provided. Finally, all the covariant descendants in the multiplets of equations of motion are obtained by applying a series of $Q$-supersymmetry transformations, equivalent to successively applying superspace spinor derivatives to the primary equations of motion. △ Less

Submitted 1 November, 2023; originally announced November 2023.

Comments: 99 pages of manuscript + 228 pages of supplementary file

arXiv:2310.11381 [pdf, other]

doi 10.1103/PhysRevLett.133.070403

Chiral Bell-state transfer via dissipative Liouvillian dynamics

Authors: Shishir Khandelwal, Weijian Chen, Kater W. Murch, Géraldine Haack

Abstract: Chiral state transfer along closed loops in the vicinity of an exceptional point is one of the many counter-intuitive observations in non-Hermitian physics. The application of this property beyond proof-of-principle in quantum physics, is an open question. In this work, we demonstrate chiral state conversion between singlet and triplet Bell states through fully-quantum Liouvillian dynamics. Crucia… ▽ More Chiral state transfer along closed loops in the vicinity of an exceptional point is one of the many counter-intuitive observations in non-Hermitian physics. The application of this property beyond proof-of-principle in quantum physics, is an open question. In this work, we demonstrate chiral state conversion between singlet and triplet Bell states through fully-quantum Liouvillian dynamics. Crucially, we demonstrate that this property can be used for the chiral production of Bell states from separable states with a high fidelity and for a large range of parameters. Additionally, we show that the removal of quantum jumps from the dynamics through postselection can result in near-perfect Bell states from initially separable states. Our work presents the first application of chiral state transfer in quantum information processing and demonstrates a novel way to control entangled states by means of dissipation engineering. △ Less

Submitted 17 September, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

Comments: 4+6 pages, 7 figures

Journal ref: Phys. Rev. Lett. 133, 070403 (2024)

arXiv:2309.07637 [pdf, ps, other]

All Gauged Curvature Squared Supergravities in Five Dimensions

Authors: Gregory Gold, Jessica Hutomo, Saurish Khandelwal, Mehmet Ozkan, Yi Pang, Gabriele Tartaglino-Mazzucchelli

Abstract: We present a complete basis to study gauged curvature-squared supergravity in five dimensions. We replace the conventional ungauged Riemann-squared action with a new Log-invariant, offering a comprehensive framework for all gauged curvature-squared supergravities. Our findings address long-standing challenges and have implications for precision tests in the AdS/CFT correspondence. We present a complete basis to study gauged curvature-squared supergravity in five dimensions. We replace the conventional ungauged Riemann-squared action with a new Log-invariant, offering a comprehensive framework for all gauged curvature-squared supergravities. Our findings address long-standing challenges and have implications for precision tests in the AdS/CFT correspondence. △ Less

Submitted 30 September, 2023; v1 submitted 14 September, 2023; originally announced September 2023.

Comments: 5+1 pages; v2: two references added, minor typos and comments corrected

arXiv:2308.10649 [pdf, ps, other]

Reinforcement Learning Based Sensor Optimization for Bio-markers

Authors: Sajal Khandelwal, Pawan Kumar, Syed Azeemuddin

Abstract: Radio frequency (RF) biosensors, in particular those based on inter-digitated capacitors (IDCs), are pivotal in areas like biomedical diagnosis, remote sensing, and wireless communication. Despite their advantages of low cost and easy fabrication, their sensitivity can be hindered by design imperfections, environmental factors, and circuit noise. This paper investigates enhancing the sensitivity o… ▽ More Radio frequency (RF) biosensors, in particular those based on inter-digitated capacitors (IDCs), are pivotal in areas like biomedical diagnosis, remote sensing, and wireless communication. Despite their advantages of low cost and easy fabrication, their sensitivity can be hindered by design imperfections, environmental factors, and circuit noise. This paper investigates enhancing the sensitivity of IDC-based RF sensors using novel reinforcement learning based Binary Particle Swarm Optimization (RLBPSO), and it is compared to Ant Colony Optimization (ACO), and other state-of-the-art methods. By focusing on optimizing design parameters like electrode design and finger width, the proposed study found notable improvements in sensor sensitivity. The proposed RLBPSO method shows best optimized design for various frequency ranges when compared to current state-of-the-art methods. △ Less

Submitted 21 August, 2023; originally announced August 2023.

Comments: 7 pages, 4 tables

arXiv:2308.05101 [pdf, other]

DOST -- Domain Obedient Self-supervised Training for Multi Label Classification with Noisy Labels

Authors: Soumadeep Saha, Utpal Garain, Arijit Ukil, Arpan Pal, Sundeep Khandelwal

Abstract: The enormous demand for annotated data brought forth by deep learning techniques has been accompanied by the problem of annotation noise. Although this issue has been widely discussed in machine learning literature, it has been relatively unexplored in the context of "multi-label classification" (MLC) tasks which feature more complicated kinds of noise. Additionally, when the domain in question ha… ▽ More The enormous demand for annotated data brought forth by deep learning techniques has been accompanied by the problem of annotation noise. Although this issue has been widely discussed in machine learning literature, it has been relatively unexplored in the context of "multi-label classification" (MLC) tasks which feature more complicated kinds of noise. Additionally, when the domain in question has certain logical constraints, noisy annotations often exacerbate their violations, making such a system unacceptable to an expert. This paper studies the effect of label noise on domain rule violation incidents in the MLC task, and incorporates domain rules into our learning algorithm to mitigate the effect of noise. We propose the Domain Obedient Self-supervised Training (DOST) paradigm which not only makes deep learning models more aligned to domain rules, but also improves learning performance in key metrics and minimizes the effect of annotation noise. This novel approach uses domain guidance to detect offending annotations and deter rule-violating predictions in a self-supervised manner, thus making it more "data efficient" and domain compliant. Empirical studies, performed over two large scale multi-label classification datasets, demonstrate that our method results in improvement across the board, and often entirely counteracts the effect of noise. △ Less

Submitted 9 August, 2023; originally announced August 2023.

Comments: Submitted to IEEE TNNLS on March 7th 2023. 8 pages, 4 figures

ACM Class: I.2.6; I.2.0

arXiv:2303.02441 [pdf, other]

doi 10.1016/j.commatsci.2023.112182

$ρ$-CP: Open Source Dislocation Density Based Crystal Plasticity Framework for Simulating Temperature- and Strain Rate-Dependent Deformation

Authors: Anirban Patra, Suketa Chaudhary, Namit Pai, Tarakram Ramgopal, Sarthak Khandelwal, Adwitiya Rao, David L. McDowell

Abstract: This work presents an open source, dislocation density based crystal plasticity modeling framework, $ρ$-CP. A Kocks-type thermally activated flow is used for accounting for the temperature and strain rate effects on the crystallographic shearing rate. Slip system-level mobile and immobile dislocation densities, as well slip system-level backstress, are used as internal state variables for represen… ▽ More This work presents an open source, dislocation density based crystal plasticity modeling framework, $ρ$-CP. A Kocks-type thermally activated flow is used for accounting for the temperature and strain rate effects on the crystallographic shearing rate. Slip system-level mobile and immobile dislocation densities, as well slip system-level backstress, are used as internal state variables for representing the substructure evolution during plastic deformation. A fully implicit numerical integration scheme is presented for the time integration of the finite deformation plasticity model. The framework is implemented and integrated with the open source finite element solver, Multiphysics Object-Oriented Simulation Environment (MOOSE). Example applications of the model are demonstrated for predicting the anisotropic mechanical response of single and polycrystalline hcp magnesium, strain rate effects and cyclic deformation of polycrystalline fcc OFHC copper, and temperature and strain rate effects on the thermo-mechanical deformation of polycrystalline bcc tantanlum. Simulations of realistic Voronoi-tessellated microstructures as well as Electron Back Scatter Diffraction (EBSD) microstructures are demonstrated to highlight the model's ability to predict large deformation and misorientation development during plastic deformation. △ Less

Submitted 30 March, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

Comments: 30 pages, 19 figures, 5 tables, v2

MSC Class: 74D10

Journal ref: Computational Materials Science, Vol. 224 (2023) 112182

arXiv:2302.14295 [pdf, ps, other]

doi 10.1103/PhysRevD.107.106013

On curvature-squared invariants of minimal five-dimensional supergravity from superspace

Authors: Gregory Gold, Jessica Hutomo, Saurish Khandelwal, Gabriele Tartaglino-Mazzucchelli

Abstract: We elaborate on the off-shell superspace construction of curvature-squared invariants in minimal five-dimensional supergravity. This is described by the standard Weyl multiplet of conformal supergravity coupled to two compensators being a vector multiplet and a linear multiplet. In this set-up, we review the definition of the off-shell two-derivative gauged supergravity together with the three ind… ▽ More We elaborate on the off-shell superspace construction of curvature-squared invariants in minimal five-dimensional supergravity. This is described by the standard Weyl multiplet of conformal supergravity coupled to two compensators being a vector multiplet and a linear multiplet. In this set-up, we review the definition of the off-shell two-derivative gauged supergravity together with the three independent four-derivative superspace invariants defined in arXiv:1410.8682. We provide the explicit expression for the linear multiplet based on a prepotential given by the logarithm of the vector multiplet primary superfield. We then present for the first time the primary equations of motion for minimal gauged off-shell supergravity deformed by an arbitrary combination of these three four-derivative locally superconformal invariants. We also identify a four-derivative invariant based on the linear multiplet compensator and the kinetic superfield of a vector multiplet which can be used to engineer an alternative supersymmetric completion of the scalar curvature squared. △ Less

Submitted 16 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

Comments: 35 pages. v2: minor corrections, typos corrected

arXiv:2302.07319 [pdf, other]

Frustratingly Simple but Effective Zero-shot Detection and Segmentation: Analysis and a Strong Baseline

Authors: Siddhesh Khandelwal, Anirudth Nambirajan, Behjat Siddiquie, Jayan Eledath, Leonid Sigal

Abstract: Methods for object detection and segmentation often require abundant instance-level annotations for training, which are time-consuming and expensive to collect. To address this, the task of zero-shot object detection (or segmentation) aims at learning effective methods for identifying and localizing object instances for the categories that have no supervision available. Constructing architectures… ▽ More Methods for object detection and segmentation often require abundant instance-level annotations for training, which are time-consuming and expensive to collect. To address this, the task of zero-shot object detection (or segmentation) aims at learning effective methods for identifying and localizing object instances for the categories that have no supervision available. Constructing architectures for these tasks requires choosing from a myriad of design options, ranging from the form of the class encoding used to transfer information from seen to unseen categories, to the nature of the function being optimized for learning. In this work, we extensively study these design choices, and carefully construct a simple yet extremely effective zero-shot recognition method. Through extensive experiments on the MSCOCO dataset on object detection and segmentation, we highlight that our proposed method outperforms existing, considerably more complex, architectures. Our findings and method, which we propose as a competitive future baseline, point towards the need to revisit some of the recent design trends in zero-shot detection / segmentation. △ Less

Submitted 14 February, 2023; originally announced February 2023.

Comments: 17 Pages, 7 Figures

arXiv:2301.13085 [pdf, other]

doi 10.1103/PhysRevB.108.174418

Dynamical nuclear polarization for dissipation-induced entanglement in NV centers

Authors: Shishir Khandelwal, Shashwat Kumar, Nicolas Palazzo, Géraldine Haack, Mayeul Chipaux

Abstract: We propose a practical implementation of a two-qubit entanglement engine which denotes a scheme to generate quantum correlations through purely dissipative processes. On a diamond platform, the electron spin transitions of two Nitrogen-Vacancy (NV) centers play the role of artificial atoms (qubits), interacting through a dipole-dipole Hamiltonian. The surrounding Carbon-13 nuclear spins act as spi… ▽ More We propose a practical implementation of a two-qubit entanglement engine which denotes a scheme to generate quantum correlations through purely dissipative processes. On a diamond platform, the electron spin transitions of two Nitrogen-Vacancy (NV) centers play the role of artificial atoms (qubits), interacting through a dipole-dipole Hamiltonian. The surrounding Carbon-13 nuclear spins act as spin baths playing the role of thermal reservoirs at well-defined temperatures and exchanging heat through the NV center qubits. In our scheme, a key challenge is therefore to create a temperature gradient between two spin baths surrounding each NV center, for which we propose the exploit the recent progresses in dynamical nuclear polarization, combined with microscopy superresolution methods. We discuss how these techniques should allow us to initialize such a long lasting out-of-equilibrium polarization situation between them, effectively leading to suitable conditions to run the entanglement engine successfully. Within a quantum master equation approach, we make theoretical predictions using state-of-the-art values for experimental parameters. We obtain promising values for the concurrence, reaching theoretical maxima. △ Less

Submitted 12 July, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

Comments: 12 Pages, 3 Figures, 1 Table

Journal ref: Phys. Rev. B 108, 174418 (2023)

arXiv:2301.00493 [pdf, other]

Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting

Authors: Benjamin Wilson, William Qi, Tanmay Agarwal, John Lambert, Jagjeet Singh, Siddhesh Khandelwal, Bowen Pan, Ratnesh Kumar, Andrew Hartnett, Jhony Kaesemodel Pontes, Deva Ramanan, Peter Carr, James Hays

Abstract: We introduce Argoverse 2 (AV2) - a collection of three datasets for perception and forecasting research in the self-driving domain. The annotated Sensor Dataset contains 1,000 sequences of multimodal data, encompassing high-resolution imagery from seven ring cameras, and two stereo cameras in addition to lidar point clouds, and 6-DOF map-aligned pose. Sequences contain 3D cuboid annotations for 26… ▽ More We introduce Argoverse 2 (AV2) - a collection of three datasets for perception and forecasting research in the self-driving domain. The annotated Sensor Dataset contains 1,000 sequences of multimodal data, encompassing high-resolution imagery from seven ring cameras, and two stereo cameras in addition to lidar point clouds, and 6-DOF map-aligned pose. Sequences contain 3D cuboid annotations for 26 object categories, all of which are sufficiently-sampled to support training and evaluation of 3D perception models. The Lidar Dataset contains 20,000 sequences of unlabeled lidar point clouds and map-aligned pose. This dataset is the largest ever collection of lidar sensor data and supports self-supervised learning and the emerging task of point cloud forecasting. Finally, the Motion Forecasting Dataset contains 250,000 scenarios mined for interesting and challenging interactions between the autonomous vehicle and other actors in each local scene. Models are tasked with the prediction of future motion for "scored actors" in each scenario and are provided with track histories that capture object location, heading, velocity, and category. In all three datasets, each scenario contains its own HD Map with 3D lane and crosswalk geometry - sourced from data captured in six distinct cities. We believe these datasets will support new and existing machine learning research problems in ways that existing datasets do not. All datasets are released under the CC BY-NC-SA 4.0 license. △ Less

Submitted 1 January, 2023; originally announced January 2023.

Comments: Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks

arXiv:2209.05748 [pdf, ps, other]

doi 10.1103/PhysRevD.107.046009

Hyper-Dilaton Weyl Multiplets of 5D and 6D Minimal Conformal Supergravity

Authors: Jessica Hutomo, Saurish Khandelwal, Gabriele Tartaglino-Mazzucchelli, Jesse Woods

Abstract: By extending the recent analysis of arXiv:2203.12203 for ${\mathcal{N}}=2$ conformal supergravity in four dimensions, we define new hyper-dilaton Weyl multiplets for five-dimensional ${\mathcal{N}}=1$, and six-dimensional ${\mathcal{N}}=(1,0)$ conformal supergravities. These are constructed by coupling the five- and six-dimensional standard Weyl multiplets to on-shell hypermultiplets and reinterpr… ▽ More By extending the recent analysis of arXiv:2203.12203 for ${\mathcal{N}}=2$ conformal supergravity in four dimensions, we define new hyper-dilaton Weyl multiplets for five-dimensional ${\mathcal{N}}=1$, and six-dimensional ${\mathcal{N}}=(1,0)$ conformal supergravities. These are constructed by coupling the five- and six-dimensional standard Weyl multiplets to on-shell hypermultiplets and reinterpreting the systems as new multiplets of conformal supergravity. In the five-dimensional case, we also construct a new hyper-dilaton Poincaré supergravity by coupling to an off-shell vector multiplet compensator. As in four dimensions, a $BF$-coupling induces a non-trivial scalar potential for the five-dimensional dilaton that admits AdS$_5$ vacua. △ Less

Submitted 27 February, 2023; v1 submitted 13 September, 2022; originally announced September 2022.

Comments: 56 pages; v2: minor corrections, version published in PRD

arXiv:2208.10809 [pdf, other]

doi 10.1103/PhysRevResearch.5.013129

Characterizing the performance of heat rectifiers

Authors: Shishir Khandelwal, Martí Perarnau-Llobet, Stella Seah, Nicolas Brunner, Géraldine Haack

Abstract: A physical system connected to two thermal reservoirs at different temperatures is said to act as a heat rectifier when it is able to bias the heat current in a given direction, similarly to an electronic diode. We propose to quantify the performance of a heat rectifier by mapping out the trade-off between heat currents and rectification. By optimizing over the system's parameters, we obtain Paret… ▽ More A physical system connected to two thermal reservoirs at different temperatures is said to act as a heat rectifier when it is able to bias the heat current in a given direction, similarly to an electronic diode. We propose to quantify the performance of a heat rectifier by mapping out the trade-off between heat currents and rectification. By optimizing over the system's parameters, we obtain Pareto fronts, which can be efficiently computed using general coefficients of performance. This approach naturally highlights the fundamental trade-off between heat rectification and conduction, and allows for a meaningful comparison between different devices for heat rectification. We illustrate the practical relevance of these ideas on three minimal models for spin-boson nanoscale rectifiers, i.e., systems consisting of one or two interacting qubits coupled to bosonic reservoirs biased in temperature. Our results demonstrate the superiority of two strongly-interacting qubits for heat rectification. △ Less

Submitted 23 August, 2022; originally announced August 2022.

Comments: 10 pages, 6 figures. Comments welcome

Journal ref: Phys. Rev. Research 5, 013129 (2023)

arXiv:2208.08426 [pdf, other]

doi 10.1145/3555156

"We Need a Woman in Music": Exploring Wikipedia's Values on Article Priority

Authors: Mo Houtti, Isaac Johnson, Joel Cepeda, Soumya Khandelwal, Aviral Bhatnagar, Loren Terveen

Abstract: Wikipedia -- like most peer production communities -- suffers from a basic problem: the amount of work that needs to be done (articles to be created and improved) exceeds the available resources (editor effort). Recommender systems have been deployed to address this problem, but they have tended to recommend work tasks that match individuals' personal interests, ignoring more global community valu… ▽ More Wikipedia -- like most peer production communities -- suffers from a basic problem: the amount of work that needs to be done (articles to be created and improved) exceeds the available resources (editor effort). Recommender systems have been deployed to address this problem, but they have tended to recommend work tasks that match individuals' personal interests, ignoring more global community values. In English Wikipedia, discussion about Vital articles constitutes a proxy for community values about the types of articles that are most important, and should therefore be prioritized for improvement. We first analyzed these discussions, finding that an article's priority is considered a function of 1) its inherent importance and 2) its effects on Wikipedia's global composition. One important example of the second consideration is balance, including along the dimensions of gender and geography. We then conducted a quantitative analysis evaluating how four different article prioritization methods -- two from prior research -- would affect Wikipedia's overall balance on these two dimensions; we found significant differences among the methods. We discuss the implications of our results, including particularly how they can guide the design of recommender systems that take into account community values, not just individuals' interests. △ Less

Submitted 17 August, 2022; originally announced August 2022.

Comments: To appear at the 25th ACM Conference On Computer-Supported Cooperative Work And Social Computing (CSCW 2022)

arXiv:2207.13440 [pdf, other]

Iterative Scene Graph Generation

Authors: Siddhesh Khandelwal, Leonid Sigal

Abstract: The task of scene graph generation entails identifying object entities and their corresponding interaction predicates in a given image (or video). Due to the combinatorially large solution space, existing approaches to scene graph generation assume certain factorization of the joint distribution to make the estimation feasible (e.g., assuming that objects are conditionally independent of predicate… ▽ More The task of scene graph generation entails identifying object entities and their corresponding interaction predicates in a given image (or video). Due to the combinatorially large solution space, existing approaches to scene graph generation assume certain factorization of the joint distribution to make the estimation feasible (e.g., assuming that objects are conditionally independent of predicate predictions). However, this fixed factorization is not ideal under all scenarios (e.g., for images where an object entailed in interaction is small and not discernible on its own). In this work, we propose a novel framework for scene graph generation that addresses this limitation, as well as introduces dynamic conditioning on the image, using message passing in a Markov Random Field. This is implemented as an iterative refinement procedure wherein each modification is conditioned on the graph generated in the previous iteration. This conditioning across refinement steps allows joint reasoning over entities and relations. This framework is realized via a novel and end-to-end trainable transformer-based architecture. In addition, the proposed framework can improve existing approach performance. Through extensive experiments on Visual Genome and Action Genome benchmark datasets we show improved performance on the scene graph generation. △ Less

Submitted 27 July, 2022; originally announced July 2022.

Comments: 25 pages, 10 images, 9 tables

arXiv:2203.12203 [pdf, ps, other]

doi 10.1007/JHEP09(2022)016

Hyper-Dilaton Weyl Multiplet of 4D, ${\mathcal{N}}=2$ Conformal Supergravity

Authors: Gregory Gold, Saurish Khandelwal, William Kitchin, Gabriele Tartaglino-Mazzucchelli

Abstract: We define a new dilaton Weyl multiplet of ${\mathcal{N}}=2$ conformal supergravity in four dimensions. This is constructed by reinterpreting the equations of motion of an on-shell hypermultiplet as constraints that render some of the fields of the standard Weyl multiplet composite. The independent bosonic components include four scalar fields and a triplet of gauge two-forms. The resulting, so-cal… ▽ More We define a new dilaton Weyl multiplet of ${\mathcal{N}}=2$ conformal supergravity in four dimensions. This is constructed by reinterpreting the equations of motion of an on-shell hypermultiplet as constraints that render some of the fields of the standard Weyl multiplet composite. The independent bosonic components include four scalar fields and a triplet of gauge two-forms. The resulting, so-called, hyper-dilaton Weyl multiplet defines a $24+24$ off-shell representation of the local ${\mathcal{N}}=2$ superconformal algebra. By coupling the hyper-dilaton Weyl multiplet to an off-shell vector multiplet compensator, we obtain one of the two minimal $32+32$ off-shell multiplets of ${\mathcal{N}}=2$ Poincaré supergravity constructed by Müller in 1986. On-shell, this contains the minimal ${\mathcal{N}}=2$ Poincaré supergravity multiplet together with a hypermultiplet where one of its physical scalars plays the role of a dilaton, while its three other scalars are dualised to a triplet of real gauge two-forms. Interestingly, a $BF$-coupling induces a scalar potential for the dilaton without a standard gauging. △ Less

Submitted 23 September, 2024; v1 submitted 23 March, 2022; originally announced March 2022.

Comments: v3: typo corrected in eq. (2.33)

Journal ref: JHEP 09 (2022), 016

arXiv:2104.14733 [pdf, other]

A Compact Model for SiC Power MOSFETs for Large Current and High Voltage Operation

Authors: Cristino Salcines, Sourabh Khandelwal, Ingmar Kallfass

Abstract: This work presents a physics based compact model for SiC power MOSFETs that accurately describes the I-V characteristics up to large voltages and currents. Charge-based formulations accounting for the different physics of SiC power MOSFETs are presented. The formulations account for the effect of the large SiC/SiO2 interface traps density characteristic of SiC MOSFETs and its dependence with tempe… ▽ More This work presents a physics based compact model for SiC power MOSFETs that accurately describes the I-V characteristics up to large voltages and currents. Charge-based formulations accounting for the different physics of SiC power MOSFETs are presented. The formulations account for the effect of the large SiC/SiO2 interface traps density characteristic of SiC MOSFETs and its dependence with temperature. The modeling of interface charge density is found to be necessary to describe the electrostatics of SiC power MOSFETs when operating at simultaneous high current and high voltage regions. The proposed compact model accurately fits the measurement data extracted of a 160 milli ohms, 1200V SiC power MOSFET in the complete IV plane from drain-voltage $V_d$ = 5mV up to 800 V and current ranges from few mA to 30 A. △ Less

Submitted 29 April, 2021; originally announced April 2021.

arXiv:2104.14207 [pdf, other]

Segmentation-grounded Scene Graph Generation

Authors: Siddhesh Khandelwal, Mohammed Suhail, Leonid Sigal

Abstract: Scene graph generation has emerged as an important problem in computer vision. While scene graphs provide a grounded representation of objects, their locations and relations in an image, they do so only at the granularity of proposal bounding boxes. In this work, we propose the first, to our knowledge, framework for pixel-level segmentation-grounded scene graph generation. Our framework is agnosti… ▽ More Scene graph generation has emerged as an important problem in computer vision. While scene graphs provide a grounded representation of objects, their locations and relations in an image, they do so only at the granularity of proposal bounding boxes. In this work, we propose the first, to our knowledge, framework for pixel-level segmentation-grounded scene graph generation. Our framework is agnostic to the underlying scene graph generation method and address the lack of segmentation annotations in target scene graph datasets (e.g., Visual Genome) through transfer and multi-task learning from, and with, an auxiliary dataset (e.g., MS COCO). Specifically, each target object being detected is endowed with a segmentation mask, which is expressed as a lingual-similarity weighted linear combination over categories that have annotations present in an auxiliary dataset. These inferred masks, along with a novel Gaussian attention mechanism which grounds the relations at a pixel-level within the image, allow for improved relation prediction. The entire framework is end-to-end trainable and is learned in a multi-task manner with both target and auxiliary datasets. △ Less

Submitted 29 April, 2021; originally announced April 2021.

Comments: 11 pages, 3 figures, 4 tables

arXiv:2101.11553 [pdf, other]

doi 10.1103/PRXQuantum.2.040346

Signatures of Liouvillian exceptional points in a quantum thermal machine

Authors: Shishir Khandelwal, Nicolas Brunner, Géraldine Haack

Abstract: Viewing a quantum thermal machine as a non-Hermitian quantum system, we characterize in full generality its analytical time-dependent dynamics by deriving the spectrum of its non-Hermitian Liouvillian for an arbitrary initial state. We show that the thermal machine features a number of Liouvillian exceptional points (EPs) for experimentally realistic parameters, in particular a third-dorder except… ▽ More Viewing a quantum thermal machine as a non-Hermitian quantum system, we characterize in full generality its analytical time-dependent dynamics by deriving the spectrum of its non-Hermitian Liouvillian for an arbitrary initial state. We show that the thermal machine features a number of Liouvillian exceptional points (EPs) for experimentally realistic parameters, in particular a third-dorder exceptional point that leaves signatures both in short and long-time regimes. Remarkably, we demonstrate that this EP corresponds to a regime of critical decay for the quantum thermal machine towards its steady state, bearing a striking resemblance with a critically damped harmonic oscillator. These results open up exciting possibilities for the precise dynamical control of quantum thermal machines exploiting exceptional points from non-Hermitian physics and are amenable to state-of-the-art solid-state platforms such as semiconducting and superconducting devices. △ Less

Submitted 22 November, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

Comments: 6 pages, discussion on critical decay expanded

Journal ref: PRX Quantum 2, 040346 (2021)

arXiv:2008.10587 [pdf, other]

What-If Motion Prediction for Autonomous Driving

Authors: Siddhesh Khandelwal, William Qi, Jagjeet Singh, Andrew Hartnett, Deva Ramanan

Abstract: Forecasting the long-term future motion of road actors is a core challenge to the deployment of safe autonomous vehicles (AVs). Viable solutions must account for both the static geometric context, such as road lanes, and dynamic social interactions arising from multiple actors. While recent deep architectures have achieved state-of-the-art performance on distance-based forecasting metrics, these a… ▽ More Forecasting the long-term future motion of road actors is a core challenge to the deployment of safe autonomous vehicles (AVs). Viable solutions must account for both the static geometric context, such as road lanes, and dynamic social interactions arising from multiple actors. While recent deep architectures have achieved state-of-the-art performance on distance-based forecasting metrics, these approaches produce forecasts that are predicted without regard to the AV's intended motion plan. In contrast, we propose a recurrent graph-based attentional approach with interpretable geometric (actor-lane) and social (actor-actor) relationships that supports the injection of counterfactual geometric goals and social contexts. Our model can produce diverse predictions conditioned on hypothetical or "what-if" road lanes and multi-actor interactions. We show that such an approach could be used in the planning loop to reason about unobserved causes or unlikely futures that are directly relevant to the AV's intended route. △ Less

Submitted 24 August, 2020; originally announced August 2020.

Comments: 16 pages, 6 tables, 6 figures

arXiv:2006.07502 [pdf, other]

UniT: Unified Knowledge Transfer for Any-shot Object Detection and Segmentation

Authors: Siddhesh Khandelwal, Raghav Goyal, Leonid Sigal

Abstract: Methods for object detection and segmentation rely on large scale instance-level annotations for training, which are difficult and time-consuming to collect. Efforts to alleviate this look at varying degrees and quality of supervision. Weakly-supervised approaches draw on image-level labels to build detectors/segmentors, while zero/few-shot methods assume abundant instance-level data for a set of… ▽ More Methods for object detection and segmentation rely on large scale instance-level annotations for training, which are difficult and time-consuming to collect. Efforts to alleviate this look at varying degrees and quality of supervision. Weakly-supervised approaches draw on image-level labels to build detectors/segmentors, while zero/few-shot methods assume abundant instance-level data for a set of base classes, and none to a few examples for novel classes. This taxonomy has largely siloed algorithmic designs. In this work, we aim to bridge this divide by proposing an intuitive and unified semi-supervised model that is applicable to a range of supervision: from zero to a few instance-level samples per novel class. For base classes, our model learns a mapping from weakly-supervised to fully-supervised detectors/segmentors. By learning and leveraging visual and lingual similarities between the novel and base classes, we transfer those mappings to obtain detectors/segmentors for novel classes; refining them with a few novel class instance-level annotated samples, if available. The overall model is end-to-end trainable and highly flexible. Through extensive experiments on MS-COCO and Pascal VOC benchmark datasets we show improved performance in a variety of settings. △ Less

Submitted 3 March, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

Comments: 22 Pages, 8 Figures, 13 Tables

arXiv:2003.01426 [pdf, other]

doi 10.1088/1367-2630/ab9983

Critical heat current for operating an entanglement engine

Authors: Shishir Khandelwal, Nicolas Palazzo, Nicolas Brunner, Géraldine Haack

Abstract: Autonomous entanglement engines have recently been proposed to generate steady-state bipartite and multipartite entanglement exploiting only incoherent interactions with thermal baths at different temperatures. In this work, we investigate the interplay between heat current and entanglement in a two-qubit entanglement engine, deriving a critical heat current for successful operation of the engine,… ▽ More Autonomous entanglement engines have recently been proposed to generate steady-state bipartite and multipartite entanglement exploiting only incoherent interactions with thermal baths at different temperatures. In this work, we investigate the interplay between heat current and entanglement in a two-qubit entanglement engine, deriving a critical heat current for successful operation of the engine, i.e. a cut-off above which entanglement is present. The heat current can thus be seen as a witness to the presence of entanglement. In the regime of weak-inter qubit coupling, we also investigate the effect of two experimentally relevant parameters for the qubits, the energy detuning and tunnelling, on the entanglement production. Finally, we show that the regime of strong inter-qubit coupling provides no clear advantage over the weak regime, in the context of out-of-equilibrium entanglement engines. △ Less

Submitted 14 June, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

Comments: 18 pages, 6 figures, discussion on strong inter-qubit coupling added

Journal ref: New J. Phys. 22, 073039 (2020)

arXiv:1907.05866 [pdf, ps, other]

doi 10.1103/PhysRevD.100.105013

Equivalent Dual Theories for 3D N=2 Supergravity

Authors: Nabamita Banerjee, Saurish Khandelwal, Parita Shah

Abstract: N=2 three dimensional Supergravity with internal $R-$symmetry generators can be understood as a two dimensional chiral Wess-Zumino-Witten model. In this paper, we present the reduced phase space description of the theory, which turns out to be flat limit of a generalised Liouville theory, up to zero modes. The reduced phase space description can also be explained as a gauged chiral Wess-Zumino-Wit… ▽ More N=2 three dimensional Supergravity with internal $R-$symmetry generators can be understood as a two dimensional chiral Wess-Zumino-Witten model. In this paper, we present the reduced phase space description of the theory, which turns out to be flat limit of a generalised Liouville theory, up to zero modes. The reduced phase space description can also be explained as a gauged chiral Wess-Zumino-Witten model. We show that both these descriptions possess identical gauge and global (quantum N=2 superBMS$_3$) symmetries. △ Less

Submitted 11 July, 2019; originally announced July 2019.

Comments: 20+2 pages. arXiv admin note: text overlap with arXiv:1905.10239

Journal ref: Phys. Rev. D 100, 105013 (2019)

arXiv:1905.09400 [pdf, other]

AttentionRNN: A Structured Spatial Attention Mechanism

Authors: Siddhesh Khandelwal, Leonid Sigal

Abstract: Visual attention mechanisms have proven to be integrally important constituent components of many modern deep neural architectures. They provide an efficient and effective way to utilize visual information selectively, which has shown to be especially valuable in multi-modal learning tasks. However, all prior attention frameworks lack the ability to explicitly model structural dependencies among a… ▽ More Visual attention mechanisms have proven to be integrally important constituent components of many modern deep neural architectures. They provide an efficient and effective way to utilize visual information selectively, which has shown to be especially valuable in multi-modal learning tasks. However, all prior attention frameworks lack the ability to explicitly model structural dependencies among attention variables, making it difficult to predict consistent attention masks. In this paper we develop a novel structured spatial attention mechanism which is end-to-end trainable and can be integrated with any feed-forward convolutional neural network. This proposed AttentionRNN layer explicitly enforces structure over the spatial attention variables by sequentially predicting attention values in the spatial mask in a bi-directional raster-scan and inverse raster-scan order. As a result, each attention value depends not only on local image or contextual information, but also on the previously predicted attention values. Our experiments show consistent quantitative and qualitative improvements on a variety of recognition tasks and datasets; including image categorization, question answering and image generation. △ Less

Submitted 22 May, 2019; originally announced May 2019.

arXiv:1904.02178 [pdf, other]

doi 10.22331/q-2020-08-14-309

Universal quantum modifications to general relativistic time dilation in delocalised clocks

Authors: Shishir Khandelwal, Maximilian P. E. Lock, Mischa P. Woods

Abstract: The theory of relativity associates a proper time with each moving object via its world line. In quantum theory however, such well-defined trajectories are forbidden. After introducing a general characterisation of quantum clocks, we demonstrate that, in the weak-field, low-velocity limit, all "good" quantum clocks experience time dilation as dictated by general relativity when their state of moti… ▽ More The theory of relativity associates a proper time with each moving object via its world line. In quantum theory however, such well-defined trajectories are forbidden. After introducing a general characterisation of quantum clocks, we demonstrate that, in the weak-field, low-velocity limit, all "good" quantum clocks experience time dilation as dictated by general relativity when their state of motion is classical (i.e. Gaussian). For nonclassical states of motion, on the other hand, we find that quantum interference effects may give rise to a significant discrepancy between the proper time and the time measured by the clock. The universality of this discrepancy implies that it is not simply a systematic error, but rather a quantum modification to the proper time itself. We also show how the clock's delocalisation leads to a larger uncertainty in the time it measures -- a consequence of the unavoidable entanglement between the clock time and its center-of-mass degrees of freedom. We demonstrate how this lost precision can be recovered by performing a measurement of the clock's state of motion alongside its time reading. △ Less

Submitted 12 August, 2020; v1 submitted 3 April, 2019; originally announced April 2019.

Comments: 7 + 10 pages. V3: accepted version

Journal ref: Quantum 4, 309 (2020)

arXiv:1804.06987 [pdf, other]

Improving Distantly Supervised Relation Extraction using Word and Entity Based Attention

Authors: Sharmistha Jat, Siddhesh Khandelwal, Partha Talukdar

Abstract: Relation extraction is the problem of classifying the relationship between two entities in a given sentence. Distant Supervision (DS) is a popular technique for developing relation extractors starting with limited supervision. We note that most of the sentences in the distant supervision relation extraction setting are very long and may benefit from word attention for better sentence representatio… ▽ More Relation extraction is the problem of classifying the relationship between two entities in a given sentence. Distant Supervision (DS) is a popular technique for developing relation extractors starting with limited supervision. We note that most of the sentences in the distant supervision relation extraction setting are very long and may benefit from word attention for better sentence representation. Our contributions in this paper are threefold. Firstly, we propose two novel word attention models for distantly- supervised relation extraction: (1) a Bi-directional Gated Recurrent Unit (Bi-GRU) based word attention model (BGWA), (2) an entity-centric attention model (EA), and (3) a combination model which combines multiple complementary models using weighted voting method for improved relation extraction. Secondly, we introduce GDS, a new distant supervision dataset for relation extraction. GDS removes test data noise present in all previous distant- supervision benchmark datasets, making credible automatic evaluation possible. Thirdly, through extensive experiments on multiple real-world datasets, we demonstrate the effectiveness of the proposed methods. △ Less

Submitted 18 April, 2018; originally announced April 2018.

arXiv:1701.04600 [pdf, ps, other]

Faster K-Means Cluster Estimation

Authors: Siddhesh Khandelwal, Amit Awekar

Abstract: There has been considerable work on improving popular clustering algorithm `K-means' in terms of mean squared error (MSE) and speed, both. However, most of the k-means variants tend to compute distance of each data point to each cluster centroid for every iteration. We propose a fast heuristic to overcome this bottleneck with only marginal increase in MSE. We observe that across all iterations of… ▽ More There has been considerable work on improving popular clustering algorithm `K-means' in terms of mean squared error (MSE) and speed, both. However, most of the k-means variants tend to compute distance of each data point to each cluster centroid for every iteration. We propose a fast heuristic to overcome this bottleneck with only marginal increase in MSE. We observe that across all iterations of K-means, a data point changes its membership only among a small subset of clusters. Our heuristic predicts such clusters for each data point by looking at nearby clusters after the first iteration of k-means. We augment well known variants of k-means with our heuristic to demonstrate effectiveness of our heuristic. For various synthetic and real-world datasets, our heuristic achieves speed-up of up-to 3 times when compared to efficient variants of k-means. △ Less

Submitted 17 January, 2017; originally announced January 2017.

Comments: 6 pages, Accepted at ECIR 2017

arXiv:1207.7274 [pdf]

The Dynamics of Health Behavior Sentiments on a Large Online Social Network

Authors: Marcel Salathé, Duy Q. Vu, Shashank Khandelwal, David R. Hunter

Abstract: Modifiable health behaviors, a leading cause of illness and death in many countries, are often driven by individual beliefs and sentiments about health and disease. Individual behaviors affecting health outcomes are increasingly modulated by social networks, for example through the associations of like-minded individuals - homophily - or through peer influence effects. Using a statistical approach… ▽ More Modifiable health behaviors, a leading cause of illness and death in many countries, are often driven by individual beliefs and sentiments about health and disease. Individual behaviors affecting health outcomes are increasingly modulated by social networks, for example through the associations of like-minded individuals - homophily - or through peer influence effects. Using a statistical approach to measure the individual temporal effects of a large number of variables pertaining to social network statistics, we investigate the spread of a health sentiment towards a new vaccine on Twitter, a large online social network. We find that the effects of neighborhood size and exposure intensity are qualitatively very different depending on the type of sentiment. Generally, we find that larger numbers of opinionated neighbors inhibit the expression of sentiments. We also find that exposure to negative sentiment is contagious - by which we merely mean predictive of future negative sentiment expression - while exposure to positive sentiments is generally not. In fact, exposure to positive sentiments can even predict increased negative sentiment expression. Our results suggest that the effects of peer influence and social contagion on the dynamics of behavioral spread on social networks are strongly content-dependent. △ Less

Submitted 31 July, 2012; originally announced July 2012.

arXiv:1105.4502 [pdf]

doi 10.1371/journal.pcbi.1002199

Assessing Vaccination Sentiments with Online Social Media: Implications for Infectious Disease Dynamics and Control

Authors: Marcel Salathé, Shashank Khandelwal

Abstract: There is great interest in the dynamics of health behaviors in social networks and how they affect collective public health outcomes, but measuring population health behaviors over time and space requires substantial resources. Here, we use publicly available data from 101,853 users of online social media collected over a time period of almost six months to measure the spatio-temporal sentiment to… ▽ More There is great interest in the dynamics of health behaviors in social networks and how they affect collective public health outcomes, but measuring population health behaviors over time and space requires substantial resources. Here, we use publicly available data from 101,853 users of online social media collected over a time period of almost six months to measure the spatio-temporal sentiment towards a new vaccine. We validated our approach by identifying a strong correlation between sentiments expressed online and CDC- estimated vaccination rates by region. Analysis of the network of opinionated users showed that information flows more often between users who share the same sentiments - and less often between users who do not share the same sentiments - than expected by chance alone. We also found that most communities are dominated by either positive or negative sentiments towards the novel vaccine. Simulations of infectious disease transmission show that if clusters of negative vaccine sentiments lead to clusters of unprotected individuals, the likelihood of disease outbreaks are greatly increased. Online social media provide unprecedented access to data allowing for inexpensive and efficient tools to identify target areas for intervention efforts and to evaluate their effectiveness. △ Less

Submitted 30 July, 2011; v1 submitted 23 May, 2011; originally announced May 2011.

Comments: Accepted for publication in PLoS Computational Biology

arXiv:0808.0509 [pdf, ps, other]

doi 10.1186/1471-2105-10-405

Evolving Clustered Random Networks

Authors: Shweta Bansal, Shashank Khandelwal, Lauren Ancel Meyers

Abstract: We propose a Markov chain simulation method to generate simple connected random graphs with a specified degree sequence and level of clustering. The networks generated by our algorithm are random in all other respects and can thus serve as generic models for studying the impacts of degree distributions and clustering on dynamical processes as well as null models for detecting other structural pr… ▽ More We propose a Markov chain simulation method to generate simple connected random graphs with a specified degree sequence and level of clustering. The networks generated by our algorithm are random in all other respects and can thus serve as generic models for studying the impacts of degree distributions and clustering on dynamical processes as well as null models for detecting other structural properties in empirical networks. △ Less

Submitted 4 August, 2008; originally announced August 2008.

Journal ref: BMC Bioinformatics, Vol 10: 405, 2009

Showing 1–50 of 50 results for author: Khandelwal, S