Skip to main content

Showing 1–50 of 119 results for author: Varshney, L R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.20841  [pdf, ps, other

    cs.CL

    Concealment of Intent: A Game-Theoretic Analysis

    Authors: Xinbo Wu, Abhishek Umrawal, Lav R. Varshney

    Abstract: As large language models (LLMs) grow more capable, concerns about their safe deployment have also grown. Although alignment mechanisms have been introduced to deter misuse, they remain vulnerable to carefully designed adversarial prompts. In this work, we present a scalable attack strategy: intent-hiding adversarial prompting, which conceals malicious intent through the composition of skills. We d… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  2. arXiv:2504.20090  [pdf, other

    cs.AI cs.IR cs.LG

    Spark: A System for Scientifically Creative Idea Generation

    Authors: Aishik Sanyal, Samuel Schapiro, Sumuk Shashidhar, Royce Moon, Lav R. Varshney, Dilek Hakkani-Tur

    Abstract: Recently, large language models (LLMs) have shown promising abilities to generate novel research ideas in science, a direction which coincides with many foundational principles in computational creativity (CC). In light of these developments, we present an idea generation system named Spark that couples retrieval-augmented idea generation using LLMs with a reviewer model named Judge trained on 600… ▽ More

    Submitted 21 May, 2025; v1 submitted 25 April, 2025; originally announced April 2025.

    Comments: Accepted at ICCC 2025

  3. arXiv:2504.19327  [pdf, other

    cs.CV cs.AI

    Platonic Grounding for Efficient Multimodal Language Models

    Authors: Moulik Choraria, Xinbo Wu, Akhil Bhimaraju, Nitesh Sekhar, Yue Wu, Xu Zhang, Prateek Singhal, Lav R. Varshney

    Abstract: The hyperscaling of data and parameter count in Transformer-based models is yielding diminishing performance improvement, especially when weighed against training costs. Such plateauing indicates the importance of methods for more efficient finetuning and inference, while retaining similar performance. This is especially relevant for multimodal learning paradigms, where inference costs of processi… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

  4. arXiv:2504.18687  [pdf, ps, other

    cs.AI

    Transformational Creativity in Science: A Graphical Theory

    Authors: Samuel Schapiro, Jonah Black, Lav R. Varshney

    Abstract: Creative processes are typically divided into three types: combinatorial, exploratory, and transformational. Here, we provide a graphical theory of transformational scientific creativity, synthesizing Boden's insight that transformational creativity arises from changes in the "enabling constraints" of a conceptual space and Kuhn's structure of scientific revolutions as resulting from paradigm shif… ▽ More

    Submitted 20 May, 2025; v1 submitted 25 April, 2025; originally announced April 2025.

    Comments: Accepted at ICCC 2025

  5. arXiv:2502.15436  [pdf, other

    cs.LG cs.AI cs.CL cs.DC

    Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning

    Authors: Raghav Singhal, Kaustubh Ponkshe, Rohit Vartak, Lav R. Varshney, Praneeth Vepakomma

    Abstract: Low-Rank Adaptation (LoRA) has become ubiquitous for efficiently fine-tuning foundation models. However, federated fine-tuning using LoRA is challenging due to suboptimal updates arising from traditional federated averaging of individual adapters. Existing solutions either incur prohibitively high communication cost that scales linearly with the number of clients or suffer from performance degrada… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: Raghav Singhal and Kaustubh Ponkshe contributed equally to this work

  6. arXiv:2502.05352  [pdf, other

    cs.AI cs.DC cs.MA

    ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

    Authors: Saurabh Jha, Rohan Arora, Yuji Watanabe, Takumi Yanagawa, Yinfang Chen, Jackson Clark, Bhavya Bhavya, Mudit Verma, Harshit Kumar, Hirokuni Kitahara, Noah Zheutlin, Saki Takano, Divya Pathak, Felix George, Xinbo Wu, Bekir O. Turkkan, Gerard Vanloo, Michael Nidd, Ting Dai, Oishik Chatterjee, Pranjal Gupta, Suranjana Samanta, Pooja Aggarwal, Rong Lee, Pavankumar Murali , et al. (18 additional authors not shown)

    Abstract: Realizing the vision of using AI agents to automate critical IT tasks depends on the ability to measure and understand effectiveness of proposed solutions. We introduce ITBench, a framework that offers a systematic methodology for benchmarking AI agents to address real-world IT automation tasks. Our initial release targets three key areas: Site Reliability Engineering (SRE), Compliance and Securit… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  7. arXiv:2411.18798  [pdf, other

    cs.CR cs.DC cs.IT eess.SY

    Formal Verification of Digital Twins with TLA and Information Leakage Control

    Authors: Luwen Huang, Lav R. Varshney, Karen E. Willcox

    Abstract: Verifying the correctness of a digital twin provides a formal guarantee that the digital twin operates as intended. Digital twin verification is challenging due to the presence of uncertainties in the virtual representation, the physical environment, and the bidirectional flow of information between physical and virtual. A further challenge is that a digital twin of a complex system is composed of… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: 23 pages

  8. arXiv:2410.14665  [pdf, ps, other

    cs.LG cs.AI

    Online Reinforcement Learning with Passive Memory

    Authors: Anay Pattanaik, Lav R. Varshney

    Abstract: This paper considers an online reinforcement learning algorithm that leverages pre-collected data (passive memory) from the environment for online interaction. We show that using passive memory improves performance and further provide theoretical guarantees for regret that turns out to be near-minimax optimal. Results show that the quality of passive memory determines sub-optimality of the incurre… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

  9. arXiv:2410.01243  [pdf, other

    cs.IT

    An Information Theory of Compute-Optimal Size Scaling, Emergence, and Plateaus in Language Models

    Authors: Anuj K. Nayak, Lav R. Varshney

    Abstract: Recent empirical studies show three phenomena with increasing size of language models: compute-optimal size scaling, emergent capabilities, and performance plateauing. We present a simple unified mathematical framework to explain all of these language model scaling phenomena, building on recent skill-text bipartite graph frameworks for semantic learning. Modeling the learning of concepts from text… ▽ More

    Submitted 15 October, 2024; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: 14 pages, 5 figures

  10. arXiv:2409.06343  [pdf, other

    cs.IT cs.AI cs.LG

    Compute-Update Federated Learning: A Lattice Coding Approach Over-the-Air

    Authors: Seyed Mohammad Azimi-Abarghouyi, Lav R. Varshney

    Abstract: This paper introduces a federated learning framework that enables over-the-air computation via digital communications, using a new joint source-channel coding scheme. Without relying on channel state information at devices, this scheme employs lattice codes to both quantize model parameters and exploit interference from the devices. We propose a novel receiver structure at the server, designed to… ▽ More

    Submitted 5 November, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

    Comments: Extended version of the preprint available at arXiv:2403.01023

  11. arXiv:2407.11780  [pdf, other

    cs.CL cs.AI

    SwitchCIT: Switching for Continual Instruction Tuning

    Authors: Xinbo Wu, Max Hartman, Vidhata Arjun Jayaraman, Lav R. Varshney

    Abstract: Large language models (LLMs) and multimodal models (MMs) have exhibited impressive capabilities in various domains, particularly in general language understanding and visual reasoning. However, these models, trained on massive data, may not be finely optimized for specific tasks triggered by instructions. Continual instruction tuning is crucial to adapt a large model to evolving tasks and domains,… ▽ More

    Submitted 18 December, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

  12. arXiv:2407.05669  [pdf, other

    cs.SI cs.AI cs.DS stat.ML

    Fractional Budget Allocation for Influence Maximization under General Marketing Strategies

    Authors: Akhil Bhimaraju, Eliot W. Robson, Lav R. Varshney, Abhishek K. Umrawal

    Abstract: We consider the fractional influence maximization problem, i.e., identifying users on a social network to be incentivized with potentially partial discounts to maximize the influence on the network. The larger the discount given to a user, the higher the likelihood of its activation (adopting a new product or innovation), who then attempts to activate its neighboring users, causing a cascade effec… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 5 pages, 2 figures, and 1 table

    MSC Class: 05C85; 60J60; 68R05; 68R10; 68T01; 90C27; 90C35 ACM Class: F.2.2; G.1.2; G.1.6; G.2.1; G.2.2; G.3; I.2.0; J.4

  13. arXiv:2406.05599  [pdf, other

    quant-ph cs.IT

    Reliable Quantum Memories with Unreliable Components

    Authors: Anuj K. Nayak, Eric Chitambar, Lav R. Varshney

    Abstract: Quantum memory systems are vital in quantum information processing for dependable storage and retrieval of quantum states. Inspired by classical reliability theories that synthesize reliable computing systems from unreliable components, we formalize the problem of reliable storage of quantum information using noisy components. We introduce the notion of stable quantum memories and define the stora… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 15 pages, 3 figures

  14. arXiv:2405.03862  [pdf, other

    cs.AI cs.CL

    Persona Inconstancy in Multi-Agent LLM Collaboration: Conformity, Confabulation, and Impersonation

    Authors: Razan Baltaji, Babak Hemmatian, Lav R. Varshney

    Abstract: Multi-agent AI systems can be used for simulating collective decision-making in scientific and practical applications. They can also be used to introduce a diverse group discussion step in chatbot pipelines, enhancing the cultural sensitivity of the chatbot's responses. These applications, however, are predicated on the ability of AI agents to reliably adopt assigned personas and mimic human inter… ▽ More

    Submitted 14 August, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: 16 pages, 8 figures, 3 tables

    ACM Class: I.2.7

    Journal ref: The 2nd Workshop on Cross-Cultural Considerations in NLP (2024)

  15. arXiv:2404.03131  [pdf, other

    cs.IT

    Semantic Compression with Information Lattice Learning

    Authors: Haizi Yu, Lav R. Varshney

    Abstract: Data-driven artificial intelligence (AI) techniques are becoming prominent for learning in support of data compression, but are focused on standard problems such as text compression. To instead address the emerging problem of semantic compression, we argue that the lattice theory of information is particularly expressive and mathematically precise in capturing notions of abstraction as a form of l… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  16. arXiv:2403.01023  [pdf, other

    cs.IT cs.LG

    Federated Learning via Lattice Joint Source-Channel Coding

    Authors: Seyed Mohammad Azimi-Abarghouyi, Lav R. Varshney

    Abstract: This paper introduces a universal federated learning framework that enables over-the-air computation via digital communications, using a new joint source-channel coding scheme. Without relying on channel state information at devices, this scheme employs lattice codes to both quantize model parameters and exploit interference from the devices. A novel two-layer receiver structure at the server is d… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  17. arXiv:2402.12151  [pdf, other

    cs.CL cs.AI

    Transformer-based Causal Language Models Perform Clustering

    Authors: Xinbo Wu, Lav R. Varshney

    Abstract: Even though large language models (LLMs) have demonstrated remarkable capability in solving various natural language tasks, the capability of an LLM to follow human instructions is still a concern. Recent works have shown great improvements in the instruction-following capability via additional training for instruction-following tasks. However, the mechanisms responsible for effective instruction-… ▽ More

    Submitted 3 March, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Added new experimental results and fixed some errors

  18. arXiv:2311.07449  [pdf, other

    cs.CV

    Semantically Grounded QFormer for Efficient Vision Language Understanding

    Authors: Moulik Choraria, Xinbo Wu, Sourya Basu, Nitesh Sekhar, Yue Wu, Xu Zhang, Prateek Singhal, Lav R. Varshney

    Abstract: General purpose Vision Language Models (VLMs) have received tremendous interest in recent years, owing to their ability to learn rich vision-language correlations as well as their broad zero-shot competencies. One immensely popular line of work utilizes frozen unimodal models, by bridging vision representations to language using a trainable module called the QFormer. However, this method relies he… ▽ More

    Submitted 16 December, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Preprint Under Review

  19. arXiv:2310.18368  [pdf, other

    cs.CL

    Muslim-Violence Bias Persists in Debiased GPT Models

    Authors: Babak Hemmatian, Razan Baltaji, Lav R. Varshney

    Abstract: Abid et al. (2021) showed a tendency in GPT-3 to generate mostly violent completions when prompted about Muslims, compared with other religions. Two pre-registered replication attempts found few violent completions and only a weak anti-Muslim bias in the more recent InstructGPT, fine-tuned to eliminate biased and toxic outputs. However, more pre-registered experiments showed that using common name… ▽ More

    Submitted 9 December, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: 2 pages, 2 figures. This work will be presented at MusIML neurips workshop

    ACM Class: I.2.7

  20. arXiv:2310.09675  [pdf, other

    cs.LG cs.AI cs.CL

    Efficient Model-Agnostic Multi-Group Equivariant Networks

    Authors: Razan Baltaji, Sourya Basu, Lav R. Varshney

    Abstract: Constructing model-agnostic group equivariant networks, such as equitune (Basu et al., 2023b) and its generalizations (Kim et al., 2023), can be computationally expensive for large product groups. We address this problem by providing efficient model-agnostic equivariant designs for two related problems: one where the network has multiple inputs each with potentially different groups acting on them… ▽ More

    Submitted 7 October, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Journal ref: Published in Transactions on Machine Learning Research (10/2024)

  21. arXiv:2310.05884  [pdf, other

    cs.LG cs.AI cs.CL

    A Meta-Learning Perspective on Transformers for Causal Language Modeling

    Authors: Xinbo Wu, Lav R. Varshney

    Abstract: The Transformer architecture has become prominent in developing large causal language models. However, mechanisms to explain its capabilities are not well understood. Focused on the training process, here we establish a meta-learning view of the Transformer architecture when trained for the causal language modeling task, by explicating an inner optimization process within the Transformer. Further,… ▽ More

    Submitted 25 March, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

  22. arXiv:2309.16911  [pdf, other

    cs.DS math.OC

    Dynamic Batching of Online Arrivals to Leverage Economies of Scale

    Authors: Akhil Bhimaraju, S. Rasoul Etesami, Lav R. Varshney

    Abstract: Many settings, such as medical testing of patients in hospitals or matching riders to drivers in ride-hailing platforms, require handling arrivals over time. In such applications, it is often beneficial to group the arriving orders, samples, or requests into batches and process the larger batches rather than individual arrivals. However, waiting too long to create larger batches incurs a waiting c… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 31 pages, 14 figures

  23. On Simultaneous Information and Energy Transmission through Quantum Channels

    Authors: Bishal Kumar Das, Lav R. Varshney, Vaibhav Madhok

    Abstract: The optimal rate at which information can be sent through a quantum channel when the transmitted signal must simultaneously carry some minimum amount of energy is characterized. To do so, we introduce the quantum-classical analogue of the capacity-power function and generalize results in classical information theory for transmitting classical information through noisy channels. We show that the ca… ▽ More

    Submitted 1 January, 2025; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: 16 pages, 18 figures

    Report number: Phys. Rev. A 111, 012609

    Journal ref: Published 8 January, 2025

  24. arXiv:2307.07843  [pdf, other

    cs.LG cs.CL

    Transformers are Universal Predictors

    Authors: Sourya Basu, Moulik Choraria, Lav R. Varshney

    Abstract: We find limits to the Transformer architecture for language modeling and show it has a universal prediction property in an information-theoretic sense. We further analyze performance in non-asymptotic data regimes to understand the role of various components of the Transformer architecture, especially in the context of data-efficient training. We validate our theoretical analysis with experiments… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

    Comments: Neural Compression Workshop (ICML 2023)

  25. arXiv:2305.09900  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Efficient Equivariant Transfer Learning from Pretrained Models

    Authors: Sourya Basu, Pulkit Katdare, Prasanna Sattigeri, Vijil Chenthamarakshan, Katherine Driggs-Campbell, Payel Das, Lav R. Varshney

    Abstract: Efficient transfer learning algorithms are key to the success of foundation models on diverse downstream tasks even with limited data. Recent works of Basu et al. (2023) and Kaba et al. (2022) propose group averaging (equitune) and optimization-based methods, respectively, over features from group-transformed inputs to obtain equivariant outputs from non-equivariant neural networks. While Kaba et… ▽ More

    Submitted 10 October, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Journal ref: NeurIPS 2023

  26. arXiv:2305.08559  [pdf, other

    cs.IT cs.LG econ.EM

    Designing Discontinuities

    Authors: Ibtihal Ferwana, Suyoung Park, Ting-Yi Wu, Lav R. Varshney

    Abstract: Discontinuities can be fairly arbitrary but also cause a significant impact on outcomes in larger systems. Indeed, their arbitrariness is why they have been used to infer causal relationships among variables in numerous settings. Regression discontinuity from econometrics assumes the existence of a discontinuous variable that splits the population into distinct partitions to estimate the causal ef… ▽ More

    Submitted 27 December, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: A short version is accepted in Neural Compression ICML Worksop July 19th, 2023

  27. arXiv:2304.13907  [pdf, other

    cs.SI

    Network Analysis as a Tool for Shaping Conservation and Development Policy: A Case Study of Timber Market Optimization in India

    Authors: Xiou Ge, Sarah E. Brown, Pushpendra Rana, Lav R. Varshney, Daniel C. Miller

    Abstract: The incorporation of trees on farms can help to improve livelihoods and build resilience among small-holder farmers in developing countries. On-farm trees can help gen- erate additional income from commercial tree harvest as well as contribute significant environmental benefits and ecosystem services to increase resiliency. Long-term benefits from tree-based livelihoods, however, depend on sustain… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Paper accepted to proceedings of the 5th Data for Good Exchange (D4GX)

  28. arXiv:2301.12067  [pdf, other

    cs.LG cs.CV

    Learning Optimal Features via Partial Invariance

    Authors: Moulik Choraria, Ibtihal Ferwana, Ankur Mani, Lav R. Varshney

    Abstract: Learning models that are robust to distribution shifts is a key concern in the context of their real-life applicability. Invariant Risk Minimization (IRM) is a popular framework that aims to learn robust models from multiple environments. The success of IRM requires an important assumption: the underlying causal mechanisms/features remain invariant across environments. When not satisfied, we show… ▽ More

    Submitted 3 April, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: Presented at the 37th AAAI Conference on Artificial Intelligence, 2023

  29. arXiv:2210.06475  [pdf, other

    cs.LG cs.CL

    Equi-Tuning: Group Equivariant Fine-Tuning of Pretrained Models

    Authors: Sourya Basu, Prasanna Sattigeri, Karthikeyan Natesan Ramamurthy, Vijil Chenthamarakshan, Kush R. Varshney, Lav R. Varshney, Payel Das

    Abstract: We introduce equi-tuning, a novel fine-tuning method that transforms (potentially non-equivariant) pretrained models into group equivariant models while incurring minimum $L_2$ loss between the feature representations of the pretrained and the equivariant models. Large pretrained models can be equi-tuned for different groups to satisfy the needs of various downstream tasks. Equi-tuned models benef… ▽ More

    Submitted 4 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Journal ref: AAAI 2023

  30. arXiv:2208.04417  [pdf

    cs.CL cs.AI

    Debiased Large Language Models Still Associate Muslims with Uniquely Violent Acts

    Authors: Babak Hemmatian, Lav R. Varshney

    Abstract: Recent work demonstrates a bias in the GPT-3 model towards generating violent text completions when prompted about Muslims, compared with Christians and Hindus. Two pre-registered replication attempts, one exact and one approximate, found only the weakest bias in the more recent Instruct Series version of GPT-3, fine-tuned to eliminate biased and toxic outputs. Few violent completions were observe… ▽ More

    Submitted 10 August, 2022; v1 submitted 8 August, 2022; originally announced August 2022.

    Comments: 6 pages, 1 figure, 3 tables

    MSC Class: 68T50; 91F20 ACM Class: I.2.7

  31. arXiv:2204.05397  [pdf, other

    cs.AI cs.CY

    Accelerated Design and Deployment of Low-Carbon Concrete for Data Centers

    Authors: Xiou Ge, Richard T. Goodwin, Haizi Yu, Pablo Romero, Omar Abdelrahman, Amruta Sudhalkar, Julius Kusuma, Ryan Cialdella, Nishant Garg, Lav R. Varshney

    Abstract: Concrete is the most widely used engineered material in the world with more than 10 billion tons produced annually. Unfortunately, with that scale comes a significant burden in terms of energy, water, and release of greenhouse gases and other pollutants; indeed 8% of worldwide carbon emissions are attributed to the production of cement, a key ingredient in concrete. As such, there is interest in c… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 18 pages. arXiv admin note: text overlap with arXiv:1905.08222

  32. arXiv:2204.02586  [pdf, other

    cs.IT

    Hypergraph-based Source Codes for Function Computation Under Maximal Distortion

    Authors: Sourya Basu, Daewon Seo, Lav R. Varshney

    Abstract: This work investigates functional source coding problems with maximal distortion, motivated by approximate function computation in many modern applications. The maximal distortion treats imprecise reconstruction of a function value as good as perfect computation if it deviates less than a tolerance level, while treating reconstruction that differs by more than that level as a failure. Using a geom… ▽ More

    Submitted 28 December, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: to appear in IEEE Journal on Selected Areas in Information Theory (JSAIT)

  33. arXiv:2201.08815  [pdf, other

    cs.CV cs.AI

    Learning from One and Only One Shot

    Authors: Haizi Yu, Igor Mineyev, Lav R. Varshney, James A. Evans

    Abstract: Humans can generalize from only a few examples and from little pretraining on similar tasks. Yet, machine learning (ML) typically requires large data to learn or pre-learn to transfer. Motivated by nativism and artificial general intelligence, we directly model human-innate priors in abstract visual tasks such as character and doodle recognition. This yields a white-box model that learns general-a… ▽ More

    Submitted 21 May, 2024; v1 submitted 14 January, 2022; originally announced January 2022.

  34. arXiv:2112.09346  [pdf, other

    cs.LG

    Balancing Fairness and Robustness via Partial Invariance

    Authors: Moulik Choraria, Ibtihal Ferwana, Ankur Mani, Lav R. Varshney

    Abstract: The Invariant Risk Minimization (IRM) framework aims to learn invariant features from a set of environments for solving the out-of-distribution (OOD) generalization problem. The underlying assumption is that the causal components of the data generating distributions remain constant across the environments or alternately, the data "overlaps" across environments to find meaningful invariant features… ▽ More

    Submitted 24 December, 2021; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: Accepted at the Algorithmic Fairness through the Lens of Causality and Robustness (AFCR) Workshop, NeurIPS 2021

  35. arXiv:2109.01520  [pdf, other

    cs.IT eess.SP

    Optimizing the Energy Efficiency of Unreliable Memories for Quantized Kalman Filtering

    Authors: Jonathan Kern, Elsa Dupraz, Abdeldjalil Aïssa-El-Bey, Lav R. Varshney, François Leduc-Primeau

    Abstract: This paper presents a quantized Kalman filter implemented using unreliable memories. We consider that both the quantization and the unreliable memories introduce errors in the computations, and develop an error propagation model that takes into account these two sources of errors. In addition to providing updated Kalman filter equations, the proposed error model accurately predicts the covariance… ▽ More

    Submitted 3 September, 2021; originally announced September 2021.

    Comments: 29 pages, 8 figures, Submitted to IEEE Transactions on Signal Processing

  36. arXiv:2107.09794  [pdf, other

    cs.IT astro-ph.IM physics.pop-ph

    Limits of Detecting Extraterrestrial Civilizations

    Authors: Ian George, Xinan Chen, Lav R. Varshney

    Abstract: The search for extraterrestrial intelligence (SETI) is a scientific endeavor which struggles with unique issues -- a strong indeterminacy in what data to look for and when to do so. This has led to attempts at finding both fundamental limits of the communication between extraterrestrial intelligence and human civilizations, as well as benchmarks so as to predict what kinds of signals we might most… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: Main Text: 16 pages, 1 Figure. Comments welcome

  37. arXiv:2106.03357  [pdf, other

    stat.ML cs.LG

    Evaluating State-of-the-Art Classification Models Against Bayes Optimality

    Authors: Ryan Theisen, Huan Wang, Lav R. Varshney, Caiming Xiong, Richard Socher

    Abstract: Evaluating the inherent difficulty of a given data-driven classification problem is important for establishing absolute benchmarks and evaluating progress in the field. To this end, a natural quantity to consider is the \emph{Bayes error}, which measures the optimal classification error theoretically achievable for a given data distribution. While generally an intractable quantity, we show that we… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  38. arXiv:2104.04848  [pdf, other

    cs.LG

    Autoequivariant Network Search via Group Decomposition

    Authors: Sourya Basu, Akshayaa Magesh, Harshit Yadav, Lav R. Varshney

    Abstract: Recent works show that group equivariance as an inductive bias improves neural network performance for both classification and generation. However, designing group-equivariant neural networks is challenging when the group of interest is large and is unknown. Moreover, inducing equivariance can significantly reduce the number of independent parameters in a network with fixed feature size, affecting… ▽ More

    Submitted 8 June, 2021; v1 submitted 10 April, 2021; originally announced April 2021.

  39. arXiv:2103.11982  [pdf, other

    cs.NI cs.ET eess.SP

    Wireless Network Coding with Intelligent Reflecting Surfaces

    Authors: Amanat Kafizov, Ahmed Elzanaty, Lav R. Varshney, Mohamed-Slim Alouini

    Abstract: Conventional wireless techniques are becoming inadequate for beyond fifth-generation (5G) networks due to latency and bandwidth considerations. To improve the error performance and throughput of wireless communication systems, we propose physical layer network coding (PNC) in an intelligent reflecting surface (IRS)-assisted environment. We consider an IRS-aided butterfly network, where we propose… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

  40. Expected Extinction Times of Epidemics with State-Dependent Infectiousness

    Authors: Akhil Bhimaraju, Avhishek Chatterjee, Lav R. Varshney

    Abstract: We model an epidemic where the per-person infectiousness in a network of geographic localities changes with the total number of active cases. This would happen as people adopt more stringent non-pharmaceutical precautions when the population has a larger number of active cases. We show that there exists a sharp threshold such that when the curing rate for the infection is above this threshold, the… ▽ More

    Submitted 5 December, 2021; v1 submitted 21 March, 2021; originally announced March 2021.

    Comments: To appear in IEEE Transactions on Network Science and Engineering

  41. arXiv:2101.04810  [pdf, other

    cs.IT eess.SP

    Wireless Power Transfer for Future Networks: Signal Processing, Machine Learning, Computing, and Sensing

    Authors: Bruno Clerckx, Kaibin Huang, Lav R. Varshney, Sennur Ulukus, Mohamed-Slim Alouini

    Abstract: Wireless power transfer (WPT) is an emerging paradigm that will enable using wireless to its full potential in future networks, not only to convey information but also to deliver energy. Such networks will enable trillions of future low-power devices to sense, compute, connect, and energize anywhere, anytime, and on the move. The design of such future networks brings new challenges and opportuniti… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: Overview paper submitted for publication

  42. arXiv:2012.05756  [pdf, other

    cs.LG

    Adversarial Linear Contextual Bandits with Graph-Structured Side Observations

    Authors: Lingda Wang, Bingcong Li, Huozhi Zhou, Georgios B. Giannakis, Lav R. Varshney, Zhizhen Zhao

    Abstract: This paper studies the adversarial graphical contextual bandits, a variant of adversarial multi-armed bandits that leverage two categories of the most common side information: \emph{contexts} and \emph{side observations}. In this setting, a learning agent repeatedly chooses from a set of $K$ actions after being presented with a $d$-dimensional context vector. The agent not only incurs and observes… ▽ More

    Submitted 16 February, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

    Comments: fix some typos

  43. arXiv:2012.03900  [pdf, other

    cs.LG cs.AI cs.CY cs.SI

    GAEA: Graph Augmentation for Equitable Access via Reinforcement Learning

    Authors: Govardana Sachithanandam Ramachandran, Ivan Brugere, Lav R. Varshney, Caiming Xiong

    Abstract: Disparate access to resources by different subpopulations is a prevalent issue in societal and sociotechnical networks. For example, urban infrastructure networks may enable certain racial groups to more easily access resources such as high-quality schools, grocery stores, and polling places. Similarly, social networks within universities and organizations may enable certain groups to more easily… ▽ More

    Submitted 9 April, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

  44. arXiv:2011.04069  [pdf, ps, other

    cs.IT

    The Twelvefold Way of Non-Sequential Lossless Compression

    Authors: Taha Ameen ur Rahman, Alton S. Barbehenn, Xinan Chen, Hassan Dbouk, James A. Douglas, Yuncong Geng, Ian George, John B. Harvill, Sung Woo Jeon, Kartik K. Kansal, Kiwook Lee, Kelly A. Levick, Bochao Li, Ziyue Li, Yashaswini Murthy, Adarsh Muthuveeru-Subramaniam, S. Yagiz Olmez, Matthew J. Tomei, Tanya Veeravalli, Xuechao Wang, Eric A. Wayman, Fan Wu, Peng Xu, Shen Yan, Heling Zhang , et al. (5 additional authors not shown)

    Abstract: Many information sources are not just sequences of distinguishable symbols but rather have invariances governed by alternative counting paradigms such as permutations, combinations, and partitions. We consider an entire classification of these invariances called the twelvefold way in enumerative combinatorics and develop a method to characterize lossless compression limits. Explicit computations f… ▽ More

    Submitted 20 January, 2021; v1 submitted 8 November, 2020; originally announced November 2020.

    Comments: DCC 2021

  45. arXiv:2010.07126  [pdf

    cs.AI

    Explaining Creative Artifacts

    Authors: Lav R. Varshney, Nazneen Fatema Rajani, Richard Socher

    Abstract: Human creativity is often described as the mental process of combining associative elements into a new form, but emerging computational creativity algorithms may not operate in this manner. Here we develop an inverse problem formulation to deconstruct the products of combinatorial and compositional creativity into associative chains as a form of post-hoc interpretation that matches the human creat… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: 2020 Workshop on Human Interpretability in Machine Learning (WHI), at ICML 2020

  46. arXiv:2010.04244  [pdf, other

    cs.LG stat.ML

    Nonstationary Reinforcement Learning with Linear Function Approximation

    Authors: Huozhi Zhou, Jinglin Chen, Lav R. Varshney, Ashish Jagmohan

    Abstract: We consider reinforcement learning (RL) in episodic Markov decision processes (MDPs) with linear function approximation under drifting environment. Specifically, both the reward and state transition functions can evolve over time but their total variations do not exceed a $\textit{variation budget}$. We first develop $\texttt{LSVI-UCB-Restart}$ algorithm, an optimistic modification of least-square… ▽ More

    Submitted 13 April, 2024; v1 submitted 8 October, 2020; originally announced October 2020.

  47. arXiv:2009.08002  [pdf

    cs.CY cs.AI

    Planting trees at the right places: Recommending suitable sites for growing trees using algorithm fusion

    Authors: Pushpendra Rana, Lav R Varshney

    Abstract: Large-scale planting of trees has been proposed as a low-cost natural solution for carbon mitigation, but is hampered by poor selection of plantation sites, especially in developing countries. To aid in site selection, we develop the ePSA (e-Plantation Site Assistant) recommendation system based on algorithm fusion that combines physics-based/traditional forestry science knowledge with machine lea… ▽ More

    Submitted 27 November, 2020; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: 26 pages, 4 figures, 2 tables, 2 supplemental tables

  48. arXiv:2009.02603  [pdf, ps, other

    cs.CY

    Respect for Human Autonomy in Recommender Systems

    Authors: Lav R. Varshney

    Abstract: Recommender systems can influence human behavior in significant ways, in some cases making people more machine-like. In this sense, recommender systems may be deleterious to notions of human autonomy. Many ethical systems point to respect for human autonomy as a key principle arising from human rights considerations, and several emerging frameworks for AI include this principle. Yet, no specific f… ▽ More

    Submitted 5 September, 2020; originally announced September 2020.

    Comments: 2 page position paper presented at 3rd FAccTRec Workshop on Responsible Recommendation (RecSys 2020 Workshop)

  49. arXiv:2007.14966  [pdf, other

    cs.CL cs.IT

    Mirostat: A Neural Text Decoding Algorithm that Directly Controls Perplexity

    Authors: Sourya Basu, Govardana Sachitanandam Ramachandran, Nitish Shirish Keskar, Lav R. Varshney

    Abstract: Neural text decoding is important for generating high-quality texts using language models. To generate high-quality text, popular decoding algorithms like top-k, top-p (nucleus), and temperature-based sampling truncate or distort the unreliable low probability tail of the language model. Though these methods generate high-quality text after parameter tuning, they are ad hoc. Not much is known abou… ▽ More

    Submitted 14 January, 2021; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: 25 pages, 12 figures

  50. arXiv:2006.15222  [pdf, other

    cs.CL cs.LG q-bio.BM

    BERTology Meets Biology: Interpreting Attention in Protein Language Models

    Authors: Jesse Vig, Ali Madani, Lav R. Varshney, Caiming Xiong, Richard Socher, Nazneen Fatema Rajani

    Abstract: Transformer architectures have proven to learn useful representations for protein classification and generation tasks. However, these representations present challenges in interpretability. In this work, we demonstrate a set of methods for analyzing protein Transformer models through the lens of attention. We show that attention: (1) captures the folding structure of proteins, connecting amino aci… ▽ More

    Submitted 28 March, 2021; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: To appear in ICLR 2021

    ACM Class: I.2