Skip to main content

Showing 1–50 of 76 results for author: Srivastava, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.16507  [pdf, ps, other

    cs.LG

    Robust Reward Modeling via Causal Rubrics

    Authors: Pragya Srivastava, Harman Singh, Rahul Madhavan, Gandharv Patil, Sravanti Addepalli, Arun Suggala, Rengarajan Aravamudhan, Soumya Sharma, Anirban Laha, Aravindan Raghuveer, Karthikeyan Shanmugam, Doina Precup

    Abstract: Reward models (RMs) are fundamental to aligning Large Language Models (LLMs) via human feedback, yet they often suffer from reward hacking. They tend to latch on to superficial or spurious attributes, such as response length or formatting, mistaking these cues learned from correlations in training data for the true causal drivers of quality (e.g., factuality, relevance). This occurs because standa… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  2. arXiv:2506.01147  [pdf, other

    cs.CL cs.LG

    A Word is Worth 4-bit: Efficient Log Parsing with Binary Coded Decimal Recognition

    Authors: Prerak Srivastava, Giulio Corallo, Sergey Rybalko

    Abstract: System-generated logs are typically converted into categorical log templates through parsing. These templates are crucial for generating actionable insights in various downstream tasks. However, existing parsers often fail to capture fine-grained template details, leading to suboptimal accuracy and reduced utility in downstream tasks requiring precise pattern identification. We propose a character… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

    Comments: Pre-print of our accepted paper at IEEE International Conference on Web Services (ICWS 2025). 4 pages, 2 figures

  3. arXiv:2505.23788  [pdf, ps, other

    cs.CL cs.AI

    Nine Ways to Break Copyright Law and Why Our LLM Won't: A Fair Use Aligned Generation Framework

    Authors: Aakash Sen Sharma, Debdeep Sanyal, Priyansh Srivastava, Sundar Atreya H., Shirish Karande, Mohan Kankanhalli, Murari Mandal

    Abstract: Large language models (LLMs) commonly risk copyright infringement by reproducing protected content verbatim or with insufficient transformative modifications, posing significant ethical, legal, and practical concerns. Current inference-time safeguards predominantly rely on restrictive refusal-based filters, often compromising the practical utility of these models. To address this, we collaborated… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 30 Pages

  4. arXiv:2505.13353  [pdf, other

    cs.CL cs.LG cs.SE

    Sense and Sensitivity: Examining the Influence of Semantic Recall on Long Context Code Reasoning

    Authors: Adam Štorek, Mukur Gupta, Samira Hajizadeh, Prashast Srivastava, Suman Jana

    Abstract: Although modern Large Language Models (LLMs) support extremely large contexts, their effectiveness in utilizing long context for code reasoning remains unclear. This paper investigates LLM reasoning ability over code snippets within large repositories and how it relates to their recall ability. Specifically, we differentiate between lexical code recall (verbatim retrieval) and semantic code recall… ▽ More

    Submitted 20 May, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

  5. arXiv:2504.09388  [pdf, other

    cs.IT cs.CC cs.DM

    The Rate-Immediacy Barrier in Explicit Tree Code Constructions

    Authors: Gil Cohen, Leonard J. Schulman, Piyush Srivastava

    Abstract: Since the introduction of tree codes by Schulman (STOC 1993), explicit construction of such codes has remained a notorious challenge. While the construction of asymptotically-good explicit tree codes continues to be elusive, a work by Cohen, Haeupler and Schulman (STOC 2018), as well as the state-of-the-art construction by Ben Yaacov, Cohen, and Yankovitz (STOC 2022) have achieved codes with rate… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

  6. arXiv:2503.14281  [pdf, other

    cs.CR cs.LG cs.SE

    XOXO: Stealthy Cross-Origin Context Poisoning Attacks against AI Coding Assistants

    Authors: Adam Štorek, Mukur Gupta, Noopur Bhatt, Aditya Gupta, Janie Kim, Prashast Srivastava, Suman Jana

    Abstract: AI coding assistants are widely used for tasks like code generation. These tools now require large and complex contexts, automatically sourced from various origins$\unicode{x2014}$across files, projects, and contributors$\unicode{x2014}$forming part of the prompt fed to underlying LLMs. This automatic context-gathering introduces new vulnerabilities, allowing attackers to subtly poison input to co… ▽ More

    Submitted 20 May, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

  7. arXiv:2503.11099  [pdf, other

    cs.DS cs.LG math.PR

    Approximating the Total Variation Distance between Gaussians

    Authors: Arnab Bhattacharyya, Weiming Feng, Piyush Srivastava

    Abstract: The total variation distance is a metric of central importance in statistics and probability theory. However, somewhat surprisingly, questions about computing it algorithmically appear not to have been systematically studied until very recently. In this paper, we contribute to this line of work by studying this question in the important special case of multivariate Gaussians. More formally, we con… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: Accepted by AISTATS 2025

  8. arXiv:2503.06459  [pdf, other

    math.CO cs.DS

    Deterministically approximating the volume of a Kostka polytope

    Authors: Hariharan Narayanan, Piyush Srivastava

    Abstract: Polynomial-time deterministic approximation of volumes of polytopes, up to an approximation factor that grows at most sub-exponentially with the dimension, remains an open problem. Recent work on this question has focused on identifying interesting classes of polytopes for which such approximation algorithms can be obtained. In this paper, we focus on one such class of polytopes: the Kostka polyto… ▽ More

    Submitted 5 April, 2025; v1 submitted 9 March, 2025; originally announced March 2025.

    Comments: Added further discussion

  9. Autotelic Reinforcement Learning: Exploring Intrinsic Motivations for Skill Acquisition in Open-Ended Environments

    Authors: Prakhar Srivastava, Jasmeet Singh

    Abstract: This paper presents a comprehensive overview of autotelic Reinforcement Learning (RL), emphasizing the role of intrinsic motivations in the open-ended formation of skill repertoires. We delineate the distinctions between knowledge-based and competence-based intrinsic motivations, illustrating how these concepts inform the development of autonomous agents capable of generating and pursuing self-def… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    Comments: 12 pages, 12 figures

    Journal ref: International Journal of Computer Trends and Technology, vol. 73, 2025

  10. arXiv:2501.16039  [pdf, ps, other

    cs.DS cs.CC math.GR

    Complexity of Minimal Faithful Permutation Degree for Fitting-free Groups

    Authors: Michael Levet, Pranjal Srivastava, Dhara Thakkar

    Abstract: In this paper, we investigate the complexity of computing the minimal faithful permutation degree for groups without abelian normal subgroups. When our groups are given as quotients of permutation groups, we establish that this problem is in $\textsf{P}$. Furthermore, in the setting of permutation groups, we obtain an upper bound of $\textsf{NC}$ for this problem. This improves upon the work of Da… ▽ More

    Submitted 28 April, 2025; v1 submitted 27 January, 2025; originally announced January 2025.

  11. arXiv:2501.02895  [pdf

    eess.IV cs.CV

    Region of Interest based Medical Image Compression

    Authors: Utkarsh Prakash Srivastava, Toshiaki Fujii

    Abstract: The vast volume of medical image data necessitates efficient compression techniques to support remote healthcare services. This paper explores Region of Interest (ROI) coding to address the balance between compression rate and image quality. By leveraging UNET segmentation on the Brats 2020 dataset, we accurately identify tumor regions, which are critical for diagnosis. These regions are then subj… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

    Comments: 8 pages, 7 figures

  12. arXiv:2501.00658  [pdf, other

    cs.LG

    Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing

    Authors: Peihao Wang, Ruisi Cai, Yuehao Wang, Jiajun Zhu, Pragya Srivastava, Zhangyang Wang, Pan Li

    Abstract: Structured State Space Models (SSMs) have emerged as alternatives to transformers. While SSMs are often regarded as effective in capturing long-sequence dependencies, we rigorously demonstrate that they are inherently limited by strong recency bias. Our empirical studies also reveal that this bias impairs the models' ability to recall distant information and introduces robustness issues. Our scali… ▽ More

    Submitted 10 March, 2025; v1 submitted 31 December, 2024; originally announced January 2025.

    Comments: International Conference on Learning Representations (ICLR), 2025

  13. arXiv:2412.20213  [pdf, other

    cs.CL cs.AI

    Decoding Emotion: Speech Perception Patterns in Individuals with Self-reported Depression

    Authors: Guneesh Vats, Priyanka Srivastava, Chiranjeevi Yarra

    Abstract: The current study examines the relationship between self-reported depression and the perception of affective speech within the Indian population. PANAS and PHQ-9 were used to assess current mood and depression, respectively. Participants' emotional reactivity was recorded on a valence and arousal scale against the affective speech audio presented in a sequence. No significant differences between t… ▽ More

    Submitted 28 December, 2024; originally announced December 2024.

  14. arXiv:2411.04718  [pdf, ps, other

    cs.DS

    Approximate counting of permutation patterns

    Authors: Omri Ben-Eliezer, Slobodan Mitrović, Pranjal Srivastava

    Abstract: We consider the problem of counting the copies of a length-$k$ pattern $σ$ in a sequence $f \colon [n] \to \mathbb{R}$, where a copy is a subset of indices $i_1 < \ldots < i_k \in [n]$ such that $f(i_j) < f(i_\ell)$ if and only if $σ(j) < σ(\ell)$. This problem is motivated by a range of connections and applications in ranking, nonparametric statistics, combinatorics, and fine-grained complexity,… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

  15. arXiv:2410.11105  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.IM cs.LG

    Emulators for stellar profiles in binary population modeling

    Authors: Elizabeth Teng, Ugur Demir, Zoheyr Doctor, Philipp M. Srivastava, Shamal Lalvani, Vicky Kalogera, Aggelos Katsaggelos, Jeff J. Andrews, Simone S. Bavera, Max M. Briel, Seth Gossage, Konstantinos Kovlakas, Matthias U. Kruckow, Kyle Akira Rocha, Meng Sun, Zepei Xing, Emmanouil Zapartas

    Abstract: Knowledge about the internal physical structure of stars is crucial to understanding their evolution. The novel binary population synthesis code POSYDON includes a module for interpolating the stellar and binary properties of any system at the end of binary MESA evolution based on a pre-computed set of models. In this work, we present a new emulation method for predicting stellar profiles, i.e., t… ▽ More

    Submitted 11 February, 2025; v1 submitted 14 October, 2024; originally announced October 2024.

    Comments: 12 pages, 10 figures. Accepted for publication by Astronomy and Computing

  16. arXiv:2409.14769  [pdf, other

    cs.CL

    Language-Agnostic Analysis of Speech Depression Detection

    Authors: Sona Binu, Jismi Jose, Fathima Shimna K V, Alino Luke Hans, Reni K. Cherian, Starlet Ben Alex, Priyanka Srivastava, Chiranjeevi Yarra

    Abstract: The people with Major Depressive Disorder (MDD) exhibit the symptoms of tonal variations in their speech compared to the healthy counterparts. However, these tonal variations not only confine to the state of MDD but also on the language, which has unique tonal patterns. This work analyzes automatic speech-based depression detection across two languages, English and Malayalam, which exhibits distin… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  17. arXiv:2406.14861  [pdf, other

    eess.SY cs.ET

    Resilience of the Electric Grid through Trustable IoT-Coordinated Assets (Extended version)

    Authors: Vineet J. Nair, Venkatesh Venkataramanan, Priyank Srivastava, Partha S. Sarker, Anurag Srivastava, Laurentiu D. Marinovici, Jun Zha, Christopher Irwin, Prateek Mittal, John Williams, Jayant Kumar, H. Vincent Poor, Anuradha M. Annaswamy

    Abstract: The electricity grid has evolved from a physical system to a cyber-physical system with digital devices that perform measurement, control, communication, computation, and actuation. The increased penetration of distributed energy resources (DERs) including renewable generation, flexible loads, and storage provides extraordinary opportunities for improvements in efficiency and sustainability. Howev… ▽ More

    Submitted 30 January, 2025; v1 submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted to the Proceedings of the National Academy of Sciences (PNAS) 2025. Extended version with supplementary information included

  18. arXiv:2406.04517  [pdf, other

    cs.CR

    FOX: Coverage-guided Fuzzing as Online Stochastic Control

    Authors: Dongdong She, Adam Storek, Yuchong Xie, Seoyoung Kweon, Prashast Srivastava, Suman Jana

    Abstract: Fuzzing is an effective technique for discovering software vulnerabilities by generating random test inputs and executing them against the target program. However, fuzzing large and complex programs remains challenging due to difficulties in uncovering deeply hidden vulnerabilities. This paper addresses the limitations of existing coverage-guided fuzzers, focusing on the scheduler and mutator comp… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: To Appear in Proceedings of the 2024 ACM SIGSAC Conference on Computer and Communications Security (CCS '24)

  19. arXiv:2405.16639  [pdf, other

    cs.LG

    A direct proof of a unified law of robustness for Bregman divergence losses

    Authors: Santanu Das, Jatin Batra, Piyush Srivastava

    Abstract: In contemporary deep learning practice, models are often trained to near zero loss i.e. to nearly interpolate the training data. However, the number of parameters in the model is usually far more than the number of data points n, the theoretical minimum needed for interpolation: a phenomenon referred to as overparameterization. In an interesting piece of work, Bubeck and Sellke considered a natura… ▽ More

    Submitted 21 April, 2025; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: 18 pages; fixed a typo in a citation

  20. A New Construction of Optimal Symmetrical ZCCS

    Authors: Rajen Kumar, Prashant Kumar Srivastava, Sudhan Majhi

    Abstract: We propose new constructions for a two-dimensional ($2$D) perfect array, complete complementary code (CCC), and multiple CCCs as an optimal symmetrical $Z$-complementary code set (ZCCS). We propose a method to generate a two-dimensional perfect array and CCC. By utilising mutually orthogonal sequences, we developed a method to extend the length of a CCC without affecting the set or code size. Addi… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: This paper has been accepted in 'IEEE International Symposium on Information Theory (ISIT 2024)'

  21. arXiv:2403.10644  [pdf, other

    cs.IT

    Multiple Spectrally Null Constrained Complete Complementary Codes of Various Lengths Over Small Alphabet

    Authors: Rajen Kumar, Palash Sarkar, Prashant Kumar Srivastava, Sudhan Majhi

    Abstract: Complete complementary codes (CCCs) are highly valuable in the fields of information security, radar and communication. The spectrally null constrained (SNC) problem arises in radar and modern communication systems due to the reservation or prohibition of specific spectrums from transmission. The literature on SNC-CCCs is somewhat limited in comparison to the literature on traditional CCCs. The ma… ▽ More

    Submitted 11 October, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: This Paper has been presented in ''SEquences and Their Applications (SETA) 2024''

  22. arXiv:2402.11194  [pdf, other

    cs.CL

    Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering

    Authors: Pragya Srivastava, Manuj Malik, Vivek Gupta, Tanuja Ganu, Dan Roth

    Abstract: Large Language Models (LLMs), excel in natural language understanding, but their capability for complex mathematical reasoning with an amalgamation of structured tables and unstructured text is uncertain. This study explores LLMs' mathematical reasoning on four financial tabular question-answering datasets: TATQA, FinQA, ConvFinQA, and Multihiertt. Through extensive experiments with various models… ▽ More

    Submitted 29 February, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: 25 pages, 17 figures

  23. arXiv:2402.06733  [pdf, other

    cs.CL cs.AI cs.LG

    NICE: To Optimize In-Context Examples or Not?

    Authors: Pragya Srivastava, Satvik Golechha, Amit Deshpande, Amit Sharma

    Abstract: Recent work shows that in-context learning and optimization of in-context examples (ICE) can significantly improve the accuracy of large language models (LLMs) on a wide range of tasks, leading to an apparent consensus that ICE optimization is crucial for better performance. However, most of these studies assume a fixed or no instruction provided in the prompt. We challenge this consensus by inves… ▽ More

    Submitted 6 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted as a full paper (9 pages) at ACL 2024 (Main)

    Journal ref: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics 2024 (Volume 1: Long Papers)

  24. arXiv:2402.06086  [pdf, other

    cs.DC cs.AI cs.DS

    Rhizomes and Diffusions for Processing Highly Skewed Graphs on Fine-Grain Message-Driven Systems

    Authors: Bibrak Qamar Chandio, Prateek Srivastava, Maciej Brodowicz, Martin Swany, Thomas Sterling

    Abstract: The paper provides a unified co-design of 1) a programming and execution model that allows spawning tasks from within the vertex data at runtime, 2) language constructs for \textit{actions} that send work to where the data resides, combining parallel expressiveness of local control objects (LCOs) to implement asynchronous graph processing primitives, 3) and an innovative vertex-centric data-struct… ▽ More

    Submitted 7 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2402.02576

    ACM Class: C.1.4; C.3; C.4; D.1.3

  25. arXiv:2402.03702  [pdf, ps, other

    cs.IT cs.NI

    On Learning Spatial Provenance in Privacy-Constrained Wireless Networks

    Authors: Manish Bansal, Pramsu Srivastava, J. Harshan

    Abstract: In Vehicle-to-Everything networks that involve multi-hop communication, the Road Side Units (RSUs) typically aim to collect location information from the participating vehicles to provide security and network diagnostics features. While the vehicles commonly use the Global Positioning System (GPS) for navigation, they may refrain from sharing their precise GPS coordinates with the RSUs due to priv… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: To be presented in IEEE WCNC 2024

  26. arXiv:2312.06071  [pdf, other

    cs.CV cs.LG physics.ao-ph stat.ML

    Precipitation Downscaling with Spatiotemporal Video Diffusion

    Authors: Prakhar Srivastava, Ruihan Yang, Gavin Kerrigan, Gideon Dresdner, Jeremy McGibbon, Christopher Bretherton, Stephan Mandt

    Abstract: In climate science and meteorology, high-resolution local precipitation (rain and snowfall) predictions are limited by the computational costs of simulation-based methods. Statistical downscaling, or super-resolution, is a common workaround where a low-resolution prediction is improved using statistical approaches. Unlike traditional computer vision tasks, weather and climate applications require… ▽ More

    Submitted 20 June, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

  27. arXiv:2311.02103  [pdf, other

    cs.LG cs.AI cs.PL

    Relax: Composable Abstractions for End-to-End Dynamic Machine Learning

    Authors: Ruihang Lai, Junru Shao, Siyuan Feng, Steven S. Lyubomirsky, Bohan Hou, Wuwei Lin, Zihao Ye, Hongyi Jin, Yuchen Jin, Jiawei Liu, Lesheng Jin, Yaxing Cai, Ziheng Jiang, Yong Wu, Sunghyun Park, Prakalp Srivastava, Jared G. Roesch, Todd C. Mowry, Tianqi Chen

    Abstract: Dynamic shape computations have become critical in modern machine learning workloads, especially in emerging large language models. The success of these models has driven the demand for their universal deployment across a diverse set of backend environments. In this paper, we present Relax, a compiler abstraction for optimizing end-to-end dynamic machine learning workloads. Relax introduces a cros… ▽ More

    Submitted 6 February, 2025; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: To appear at ASPLOS 2025 (16 pages, 20 figures)

  28. arXiv:2310.10691  [pdf, other

    cs.LG cs.AR

    Enhancing ML model accuracy for Digital VLSI circuits using diffusion models: A study on synthetic data generation

    Authors: Prasha Srivastava, Pawan Kumar, Zia Abbas

    Abstract: Generative AI has seen remarkable growth over the past few years, with diffusion models being state-of-the-art for image generation. This study investigates the use of diffusion models in generating artificial data generation for electronic circuits for enhancing the accuracy of subsequent machine learning models in tasks such as performance assessment, design, and testing when training data is us… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: 7 pages, submitted to NeurIPS workshop 2023

  29. arXiv:2307.10200  [pdf, other

    cs.CY cs.AI cs.CL cs.LG

    Disentangling Societal Inequality from Model Biases: Gender Inequality in Divorce Court Proceedings

    Authors: Sujan Dutta, Parth Srivastava, Vaishnavi Solunke, Swaprava Nath, Ashiqur R. KhudaBukhsh

    Abstract: Divorce is the legal dissolution of a marriage by a court. Since this is usually an unpleasant outcome of a marital union, each party may have reasons to call the decision to quit which is generally documented in detail in the court proceedings. Via a substantial corpus of 17,306 court proceedings, this paper investigates gender inequality through the lens of divorce court proceedings. While emerg… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: This paper is accepted at IJCAI 2023 (AI for good track)

  30. arXiv:2305.04433  [pdf, other

    math.OC cs.LG

    Accelerated Algorithms for a Class of Optimization Problems with Equality and Box Constraints

    Authors: Anjali Parashar, Priyank Srivastava, Anuradha M. Annaswamy

    Abstract: Convex optimization with equality and inequality constraints is a ubiquitous problem in several optimization and control problems in large-scale systems. Recently there has been a lot of interest in establishing accelerated convergence of the loss function. A class of high-order tuners was recently proposed in an effort to lead to accelerated convergence for the case when no constraints are pres… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

    Comments: 6 pages, accepted in ACC 2023 (American Control Conference, 2023)

  31. arXiv:2305.01290  [pdf, other

    cs.IT

    A Construction of Arbitrarily Large Type-II $Z$ Complementary Code Set

    Authors: Rajen Kumar, Prashant Kumar Srivastava, Sudhan Majhi

    Abstract: For a type-I $(K,M,Z,N)$-ZCCS, it follows $K \leq M \left\lfloor \frac{N}{Z}\right\rfloor$. In this paper, we propose a construction of type-II $(p^{k+n},p^k,p^{n+r}-p^r+1,p^{n+r})$-$Z$ complementary code set (ZCCS) using an extended Boolean function, its properties of Hamiltonian paths and the concept of isolated vertices, where $p\ge 2$. However, the proposed type-II ZCCS provides… ▽ More

    Submitted 14 May, 2024; v1 submitted 2 May, 2023; originally announced May 2023.

  32. arXiv:2302.13906  [pdf

    cs.CL cs.AI

    Argument Mining using BERT and Self-Attention based Embeddings

    Authors: Pranjal Srivastava, Pranav Bhatnagar, Anurag Goel

    Abstract: Argument mining automatically identifies and extracts the structure of inference and reasoning conveyed in natural language arguments. To the best of our knowledge, most of the state-of-the-art works in this field have focused on using tree-like structures and linguistic modeling. But, these approaches are not able to model more complex structures which are often found in online forums and real wo… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: 2022 4th International Conference on Advances in Computing, Communication Control and Networking (ICAC3N)

  33. arXiv:2302.07566  [pdf, other

    cs.LG

    Qualitative Data Augmentation for Performance Prediction in VLSI circuits

    Authors: Prasha Srivastava, Pawan Kumar, Zia Abbas

    Abstract: Various studies have shown the advantages of using Machine Learning (ML) techniques for analog and digital IC design automation and optimization. Data scarcity is still an issue for electronic designs, while training highly accurate ML models. This work proposes generating and evaluating artificial data using generative adversarial networks (GANs) for circuit data to aid and improve the accuracy o… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 14 pages, 13 figures

  34. arXiv:2301.08695  [pdf, other

    cs.DC cs.LG

    Baechi: Fast Device Placement of Machine Learning Graphs

    Authors: Beomyeol Jeon, Linda Cai, Chirag Shetty, Pallavi Srivastava, Jintao Jiang, Xiaolan Ke, Yitao Meng, Cong Xie, Indranil Gupta

    Abstract: Machine Learning graphs (or models) can be challenging or impossible to train when either devices have limited memory, or models are large. To split the model across devices, learning-based approaches are still popular. While these result in model placements that train fast on data (i.e., low step times), learning-based model-parallelism is time-consuming, taking many hours or days to create a pla… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

    Comments: Extended version of SoCC 2020 paper: https://dl.acm.org/doi/10.1145/3419111.3421302

  35. arXiv:2211.16958  [pdf, ps, other

    cs.SD eess.AS

    How to (virtually) train your speaker localizer

    Authors: Prerak Srivastava, Antoine Deleforge, Archontis Politis, Emmanuel Vincent

    Abstract: Learning-based methods have become ubiquitous in speaker localization. Existing systems rely on simulated training sets for the lack of sufficiently large, diverse and annotated real datasets. Most room acoustics simulators used for this purpose rely on the image source method (ISM) because of its computational efficiency. This paper argues that carefully extending the ISM to incorporate more real… ▽ More

    Submitted 25 May, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: Published in INTERSPEECH 2023

  36. arXiv:2211.04439  [pdf, other

    cs.DS cs.CG math.PR

    Sampling from convex sets with a cold start using multiscale decompositions

    Authors: Hariharan Narayanan, Amit Rajaraman, Piyush Srivastava

    Abstract: Running a random walk in a convex body $K\subseteq\mathbb{R}^n$ is a standard approach to sample approximately uniformly from the body. The requirement is that from a suitable initial distribution, the distribution of the walk comes close to the uniform distribution $π_K$ on $K$ after a number of steps polynomial in $n$ and the aspect ratio $R/r$ (i.e., when $rB_2 \subseteq K \subseteq RB_{2}$).… ▽ More

    Submitted 22 November, 2024; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: Changes from v3: Added further discussion/details, and fixed some typos. This version should be close to the final version

    Journal ref: Probab. Theory Relat. Fields (December 2024, online)

  37. arXiv:2210.17284  [pdf, other

    cs.LG

    Towards Zero-Shot and Few-Shot Table Question Answering using GPT-3

    Authors: Pragya Srivastava, Tanuja Ganu, Saikat Guha

    Abstract: We present very early results on using GPT-3 to perform question answering on tabular data. We find that stock pre-trained GPT-3 is able to zero-shot learn the table structure from a serialized JSON array-of-arrays representation, and able to answer lookup queries and simple comparison questions in natural language without any fine-tuning. We further find that simple prompt engineering to include… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: 7 pages

    MSC Class: 14J60 (Primary)

  38. arXiv:2207.09133  [pdf, other

    cs.SD eess.AS

    Realistic sources, receivers and walls improve the generalisability of virtually-supervised blind acoustic parameter estimators

    Authors: Prerak Srivastava, Antoine Deleforge, Emmanuel Vincent

    Abstract: Blind acoustic parameter estimation consists in inferring the acoustic properties of an environment from recordings of unknown sound sources. Recent works in this area have utilized deep neural networks trained either partially or exclusively on simulated data, due to the limited availability of real annotated measurements. In this paper, we study whether a model purely trained using a fast image-… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

  39. A Construction of Type-II ZCCS for the MC-CDMA System with Low PMEPR

    Authors: Rajen Kumar, Sushant Kumar Jha, Prashant Kumar Srivastava, Sudhan Majhi

    Abstract: In this letter, we propose a novel construction of type-II $Z$-complementary code set (ZCCS) having arbitrary sequence length using the Kronecker product between a complete complementary code (CCC) and mutually orthogonal uni-modular sequences. In this construction, Barker sequences are used to reduce row sequence peak-to-mean envelope power ratio (PMEPR) for some specific lengths sequence and col… ▽ More

    Submitted 22 August, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

  40. arXiv:2206.09812  [pdf, other

    cs.LG

    ConvGeN: Convex space learning improves deep-generative oversampling for tabular imbalanced classification on smaller datasets

    Authors: Kristian Schultz, Saptarshi Bej, Waldemar Hahn, Markus Wolfien, Prashant Srivastava, Olaf Wolkenhauer

    Abstract: Data is commonly stored in tabular format. Several fields of research are prone to small imbalanced tabular data. Supervised Machine Learning on such data is often difficult due to class imbalance. Synthetic data generation, i.e., oversampling, is a common remedy used to improve classifier performance. State-of-the-art linear interpolation approaches, such as LoRAS and ProWRAS can be used to gener… ▽ More

    Submitted 13 July, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

  41. arXiv:2204.04868  [pdf, other

    cs.DM cs.DS math-ph math.CO

    On complex roots of the independence polynomial

    Authors: Ferenc Bencs, Péter Csikvári, Piyush Srivastava, Jan Vondrák

    Abstract: It is known from the work of Shearer (1985) (and also Scott and Sokal (2005)) that the independence polynomial $Z_G(λ)$ of a graph $G$ of maximum degree at most $d+1$ does not vanish provided that $\vertλ\vert \leq \frac{d^d}{(d+1)^{d+1}}$. Significant extensions of this result have recently been given in the case $\Re λ\geq 0$ by Peters and Regts (2019) and Bencs and Csikvári (arxiv:1807.08963).… ▽ More

    Submitted 13 November, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Extended version., to appear in proceedings of SODA 2023

  42. arXiv:2203.09481  [pdf, other

    cs.CV cs.LG stat.ML

    Diffusion Probabilistic Modeling for Video Generation

    Authors: Ruihan Yang, Prakhar Srivastava, Stephan Mandt

    Abstract: Denoising diffusion probabilistic models are a promising new class of generative models that mark a milestone in high-quality image generation. This paper showcases their ability to sequentially generate video, surpassing prior methods in perceptual and probabilistic forecasting metrics. We propose an autoregressive, end-to-end optimized video diffusion model inspired by recent advances in neural… ▽ More

    Submitted 7 December, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

  43. arXiv:2202.09418   

    cs.DC

    Uniting Control and Data Parallelism: Towards Scalable Memory-Driven Dynamic Graph Processing

    Authors: Bibrak Qamar Chandio, Thomas Sterling, Prateek Srivastava

    Abstract: Control parallelism and data parallelism is mostly reasoned and optimized as separate functions. Because of this, workloads that are irregular, fine-grain and dynamic such as dynamic graph processing become very hard to scale. An experimental research approach to computer architecture that synthesizes prior techniques of parallel computing along with new innovations is proposed in this paper. We e… ▽ More

    Submitted 7 March, 2023; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: The paper did not publish and we are working on a new paper that is very different than this one but contains some information that is in this paper

  44. arXiv:2202.00471   

    cs.CL cs.CY

    Causal effect of racial bias in data and machine learning algorithms on user persuasiveness & discriminatory decision making: An Empirical Study

    Authors: Kinshuk Sengupta, Praveen Ranjan Srivastava

    Abstract: Language data and models demonstrate various types of bias, be it ethnic, religious, gender, or socioeconomic. AI/NLP models, when trained on the racially biased dataset, AI/NLP models instigate poor model explainability, influence user experience during decision making and thus further magnifies societal biases, raising profound ethical implications for society. The motivation of the study is to… ▽ More

    Submitted 25 November, 2022; v1 submitted 22 January, 2022; originally announced February 2022.

    Comments: Fresh experiments need to be added to the design of experiments

  45. arXiv:2112.09047  [pdf, other

    physics.soc-ph cs.DL

    Citation inequity and gendered citation practices in contemporary physics

    Authors: Erin G. Teich, Jason Z. Kim, Christopher W. Lynn, Samantha C. Simon, Andrei A. Klishin, Karol P. Szymula, Pragya Srivastava, Lee C. Bassett, Perry Zurn, Jordan D. Dworkin, Dani S. Bassett

    Abstract: The historical and contemporary under-attribution of women's contributions to scientific scholarship is well-known and well-studied, with effects that are felt today in myriad ways by women scientists. One measure of this under-attribution is the so-called citation gap between men and women: the under-citation of papers authored by women relative to expected rates coupled with a corresponding over… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

  46. HRNET: AI on Edge for mask detection and social distancing

    Authors: Kinshuk Sengupta, Praveen Ranjan Srivastava

    Abstract: The purpose of the paper is to provide innovative emerging technology framework for community to combat epidemic situations. The paper proposes a unique outbreak response system framework based on artificial intelligence and edge computing for citizen centric services to help track and trace people eluding safety policies like mask detection and social distancing measure in public or workplace set… ▽ More

    Submitted 3 February, 2022; v1 submitted 30 November, 2021; originally announced November 2021.

    Report number: Volume 3, Issue 2, March 2022

    Journal ref: SN Computer Science, 2022

  47. arXiv:2111.05070  [pdf, other

    cs.LG cs.AI cs.DM stat.ME stat.ML

    Universal Lower Bound for Learning Causal DAGs with Atomic Interventions

    Authors: Vibhor Porwal, Piyush Srivastava, Gaurav Sinha

    Abstract: A well-studied challenge that arises in the structure learning problem of causal directed acyclic graphs (DAG) is that using observational data, one can only learn the graph up to a "Markov equivalence class" (MEC). The remaining undirected edges have to be oriented using interventions, which can be very expensive to perform in applications. Thus, the problem of minimizing the number of interventi… ▽ More

    Submitted 19 May, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: Extended version of AISTATS 2022 paper. Added results for multi-node interventions, and shortened title

  48. arXiv:2110.03785  [pdf, other

    cs.LG

    Addressing practical challenges in Active Learning via a hybrid query strategy

    Authors: Deepesh Agarwal, Pravesh Srivastava, Sergio Martin-del-Campo, Balasubramaniam Natarajan, Babji Srinivasan

    Abstract: Active Learning (AL) is a powerful tool to address modern machine learning problems with significantly fewer labeled training instances. However, implementation of traditional AL methodologies in practical scenarios is accompanied by multiple challenges due to the inherent assumptions. There are several hindrances, such as unavailability of labels for the AL algorithm at the beginning; unreliable… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: 15 pages, 4 figures, 6 tables

  49. arXiv:2107.13832  [pdf, other

    cs.SD cs.LG eess.AS

    Blind Room Parameter Estimation Using Multiple-Multichannel Speech Recordings

    Authors: Prerak Srivastava, Antoine Deleforge, Emmanuel Vincent

    Abstract: Knowing the geometrical and acoustical parameters of a room may benefit applications such as audio augmented reality, speech dereverberation or audio forensics. In this paper, we study the problem of jointly estimating the total surface area, the volume, as well as the frequency-dependent reverberation time and mean surface absorption of a room in a blind fashion, based on two-channel noisy speech… ▽ More

    Submitted 29 July, 2021; originally announced July 2021.

    Comments: Accepted In WASPAA 2021 ( IEEE Workshop on Applications of Signal Processing to Audio and Acoustics )

  50. arXiv:2107.11066  [pdf, other

    cs.SD eess.AS

    SALADnet: Self-Attentive multisource Localization in the Ambisonics Domain

    Authors: Pierre-Amaury Grumiaux, Srdan Kitic, Prerak Srivastava, Laurent Girin, Alexandre Guérin

    Abstract: In this work, we propose a novel self-attention based neural network for robust multi-speaker localization from Ambisonics recordings. Starting from a state-of-the-art convolutional recurrent neural network, we investigate the benefit of replacing the recurrent layers by self-attention encoders, inherited from the Transformer architecture. We evaluate these models on synthetic and real-world data,… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Comments: Accepted to Workshop on Applications of Signal Processing to Audio and Acoustics