Skip to main content

Showing 1–50 of 147 results for author: Pranjal

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3264 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 11 July, 2025; v1 submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  2. arXiv:2507.05707  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Agentic-R1: Distilled Dual-Strategy Reasoning

    Authors: Weihua Du, Pranjal Aggarwal, Sean Welleck, Yiming Yang

    Abstract: Current long chain-of-thought (long-CoT) models excel at mathematical reasoning but rely on slow and error-prone natural language traces. Tool-augmented agents address arithmetic via code execution, but often falter on complex logical tasks. We introduce a fine-tuning framework, DualDistill, that distills complementary reasoning strategies from multiple teachers into a unified student model. Using… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

    Comments: Preprint. 15 pages. Project available at https://github.com/StigLidu/DualDistill

  3. arXiv:2506.12425  [pdf, ps, other

    cs.DC cs.LG

    Optimizing Federated Learning using Remote Embeddings for Graph Neural Networks

    Authors: Pranjal Naman, Yogesh Simmhan

    Abstract: Graph Neural Networks (GNNs) have experienced rapid advancements in recent years due to their ability to learn meaningful representations from graph data structures. Federated Learning (FL) has emerged as a viable machine learning approach for training a shared model on decentralized data, addressing privacy concerns while leveraging parallelism. Existing methods that address the unique requiremen… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

    Comments: Preprint of paper in the proceedings of the 30th International European Conference on Parallel and Distributed Computing (Euro-Par)

  4. arXiv:2505.19964  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.GT

    The Limits of Preference Data for Post-Training

    Authors: Eric Zhao, Jessica Dai, Pranjal Awasthi

    Abstract: Recent progress in strengthening the capabilities of large language models has stemmed from applying reinforcement learning to domains with automatically verifiable outcomes. A key question is whether we can similarly use RL to optimize for outcomes in domains where evaluating outcomes inherently requires human feedback; for example, in tasks like deep research and trip planning, outcome evaluatio… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  5. arXiv:2505.12112  [pdf, other

    cs.DC

    Ripple: Scalable Incremental GNN Inferencing on Large Streaming Graphs

    Authors: Pranjal Naman, Yogesh Simmhan

    Abstract: Most real-world graphs are dynamic in nature, with continuous and rapid updates to the graph topology, and vertex and edge properties. Such frequent updates pose significant challenges for inferencing over Graph Neural Networks (GNNs). Current approaches that perform vertex-wise and layer-wise inferencing are impractical for dynamic graphs as they cause redundant computations, expand to large neig… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

    Comments: Preprint of paper to appear in the proceedings of the 45th IEEE International Conference on Distributed Computing Systems (ICDCS)

  6. arXiv:2504.10636  [pdf, other

    econ.GN cs.AI stat.ME

    Who is More Bayesian: Humans or ChatGPT?

    Authors: Tianshi Mu, Pranjal Rawat, John Rust, Chengjun Zhang, Qixuan Zhong

    Abstract: We compare the performance of human and artificially intelligent (AI) decision makers in simple binary classification tasks where the optimal decision rule is given by Bayes Rule. We reanalyze choices of human subjects gathered from laboratory experiments conducted by El-Gamal and Grether and Holt and Smith. We confirm that while overall, Bayes Rule represents the single best model for predicting… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: 86 pages, 19 figures

  7. arXiv:2503.23465  [pdf, other

    cs.RO

    SparseLoc: Sparse Open-Set Landmark-based Global Localization for Autonomous Navigation

    Authors: Pranjal Paul, Vineeth Bhat, Tejas Salian, Mohammad Omama, Krishna Murthy Jatavallabhula, Naveen Arulselvan, K. Madhava Krishna

    Abstract: Global localization is a critical problem in autonomous navigation, enabling precise positioning without reliance on GPS. Modern global localization techniques often depend on dense LiDAR maps, which, while precise, require extensive storage and computational resources. Recent approaches have explored alternative methods, such as sparse maps and learned features, but they suffer from poor robustne… ▽ More

    Submitted 30 March, 2025; originally announced March 2025.

  8. arXiv:2503.07914  [pdf, other

    cs.AI cs.CL cs.LG

    Demystifying the Accuracy-Interpretability Trade-Off: A Case Study of Inferring Ratings from Reviews

    Authors: Pranjal Atrey, Michael P. Brundage, Min Wu, Sanghamitra Dutta

    Abstract: Interpretable machine learning models offer understandable reasoning behind their decision-making process, though they may not always match the performance of their black-box counterparts. This trade-off between interpretability and model performance has sparked discussions around the deployment of AI, particularly in critical applications where knowing the rationale of decision-making is essentia… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: Accepted at DAI Workshop, AAAI-2025

  9. arXiv:2503.06917  [pdf, other

    cs.LG cs.DS stat.ML

    Combinatorial Optimization via LLM-driven Iterated Fine-tuning

    Authors: Pranjal Awasthi, Sreenivas Gollapudi, Ravi Kumar, Kamesh Munagala

    Abstract: We present a novel way to integrate flexible, context-dependent constraints into combinatorial optimization by leveraging Large Language Models (LLMs) alongside traditional algorithms. Although LLMs excel at interpreting nuanced, locally specified requirements, they struggle with enforcing global combinatorial feasibility. To bridge this gap, we propose an iterated fine-tuning framework where algo… ▽ More

    Submitted 13 March, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

  10. arXiv:2503.05919  [pdf, other

    cs.CL cs.LG

    From Style to Facts: Mapping the Boundaries of Knowledge Injection with Finetuning

    Authors: Eric Zhao, Pranjal Awasthi, Nika Haghtalab

    Abstract: Finetuning provides a scalable and cost-effective means of customizing language models for specific tasks or response styles, with greater reliability than prompting or in-context learning. In contrast, the conventional wisdom is that injecting knowledge via finetuning results in brittle performance and poor generalization. We argue that the dichotomy of "task customization" (e.g., instruction tun… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

  11. arXiv:2503.04697  [pdf, other

    cs.CL cs.AI cs.LG

    L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

    Authors: Pranjal Aggarwal, Sean Welleck

    Abstract: Reasoning language models have shown an uncanny ability to improve performance at test-time by ``thinking longer''-that is, by generating longer chain-of-thought sequences and hence using more compute. However, the length of their chain-of-thought reasoning is not controllable, making it impossible to allocate test-time compute to achieve a desired level of performance. We introduce Length Control… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  12. arXiv:2503.02065  [pdf, other

    cs.CR cs.AI cs.IR

    Survey Perspective: The Role of Explainable AI in Threat Intelligence

    Authors: Nidhi Rastogi, Devang Dhanuka, Amulya Saxena, Pranjal Mairal, Le Nguyen

    Abstract: The increasing reliance on AI-based security tools in Security Operations Centers (SOCs) has transformed threat detection and response, yet analysts frequently struggle with alert overload, false positives, and lack of contextual relevance. The inability to effectively analyze AI-generated security alerts lead to inefficiencies in incident response and reduces trust in automated decision-making. I… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: 5 pages, SIGIR Symposium on IR in Practice (SIRIP), 2025

  13. arXiv:2502.18525  [pdf, other

    cs.SE cs.LG

    Programming with Pixels: Computer-Use Meets Software Engineering

    Authors: Pranjal Aggarwal, Sean Welleck

    Abstract: Recent advancements in software engineering (SWE) agents have largely followed a $\textit{tool-based paradigm}$, where agents interact with hand-engineered tool APIs to perform specific tasks. While effective for specialized tasks, these methods fundamentally lack generalization, as they require predefined tools for each task and do not scale across programming languages and domains. We introduce… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  14. arXiv:2502.13595  [pdf, ps, other

    cs.CL cs.AI cs.IR

    MMTEB: Massive Multilingual Text Embedding Benchmark

    Authors: Kenneth Enevoldsen, Isaac Chung, Imene Kerboua, Márton Kardos, Ashwin Mathur, David Stap, Jay Gala, Wissam Siblini, Dominik Krzemiński, Genta Indra Winata, Saba Sturua, Saiteja Utpala, Mathieu Ciancone, Marion Schaeffer, Gabriel Sequeira, Diganta Misra, Shreeya Dhakal, Jonathan Rystrøm, Roman Solomatin, Ömer Çağatan, Akash Kundu, Martin Bernstorff, Shitao Xiao, Akshita Sukhlecha, Bhavish Pahwa , et al. (61 additional authors not shown)

    Abstract: Text embeddings are typically evaluated on a limited set of tasks, which are constrained by language, domain, and task diversity. To address these limitations and provide a more comprehensive evaluation, we introduce the Massive Multilingual Text Embedding Benchmark (MMTEB) - a large-scale, community-driven expansion of MTEB, covering over 500 quality-controlled evaluation tasks across 250+ langua… ▽ More

    Submitted 8 June, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: Accepted for ICLR: https://openreview.net/forum?id=zl3pfz4VCV

  15. arXiv:2502.05352  [pdf, other

    cs.AI cs.DC cs.MA

    ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks

    Authors: Saurabh Jha, Rohan Arora, Yuji Watanabe, Takumi Yanagawa, Yinfang Chen, Jackson Clark, Bhavya Bhavya, Mudit Verma, Harshit Kumar, Hirokuni Kitahara, Noah Zheutlin, Saki Takano, Divya Pathak, Felix George, Xinbo Wu, Bekir O. Turkkan, Gerard Vanloo, Michael Nidd, Ting Dai, Oishik Chatterjee, Pranjal Gupta, Suranjana Samanta, Pooja Aggarwal, Rong Lee, Pavankumar Murali , et al. (18 additional authors not shown)

    Abstract: Realizing the vision of using AI agents to automate critical IT tasks depends on the ability to measure and understand effectiveness of proposed solutions. We introduce ITBench, a framework that offers a systematic methodology for benchmarking AI agents to address real-world IT automation tasks. Our initial release targets three key areas: Site Reliability Engineering (SRE), Compliance and Securit… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  16. arXiv:2502.01839  [pdf, other

    cs.LG cs.AI

    Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

    Authors: Eric Zhao, Pranjal Awasthi, Sreenivas Gollapudi

    Abstract: Sampling-based search, a simple paradigm for utilizing test-time compute, involves generating multiple candidate responses and selecting the best one -- typically by having models self-verify each response for correctness. In this paper, we study the scaling trends governing sampling-based search. Among our findings is that simply scaling up a minimalist implementation of sampling-based search, us… ▽ More

    Submitted 20 February, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

  17. arXiv:2501.18229  [pdf, other

    cs.RO

    GPD: Guided Polynomial Diffusion for Motion Planning

    Authors: Ajit Srikanth, Parth Mahanjan, Kallol Saha, Vishal Mandadi, Pranjal Paul, Pawan Wadhwani, Brojeshwar Bhowmick, Arun Singh, Madhava Krishna

    Abstract: Diffusion-based motion planners are becoming popular due to their well-established performance improvements, stemming from sample diversity and the ease of incorporating new constraints directly during inference. However, a primary limitation of the diffusion process is the requirement for a substantial number of denoising steps, especially when the denoising process is coupled with gradient-based… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  18. arXiv:2501.16039  [pdf, ps, other

    cs.DS cs.CC math.GR

    Complexity of Minimal Faithful Permutation Degree for Fitting-free Groups

    Authors: Michael Levet, Pranjal Srivastava, Dhara Thakkar

    Abstract: In this paper, we investigate the complexity of computing the minimal faithful permutation degree for groups without abelian normal subgroups. When our groups are given as quotients of permutation groups, we establish that this problem is in $\textsf{P}$. Furthermore, in the setting of permutation groups, we obtain an upper bound of $\textsf{NC}$ for this problem. This improves upon the work of Da… ▽ More

    Submitted 28 April, 2025; v1 submitted 27 January, 2025; originally announced January 2025.

  19. arXiv:2412.06176  [pdf, other

    cs.LG cs.AI

    AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement

    Authors: Pranjal Aggarwal, Bryan Parno, Sean Welleck

    Abstract: Automated code generation with large language models has gained significant traction, but there remains no guarantee on the correctness of generated code. We aim to use formal verification to provide mathematical guarantees that the generated code is correct. However, generating formally verified code with LLMs is hindered by the scarcity of training data and the complexity of formal proofs. To ta… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

  20. arXiv:2411.18981  [pdf, ps, other

    cs.CC

    The Complexity of Order-Finding for ROABPs

    Authors: Vishwas Bhargava, Pranjal Dutta, Sumanta Ghosh, Anamay Tengse

    Abstract: We study the \emph{order-finding problem} for Read-once Oblivious Algebraic Branching Programs (ROABPs). Given a polynomial $f$ and a parameter $w$, the goal is to find an order $σ$ in which $f$ has an ROABP of \emph{width} $w$. We show that this problem is NP-hard in the worst case, even when the input is a constant degree polynomial that is given in its dense representation. We provide a reducti… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

  21. arXiv:2411.17330  [pdf, ps, other

    cs.CC

    Derandomizing Multivariate Polynomial Factoring for Low Degree Factors

    Authors: Pranjal Dutta, Amit Sinhababu, Thomas Thierauf

    Abstract: For a polynomial $f$ from a class $\mathcal{C}$ of polynomials, we show that the problem to compute all the constant degree irreducible factors of $f$ reduces in polynomial time to polynomial identity tests (PIT) for class $\mathcal{C}$ and divisibility tests of $f$ by constant degree polynomials. We apply the result to several classes $\mathcal{C}$ and obtain the constant degree factors in \begin… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

  22. arXiv:2411.04718  [pdf, ps, other

    cs.DS

    Approximate counting of permutation patterns

    Authors: Omri Ben-Eliezer, Slobodan Mitrović, Pranjal Srivastava

    Abstract: We consider the problem of counting the copies of a length-$k$ pattern $σ$ in a sequence $f \colon [n] \to \mathbb{R}$, where a copy is a subset of indices $i_1 < \ldots < i_k \in [n]$ such that $f(i_j) < f(i_\ell)$ if and only if $σ(j) < σ(\ell)$. This problem is motivated by a range of connections and applications in ranking, nonparametric statistics, combinatorics, and fine-grained complexity,… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

  23. arXiv:2411.03444  [pdf, ps, other

    cs.CC math.AG math.RT

    Algebraic metacomplexity and representation theory

    Authors: Maxim van den Berg, Pranjal Dutta, Fulvio Gesmundo, Christian Ikenmeyer, Vladimir Lysikov

    Abstract: In the algebraic metacomplexity framework we prove that the decomposition of metapolynomials into their isotypic components can be implemented efficiently, namely with only a quasipolynomial blowup in the circuit size. We use this to resolve an open question posed by Grochow, Kumar, Saks & Saraf (2017). Our result means that many existing algebraic complexity lower bound proofs can be efficiently… ▽ More

    Submitted 7 February, 2025; v1 submitted 5 November, 2024; originally announced November 2024.

    Comments: 31 pages. Added Corollary 1.2 adressing an open question from [GKSS]; Introduction is rewritten; some minor fixes

    MSC Class: 68Q15; 20C35; 16S30 ACM Class: F.1.3

  24. arXiv:2410.13960  [pdf, other

    econ.GN cs.AI

    Approximating Auction Equilibria with Reinforcement Learning

    Authors: Pranjal Rawat

    Abstract: Traditional methods for computing equilibria in auctions become computationally intractable as auction complexity increases, particularly in multi-item and dynamic auctions. This paper introduces a self-play based reinforcement learning approach that employs advanced algorithms such as Proximal Policy Optimization and Neural Fictitious Self-Play to approximate Bayes-Nash equilibria. This framework… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  25. arXiv:2410.10739  [pdf, other

    cs.CL

    Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs

    Authors: Ishan Jindal, Chandana Badrinath, Pranjal Bharti, Lakkidi Vinay, Sachin Dev Sharma

    Abstract: Large Language Models (LLMs) for public use require continuous pre-training to remain up-to-date with the latest data. The models also need to be fine-tuned with specific instructions to maintain their ability to follow instructions accurately. Typically, LLMs are released in two versions: the Base LLM, pre-trained on diverse data, and the instruction-refined LLM, additionally trained with specifi… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  26. arXiv:2409.09744  [pdf, other

    cs.CR

    Taming the Ransomware Threats: Leveraging Prospect Theory for Rational Payment Decisions

    Authors: Pranjal Sharma

    Abstract: Day by day, the frequency of ransomware attacks on organizations is experiencing a significant surge. High-profile incidents involving major entities like Las Vegas giants MGM Resorts, Caesar Entertainment, and Boeing underscore the profound impact, posing substantial business barriers. When a sudden cyberattack occurs, organizations often find themselves at a loss, with a looming countdown to pay… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

  27. arXiv:2408.11382  [pdf, other

    cs.CL

    Towards Inducing Long-Context Abilities in Multilingual Neural Machine Translation Models

    Authors: Varun Gumma, Pranjal A. Chitale, Kalika Bali

    Abstract: Neural Machine Translation (NMT) models have traditionally used Sinusoidal Positional Embeddings (PEs), which often struggle to capture long-range dependencies and are inefficient for handling extended context or document-level translation tasks. This work addresses the challenge of transitioning pre-trained NMT models from absolute Sinusoidal PEs to Relative PEs, such as RoPE and ALiBi, without c… ▽ More

    Submitted 9 February, 2025; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: Accepted at NAACL 2025

  28. arXiv:2406.18700  [pdf, other

    cs.CC

    On Fourier analysis of sparse Boolean functions over certain Abelian groups

    Authors: Sourav Chakraborty, Swarnalipa Datta, Pranjal Dutta, Arijit Ghosh, Swagato Sanyal

    Abstract: Given an Abelian group G, a Boolean-valued function f: G -> {-1,+1}, is said to be s-sparse, if it has at most s-many non-zero Fourier coefficients over the domain G. In a seminal paper, Gopalan et al. proved "Granularity" for Fourier coefficients of Boolean valued functions over Z_2^n, that have found many diverse applications in theoretical computer science and combinatorics. They also studied s… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  29. arXiv:2406.17989  [pdf, ps, other

    cs.LG stat.ML

    Learning Neural Networks with Sparse Activations

    Authors: Pranjal Awasthi, Nishanth Dikkala, Pritish Kamath, Raghu Meka

    Abstract: A core component present in many successful neural network architectures, is an MLP block of two fully connected layers with a non-linear activation in between. An intriguing phenomenon observed empirically, including in transformer architectures, is that, after training, the activations in the hidden layer of this MLP block tend to be extremely sparse on any given input. Unlike traditional forms… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Proceedings of the 37th Conference on Learning Theory (COLT 2024), 20 pages

  30. arXiv:2406.16250  [pdf

    cs.DC cs.OS

    Evaluating Serverless Machine Learning Performance on Google Cloud Run

    Authors: Prerana Khatiwada, Pranjal Dhakal

    Abstract: End-users can get functions-as-a-service from serverless platforms, which promise lower hosting costs, high availability, fault tolerance, and dynamic flexibility for hosting individual functions known as microservices. Machine learning tools are seen to be reliably useful, and the services created using these tools are in increasing demand on a large scale. The serverless platforms are uniquely s… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 5 pages, 12 figures

  31. arXiv:2406.09175  [pdf, other

    cs.CV cs.CL

    ReMI: A Dataset for Reasoning with Multiple Images

    Authors: Mehran Kazemi, Nishanth Dikkala, Ankit Anand, Petar Devic, Ishita Dasgupta, Fangyu Liu, Bahare Fatemi, Pranjal Awasthi, Dee Guo, Sreenivas Gollapudi, Ahmed Qureshi

    Abstract: With the continuous advancement of large language models (LLMs), it is essential to create new benchmarks to effectively evaluate their expanding capabilities and identify areas for improvement. This work focuses on multi-image reasoning, an emerging capability in state-of-the-art LLMs. We introduce ReMI, a dataset designed to assess LLMs' ability to Reason with Multiple Images. This dataset encom… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  32. arXiv:2406.06221  [pdf, other

    cs.PL

    Synchronous Programming with Refinement Types

    Authors: Jiawei Chen, José Luiz Vargas de Mendonça, Bereket Shimels Ayele, Bereket Ngussie Bekele, Shayan Jalili, Pranjal Sharma, Nicholas Wohlfeil, Yicheng Zhang, Jean-Baptiste Jeannin

    Abstract: Cyber-Physical Systems (CPS) consist of software interacting with the physical world, such as robots, vehicles, and industrial processes. CPS are frequently responsible for the safety of lives, property, or the environment, and so software correctness must be determined with a high degree of certainty. To that end, simply testing a CPS is insufficient, as its interactions with the physical world m… ▽ More

    Submitted 4 September, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  33. arXiv:2406.05967  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

    Authors: David Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Teresa Lynn, Injy Hamed, Aditya Nanda Kishore, Aishik Mandal, Alina Dragonetti, Artem Abzaliev, Atnafu Lambebo Tonja, Bontu Fufa Balcha, Chenxi Whitehouse, Christian Salamea, Dan John Velasco, David Ifeoluwa Adelani, David Le Meur, Emilio Villa-Cueva, Fajri Koto, Fauzan Farooqui, Frederico Belcavello, Ganzorig Batnasan, Gisela Vallejo, Grainne Caulfield, Guido Ivetta, Haiyue Song , et al. (51 additional authors not shown)

    Abstract: Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recen… ▽ More

    Submitted 4 November, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: 38th Conference on Neural Information Processing Systems (NeurIPS 2024) Track on Datasets and Benchmarks

  34. arXiv:2406.00179  [pdf, other

    cs.CL cs.AI

    Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation

    Authors: Bernd Bohnet, Kevin Swersky, Rosanne Liu, Pranjal Awasthi, Azade Nova, Javier Snaider, Hanie Sedghi, Aaron T Parisi, Michael Collins, Angeliki Lazaridou, Orhan Firat, Noah Fiedel

    Abstract: We explore the use of long-context capabilities in large language models to create synthetic reading comprehension data from entire books. Previous efforts to construct such datasets relied on crowd-sourcing, but the emergence of transformers with a context size of 1 million or more tokens now enables entirely automatic approaches. Our objective is to test the capabilities of LLMs to analyze, unde… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  35. arXiv:2405.20671  [pdf, other

    cs.LG cs.AI cs.CL

    Position Coupling: Improving Length Generalization of Arithmetic Transformers Using Task Structure

    Authors: Hanseul Cho, Jaeyoung Cha, Pranjal Awasthi, Srinadh Bhojanapalli, Anupam Gupta, Chulhee Yun

    Abstract: Even for simple arithmetic tasks like integer addition, it is challenging for Transformers to generalize to longer sequences than those encountered during training. To tackle this problem, we propose position coupling, a simple yet effective method that directly embeds the structure of the tasks into the positional encoding of a (decoder-only) Transformer. Taking a departure from the vanilla absol… ▽ More

    Submitted 30 October, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: Accepted to NeurIPS 2024. 76 pages. 23 figures. 90 tables

  36. arXiv:2404.08555  [pdf, other

    cs.LG cs.AI cs.CL

    RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

    Authors: Shreyas Chaudhari, Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande, Bruno Castro da Silva

    Abstract: State-of-the-art large language models (LLMs) have become indispensable tools for various tasks. However, training LLMs to serve as effective assistants for humans requires careful consideration. A promising approach is reinforcement learning from human feedback (RLHF), which leverages human feedback to update the model in accordance with human preferences and mitigate issues like toxicity and hal… ▽ More

    Submitted 15 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

  37. arXiv:2403.20116  [pdf, other

    cs.RO

    LeGo-Drive: Language-enhanced Goal-oriented Closed-Loop End-to-End Autonomous Driving

    Authors: Pranjal Paul, Anant Garg, Tushar Choudhary, Arun Kumar Singh, K. Madhava Krishna

    Abstract: Existing Vision-Language models (VLMs) estimate either long-term trajectory waypoints or a set of control actions as a reactive solution for closed-loop planning based on their rich scene comprehension. However, these estimations are coarse and are subjective to their "world understanding" which may generate sub-optimal decisions due to perception errors. In this paper, we introduce LeGo-Drive, wh… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  38. A Machine learning and Empirical Bayesian Approach for Predictive Buying in B2B E-commerce

    Authors: Tuhin Subhra De, Pranjal Singh, Alok Patel

    Abstract: In the context of developing nations like India, traditional business to business (B2B) commerce heavily relies on the establishment of robust relationships, trust, and credit arrangements between buyers and sellers. Consequently, ecommerce enterprises frequently. Established in 2016 with a vision to revolutionize trade in India through technology, Udaan is the countrys largest business to busines… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: Published at the 8th International Conference on Machine Learning and Soft Computing (ICMLSC 2024), Singapore

  39. arXiv:2403.04978  [pdf, other

    cs.LG stat.ML

    Stacking as Accelerated Gradient Descent

    Authors: Naman Agarwal, Pranjal Awasthi, Satyen Kale, Eric Zhao

    Abstract: Stacking, a heuristic technique for training deep residual networks by progressively increasing the number of layers and initializing new layers by copying parameters from older layers, has proven quite successful in improving the efficiency of training deep neural networks. In this paper, we propose a theoretical explanation for the efficacy of stacking: viz., stacking implements a form of Nester… ▽ More

    Submitted 19 February, 2025; v1 submitted 7 March, 2024; originally announced March 2024.

  40. arXiv:2402.16442  [pdf, other

    cs.LG cs.AI cs.CV cs.DC math.OC

    On Distributed Larger-Than-Memory Subset Selection With Pairwise Submodular Functions

    Authors: Maximilian Böther, Abraham Sebastian, Pranjal Awasthi, Ana Klimovic, Srikumar Ramalingam

    Abstract: Modern datasets span billions of samples, making training on all available data infeasible. Selecting a high quality subset helps in reducing training costs and enhancing model quality. Submodularity, a discrete analogue of convexity, is commonly used for solving such subset selection problems. However, existing algorithms for optimizing submodular functions are sequential, and the prior distribut… ▽ More

    Submitted 3 April, 2025; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: accepted at MLSys 2025

  41. arXiv:2402.10576  [pdf, ps, other

    cs.CR

    Post-Quantum Cryptography

    Authors: Pranjal, Atul Chaturvedi

    Abstract: In this survey we propose to cover the prose of post-quantum cryptography over classical cryptography. We talk about the various cryptographic methods that are being practiced to safeguard our information. The future of secure communication is expected to be the implementation of quantum-safe cryptographic systems, and that in the post-quantum era, the development of post-quantum cryptography is e… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  42. arXiv:2402.05033  [pdf, other

    cs.LG

    Majority Kernels: An Approach to Leverage Big Model Dynamics for Efficient Small Model Training

    Authors: Hanna Mazzawi, Pranjal Awasthi, Xavi Gonzalvo, Srikumar Ramalingam

    Abstract: Recent breakthroughs and successful deployment of large language and vision models in a constrained environment predominantly follow a two phase approach. First, large models are trained to achieve peak performance, followed by a model shrinking method to meet hardware constraints; Methods like distillation, compression or quantization help leverage the highly performant large models to induce sma… ▽ More

    Submitted 20 November, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  43. arXiv:2401.12097  [pdf, other

    cs.CL

    An Empirical Study of In-context Learning in LLMs for Machine Translation

    Authors: Pranjal A. Chitale, Jay Gala, Raj Dabre

    Abstract: Recent interest has surged in employing Large Language Models (LLMs) for machine translation (MT) via in-context learning (ICL) (Vilar et al., 2023). Most prior studies primarily focus on optimizing translation quality, with limited attention to understanding the specific aspects of ICL that influence the said quality. To this end, we perform the first of its kind, an exhaustive study of in-contex… ▽ More

    Submitted 4 June, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted to ACL 2024 Findings

  44. Fixed-parameter debordering of Waring rank

    Authors: Pranjal Dutta, Fulvio Gesmundo, Christian Ikenmeyer, Gorav Jindal, Vladimir Lysikov

    Abstract: Border complexity measures are defined via limits (or topological closures), so that any function which can approximated arbitrarily closely by low complexity functions itself has low border complexity. Debordering is the task of proving an upper bound on some non-border complexity measure in terms of a border complexity measure, thus getting rid of limits. Debordering is at the heart of underst… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 22 pages; accepted at STACS 2024; this is an edited part of the preprint arXiv:2211.07055

    MSC Class: 68Q99 ACM Class: F.1.3

    Journal ref: 41st International Symposium on Theoretical Aspects of Computer Science (STACS 2024), pp. 30:1-30:15

  45. arXiv:2312.10602  [pdf, other

    cs.LG cs.AI cs.CV

    A Weighted K-Center Algorithm for Data Subset Selection

    Authors: Srikumar Ramalingam, Pranjal Awasthi, Sanjiv Kumar

    Abstract: The success of deep learning hinges on enormous data and large models, which require labor-intensive annotations and heavy computation costs. Subset selection is a fundamental problem that can play a key role in identifying smaller portions of the training data, which can then be used to produce similar models as the ones trained with full data. Two prior methods are shown to achieve impressive re… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: data selection, k-center, subset selection,

  46. Homogeneous Algebraic Complexity Theory and Algebraic Formulas

    Authors: Pranjal Dutta, Fulvio Gesmundo, Christian Ikenmeyer, Gorav Jindal, Vladimir Lysikov

    Abstract: We study algebraic complexity classes and their complete polynomials under \emph{homogeneous linear} projections, not just under the usual affine linear projections that were originally introduced by Valiant in 1979. These reductions are weaker yet more natural from a geometric complexity theory (GCT) standpoint, because the corresponding orbit closure formulations do not require the padding of po… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: This is edited part of preprint arXiv:2211.07055

    MSC Class: 68Qxx ACM Class: F.1.3

    Journal ref: 15th Innovations in Theoretical Computer Science Conference (ITCS 2024), pp. 43:1-43:23

  47. arXiv:2311.09735  [pdf, other

    cs.LG cs.IR

    GEO: Generative Engine Optimization

    Authors: Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande

    Abstract: The advent of large language models (LLMs) has ushered in a new paradigm of search engines that use generative models to gather and summarize information to answer user queries. This emerging technology, which we formalize under the unified framework of generative engines (GEs), can generate accurate and personalized responses, rapidly replacing traditional search engines like Google and Bing. Gen… ▽ More

    Submitted 28 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to KDD 2024

  48. arXiv:2310.12963  [pdf, other

    cs.CL cs.AI

    AutoMix: Automatically Mixing Language Models

    Authors: Pranjal Aggarwal, Aman Madaan, Ankit Anand, Srividya Pranavi Potharaju, Swaroop Mishra, Pei Zhou, Aditya Gupta, Dheeraj Rajagopal, Karthik Kappaganthu, Yiming Yang, Shyam Upadhyay, Manaal Faruqui, Mausam

    Abstract: Large language models (LLMs) are now available from cloud API providers in various sizes and configurations. While this diversity offers a broad spectrum of choices, effectively leveraging the options to optimize computational cost and performance remains challenging. In this work, we present Automix, an approach that strategically routes queries to larger LMs, based on the approximate correctness… ▽ More

    Submitted 19 January, 2025; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: 38th Conference on Neural Information Processing Systems (NeurIPS 2024). The first two authors contributed equally. Work started and partly done during Aman's internship at Google. This version adds results on additional models and datasets

  49. arXiv:2310.02324  [pdf, other

    cs.RO

    ALT-Pilot: Autonomous navigation with Language augmented Topometric maps

    Authors: Mohammad Omama, Pranav Inani, Pranjal Paul, Sarat Chandra Yellapragada, Krishna Murthy Jatavallabhula, Sandeep Chinchali, Madhava Krishna

    Abstract: We present an autonomous navigation system that operates without assuming HD LiDAR maps of the environment. Our system, ALT-Pilot, relies only on publicly available road network information and a sparse (and noisy) set of crowdsourced language landmarks. With the help of onboard sensors and a language-augmented topometric map, ALT-Pilot autonomously pilots the vehicle to any destination on the roa… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  50. arXiv:2310.00726  [pdf, other

    cs.LG cs.AI stat.ML

    Improving Length-Generalization in Transformers via Task Hinting

    Authors: Pranjal Awasthi, Anupam Gupta

    Abstract: It has been observed in recent years that transformers have problems with length generalization for certain types of reasoning and arithmetic tasks. In particular, the performance of a transformer model trained on tasks (say addition) up to a certain length (e.g., 5 digit numbers) drops sharply when applied to longer instances of the same problem. This work proposes an approach based on task hinti… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.