Skip to main content

Showing 1–21 of 21 results for author: Deoras, A

.
  1. arXiv:2506.12103  [pdf, other

    cs.AI cs.CY cs.LG

    The Amazon Nova Family of Models: Technical Report and Model Card

    Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, AdriĆ  de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

    Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More

    Submitted 17 March, 2025; originally announced June 2025.

    Comments: 48 pages, 10 figures

    Report number: 20250317

  2. arXiv:2506.00307  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Lossless Token Sequence Compression via Meta-Tokens

    Authors: John Harvill, Ziwei Fan, Hao Wang, Yizhou Sun, Hao Ding, Luke Huan, Anoop Deoras

    Abstract: Existing work on prompt compression for Large Language Models (LLM) focuses on lossy methods that try to maximize the retention of semantic information that is relevant to downstream tasks while significantly reducing the sequence length. In this paper, we introduce a task-agnostic lossless compression technique similar to LZ77 that makes it possible to reduce the input token sequence length on av… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

    Comments: 16 pages, 8 figures

  3. arXiv:2505.09569  [pdf, other

    cs.SE

    MigrationBench: Repository-Level Code Migration Benchmark from Java 8

    Authors: Linbo Liu, Xinle Liu, Qiang Zhou, Lin Chen, Yihan Liu, Hoan Nguyen, Behrooz Omidvar-Tehrani, Xi Shen, Jun Huan, Omer Tripp, Anoop Deoras

    Abstract: With the rapid advancement of powerful large language models (LLMs) in recent years, a wide range of software engineering tasks can now be addressed using LLMs, significantly enhancing productivity and scalability. Numerous benchmark datasets have been developed to evaluate the coding capabilities of these models, while they primarily focus on code generation and issue-resolution tasks. In contras… ▽ More

    Submitted 19 May, 2025; v1 submitted 14 May, 2025; originally announced May 2025.

  4. arXiv:2504.08703  [pdf, other

    cs.SE

    SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents

    Authors: Muhammad Shihab Rashid, Christian Bock, Yuan Zhuang, Alexander Buchholz, Tim Esler, Simon Valentin, Luca Franceschi, Martin Wistuba, Prabhu Teja Sivaprasad, Woo Jung Kim, Anoop Deoras, Giovanni Zappella, Laurent Callot

    Abstract: Coding agents powered by large language models have shown impressive capabilities in software engineering tasks, but evaluating their performance across diverse programming languages and real-world scenarios remains challenging. We introduce SWE-PolyBench, a new multi-language benchmark for repository-level, execution-based evaluation of coding agents. SWE-PolyBench contains 2110 instances from 21… ▽ More

    Submitted 23 April, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

    Comments: 20 pages, 6 figures, corrected author name spelling

  5. arXiv:2503.14924  [pdf, other

    cs.SE

    UTFix: Change Aware Unit Test Repairing using LLM

    Authors: Shanto Rahman, Sachit Kuhar, Berk Cirisci, Pranav Garg, Shiqi Wang, Xiaofei Ma, Anoop Deoras, Baishakhi Ray

    Abstract: Software updates, including bug repair and feature additions, are frequent in modern applications but they often leave test suites outdated, resulting in undetected bugs and increased chances of system failures. A recent study by Meta revealed that 14%-22% of software failures stem from outdated tests that fail to reflect changes in the codebase. This highlights the need to keep tests in sync with… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

    Comments: 26 pages, International Conference on Object-oriented Programming, Systems, Languages, and Applications (OOPSLA) 2025

  6. arXiv:2412.04478  [pdf, other

    cs.SE cs.AI

    LibEvolutionEval: A Benchmark and Study for Version-Specific Code Generation

    Authors: Sachit Kuhar, Wasi Uddin Ahmad, Zijian Wang, Nihal Jain, Haifeng Qian, Baishakhi Ray, Murali Krishna Ramanathan, Xiaofei Ma, Anoop Deoras

    Abstract: Recent advancements in code completion models have primarily focused on local file contexts. However, these studies do not fully capture the complexity of real-world software development, which often requires the use of rapidly-evolving public libraries. To fill the gap, we introduce LibEvolutionEval, a detailed study requiring an understanding of library evolution to perform in-line code completi… ▽ More

    Submitted 19 November, 2024; originally announced December 2024.

    Journal ref: Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Albuquerque, New Mexico, USA, April 2-7, 2025

  7. arXiv:2410.01103  [pdf, other

    cs.CL cs.AI

    Approximately Aligned Decoding

    Authors: Daniel Melcer, Sujan Gonugondla, Pramuditha Perera, Haifeng Qian, Wen-Hao Chiang, Yanjun Wang, Nihal Jain, Pranav Garg, Xiaofei Ma, Anoop Deoras

    Abstract: It is common to reject undesired outputs of Large Language Models (LLMs); however, current methods to do so require an excessive amount of computation, or severely distort the distribution of outputs. We present a method to balance the distortion of the output distribution with computational efficiency, allowing for the generation of long sequences of text with difficult-to-satisfy constraints, wi… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: 9 pages main, 22 pages total

  8. arXiv:2405.18649  [pdf, other

    cs.CL cs.AI cs.SE

    LeDex: Training LLMs to Better Self-Debug and Explain Code

    Authors: Nan Jiang, Xiaopeng Li, Shiqi Wang, Qiang Zhou, Soneya Binta Hossain, Baishakhi Ray, Varun Kumar, Xiaofei Ma, Anoop Deoras

    Abstract: In the domain of code generation, self-debugging is crucial. It allows LLMs to refine their generated code based on execution feedback. This is particularly important because generating correct solutions in one attempt proves challenging for complex tasks. Prior works on self-debugging mostly focus on prompting methods by providing LLMs with few-shot examples, which work poorly on small open-sourc… ▽ More

    Submitted 13 February, 2025; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: This paper is accepted by The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024)

  9. arXiv:2405.13622  [pdf, other

    cs.CL cs.IR

    Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation

    Authors: Gauthier Guinet, Behrooz Omidvar-Tehrani, Anoop Deoras, Laurent Callot

    Abstract: We propose a new method to measure the task-specific accuracy of Retrieval-Augmented Large Language Models (RAG). Evaluation is performed by scoring the RAG on an automatically-generated synthetic exam composed of multiple choice questions based on the corpus of documents associated with the task. Our method is an automated, cost-efficient, interpretable, and robust strategy to select the optimal… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning (ICML), 29 pages, 12 figures

  10. arXiv:2405.03637  [pdf, other

    cs.LG

    Collage: Light-Weight Low-Precision Strategy for LLM Training

    Authors: Tao Yu, Gaurav Gupta, Karthick Gopalswamy, Amith Mamidala, Hao Zhou, Jeffrey Huynh, Youngsuk Park, Ron Diamant, Anoop Deoras, Luke Huan

    Abstract: Large models training is plagued by the intense compute cost and limited hardware memory. A practical solution is low-precision representation but is troubled by loss in numerical accuracy and unstable training rendering the model less useful. We argue that low-precision floating points can perform well provided the error is properly compensated at the critical locations in the training process. W… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  11. arXiv:2405.01567  [pdf, other

    cs.SE cs.AI

    CodeFort: Robust Training for Code Generation Models

    Authors: Yuhao Zhang, Shiqi Wang, Haifeng Qian, Zijian Wang, Mingyue Shang, Linbo Liu, Sanjay Krishna Gouda, Baishakhi Ray, Murali Krishna Ramanathan, Xiaofei Ma, Anoop Deoras

    Abstract: Code generation models are not robust to small perturbations, which often lead to incorrect generations and significantly degrade the performance of these models. Although improving the robustness of code generation models is crucial to enhancing user experience in real-world applications, existing research efforts do not address this issue. To fill this gap, we propose CodeFort, a framework to im… ▽ More

    Submitted 28 October, 2024; v1 submitted 11 April, 2024; originally announced May 2024.

  12. arXiv:2404.15778  [pdf, other

    cs.LG cs.CL

    BASS: Batched Attention-optimized Speculative Sampling

    Authors: Haifeng Qian, Sujan Kumar Gonugondla, Sungsoo Ha, Mingyue Shang, Sanjay Krishna Gouda, Ramesh Nallapati, Sudipta Sengupta, Xiaofei Ma, Anoop Deoras

    Abstract: Speculative decoding has emerged as a powerful method to improve latency and throughput in hosting large language models. However, most existing implementations focus on generating a single sequence. Real-world generative AI applications often require multiple responses and how to perform speculative decoding in a batched setting while preserving its latency benefits poses non-trivial challenges.… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  13. arXiv:2404.10830  [pdf, other

    cs.CL cs.AI cs.LG

    Fewer Truncations Improve Language Modeling

    Authors: Hantian Ding, Zijian Wang, Giovanni Paolini, Varun Kumar, Anoop Deoras, Dan Roth, Stefano Soatto

    Abstract: In large language model training, input documents are typically concatenated together and then split into sequences of equal length to avoid padding tokens. Despite its efficiency, the concatenation approach compromises data integrity -- it inevitably breaks many documents into incomplete pieces, leading to excessive truncations that hinder the model from learning to compose logically coherent and… ▽ More

    Submitted 2 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: ICML 2024

  14. arXiv:2312.14345  [pdf, other

    cs.AI cs.CL cs.HC

    Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs

    Authors: Behnam Rahdari, Hao Ding, Ziwei Fan, Yifei Ma, Zhuotong Chen, Anoop Deoras, Branislav Kveton

    Abstract: The unique capabilities of Large Language Models (LLMs), such as the natural language text generation ability, position them as strong candidates for providing explanation for recommendations. However, despite the size of the LLM, most existing models struggle to produce zero-shot explanations reliably. To address this issue, we propose a framework called Logic-Scaffolding, that combines the ideas… ▽ More

    Submitted 17 January, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: The 17th ACM International Conference on Web Search and Data Mining (WSDM 2024)

  15. arXiv:2310.19251  [pdf, other

    cs.IR cs.AI

    Pre-trained Recommender Systems: A Causal Debiasing Perspective

    Authors: Ziqian Lin, Hao Ding, Nghia Trong Hoang, Branislav Kveton, Anoop Deoras, Hao Wang

    Abstract: Recent studies on pre-trained vision/language models have demonstrated the practical benefit of a new, promising solution-building paradigm in AI where models can be pre-trained on broad data describing a generic task space and then adapted successfully to solve a wide range of downstream tasks, even when training data is severely limited (e.g., in zero- or few-shot learning scenarios). Inspired b… ▽ More

    Submitted 8 January, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: 8 pages, WSDM 24

  16. arXiv:2307.06857  [pdf, other

    cs.AI cs.CL cs.LG

    Lightweight reranking for language model generations

    Authors: Siddhartha Jain, Xiaofei Ma, Anoop Deoras, Bing Xiang

    Abstract: Large Language Models (LLMs) can exhibit considerable variation in the quality of their sampled outputs. Reranking and selecting the best generation from the sampled set is a popular way of obtaining strong gains in generation quality. In this paper, we present a novel approach for reranking LLM generations. Unlike other techniques that might involve additional inferences or training a specialized… ▽ More

    Submitted 11 January, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

  17. arXiv:2306.07549  [pdf, other

    cs.LG stat.ML

    Fixed-Budget Best-Arm Identification with Heterogeneous Reward Variances

    Authors: Anusha Lalitha, Kousha Kalantari, Yifei Ma, Anoop Deoras, Branislav Kveton

    Abstract: We study the problem of best-arm identification (BAI) in the fixed-budget setting with heterogeneous reward variances. We propose two variance-adaptive BAI algorithms for this setting: SHVar for known reward variances and SHAdaVar for unknown reward variances. Our algorithms rely on non-uniform budget allocations among the arms where the arms with higher reward variances are pulled more often than… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  18. arXiv:2306.03191  [pdf, other

    cs.IR cs.LG

    Personalized Federated Domain Adaptation for Item-to-Item Recommendation

    Authors: Ziwei Fan, Hao Ding, Anoop Deoras, Trong Nghia Hoang

    Abstract: Item-to-Item (I2I) recommendation is an important function in most recommendation systems, which generates replacement or complement suggestions for a particular item based on its semantic similarities to other cataloged items. Given that subsets of items in a recommendation system might be co-interacted with by the same set of customers, graph-based models, such as graph neural networks (GNNs), p… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 14 pages, Accepted by UAI 2023

  19. arXiv:2205.15548  [pdf, other

    stat.ML cs.AI cs.LG

    Robust Projection based Anomaly Extraction (RPE) in Univariate Time-Series

    Authors: Mostafa Rahmani, Anoop Deoras, Laurent Callot

    Abstract: This paper presents a novel, closed-form, and data/computation efficient online anomaly detection algorithm for time-series data. The proposed method, dubbed RPE, is a window-based method and in sharp contrast to the existing window-based methods, it is robust to the presence of anomalies in its window and it can distinguish the anomalies in time-stamp level. RPE leverages the linear structure of… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

  20. arXiv:2203.12598  [pdf, other

    cs.IR cs.LG

    Learning Personalized Item-to-Item Recommendation Metric via Implicit Feedback

    Authors: Trong Nghia Hoang, Anoop Deoras, Tong Zhao, Jin Li, George Karypis

    Abstract: This paper studies the item-to-item recommendation problem in recommender systems from a new perspective of metric learning via implicit feedback. We develop and investigate a personalizable deep metric model that captures both the internal contents of items and how they were interacted with by users. There are two key challenges in learning such model. First, there is no explicit similarity annot… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: AISTATS-22

  21. arXiv:2105.08318  [pdf, other

    cs.LG cs.AI cs.IR

    Zero-Shot Recommender Systems

    Authors: Hao Ding, Yifei Ma, Anoop Deoras, Yuyang Wang, Hao Wang

    Abstract: Performance of recommender systems (RS) relies heavily on the amount of training data available. This poses a chicken-and-egg problem for early-stage products, whose amount of data, in turn, relies on the performance of their RS. On the other hand, zero-shot learning promises some degree of generalization from an old dataset to an entirely new dataset. In this paper, we explore the possibility of… ▽ More

    Submitted 12 October, 2021; v1 submitted 18 May, 2021; originally announced May 2021.