Skip to main content

Showing 1–50 of 110 results for author: Murthy, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.18011  [pdf, ps, other

    cs.CL cs.AI

    Training with Pseudo-Code for Instruction Following

    Authors: Prince Kumar, Rudra Murthy, Riyaz Bhat, Danish Contractor

    Abstract: Despite the rapid progress in the capabilities of Large Language Models (LLMs), they continue to have difficulty following relatively simple, unambiguous instructions, especially when compositions are involved. In this paper, we take inspiration from recent work that suggests that models may follow instructions better when they are expressed in pseudo-code. However, writing pseudo-code programs ca… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: Under Review

  2. arXiv:2504.04678  [pdf, other

    cs.NI

    Federated Learning over 5G, WiFi, and Ethernet: Measurements and Evaluation

    Authors: Robert J. Hayek, Joaquin Chung, Kayla Comer, Chandra R. Murthy, Rajkumar Kettimuthu, Igor Kadota

    Abstract: Federated Learning (FL) deployments using IoT devices is an area that is poised to significantly benefit from advances in NextG wireless. In this paper, we deploy a FL application using a 5G-NR Standalone (SA) testbed with open-source and Commercial Off-the-Shelf (COTS) components. The 5G testbed architecture consists of a network of resource-constrained edge devices, namely Raspberry Pi's, and a… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

    Comments: 10 pages, 14 figures, 7 tables, conference

  3. arXiv:2502.20616  [pdf, other

    cs.AI

    PersonaBench: Evaluating AI Models on Understanding Personal Information through Accessing (Synthetic) Private User Data

    Authors: Juntao Tan, Liangwei Yang, Zuxin Liu, Zhiwei Liu, Rithesh Murthy, Tulika Manoj Awalgaonkar, Jianguo Zhang, Weiran Yao, Ming Zhu, Shirley Kokane, Silvio Savarese, Huan Wang, Caiming Xiong, Shelby Heinecke

    Abstract: Personalization is critical in AI assistants, particularly in the context of private AI models that work with individual users. A key scenario in this domain involves enabling AI models to access and interpret a user's private data (e.g., conversation history, user-AI interactions, app usage) to understand personal details such as biographical information, preferences, and social connections. Howe… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  4. arXiv:2502.20204  [pdf, other

    cs.IR cs.CL

    Granite Embedding Models

    Authors: Parul Awasthy, Aashka Trivedi, Yulong Li, Mihaela Bornea, David Cox, Abraham Daniels, Martin Franz, Gabe Goodhart, Bhavani Iyer, Vishwajeet Kumar, Luis Lastras, Scott McCarley, Rudra Murthy, Vignesh P, Sara Rosenthal, Salim Roukos, Jaydeep Sen, Sukriti Sharma, Avirup Sil, Kate Soule, Arafat Sultan, Radu Florian

    Abstract: We introduce the Granite Embedding models, a family of encoder-based embedding models designed for retrieval tasks, spanning dense-retrieval and sparse retrieval architectures, with both English and Multilingual capabilities. This report provides the technical details of training these highly effective 12 layer embedding models, along with their efficient 6 layer distilled counterparts. Extensive… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  5. arXiv:2501.14704  [pdf, other

    math.AP cs.CV math.NA

    Stroke classification using Virtual Hybrid Edge Detection from in silico electrical impedance tomography data

    Authors: Juan Pablo Agnelli, Fernando S. Moura, Siiri Rautio, Melody Alsaker, Rashmi Murthy, Matti Lassas, Samuli Siltanen

    Abstract: Electrical impedance tomography (EIT) is a non-invasive imaging method for recovering the internal conductivity of a physical body from electric boundary measurements. EIT combined with machine learning has shown promise for the classification of strokes. However, most previous works have used raw EIT voltage data as network inputs. We build upon a recent development which suggested the use of spe… ▽ More

    Submitted 29 January, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 21 pages, 5 figures

  6. arXiv:2412.00466  [pdf, ps, other

    cs.IT stat.ML

    A Probably Approximately Correct Analysis of Group Testing Algorithms

    Authors: Sameera Bharadwaja H., Chandra R. Murthy

    Abstract: We consider the problem of identifying the defectives from a population of items via a non-adaptive group testing framework with a random pooling-matrix design. We analyze the sufficient number of tests needed for approximate set identification, i.e., for identifying almost all the defective and non-defective items with high confidence. To this end, we view the group testing problem as a function… ▽ More

    Submitted 30 November, 2024; originally announced December 2024.

  7. arXiv:2411.13547  [pdf, other

    cs.SE cs.AI

    SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs

    Authors: Shirley Kokane, Ming Zhu, Tulika Awalgaonkar, Jianguo Zhang, Thai Hoang, Akshara Prabhakar, Zuxin Liu, Tian Lan, Liangwei Yang, Juntao Tan, Rithesh Murthy, Weiran Yao, Zhiwei Liu, Juan Carlos Niebles, Huan Wang, Shelby Heinecke, Caiming Xiong, Silivo Savarese

    Abstract: Evaluating the output of Large Language Models (LLMs) is one of the most critical aspects of building a performant compound AI system. Since the output from LLMs propagate to downstream steps, identifying LLM errors is crucial to system performance. A common task for LLMs in AI systems is tool use. While there are several benchmark environments for evaluating LLMs on this task, they typically only… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

  8. arXiv:2411.02538  [pdf, other

    cs.CL

    MILU: A Multi-task Indic Language Understanding Benchmark

    Authors: Sshubam Verma, Mohammed Safi Ur Rahman Khan, Vishwajeet Kumar, Rudra Murthy, Jaydeep Sen

    Abstract: Evaluating Large Language Models (LLMs) in low-resource and linguistically diverse languages remains a significant challenge in NLP, particularly for languages using non-Latin scripts like those spoken in India. Existing benchmarks predominantly focus on English, leaving substantial gaps in assessing LLM capabilities in these languages. We introduce MILU, a Multi task Indic Language Understanding… ▽ More

    Submitted 4 February, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

  9. arXiv:2410.18528  [pdf, other

    cs.AI

    PRACT: Optimizing Principled Reasoning and Acting of LLM Agent

    Authors: Zhiwei Liu, Weiran Yao, Jianguo Zhang, Rithesh Murthy, Liangwei Yang, Zuxin Liu, Tian Lan, Ming Zhu, Juntao Tan, Shirley Kokane, Thai Hoang, Juan Carlos Niebles, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong

    Abstract: We introduce the Principled Reasoning and Acting (PRAct) framework, a novel method for learning and enforcing action principles from trajectory data. Central to our approach is the use of text gradients from a reflection and optimization engine to derive these action principles. To adapt action principles to specific task requirements, we propose a new optimization framework, Reflective Principle… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

    Comments: Accepted to SIG CoNLL 2024

  10. arXiv:2410.12972  [pdf, other

    cs.CL

    KCIF: Knowledge-Conditioned Instruction Following

    Authors: Rudra Murthy, Praveen Venkateswaran, Prince Kumar, Danish Contractor

    Abstract: LLM evaluation benchmarks have traditionally separated the testing of knowledge/reasoning capabilities from instruction following. In this work, we study the interaction between knowledge and instruction following, and observe that LLMs struggle to follow simple answer modifying instructions, and are also distracted by instructions that should have no bearing on the original knowledge task answer.… ▽ More

    Submitted 23 May, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Under Review

  11. arXiv:2409.05401  [pdf, other

    cs.IR cs.CL

    Benchmarking and Building Zero-Shot Hindi Retrieval Model with Hindi-BEIR and NLLB-E5

    Authors: Arkadeep Acharya, Rudra Murthy, Vishwajeet Kumar, Jaydeep Sen

    Abstract: Given the large number of Hindi speakers worldwide, there is a pressing need for robust and efficient information retrieval systems for Hindi. Despite ongoing research, comprehensive benchmarks for evaluating retrieval models in Hindi are lacking. To address this gap, we introduce the Hindi-BEIR benchmark, comprising 15 datasets across seven distinct tasks. We evaluate state-of-the-art multilingua… ▽ More

    Submitted 25 October, 2024; v1 submitted 9 September, 2024; originally announced September 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2408.09437

  12. arXiv:2409.03215  [pdf, other

    cs.CL cs.AI cs.LG

    xLAM: A Family of Large Action Models to Empower AI Agent Systems

    Authors: Jianguo Zhang, Tian Lan, Ming Zhu, Zuxin Liu, Thai Hoang, Shirley Kokane, Weiran Yao, Juntao Tan, Akshara Prabhakar, Haolin Chen, Zhiwei Liu, Yihao Feng, Tulika Awalgaonkar, Rithesh Murthy, Eric Hu, Zeyuan Chen, Ran Xu, Juan Carlos Niebles, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong

    Abstract: Autonomous agents powered by large language models (LLMs) have attracted significant research interest. However, the open-source community faces many challenges in developing specialized models for agent tasks, driven by the scarcity of high-quality agent datasets and the absence of standard protocols in this area. We introduce and publicly release xLAM, a series of large action models designed fo… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: Technical report for the Salesforce xLAM model series

  13. arXiv:2408.11119  [pdf, other

    cs.IR cs.CL

    Mistral-SPLADE: LLMs for better Learned Sparse Retrieval

    Authors: Meet Doshi, Vishwajeet Kumar, Rudra Murthy, Vignesh P, Jaydeep Sen

    Abstract: Learned Sparse Retrievers (LSR) have evolved into an effective retrieval strategy that can bridge the gap between traditional keyword-based sparse retrievers and embedding-based dense retrievers. At its core, learned sparse retrievers try to learn the most important semantic keyword expansions from a query and/or document which can facilitate better retrieval with overlapping keyword expansions. L… ▽ More

    Submitted 21 August, 2024; v1 submitted 20 August, 2024; originally announced August 2024.

  14. arXiv:2408.09437  [pdf, other

    cs.IR cs.CL

    Hindi-BEIR : A Large Scale Retrieval Benchmark in Hindi

    Authors: Arkadeep Acharya, Rudra Murthy, Vishwajeet Kumar, Jaydeep Sen

    Abstract: Given the large number of Hindi speakers worldwide, there is a pressing need for robust and efficient information retrieval systems for Hindi. Despite ongoing research, there is a lack of comprehensive benchmark for evaluating retrieval models in Hindi. To address this gap, we introduce the Hindi version of the BEIR benchmark, which includes a subset of English BEIR datasets translated to Hindi, e… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  15. arXiv:2408.07060  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

    Authors: Kexun Zhang, Weiran Yao, Zuxin Liu, Yihao Feng, Zhiwei Liu, Rithesh Murthy, Tian Lan, Lei Li, Renze Lou, Jiacheng Xu, Bo Pang, Yingbo Zhou, Shelby Heinecke, Silvio Savarese, Huan Wang, Caiming Xiong

    Abstract: Large language model (LLM) agents have shown great potential in solving real-world software engineering (SWE) problems. The most advanced open-source SWE agent can resolve over 27% of real GitHub issues in SWE-Bench Lite. However, these sophisticated agent frameworks exhibit varying strengths, excelling in certain tasks while underperforming in others. To fully harness the diversity of these agent… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  16. arXiv:2408.04661  [pdf, other

    cs.CL cond-mat.mtrl-sci

    MaterioMiner -- An ontology-based text mining dataset for extraction of process-structure-property entities

    Authors: Ali Riza Durmaz, Akhil Thomas, Lokesh Mishra, Rachana Niranjan Murthy, Thomas Straub

    Abstract: While large language models learn sound statistical representations of the language and information therein, ontologies are symbolic knowledge representations that can complement the former ideally. Research at this critical intersection relies on datasets that intertwine ontologies and text corpora to enable training and comprehensive benchmarking of neurosymbolic models. We present the MaterioMi… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  17. arXiv:2407.21783  [pdf, other

    cs.AI cs.CL cs.CV

    The Llama 3 Herd of Models

    Authors: Aaron Grattafiori, Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Alex Vaughan, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere , et al. (536 additional authors not shown)

    Abstract: Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical… ▽ More

    Submitted 23 November, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

  18. arXiv:2407.21364  [pdf, other

    cs.IR

    Personalized Multi-task Training for Recommender System

    Authors: Liangwei Yang, Zhiwei Liu, Jianguo Zhang, Rithesh Murthy, Shelby Heinecke, Huan Wang, Caiming Xiong, Philip S. Yu

    Abstract: In the vast landscape of internet information, recommender systems (RecSys) have become essential for guiding users through a sea of choices aligned with their preferences. These systems have applications in diverse domains, such as news feeds, game suggestions, and shopping recommendations. Personalization is a key technique in RecSys, where modern methods leverage representation learning to enco… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: 11 pages

  19. arXiv:2407.13522  [pdf, other

    cs.LG

    INDIC QA BENCHMARK: A Multilingual Benchmark to Evaluate Question Answering capability of LLMs for Indic Languages

    Authors: Abhishek Kumar Singh, Vishwajeet kumar, Rudra Murthy, Jaydeep Sen, Ashish Mittal, Ganesh Ramakrishnan

    Abstract: Large Language Models (LLMs) perform well on unseen tasks in English, but their abilities in non English languages are less explored due to limited benchmarks and training data. To bridge this gap, we introduce the Indic QA Benchmark, a large dataset for context grounded question answering in 11 major Indian languages, covering both extractive and abstractive tasks. Evaluations of multilingual LLM… ▽ More

    Submitted 24 February, 2025; v1 submitted 18 July, 2024; originally announced July 2024.

  20. arXiv:2406.18518  [pdf, other

    cs.CL cs.AI cs.LG cs.SE

    APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

    Authors: Zuxin Liu, Thai Hoang, Jianguo Zhang, Ming Zhu, Tian Lan, Shirley Kokane, Juntao Tan, Weiran Yao, Zhiwei Liu, Yihao Feng, Rithesh Murthy, Liangwei Yang, Silvio Savarese, Juan Carlos Niebles, Huan Wang, Shelby Heinecke, Caiming Xiong

    Abstract: The advancement of function-calling agent models requires diverse, reliable, and high-quality datasets. This paper presents APIGen, an automated data generation pipeline designed to synthesize verifiable high-quality datasets for function-calling applications. We leverage APIGen and collect 3,673 executable APIs across 21 different categories to generate diverse function-calling datasets in a scal… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  21. arXiv:2406.10290  [pdf, other

    cs.CL cs.AI cs.LG

    MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases

    Authors: Rithesh Murthy, Liangwei Yang, Juntao Tan, Tulika Manoj Awalgaonkar, Yilun Zhou, Shelby Heinecke, Sachin Desai, Jason Wu, Ran Xu, Sarah Tan, Jianguo Zhang, Zhiwei Liu, Shirley Kokane, Zuxin Liu, Ming Zhu, Huan Wang, Caiming Xiong, Silvio Savarese

    Abstract: The deployment of Large Language Models (LLMs) and Large Multimodal Models (LMMs) on mobile devices has gained significant attention due to the benefits of enhanced privacy, stability, and personalization. However, the hardware constraints of mobile devices necessitate the use of models with fewer parameters and model compression techniques like quantization. Currently, there is limited understand… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  22. arXiv:2402.15506  [pdf, other

    cs.AI cs.CL cs.LG

    AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

    Authors: Jianguo Zhang, Tian Lan, Rithesh Murthy, Zhiwei Liu, Weiran Yao, Ming Zhu, Juntao Tan, Thai Hoang, Zuxin Liu, Liangwei Yang, Yihao Feng, Shirley Kokane, Tulika Awalgaonkar, Juan Carlos Niebles, Silvio Savarese, Shelby Heinecke, Huan Wang, Caiming Xiong

    Abstract: Autonomous agents powered by large language models (LLMs) have garnered significant research attention. However, fully harnessing the potential of LLMs for agent-based tasks presents inherent challenges due to the heterogeneous nature of diverse data sources featuring multi-turn trajectories. In this paper, we introduce \textbf{AgentOhana} as a comprehensive solution to address these challenges. \… ▽ More

    Submitted 8 November, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Add GitHub repo link at \url{https://github.com/SalesforceAIResearch/xLAM} and HuggingFace model link at \url{https://huggingface.co/Salesforce/xLAM-v0.1-r}

  23. arXiv:2401.15006  [pdf, other

    cs.CL cs.AI

    Airavata: Introducing Hindi Instruction-tuned LLM

    Authors: Jay Gala, Thanmay Jayakumar, Jaavid Aktar Husain, Aswanth Kumar M, Mohammed Safi Ur Rahman Khan, Diptesh Kanojia, Ratish Puduppully, Mitesh M. Khapra, Raj Dabre, Rudra Murthy, Anoop Kunchukuttan

    Abstract: We announce the initial release of "Airavata," an instruction-tuned LLM for Hindi. Airavata was created by fine-tuning OpenHathi with diverse, instruction-tuning Hindi datasets to make it better suited for assistive tasks. Along with the model, we also share the IndicInstruct dataset, which is a collection of diverse instruction-tuning datasets to enable further research for Indic LLMs. Additional… ▽ More

    Submitted 26 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Work in progress

  24. Distributed IRSs Always Benefit Every Mobile Operator

    Authors: L. Yashvanth, Chandra R. Murthy

    Abstract: We investigate the impact of multiple distributed intelligent reflecting surfaces (IRSs), which are deployed and optimized by a mobile operator (MO), on the performance of user equipments (UEs) served by other co-existing out-of-band (OOB) MOs that do not control the IRSs. We show that, under round-robin scheduling, in mmWave frequencies, the ergodic sum spectral efficiency (SE) of an OOB MO incre… ▽ More

    Submitted 13 July, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted for Publication in IEEE Wireless Communications Letters

  25. arXiv:2401.07078  [pdf, other

    cs.CL

    PUB: A Pragmatics Understanding Benchmark for Assessing LLMs' Pragmatics Capabilities

    Authors: Settaluri Lakshmi Sravanthi, Meet Doshi, Tankala Pavan Kalyan, Rudra Murthy, Pushpak Bhattacharyya, Raj Dabre

    Abstract: LLMs have demonstrated remarkable capability for understanding semantics, but they often struggle with understanding pragmatics. To demonstrate this fact, we release a Pragmatics Understanding Benchmark (PUB) dataset consisting of fourteen tasks in four pragmatics phenomena, namely, Implicature, Presupposition, Reference, and Deixis. We curated high-quality test sets for each task, consisting of M… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  26. arXiv:2312.01364  [pdf, other

    cs.IT

    Tradeoff of age-of-information and power under reliability constraint for short-packet communication with block-length adaptation

    Authors: Sudarsanan A. K., Vineeth B. S., Chandra R. Murthy

    Abstract: In applications such as remote estimation and monitoring, update packets are transmitted by power-constrained devices using short-packet codes over wireless networks. Therefore, networks need to be end-to-end optimized using information freshness metrics such as age of information under transmit power and reliability constraints to ensure support for such applications. For short-packet coding, mod… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  27. On the Impact of an IRS on the Out-of-Band Performance in Sub-6 GHz & mmWave Frequencies

    Authors: L. Yashvanth, Chandra R. Murthy

    Abstract: Intelligent reflecting surfaces (IRSs) were introduced to enhance the performance of wireless communication systems. However, from a service provider's viewpoint, a concern with the use of an IRS is its effect on out-of-band (OOB) quality of service. Specifically, if two operators, say X and Y, provide services in a given geographical area using non-overlapping frequency bands, and if operator X u… ▽ More

    Submitted 10 June, 2024; v1 submitted 27 August, 2023; originally announced August 2023.

    Comments: Accepted for publication in IEEE Transactions on Communications

  28. arXiv:2308.05960  [pdf, other

    cs.AI

    BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

    Authors: Zhiwei Liu, Weiran Yao, Jianguo Zhang, Le Xue, Shelby Heinecke, Rithesh Murthy, Yihao Feng, Zeyuan Chen, Juan Carlos Niebles, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese

    Abstract: The massive successes of large language models (LLMs) encourage the emerging exploration of LLM-augmented Autonomous Agents (LAAs). An LAA is able to generate actions with its core LLM and interact with environments, which facilitates the ability to resolve complex tasks by conditioning on past interactions such as observations and actions. Since the investigation of LAA is still very recent, limi… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: Preprint

  29. arXiv:2308.02151  [pdf, other

    cs.CL cs.AI

    Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

    Authors: Weiran Yao, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Yihao Feng, Le Xue, Rithesh Murthy, Zeyuan Chen, Jianguo Zhang, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese

    Abstract: Recent months have seen the emergence of a powerful new trend in which large language models (LLMs) are augmented to become autonomous language agents capable of performing objective oriented multi-step tasks on their own, rather than merely responding to queries from human users. Most existing language agents, however, are not optimized using environment-specific rewards. Although some agents ena… ▽ More

    Submitted 5 May, 2024; v1 submitted 4 August, 2023; originally announced August 2023.

  30. arXiv:2307.08962  [pdf, other

    cs.AI cs.LG

    REX: Rapid Exploration and eXploitation for AI Agents

    Authors: Rithesh Murthy, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Le Xue, Weiran Yao, Yihao Feng, Zeyuan Chen, Akash Gokul, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese

    Abstract: In this paper, we propose an enhanced approach for Rapid Exploration and eXploitation for AI Agents called REX. Existing AutoGPT-style techniques have inherent limitations, such as a heavy reliance on precise descriptions for decision-making, and the lack of a systematic approach to leverage try-and-fail procedures akin to traditional Reinforcement Learning (RL). REX introduces an additional layer… ▽ More

    Submitted 26 January, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

  31. arXiv:2305.11790  [pdf, other

    cs.CL

    Prompting with Pseudo-Code Instructions

    Authors: Mayank Mishra, Prince Kumar, Riyaz Bhat, Rudra Murthy V, Danish Contractor, Srikanth Tamilselvam

    Abstract: Prompting with natural language instructions has recently emerged as a popular method of harnessing the capabilities of large language models. Given the inherent ambiguity present in natural language, it is intuitive to consider the possible advantages of prompting with less ambiguous prompt styles, such as the use of pseudo-code. In this paper we explore if prompting via pseudo-code instruction… ▽ More

    Submitted 19 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Published in EMNLP 2023 main track

  32. arXiv:2305.06161  [pdf, other

    cs.CL cs.AI cs.PL cs.SE

    StarCoder: may the source be with you!

    Authors: Raymond Li, Loubna Ben Allal, Yangtian Zi, Niklas Muennighoff, Denis Kocetkov, Chenghao Mou, Marc Marone, Christopher Akiki, Jia Li, Jenny Chim, Qian Liu, Evgenii Zheltonozhskii, Terry Yue Zhuo, Thomas Wang, Olivier Dehaene, Mishig Davaadorj, Joel Lamy-Poirier, João Monteiro, Oleh Shliazhko, Nicolas Gontier, Nicholas Meade, Armel Zebaze, Ming-Ho Yee, Logesh Kumar Umapathi, Jian Zhu , et al. (42 additional authors not shown)

    Abstract: The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling capabilities and fast large-batch inference enabled by multi-query attention. StarCoderBase is trained on 1 trillion tokens sourced from The Stack, a large colle… ▽ More

    Submitted 13 December, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  33. arXiv:2303.01191  [pdf, other

    cs.CL

    Denoising-based UNMT is more robust to word-order divergence than MASS-based UNMT

    Authors: Tamali Banerjee, Rudra Murthy V, Pushpak Bhattacharyya

    Abstract: We aim to investigate whether UNMT approaches with self-supervised pre-training are robust to word-order divergence between language pairs. We achieve this by comparing two models pre-trained with the same self-supervised pre-training objective. The first model is trained on language pairs with different word-orders, and the second model is trained on the same language pairs with source language r… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  34. Does an IRS Degrade Out-of-Band Performance?

    Authors: L. Yashvanth, Chandra R. Murthy

    Abstract: Intelligent reflecting surfaces (IRSs) were introduced to enhance the performance of wireless systems. However, from a cellular service provider's view, a concern with the use of an IRS is its effect on out-of-band (OOB) quality of service. Specifically, given two operators, say X and Y, providing services in a geographical area using non-overlapping frequency bands, if operator-X uses an IRS to o… ▽ More

    Submitted 30 June, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: Accepted for presentation in IEEE International Workshop on Signal Processing Advances in Wireless Communications (SPAWC) 2023

  35. arXiv:2302.12489  [pdf, other

    cs.IT eess.SP

    Channel State Information Based User Censoring in Irregular Repetition Slotted Aloha

    Authors: Chirag Ramesh Srivatsa, Chandra R. Murthy

    Abstract: Irregular repetition slotted aloha (IRSA) is a massive random access protocol which can be used to serve a large number of users while achieving a packet loss rate (PLR) close to zero. However, if the number of users is too high, then the system is interference limited and the PLR is close to one. In this paper, we propose a variant of IRSA in the interference limited regime, namely Censored-IRSA… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: Accepted at IEEE ICC 2023

  36. arXiv:2301.01015  [pdf, other

    cs.CV cs.AI cs.CL

    Semi-Structured Object Sequence Encoders

    Authors: Rudra Murthy V, Riyaz Bhat, Chulaka Gunasekara, Siva Sankalp Patel, Hui Wan, Tejas Indulal Dhamecha, Danish Contractor, Marina Danilevsky

    Abstract: In this paper we explore the task of modeling semi-structured object sequences; in particular, we focus our attention on the problem of developing a structure-aware input representation for such sequences. Examples of such data include user activity on websites, machine logs, and many others. This type of data is often represented as a sequence of sets of key-value pairs over time and can present… ▽ More

    Submitted 22 May, 2023; v1 submitted 3 January, 2023; originally announced January 2023.

  37. arXiv:2212.10168  [pdf, other

    cs.CL

    Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages

    Authors: Arnav Mhaske, Harshit Kedia, Sumanth Doddapaneni, Mitesh M. Khapra, Pratyush Kumar, Rudra Murthy V, Anoop Kunchukuttan

    Abstract: We present, Naamapadam, the largest publicly available Named Entity Recognition (NER) dataset for the 11 major Indian languages from two language families. The dataset contains more than 400k sentences annotated with a total of at least 100k entities from three standard entity categories (Person, Location, and, Organization) for 9 out of the 11 languages. The training dataset has been automaticall… ▽ More

    Submitted 28 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  38. arXiv:2209.02777  [pdf, other

    cs.IT eess.SP

    Impact of Mobility on Downlink Cell-Free Massive MIMO Systems

    Authors: Abhinav Anand, Chandra R. Murthy, Ribhu Chopra

    Abstract: In this paper, we analyze the achievable downlink spectral efficiency of cell-free massive multiple input multiple output (CF-mMIMO) systems, accounting for the effects of channel aging (caused by user mobility) and pilot contamination. We consider two cases, one where user equipments (UEs) rely on downlink pilots beamformed by the access points (APs) to estimate downlink channel, and another wher… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  39. Performance Analysis of Irregular Repetition Slotted Aloha with Multi-Cell Interference

    Authors: Chirag Ramesh Srivatsa, Chandra R. Murthy

    Abstract: Irregular repetition slotted aloha (IRSA) is a massive random access protocol in which users transmit several replicas of their packet over a frame to a base station. Existing studies have analyzed IRSA in the single-cell (SC) setup, which does not extend to the more practically relevant multi-cell (MC) setup due to the inter-cell interference. In this work, we analyze MC IRSA, accounting for pilo… ▽ More

    Submitted 28 May, 2022; v1 submitted 14 May, 2022; originally announced May 2022.

    Comments: Accepted at IEEE SPAWC 2022

    Journal ref: IEEE 23rd International Workshop on Signal Processing Advances in Wireless Communication (2022), 1-5

  40. arXiv:2204.13743  [pdf, other

    cs.CL

    HiNER: A Large Hindi Named Entity Recognition Dataset

    Authors: Rudra Murthy, Pallab Bhattacharjee, Rahul Sharnagat, Jyotsana Khatri, Diptesh Kanojia, Pushpak Bhattacharyya

    Abstract: Named Entity Recognition (NER) is a foundational NLP task that aims to provide class labels like Person, Location, Organisation, Time, and Number to words in free text. Named Entities can also be multi-word expressions where the additional I-O-B annotation information helps label them during the NER annotation process. While English and European languages have considerable annotated data for the N… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: Accepted at LREC 2022, 8 pages

  41. Performance Analysis of Intelligent Reflecting Surface Assisted Opportunistic Communications

    Authors: L. Yashvanth, Chandra R. Murthy

    Abstract: Intelligent reflecting surfaces (IRSs) are a promising technology for enhancing coverage and spectral efficiency, both in the sub-6 GHz and the millimeter wave (mmWave) bands. Existing approaches to leverage the benefits of IRS involve the use of a resource-intensive channel estimation step followed by a computationally expensive algorithm to optimize the reflection coefficients at the IRS. In thi… ▽ More

    Submitted 24 October, 2022; v1 submitted 11 March, 2022; originally announced March 2022.

    Comments: 17 pages, 9 figures

    Journal ref: IEEE Transactions on Signal Processing, vol. 71, pp. 2056-2070, 2023

  42. On the Impact of Channel Estimation on the Design and Analysis of IRSA based Systems

    Authors: Chirag Ramesh Srivatsa, Chandra R. Murthy

    Abstract: Irregular repetition slotted aloha (IRSA) is a distributed grant-free random access protocol where users transmit multiple replicas of their packets to a base station (BS). The BS recovers the packets using successive interference cancellation. In this paper, we first derive channel estimates for IRSA, exploiting the sparsity structure of IRSA transmissions, when non-orthogonal pilots are employed… ▽ More

    Submitted 23 June, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: Accepted at IEEE Transactions on Signal Processing, June 2022

    Journal ref: IEEE Transactions on Signal Processing, Volume 70, June 2022, 4186-4200

  43. User Activity Detection for Irregular Repetition Slotted Aloha based MMTC

    Authors: Chirag Ramesh Srivatsa, Chandra R. Murthy

    Abstract: Irregular repetition slotted aloha (IRSA) is a grant-free random access protocol for massive machine-type communications, where a large number of users sporadically send their data packets to a base station (BS). IRSA is a completely distributed multiple access protocol: in any given frame, a small subset of the users, i.e., the active users, transmit replicas of their packet in randomly selected… ▽ More

    Submitted 23 June, 2022; v1 submitted 11 November, 2021; originally announced November 2021.

    Comments: Accepted at IEEE Transactions on Signal Processing, June 2022

    Journal ref: IEEE Transactions on Signal Processing, Volume 70, June 2022, 3616-3631

  44. arXiv:2111.04975  [pdf, other

    cs.IT eess.SP

    Evaluation Of Orthogonal Chirp Division Multiplexing For Automotive Integrated Sensing And Communications

    Authors: Sangeeta Bhattacharjee, Kumar Vijay Mishra, Ramesh Annavajjala, Chandra R. Murthy

    Abstract: We consider a bistatic vehicular integrated sensing and communications (ISAC) system that employs the recently proposed orthogonal chirp division multiplexing (OCDM) multicarrier waveform. As a stand-alone communications waveform, OCDM has been shown to be robust against the interference in time-frequency selective channels. In a bistatic ISAC, we exploit this property to develop efficient receive… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: Submitted to ICASSP 2022

  45. arXiv:2110.09968  [pdf, ps, other

    eess.SP cs.IT

    Can Dynamic TDD Enabled Half-Duplex Cell-Free Massive MIMO Outperform Full-Duplex Cellular Massive MIMO?

    Authors: Anubhab Chowdhury, Ribhu Chopra, Chandra R. Murthy

    Abstract: We consider a dynamic time division duplex (DTDD) enabled cell-free massive multiple-input multiple-output (CF-mMIMO) system, where each half-duplex (HD) access point (AP) is scheduled to operate in the uplink (UL) or downlink (DL) mode based on the data demands of the user equipments (UEs), with the goal of maximizing the sum UL-DL spectral efficiency (SE). We develop a new, low complexity, greed… ▽ More

    Submitted 21 May, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: Accepted, IEEE Transactions on Communications

    Journal ref: IEEE Transactions on Communications, May, 2022

  46. arXiv:2109.10534  [pdf, other

    cs.CL

    Role of Language Relatedness in Multilingual Fine-tuning of Language Models: A Case Study in Indo-Aryan Languages

    Authors: Tejas Indulal Dhamecha, Rudra Murthy V, Samarth Bharadwaj, Karthik Sankaranarayanan, Pushpak Bhattacharyya

    Abstract: We explore the impact of leveraging the relatedness of languages that belong to the same family in NLP models using multilingual fine-tuning. We hypothesize and validate that multilingual fine-tuning of pre-trained language models can yield better performance on downstream NLP applications, compared to models fine-tuned on individual languages. A first of its kind detailed study is presented to tr… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: Accepted in EMNLP 2021

  47. Resilient and Latency-aware Orchestration of Network Slices Using Multi-connectivity in MEC-enabled 5G Networks

    Authors: Prabhu Kaliyammal Thiruvasagam, Abhishek Chakraborty, C Siva Ram Murthy

    Abstract: Network slicing (NS) and multi-access edge computing (MEC) are new paradigms which play key roles in 5G and beyond networks. NS allows network operators (NOs) to divide the available network resources into multiple logical NSs for providing dedicated virtual networks tailored to the specific service/business requirements. MEC enables NOs to provide diverse ultra-low latency services for supporting… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

  48. arXiv:2106.09849  [pdf, other

    cs.NI

    Latency-aware and Survivable Mapping of VNFs in 5G Network Edge Cloud

    Authors: Prabhu Kaliyammal Thiruvasagam, Abhishek Chakraborty, C. Siva Ram Murthy

    Abstract: Network Functions Virtualization (NFV) and Multi-access Edge Computing (MEC) play crucial roles in 5G networks for dynamically provisioning diverse communication services with heterogeneous service requirements. In particular, while NFV improves flexibility and scalability by softwarizing physical network functions as Virtual Network Functions (VNFs), MEC enables to provide delay-sensitive/time-cr… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  49. arXiv:2106.04995  [pdf, other

    cs.CL cs.LG

    Crosslingual Embeddings are Essential in UNMT for Distant Languages: An English to IndoAryan Case Study

    Authors: Tamali Banerjee, Rudra Murthy V, Pushpak Bhattacharyya

    Abstract: Recent advances in Unsupervised Neural Machine Translation (UNMT) have minimized the gap between supervised and unsupervised machine translation performance for closely related language pairs. However, the situation is very different for distant language pairs. Lack of lexical overlap and low syntactic similarities such as between English and Indo-Aryan languages leads to poor translation quality… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

  50. Multiple Support Recovery Using Very Few Measurements Per Sample

    Authors: Lekshmi Ramesh, Chandra R. Murthy, Himanshu Tyagi

    Abstract: In the problem of multiple support recovery, we are given access to linear measurements of multiple sparse samples in $\mathbb{R}^{d}$. These samples can be partitioned into $\ell$ groups, with samples having the same support belonging to the same group. For a given budget of $m$ measurements per sample, the goal is to recover the $\ell$ underlying supports, in the absence of the knowledge of grou… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.