Skip to main content

Showing 1–36 of 36 results for author: Hazra, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.02554  [pdf, ps, other

    cs.AI cs.LG

    AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench

    Authors: Edan Toledo, Karen Hambardzumyan, Martin Josifoski, Rishi Hazra, Nicolas Baldwin, Alexis Audran-Reiss, Michael Kuchnik, Despoina Magka, Minqi Jiang, Alisia Maria Lupidi, Andrei Lupu, Roberta Raileanu, Kelvin Niu, Tatiana Shavrina, Jean-Christophe Gagnon-Audet, Michael Shvartsman, Shagun Sodhani, Alexander H. Miller, Abhishek Charnalia, Derek Dunfield, Carole-Jean Wu, Pontus Stenetorp, Nicola Cancedda, Jakob Nicolaus Foerster, Yoram Bachrach

    Abstract: AI research agents are demonstrating great potential to accelerate scientific progress by automating the design, implementation, and training of machine learning models. We focus on methods for improving agents' performance on MLE-bench, a challenging benchmark where agents compete in Kaggle competitions to solve real-world machine learning problems. We formalize AI research agents as search polic… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: Code: https://github.com/facebookresearch/aira-dojo

  2. arXiv:2505.14469  [pdf, ps, other

    cs.CL cs.AI

    Attributional Safety Failures in Large Language Models under Code-Mixed Perturbations

    Authors: Somnath Banerjee, Pratyush Chatterjee, Shanu Kumar, Sayan Layek, Parag Agrawal, Rima Hazra, Animesh Mukherjee

    Abstract: Recent advancements in LLMs have raised significant safety concerns, particularly when dealing with code-mixed inputs and outputs. Our study systematically investigates the increased susceptibility of LLMs to produce unsafe outputs from code-mixed prompts compared to monolingual English prompts. Utilizing explainability methods, we dissect the internal attribution shifts causing model's harmful be… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  3. arXiv:2504.03930  [pdf, other

    cs.AI cs.CC cs.LG

    Have Large Language Models Learned to Reason? A Characterization via 3-SAT Phase Transition

    Authors: Rishi Hazra, Gabriele Venturato, Pedro Zuidberg Dos Martires, Luc De Raedt

    Abstract: Large Language Models (LLMs) have been touted as AI models possessing advanced reasoning abilities. In theory, autoregressive LLMs with Chain-of-Thought (CoT) can perform more serial computations to solve complex reasoning tasks. However, recent studies suggest that, despite this capacity, LLMs do not truly learn to reason but instead fit on statistical features. To study the reasoning capabilitie… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: An updated version of arXiv:2408.07215v2, featuring: (1) inclusion of recent LRMs and recent LLMs, (2) revised conclusions reflecting recent developments, and (3) updated analysis

  4. arXiv:2502.11246  [pdf, other

    cs.IR cs.CL cs.CY

    MemeSense: An Adaptive In-Context Framework for Social Commonsense Driven Meme Moderation

    Authors: Sayantan Adak, Somnath Banerjee, Rajarshi Mandal, Avik Halder, Sayan Layek, Rima Hazra, Animesh Mukherjee

    Abstract: Memes present unique moderation challenges due to their subtle, multimodal interplay of images, text, and social context. Standard systems relying predominantly on explicit textual cues often overlook harmful content camouflaged by irony, symbolism, or cultural references. To address this gap, we introduce MemeSense, an adaptive in-context learning framework that fuses social commonsense reasoning… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

    Comments: Code and data available at: https://github.com/sayantan11995/MemeSense

  5. arXiv:2502.11244  [pdf, other

    cs.CL cs.AI

    Soteria: Language-Specific Functional Parameter Steering for Multilingual Safety Alignment

    Authors: Somnath Banerjee, Sayan Layek, Pratyush Chatterjee, Animesh Mukherjee, Rima Hazra

    Abstract: Ensuring consistent safety across multiple languages remains a significant challenge for large language models (LLMs). We introduce Soteria, a lightweight yet powerful strategy that locates and minimally adjusts the "functional heads" most responsible for harmful content generation in each language. By altering only a fraction of parameters, Soteria drastically reduces policy violations without sa… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  6. arXiv:2501.12128  [pdf, other

    cs.RO cs.HC

    Evaluating Efficiency and Engagement in Scripted and LLM-Enhanced Human-Robot Interactions

    Authors: Tim Schreiter, Jens V. Rüppel, Rishi Hazra, Andrey Rudenko, Martin Magnusson, Achim J. Lilienthal

    Abstract: To achieve natural and intuitive interaction with people, HRI frameworks combine a wide array of methods for human perception, intention communication, human-aware navigation and collaborative action. In practice, when encountering unpredictable behavior of people or unexpected states of the environment, these frameworks may lack the ability to dynamically recognize such states, adapt and recover… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    Comments: Accepted as a Late-Breaking Report to the 2025, 20th ACM/IEEE International Conference on Human-Robot Interaction (HRI)

  7. arXiv:2501.01872  [pdf, other

    cs.CL

    Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions

    Authors: Rachneet Sachdeva, Rima Hazra, Iryna Gurevych

    Abstract: Large language models, despite extensive alignment with human values and ethical principles, remain vulnerable to sophisticated jailbreak attacks that exploit their reasoning abilities. Existing safety measures often detect overt malicious intent but fail to address subtle, reasoning-driven vulnerabilities. In this work, we introduce POATE (Polar Opposite query generation, Adversarial Template con… ▽ More

    Submitted 9 January, 2025; v1 submitted 3 January, 2025; originally announced January 2025.

    Comments: Our code is publicly available at https://github.com/UKPLab/POATE-attack

  8. arXiv:2410.12880  [pdf, other

    cs.CL cs.AI cs.CY

    Navigating the Cultural Kaleidoscope: A Hitchhiker's Guide to Sensitivity in Large Language Models

    Authors: Somnath Banerjee, Sayan Layek, Hari Shrawgi, Rajarshi Mandal, Avik Halder, Shanu Kumar, Sagnik Basu, Parag Agrawal, Rima Hazra, Animesh Mukherjee

    Abstract: As LLMs are increasingly deployed in global applications, the importance of cultural sensitivity becomes paramount, ensuring that users from diverse backgrounds feel respected and understood. Cultural harm can arise when these models fail to align with specific cultural norms, resulting in misrepresentations or violations of cultural values. This work addresses the challenges of ensuring cultural… ▽ More

    Submitted 24 January, 2025; v1 submitted 15 October, 2024; originally announced October 2024.

    Comments: Accepted at NAACL 2025 (Main track). [Project Page](https://neuralsentinel.github.io/KaleidoCulture/)

  9. arXiv:2408.10589  [pdf, other

    cs.RO cs.HC

    Bidirectional Intent Communication: A Role for Large Foundation Models

    Authors: Tim Schreiter, Rishi Hazra, Jens Rüppel, Andrey Rudenko

    Abstract: Integrating multimodal foundation models has significantly enhanced autonomous agents' language comprehension, perception, and planning capabilities. However, while existing works adopt a \emph{task-centric} approach with minimal human interaction, applying these models to developing assistive \emph{user-centric} robots that can interact and cooperate with humans remains underexplored. This paper… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 2024 33rd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), Workshop: Large Language Models in the RoMan Age

  10. arXiv:2408.07215  [pdf, other

    cs.AI

    Can Large Language Models Reason? A Characterization via 3-SAT

    Authors: Rishi Hazra, Gabriele Venturato, Pedro Zuidberg Dos Martires, Luc De Raedt

    Abstract: Large Language Models (LLMs) have been touted as AI models possessing advanced reasoning abilities. However, recent works have shown that LLMs often bypass true reasoning using shortcuts, sparking skepticism. To study the reasoning capabilities in a principled fashion, we adopt a computational theory perspective and propose an experimental protocol centered on 3-SAT -- the prototypical NP-complete… ▽ More

    Submitted 22 October, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

  11. arXiv:2406.12274  [pdf, other

    cs.CL

    SafeInfer: Context Adaptive Decoding Time Safety Alignment for Large Language Models

    Authors: Somnath Banerjee, Sayan Layek, Soham Tripathy, Shanu Kumar, Animesh Mukherjee, Rima Hazra

    Abstract: Safety-aligned language models often exhibit fragile and imbalanced safety mechanisms, increasing the likelihood of generating unsafe content. In addition, incorporating new knowledge through editing techniques to language models can further compromise safety. To address these issues, we propose SafeInfer, a context-adaptive, decoding-time safety alignment strategy for generating safe responses to… ▽ More

    Submitted 14 December, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted at AAAI 2025 (AI Alignment Track). Also selected for Microsoft Academic Partnership Grant (MAPG) 2024

  12. arXiv:2406.11801  [pdf, other

    cs.CL

    Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations

    Authors: Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria

    Abstract: Ensuring the safe alignment of large language models (LLMs) with human values is critical as they become integral to applications like translation and question answering. Current alignment methods struggle with dynamic user intentions and complex objectives, making models vulnerable to generating harmful content. We propose Safety Arithmetic, a training-free framework enhancing LLM safety across d… ▽ More

    Submitted 28 October, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: EMNLP 2024 Main. Codes are available at: https://github.com/declare-lab/safety-arithmetic

  13. arXiv:2406.11139  [pdf, other

    cs.CL

    Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance

    Authors: Somnath Banerjee, Avik Halder, Rajarshi Mandal, Sayan Layek, Ian Soboroff, Rima Hazra, Animesh Mukherjee

    Abstract: The integration of pretrained language models (PLMs) like BERT and GPT has revolutionized NLP, particularly for English, but it has also created linguistic imbalances. This paper strategically identifies the need for linguistic equity by examining several knowledge editing techniques in multilingual contexts. We evaluate the performance of models such as Mistral, TowerInstruct, OpenHathi, Tamil-Ll… ▽ More

    Submitted 18 March, 2025; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: Accepted at NAACL 2025 (Industry track)

  14. arXiv:2406.01309  [pdf, other

    cs.NE cs.AI

    REvolve: Reward Evolution with Large Language Models using Human Feedback

    Authors: Rishi Hazra, Alkis Sygkounas, Andreas Persson, Amy Loutfi, Pedro Zuidberg Dos Martires

    Abstract: Designing effective reward functions is crucial to training reinforcement learning (RL) algorithms. However, this design is non-trivial, even for domain experts, due to the subjective nature of certain tasks that are hard to quantify explicitly. In recent works, large language models (LLMs) have been used for reward generation from natural language task descriptions, leveraging their extensive ins… ▽ More

    Submitted 23 May, 2025; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Published in ICLR 2025. Project page: https://rishihazra.github.io/REvolve

  15. arXiv:2402.16159  [pdf, other

    cs.CL

    DistALANER: Distantly Supervised Active Learning Augmented Named Entity Recognition in the Open Source Software Ecosystem

    Authors: Somnath Banerjee, Avik Dutta, Aaditya Agrawal, Rima Hazra, Animesh Mukherjee

    Abstract: With the AI revolution in place, the trend for building automated systems to support professionals in different domains such as the open source software systems, healthcare systems, banking systems, transportation systems and many others have become increasingly prominent. A crucial requirement in the automation of support tools for such systems is the early identification of named entities, which… ▽ More

    Submitted 20 June, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

    Comments: Accepted at ECML-PKDD 2024 (Long Paper)

  16. arXiv:2402.15302  [pdf, other

    cs.CL cs.CR

    How (un)ethical are instruction-centric responses of LLMs? Unveiling the vulnerabilities of safety guardrails to harmful queries

    Authors: Somnath Banerjee, Sayan Layek, Rima Hazra, Animesh Mukherjee

    Abstract: In this study, we tackle a growing concern around the safety and ethical use of large language models (LLMs). Despite their potential, these models can be tricked into producing harmful or unethical content through various sophisticated methods, including 'jailbreaking' techniques and targeted manipulation. Our work zeroes in on a specific issue: to what extent LLMs can be led astray by asking the… ▽ More

    Submitted 16 November, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted at AAAI Conference on Web and Social Media (ICWSM) 2025. [Dataset](https://huggingface.co/datasets/SoftMINER-Group/TechHazardQA)

  17. arXiv:2401.12671  [pdf, other

    cs.CL

    Context Matters: Pushing the Boundaries of Open-Ended Answer Generation with Graph-Structured Knowledge Context

    Authors: Somnath Banerjee, Amruit Sahoo, Sayan Layek, Avik Dutta, Rima Hazra, Animesh Mukherjee

    Abstract: In the continuously advancing AI landscape, crafting context-rich and meaningful responses via Large Language Models (LLMs) is essential. Researchers are becoming more aware of the challenges that LLMs with fewer parameters encounter when trying to provide suitable answers to open-ended questions. To address these hurdles, the integration of cutting-edge strategies, augmentation of rich external d… ▽ More

    Submitted 15 October, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted at EMNLP 2024

  18. arXiv:2401.10647  [pdf, other

    cs.CL

    Sowing the Wind, Reaping the Whirlwind: The Impact of Editing Language Models

    Authors: Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria

    Abstract: In the rapidly advancing field of artificial intelligence, the concept of Red-Teaming or Jailbreaking large language models (LLMs) has emerged as a crucial area of study. This approach is especially significant in terms of assessing and enhancing the safety and robustness of these models. This paper investigates the intricate consequences of such modifications through model editing, uncovering a c… ▽ More

    Submitted 16 May, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Accepted at ACL 2024

  19. arXiv:2312.05626  [pdf, other

    cs.SE cs.AI

    Redefining Developer Assistance: Through Large Language Models in Software Ecosystem

    Authors: Somnath Banerjee, Avik Dutta, Sayan Layek, Amruit Sahoo, Sam Conrad Joyce, Rima Hazra

    Abstract: In this paper, we delve into the advancement of domain-specific Large Language Models (LLMs) with a focus on their application in software development. We introduce DevAssistLlama, a model developed through instruction tuning, to assist developers in processing software-related natural language queries. This model, a variant of instruction tuned LLM, is particularly adept at handling intricate tec… ▽ More

    Submitted 15 March, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

    Comments: Under review

  20. arXiv:2309.10811  [pdf, other

    cs.DL cs.CL

    Modeling interdisciplinary interactions among Physics, Mathematics & Computer Science

    Authors: Rima Hazra, Mayank Singh, Pawan Goyal, Bibhas Adhikari, Animesh Mukherjee

    Abstract: Interdisciplinarity has over the recent years have gained tremendous importance and has become one of the key ways of doing cutting edge research. In this paper we attempt to model the citation flow across three different fields -- Physics (PHY), Mathematics (MA) and Computer Science (CS). For instance, is there a specific pattern in which these fields cite one another? We carry out experiments on… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted at Journal of Physics: Complexity

  21. arXiv:2309.05961  [pdf, other

    cs.SI cs.CL cs.IR cs.LG

    Evaluating the Ebb and Flow: An In-depth Analysis of Question-Answering Trends across Diverse Platforms

    Authors: Rima Hazra, Agnik Saha, Somnath Banerjee, Animesh Mukherjee

    Abstract: Community Question Answering (CQA) platforms steadily gain popularity as they provide users with fast responses to their queries. The swiftness of these responses is contingent on a mixture of query-specific and user-related elements. This paper scrutinizes these contributing factors within the context of six highly popular CQA platforms, identified through their standout answering speed. Our inve… ▽ More

    Submitted 15 March, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted as POSTER

  22. arXiv:2309.05035  [pdf, other

    cs.IR cs.SE cs.SI

    Duplicate Question Retrieval and Confirmation Time Prediction in Software Communities

    Authors: Rima Hazra, Debanjan Saha, Amruit Sahoo, Somnath Banerjee, Animesh Mukherjee

    Abstract: Community Question Answering (CQA) in different domains is growing at a large scale because of the availability of several platforms and huge shareable information among users. With the rapid growth of such online platforms, a massive amount of archived data makes it difficult for moderators to retrieve possible duplicates for a new question and identify and confirm existing question pairs as dupl… ▽ More

    Submitted 5 March, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: Full paper accepted at ASONAM 2023: The 2023 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining

  23. arXiv:2308.12682  [pdf, other

    cs.AI

    SayCanPay: Heuristic Planning with Large Language Models using Learnable Domain Knowledge

    Authors: Rishi Hazra, Pedro Zuidberg Dos Martires, Luc De Raedt

    Abstract: Large Language Models (LLMs) have demonstrated impressive planning abilities due to their vast "world knowledge". Yet, obtaining plans that are both feasible (grounded in affordances) and cost-effective (in plan length), remains a challenge, despite recent progress. This contrasts with heuristic planning methods that employ domain knowledge (formalized in action models such as PDDL) and heuristic… ▽ More

    Submitted 1 January, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: Accepted in AAAI 2024. Website: https://rishihazra.github.io/SayCanPay/

  24. arXiv:2304.08349  [pdf, other

    cs.AI cs.LG

    Deep Explainable Relational Reinforcement Learning: A Neuro-Symbolic Approach

    Authors: Rishi Hazra, Luc De Raedt

    Abstract: Despite numerous successes in Deep Reinforcement Learning (DRL), the learned policies are not interpretable. Moreover, since DRL does not exploit symbolic relational representations, it has difficulties in coping with structural changes in its environment (such as increasing the number of objects). Relational Reinforcement Learning, on the other hand, inherits the relational representations from s… ▽ More

    Submitted 14 July, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: Abridged version accepted in ECML-PKDD 2023

  25. arXiv:2303.16975  [pdf, other

    cs.CV

    EgoTV: Egocentric Task Verification from Natural Language Task Descriptions

    Authors: Rishi Hazra, Brian Chen, Akshara Rai, Nitin Kamra, Ruta Desai

    Abstract: To enable progress towards egocentric agents capable of understanding everyday tasks specified in natural language, we propose a benchmark and a synthetic dataset called Egocentric Task Verification (EgoTV). The goal in EgoTV is to verify the execution of tasks from egocentric videos based on the natural language description of these tasks. EgoTV contains pairs of videos and their task description… ▽ More

    Submitted 25 September, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: Accepted at ICCV 2023

  26. arXiv:2207.00623  [pdf, other

    cs.IR cs.SE

    Is this bug severe? A text-cum-graph based model for bug severity prediction

    Authors: Rima Hazra, Arpit Dwivedi, Animesh Mukherjee

    Abstract: Repositories of large software systems have become commonplace. This massive expansion has resulted in the emergence of various problems in these software platforms including identification of (i) bug-prone packages, (ii) critical bugs, and (iii) severity of bugs. One of the important goals would be to mine these bugs and recommend them to the developers to resolve them. The first step to this is… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted at ECML PKDD 2022, Research and ADS Track

  27. arXiv:2105.05069  [pdf, other

    cs.AI cs.CL

    Zero-Shot Generalization using Intrinsically Motivated Compositional Emergent Protocols

    Authors: Rishi Hazra, Sonu Dixit, Sayambhu Sen

    Abstract: Human language has been described as a system that makes \textit{use of finite means to express an unlimited array of thoughts}. Of particular interest is the aspect of compositionality, whereby, the meaning of a compound language expression can be deduced from the meaning of its constituent parts. If artificial agents can develop compositional communication protocols akin to human language, they… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

    Comments: Accepted in NAACL 2021 workshop: Visually Grounded Interaction and Language (ViGIL). arXiv admin note: substantial text overlap with arXiv:2012.05011

  28. arXiv:2105.03943  [pdf, other

    cs.CL cs.AI

    gComm: An environment for investigating generalization in Grounded Language Acquisition

    Authors: Rishi Hazra, Sonu Dixit

    Abstract: gComm is a step towards developing a robust platform to foster research in grounded language acquisition in a more challenging and realistic setting. It comprises a 2-d grid environment with a set of agents (a stationary speaker and a mobile listener connected via a communication channel) exposed to a continuous array of tasks in a partially observable setting. The key to solving these tasks lies… ▽ More

    Submitted 15 May, 2021; v1 submitted 9 May, 2021; originally announced May 2021.

    Comments: Accepted in NAACL 2021 workshop: Visually Grounded Interaction and Language (ViGIL). arXiv admin note: substantial text overlap with arXiv:2012.05011

  29. arXiv:2103.06490   

    cs.CL cs.AI cs.HC cs.LG cs.NE

    Active$^2$ Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation

    Authors: Rishi Hazra, Parag Dutta, Shubham Gupta, Mohammed Abdul Qaathir, Ambedkar Dukkipati

    Abstract: While deep learning is a powerful tool for natural language processing (NLP) problems, successful solutions to these problems rely heavily on large amounts of annotated samples. However, manually annotating data is expensive and time-consuming. Active Learning (AL) strategies reduce the need for huge volumes of labeled data by iteratively selecting a small number of examples for manual annotation… ▽ More

    Submitted 3 April, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: Two of the authors had published similar manuscripts on arXiv. So withdrawing this one. All further updations will be reflected at arXiv:1911.00234

  30. arXiv:2101.08729  [pdf, other

    cs.IR cs.SE

    Joint Autoregressive and Graph Models for Software and Developer Social Networks

    Authors: Rima Hazra, Hardik Aggarwal, Pawan Goyal, Animesh Mukherjee, Soumen Chakrabarti

    Abstract: Social network research has focused on hyperlink graphs, bibliographic citations, friend/follow patterns, influence spread, etc. Large software repositories also form a highly valuable networked artifact, usually in the form of a collection of packages, their developers, dependencies among them, and bug reports. This "social network of code" is rarely studied by social network researchers. We intr… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: Accepted at ECIR 2021

  31. arXiv:2101.00454  [pdf, other

    cs.DL

    Mining the online infosphere: A survey

    Authors: Sayantan Adak, Souvic Chakraborty, Paramtia Das, Mithun Das, Abhisek Dash, Rima Hazra, Binny Mathew, Punyajoy Saha, Soumya Sarkar, Animesh Mukherjee

    Abstract: The evolution of AI-based system and applications had pervaded everyday life to make decisions that have momentous impact on individuals and society. With the staggering growth of online data, often termed as the Online Infosphere it has become paramount to monitor the infosphere to ensure social good as the AI-based decisions are severely dependent on it. The goal of this survey is to provide a c… ▽ More

    Submitted 2 January, 2021; originally announced January 2021.

    Comments: 29 pages

  32. arXiv:2012.05011  [pdf, other

    cs.CL cs.AI

    Intrinsically Motivated Compositional Language Emergence

    Authors: Rishi Hazra, Sonu Dixit, Sayambhu Sen

    Abstract: Recently, there has been a great deal of research in emergent communication on artificial agents interacting in simulated environments. Recent studies have revealed that, in general, emergent languages do not follow the compositionality patterns of natural language. To deal with this, existing works have proposed a limited channel capacity as an important constraint for learning highly composition… ▽ More

    Submitted 27 January, 2023; v1 submitted 9 December, 2020; originally announced December 2020.

    Comments: abridged version Accepted in NAACL 2021 workshop: Visually Grounded Interaction and Language (ViGIL) titled "gComm: An environment for investigating generalization in Grounded Language Acquisition"

  33. Characterising authors on the extent of their paper acceptance: A case study of the Journal of High Energy Physics

    Authors: Rima Hazra, Aryan, Hardik Aggarwal, Matteo Marsili, Animesh Mukherjee

    Abstract: New researchers are usually very curious about the recipe that could accelerate the chances of their paper getting accepted in a reputed forum (journal/conference). In search of such a recipe, we investigate the profile and peer review text of authors whose papers almost always get accepted at a venue (Journal of High Energy Physics in our current work). We find authors with high acceptance rate a… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: Accepted in JCDL'2020

  34. arXiv:2004.02780  [pdf, other

    cs.MA cs.AI

    Networked Multi-Agent Reinforcement Learning with Emergent Communication

    Authors: Shubham Gupta, Rishi Hazra, Ambedkar Dukkipati

    Abstract: Multi-Agent Reinforcement Learning (MARL) methods find optimal policies for agents that operate in the presence of other learning agents. Central to achieving this is how the agents coordinate. One way to coordinate is by learning to communicate with each other. Can the agents develop a language while learning to perform a common task? In this paper, we formulate and study a MARL problem where coo… ▽ More

    Submitted 9 April, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: An abridged version of this paper has been accepted as a short paper at AAMAS 2020

  35. arXiv:1911.00234  [pdf, other

    cs.LG cs.IR stat.ML

    Active$^2$ Learning: Actively reducing redundancies in Active Learning methods for Sequence Tagging and Machine Translation

    Authors: Rishi Hazra, Parag Dutta, Shubham Gupta, Mohammed Abdul Qaathir, Ambedkar Dukkipati

    Abstract: While deep learning is a powerful tool for natural language processing (NLP) problems, successful solutions to these problems rely heavily on large amounts of annotated samples. However, manually annotating data is expensive and time-consuming. Active Learning (AL) strategies reduce the need for huge volumes of labeled data by iteratively selecting a small number of examples for manual annotation… ▽ More

    Submitted 6 April, 2021; v1 submitted 1 November, 2019; originally announced November 2019.

    Comments: Accepted in NAACL-HLT 2021

  36. arXiv:1908.03793  [pdf, other

    cs.DL

    The rise and rise of interdisciplinary research: Understanding the interaction dynamics of three major fields -- Physics, Mathematics & Computer Science

    Authors: Rima Hazra, Mayank Singh, Pawan Goyal, Bibhas Adhikari, Animesh Mukherjee

    Abstract: The distinction between sciences is becoming increasingly more artificial -- an approach from one area can be easily applied to the other. More exciting research nowadays is happening perhaps at the interfaces of disciplines like Physics, Mathematics and Computer Science. How do these interfaces emerge and interact? For instance, is there a specific pattern in which these fields cite each other? I… ▽ More

    Submitted 10 August, 2019; originally announced August 2019.

    Comments: This work has been accepted in ICADL-2019 conference