Skip to main content

Showing 1–21 of 21 results for author: Kumarage, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.17514  [pdf, ps, other

    cs.AI

    Kaleidoscopic Teaming in Multi Agent Simulations

    Authors: Ninareh Mehrabi, Tharindu Kumarage, Kai-Wei Chang, Aram Galstyan, Rahul Gupta

    Abstract: Warning: This paper contains content that may be inappropriate or offensive. AI agents have gained significant recent attention due to their autonomous tool usage capabilities and their integration in various real-world applications. This autonomy poses novel challenges for the safety of such systems, both in single- and multi-agent scenarios. We argue that existing red teaming or safety evaluat… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  2. arXiv:2505.21784  [pdf, ps, other

    cs.AI cs.CL

    Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation

    Authors: Tharindu Kumarage, Ninareh Mehrabi, Anil Ramakrishna, Xinyan Zhao, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta, Charith Peris

    Abstract: Safety reasoning is a recent paradigm where LLMs reason over safety policies before generating responses, thereby mitigating limitations in existing safety measures such as over-refusal and jailbreak vulnerabilities. However, implementing this paradigm is challenging due to the resource-intensive process of creating high-quality policy-embedded chain-of-thought (CoT) datasets while ensuring reason… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: Accepted to ACL 2025 (Findings)

  3. arXiv:2504.00389  [pdf, other

    cs.AI

    CyberBOT: Towards Reliable Cybersecurity Education via Ontology-Grounded Retrieval Augmented Generation

    Authors: Chengshuai Zhao, Riccardo De Maria, Tharindu Kumarage, Kumar Satvik Chaudhary, Garima Agrawal, Yiwen Li, Jongchan Park, Yuli Deng, Ying-Chih Chen, Huan Liu

    Abstract: Advancements in large language models (LLMs) have enabled the development of intelligent educational tools that support inquiry-based learning across technical domains. In cybersecurity education, where accuracy and safety are paramount, systems must go beyond surface-level relevance to provide information that is both trustworthy and domain-appropriate. To address this challenge, we introduce Cyb… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

  4. arXiv:2503.21888  [pdf, other

    cs.CL cs.AI

    RedditESS: A Mental Health Social Support Interaction Dataset -- Understanding Effective Social Support to Refine AI-Driven Support Tools

    Authors: Zeyad Alghamdi, Tharindu Kumarage, Garima Agrawal, Mansooreh Karami, Ibrahim Almuteb, Huan Liu

    Abstract: Effective mental health support is crucial for alleviating psychological distress. While large language model (LLM)-based assistants have shown promise in mental health interventions, existing research often defines "effective" support primarily in terms of empathetic acknowledgments, overlooking other essential dimensions such as informational guidance, community validation, and tangible coping s… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  5. arXiv:2503.15552  [pdf, other

    cs.CR cs.CL

    Personalized Attacks of Social Engineering in Multi-turn Conversations -- LLM Agents for Simulation and Detection

    Authors: Tharindu Kumarage, Cameron Johnson, Jadie Adams, Lin Ai, Matthias Kirchner, Anthony Hoogs, Joshua Garland, Julia Hirschberg, Arslan Basharat, Huan Liu

    Abstract: The rapid advancement of conversational agents, particularly chatbots powered by Large Language Models (LLMs), poses a significant risk of social engineering (SE) attacks on social media platforms. SE detection in multi-turn, chat-based interactions is considerably more complex than single-instance detection due to the dynamic nature of these conversations. A critical factor in mitigating this thr… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  6. arXiv:2412.14191  [pdf, other

    cs.CY cs.AI

    Ontology-Aware RAG for Improved Question-Answering in Cybersecurity Education

    Authors: Chengshuai Zhao, Garima Agrawal, Tharindu Kumarage, Zhen Tan, Yuli Deng, Ying-Chih Chen, Huan Liu

    Abstract: Integrating AI into education has the potential to transform the teaching of science and technology courses, particularly in the field of cybersecurity. AI-driven question-answering (QA) systems can actively manage uncertainty in cybersecurity problem-solving, offering interactive, inquiry-based learning experiences. Large language models (LLMs) have gained prominence in AI-driven QA systems, offe… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

  7. arXiv:2410.04616  [pdf, other

    cs.CL

    Can LLMs Improve Multimodal Fact-Checking by Asking Relevant Questions?

    Authors: Alimohammad Beigi, Bohan Jiang, Dawei Li, Zhen Tan, Pouya Shaeri, Tharindu Kumarage, Amrita Bhattacharjee, Huan Liu

    Abstract: Traditional fact-checking relies on humans to formulate relevant and targeted fact-checking questions (FCQs), search for evidence, and verify the factuality of claims. While Large Language Models (LLMs) have been commonly used to automate evidence retrieval and factuality verification at scale, their effectiveness for fact-checking is hindered by the absence of FCQ formulation. To bridge this gap,… ▽ More

    Submitted 20 February, 2025; v1 submitted 6 October, 2024; originally announced October 2024.

  8. arXiv:2407.12216  [pdf, other

    cs.IR

    Mindful-RAG: A Study of Points of Failure in Retrieval Augmented Generation

    Authors: Garima Agrawal, Tharindu Kumarage, Zeyad Alghamdi, Huan Liu

    Abstract: Large Language Models (LLMs) are proficient at generating coherent and contextually relevant text but face challenges when addressing knowledge-intensive queries in domain-specific and factual question-answering tasks. Retrieval-augmented generation (RAG) systems mitigate this by incorporating external knowledge sources, such as structured knowledge graphs (KGs). However, LLMs often struggle to pr… ▽ More

    Submitted 6 October, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

  9. arXiv:2406.12263  [pdf, other

    cs.CL

    Defending Against Social Engineering Attacks in the Age of LLMs

    Authors: Lin Ai, Tharindu Kumarage, Amrita Bhattacharjee, Zizhou Liu, Zheng Hui, Michael Davinroy, James Cook, Laura Cassani, Kirill Trapeznikov, Matthias Kirchner, Arslan Basharat, Anthony Hoogs, Joshua Garland, Huan Liu, Julia Hirschberg

    Abstract: The proliferation of Large Language Models (LLMs) poses challenges in detecting and mitigating digital deception, as these models can emulate human conversational patterns and facilitate chat-based social engineering (CSE) attacks. This study investigates the dual capabilities of LLMs as both facilitators and defenders against CSE threats. We develop a novel dataset, SEConvo, simulating CSE scenar… ▽ More

    Submitted 11 October, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  10. arXiv:2404.11036  [pdf, other

    cs.LG cs.CL

    Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement

    Authors: Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

    Abstract: Content moderation faces a challenging task as social media's ability to spread hate speech contrasts with its role in promoting global connectivity. With rapidly evolving slang and hate speech, the adaptability of conventional deep learning to the fluid landscape of online dialogue remains limited. In response, causality inspired disentanglement has shown promise by segregating platform specific… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  11. arXiv:2403.08035  [pdf, other

    cs.CL cs.AI

    Harnessing Artificial Intelligence to Combat Online Hate: Exploring the Challenges and Opportunities of Large Language Models in Hate Speech Detection

    Authors: Tharindu Kumarage, Amrita Bhattacharjee, Joshua Garland

    Abstract: Large language models (LLMs) excel in many diverse applications beyond language generation, e.g., translation, summarization, and sentiment analysis. One intriguing application is in text classification. This becomes pertinent in the realm of identifying hateful or toxic speech -- a domain fraught with challenges and ethical dilemmas. In our study, we have two objectives: firstly, to offer a liter… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  12. arXiv:2403.01152  [pdf, other

    cs.CL cs.AI

    A Survey of AI-generated Text Forensic Systems: Detection, Attribution, and Characterization

    Authors: Tharindu Kumarage, Garima Agrawal, Paras Sheth, Raha Moraffah, Aman Chadha, Joshua Garland, Huan Liu

    Abstract: We have witnessed lately a rapid proliferation of advanced Large Language Models (LLMs) capable of generating high-quality text. While these LLMs have revolutionized text generation across various domains, they also pose significant risks to the information ecosystem, such as the potential for generating convincing propaganda, misinformation, and disinformation at scale. This paper offers a review… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  13. arXiv:2311.07914  [pdf, other

    cs.CL cs.LG

    Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey

    Authors: Garima Agrawal, Tharindu Kumarage, Zeyad Alghamdi, Huan Liu

    Abstract: The contemporary LLMs are prone to producing hallucinations, stemming mainly from the knowledge gaps within the models. To address this critical limitation, researchers employ diverse strategies to augment the LLMs by incorporating external knowledge, aiming to reduce hallucinations and enhance reasoning accuracy. Among these strategies, leveraging knowledge graphs as a source of external informat… ▽ More

    Submitted 15 March, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: Accepted Paper in NAACL 2024

  14. arXiv:2310.05095  [pdf, other

    cs.CL cs.AI

    How Reliable Are AI-Generated-Text Detectors? An Assessment Framework Using Evasive Soft Prompts

    Authors: Tharindu Kumarage, Paras Sheth, Raha Moraffah, Joshua Garland, Huan Liu

    Abstract: In recent years, there has been a rapid proliferation of AI-generated text, primarily driven by the release of powerful pre-trained language models (PLMs). To address the issue of misuse associated with AI-generated text, various high-performing detectors have been developed, including the OpenAI detector and the Stanford DetectGPT. In our study, we ask how reliable these detectors are. We answer… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (Findings)

  15. arXiv:2309.03992  [pdf, other

    cs.CL cs.AI cs.LG

    ConDA: Contrastive Domain Adaptation for AI-generated Text Detection

    Authors: Amrita Bhattacharjee, Tharindu Kumarage, Raha Moraffah, Huan Liu

    Abstract: Large language models (LLMs) are increasingly being used for generating text in a variety of use cases, including journalistic news articles. Given the potential malicious nature in which these LLMs can be used to generate disinformation at scale, it is important to build effective detectors for such AI-generated text. Given the surge in development of new LLMs, acquiring labeled training data for… ▽ More

    Submitted 20 September, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Camera-ready for IJCNLP-AACL 2023 main track

  16. arXiv:2309.03164  [pdf, other

    cs.CL cs.AI

    J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News

    Authors: Tharindu Kumarage, Amrita Bhattacharjee, Djordje Padejski, Kristy Roschke, Dan Gillmor, Scott Ruston, Huan Liu, Joshua Garland

    Abstract: The rapid proliferation of AI-generated text online is profoundly reshaping the information landscape. Among various types of AI-generated text, AI-generated news presents a significant threat as it can be a prominent source of misinformation online. While several recent efforts have focused on detecting AI-generated text in general, these methods require enhanced reliability, given concerns about… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: This Paper is Accepted to The 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (IJCNLP-AACL 2023)

  17. arXiv:2308.07305  [pdf, other

    cs.CL cs.AI

    Neural Authorship Attribution: Stylometric Analysis on Large Language Models

    Authors: Tharindu Kumarage, Huan Liu

    Abstract: Large language models (LLMs) such as GPT-4, PaLM, and Llama have significantly propelled the generation of AI-crafted text. With rising concerns about their potential misuse, there is a pressing need for AI-generated-text forensics. Neural authorship attribution is a forensic effort, seeking to trace AI-generated text back to its originating LLM. The LLM landscape can be divided into two primary c… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  18. arXiv:2308.02080  [pdf, other

    cs.CL cs.LG

    Causality Guided Disentanglement for Cross-Platform Hate Speech Detection

    Authors: Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

    Abstract: Social media platforms, despite their value in promoting open discourse, are often exploited to spread harmful content. Current deep learning and natural language processing models used for detecting this harmful content overly rely on domain-specific terms affecting their capabilities to adapt to generalizable hate speech detection. This is because they tend to focus too narrowly on particular li… ▽ More

    Submitted 10 December, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: Accepted to WSDM'24

  19. arXiv:2306.08804  [pdf, other

    cs.CL cs.LG

    PEACE: Cross-Platform Hate Speech Detection- A Causality-guided Framework

    Authors: Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

    Abstract: Hate speech detection refers to the task of detecting hateful content that aims at denigrating an individual or a group based on their religion, gender, sexual orientation, or other characteristics. Due to the different policies of the platforms, different groups of people express hate in different ways. Furthermore, due to the lack of labeled data in some platforms it becomes challenging to build… ▽ More

    Submitted 8 October, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: ECML PKDD 2023

  20. arXiv:2303.03697  [pdf, other

    cs.CL cs.LG

    Stylometric Detection of AI-Generated Text in Twitter Timelines

    Authors: Tharindu Kumarage, Joshua Garland, Amrita Bhattacharjee, Kirill Trapeznikov, Scott Ruston, Huan Liu

    Abstract: Recent advancements in pre-trained language models have enabled convenient methods for generating human-like text at a large scale. Though these generation capabilities hold great potential for breakthrough applications, it can also be a tool for an adversary to generate misinformation. In particular, social media platforms like Twitter are highly susceptible to AI-generated misinformation. A pote… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  21. arXiv:2302.00102  [pdf, other

    cs.CL cs.LG

    Towards Detecting Harmful Agendas in News Articles

    Authors: Melanie Subbiah, Amrita Bhattacharjee, Yilun Hua, Tharindu Kumarage, Huan Liu, Kathleen McKeown

    Abstract: Manipulated news online is a growing problem which necessitates the use of automated systems to curtail its spread. We argue that while misinformation and disinformation detection have been studied, there has been a lack of investment in the important open challenge of detecting harmful agendas in news articles; identifying harmful agendas is critical to flag news campaigns with the greatest poten… ▽ More

    Submitted 2 August, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

    Comments: Camera-ready for ACL-WASSA 2023. First two authors contributed equally