Skip to main content

Showing 1–8 of 8 results for author: Tripto, N I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.19301  [pdf, other

    cs.CL cs.AI

    Beyond checkmate: exploring the creative chokepoints in AI text

    Authors: Nafis Irtiza Tripto, Saranya Venkatraman, Mahjabin Nahar, Dongwon Lee

    Abstract: Large Language Models (LLMs) have revolutionized Natural Language Processing (NLP) and Artificial Intelligence (AI), unlocking unprecedented capabilities. This rapid advancement has spurred research into various aspects of LLMs, their text generation & reasoning capability, and potential misuse, fueling the necessity for robust detection methods. While numerous prior research has focused on detect… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

    Comments: 18 pages, single columns, under review at Nature Machine Intelligence

  2. arXiv:2409.03708  [pdf, other

    cs.CL cs.IR

    RAG based Question-Answering for Contextual Response Prediction System

    Authors: Sriram Veturi, Saurabh Vaichal, Reshma Lal Jagadheesh, Nafis Irtiza Tripto, Nian Yan

    Abstract: Large Language Models (LLMs) have shown versatility in various Natural Language Processing (NLP) tasks, including their potential as effective question-answering systems. However, to provide precise and relevant information in response to specific customer queries in industry settings, LLMs require access to a comprehensive knowledge base to avoid hallucinations. Retrieval Augmented Generation (RA… ▽ More

    Submitted 6 September, 2024; v1 submitted 5 September, 2024; originally announced September 2024.

    Comments: Accepted at the 1st Workshop on GenAI and RAG Systems for Enterprise, CIKM'24. 6 pages

  3. arXiv:2406.12665  [pdf, other

    cs.CL cs.AI

    CollabStory: Multi-LLM Collaborative Story Generation and Authorship Analysis

    Authors: Saranya Venkatraman, Nafis Irtiza Tripto, Dongwon Lee

    Abstract: The rise of unifying frameworks that enable seamless interoperability of Large Language Models (LLMs) has made LLM-LLM collaboration for open-ended tasks a possibility. Despite this, there have not been efforts to explore such collaborative writing. We take the next step beyond human-LLM collaboration to explore this multi-LLM scenario by generating the first exclusively LLM-generated collaborativ… ▽ More

    Submitted 10 February, 2025; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted to NAACL Findings 2025

  4. Authorship Obfuscation in Multilingual Machine-Generated Text Detection

    Authors: Dominik Macko, Robert Moro, Adaku Uchendu, Ivan Srba, Jason Samuel Lucas, Michiharu Yamashita, Nafis Irtiza Tripto, Dongwon Lee, Jakub Simko, Maria Bielikova

    Abstract: High-quality text generation capability of recent Large Language Models (LLMs) causes concerns about their misuse (e.g., in massive generation/spread of disinformation). Machine-generated text (MGT) detection is important to cope with such threats. However, it is susceptible to authorship obfuscation (AO) methods, such as paraphrasing, which can cause MGTs to evade detection. So far, this was eval… ▽ More

    Submitted 4 October, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: Accepted to EMNLP 2024 Findings

    Journal ref: Findings of the Association for Computational Linguistics: EMNLP 2024

  5. arXiv:2311.08374  [pdf, other

    cs.CL

    A Ship of Theseus: Curious Cases of Paraphrasing in LLM-Generated Texts

    Authors: Nafis Irtiza Tripto, Saranya Venkatraman, Dominik Macko, Robert Moro, Ivan Srba, Adaku Uchendu, Thai Le, Dongwon Lee

    Abstract: In the realm of text manipulation and linguistic transformation, the question of authorship has been a subject of fascination and philosophical inquiry. Much like the Ship of Theseus paradox, which ponders whether a ship remains the same when each of its original planks is replaced, our research delves into an intriguing question: Does a text retain its original authorship when it undergoes numero… ▽ More

    Submitted 6 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: To appear in Association for Computational Linguistics (ACL 2024)

  6. arXiv:2310.16972  [pdf, other

    cs.IR

    The Word2vec Graph Model for Author Attribution and Genre Detection in Literary Analysis

    Authors: Nafis Irtiza Tripto, Mohammed Eunus Ali

    Abstract: Analyzing the writing styles of authors and articles is a key to supporting various literary analyses such as author attribution and genre detection. Over the years, rich sets of features that include stylometry, bag-of-words, n-grams have been widely used to perform such analysis. However, the effectiveness of these features largely depends on the linguistic aspects of a particular language and d… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 12 pages, 6 figures

  7. arXiv:2310.16968  [pdf, other

    cs.CL cs.CY

    Understanding Social Structures from Contemporary Literary Fiction using Character Interaction Graph -- Half Century Chronology of Influential Bengali Writers

    Authors: Nafis Irtiza Tripto, Mohammed Eunus Ali

    Abstract: Social structures and real-world incidents often influence contemporary literary fiction. Existing research in literary fiction analysis explains these real-world phenomena through the manual critical analysis of stories. Conventional Natural Language Processing (NLP) methodologies, including sentiment analysis, narrative summarization, and topic modeling, have demonstrated substantial efficacy in… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 8 pages, 11 figures, 6 pages appendix

  8. arXiv:2310.16746  [pdf, other

    cs.CL

    HANSEN: Human and AI Spoken Text Benchmark for Authorship Analysis

    Authors: Nafis Irtiza Tripto, Adaku Uchendu, Thai Le, Mattia Setzu, Fosca Giannotti, Dongwon Lee

    Abstract: Authorship Analysis, also known as stylometry, has been an essential aspect of Natural Language Processing (NLP) for a long time. Likewise, the recent advancement of Large Language Models (LLMs) has made authorship analysis increasingly crucial for distinguishing between human-written and AI-generated texts. However, these authorship analysis tasks have primarily been focused on written texts, not… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 9 pages, EMNLP-23 findings, 5 pages appendix, 6 figures, 17 tables