Skip to main content

Showing 1–6 of 6 results for author: Kannen, N

.
  1. arXiv:2412.06771  [pdf, other

    cs.AI cs.CV cs.LG

    Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty

    Authors: Meera Hahn, Wenjun Zeng, Nithish Kannen, Rich Galt, Kartikeya Badola, Been Kim, Zi Wang

    Abstract: User prompts for generative AI models are often underspecified, leading to sub-optimal responses. This problem is particularly evident in text-to-image (T2I) generation, where users commonly struggle to articulate their precise intent. This disconnect between the user's vision and the model's interpretation often forces users to painstakingly and repeatedly refine their prompts. To address this, w… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

  2. arXiv:2409.17711  [pdf, other

    cs.IR cs.LG

    Efficient Pointwise-Pairwise Learning-to-Rank for News Recommendation

    Authors: Nithish Kannen, Yao Ma, Gerrit J. J. van den Burg, Jean Baptiste Faddoul

    Abstract: News recommendation is a challenging task that involves personalization based on the interaction history and preferences of each user. Recent works have leveraged the power of pretrained language models (PLMs) to directly rank news items by using inference approaches that predominately fall into three categories: pointwise, pairwise, and listwise learning-to-rank. While pointwise methods offer lin… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  3. arXiv:2407.06863  [pdf, other

    cs.CV

    Beyond Aesthetics: Cultural Competence in Text-to-Image Models

    Authors: Nithish Kannen, Arif Ahmad, Marco Andreetto, Vinodkumar Prabhakaran, Utsav Prabhu, Adji Bousso Dieng, Pushpak Bhattacharyya, Shachi Dave

    Abstract: Text-to-Image (T2I) models are being increasingly adopted in diverse global communities where they create visual representations of their unique cultures. Current T2I benchmarks primarily focus on faithfulness, aesthetics, and realism of generated images, overlooking the critical dimension of cultural competence. In this work, we introduce a framework to evaluate cultural competence of T2I models… ▽ More

    Submitted 20 January, 2025; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: NeurIPS 2024 camera-ready version

  4. arXiv:2310.15577  [pdf, other

    cs.CL cs.AI

    CONTRASTE: Supervised Contrastive Pre-training With Aspect-based Prompts For Aspect Sentiment Triplet Extraction

    Authors: Rajdeep Mukherjee, Nithish Kannen, Saurabh Kumar Pandey, Pawan Goyal

    Abstract: Existing works on Aspect Sentiment Triplet Extraction (ASTE) explicitly focus on developing more efficient fine-tuning techniques for the task. Instead, our motivation is to come up with a generic approach that can improve the downstream performances of multiple ABSA tasks simultaneously. Towards this, we present CONTRASTE, a novel pre-training strategy using CONTRastive learning to enhance the AS… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted as a Long Paper at EMNLP 2023 (Findings); 16 pages; Codes: https://github.com/nitkannen/CONTRASTE/

    ACM Class: I.2.7

  5. arXiv:2203.11054  [pdf, other

    cs.CL cs.AI

    Targeted Extraction of Temporal Facts from Textual Resources for Improved Temporal Question Answering over Knowledge Bases

    Authors: Nithish Kannen, Udit Sharma, Sumit Neelam, Dinesh Khandelwal, Shajith Ikbal, Hima Karanam, L Venkata Subramaniam

    Abstract: Knowledge Base Question Answering (KBQA) systems have the goal of answering complex natural language questions by reasoning over relevant facts retrieved from Knowledge Bases (KB). One of the major challenges faced by these systems is their inability to retrieve all relevant facts due to factors such as incomplete KB and entity/relation linking errors. In this paper, we address this particular cha… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    ACM Class: I.2.7; I.2.4

  6. arXiv:2112.13237  [pdf, other

    cs.CL cs.AI cs.IR

    CABACE: Injecting Character Sequence Information and Domain Knowledge for Enhanced Acronym and Long-Form Extraction

    Authors: Nithish Kannen, Divyanshu Sheth, Abhranil Chandra, Shubhraneel Pal

    Abstract: Acronyms and long-forms are commonly found in research documents, more so in documents from scientific and legal domains. Many acronyms used in such documents are domain-specific and are very rarely found in normal text corpora. Owing to this, transformer-based NLP models often detect OOV (Out of Vocabulary) for acronym tokens, especially for non-English languages, and their performance suffers wh… ▽ More

    Submitted 25 December, 2021; originally announced December 2021.