Skip to main content

Showing 1–8 of 8 results for author: Ferracane, E

.
  1. arXiv:2406.03487  [pdf, other

    cs.CL cs.AI

    Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends

    Authors: Sanjana Ramprasad, Elisa Ferracane, Zachary C. Lipton

    Abstract: Recent advancements in large language models (LLMs) have considerably advanced the capabilities of summarization systems. However, they continue to face concerns about hallucinations. While prior work has evaluated LLMs extensively in news domains, most evaluation of dialogue summarization has focused on BART-based models, leaving a gap in our understanding of their faithfulness. Our work benchmar… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL 2024

  2. arXiv:2306.14907  [pdf, other

    cs.CL

    Clickbait Classification and Spoiling Using Natural Language Processing

    Authors: Adhitya Thirumala, Elisa Ferracane

    Abstract: Clickbait is the practice of engineering titles to incentivize readers to click through to articles. Such titles with sensationalized language reveal as little information as possible. Occasionally, clickbait will be intentionally misleading, so natural language processing (NLP) can scan the article and answer the question posed by the clickbait title, or spoil it. We tackle two tasks: classifying… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 7 pages, 2 figures, 3 tables, 1 Appendix (3 Sections)

    ACM Class: I.2.7

  3. arXiv:2210.06356  [pdf, other

    cs.CL

    Extractive Question Answering on Queries in Hindi and Tamil

    Authors: Adhitya Thirumala, Elisa Ferracane

    Abstract: Indic languages like Hindi and Tamil are underrepresented in the natural language processing (NLP) field compared to languages like English. Due to this underrepresentation, performance on NLP tasks (such as search algorithms) in Indic languages are inferior to their English counterparts. This difference disproportionately affects those who come from lower socioeconomic statuses because they consu… ▽ More

    Submitted 26 September, 2022; originally announced October 2022.

    Comments: 8 pages, 1 figure, 1 table, submitted to the Pittsburgh Regional Science and Engineering Fair in the Computer Science and Math Senior Division

    ACM Class: I.2.7

  4. arXiv:2104.04470  [pdf, other

    cs.CL

    Did they answer? Subjective acts and intents in conversational discourse

    Authors: Elisa Ferracane, Greg Durrett, Junyi Jessy Li, Katrin Erk

    Abstract: Discourse signals are often implicit, leaving it up to the interpreter to draw the required inferences. At the same time, discourse is embedded in a social context, meaning that interpreters apply their own assumptions and beliefs when resolving these inferences, leading to multiple, valid interpretations. However, current discourse data and frameworks ignore the social aspect, expecting only a si… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: NAACL 2021

  5. arXiv:2012.07749  [pdf, other

    cs.CY cs.CL

    Towards Fairness in Classifying Medical Conversations into SOAP Sections

    Authors: Elisa Ferracane, Sandeep Konam

    Abstract: As machine learning algorithms are more widely deployed in healthcare, the question of algorithmic fairness becomes more critical to examine. Our work seeks to identify and understand disparities in a deployed model that classifies doctor-patient conversations into sections of a medical SOAP note. We employ several metrics to measure disparities in the classifier performance, and find small differ… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

    Comments: To be presented at AAAI TAIH Workshop 2021

  6. arXiv:1906.01472  [pdf, other

    cs.CL

    Evaluating Discourse in Structured Text Representations

    Authors: Elisa Ferracane, Greg Durrett, Junyi Jessy Li, Katrin Erk

    Abstract: Discourse structure is integral to understanding a text and is helpful in many NLP tasks. Learning latent representations of discourse is an attractive alternative to acquiring expensive labeled discourse data. Liu and Lapata (2018) propose a structured attention mechanism for text classification that derives a tree over a text, akin to an RST discourse tree. We examine this model in detail, and e… ▽ More

    Submitted 10 June, 2019; v1 submitted 4 June, 2019; originally announced June 2019.

    Comments: ACL 2019

  7. arXiv:1904.06682  [pdf, other

    cs.CL

    From News to Medical: Cross-domain Discourse Segmentation

    Authors: Elisa Ferracane, Titan Page, Junyi Jessy Li, Katrin Erk

    Abstract: The first step in discourse analysis involves dividing a text into segments. We annotate the first high-quality small-scale medical corpus in English with discourse segments and analyze how well news-trained segmenters perform on this domain. While we expectedly find a drop in performance, the nature of the segmentation errors suggests some problems can be addressed earlier in the pipeline, while… ▽ More

    Submitted 14 April, 2019; originally announced April 2019.

    Comments: NAACL DISRPT Workshop 2019

  8. arXiv:1709.02271  [pdf, other

    cs.CL

    Leveraging Discourse Information Effectively for Authorship Attribution

    Authors: Su Wang, Elisa Ferracane, Raymond J. Mooney

    Abstract: We explore techniques to maximize the effectiveness of discourse information in the task of authorship attribution. We present a novel method to embed discourse features in a Convolutional Neural Network text classifier, which achieves a state-of-the-art result by a substantial margin. We empirically investigate several featurization methods to understand the conditions under which discourse featu… ▽ More

    Submitted 7 September, 2017; originally announced September 2017.

    Comments: Accepted at IJCNLP 2017 as a conference paper

    Journal ref: The 8th International Joint Conference on Natural Language Processing (IJCNLP 2017)