Skip to main content

Showing 1–50 of 62 results for author: Ting-Hao

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06561  [pdf, ps, other

    cs.CL cs.AI

    LaMP-Cap: Personalized Figure Caption Generation With Multimodal Figure Profiles

    Authors: Ho Yin 'Sam' Ng, Ting-Yao Hsu, Aashish Anantha Ramakrishnan, Branislav Kveton, Nedim Lipka, Franck Dernoncourt, Dongwon Lee, Tong Yu, Sungchul Kim, Ryan A. Rossi, Ting-Hao 'Kenneth' Huang

    Abstract: Figure captions are crucial for helping readers understand and remember a figure's key message. Many models have been developed to generate these captions, helping authors compose better quality captions more easily. Yet, authors almost always need to revise generic AI-generated captions to match their writing style and the domain's style, highlighting the need for personalization. Despite languag… ▽ More

    Submitted 17 June, 2025; v1 submitted 6 June, 2025; originally announced June 2025.

    Comments: The LaMP-CAP dataset is publicly available at: https://github.com/Crowd-AI-Lab/lamp-cap

  2. arXiv:2502.11767  [pdf, ps, other

    cs.LG cs.CL

    From Selection to Generation: A Survey of LLM-based Active Learning

    Authors: Yu Xia, Subhojyoti Mukherjee, Zhouhang Xie, Junda Wu, Xintong Li, Ryan Aponte, Hanjia Lyu, Joe Barrow, Hongjie Chen, Franck Dernoncourt, Branislav Kveton, Tong Yu, Ruiyi Zhang, Jiuxiang Gu, Nesreen K. Ahmed, Yu Wang, Xiang Chen, Hanieh Deilamsalehy, Sungchul Kim, Zhengmian Hu, Yue Zhao, Nedim Lipka, Seunghyun Yoon, Ting-Hao Kenneth Huang, Zichao Wang , et al. (9 additional authors not shown)

    Abstract: Active Learning (AL) has been a powerful paradigm for improving model efficiency and performance by selecting the most informative data points for labeling and training. In recent active learning frameworks, Large Language Models (LLMs) have been employed not only for selection but also for generating entirely new data instances and providing more cost-effective annotations. Motivated by the incre… ▽ More

    Submitted 31 May, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: ACL 2025

  3. arXiv:2502.11267  [pdf, other

    cs.HC cs.AI cs.CL cs.LG

    Prompting in the Dark: Assessing Human Performance in Prompt Engineering for Data Labeling When Gold Labels Are Absent

    Authors: Zeyu He, Saniya Naphade, Ting-Hao 'Kenneth' Huang

    Abstract: Millions of users prompt large language models (LLMs) for various tasks, but how good are people at prompt engineering? Do users actually get closer to their desired outcome over multiple iterations of their prompts? These questions are crucial when no gold-standard labels are available to measure progress. This paper investigates a scenario in LLM-powered data labeling, "prompting in the dark," w… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

    Comments: Accepted By CHI 2025

  4. arXiv:2502.07058  [pdf, other

    cs.CL cs.HC

    Using Contextually Aligned Online Reviews to Measure LLMs' Performance Disparities Across Language Varieties

    Authors: Zixin Tang, Chieh-Yang Huang, Tsung-Che Li, Ho Yin Sam Ng, Hen-Hsen Huang, Ting-Hao 'Kenneth' Huang

    Abstract: A language can have different varieties. These varieties can affect the performance of natural language processing (NLP) models, including large language models (LLMs), which are often trained on data from widely spoken varieties. This paper introduces a novel and cost-effective approach to benchmark model performance across language varieties. We argue that international online review platforms,… ▽ More

    Submitted 20 March, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: Accepted by 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), theme track

  5. arXiv:2501.19353  [pdf, other

    cs.CL cs.AI cs.CV

    Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SciCap Challenge 2023

    Authors: Ting-Yao E. Hsu, Yi-Li Hsu, Shaurya Rohatgi, Chieh-Yang Huang, Ho Yin Sam Ng, Ryan Rossi, Sungchul Kim, Tong Yu, Lun-Wei Ku, C. Lee Giles, Ting-Hao K. Huang

    Abstract: Since the SciCap datasets launch in 2021, the research community has made significant progress in generating captions for scientific figures in scholarly articles. In 2023, the first SciCap Challenge took place, inviting global teams to use an expanded SciCap dataset to develop models for captioning diverse figure types across various academic fields. At the same time, text generation models advan… ▽ More

    Submitted 18 February, 2025; v1 submitted 31 January, 2025; originally announced January 2025.

    Comments: Accepted to TACL 2025

  6. arXiv:2501.18210  [pdf, other

    cs.HC cs.CY cs.IR cs.SI

    Hashtag Re-Appropriation for Audience Control on Recommendation-Driven Social Media Xiaohongshu (rednote)

    Authors: Ruyuan Wan, Lingbo Tong, Tiffany Knearem, Toby Jia-Jun Li, Ting-Hao 'Kenneth' Huang, Qunfang Wu

    Abstract: Algorithms have played a central role in personalized recommendations on social media. However, they also present significant obstacles for content creators trying to predict and manage their audience reach. This issue is particularly challenging for marginalized groups seeking to maintain safe spaces. Our study explores how women on Xiaohongshu (rednote), a recommendation-driven social platform,… ▽ More

    Submitted 3 March, 2025; v1 submitted 30 January, 2025; originally announced January 2025.

  7. arXiv:2501.06317  [pdf, other

    cs.HC cs.AI cs.CL

    Understanding How Paper Writers Use AI-Generated Captions in Figure Caption Writing

    Authors: Ho Yin, Ng, Ting-Yao Hsu, Jiyoo Min, Sungchul Kim, Ryan A. Rossi, Tong Yu, Hyunggu Jung, Ting-Hao 'Kenneth' Huang

    Abstract: Figures and their captions play a key role in scientific publications. However, despite their importance, many captions in published papers are poorly crafted, largely due to a lack of attention by paper authors. While prior AI research has explored caption generation, it has mainly focused on reader-centered use cases, where users evaluate generated captions rather than actively integrating them… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

    Comments: This paper will appear at AAAI 2025 Workshop (2nd AI4Research Workshop: Towards a Knowledge-grounded Scientific Research Lifecycle)

  8. arXiv:2501.02552  [pdf, other

    cs.CL cs.CV

    Multi-LLM Collaborative Caption Generation in Scientific Documents

    Authors: Jaeyoung Kim, Jongho Lee, Hong-Jun Choi, Ting-Yao Hsu, Chieh-Yang Huang, Sungchul Kim, Ryan Rossi, Tong Yu, Clyde Lee Giles, Ting-Hao 'Kenneth' Huang, Sungchul Choi

    Abstract: Scientific figure captioning is a complex task that requires generating contextually appropriate descriptions of visual content. However, existing methods often fall short by utilizing incomplete information, treating the task solely as either an image-to-text or text summarization problem. This limitation hinders the generation of high-quality captions that fully capture the necessary details. Mo… ▽ More

    Submitted 5 January, 2025; originally announced January 2025.

    Comments: Accepted to AAAI 2025 AI4Research Workshop

  9. arXiv:2410.03457  [pdf, other

    cs.CL

    CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds

    Authors: Min-Hsuan Yeh, Ruyuan Wan, Ting-Hao 'Kenneth' Huang

    Abstract: Detecting logical fallacies in texts can help users spot argument flaws, but automating this detection is not easy. Manually annotating fallacies in large-scale, real-world text data to create datasets for developing and validating detection models is costly. This paper introduces CoCoLoFa, the largest known logical fallacy dataset, containing 7,706 comments for 648 news articles, with each commen… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)

  10. A Learning-based Quadcopter Controller with Extreme Adaptation

    Authors: Dingqi Zhang, Antonio Loquercio, Jerry Tang, Ting-Hao Wang, Jitendra Malik, Mark W. Mueller

    Abstract: This paper introduces a learning-based low-level controller for quadcopters, which adaptively controls quadcopters with significant variations in mass, size, and actuator capabilities. Our approach leverages a combination of imitation learning and reinforcement learning, creating a fast-adapting and general control framework for quadcopters that eliminates the need for precise model estimation or… ▽ More

    Submitted 8 June, 2025; v1 submitted 19 September, 2024; originally announced September 2024.

    Comments: Accepted for the Transaction on Robotics (T-RO), April 2025

  11. arXiv:2408.06494  [pdf, other

    cs.HC cs.CL cs.CV

    What Color Scheme is More Effective in Assisting Readers to Locate Information in a Color-Coded Article?

    Authors: Ho Yin Ng, Zeyu He, Ting-Hao 'Kenneth' Huang

    Abstract: Color coding, a technique assigning specific colors to cluster information types, has proven advantages in aiding human cognitive activities, especially reading and comprehension. The rise of Large Language Models (LLMs) has streamlined document coding, enabling simple automatic text labeling with various schemes. This has the potential to make color-coding more accessible and benefit more users.… ▽ More

    Submitted 26 August, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

    Comments: This paper will appear at IEEE VIS 2024

  12. arXiv:2406.12787  [pdf, other

    cs.CL cs.HC

    Generating Educational Materials with Different Levels of Readability using LLMs

    Authors: Chieh-Yang Huang, Jing Wei, Ting-Hao 'Kenneth' Huang

    Abstract: This study introduces the leveled-text generation task, aiming to rewrite educational materials to specific readability levels while preserving meaning. We assess the capability of GPT-3.5, LLaMA-2 70B, and Mixtral 8x7B, to generate content at various readability levels through zero-shot and few-shot prompting. Evaluating 100 processed educational materials reveals that few-shot prompting signific… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: In2Writing 2024

  13. arXiv:2404.17025  [pdf, other

    cs.HC

    How Does Conversation Length Impact User's Satisfaction? A Case Study of Length-Controlled Conversations with LLM-Powered Chatbots

    Authors: Shih-Hong Huang, Ya-Fang Lin, Zeyu He, Chieh-Yang Huang, Ting-Hao 'Kenneth' Huang

    Abstract: Users can discuss a wide range of topics with large language models (LLMs), but they do not always prefer solving problems or getting information through lengthy conversations. This raises an intriguing HCI question: How does instructing LLMs to engage in longer or shorter conversations affect conversation quality? In this paper, we developed two Slack chatbots using GPT-4 with the ability to vary… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  14. SciCapenter: Supporting Caption Composition for Scientific Figures with Machine-Generated Captions and Ratings

    Authors: Ting-Yao Hsu, Chieh-Yang Huang, Shih-Hong Huang, Ryan Rossi, Sungchul Kim, Tong Yu, C. Lee Giles, Ting-Hao K. Huang

    Abstract: Crafting effective captions for figures is important. Readers heavily depend on these captions to grasp the figure's message. However, despite a well-developed set of AI technologies for figures and captions, these have rarely been tested for usefulness in aiding caption writing. This paper introduces SciCapenter, an interactive system that puts together cutting-edge AI technologies for scientific… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: CHI EA '24: Extended Abstracts of the 2024 CHI Conference on Human Factors in Computing Systems

  15. arXiv:2402.16795  [pdf, other

    cs.HC cs.AI cs.CL cs.LG

    If in a Crowdsourced Data Annotation Pipeline, a GPT-4

    Authors: Zeyu He, Chieh-Yang Huang, Chien-Kuang Cornelia Ding, Shaurya Rohatgi, Ting-Hao 'Kenneth' Huang

    Abstract: Recent studies indicated GPT-4 outperforms online crowd workers in data labeling accuracy, notably workers from Amazon Mechanical Turk (MTurk). However, these studies were criticized for deviating from standard crowdsourcing practices and emphasizing individual workers' performances over the whole data-annotation process. This paper compared GPT-4 and an ethical and well-executed MTurk pipeline, w… ▽ More

    Submitted 28 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted By CHI 2024

  16. arXiv:2311.16521  [pdf, other

    cs.HC

    Inspo: Writing with Crowds Alongside AI

    Authors: Chieh-Yang Huang, Sanjana Gautam, Shannon McClellan Brooks, Ya-Fang Lin, Tiffany Knearem, Ting-Hao 'Kenneth' Huang

    Abstract: The use of artificial intelligence (AI) to support creative writing has bloomed in recent years. However, it is less well understood how AI compares to on-demand human support. We explored how writers interact with both AI and crowd worker writing assistants in creative writing. We replicated the interface of the prior crowd-writing system, Heteroglossia, and developed Inspo, a text editor allowin… ▽ More

    Submitted 19 October, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  17. arXiv:2310.15405  [pdf, other

    cs.CL

    GPT-4 as an Effective Zero-Shot Evaluator for Scientific Figure Captions

    Authors: Ting-Yao Hsu, Chieh-Yang Huang, Ryan Rossi, Sungchul Kim, C. Lee Giles, Ting-Hao K. Huang

    Abstract: There is growing interest in systems that generate captions for scientific figures. However, assessing these systems output poses a significant challenge. Human evaluation requires academic expertise and is costly, while automatic evaluation depends on often low-quality author-written captions. This paper investigates using large language models (LLMs) as a cost-effective, reference-free method fo… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: To Appear in EMNLP 2023 Findings

  18. arXiv:2310.15129  [pdf, other

    cs.CL cs.LG

    Location-Aware Visual Question Generation with Lightweight Models

    Authors: Nicholas Collin Suwono, Justin Chih-Yao Chen, Tun Min Hung, Ting-Hao Kenneth Huang, I-Bin Liao, Yung-Hui Li, Lun-Wei Ku, Shao-Hua Sun

    Abstract: This work introduces a novel task, location-aware visual question generation (LocaVQG), which aims to generate engaging questions from data relevant to a particular geographical location. Specifically, we represent such location-aware information with surrounding images and a GPS coordinate. To tackle this task, we present a dataset generation pipeline that leverages GPT-4 to produce diverse and s… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  19. arXiv:2308.04346  [pdf, other

    cs.CL cs.CY

    Unmasking Nationality Bias: A Study of Human Perception of Nationalities in AI-Generated Articles

    Authors: Pranav Narayanan Venkit, Sanjana Gautam, Ruchi Panchanadikar, Ting-Hao `Kenneth' Huang, Shomir Wilson

    Abstract: We investigate the potential for nationality biases in natural language processing (NLP) models using human evaluation methods. Biased NLP models can perpetuate stereotypes and lead to algorithmic discrimination, posing a significant challenge to the fairness and justice of AI systems. Our study employs a two-step mixed-methods approach that includes both quantitative and qualitative analysis to i… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  20. arXiv:2306.04820  [pdf, other

    cs.CL

    Good Data, Large Data, or No Data? Comparing Three Approaches in Developing Research Aspect Classifiers for Biomedical Papers

    Authors: Shreya Chandrasekhar, Chieh-Yang Huang, Ting-Hao 'Kenneth' Huang

    Abstract: The rapid growth of scientific publications, particularly during the COVID-19 pandemic, emphasizes the need for tools to help researchers efficiently comprehend the latest advancements. One essential part of understanding scientific literature is research aspect classification, which categorizes sentences in abstracts to Background, Purpose, Method, and Finding. In this study, we investigate the i… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: BioNLP workshop 2023

  21. arXiv:2305.09770  [pdf, other

    cs.HC cs.AI cs.CL

    ConvXAI: Delivering Heterogeneous AI Explanations via Conversations to Support Human-AI Scientific Writing

    Authors: Hua Shen, Chieh-Yang Huang, Tongshuang Wu, Ting-Hao 'Kenneth' Huang

    Abstract: Despite a surge collection of XAI methods, users still struggle to obtain required AI explanations. Previous research suggests chatbots as dynamic solutions, but the effective design of conversational XAI agents for practical human needs remains under-explored. This paper focuses on Conversational XAI for AI-assisted scientific writing tasks. Drawing from human linguistic theories and formative st… ▽ More

    Submitted 27 October, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: CSCW 2023 Demo. ConvXAI system code: https://github.com/huashen218/convxai.git

  22. arXiv:2304.01002  [pdf, other

    cs.CL cs.AI cs.HC

    Does Human Collaboration Enhance the Accuracy of Identifying LLM-Generated Deepfake Texts?

    Authors: Adaku Uchendu, Jooyoung Lee, Hua Shen, Thai Le, Ting-Hao 'Kenneth' Huang, Dongwon Lee

    Abstract: Advances in Large Language Models (e.g., GPT-4, LLaMA) have improved the generation of coherent sentences resembling human writing on a large scale, resulting in the creation of so-called deepfake texts. However, this progress poses security and privacy concerns, necessitating effective solutions for distinguishing deepfake texts from human-written ones. Although prior works studied humans' abilit… ▽ More

    Submitted 9 October, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: Accepted at The 11th AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2023)

  23. arXiv:2303.17710  [pdf, other

    cs.HC cs.CL

    What Types of Questions Require Conversation to Answer? A Case Study of AskReddit Questions

    Authors: Shih-Hong Huang, Chieh-Yang Huang, Ya-Fang Lin, Ting-Hao 'Kenneth' Huang

    Abstract: The proliferation of automated conversational systems such as chatbots, spoken-dialogue systems, and smart speakers, has significantly impacted modern digital life. However, these systems are primarily designed to provide answers to well-defined questions rather than to support users in exploring complex, ill-defined questions. In this paper, we aim to push the boundaries of conversational systems… ▽ More

    Submitted 3 April, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: To appear in CHI 2023 Late-Breaking Work

  24. arXiv:2302.12324  [pdf, other

    cs.CL

    Summaries as Captions: Generating Figure Captions for Scientific Documents with Automated Text Summarization

    Authors: Chieh-Yang Huang, Ting-Yao Hsu, Ryan Rossi, Ani Nenkova, Sungchul Kim, Gromit Yeuk-Yin Chan, Eunyee Koh, Clyde Lee Giles, Ting-Hao 'Kenneth' Huang

    Abstract: Good figure captions help paper readers understand complex scientific figures. Unfortunately, even published papers often have poorly written captions. Automatic caption generation could aid paper writers by providing good starting captions that can be refined for better quality. Prior work often treated figure caption generation as a vision-to-language task. In this paper, we show that it can be… ▽ More

    Submitted 11 August, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: Accepted by INLG-2023

  25. arXiv:2302.09122  [pdf, other

    cs.CL cs.HC

    Conveying the Predicted Future to Users: A Case Study of Story Plot Prediction

    Authors: Chieh-Yang Huang, Saniya Naphade, Kavya Laalasa Karanam, Ting-Hao 'Kenneth' Huang

    Abstract: Creative writing is hard: Novelists struggle with writer's block daily. While automatic story generation has advanced recently, it is treated as a "toy task" for advancing artificial intelligence rather than helping people. In this paper, we create a system that produces a short description that narrates a predicted plot using existing story generation approaches. Our goal is to assist writers in… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: To appear in the AAAI 2023 Workshop- Creative AI Across Modalities

  26. arXiv:2302.02463  [pdf, other

    cs.CL cs.AI

    Nationality Bias in Text Generation

    Authors: Pranav Narayanan Venkit, Sanjana Gautam, Ruchi Panchanadikar, Ting-Hao 'Kenneth' Huang, Shomir Wilson

    Abstract: Little attention is placed on analyzing nationality bias in language models, especially when nationality is highly used as a factor in increasing the performance of social NLP models. This paper examines how a text generation model, GPT-2, accentuates pre-existing societal biases about country-based demonyms. We generate stories using GPT-2 for various nationalities and use sensitivity analysis to… ▽ More

    Submitted 14 February, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: Paper accepted in the 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL2023)

  27. arXiv:2212.03969  [pdf, other

    cs.HC

    Too Slow to Be Useful? On Incorporating Humans in the Loop of Smart Speakers

    Authors: Shih-Hong Huang, Chieh-Yang Huang, Yuxin Deng, Hua Shen, Szu-Chi Kuan, Ting-Hao 'Kenneth' Huang

    Abstract: Real-time crowd-powered systems, such as Chorus/Evorus, VizWiz, and Apparition, have shown how incorporating humans into automated systems could supplement where the automatic solutions fall short. However, one unspoken bottleneck of applying such architectures to more scenarios is the longer latency of including humans in the loop of automated systems. For the applications that have hard constrai… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: This document is the extended technical report of the "Too Slow to Be Useful? On Incorporating Humans in the Loop of Smart Speakers" paper by the authors. The paper was accepted by the Works-in-Progress and Demonstration track of the 10th AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2022 WiP/Demo) https://youtu.be/iMDsX52VWGY

  28. arXiv:2211.07441  [pdf, other

    cs.CL cs.CV cs.LG

    Multi-VQG: Generating Engaging Questions for Multiple Images

    Authors: Min-Hsuan Yeh, Vicent Chen, Ting-Hao 'Kenneth' Haung, Lun-Wei Ku

    Abstract: Generating engaging content has drawn much recent attention in the NLP community. Asking questions is a natural way to respond to photos and promote awareness. However, most answers to questions in traditional question-answering (QA) datasets are factoids, which reduce individuals' willingness to answer. Furthermore, traditional visual question generation (VQG) confines the source data for questio… ▽ More

    Submitted 17 November, 2022; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022)

  29. arXiv:2205.09327  [pdf, other

    cs.AI cs.CL cs.CV

    Let's Talk! Striking Up Conversations via Conversational Visual Question Generation

    Authors: Shih-Han Chan, Tsai-Lun Yang, Yun-Wei Chu, Chi-Yang Hsu, Ting-Hao Huang, Yu-Shian Chiu, Lun-Wei Ku

    Abstract: An engaging and provocative question can open up a great conversation. In this work, we explore a novel scenario: a conversation agent views a set of the user's photos (for example, from social media platforms) and asks an engaging question to initiate a conversation with the user. The existing vision-to-question models mostly generate tedious and obvious questions, which might not be ideals conve… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: Accepted as a full talk paper on AAAI-DEEPDIAL'21

  30. arXiv:2204.06382  [pdf, ps, other

    cs.HC

    Empathy-Centric Design At Scale

    Authors: Andrea Mauri, Yen-Chia Hsu, Marco Brambilla, Aisling Ann O'Kane, Ting-Hao 'Kenneth' Huang, Himanshu Verma

    Abstract: EmpathiCH aims at bringing together and blend different expertise to develop new research agenda in the context of "Empathy-Centric Design at Scale". The main research question is to investigate how new technologies can contribute to the elicitation of empathy across and within multiple stakeholders at scale; and how empathy can be used to design solutions to societal problems that are not only ef… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: accepted at Workshops at the 2022 CHI Conference on Human Factors in Computing Systems (CHI 2022)

  31. arXiv:2203.08788  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Are Shortest Rationales the Best Explanations for Human Understanding?

    Authors: Hua Shen, Tongshuang Wu, Wenbo Guo, Ting-Hao 'Kenneth' Huang

    Abstract: Existing self-explaining models typically favor extracting the shortest possible rationales - snippets of an input text "responsible for" corresponding output - to explain the model prediction, with the assumption that shorter rationales are more intuitive to humans. However, this assumption has yet to be validated. Is the shortest rationale indeed the most human-understandable? To answer this que… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: To appear in ACL 2022 main conference

  32. arXiv:2110.11624  [pdf, other

    cs.CL cs.AI cs.CV

    SciCap: Generating Captions for Scientific Figures

    Authors: Ting-Yao Hsu, C. Lee Giles, Ting-Hao 'Kenneth' Huang

    Abstract: Researchers use figures to communicate rich, complex information in scientific papers. The captions of these figures are critical to conveying effective messages. However, low-quality figure captions commonly occur in scientific articles and may decrease understanding. In this paper, we propose an end-to-end neural framework to automatically generate informative, high-quality captions for scientif… ▽ More

    Submitted 25 October, 2021; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: To Appear in EMNLP 2021 Findings. The dataset is available at: https://github.com/tingyaohsu/SciCap

  33. Empowering Local Communities Using Artificial Intelligence

    Authors: Yen-Chia Hsu, Ting-Hao 'Kenneth' Huang, Himanshu Verma, Andrea Mauri, Illah Nourbakhsh, Alessandro Bozzon

    Abstract: Artificial Intelligence (AI) is increasingly used to analyze large amounts of data in various practices, such as object recognition. We are specifically interested in using AI-powered systems to engage local communities in developing plans or solutions for pressing societal and environmental concerns. Such local contexts often involve multiple stakeholders with different and even contradictory age… ▽ More

    Submitted 26 April, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: This manuscript is peer-reviewed and accepted by the Patterns journal

  34. arXiv:2109.00122  [pdf, other

    cs.CL

    FinQA: A Dataset of Numerical Reasoning over Financial Data

    Authors: Zhiyu Chen, Wenhu Chen, Charese Smiley, Sameena Shah, Iana Borova, Dylan Langdon, Reema Moussa, Matt Beane, Ting-Hao Huang, Bryan Routledge, William Yang Wang

    Abstract: The sheer volume of financial statements makes it difficult for humans to access and analyze a business's financials. Robust numerical reasoning likewise faces unique challenges in this domain. In this work, we focus on answering deep questions over financial data, aiming to automate the analysis of a large corpus of financial documents. In contrast to existing tasks on general domain, the finance… ▽ More

    Submitted 7 May, 2022; v1 submitted 31 August, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  35. arXiv:2106.12027  [pdf, other

    cs.CL cs.AI cs.LG

    ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences

    Authors: Yanjun Gao, Ting-hao Huang, Rebecca J. Passonneau

    Abstract: Atomic clauses are fundamental text units for understanding complex sentences. Identifying the atomic sentences within complex sentences is important for applications such as summarization, argument mining, discourse analysis, discourse parsing, and question answering. Previous work mainly relies on rule-based methods dependent on parsing. We propose a new task to decompose each complex sentence i… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: To appear in the proceeding of 59th Annual Meeting of the Association for Computational Linguistics (ACL 2021) Main Conference

  36. arXiv:2105.06950  [pdf, other

    cs.CL cs.AI

    Plot and Rework: Modeling Storylines for Visual Storytelling

    Authors: Chi-Yang Hsu, Yun-Wei Chu, Ting-Hao 'Kenneth' Huang, Lun-Wei Ku

    Abstract: Writing a coherent and engaging story is not easy. Creative writers use their knowledge and worldview to put disjointed elements together to form a coherent storyline, and work and rework iteratively toward perfection. Automated visual storytelling (VIST) models, however, make poor use of external knowledge and iterative generation when attempting to create stories. This paper introduces PR-VIST,… ▽ More

    Submitted 7 July, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

    Comments: 9 pages, ACL-IJCNLP 2021 Findings

  37. arXiv:2104.05604  [pdf, other

    cs.CL

    Semantic Frame Forecast

    Authors: Chieh-Yang Huang, Ting-Hao 'Kenneth' Huang

    Abstract: This paper introduces semantic frame forecast, a task that predicts the semantic frames that will occur in the next 10, 100, or even 1,000 sentences in a running story. Prior work focused on predicting the immediate future of a story, such as one to a few sentences ahead. However, when novelists write long stories, generating a few sentences is not enough to help them gain high-level insight to de… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 9 pages, NAACL 2021

  38. arXiv:2103.14973  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Explaining the Road Not Taken

    Authors: Hua Shen, Ting-Hao 'Kenneth' Huang

    Abstract: It is unclear if existing interpretations of deep neural network models respond effectively to the needs of users. This paper summarizes the common forms of explanations (such as feature attribution, decision rules, or probes) used in over 200 recent papers about natural language processing (NLP), and compares them against user questions collected in the XAI Question Bank. We found that although u… ▽ More

    Submitted 30 March, 2021; v1 submitted 27 March, 2021; originally announced March 2021.

    Comments: Accepted by The 2021 ACM CHI Workshop on Operationalizing Human-Centered Perspectives in Explainable AI (CHI 2021 HCXAI Workshop). For associated website, see https://human-centered-exnlp.github.io

  39. arXiv:2010.02179  [pdf, other

    cs.CL

    Assessing the Helpfulness of Learning Materials with Inference-Based Learner-Like Agent

    Authors: Yun-Hsuan Jen, Chieh-Yang Huang, Mei-Hua Chen, Ting-Hao 'Kenneth' Huang, Lun-Wei Ku

    Abstract: Many English-as-a-second language learners have trouble using near-synonym words (e.g., small vs.little; briefly vs.shortly) correctly, and often look for example sentences to learn how two nearly synonymous terms differ. Prior work uses hand-crafted scores to recommend sentences but has difficulty in adopting such scores to all the near-synonyms as near-synonyms differ in various ways. We notice… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 9 pages, to appear in EMNLP 2020 as a long paper

  40. arXiv:2008.11721  [pdf, other

    cs.HC cs.AI cs.LG stat.ML

    How Useful Are the Machine-Generated Interpretations to General Users? A Human Evaluation on Guessing the Incorrectly Predicted Labels

    Authors: Hua Shen, Ting-Hao Kenneth Huang

    Abstract: Explaining to users why automated systems make certain mistakes is important and challenging. Researchers have proposed ways to automatically produce interpretations for deep neural network models. However, it is unclear how useful these interpretations are in helping users figure out why they are getting an error. If an interpretation effectively explains to users how the underlying deep neural n… ▽ More

    Submitted 27 August, 2020; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: Accepted by The 8th AAAI Conference on Human Computation and Crowdsourcing (HCOMP 2020) https://github.com/huashen218/GuessWrongLabel

  41. arXiv:2005.06111  [pdf, other

    cs.CV

    Project RISE: Recognizing Industrial Smoke Emissions

    Authors: Yen-Chia Hsu, Ting-Hao 'Kenneth' Huang, Ting-Yao Hu, Paul Dille, Sean Prendi, Ryan Hoffman, Anastasia Tsuhlares, Jessica Pachuta, Randy Sargent, Illah Nourbakhsh

    Abstract: Industrial smoke emissions pose a significant concern to human health. Prior works have shown that using Computer Vision (CV) techniques to identify smoke as visual evidence can influence the attitude of regulators and empower citizens to pursue environmental justice. However, existing datasets are not of sufficient quality nor quantity to train the robust CV models needed to support air quality a… ▽ More

    Submitted 29 April, 2024; v1 submitted 12 May, 2020; originally announced May 2020.

    Comments: Accepted by AAAI 2021

  42. arXiv:2005.02367  [pdf, other

    cs.CL cs.HC

    CODA-19: Using a Non-Expert Crowd to Annotate Research Aspects on 10,000+ Abstracts in the COVID-19 Open Research Dataset

    Authors: Ting-Hao 'Kenneth' Huang, Chieh-Yang Huang, Chien-Kuang Cornelia Ding, Yen-Chia Hsu, C. Lee Giles

    Abstract: This paper introduces CODA-19, a human-annotated dataset that codes the Background, Purpose, Method, Finding/Contribution, and Other sections of 10,966 English abstracts in the COVID-19 Open Research Dataset. CODA-19 was created by 248 crowd workers from Amazon Mechanical Turk within 10 days, and achieved labeling quality comparable to that of experts. Each abstract was annotated by nine different… ▽ More

    Submitted 17 September, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

    Comments: Accepted by the NLP COVID-19 Workshop at ACL 2020. (The data, code, and model are available at: https://github.com/windx0303/CODA-19)

  43. Heteroglossia: In-Situ Story Ideation with the Crowd

    Authors: Chieh-Yang Huang, Shih-Hong Huang, Ting-Hao 'Kenneth' Huang

    Abstract: Ideation is essential for creative writing. Many authors struggle to come up with ideas throughout the writing process, yet modern writing tools fail to provide on-the-spot assistance for writers when they get stuck. This paper introduces Heteroglossia, an add-on for Google Docs that allows writers to elicit story ideas from the online crowd using their text editors. Writers can share snippets of… ▽ More

    Submitted 15 January, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

    Comments: Accepted by CHI 2020. Video Promotion: https://www.youtube.com/watch?v=i0G-tq3d8c0

    ACM Class: H.5; H.4; I.7

  44. arXiv:1912.11936  [pdf, other

    cs.HC cs.AI cs.SI

    Smell Pittsburgh: Engaging Community Citizen Science for Air Quality

    Authors: Yen-Chia Hsu, Jennifer Cross, Paul Dille, Michael Tasota, Beatrice Dias, Randy Sargent, Ting-Hao 'Kenneth' Huang, Illah Nourbakhsh

    Abstract: Urban air pollution has been linked to various human health concerns, including cardiopulmonary diseases. Communities who suffer from poor air quality often rely on experts to identify pollution sources due to the lack of accessible tools. Taking this into account, we developed Smell Pittsburgh, a system that enables community members to report odors and track where these odors are frequently conc… ▽ More

    Submitted 20 November, 2020; v1 submitted 26 December, 2019; originally announced December 2019.

    Comments: Accepted by ACM Transactions on Interactive Intelligent Systems on 2020. This is an extended version of the arXiv:1810.11143, which was accepted by the ACM IUI 2019 conference. arXiv admin note: substantial text overlap with arXiv:1810.11143

  45. arXiv:1912.01496  [pdf, other

    cs.CL

    Knowledge-Enriched Visual Storytelling

    Authors: Chao-Chun Hsu, Zi-Yuan Chen, Chi-Yang Hsu, Chih-Chia Li, Tzu-Yuan Lin, Ting-Hao 'Kenneth' Huang, Lun-Wei Ku

    Abstract: Stories are diverse and highly personalized, resulting in a large possible output space for story generation. Existing end-to-end approaches produce monotonous stories because they are limited to the vocabulary and knowledge in a single training dataset. This paper introduces KG-Story, a three-stage framework that allows the story generation model to take advantage of external Knowledge Graphs to… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    Comments: AAAI 2020

  46. arXiv:1910.09621  [pdf, other

    cs.HC cs.CL

    On Automating Conversations

    Authors: Ting-Hao 'Kenneth' Huang

    Abstract: From 2016 to 2018, we developed and deployed Chorus, a system that blends real-time human computation with artificial intelligence (AI) and has real-world, open conversations with users. We took a top-down approach that started with a working crowd-powered system, Chorus, and then created a framework, Evorus, that enables Chorus to automate itself over time. Over our two-year deployment, more than… ▽ More

    Submitted 24 October, 2019; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: An invited position paper at the "Artificial Intelligence and Work: AAAI 2019 Fall Symposium" (AAAI-FSS 2019), Washington, DC, November 7-9, 2019

  47. arXiv:1910.08814  [pdf, ps, other

    cs.HC

    On Using Chatbots to Promote Smoking Cessation Among Adolescents of Low Socioeconomic Status

    Authors: Patricia Simon, Suchitra Krishnan-Sarin, Ting-Hao 'Kenneth' Huang

    Abstract: Reducing youth tobacco use is critical for improving child health since tobacco use is associated with respiratory problems, and nicotine may interfere with healthy brain development. While tobacco regulation has contributed to declines in cigarette use among youth, these declines have occurred more quickly for youth of high socioeconomic status (SES) compared to youth of low SES. A major barrier… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

    Comments: Selected for round-table discussion in Artificial Intelligence and Work: AAAI 2019 Fall Symposium (AAAI FSS 2019)

  48. InstructableCrowd: Creating IF-THEN Rules for Smartphones via Conversations with the Crowd

    Authors: Ting-Hao 'Kenneth' Huang, Amos Azaria, Oscar J. Romero, Jeffrey P. Bigham

    Abstract: Natural language interfaces have become a common part of modern digital life. Chatbots utilize text-based conversations to communicate with users; personal assistants on smartphones such as Google Assistant take direct speech commands from their users; and speech-controlled devices such as Amazon Echo use voice as their only input mode. In this paper, we introduce InstructableCrowd, a crowd-powere… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

    Comments: Published at Human Computation (2019) 6:1:113-146

    Journal ref: Human Computation (2019) 6:1:113-146

  49. arXiv:1906.01764  [pdf, other

    cs.CL cs.AI cs.HC

    Visual Story Post-Editing

    Authors: Ting-Yao Hsu, Chieh-Yang Huang, Yen-Chia Hsu, Ting-Hao 'Kenneth' Huang

    Abstract: We introduce the first dataset for human edits of machine-generated visual stories and explore how these collected edits may be used for the visual story post-editing task. The dataset, VIST-Edit, includes 14,905 human edited versions of 2,981 machine-generated visual stories. The stories were generated by two state-of-the-art visual storytelling models, each aligned to 5 human-edited versions. We… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: Accepted by ACL 2019

  50. Dixit: Interactive Visual Storytelling via Term Manipulation

    Authors: Chao-Chun Hsu, Yu-Hua Chen, Zi-Yuan Chen, Hsin-Yu Lin, Ting-Hao 'Kenneth' Huang, Lun-Wei Ku

    Abstract: In this paper, we introduce Dixit, an interactive visual storytelling system that the user interacts with iteratively to compose a short story for a photo sequence. The user initiates the process by uploading a sequence of photos. Dixit first extracts text terms from each photo which describe the objects (e.g., boy, bike) or actions (e.g., sleep) in the photo, and then allows the user to add new t… ▽ More

    Submitted 31 May, 2019; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: WWW'19 Demo, demo video: https://www.youtube.com/watch?v=CUu1MOwnveI