Skip to main content

Showing 1–45 of 45 results for author: Fung, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.17114  [pdf, ps, other

    cs.AI

    Mathematical Proof as a Litmus Test: Revealing Failure Modes of Advanced Large Reasoning Models

    Authors: Dadi Guo, Jiayu Liu, Zhiyuan Fan, Zhitao He, Haoran Li, Yumeng Wang, Yi R. Fung

    Abstract: Large reasoning models (e.g., R1, o3) have demonstrated remarkable mathematical problem-solving abilities. However, the high reported accuracy of these advanced models on popular datasets, reliance on purely numerical evaluation and potential benchmark leakage, often masks their true reasoning shortcomings. To address this, we propose leveraging the inherent rigor and methodological complexity of… ▽ More

    Submitted 23 June, 2025; v1 submitted 20 June, 2025; originally announced June 2025.

  2. arXiv:2506.06034  [pdf, ps, other

    cs.CL

    MATP-BENCH: Can MLLM Be a Good Automated Theorem Prover for Multimodal Problems?

    Authors: Zhitao He, Zongwei Lyu, Dazhong Chen, Dadi Guo, Yi R. Fung

    Abstract: Numerous theorems, such as those in geometry, are often presented in multimodal forms (e.g., diagrams). Humans benefit from visual reasoning in such settings, using diagrams to gain intuition and guide the proof process. Modern Multimodal Large Language Models (MLLMs) have demonstrated remarkable capabilities in solving a wide range of mathematical problems. However, the potential of MLLMs as Auto… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: 29 pages

  3. arXiv:2506.01921  [pdf, ps, other

    cs.CV cs.AI

    MedEBench: Revisiting Text-instructed Image Editing on Medical Domain

    Authors: Minghao Liu, Zhitao He, Zhiyuan Fan, Qingyun Wang, Yi R. Fung

    Abstract: Text-guided image editing has seen rapid progress in natural image domains, but its adaptation to medical imaging remains limited and lacks standardized evaluation. Clinically, such editing holds promise for simulating surgical outcomes, creating personalized teaching materials, and enhancing patient communication. To bridge this gap, we introduce MedEBench, a comprehensive benchmark for evaluatin… ▽ More

    Submitted 4 June, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

    Comments: Project website: https://mliuby.github.io/MedEBench_Website/

  4. arXiv:2505.23703  [pdf, ps, other

    cs.AI cs.CL

    Let's Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM's Math Capability

    Authors: Ruida Wang, Yuxin Li, Yi R. Fung, Tong Zhang

    Abstract: Enhancing the mathematical reasoning capabilities of LLMs has garnered significant attention in both the mathematical and computer science communities. Recent works have made substantial progress in both Natural Language (NL) reasoning and Formal Language (FL) reasoning by leveraging the potential of pure Reinforcement Learning (RL) methods on base models. However, RL approaches struggle to impart… ▽ More

    Submitted 4 June, 2025; v1 submitted 29 May, 2025; originally announced May 2025.

  5. arXiv:2505.23224  [pdf, ps, other

    cs.CL

    MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration

    Authors: Zhitao He, Sandeep Polisetty, Zhiyuan Fan, Yuchen Huang, Shujin Wu, Yi R. Fung

    Abstract: In recent years, multimodal large language models (MLLMs) have made significant progress but continue to face inherent challenges in multimodal reasoning, which requires multi-level (e.g., perception, reasoning) and multi-granular (e.g., multi-step reasoning chain) advanced inferencing. Prior work on estimating model confidence tends to focus on the overall response for training and calibration, b… ▽ More

    Submitted 5 June, 2025; v1 submitted 29 May, 2025; originally announced May 2025.

    Comments: 18 pages, ACL 2025

  6. arXiv:2505.18822  [pdf, ps, other

    cs.AI cs.CL

    AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting

    Authors: Shijue Huang, Hongru Wang, Wanjun Zhong, Zhaochen Su, Jiazhan Feng, Bowen Cao, Yi R. Fung

    Abstract: Modern large reasoning models demonstrate impressive problem-solving capabilities by employing sophisticated reasoning strategies. However, they often struggle to balance efficiency and effectiveness, frequently generating unnecessarily lengthy reasoning chains for simple problems. In this work, we propose AdaCtrl, a novel framework to support both difficulty-aware adaptive reasoning budget alloca… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

  7. arXiv:2504.16727  [pdf, ps, other

    cs.CV cs.AI

    Unveiling the Lack of LVLM Robustness to Fundamental Visual Variations: Why and Path Forward

    Authors: Zhiyuan Fan, Yumeng Wang, Sandeep Polisetty, Yi R. Fung

    Abstract: Large Vision Language Models (LVLMs) excel in various vision-language tasks. Yet, their robustness to visual variations in position, scale, orientation, and context that objects in natural scenes inevitably exhibit due to changes in viewpoint and environment remains largely underexplored. To bridge this gap, we introduce V$^2$R-Bench, a comprehensive benchmark framework for evaluating Visual Varia… ▽ More

    Submitted 2 June, 2025; v1 submitted 23 April, 2025; originally announced April 2025.

    Comments: Accepted to ACL 2025 Findings

  8. arXiv:2504.09897  [pdf, other

    cs.CV

    TAMP: Token-Adaptive Layerwise Pruning in Multimodal Large Language Models

    Authors: Jaewoo Lee, Keyang Xuan, Chanakya Ekbote, Sandeep Polisetty, Yi R. Fung, Paul Pu Liang

    Abstract: Multimodal Large Language Models (MLLMs) have shown remarkable versatility in understanding diverse multimodal data and tasks. However, these capabilities come with an increased model scale. While post-training pruning reduces model size in unimodal models, its application to MLLMs often yields limited success. Our analysis discovers that conventional methods fail to account for the unique token a… ▽ More

    Submitted 17 May, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

    Comments: ACL Findings 2025

  9. arXiv:2504.07316  [pdf, other

    cs.CL

    Alice: Proactive Learning with Teacher's Demonstrations for Weak-to-Strong Generalization

    Authors: Shujin Wu, Cheng Qian, Yi R. Fung, Paul Pu Liang, Heng Ji

    Abstract: The growing capabilities of large language models (LLMs) present a key challenge of maintaining effective human oversight. Weak-to-strong generalization (W2SG) offers a promising framework for supervising increasingly capable LLMs using weaker ones. Traditional W2SG methods rely on passive learning, where a weak teacher provides noisy demonstrations to train a strong student. This hinders students… ▽ More

    Submitted 11 April, 2025; v1 submitted 9 April, 2025; originally announced April 2025.

  10. arXiv:2503.19551  [pdf, other

    cs.CL cs.AI

    Scaling Laws of Synthetic Data for Language Models

    Authors: Zeyu Qin, Qingxiu Dong, Xingxing Zhang, Li Dong, Xiaolong Huang, Ziyi Yang, Mahmoud Khademi, Dongdong Zhang, Hany Hassan Awadalla, Yi R. Fung, Weizhu Chen, Minhao Cheng, Furu Wei

    Abstract: Large language models (LLMs) achieve strong performance across diverse tasks, largely driven by high-quality web data used in pre-training. However, recent studies indicate this data source is rapidly depleting. Synthetic data emerges as a promising alternative, but it remains unclear whether synthetic datasets exhibit predictable scalability comparable to raw pre-training data. In this work, we s… ▽ More

    Submitted 26 March, 2025; v1 submitted 25 March, 2025; originally announced March 2025.

    Comments: work in progress

  11. arXiv:2502.16671  [pdf, ps, other

    cs.CL cs.AI cs.CV

    MimeQA: Towards Socially-Intelligent Nonverbal Foundation Models

    Authors: Hengzhi Li, Megan Tjandrasuwita, Yi R. Fung, Armando Solar-Lezama, Paul Pu Liang

    Abstract: As AI becomes more closely integrated with peoples' daily activities, socially intelligent AI that can understand and interact seamlessly with humans in daily lives is increasingly important. However, current works in AI social reasoning all rely on language-only or language-dominant approaches to benchmark and training models, resulting in systems that are improving in verbal communication but st… ▽ More

    Submitted 6 June, 2025; v1 submitted 23 February, 2025; originally announced February 2025.

  12. arXiv:2502.16143  [pdf, other

    cs.CL

    The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination

    Authors: Yuji Zhang, Sha Li, Cheng Qian, Jiateng Liu, Pengfei Yu, Chi Han, Yi R. Fung, Kathleen McKeown, Chengxiang Zhai, Manling Li, Heng Ji

    Abstract: Hallucination is a persistent challenge in large language models (LLMs), where even with rigorous quality control, models often generate distorted facts. This paradox, in which error generation continues despite high-quality training data, calls for a deeper understanding of the underlying LLM mechanisms. To address it, we propose a novel concept: knowledge overshadowing, where model's dominant kn… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

    Comments: 19 pages, 5 figures

  13. arXiv:2502.12084  [pdf, other

    cs.CL

    VLM2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues

    Authors: Jianshu Zhang, Dongyu Yao, Renjie Pi, Paul Pu Liang, Yi R. Fung

    Abstract: Visually linking matching cues is a crucial ability in daily life, such as identifying the same person in multiple photos based on their cues, even without knowing who they are. Despite the extensive knowledge that vision-language models (VLMs) possess, it remains largely unexplored whether they are capable of performing this fundamental task. To address this, we introduce VLM2-Bench, a benchmark… ▽ More

    Submitted 12 May, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: Project Page: https://vlm2-bench.github.io/

  14. arXiv:2407.08039  [pdf, other

    cs.CL

    Knowledge Overshadowing Causes Amalgamated Hallucination in Large Language Models

    Authors: Yuji Zhang, Sha Li, Jiateng Liu, Pengfei Yu, Yi R. Fung, Jing Li, Manling Li, Heng Ji

    Abstract: Hallucination is often regarded as a major impediment for using large language models (LLMs), especially for knowledge-intensive tasks. Even when the training corpus consists solely of true statements, language models still generate hallucinations in the form of amalgamations of multiple facts. We coin this phenomenon as ``knowledge overshadowing'': when we query knowledge from a language model wi… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  15. arXiv:2406.14137  [pdf, other

    cs.CL

    MACAROON: Training Vision-Language Models To Be Your Engaged Partners

    Authors: Shujin Wu, Yi R. Fung, Sha Li, Yixin Wan, Kai-Wei Chang, Heng Ji

    Abstract: Large vision-language models (LVLMs), while proficient in following instructions and responding to diverse questions, invariably generate detailed responses even when questions are ambiguous or unanswerable, leading to hallucinations and bias issues. Thus, it is essential for LVLMs to proactively engage with humans to ask for clarifications or additional information for better responses. In this s… ▽ More

    Submitted 17 October, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: The code will be made public at https://github.com/ShujinWu-0814/MACAROON

  16. arXiv:2403.12027  [pdf, other

    cs.CL cs.AI cs.CV

    From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models

    Authors: Kung-Hsiang Huang, Hou Pong Chan, Yi R. Fung, Haoyi Qiu, Mingyang Zhou, Shafiq Joty, Shih-Fu Chang, Heng Ji

    Abstract: Data visualization in the form of charts plays a pivotal role in data analysis, offering critical insights and aiding in informed decision-making. Automatic chart understanding has witnessed significant advancements with the rise of large foundation models in recent years. Foundation models, such as large language models, have revolutionized various natural language processing tasks and are increa… ▽ More

    Submitted 4 December, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: IEEE Transactions on Knowledge and Data Engineering (TKDE)

  17. arXiv:2402.11943  [pdf, other

    cs.CL

    LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge Augmentation

    Authors: Keyang Xuan, Li Yi, Fan Yang, Ruochen Wu, Yi R. Fung, Heng Ji

    Abstract: The rise of multimodal misinformation on social platforms poses significant challenges for individuals and societies. Its increased credibility and broader impact compared to textual misinformation make detection complex, requiring robust reasoning across diverse media types and profound knowledge for accurate verification. The emergence of Large Vision Language Model (LVLM) offers a potential sol… ▽ More

    Submitted 20 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  18. arXiv:2402.11060  [pdf, other

    cs.CL cs.AI cs.IR

    Persona-DB: Efficient Large Language Model Personalization for Response Prediction with Collaborative Data Refinement

    Authors: Chenkai Sun, Ke Yang, Revanth Gangi Reddy, Yi R. Fung, Hou Pong Chan, Kevin Small, ChengXiang Zhai, Heng Ji

    Abstract: The increasing demand for personalized interactions with large language models (LLMs) calls for methodologies capable of accurately and efficiently identifying user opinions and preferences. Retrieval augmentation emerges as an effective strategy, as it can accommodate a vast number of users without the costs from fine-tuning. Existing research, however, has largely focused on enhancing the retrie… ▽ More

    Submitted 2 February, 2025; v1 submitted 16 February, 2024; originally announced February 2024.

  19. arXiv:2401.00812  [pdf, other

    cs.CL

    If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents

    Authors: Ke Yang, Jiateng Liu, John Wu, Chaoqi Yang, Yi R. Fung, Sha Li, Zixuan Huang, Xu Cao, Xingyao Wang, Yiquan Wang, Heng Ji, Chengxiang Zhai

    Abstract: The prominent large language models (LLMs) of today differ from past language models not only in size, but also in the fact that they are trained on a combination of natural language and formal language (code). As a medium between humans and computers, code translates high-level goals into executable steps, featuring standard syntax, logical consistency, abstraction, and modularity. In this survey… ▽ More

    Submitted 8 January, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

  20. arXiv:2312.10160  [pdf, other

    cs.CL

    Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning

    Authors: Kung-Hsiang Huang, Mingyang Zhou, Hou Pong Chan, Yi R. Fung, Zhenhailong Wang, Lingyu Zhang, Shih-Fu Chang, Heng Ji

    Abstract: Recent advancements in large vision-language models (LVLMs) have led to significant progress in generating natural language descriptions for visual content and thus enhancing various applications. One issue with these powerful models is that they sometimes produce texts that are factually inconsistent with the visual input. While there has been some effort to mitigate such inconsistencies in natur… ▽ More

    Submitted 30 May, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: ACL 2024 Findings

  21. arXiv:2311.09677  [pdf, other

    cs.CL

    R-Tuning: Instructing Large Language Models to Say `I Don't Know'

    Authors: Hanning Zhang, Shizhe Diao, Yong Lin, Yi R. Fung, Qing Lian, Xingyao Wang, Yangyi Chen, Heng Ji, Tong Zhang

    Abstract: Large language models (LLMs) have revolutionized numerous domains with their impressive performance but still face their challenges. A predominant issue is the propensity for these models to generate non-existent facts, a concern termed hallucination. Our research is motivated by the observation that previous instruction tuning methods force the model to complete a sentence no matter whether the m… ▽ More

    Submitted 6 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: NAACL 2024

  22. arXiv:2310.20633  [pdf, other

    cs.CL

    Defining a New NLP Playground

    Authors: Sha Li, Chi Han, Pengfei Yu, Carl Edwards, Manling Li, Xingyao Wang, Yi R. Fung, Charles Yu, Joel R. Tetreault, Eduard H. Hovy, Heng Ji

    Abstract: The recent explosion of performance of large language models (LLMs) has changed the field of Natural Language Processing (NLP) more abruptly and seismically than any other shift in the field's 80-year history. This has resulted in concerns that the field will become homogenized and resource-intensive. The new status quo has put many academic researchers, especially PhD students, at a disadvantage.… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: EMNLP Findings 2023 "Theme Track: Large Language Models and the Future of NLP"

  23. arXiv:2310.13297  [pdf, other

    cs.CL cs.AI cs.LG

    Decoding the Silent Majority: Inducing Belief Augmented Social Graph with Large Language Model for Response Forecasting

    Authors: Chenkai Sun, Jinning Li, Yi R. Fung, Hou Pong Chan, Tarek Abdelzaher, ChengXiang Zhai, Heng Ji

    Abstract: Automatic response forecasting for news media plays a crucial role in enabling content producers to efficiently predict the impact of news releases and prevent unexpected negative outcomes such as social conflict and moral injury. To effectively forecast responses, it is essential to develop measures that leverage the social dynamics and contextual information surrounding individuals, especially i… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 Main Conference

  24. arXiv:2309.17428  [pdf, other

    cs.CL cs.AI cs.LG

    CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets

    Authors: Lifan Yuan, Yangyi Chen, Xingyao Wang, Yi R. Fung, Hao Peng, Heng Ji

    Abstract: Large language models (LLMs) are often augmented with tools to solve complex tasks. By generating code snippets and executing them through task-specific Application Programming Interfaces (APIs), they can offload certain functions to dedicated external modules, such as image encoding and performing calculations. However, most existing approaches to augment LLMs with tools are constrained by genera… ▽ More

    Submitted 13 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted to ICLR 2024. Code is available at https://github.com/lifan-yuan/CRAFT

  25. arXiv:2305.18641  [pdf, other

    cs.CL cs.CV

    Enhanced Chart Understanding in Vision and Language Task via Cross-modal Pre-training on Plot Table Pairs

    Authors: Mingyang Zhou, Yi R. Fung, Long Chen, Christopher Thomas, Heng Ji, Shih-Fu Chang

    Abstract: Building cross-model intelligence that can understand charts and communicate the salient information hidden behind them is an appealing challenge in the vision and language(V+L) community. The capability to uncover the underlined table data of chart figures is a critical key to automatic chart understanding. We introduce ChartT5, a V+L model that learns how to interpret table information from char… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Accepted by Findings of ACL 2023

  26. arXiv:2305.14318  [pdf, other

    cs.CL

    CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models

    Authors: Cheng Qian, Chi Han, Yi R. Fung, Yujia Qin, Zhiyuan Liu, Heng Ji

    Abstract: Large Language Models (LLMs) have made significant progress in utilizing tools, but their ability is limited by API availability and the instability of implicit reasoning, particularly when both planning and execution are involved. To overcome these limitations, we propose CREATOR, a novel framework that enables LLMs to create their own tools using documentation and code realization. CREATOR disen… ▽ More

    Submitted 21 June, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Findings of EMNLP 2023

  27. arXiv:2304.08354  [pdf, other

    cs.CL cs.AI cs.LG

    Tool Learning with Foundation Models

    Authors: Yujia Qin, Shengding Hu, Yankai Lin, Weize Chen, Ning Ding, Ganqu Cui, Zheni Zeng, Yufei Huang, Chaojun Xiao, Chi Han, Yi Ren Fung, Yusheng Su, Huadong Wang, Cheng Qian, Runchu Tian, Kunlun Zhu, Shihao Liang, Xingyu Shen, Bokai Xu, Zhen Zhang, Yining Ye, Bowen Li, Ziwei Tang, Jing Yi, Yuzhang Zhu , et al. (16 additional authors not shown)

    Abstract: Humans possess an extraordinary ability to create and utilize tools, allowing them to overcome physical limitations and explore new frontiers. With the advent of foundation models, AI systems have the potential to be equally adept in tool use as humans. This paradigm, i.e., tool learning with foundation models, combines the strengths of specialized tools and foundation models to achieve enhanced a… ▽ More

    Submitted 6 August, 2024; v1 submitted 17 April, 2023; originally announced April 2023.

  28. arXiv:2303.14337  [pdf, other

    cs.CL

    SmartBook: AI-Assisted Situation Report Generation for Intelligence Analysts

    Authors: Revanth Gangi Reddy, Daniel Lee, Yi R. Fung, Khanh Duy Nguyen, Qi Zeng, Manling Li, Ziqi Wang, Clare Voss, Heng Ji

    Abstract: Timely and comprehensive understanding of emerging events is crucial for effective decision-making; automating situation report generation can significantly reduce the time, effort, and cost for intelligence analysts. In this work, we identify intelligence analysts' practices and preferences for AI assistance in situation report generation to guide the design strategies for an effective, trust-bui… ▽ More

    Submitted 27 May, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: Preprint

  29. arXiv:2303.13775  [pdf, other

    cs.DC cs.LG

    GSplit: Scaling Graph Neural Network Training on Large Graphs via Split-Parallelism

    Authors: Sandeep Polisetty, Juelin Liu, Kobi Falus, Yi Ren Fung, Seung-Hwan Lim, Hui Guan, Marco Serafini

    Abstract: Graph neural networks (GNNs), an emerging class of machine learning models for graphs, have gained popularity for their superior performance in various graph analytical tasks. Mini-batch training is commonly used to train GNNs on large graphs, and data parallelism is the standard approach to scale mini-batch training across multiple GPUs. One of the major performance costs in GNN training is the l… ▽ More

    Submitted 27 June, 2024; v1 submitted 23 March, 2023; originally announced March 2023.

  30. arXiv:2210.08604  [pdf, other

    cs.CL cs.AI

    NormSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-Fly

    Authors: Yi R. Fung, Tuhin Chakraborty, Hao Guo, Owen Rambow, Smaranda Muresan, Heng Ji

    Abstract: Norm discovery is important for understanding and reasoning about the acceptable behaviors and potential violations in human communication and interactions. We introduce NormSage, a framework for addressing the novel task of conversation-grounded multi-lingual, multi-cultural norm discovery, based on language model prompting and self-verification. NormSAGE leverages the expressiveness and implicit… ▽ More

    Submitted 13 January, 2024; v1 submitted 16 October, 2022; originally announced October 2022.

  31. arXiv:2203.05967  [pdf, other

    cs.SI cs.CL

    A Weibo Dataset for the 2022 Russo-Ukrainian Crisis

    Authors: Yi R. Fung, Heng Ji

    Abstract: Online social networks such as Twitter and Weibo play an important role in how people stay informed and exchange reactions. Each crisis encompasses a new opportunity to study the portability of models for various tasks (e.g., information extraction, complex event understanding, misinformation detection, etc.), due to differences in domain, entities, and event types. We present the Russia-Ukraine C… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: Russia-Ukraine Crisis, Weibo Dataset

  32. arXiv:2112.08544  [pdf, other

    cs.CL cs.AI

    NewsClaims: A New Benchmark for Claim Detection from News with Attribute Knowledge

    Authors: Revanth Gangi Reddy, Sai Chetan, Zhenhailong Wang, Yi R. Fung, Kathryn Conger, Ahmed Elsayed, Martha Palmer, Preslav Nakov, Eduard Hovy, Kevin Small, Heng Ji

    Abstract: Claim detection and verification are crucial for news understanding and have emerged as promising technologies for mitigating misinformation and disinformation in the news. However, most existing work has focused on claim sentence analysis while overlooking additional crucial attributes (e.g., the claimer and the main object associated with the claim). In this work, we present NewsClaims, a new be… ▽ More

    Submitted 23 November, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: Accepted at EMNLP 2022

  33. arXiv:2011.13406  [pdf, other

    cs.CV

    Learning from Lexical Perturbations for Consistent Visual Question Answering

    Authors: Spencer Whitehead, Hui Wu, Yi Ren Fung, Heng Ji, Rogerio Feris, Kate Saenko

    Abstract: Existing Visual Question Answering (VQA) models are often fragile and sensitive to input variations. In this paper, we propose a novel approach to address this issue based on modular networks, which creates two questions related by linguistic perturbations and regularizes the visual reasoning process between them to be consistent during training. We show that our framework markedly improves consis… ▽ More

    Submitted 22 December, 2020; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: 14 pages, 8 figures

  34. COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report Generation

    Authors: Qingyun Wang, Manling Li, Xuan Wang, Nikolaus Parulian, Guangxing Han, Jiawei Ma, Jingxuan Tu, Ying Lin, Haoran Zhang, Weili Liu, Aabhas Chauhan, Yingjun Guan, Bangzheng Li, Ruisong Li, Xiangchen Song, Yi R. Fung, Heng Ji, Jiawei Han, Shih-Fu Chang, James Pustejovsky, Jasmine Rah, David Liem, Ahmed Elsayed, Martha Palmer, Clare Voss , et al. (2 additional authors not shown)

    Abstract: To combat COVID-19, both clinicians and scientists need to digest vast amounts of relevant biomedical knowledge in scientific literature to understand the disease mechanism and related biological functions. We have developed a novel and comprehensive knowledge discovery framework, COVID-KG to extract fine-grained multimedia knowledge elements (entities and their visual chemical structures, relatio… ▽ More

    Submitted 11 May, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: 12 pages, Accepted by Proceedings of 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics System Demonstrations, for resources see http://blender.cs.illinois.edu/covid19/, for video see http://159.89.180.81/demo/covid/Covid-KG_DemoVideo.mp4, for slides see https://eaglew.github.io/files/Covid-KG_DemoVideo_with_ethics.pdf

  35. arXiv:1909.06427  [pdf, other

    cs.AI

    Responsive Planning and Recognition for Closed-Loop Interaction

    Authors: Richard G. Freedman, Yi Ren Fung, Roman Ganchin, Shlomo Zilberstein

    Abstract: Many intelligent systems currently interact with others using at least one of fixed communication inputs or preset responses, resulting in rigid interaction experiences and extensive efforts developing a variety of scenarios for the system. Fixed inputs limit the natural behavior of the user in order to effectively communicate, and preset responses prevent the system from adapting to the current s… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

    Comments: Accepted for presentation at the AAAI 2019 Fall Symposium Series, in the symposium for Artificial Intelligence and Human-Robot Interaction for Service Robots in Human Environments

    Report number: AI-HRI/2019/24

  36. arXiv:1906.04231  [pdf, other

    eess.IV cs.CV

    Alzheimer's Disease Brain MRI Classification: Challenges and Insights

    Authors: Yi Ren Fung, Ziqiang Guan, Ritesh Kumar, Joie Yeahuay Wu, Madalina Fiterau

    Abstract: In recent years, many papers have reported state-of-the-art performance on Alzheimer's Disease classification with MRI scans from the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset using convolutional neural networks. However, we discover that when we split that data into training and testing sets at the subject level, we are not able to obtain similar performance, bringing the validit… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: 5 pages, 2 figures, IJCAI ARIAL workshop paper

  37. arXiv:1904.08930  [pdf, other

    cs.LG stat.ML

    FLARe: Forecasting by Learning Anticipated Representations

    Authors: Surya Teja Devarakonda, Joie Yeahuay Wu, Yi Ren Fung, Madalina Fiterau

    Abstract: Computational models that forecast the progression of Alzheimer's disease at the patient level are extremely useful tools for identifying high risk cohorts for early intervention and treatment planning. The state-of-the-art work in this area proposes models that forecast by using latent representations extracted from the longitudinal data across multiple modalities, including volumetric informatio… ▽ More

    Submitted 26 December, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

    Report number: PMLR 106:53-65

  38. arXiv:1904.07950  [pdf, other

    cs.CV

    A Comprehensive Study of Alzheimer's Disease Classification Using Convolutional Neural Networks

    Authors: Ziqiang Guan, Ritesh Kumar, Yi Ren Fung, Yeahuay Wu, Madalina Fiterau

    Abstract: A plethora of deep learning models have been developed for the task of Alzheimer's disease classification from brain MRI scans. Many of these models report high performance, achieving three-class classification accuracy of up to 95%. However, it is common for these studies to draw performance comparisons between models that are trained on different subsets of a dataset or use varying imaging prepr… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

  39. arXiv:1304.3427  [pdf

    cs.AI

    Metaprobability and Dempster-Shafer in Evidential Reasoning

    Authors: Robert Fung, Chee Yee Chong

    Abstract: Evidential reasoning in expert systems has often used ad-hoc uncertainty calculi. Although it is generally accepted that probability theory provides a firm theoretical foundation, researchers have found some problems with its use as a workable uncertainty calculus. Among these problems are representation of ignorance, consistency of probabilistic judgements, and adjustment of a priori judgements w… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the First Conference on Uncertainty in Artificial Intelligence (UAI1985)

    Report number: UAI-P-1985-PG-76-83

  40. arXiv:1304.1504  [pdf

    cs.AI

    Weighing and Integrating Evidence for Stochastic Simulation in Bayesian Networks

    Authors: Robert Fung, Kuo-Chu Chang

    Abstract: Stochastic simulation approaches perform probabilistic inference in Bayesian networks by estimating the probability of an event based on the frequency that the event occurs in a set of simulation trials. This paper describes the evidence weighting mechanism, for augmenting the logic sampling stochastic simulation algorithm [Henrion, 1986]. Evidence weighting modifies the logic sampling algorithm… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Fifth Conference on Uncertainty in Artificial Intelligence (UAI1989)

    Report number: UAI-P-1989-PG-112-117

  41. arXiv:1304.1138  [pdf

    cs.AI

    Refinement and Coarsening of Bayesian Networks

    Authors: Kuo-Chu Chang, Robert Fung

    Abstract: In almost all situation assessment problems, it is useful to dynamically contract and expand the states under consideration as assessment proceeds. Contraction is most often used to combine similar events or low probability events together in order to reduce computation. Expansion is most often used to make distinctions of interest which have significant probability in order to improve the quali… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Sixth Conference on Uncertainty in Artificial Intelligence (UAI1990)

    Report number: UAI-P-1990-PG-475-482

  42. arXiv:1304.1128  [pdf

    cs.AI

    An Architecture for Probabilistic Concept-Based Information Retrieval

    Authors: Robert Fung, S. L. Crawford, Lee A. Appelbaum, Richard M. Tong

    Abstract: While concept-based methods for information retrieval can provide improved performance over more conventional techniques, they require large amounts of effort to acquire the concepts and their qualitative and quantitative relationships. This paper discusses an architecture for probabilistic concept-based information retrieval which addresses the knowledge acquisition problem. The architecture make… ▽ More

    Submitted 27 March, 2013; originally announced April 2013.

    Comments: Appears in Proceedings of the Sixth Conference on Uncertainty in Artificial Intelligence (UAI1990)

    Report number: UAI-P-1990-PG-392-404

  43. arXiv:1303.5713  [pdf

    cs.AI

    Symbolic Probabilistic Inference with Evidence Potential

    Authors: Kuo-Chu Chang, Robert Fung

    Abstract: Recent research on the Symbolic Probabilistic Inference (SPI) algorithm[2] has focused attention on the importance of resolving general queries in Bayesian networks. SPI applies the concept of dependency-directed backward search to probabilistic inference, and is incremental with respect to both queries and observations. In response to this research we have extended the evidence potential algori… ▽ More

    Submitted 20 March, 2013; originally announced March 2013.

    Comments: Appears in Proceedings of the Seventh Conference on Uncertainty in Artificial Intelligence (UAI1991)

    Report number: UAI-P-1991-PG-82-85

  44. arXiv:1303.5712  [pdf

    cs.AI

    Symbolic Probabilistic Inference with Continuous Variables

    Authors: Kuo-Chu Chang, Robert Fung

    Abstract: Research on Symbolic Probabilistic Inference (SPI) [2, 3] has provided an algorithm for resolving general queries in Bayesian networks. SPI applies the concept of dependency directed backward search to probabilistic inference, and is incremental with respect to both queries and observations. Unlike traditional Bayesian network inferencing algorithms, SPI algorithm is goal directed, performing on… ▽ More

    Submitted 20 March, 2013; originally announced March 2013.

    Comments: Appears in Proceedings of the Seventh Conference on Uncertainty in Artificial Intelligence (UAI1991)

    Report number: UAI-P-1991-PG-77-81

  45. arXiv:1302.6807  [pdf

    cs.AI

    Backward Simulation in Bayesian Networks

    Authors: Robert Fung, Brendan del Favero

    Abstract: Backward simulation is an approximate inference technique for Bayesian belief networks. It differs from existing simulation methods in that it starts simulation from the known evidence and works backward (i.e., contrary to the direction of the arcs). The technique's focus on the evidence leads to improved convergence in situations where the posterior beliefs are dominated by the evidence rather… ▽ More

    Submitted 27 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence (UAI1994)

    Report number: UAI-P-1994-PG-227-234