Skip to main content

Showing 1–16 of 16 results for author: Bao, F S

.
  1. arXiv:2505.04847  [pdf, other

    cs.CL cs.AI

    Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards

    Authors: Manveer Singh Tamber, Forrest Sheng Bao, Chenyu Xu, Ge Luo, Suleman Kazi, Minseok Bae, Miaoran Li, Ofer Mendelevitch, Renyi Qu, Jimmy Lin

    Abstract: Hallucinations remain a persistent challenge for LLMs. RAG aims to reduce hallucinations by grounding responses in contexts. However, even when provided context, LLMs still frequently introduce unsupported information or contradictions. This paper presents our efforts to measure LLM hallucinations with a focus on summarization tasks, assessing how often various LLMs introduce hallucinations when s… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  2. arXiv:2410.13210  [pdf, other

    cs.CL cs.AI

    FaithBench: A Diverse Hallucination Benchmark for Summarization by Modern LLMs

    Authors: Forrest Sheng Bao, Miaoran Li, Renyi Qu, Ge Luo, Erana Wan, Yujia Tang, Weisi Fan, Manveer Singh Tamber, Suleman Kazi, Vivek Sourabh, Mike Qi, Ruixuan Tu, Chenyu Xu, Matthew Gonzales, Ofer Mendelevitch, Amin Ahmad

    Abstract: Summarization is one of the most common tasks performed by large language models (LLMs), especially in applications like Retrieval-Augmented Generation (RAG). However, existing evaluations of hallucinations in LLM-generated summaries, and evaluations of hallucination detection models both suffer from a lack of diversity and recency in the LLM and LLM families considered. This paper introduces Fait… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  3. arXiv:2301.06622  [pdf, other

    cs.DC eess.SY

    IOPathTune: Adaptive Online Parameter Tuning for Parallel File System I/O Path

    Authors: Md. Hasanur Rashid, Youbiao He, Forrest Sheng Bao, Dong Dai

    Abstract: Parallel file systems contain complicated I/O paths from clients to storage servers. An efficient I/O path requires proper settings of multiple parameters, as the default settings often fail to deliver optimal performance, especially for diverse workloads in the HPC environment. Existing tuning strategies have shortcomings in being adaptive, timely, and flexible. We propose IOPathTune, which adapt… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

  4. arXiv:2301.02410  [pdf, other

    cs.SE cs.PL

    Codepod: A Namespace-Aware, Hierarchical Jupyter for Interactive Development at Scale

    Authors: Hebi Li, Forrest Sheng Bao, Qi Xiao, Jin Tian

    Abstract: Jupyter is a browser-based interactive development environment that has been popular recently. Jupyter models programs in code blocks, and makes it easy to develop code blocks interactively by running the code blocks and attaching rich media output. However, Jupyter provides no support for module systems and namespaces. Code blocks are linear and live in the global namespace; therefore, it is hard… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

  5. arXiv:2212.10013  [pdf, other

    cs.AI cs.CL

    DocAsRef: An Empirical Study on Repurposing Reference-Based Summary Quality Metrics Reference-Freely

    Authors: Forrest Sheng Bao, Ruixuan Tu, Ge Luo, Yinfei Yang, Hebi Li, Minghui Qiu, Youbiao He, Cen Chen

    Abstract: Automated summary quality assessment falls into two categories: reference-based and reference-free. Reference-based metrics, historically deemed more accurate due to the additional information provided by human-written references, are limited by their reliance on human input. In this paper, we hypothesize that the comparison methodologies used by some reference-based metrics to evaluate a system s… ▽ More

    Submitted 26 November, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted into Findings of EMNLP 2023

  6. arXiv:2006.13607  [pdf, other

    cs.AI

    Circuit Routing Using Monte Carlo Tree Search and Deep Neural Networks

    Authors: Youbiao He, Forrest Sheng Bao

    Abstract: Circuit routing is a fundamental problem in designing electronic systems such as integrated circuits (ICs) and printed circuit boards (PCBs) which form the hardware of electronics and computers. Like finding paths between pairs of locations, circuit routing generates traces of wires to connect contacts or leads of circuit components. It is challenging because finding paths between dense and massiv… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    ACM Class: F.2.2; I.2.8

  7. arXiv:2005.06546  [pdf

    cs.LG stat.ML

    Triaging moderate COVID-19 and other viral pneumonias from routine blood tests

    Authors: Forrest Sheng Bao, Youbiao He, Jie Liu, Yuanfang Chen, Qian Li, Christina R. Zhang, Lei Han, Baoli Zhu, Yaorong Ge, Shi Chen, Ming Xu, Liu Ouyang

    Abstract: The COVID-19 is sweeping the world with deadly consequences. Its contagious nature and clinical similarity to other pneumonias make separating subjects contracted with COVID-19 and non-COVID-19 viral pneumonia a priority and a challenge. However, COVID-19 testing has been greatly limited by the availability and cost of existing methods, even in developed countries like the US. Intrigued by the wid… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    ACM Class: I.5.4

  8. arXiv:2005.06377  [pdf, other

    cs.CL cs.IR cs.LG

    SueNes: A Weakly Supervised Approach to Evaluating Single-Document Summarization via Negative Sampling

    Authors: Forrest Sheng Bao, Hebi Li, Ge Luo, Minghui Qiu, Yinfei Yang, Youbiao He, Cen Chen

    Abstract: Canonical automatic summary evaluation metrics, such as ROUGE, focus on lexical similarity which cannot well capture semantics nor linguistic quality and require a reference summary which is costly to obtain. Recently, there have been a growing number of efforts to alleviate either or both of the two drawbacks. In this paper, we present a proof-of-concept study to a weakly supervised summary evalu… ▽ More

    Submitted 5 May, 2022; v1 submitted 13 May, 2020; originally announced May 2020.

    Comments: accepted into NAACL 2022

    ACM Class: I.2.7

  9. arXiv:1910.08925  [pdf, other

    cs.DC cs.AI cs.LG

    RLScheduler: An Automated HPC Batch Job Scheduler Using Reinforcement Learning

    Authors: Di Zhang, Dong Dai, Youbiao He, Forrest Sheng Bao, Bing Xie

    Abstract: Today high-performance computing (HPC) platforms are still dominated by batch jobs. Accordingly, effective batch job scheduling is crucial to obtain high system efficiency. Existing HPC batch job schedulers typically leverage heuristic priority functions to prioritize and schedule jobs. But, once configured and deployed by the experts, such priority functions can hardly adapt to the changes of job… ▽ More

    Submitted 1 September, 2020; v1 submitted 20 October, 2019; originally announced October 2019.

    Comments: 14 pages; conference accepted

  10. arXiv:1702.07998  [pdf, ps, other

    cs.CL

    Detecting (Un)Important Content for Single-Document News Summarization

    Authors: Yinfei Yang, Forrest Sheng Bao, Ani Nenkova

    Abstract: We present a robust approach for detecting intrinsic sentence importance in news, by training on two corpora of document-summary pairs. When used for single-document summarization, our approach, combined with the "beginning of document" heuristic, outperforms a state-of-the-art summarizer and the beginning-of-article baseline in both automatic and manual evaluations. These results represent an imp… ▽ More

    Submitted 26 February, 2017; originally announced February 2017.

    Comments: Accepted By EACL 2017

  11. arXiv:1009.5268  [pdf, ps, other

    cs.AI

    General Scaled Support Vector Machines

    Authors: Xin Liu, Ying Ding, Forrest Sheng Bao

    Abstract: Support Vector Machines (SVMs) are popular tools for data mining tasks such as classification, regression, and density estimation. However, original SVM (C-SVM) only considers local information of data points on or over the margin. Therefore, C-SVM loses robustness. To solve this problem, one approach is to translate (i.e., to move without rotation or change of shape) the hyperplane according to t… ▽ More

    Submitted 27 September, 2010; originally announced September 2010.

    Comments: 5 pages, 4 figures

    ACM Class: I.5.2

  12. arXiv:0906.0205  [pdf, ps, other

    cs.DS cs.CC

    A Survey of Tree Convex Sets Test

    Authors: Yuanlin Zhang, Forrest Sheng Bao

    Abstract: Tree convex sets refer to a collection of sets such that each set in the collection is a subtree of a tree whose nodes are the elements of these sets. They extend the concept of row convex sets each of which is an interval over a total ordering of the elements of those sets. They have been applied to identify tractable Constraint Satisfaction Problems and Combinatorial Auction Problems. Recently… ▽ More

    Submitted 31 May, 2009; originally announced June 2009.

    Comments: 13 pages, 5 figures, 2 tables

    ACM Class: F.2

  13. arXiv:0904.3808  [pdf, ps, other

    cs.AI cs.CV

    Automated Epilepsy Diagnosis Using Interictal Scalp EEG

    Authors: Forrest Sheng Bao, Jue-Ming Gao, Jing Hu, Donald Y. -C. Lie, Yuanlin Zhang, K. J. Oommen

    Abstract: Approximately over 50 million people worldwide suffer from epilepsy. Traditional diagnosis of epilepsy relies on tedious visual screening by highly trained clinicians from lengthy EEG recording that contains the presence of seizure (ictal) activities. Nowadays, there are many automatic systems that can recognize seizure-related EEG signals to help the diagnosis. However, it is very costly and in… ▽ More

    Submitted 24 April, 2009; v1 submitted 24 April, 2009; originally announced April 2009.

    Comments: 5 pages, 4 figures, 3 tables, based on our IEEE ICTAI'08 paper, submitted to IEEE EMBC'09

    ACM Class: I.5.4; I.2.1

  14. A New Approach to Automated Epileptic Diagnosis Using EEG and Probabilistic Neural Network

    Authors: Forrest Sheng Bao, Donald Yu-Chun Lie, Yuanlin Zhang

    Abstract: Epilepsy is one of the most common neurological disorders that greatly impair patient' daily lives. Traditional epileptic diagnosis relies on tedious visual screening by neurologists from lengthy EEG recording that requires the presence of seizure (ictal) activities. Nowadays, there are many systems helping the neurologists to quickly find interesting segments of the lengthy signal by automatic… ▽ More

    Submitted 4 July, 2008; v1 submitted 21 April, 2008; originally announced April 2008.

    Comments: 5 pages, 6 figures, 1 table, submitted to IEEE ICTAI 2008

    ACM Class: I.5.4; I.2.1

  15. arXiv:0707.4289  [pdf, ps, other

    cs.AI

    A Leaf Recognition Algorithm for Plant Classification Using Probabilistic Neural Network

    Authors: Stephen Gang Wu, Forrest Sheng Bao, Eric You Xu, Yu-Xuan Wang, Yi-Fan Chang, Qiao-Liang Xiang

    Abstract: In this paper, we employ Probabilistic Neural Network (PNN) with image and data processing techniques to implement a general purpose automated leaf recognition algorithm. 12 leaf features are extracted and orthogonalized into 5 principal variables which consist the input vector of the PNN. The PNN is trained by 1800 leaves to classify 32 kinds of plants with an accuracy greater than 90%. Compare… ▽ More

    Submitted 29 July, 2007; originally announced July 2007.

    Comments: 6 pages, 3 figures, 2 tables

    ACM Class: I.5.4

  16. A Novel Model of Working Set Selection for SMO Decomposition Methods

    Authors: Zhendong Zhao, Lei Yuan, Yuxuan Wang, Forrest Sheng Bao, Shunyi Zhang Yanfei Sun

    Abstract: In the process of training Support Vector Machines (SVMs) by decomposition methods, working set selection is an important technique, and some exciting schemes were employed into this field. To improve working set selection, we propose a new model for working set selection in sequential minimal optimization (SMO) decomposition methods. In this model, it selects B as working set without reselectio… ▽ More

    Submitted 5 June, 2007; originally announced June 2007.

    Comments: 8 pages, 12 figures, it was submitted to IEEE International conference of Tools on Artificial Intelligence