Skip to main content

Showing 1–6 of 6 results for author: Forbes, G C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.21718  [pdf, ps, other

    cs.LG cs.AI cs.PF cs.SE eess.SY

    Performance Prediction for Large Systems via Text-to-Text Regression

    Authors: Yash Akhauri, Bryan Lewandowski, Cheng-Hsi Lin, Adrian N. Reyes, Grant C. Forbes, Arissa Wongpanich, Bangding Yang, Mohamed S. Abdelfattah, Sagi Perel, Xingyou Song

    Abstract: In many industries, predicting metric outcomes of large systems is a fundamental problem, driven largely by traditional tabular regression. However, such methods struggle on complex systems data in the wild such as configuration files or system logs, where feature engineering is often infeasible. We propose text-to-text regression as a general, scalable alternative. For predicting resource efficie… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Code can be found at https://github.com/google-deepmind/regress-lm

  2. arXiv:2505.12611  [pdf, other

    cs.LG

    Action-Dependent Optimality-Preserving Reward Shaping

    Authors: Grant C. Forbes, Jianxun Wang, Leonardo Villalobos-Arias, Arnav Jhala, David L. Roberts

    Abstract: Recent RL research has utilized reward shaping--particularly complex shaping rewards such as intrinsic motivation (IM)--to encourage agent exploration in sparse-reward environments. While often effective, ``reward hacking'' can lead to the shaping reward being optimized at the expense of the extrinsic reward, resulting in a suboptimal policy. Potential-Based Reward Shaping (PBRS) techniques such a… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

    Comments: Extended abstract at AAMAS 2025; full paper at ICML 2025

    ACM Class: I.2.6

  3. arXiv:2410.12197  [pdf, other

    cs.LG

    Potential-Based Intrinsic Motivation: Preserving Optimality With Complex, Non-Markovian Shaping Rewards

    Authors: Grant C. Forbes, Leonardo Villalobos-Arias, Jianxun Wang, Arnav Jhala, David L. Roberts

    Abstract: Recently there has been a proliferation of intrinsic motivation (IM) reward-shaping methods to learn in complex and sparse-reward environments. These methods can often inadvertently change the set of optimal policies in an environment, leading to suboptimal behavior. Previous work on mitigating the risks of reward shaping, particularly through potential-based reward shaping (PBRS), has not been ap… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: To be submit to joint AIJ-JAIR special track for award-winning papers. arXiv admin note: substantial text overlap with arXiv:2402.07411

    ACM Class: I.2.6

  4. arXiv:2405.04039  [pdf, other

    cs.CL cs.AI cs.LG

    Utilizing GPT to Enhance Text Summarization: A Strategy to Minimize Hallucinations

    Authors: Hassan Shakil, Zeydy Ortiz, Grant C. Forbes

    Abstract: In this research, we uses the DistilBERT model to generate extractive summary and the T5 model to generate abstractive summaries. Also, we generate hybrid summaries by combining both DistilBERT and T5 models. Central to our research is the implementation of GPT-based refining process to minimize the common problem of hallucinations that happens in AI-generated summaries. We evaluate unrefined summ… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 9 pages, 3 figures

  5. arXiv:2402.07411  [pdf, other

    cs.LG

    Potential-Based Reward Shaping For Intrinsic Motivation

    Authors: Grant C. Forbes, Nitish Gupta, Leonardo Villalobos-Arias, Colin M. Potts, Arnav Jhala, David L. Roberts

    Abstract: Recently there has been a proliferation of intrinsic motivation (IM) reward-shaping methods to learn in complex and sparse-reward environments. These methods can often inadvertently change the set of optimal policies in an environment, leading to suboptimal behavior. Previous work on mitigating the risks of reward shaping, particularly through potential-based reward shaping (PBRS), has not been ap… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Extended version of paper appearing in AAMAS 2024

    ACM Class: I.2.6

  6. arXiv:2310.10495  [pdf, other

    cs.CL

    Metric Ensembles For Hallucination Detection

    Authors: Grant C. Forbes, Parth Katlana, Zeydy Ortiz

    Abstract: Abstractive text summarization has garnered increased interest as of late, in part due to the proliferation of large language models (LLMs). One of the most pressing problems related to generation of abstractive summaries is the need to reduce "hallucinations," information that was not included in the document being summarized, and which may be wholly incorrect. Due to this need, a wide array of m… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 9 pages, 5 figures