Skip to main content

Showing 1–7 of 7 results for author: Foote, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17713  [pdf, other

    cs.AI cs.LG

    AI Alignment with Changing and Influenceable Reward Functions

    Authors: Micah Carroll, Davis Foote, Anand Siththaranjan, Stuart Russell, Anca Dragan

    Abstract: Existing AI alignment approaches assume that preferences are static, which is unrealistic: our preferences change, and may even be influenced by our interactions with AI systems themselves. To clarify the consequences of incorrectly assuming static preferences, we introduce Dynamic Reward Markov Decision Processes (DR-MDPs), which explicitly model preference changes and the AI's influence on them.… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted to ICML 2024

  2. arXiv:2402.17747  [pdf, other

    cs.LG cs.AI stat.ML

    When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback

    Authors: Leon Lang, Davis Foote, Stuart Russell, Anca Dragan, Erik Jenner, Scott Emmons

    Abstract: Past analyses of reinforcement learning from human feedback (RLHF) assume that the human evaluators fully observe the environment. What happens when human feedback is based only on partial observations? We formally define two failure cases: deceptive inflation and overjustification. Modeling the human as Boltzmann-rational w.r.t. a belief over trajectories, we prove conditions under which RLHF is… ▽ More

    Submitted 17 November, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Advances in Neural Information Processing Systems 37 (NeurIPS 2024)

  3. arXiv:2003.02978  [pdf, other

    eess.IV cs.DC physics.ao-ph stat.AP

    Fast and Accurate Retrieval of Methane Concentration from Imaging Spectrometer Data Using Sparsity Prior

    Authors: Markus D. Foote, Philip E. Dennison, Andrew K. Thorpe, David R. Thompson, Siraput Jongaramrungruang, Christian Frankenberg, Sarang C. Joshi

    Abstract: The strong radiative forcing by atmospheric methane has stimulated interest in identifying natural and anthropogenic sources of this potent greenhouse gas. Point sources are important targets for quantification, and anthropogenic targets have potential for emissions reduction. Methane point source plume detection and concentration retrieval have been previously demonstrated using data from the Air… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

    Comments: 13 pages, 11 figures

    Journal ref: IEEE Transactions on Geoscience and Remote Sensing, 2020, pp. 1-13

  4. arXiv:1910.10769  [pdf, other

    eess.IV cs.LG physics.med-ph stat.ML

    Learning Multiparametric Biomarkers for Assessing MR-Guided Focused Ultrasound Treatment of Malignant Tumors

    Authors: Blake E. Zimmerman, Sara Johnson, Henrik Odéen, Jill Shea, Markus D. Foote, Nicole Winkler, Sarang C. Joshi, Allison Payne

    Abstract: Noninvasive MR-guided focused ultrasound (MRgFUS) treatments are promising alternatives to the surgical removal of malignant tumors. A significant challenge is assessing the viability of treated tissue during and immediately after MRgFUS procedures. Current clinical assessment uses the nonperfused volume (NPV) biomarker immediately after treatment from contrast-enhanced MRI. The NPV has variable a… ▽ More

    Submitted 29 September, 2020; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: 11 pages, 12 figures

  5. Development and Validation of a Deep Learning Algorithm for Improving Gleason Scoring of Prostate Cancer

    Authors: Kunal Nagpal, Davis Foote, Yun Liu, Po-Hsuan, Chen, Ellery Wulczyn, Fraser Tan, Niels Olson, Jenny L. Smith, Arash Mohtashamian, James H. Wren, Greg S. Corrado, Robert MacDonald, Lily H. Peng, Mahul B. Amin, Andrew J. Evans, Ankur R. Sangoi, Craig H. Mermel, Jason D. Hipp, Martin C. Stumpe

    Abstract: For prostate cancer patients, the Gleason score is one of the most important prognostic factors, potentially determining treatment independent of the stage. However, Gleason scoring is based on subjective microscopic examination of tumor morphology and suffers from poor reproducibility. Here we present a deep learning system (DLS) for Gleason scoring whole-slide images of prostatectomies. Our syst… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

    Journal ref: Nature Partner Journal Digital Medicine (2019)

  6. Real-Time 2D-3D Deformable Registration with Deep Learning and Application to Lung Radiotherapy Targeting

    Authors: Markus D. Foote, Blake E. Zimmerman, Amit Sawant, Sarang Joshi

    Abstract: Radiation therapy presents a need for dynamic tracking of a target tumor volume. Fiducial markers such as implanted gold seeds have been used to gate radiation delivery but the markers are invasive and gating significantly increases treatment time. Pretreatment acquisition of a respiratory correlated 4DCT allows for determination of accurate motion tracking which is useful in treatment planning. W… ▽ More

    Submitted 25 September, 2019; v1 submitted 22 July, 2018; originally announced July 2018.

    Journal ref: IPMI 2019. Lecture Notes in Computer Science, vol 11492. Springer, Cham (2019)

  7. arXiv:1611.04717  [pdf, other

    cs.AI cs.LG

    #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning

    Authors: Haoran Tang, Rein Houthooft, Davis Foote, Adam Stooke, Xi Chen, Yan Duan, John Schulman, Filip De Turck, Pieter Abbeel

    Abstract: Count-based exploration algorithms are known to perform near-optimally when used in conjunction with tabular reinforcement learning (RL) methods for solving small discrete Markov decision processes (MDPs). It is generally thought that count-based methods cannot be applied in high-dimensional state spaces, since most states will only occur once. Recent deep RL exploration strategies are able to dea… ▽ More

    Submitted 5 December, 2017; v1 submitted 15 November, 2016; originally announced November 2016.

    Comments: 10 pages main text + 10 pages supplementary. Published at NIPS 2017