Skip to main content

Showing 1–27 of 27 results for author: Young, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.10798  [pdf, other

    cs.CL

    Relation Extraction Across Entire Books to Reconstruct Community Networks: The AffilKG Datasets

    Authors: Erica Cai, Sean McQuade, Kevin Young, Brendan O'Connor

    Abstract: When knowledge graphs (KGs) are automatically extracted from text, are they accurate enough for downstream analysis? Unfortunately, current annotated datasets can not be used to evaluate this question, since their KGs are highly disconnected, too small, or overly complex. To address this gap, we introduce AffilKG (https://doi.org/10.5281/zenodo.15427977), which is a collection of six datasets that… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  2. arXiv:2502.00943  [pdf, other

    cs.CL

    Universal Abstraction: Harnessing Frontier Models to Structure Real-World Data at Scale

    Authors: Cliff Wong, Sam Preston, Qianchu Liu, Zelalem Gero, Jass Bagga, Sheng Zhang, Shrey Jain, Theodore Zhao, Yu Gu, Yanbo Xu, Sid Kiblawi, Roshanthi Weerasinghe, Rom Leidner, Kristina Young, Brian Piening, Carlo Bifulco, Tristan Naumann, Mu Wei, Hoifung Poon

    Abstract: The vast majority of real-world patient information resides in unstructured clinical text, and the process of medical abstraction seeks to extract and normalize structured information from this unstructured input. However, traditional medical abstraction methods can require significant manual efforts that can include crafting rules or annotating training labels, limiting scalability. In this paper… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

  3. arXiv:2407.08828  [pdf, other

    quant-ph cs.ET cs.PF

    Benchmarking quantum computers

    Authors: Timothy Proctor, Kevin Young, Andrew D. Baczewski, Robin Blume-Kohout

    Abstract: The rapid pace of development in quantum computing technology has sparked a proliferation of benchmarks for assessing the performance of quantum computing hardware and software. Good benchmarks empower scientists, engineers, programmers, and users to understand a computing system's power, but bad benchmarks can misdirect research and inhibit progress. In this Perspective, we survey the science of… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  4. arXiv:2405.03878  [pdf, other

    cs.LG cs.AI

    Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning

    Authors: Aditya A. Ramesh, Kenny Young, Louis Kirsch, Jürgen Schmidhuber

    Abstract: Temporal credit assignment in reinforcement learning is challenging due to delayed and stochastic outcomes. Monte Carlo targets can bridge long delays between action and consequence but lead to high-variance targets due to stochasticity. Temporal difference (TD) learning uses bootstrapping to overcome variance but introduces a bias that can only be corrected through many iterations. TD($λ$) provid… ▽ More

    Submitted 4 June, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: ICML 2024 version

  5. arXiv:2404.15980  [pdf, other

    cs.ET cs.DC quant-ph

    Minimizing the Number of Teleportations in Distributed Quantum Computing Using Alloy

    Authors: Ali Ebnenasir, Kieran Young

    Abstract: This paper presents a novel approach for minimizing the number of teleportations in Distributed Quantum Computing (DQC) using formal methods. Quantum teleportation plays a major role in communicating quantum information. As such, it is desirable to perform as few teleportations as possible when distributing a quantum algorithm on a network of quantum machines. Contrary to most existing methods whi… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  6. arXiv:2310.01569  [pdf, other

    cs.AI cs.LG

    Iterative Option Discovery for Planning, by Planning

    Authors: Kenny Young, Richard S. Sutton

    Abstract: Discovering useful temporal abstractions, in the form of options, is widely thought to be key to applying reinforcement learning and planning to increasingly complex domains. Building on the empirical success of the Expert Iteration approach to policy learning used in AlphaZero, we propose Option Iteration, an analogous approach to option discovery. Rather than learning a single strong policy that… ▽ More

    Submitted 22 December, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Fixed incorrect arrows on some figures in the appendix

  7. Introducing CALMED: Multimodal Annotated Dataset for Emotion Detection in Children with Autism

    Authors: Annanda Sousa, Karen Young, Mathieu D'aquin, Manel Zarrouk, Jennifer Holloway

    Abstract: Automatic Emotion Detection (ED) aims to build systems to identify users' emotions automatically. This field has the potential to enhance HCI, creating an individualised experience for the user. However, ED systems tend to perform poorly on people with Autism Spectrum Disorder (ASD). Hence, the need to create ED systems tailored to how people with autism express emotions. Previous works have creat… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Journal ref: HCII 2023: Universal Access in Human-Computer Interaction, Margherita Antona; Constantine Stephanidis, Jul 2023, Copenhagen, Denmark. pp.657-677

  8. Evaluating Privacy Questions From Stack Overflow: Can ChatGPT Compete?

    Authors: Zack Delile, Sean Radel, Joe Godinez, Garrett Engstrom, Theo Brucker, Kenzie Young, Sepideh Ghanavati

    Abstract: Stack Overflow and other similar forums are used commonly by developers to seek answers for their software development as well as privacy-related concerns. Recently, ChatGPT has been used as an alternative to generate code or produce responses to developers' questions. In this paper, we aim to understand developers' privacy challenges by evaluating the types of privacy-related questions asked on S… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Submitted to the 10th International Workshop on Evolving Security & Privacy Requirements Engineering (ESPRE'23) co-located with the 31st IEEE International Requirements Engineering Conference September 4-8, 2023, Leibniz Universität, Hannover, Germany

    Journal ref: 2023 IEEE 31st International Requirements Engineering Conference Workshops (REW)

  9. Learning a quantum computer's capability

    Authors: Daniel Hothem, Kevin Young, Tommie Catanach, Timothy Proctor

    Abstract: Accurately predicting a quantum computer's capability -- which circuits it can run and how well it can run them -- is a foundational goal of quantum characterization and benchmarking. As modern quantum computers become increasingly hard to simulate, we must develop accurate and scalable predictive capability models to help researchers and stakeholders decide which quantum computers to build and us… ▽ More

    Submitted 23 October, 2024; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: 20 pages, 11 figures, plus appendices

    Report number: SAND2023-02208O

    Journal ref: IEEE Transactions on Quantum Engineering, vol. 5, pp. 1-26, 2024, Art no. 2100526,

  10. arXiv:2211.02976  [pdf, other

    cs.CL cs.DL cs.LG cs.SI

    A Comparison of Automatic Labelling Approaches for Sentiment Analysis

    Authors: Sumana Biswas, Karen Young, Josephine Griffith

    Abstract: Labelling a large quantity of social media data for the task of supervised machine learning is not only time-consuming but also difficult and expensive. On the other hand, the accuracy of supervised machine learning models is strongly related to the quality of the labelled data on which they train, and automatic sentiment labelling techniques could reduce the time and cost of human labelling. We h… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Comments: 12 pages 3 figure, 11th International Conference on Data Science, Technology and Applications, ISBN 978-989-758-583-8, ISSN 2184-285X, pages 312-319

  11. arXiv:2211.02222  [pdf, other

    cs.LG

    The Benefits of Model-Based Generalization in Reinforcement Learning

    Authors: Kenny Young, Aditya Ramesh, Louis Kirsch, Jürgen Schmidhuber

    Abstract: Model-Based Reinforcement Learning (RL) is widely believed to have the potential to improve sample efficiency by allowing an agent to synthesize large amounts of imagined experience. Experience Replay (ER) can be considered a simple kind of model, which has proved effective at improving the stability and efficiency of deep RL. In principle, a learned parametric model could improve on ER by general… ▽ More

    Submitted 10 July, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: Update to ICML version

  12. arXiv:2207.01613  [pdf, other

    cs.LG

    Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions

    Authors: Tian Tian, Kenny Young, Richard S. Sutton

    Abstract: Value iteration (VI) is a foundational dynamic programming method, important for learning and planning in optimal control and reinforcement learning. VI proceeds in batches, where the update to the value of each state must be completed before the next batch of updates can begin. Completing a single batch is prohibitively expensive if the state space is large, rendering VI impractical for many appl… ▽ More

    Submitted 27 November, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

  13. arXiv:2110.07700  [pdf, other

    cs.LG cs.AI

    Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units

    Authors: Kenny Young

    Abstract: Training neural networks with discrete stochastic variables presents a unique challenge. Backpropagation is not directly applicable, nor are the reparameterization tricks used in networks with continuous stochastic variables. To address this challenge, we present Hindsight Network Credit Assignment (HNCA), a novel gradient estimation algorithm for networks of discrete stochastic units. HNCA works… ▽ More

    Submitted 16 December, 2021; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: To be presented at AAAI 2022

  14. arXiv:2103.02768  [pdf, other

    cs.LG

    Learning to Predict with Supporting Evidence: Applications to Clinical Risk Prediction

    Authors: Aniruddh Raghu, John Guttag, Katherine Young, Eugene Pomerantsev, Adrian V. Dalca, Collin M. Stultz

    Abstract: The impact of machine learning models on healthcare will depend on the degree of trust that healthcare professionals place in the predictions made by these models. In this paper, we present a method to provide people with clinical expertise with domain-relevant evidence about why a prediction should be trusted. We first design a probabilistic model that relates meaningful latent concepts to predic… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: ACM Conference on Health, Learning, and Inference 2021

  15. arXiv:2102.04399  [pdf, other

    cs.LG cs.AI

    How to Stay Curious while Avoiding Noisy TVs using Aleatoric Uncertainty Estimation

    Authors: Augustine N. Mavor-Parker, Kimberly A. Young, Caswell Barry, Lewis D. Griffin

    Abstract: Exploration in environments with sparse rewards is difficult for artificial agents. Curiosity driven learning -- using feed-forward prediction errors as intrinsic rewards -- has achieved some success in these scenarios, but fails when faced with action-dependent noise sources. We present aleatoric mapping agents (AMAs), a neuroscience inspired solution modeled on the cholinergic system of the mamm… ▽ More

    Submitted 5 July, 2024; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: ICML 2022 format. Thesis version with cleaned up code

  16. arXiv:2011.12351  [pdf, other

    cs.LG cs.AI

    Hindsight Network Credit Assignment

    Authors: Kenny Young

    Abstract: We present Hindsight Network Credit Assignment (HNCA), a novel learning method for stochastic neural networks, which works by assigning credit to each neuron's stochastic output based on how it influences the output of its immediate children in the network. We prove that HNCA provides unbiased gradient estimates while reducing variance compared to the REINFORCE estimator. We also experimentally de… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

  17. arXiv:2010.15268  [pdf, other

    cs.LG cs.AI

    Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning

    Authors: Kenny Young, Richard S. Sutton

    Abstract: Despite empirical success, the theory of reinforcement learning (RL) with value function approximation remains fundamentally incomplete. Prior work has identified a variety of pathological behaviours that arise in RL algorithms that combine approximate on-policy evaluation and greedification. One prominent example is policy oscillation, wherein an algorithm may cycle indefinitely between policies,… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

  18. arXiv:2004.04742  [pdf

    cs.CY eess.IV

    OPTIMAM Mammography Image Database: a large scale resource of mammography images and clinical data

    Authors: Mark D Halling-Brown, Lucy M Warren, Dominic Ward, Emma Lewis, Alistair Mackenzie, Matthew G Wallis, Louise Wilkinson, Rosalind M Given-Wilson, Rita McAvinchey, Kenneth C Young

    Abstract: A major barrier to medical imaging research and in particular the development of artificial intelligence (AI) is a lack of large databases of medical images which share images with other researchers. Without such databases it is not possible to train generalisable AI algorithms, and large amounts of time and funding is spent collecting smaller datasets at individual research centres. The OPTIMAM i… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

  19. arXiv:1911.08362  [pdf, other

    cs.LG cs.AI

    Variance Reduced Advantage Estimation with $δ$ Hindsight Credit Assignment

    Authors: Kenny Young

    Abstract: Hindsight Credit Assignment (HCA) refers to a recently proposed family of methods for producing more efficient credit assignment in reinforcement learning. These methods work by explicitly estimating the probability that certain actions were taken in the past given present information. Prior work has studied the properties of such methods and demonstrated their behaviour empirically. We extend thi… ▽ More

    Submitted 28 September, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: Removed incorrect sentence regarding policy gradients of any 2 different different actions necessarily being negative for softmax parameterization

  20. arXiv:1908.11762  [pdf, other

    quant-ph cs.LG

    Classifying single-qubit noise using machine learning

    Authors: Travis L. Scholten, Yi-Kai Liu, Kevin Young, Robin Blume-Kohout

    Abstract: Quantum characterization, validation, and verification (QCVV) techniques are used to probe, characterize, diagnose, and detect errors in quantum information processors (QIPs). An important component of any QCVV protocol is a mapping from experimental data to an estimate of a property of a QIP. Machine learning (ML) algorithms can help automate the development of QCVV protocols, creating such maps… ▽ More

    Submitted 30 August, 2019; originally announced August 2019.

    Comments: 20 pages (15 main, 5 supplemental), 11 figures, and 5 tables

  21. arXiv:1908.06612  [pdf, other

    cs.LG eess.IV stat.ML

    Deep neural network or dermatologist?

    Authors: Kyle Young, Gareth Booth, Becks Simpson, Reuben Dutton, Sally Shrapnel

    Abstract: Deep learning techniques have proven high accuracy for identifying melanoma in digitised dermoscopic images. A strength is that these methods are not constrained by features that are pre-defined by human semantics. A down-side is that it is difficult to understand the rationale of the model predictions and to identify potential failure modes. This is a major barrier to adoption of deep learning in… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  22. arXiv:1903.03176  [pdf, other

    cs.LG cs.AI

    MinAtar: An Atari-Inspired Testbed for Thorough and Reproducible Reinforcement Learning Experiments

    Authors: Kenny Young, Tian Tian

    Abstract: The Arcade Learning Environment (ALE) is a popular platform for evaluating reinforcement learning agents. Much of the appeal comes from the fact that Atari games demonstrate aspects of competency we expect from an intelligent agent and are not biased toward any particular solution approach. The challenge of the ALE includes (1) the representation learning problem of extracting pertinent informatio… ▽ More

    Submitted 6 June, 2019; v1 submitted 7 March, 2019; originally announced March 2019.

  23. arXiv:1806.00540  [pdf, other

    cs.LG cs.AI stat.ML

    Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling

    Authors: Kenny J. Young, Richard S. Sutton, Shuo Yang

    Abstract: Episodic memory is a psychology term which refers to the ability to recall specific events from the past. We suggest one advantage of this particular type of memory is the ability to easily assign credit to a specific state when remembered information is found to be useful. Inspired by this idea, and the increasing popularity of external memory mechanisms to handle long-term dependencies in deep l… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

  24. arXiv:1805.04514  [pdf, other

    cs.LG cs.AI stat.ML

    Metatrace Actor-Critic: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control

    Authors: Kenny Young, Baoxiang Wang, Matthew E. Taylor

    Abstract: Reinforcement learning (RL) has had many successes in both "deep" and "shallow" settings. In both cases, significant hyperparameter tuning is often required to achieve good performance. Furthermore, when nonlinear function approximation is used, non-stationarity in the state representation can lead to learning instability. A variety of techniques exist to combat this --- most notably large experie… ▽ More

    Submitted 24 May, 2019; v1 submitted 10 May, 2018; originally announced May 2018.

  25. arXiv:1801.08287  [pdf, other

    cs.AI

    Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods

    Authors: Craig Sherstan, Brendan Bennett, Kenny Young, Dylan R. Ashley, Adam White, Martha White, Richard S. Sutton

    Abstract: This paper investigates estimating the variance of a temporal-difference learning agent's update target. Most reinforcement learning methods use an estimate of the value function, which captures how good it is for the agent to be in a particular state and is mathematically expressed as the expected sum of discounted future rewards (called the return). These values can be straightforwardly estimate… ▽ More

    Submitted 14 February, 2018; v1 submitted 25 January, 2018; originally announced January 2018.

  26. arXiv:1707.00627  [pdf, ps, other

    cs.AI cs.GT

    A Reverse Hex Solver

    Authors: Kenny Young, Ryan B. Hayward

    Abstract: We present Solrex,an automated solver for the game of Reverse Hex.Reverse Hex, also known as Rex, or Misere Hex, is the variant of the game of Hex in which the player who joins her two sides loses the game. Solrex performs a mini-max search of the state space using Scalable Parallel Depth First Proof Number Search, enhanced by the pruning of inferior moves and the early detection of certain winnin… ▽ More

    Submitted 26 April, 2017; originally announced July 2017.

    Comments: Presented at Computers and Games 2016 Leiden, International Conference on Computers and Games. Springer International Publishing, 2016

  27. arXiv:1604.07097  [pdf, other

    cs.AI

    Neurohex: A Deep Q-learning Hex Agent

    Authors: Kenny Young, Ryan Hayward, Gautham Vasan

    Abstract: DeepMind's recent spectacular success in using deep convolutional neural nets and machine learning to build superhuman level agents --- e.g. for Atari games via deep Q-learning and for the game of Go via Reinforcement Learning --- raises many questions, including to what extent these methods will succeed in other domains. In this paper we consider DQL for the game of Hex: after supervised initiali… ▽ More

    Submitted 25 April, 2016; v1 submitted 24 April, 2016; originally announced April 2016.