Skip to main content

Showing 1–5 of 5 results for author: Sims, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.10546  [pdf, other

    cs.DC cs.AI cs.LG

    Scalable Machine Learning Training Infrastructure for Online Ads Recommendation and Auction Scoring Modeling at Google

    Authors: George Kurian, Somayeh Sardashti, Ryan Sims, Felix Berger, Gary Holt, Yang Li, Jeremiah Willcock, Kaiyuan Wang, Herve Quiroz, Abdulrahman Salem, Julian Grady

    Abstract: Large-scale Ads recommendation and auction scoring models at Google scale demand immense computational resources. While specialized hardware like TPUs have improved linear algebra computations, bottlenecks persist in large-scale systems. This paper proposes solutions for three critical challenges that must be addressed for efficient end-to-end execution in a widely used production infrastructure:… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

    Comments: 13 pages, 7 figures

    ACM Class: C.0; C.4; I.2.6

  2. arXiv:2303.17508  [pdf, other

    cs.AI cs.CV cs.HC q-bio.NC

    Learning in Factored Domains with Information-Constrained Visual Representations

    Authors: Tailia Malloy, Miao Liu, Matthew D. Riemer, Tim Klinger, Gerald Tesauro, Chris R. Sims

    Abstract: Humans learn quickly even in tasks that contain complex visual information. This is due in part to the efficient formation of compressed representations of visual information, allowing for better generalization and robustness. However, compressed representations alone are insufficient for explaining the high speed of human learning. Reinforcement learning (RL) models that seek to replicate this… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

  3. arXiv:2104.10611  [pdf, other

    eess.IV cs.CV cs.LG

    FourierNets enable the design of highly non-local optical encoders for computational imaging

    Authors: Diptodip Deb, Zhenfei Jiao, Ruth Sims, Alex B. Chen, Michael Broxton, Misha B. Ahrens, Kaspar Podgorski, Srinivas C. Turaga

    Abstract: Differentiable simulations of optical systems can be combined with deep learning-based reconstruction networks to enable high performance computational imaging via end-to-end (E2E) optimization of both the optical encoder and the deep decoder. This has enabled imaging applications such as 3D localization microscopy, depth estimation, and lensless photography via the optimization of local optical e… ▽ More

    Submitted 2 November, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: Accepted to NeurIPS 2022

  4. arXiv:2011.11517  [pdf, other

    cs.AI

    Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games

    Authors: Tailia Malloy, Tim Klinger, Miao Liu, Matthew Riemer, Gerald Tesauro, Chris R. Sims

    Abstract: This paper introduces an information-theoretic constraint on learned policy complexity in the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) reinforcement learning algorithm. Previous research with a related approach in continuous control experiments suggests that this method favors learning policies that are more robust to changing environment dynamics. The multi-agent game setting nat… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

  5. arXiv:2010.04646  [pdf, other

    cs.LG cs.AI

    Deep RL With Information Constrained Policies: Generalization in Continuous Control

    Authors: Tailia Malloy, Chris R. Sims, Tim Klinger, Miao Liu, Matthew Riemer, Gerald Tesauro

    Abstract: Biological agents learn and act intelligently in spite of a highly limited capacity to process and store information. Many real-world problems involve continuous control, which represents a difficult task for artificial intelligence agents. In this paper we explore the potential learning advantages a natural constraint on information flow might confer onto artificial agents in continuous control… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.