Skip to main content

Showing 1–19 of 19 results for author: Ainsworth, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.01380  [pdf, other

    cs.CR cs.AR

    FireGuard: A Generalized Microarchitecture for Fine-Grained Security Analysis on OoO Superscalar Cores

    Authors: Zhe Jiang, Sam Ainsworth, Timothy Jones

    Abstract: High-performance security guarantees rely on hardware support. Generic programmable support for fine-grained instruction analysis has gained broad interest in the literature as a fundamental building block for the security of future processors. Yet, implementation in real out-of-order (OoO) superscalar processors presents tough challenges that cannot be explored in highly abstract simulators. We d… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  2. arXiv:2504.01347  [pdf, other

    cs.AR

    MEEK: Re-thinking Heterogeneous Parallel Error Detection Architecture for Real-World OoO Superscalar Processors

    Authors: Zhe Jiang, Minli Liao, Sam Ainsworth, Dean You, Timothy Jones

    Abstract: Heterogeneous parallel error detection is an approach to achieving fault-tolerant processors, leveraging multiple power-efficient cores to re-execute software originally run on a high-performance core. Yet, its complex components, gathering data cross-chip from many parts of the core, raise questions of how to build it into commodity cores without heavy design invasion and extensive re-engineering… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  3. Triangel: A High-Performance, Accurate, Timely On-Chip Temporal Prefetcher

    Authors: Sam Ainsworth, Lev Mukhanov

    Abstract: Temporal prefetching, where correlated pairs of addresses are logged and replayed on repeat accesses, has recently become viable in commercial designs. Arm's latest processors include Correlating Miss Chaining prefetchers, which store such patterns in a partition of the on-chip cache. However, the state-of-the-art on-chip temporal prefetcher in the literature, Triage, features some design inconsis… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: To be published at ISCA 2024

  4. arXiv:2309.09112  [pdf, other

    cs.PL cs.AR

    Rewriting History: Repurposing Domain-Specific CGRAs

    Authors: Jackson Woodruff, Thomas Koehler, Alexander Brauckmann, Chris Cummins, Sam Ainsworth, Michael F. P. O'Boyle

    Abstract: Coarse-grained reconfigurable arrays (CGRAs) are domain-specific devices promising both the flexibility of FPGAs and the performance of ASICs. However, with restricted domains comes a danger: designing chips that cannot accelerate enough current and future software to justify the hardware cost. We introduce FlexC, the first flexible CGRA compiler, which allows CGRAs to be adapted to operations the… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

  5. arXiv:2212.00873  [pdf, other

    cs.AR

    CONVOLVE: Smart and seamless design of smart edge processors

    Authors: M. Gomony, F. Putter, A. Gebregiorgis, G. Paulin, L. Mei, V. Jain, S. Hamdioui, V. Sanchez, T. Grosser, M. Geilen, M. Verhelst, F. Zenke, F. Gurkaynak, B. Bruin, S. Stuijk, S. Davidson, S. De, M. Ghogho, A. Jimborean, S. Eissa, L. Benini, D. Soudris, R. Bishnoi, S. Ainsworth, F. Corradi , et al. (3 additional authors not shown)

    Abstract: With the rise of Deep Learning (DL), our world braces for AI in every edge device, creating an urgent need for edge-AI SoCs. This SoC hardware needs to support high throughput, reliable and secure AI processing at Ultra Low Power (ULP), with a very short time to market. With its strong legacy in edge solutions and open processing platforms, the EU is well-positioned to become a leader in this SoC… ▽ More

    Submitted 2 May, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

  6. Hacky Racers: Exploiting Instruction-Level Parallelism to Generate Stealthy Fine-Grained Timers

    Authors: Haocheng Xiao, Sam Ainsworth

    Abstract: Side-channel attacks pose serious threats to many security models, especially sandbox-based browsers. While transient-execution side channels in out-of-order processors have previously been blamed for vulnerabilities such as Spectre and Meltdown, we show that in fact, the capability of out-of-order execution \emph{itself} to cause mayhem is far more general. We develop Hacky Racers, a new type o… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

    Comments: This paper is accepted at ASPLOS 2023

  7. arXiv:2209.04836  [pdf, other

    cs.LG cs.AI

    Git Re-Basin: Merging Models modulo Permutation Symmetries

    Authors: Samuel K. Ainsworth, Jonathan Hayase, Siddhartha Srinivasa

    Abstract: The success of deep learning is due in large part to our ability to solve certain massive non-convex optimization problems with relative ease. Though non-convex optimization is NP-hard, simple algorithms -- often variants of stochastic gradient descent -- exhibit surprising effectiveness in fitting large neural networks in practice. We argue that neural network loss landscapes often contain (nearl… ▽ More

    Submitted 1 March, 2023; v1 submitted 11 September, 2022; originally announced September 2022.

  8. GhostMinion: A Strictness-Ordered Cache System for Spectre Mitigation

    Authors: Sam Ainsworth

    Abstract: Out-of-order speculation, a technique ubiquitous since the early 1990s, remains a fundamental security flaw. Via attacks such as Spectre and Meltdown, an attacker can trick a victim, in an otherwise entirely correct program, into leaking its secrets through the effects of misspeculated execution, in a way that is entirely invisible to the programmer's model. This has serious implications for appli… ▽ More

    Submitted 9 September, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

  9. arXiv:2012.06684  [pdf, other

    cs.LG stat.ML

    Faster Policy Learning with Continuous-Time Gradients

    Authors: Samuel Ainsworth, Kendall Lowrey, John Thickstun, Zaid Harchaoui, Siddhartha Srinivasa

    Abstract: We study the estimation of policy gradients for continuous-time systems with known dynamics. By reframing policy learning in continuous-time, we show that it is possible construct a more efficient and accurate gradient estimator. The standard back-propagation through time estimator (BPTT) computes exact gradients for a crude discretization of the continuous-time system. In contrast, we approximate… ▽ More

    Submitted 24 June, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Journal ref: L4DC 2021

  10. arXiv:1912.07777  [pdf, other

    cs.DB cs.LG

    Mosaic: A Sample-Based Database System for Open World Query Processing

    Authors: Laurel Orr, Samuel Ainsworth, Walter Cai, Kevin Jamieson, Magda Balazinska, Dan Suciu

    Abstract: Data scientists have relied on samples to analyze populations of interest for decades. Recently, with the increase in the number of public data repositories, sample data has become easier to access. It has not, however, become easier to analyze. This sample data is arbitrarily biased with an unknown sampling probability, meaning data scientists must manually debias the sample with custom technique… ▽ More

    Submitted 10 January, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: CIDR 2020

  11. arXiv:1912.01649  [pdf, other

    cs.LG stat.ML

    Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

    Authors: Samuel Ainsworth, Matt Barnes, Siddhartha Srinivasa

    Abstract: In many environments, only a relatively small subset of the complete state space is necessary in order to accomplish a given task. We develop a simple technique using emergency stops (e-stops) to exploit this phenomenon. Using e-stops significantly improves sample complexity by reducing the amount of required exploration, while retaining a performance bound that efficiently trades off the rate of… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    Journal ref: NeurIPS 2019

  12. MuonTrap: Preventing Cross-Domain Spectre-Like Attacks by Capturing Speculative State

    Authors: Sam Ainsworth, Timothy M. Jones

    Abstract: The disclosure of the Spectre speculative-execution attacks in January 2018 has left a severe vulnerability that systems are still struggling with how to patch. The solutions that currently exist tend to have incomplete coverage, perform badly, or have highly undesirable edge cases that cause application domains to break. MuonTrap allows processors to continue to speculate, avoiding significant… ▽ More

    Submitted 28 April, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

  13. arXiv:1806.09060  [pdf, other

    cs.LG stat.ML

    Disentangled VAE Representations for Multi-Aspect and Missing Data

    Authors: Samuel K. Ainsworth, Nicholas J. Foti, Emily B. Fox

    Abstract: Many problems in machine learning and related application areas are fundamentally variants of conditional modeling and sampling across multi-aspect data, either multi-view, multi-modal, or simply multi-group. For example, sampling from the distribution of English sentences conditioned on a given French sentence or sampling audio waveforms conditioned on a given piece of text. Central to many of th… ▽ More

    Submitted 23 June, 2018; originally announced June 2018.

  14. arXiv:1802.06765  [pdf, other

    cs.LG stat.ML

    Interpretable VAEs for nonlinear group factor analysis

    Authors: Samuel Ainsworth, Nicholas Foti, Adrian KC Lee, Emily Fox

    Abstract: Deep generative models have recently yielded encouraging results in producing subjectively realistic samples of complex data. Far less attention has been paid to making these generative models interpretable. In many scenarios, ranging from scientific applications to finance, the observed variables have a natural grouping. It is often of interest to understand systems of interaction amongst these g… ▽ More

    Submitted 16 February, 2018; originally announced February 2018.

  15. arXiv:1402.0928  [pdf, other

    cs.DL

    A Framework for Evaluation of Composite Memento Temporal Coherence

    Authors: Scott G. Ainsworth, Michael L. Nelson, Herbert Van de Sompel

    Abstract: Most archived HTML pages embed other web resources, such as images and stylesheets. Playback of the archived web pages typically provides only the capture date (or Memento-Datetime) of the root resource and not the Memento-Datetime of the embedded resources. In the course of our research, we have discovered that the Memento-Datetime of embedded resources can be up to several years in the future or… ▽ More

    Submitted 5 October, 2014; v1 submitted 4 February, 2014; originally announced February 2014.

    Comments: 10 pages, 12 figures, 6 tables. Version 3 corrects predicates 13, 14, 16, 17, 19, and 20

    ACM Class: H.3.7

  16. Evaluating Sliding and Sticky Target Policies by Measuring Temporal Drift in Acyclic Walks Through a Web Archive

    Authors: Scott G. Ainsworth, Michael L. Nelson

    Abstract: When a user views an archived page using the archive's user interface (UI), the user selects a datetime to view from a list. The archived web page, if available, is then displayed. From this display, the web archive UI attempts to simulate the web browsing experience by smoothly transitioning between archived pages. During this process, the target datetime changes with each link followed; drifting… ▽ More

    Submitted 21 September, 2013; originally announced September 2013.

    Comments: 10 pages, JCDL 2013

    ACM Class: H.3.7

  17. arXiv:1212.6177  [pdf, other

    cs.DL cs.IR

    How Much of the Web Is Archived?

    Authors: Scott G. Ainsworth, Ahmed AlSum, Hany SalahEldeen, Michele C. Weigle, Michael L. Nelson

    Abstract: Although the Internet Archive's Wayback Machine is the largest and most well-known web archive, there have been a number of public web archives that have emerged in the last several years. With varying resources, audiences and collection development policies, these archives have varying levels of overlap with each other. While individual archives can be measured in terms of number of URIs, number… ▽ More

    Submitted 5 January, 2013; v1 submitted 26 December, 2012; originally announced December 2012.

    Comments: This is the long version of the short paper by the same title published at JCDL'11. 10 pages, 5 figures, 7 tables. Version 2 includes minor typographical corrections

    ACM Class: H.3.7

  18. arXiv:1003.3661  [pdf, other

    cs.DL cs.IR

    An HTTP-Based Versioning Mechanism for Linked Data

    Authors: Herbert Van de Sompel, Robert Sanderson, Michael L. Nelson, Lyudmila L. Balakireva, Harihar Shankar, Scott Ainsworth

    Abstract: Dereferencing a URI returns a representation of the current state of the resource identified by that URI. But, on the Web representations of prior states of a resource are also available, for example, as resource versions in Content Management Systems or archival resources in Web Archives such as the Internet Archive. This paper introduces a resource versioning mechanism that is fully based on HTT… ▽ More

    Submitted 18 March, 2010; originally announced March 2010.

    Comments: Proceedings of Linked Data on the Web (LDOW2010), April 27, 2010, Raleigh, USA

    ACM Class: H.3.5

  19. arXiv:0911.1112  [pdf, other

    cs.IR cs.DL

    Memento: Time Travel for the Web

    Authors: Herbert Van de Sompel, Michael L. Nelson, Robert Sanderson, Lyudmila L. Balakireva, Scott Ainsworth, Harihar Shankar

    Abstract: The Web is ephemeral. Many resources have representations that change over time, and many of those representations are lost forever. A lucky few manage to reappear as archived resources that carry their own URIs. For example, some content management systems maintain version pages that reflect a frozen prior state of their changing resources. Archives recurrently crawl the web to obtain the actua… ▽ More

    Submitted 6 November, 2009; v1 submitted 5 November, 2009; originally announced November 2009.

    Comments: 14 pages, 5 figures

    ACM Class: H.3.5; H.3.7