Skip to main content

Showing 1–8 of 8 results for author: Holmes, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  2. arXiv:2403.17107  [pdf, other

    cs.DC

    Design Principles of Dynamic Resource Management for High-Performance Parallel Programming Models

    Authors: Dominik Huber, Martin Schreiber, Martin Schulz, Howard Pritchard, Daniel Holmes

    Abstract: With Dynamic Resource Management (DRM) the resources assigned to a job can be changed dynamically during its execution. From the system's perspective, DRM opens a new level of flexibility in resource allocation and job scheduling and therefore has the potential to improve system efficiency metrics such as the utilization rate, job throughput, energy efficiency, and responsiveness. From the applica… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  3. arXiv:2309.13612  [pdf, ps, other

    cs.CR cs.ET

    Digital Twins and the Future of their Use Enabling Shift Left and Shift Right Cybersecurity Operations

    Authors: Ahmad Mohsin, Helge Janicke, Surya Nepal, David Holmes

    Abstract: Digital Twins (DTs), optimize operations and monitor performance in Smart Critical Systems (SCS) domains like smart grids and manufacturing. DT-based cybersecurity solutions are in their infancy, lacking a unified strategy to overcome challenges spanning next three to five decades. These challenges include reliable data accessibility from Cyber-Physical Systems (CPS), operating in unpredictable en… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: IEEE Submitted Paper: Trust, Privacy and Security in Intelligent Systems, and Applications

  4. arXiv:2304.09837  [pdf, ps, other

    cs.LG math.PR

    Points of non-linearity of functions generated by random neural networks

    Authors: David Holmes

    Abstract: We consider functions from the real numbers to the real numbers, output by a neural network with 1 hidden activation layer, arbitrary width, and ReLU activation function. We assume that the parameters of the neural network are chosen uniformly at random with respect to various probability distributions, and compute the expected distribution of the points of non-linearity. We use these results to e… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: 1 figure; comments very welcome

  5. arXiv:2102.10215  [pdf, other

    cs.IT

    Symbol-Level Synchronisation Channel Modelling With Real-World Application: From Davey-Mackay, Fritchman to Markov

    Authors: Shamin Achari, Daniel Glenn Holmes, Ling Cheng

    Abstract: Errors in realistic channels contain not only substitution errors, but synchronisation errors as well. Moreover, these errors are rarely statistically independent in nature. By extending on the idea of the Fritchman channel model, a novel error category-based methodology in determining channel characteristics is described for memory channels which contain insertion, deletion, and substitution erro… ▽ More

    Submitted 19 February, 2021; originally announced February 2021.

    Comments: This paper is the preprint version of the research article submitted to the IEEE Access journal. Submission date: February 19, 2021. It is currently under review at IEEE Access

  6. arXiv:2010.11228  [pdf, other

    cs.RO

    Trip Recovery in Lower-Limb Prostheses using Reachable Sets of Predicted Human Motion

    Authors: Shannon M. Danforth, Patrick D. Holmes, Ram Vasudevan

    Abstract: People with lower-limb loss, the majority of which use passive prostheses, exhibit a high incidence of falls each year. Powered lower-limb prostheses have the potential to reduce fall rates by actively helping the user recover from a stumble, but the unpredictability of the human response makes it difficult to design controllers that ensure a successful recovery. This paper presents a method calle… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

    Comments: 8 pages, 3 figures

  7. arXiv:1909.11762  [pdf, other

    cs.DC

    Extending the Message Passing Interface (MPI) with User-Level Schedules

    Authors: Derek Schafer, Sheikh Ghafoor, Daniel Holmes, Martin Ruefenacht, Anthony Skjellum

    Abstract: Composability is one of seven reasons for the long-standing and continuing success of MPI. Extending MPI by composing its operations with user-level operations provides useful integration with the progress engine and completion notification methods of MPI. However, the existing extensibility mechanism in MPI (generalized requests) is not widely utilized and has significant drawbacks. MPI can be… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

  8. arXiv:1810.11087  [pdf, other

    cs.RO

    Automated Camera-Based Estimation of Rehabilitation Criteria Following ACL Reconstruction

    Authors: Choong Hee Kim, Shannon M. Danforth, Patrick D. Holmes, Daphna Raz, Darlene Yao, Asheesh Bedi, Ram Vasudevan

    Abstract: Anterior cruciate ligament (ACL) reconstruction necessitates months of rehabilitation, during which a clinician evaluates whether a patient is ready to return to sports or occupation. Due to their time- and cost-intensive nature, these screenings to assess progress are unavailable to many. This paper introduces an automated, markerless, camera-based method for estimating rehabilitation criteria fo… ▽ More

    Submitted 25 October, 2018; originally announced October 2018.

    Comments: 6 pages