Skip to main content

Showing 1–5 of 5 results for author: O'Reilly, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.04516  [pdf, other

    cs.CV

    An Examination of Wearable Sensors and Video Data Capture for Human Exercise Classification

    Authors: Ashish Singh, Antonio Bevilacqua, Timilehin B. Aderinola, Thach Le Nguyen, Darragh Whelan, Martin O'Reilly, Brian Caulfield, Georgiana Ifrim

    Abstract: Wearable sensors such as Inertial Measurement Units (IMUs) are often used to assess the performance of human exercise. Common approaches use handcrafted features based on domain expertise or automatically extracted features using time series analysis. Multiple sensors are required to achieve high classification accuracy, which is not very practical. These sensors require calibration and synchroniz… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  2. arXiv:2203.01363  [pdf, other

    cs.LG stat.AP

    Faking feature importance: A cautionary tale on the use of differentially-private synthetic data

    Authors: Oscar Giles, Kasra Hosseini, Grigorios Mingas, Oliver Strickson, Louise Bowler, Camila Rangel Smith, Harrison Wilde, Jen Ning Lim, Bilal Mateen, Kasun Amarasinghe, Rayid Ghani, Alison Heppenstall, Nik Lomax, Nick Malleson, Martin O'Reilly, Sebastian Vollmerteke

    Abstract: Synthetic datasets are often presented as a silver-bullet solution to the problem of privacy-preserving data publishing. However, for many applications, synthetic data has been shown to have limited utility when used to train predictive models. One promising potential application of these data is in the exploratory phase of the machine learning workflow, which involves understanding, engineering a… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: 27 pages, 8 figures

  3. arXiv:2006.01667  [pdf, other

    cs.LG eess.SP stat.ML

    Interpretable Time Series Classification using Linear Models and Multi-resolution Multi-domain Symbolic Representations

    Authors: Thach Le Nguyen, Severin Gsponer, Iulia Ilie, Martin O'Reilly, Georgiana Ifrim

    Abstract: The time series classification literature has expanded rapidly over the last decade, with many new classification approaches published each year. Prior research has mostly focused on improving the accuracy and efficiency of classifiers, with interpretability being somewhat neglected. This aspect of classifiers has become critical for many application domains and the introduction of the EU GDPR leg… ▽ More

    Submitted 31 May, 2020; originally announced June 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1808.04022

    Journal ref: Data Mining and Knowledge Discovery 33 (2019) 1183-1222

  4. arXiv:1908.08737  [pdf, other

    cs.CR

    Design choices for productive, secure, data-intensive research at scale in the cloud

    Authors: Diego Arenas, Jon Atkins, Claire Austin, David Beavan, Alvaro Cabrejas Egea, Steven Carlysle-Davies, Ian Carter, Rob Clarke, James Cunningham, Tom Doel, Oliver Forrest, Evelina Gabasova, James Geddes, James Hetherington, Radka Jersakova, Franz Kiraly, Catherine Lawrence, Jules Manser, Martin T. O'Reilly, James Robinson, Helen Sherwood-Taylor, Serena Tierney, Catalina A. Vallejos, Sebastian Vollmer, Kirstie Whitaker

    Abstract: We present a policy and process framework for secure environments for productive data science research projects at scale, by combining prevailing data security threat and risk profiles into five sensitivity tiers, and, at each tier, specifying recommended policies for data classification, data ingress, software ingress, data egress, user access, user device control, and analysis environments. By p… ▽ More

    Submitted 15 September, 2019; v1 submitted 23 August, 2019; originally announced August 2019.

  5. arXiv:1505.00062  [pdf, ps, other

    cs.DS

    Multi-probe consistent hashing

    Authors: Ben Appleton, Michael O'Reilly

    Abstract: We describe a consistent hashing algorithm which performs multiple lookups per key in a hash table of nodes. It requires no additional storage beyond the hash table, and achieves a peak-to-average load ratio of 1 + epsilon with just 1 + 1/epsilon lookups per key.

    Submitted 30 April, 2015; originally announced May 2015.

    Comments: 15 pages, 3 figures

    ACM Class: E.2; E.1