Skip to main content

Showing 1–10 of 10 results for author: Darlow, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.05522  [pdf, other

    cs.LG cs.AI

    Continuous Thought Machines

    Authors: Luke Darlow, Ciaran Regan, Sebastian Risi, Jeffrey Seely, Llion Jones

    Abstract: Biological brains demonstrate complex neural activity, where the timing and interplay between neurons is critical to how brains process information. Most deep learning architectures simplify neural activity by abstracting away temporal dynamics. In this paper we challenge that paradigm. By incorporating neuron-level processing and synchronization, we can effectively reintroduce neural timing as a… ▽ More

    Submitted 27 May, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

    Comments: Technical report accompanied by online project page: https://pub.sakana.ai/ctm/

  2. arXiv:2410.06145  [pdf, other

    cs.DC cs.OS cs.PF

    Serverless Cold Starts and Where to Find Them

    Authors: Artjom Joosen, Ahmed Hassan, Martin Asenov, Rajkarn Singh, Luke Darlow, Jianfeng Wang, Qiwen Deng, Adam Barker

    Abstract: This paper releases and analyzes a month-long trace of 85 billion user requests and 11.9 million cold starts from Huawei's serverless cloud platform. Our analysis spans workloads from five data centers. We focus on cold starts and provide a comprehensive examination of the underlying factors influencing the number and duration of cold starts. These factors include trigger types, request synchronic… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    ACM Class: C.4; D.4.7

  3. arXiv:2407.17880  [pdf, other

    cs.LG

    DAM: Towards A Foundation Model for Time Series Forecasting

    Authors: Luke Darlow, Qiwen Deng, Ahmed Hassan, Martin Asenov, Rajkarn Singh, Artjom Joosen, Adam Barker, Amos Storkey

    Abstract: It is challenging to scale time series forecasting models such that they forecast accurately for multiple distinct domains and datasets, all with potentially different underlying collection procedures (e.g., sample resolution), patterns (e.g., periodicity), and prediction requirements (e.g., reconstruction vs. forecasting). We call this general task universal forecasting. Existing methods usually… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  4. arXiv:2403.14587  [pdf, other

    cs.LG

    An Analysis of Linear Time Series Forecasting Models

    Authors: William Toner, Luke Darlow

    Abstract: Despite their simplicity, linear models perform well at time series forecasting, even when pitted against deeper and more expensive models. A number of variations to the linear model have been proposed, often including some form of feature normalisation that improves model generalisation. In this paper we analyse the sets of functions expressible using these linear model architectures. In so doing… ▽ More

    Submitted 25 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  5. arXiv:2312.10127  [pdf, other

    cs.PF cs.DC cs.LG

    How Does It Function? Characterizing Long-term Trends in Production Serverless Workloads

    Authors: Artjom Joosen, Ahmed Hassan, Martin Asenov, Rajkarn Singh, Luke Darlow, Jianfeng Wang, Adam Barker

    Abstract: This paper releases and analyzes two new Huawei cloud serverless traces. The traces span a period of over 7 months with over 1.4 trillion function invocations combined. The first trace is derived from Huawei's internal workloads and contains detailed per-second statistics for 200 functions running across multiple Huawei cloud data centers. The second trace is a representative workload from Huawei'… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    ACM Class: D.4.7; I.5.1; C.4

    Journal ref: SoCC '23: Proceedings of the 2023 ACM Symposium on Cloud Computing, October 2023, Pages 443-458

  6. arXiv:2011.11486  [pdf, other

    cs.LG

    Latent Adversarial Debiasing: Mitigating Collider Bias in Deep Neural Networks

    Authors: Luke Darlow, Stanisław Jastrzębski, Amos Storkey

    Abstract: Collider bias is a harmful form of sample selection bias that neural networks are ill-equipped to handle. This bias manifests itself when the underlying causal signal is strongly correlated with other confounding signals due to the training data collection procedure. In the situation where the confounding signal is easy-to-learn, deep neural networks will latch onto this and the resulting model wi… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

    Comments: 10 pages, 4 figures, submitted to AISTATS 2021

  7. arXiv:2003.08821  [pdf, other

    cs.CV cs.LG stat.ML

    DHOG: Deep Hierarchical Object Grouping

    Authors: Luke Nicholas Darlow, Amos Storkey

    Abstract: Recently, a number of competitive methods have tackled unsupervised representation learning by maximising the mutual information between the representations produced from augmentations. The resulting representations are then invariant to stochastic augmentation strategies, and can be used for downstream tasks such as clustering or classification. Yet data augmentations preserve many properties of… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

    Comments: 15 pages, submitted to ECCV 2020

  8. arXiv:2003.06254  [pdf, other

    cs.LG cs.CV stat.ML

    What Information Does a ResNet Compress?

    Authors: Luke Nicholas Darlow, Amos Storkey

    Abstract: The information bottleneck principle (Shwartz-Ziv & Tishby, 2017) suggests that SGD-based training of deep neural networks results in optimally compressed hidden layers, from an information theoretic perspective. However, this claim was established on toy data. The goal of the work we present here is to test whether the information bottleneck principle is applicable to a realistic setting using a… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

    Comments: 10 pages + appendices; submitted to ICLR 2019

  9. arXiv:1810.03505  [pdf, other

    cs.CV cs.LG stat.ML

    CINIC-10 is not ImageNet or CIFAR-10

    Authors: Luke N. Darlow, Elliot J. Crowley, Antreas Antoniou, Amos J. Storkey

    Abstract: In this brief technical report we introduce the CINIC-10 dataset as a plug-in extended alternative for CIFAR-10. It was compiled by combining CIFAR-10 with images selected and downsampled from the ImageNet database. We present the approach to compiling the dataset, illustrate the example images for different classes, give pixel distributions for each part of the repository, and give some standard… ▽ More

    Submitted 2 October, 2018; originally announced October 2018.

    Comments: Dataset compilation, 9 pages, 11 figures, technical report

    Report number: EDI-INF-ANC-1802

  10. arXiv:1810.01860  [pdf, other

    cs.LG stat.ML

    GINN: Geometric Illustration of Neural Networks

    Authors: Luke N. Darlow, Amos J. Storkey

    Abstract: This informal technical report details the geometric illustration of decision boundaries for ReLU units in a three layer fully connected neural network. The network is designed and trained to predict pixel intensity from an (x, y) input location. The Geometric Illustration of Neural Networks (GINN) tool was built to visualise and track the points at which ReLU units switch from being active to off… ▽ More

    Submitted 2 October, 2018; originally announced October 2018.

    Comments: 8 pages, 9 figures, technical report

    Report number: EDI-INF-ANC-1901