Skip to main content

Showing 1–50 of 54 results for author: Parnell

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.03780  [pdf, other

    cs.AR cs.AI cs.PL

    GPU Performance Portability needs Autotuning

    Authors: Burkhard Ringlein, Thomas Parnell, Radu Stoica

    Abstract: As LLMs grow in complexity, achieving state-of-the-art performance requires tight co-design across algorithms, software, and hardware. Today's reliance on a single dominant platform limits portability, creates vendor lock-in, and raises barriers for new AI hardware. In this work, we make the case for combining just-in-time (JIT) compilation with kernel parameter autotuning to enable portable LLM i… ▽ More

    Submitted 15 May, 2025; v1 submitted 30 April, 2025; originally announced May 2025.

    Comments: typos, fix grammatical mistakes

  2. arXiv:2504.13116  [pdf, other

    cs.LG stat.ME

    Predicting BVD Re-emergence in Irish Cattle From Highly Imbalanced Herd-Level Data Using Machine Learning Algorithms

    Authors: Niamh Mimnagh, Andrew Parnell, Conor McAloon, Jaden Carlson, Maria Guelbenzu, Jonas Brock, Damien Barrett, Guy McGrath, Jamie Tratalos, Rafael Moral

    Abstract: Bovine Viral Diarrhoea (BVD) has been the focus of a successful eradication programme in Ireland, with the herd-level prevalence declining from 11.3% in 2013 to just 0.2% in 2023. As the country moves toward BVD freedom, the development of predictive models for targeted surveillance becomes increasingly important to mitigate the risk of disease re-emergence. In this study, we evaluate the performa… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  3. arXiv:2504.12397  [pdf, other

    cs.LG cs.AI

    Activated LoRA: Fine-tuned LLMs for Intrinsics

    Authors: Kristjan Greenewald, Luis Lastras, Thomas Parnell, Vraj Shah, Lucian Popa, Giulio Zizzo, Chulaka Gunasekara, Ambrish Rawat, David Cox

    Abstract: Low-Rank Adaptation (LoRA) has emerged as a highly efficient framework for finetuning the weights of large foundation models, and has become the go-to method for data-driven customization of LLMs. Despite the promise of highly customized behaviors and capabilities, switching between relevant LoRAs in a multiturn setting is inefficient, as the key-value (KV) cache of the entire turn history must be… ▽ More

    Submitted 10 June, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

  4. arXiv:2412.01530  [pdf, other

    cs.SD eess.AS stat.AP

    Generative AI-based data augmentation for improved bioacoustic classification in noisy environments

    Authors: Anthony Gibbons, Emma King, Ian Donohue, Andrew Parnell

    Abstract: 1. Obtaining data to train robust artificial intelligence (AI)-based models for species classification can be challenging, particularly for rare species. Data augmentation can boost classification accuracy by increasing the diversity of training data and is cheaper to obtain than expert-labelled data. However, many classic image-based augmentation techniques are not suitable for audio spectrograms… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 18 pages, 3 tables, 5 figures

  5. arXiv:2410.02425  [pdf, other

    cs.DC cs.CL cs.LG

    LLM-Pilot: Characterize and Optimize Performance of your LLM Inference Services

    Authors: Małgorzata Łazuka, Andreea Anghel, Thomas Parnell

    Abstract: As Large Language Models (LLMs) are rapidly growing in popularity, LLM inference services must be able to serve requests from thousands of users while satisfying performance requirements. The performance of an LLM inference service is largely determined by the hardware onto which it is deployed, but understanding of which hardware will deliver on performance requirements remains challenging. In th… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: Accepted to the International Conference for High Performance Computing, Networking, Storage and Analysis (SC '24)

  6. arXiv:2407.11927  [pdf, other

    stat.ML cs.LG stat.AP

    Bayesian Causal Forests for Longitudinal Data: Assessing the Impact of Part-Time Work on Growth in High School Mathematics Achievement

    Authors: Nathan McJames, Ann O'Shea, Andrew Parnell

    Abstract: Modelling growth in student achievement is a significant challenge in the field of education. Understanding how interventions or experiences such as part-time work can influence this growth is also important. Traditional methods like difference-in-differences are effective for estimating causal effects from longitudinal data. Meanwhile, Bayesian non-parametric methods have recently become popular… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 25 pages, 7 figures, 3 tables

  7. arXiv:2404.19124  [pdf, other

    cs.CL

    Accelerating Production LLMs with Combined Token/Embedding Speculators

    Authors: Davis Wertheimer, Joshua Rosenkranz, Thomas Parnell, Sahil Suneja, Pavithra Ranganathan, Raghu Ganti, Mudhakar Srivatsa

    Abstract: This technical report describes the design and training of novel speculative decoding draft models, for accelerating the inference speeds of large language models in a production environment. By conditioning draft predictions on both context vectors and sampled tokens, we can train our speculators to efficiently predict high-quality n-grams, which the base model then accepts or rejects. This allow… ▽ More

    Submitted 6 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: Original upload 4/29/24, updated 6/6/24 with additional references to concurrent work

  8. arXiv:2404.15387  [pdf, other

    q-bio.QM cs.LG

    Machine Learning Applied to the Detection of Mycotoxin in Food: A Review

    Authors: Alan Inglis, Andrew Parnell, Natarajan Subramani, Fiona Doohan

    Abstract: Mycotoxins, toxic secondary metabolites produced by certain fungi, pose significant threats to global food safety and public health. These compounds can contaminate a variety of crops, leading to economic losses and health risks to both humans and animals. Traditional lab analysis methods for mycotoxin detection can be time-consuming and may not always be suitable for large-scale screenings. Howev… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 39 pages, 8 figures, review paper

  9. arXiv:2403.13240  [pdf, other

    cs.CL

    SumTra: A Differentiable Pipeline for Few-Shot Cross-Lingual Summarization

    Authors: Jacob Parnell, Inigo Jauregi Unanue, Massimo Piccardi

    Abstract: Cross-lingual summarization (XLS) generates summaries in a language different from that of the input documents (e.g., English to Spanish), allowing speakers of the target language to gain a concise view of their content. In the present day, the predominant approach to this task is to take a performing, pretrained multilingual language model (LM) and fine-tune it for XLS on the language pairs of in… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL 2024

  10. arXiv:2403.03882  [pdf

    cs.CV

    Self and Mixed Supervision to Improve Training Labels for Multi-Class Medical Image Segmentation

    Authors: Jianfei Liu, Christopher Parnell, Ronald M. Summers

    Abstract: Accurate training labels are a key component for multi-class medical image segmentation. Their annotation is costly and time-consuming because it requires domain expertise. This work aims to develop a dual-branch network and automatically improve training labels for multi-class image segmentation. Transfer learning is used to train the network and improve inaccurate weak labels sequentially. The d… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 7 Pages, 3 figures, SPIE Medical Imaging 2024, Computer-aided diagnosis

  11. arXiv:2402.05295  [pdf

    cs.LG cs.AI

    An information theoretic approach to quantify the stability of feature selection and ranking algorithms

    Authors: Alaiz-Rodriguez, R., Parnell, A. C

    Abstract: Feature selection is a key step when dealing with high dimensional data. In particular, these techniques simplify the process of knowledge discovery from the data by selecting the most relevant features out of the noisy, redundant and irrelevant features. A problem that arises in many of these practical applications is that the outcome of the feature selection algorithm is not stable. Thus, small… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  12. arXiv:2401.05294  [pdf, other

    cs.CV

    Enhanced Muscle and Fat Segmentation for CT-Based Body Composition Analysis: A Comparative Study

    Authors: Benjamin Hou, Tejas Sudharshan Mathai, Jianfei Liu, Christopher Parnell, Ronald M. Summers

    Abstract: Purpose: Body composition measurements from routine abdominal CT can yield personalized risk assessments for asymptomatic and diseased patients. In particular, attenuation and volume measures of muscle and fat are associated with important clinical outcomes, such as cardiovascular events, fractures, and death. This study evaluates the reliability of an Internal tool for the segmentation of muscle… ▽ More

    Submitted 12 April, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  13. arXiv:2306.03042  [pdf, other

    cs.LG cs.AI

    SERT: A Transfomer Based Model for Spatio-Temporal Sensor Data with Missing Values for Environmental Monitoring

    Authors: Amin Shoari Nejad, Rocío Alaiz-Rodríguez, Gerard D. McCarthy, Brian Kelleher, Anthony Grey, Andrew Parnell

    Abstract: Environmental monitoring is crucial to our understanding of climate change, biodiversity loss and pollution. The availability of large-scale spatio-temporal data from sources such as sensors and satellites allows us to develop sophisticated models for forecasting and understanding key drivers. However, the data collected from sensors often contain missing values due to faulty equipment or maintena… ▽ More

    Submitted 9 June, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 11 pages, 7 figures

  14. arXiv:2303.04874  [pdf, other

    stat.ML cs.LG stat.AP

    Bayesian Causal Forests for Multivariate Outcomes: Application to Irish Data From an International Large Scale Education Assessment

    Authors: Nathan McJames, Andrew Parnell, Yong Chen Goh, Ann O'Shea

    Abstract: Bayesian Causal Forests (BCF) is a causal inference machine learning model based on a highly flexible non-parametric regression and classification tool called Bayesian Additive Regression Trees (BART). Motivated by data from the Trends in International Mathematics and Science Study (TIMSS), which includes data on student achievement in both mathematics and science, we present a multivariate extens… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: 26 pages, 6 figures

  15. arXiv:2301.03655  [pdf, other

    stat.ML cs.LG

    Bayesian Additive Main Effects and Multiplicative Interaction Models using Tensor Regression for Multi-environmental Trials

    Authors: Antonia A. L. Dos Santos, Danilo A. Sarti, Rafael A. Moral, Andrew C. Parnell

    Abstract: We propose a Bayesian tensor regression model to accommodate the effect of multiple factors on phenotype prediction. We adopt a set of prior distributions that resolve identifiability issues that may arise between the parameters in the model. Simulation experiments show that our method out-performs previous related models and machine learning algorithms under different sample sizes and degrees of… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

  16. arXiv:2212.01457  [pdf, other

    cs.SD eess.AS

    NEAL: An open-source tool for audio annotation

    Authors: Anthony Gibbons, Ian Donohue, Courtney E. Gorman, Emma King, Andrew Parnell

    Abstract: Passive acoustic monitoring is used widely in ecology, biodiversity, and conservation studies. Data sets collected via acoustic monitoring are often extremely large and built to be processed automatically using Artificial Intelligence and Machine learning models, which aim to replicate the work of domain experts. These models, being supervised learning algorithms, need to be trained on high qualit… ▽ More

    Submitted 8 December, 2022; v1 submitted 2 December, 2022; originally announced December 2022.

  17. arXiv:2207.09543  [pdf

    cs.RO cs.HC cs.MA

    Industry Led Use-Case Development for Human-Swarm Operations

    Authors: Jediah R. Clark, Mohammad Naiseh, Joel Fischer, Marise Galvez Trigo, Katie Parnell, Mario Brito, Adrian Bodenmann, Sarvapali D. Ramchurn, Mohammad Divband Soorati

    Abstract: In the domain of unmanned vehicles, autonomous robotic swarms promise to deliver increased efficiency and collective autonomy. How these swarms will operate in the future, and what communication requirements and operational boundaries will arise are yet to be sufficiently defined. A workshop was conducted with 11 professional unmanned-vehicle operators and designers with the objective of identifyi… ▽ More

    Submitted 24 July, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted at AAAI 2022 Spring Symposium Series (Putting AI in the Critical Loop: Assured Trust and Autonomy in Human-Machine Teams)

  18. arXiv:2207.00011  [pdf, other

    stat.ML cs.LG stat.ME

    Variational Inference for Additive Main and Multiplicative Interaction Effects Models

    Authors: AntÔnia A. L. Dos Santos, Rafael A. Moral, Danilo A. Sarti, Andrew C. Parnell

    Abstract: In plant breeding the presence of a genotype by environment (GxE) interaction has a strong impact on cultivation decision making and the introduction of new crop cultivars. The combination of linear and bilinear terms has been shown to be very useful in modelling this type of data. A widely-used approach to identify GxE is the Additive Main Effects and Multiplicative Interaction Effects (AMMI) mod… ▽ More

    Submitted 29 June, 2022; originally announced July 2022.

  19. arXiv:2204.09437  [pdf, other

    cs.DC cs.LG

    Search-based Methods for Multi-Cloud Configuration

    Authors: Małgorzata Łazuka, Thomas Parnell, Andreea Anghel, Haralampos Pozidis

    Abstract: Multi-cloud computing has become increasingly popular with enterprises looking to avoid vendor lock-in. While most cloud providers offer similar functionality, they may differ significantly in terms of performance and/or cost. A customer looking to benefit from such differences will naturally want to solve the multi-cloud configuration problem: given a workload, which cloud provider should be chos… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: Submitted to IEEE Cloud 2022

  20. arXiv:2204.07207  [pdf, other

    stat.ME cs.LG

    Hierarchical Embedded Bayesian Additive Regression Trees

    Authors: Bruna Wundervald, Andrew Parnell, Katarina Domijan

    Abstract: We propose a simple yet powerful extension of Bayesian Additive Regression Trees which we name Hierarchical Embedded BART (HE-BART). The model allows for random effects to be included at the terminal node level of a set of regression trees, making HE-BART a non-parametric alternative to mixed effects models which avoids the need for the user to specify the structure of the random effects in the mo… ▽ More

    Submitted 24 April, 2023; v1 submitted 14 April, 2022; originally announced April 2022.

  21. arXiv:2204.02112  [pdf, other

    stat.ME cs.LG stat.ML

    GP-BART: a novel Bayesian additive regression trees approach using Gaussian processes

    Authors: Mateus Maia, Keefe Murphy, Andrew C. Parnell

    Abstract: The Bayesian additive regression trees (BART) model is an ensemble method extensively and successfully used in regression tasks due to its consistently strong predictive performance and its ability to quantify uncertainty. BART combines "weak" tree models through a set of shrinkage priors, whereby each tree explains a small portion of the variability in the data. However, the lack of smoothness an… ▽ More

    Submitted 14 September, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

  22. arXiv:2203.02894  [pdf, other

    cs.CL

    A Multi-Document Coverage Reward for RELAXed Multi-Document Summarization

    Authors: Jacob Parnell, Inigo Jauregi Unanue, Massimo Piccardi

    Abstract: Multi-document summarization (MDS) has made significant progress in recent years, in part facilitated by the availability of new, dedicated datasets and capacious language models. However, a standing limitation of these models is that they are trained against limited references and with plain maximum-likelihood objectives. As for many other generative tasks, reinforcement learning (RL) offers the… ▽ More

    Submitted 6 March, 2022; originally announced March 2022.

    Comments: Accepted to ACL 2022

  23. arXiv:2108.07636  [pdf, other

    stat.ML cs.LG

    Accounting for shared covariates in semi-parametric Bayesian additive regression trees

    Authors: Estevão B. Prado, Andrew C. Parnell, Keefe Murphy, Nathan McJames, Ann O'Shea, Rafael A. Moral

    Abstract: We propose some extensions to semi-parametric models based on Bayesian additive regression trees (BART). In the semi-parametric BART paradigm, the response variable is approximated by a linear predictor and a BART model, where the linear component is responsible for estimating the main effects and BART accounts for non-specified interactions and non-linearities. Previous semi-parametric models bas… ▽ More

    Submitted 30 July, 2024; v1 submitted 17 August, 2021; originally announced August 2021.

    Comments: 48 pages, 8 tables, 10 figures

  24. arXiv:2107.12809  [pdf, other

    cs.LG cs.CE

    Bayesian Optimisation for Sequential Experimental Design with Applications in Additive Manufacturing

    Authors: Mimi Zhang, Andrew Parnell, Dermot Brabazon, Alessio Benavoli

    Abstract: Bayesian optimization (BO) is an approach to globally optimizing black-box objective functions that are expensive to evaluate. BO-powered experimental design has found wide application in materials science, chemistry, experimental physics, drug development, etc. This work aims to bring attention to the benefits of applying BO in designing experiments and to provide a BO manual, covering both metho… ▽ More

    Submitted 8 October, 2023; v1 submitted 27 July, 2021; originally announced July 2021.

  25. arXiv:2106.04080  [pdf, other

    cs.CL

    RewardsOfSum: Exploring Reinforcement Learning Rewards for Summarisation

    Authors: Jacob Parnell, Inigo Jauregi Unanue, Massimo Piccardi

    Abstract: To date, most abstractive summarisation models have relied on variants of the negative log-likelihood (NLL) as their training objective. In some cases, reinforcement learning has been added to train the models with an objective that is closer to their evaluation measures (e.g. ROUGE). However, the reward function to be used within the reinforcement learning approach can play a key role for perform… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: 5th Workshop on Structured Prediction for NLP; held in conjunction with ACL-IJCNLP 2021

  26. arXiv:2106.02208  [pdf, other

    cs.CL

    BERTTune: Fine-Tuning Neural Machine Translation with BERTScore

    Authors: Inigo Jauregi Unanue, Jacob Parnell, Massimo Piccardi

    Abstract: Neural machine translation models are often biased toward the limited translation references seen during training. To amend this form of overfitting, in this paper we propose fine-tuning the models with a novel training objective based on the recently-proposed BERTScore evaluation metric. BERTScore is a scoring function based on contextual embeddings that overcomes the typical limitations of n-gra… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: Accepted at ACL 2021

  27. arXiv:2011.07921  [pdf, other

    cs.DB cs.LG

    Towards a General Framework for ML-based Self-tuning Databases

    Authors: Thomas Schmied, Diego Didona, Andreas Döring, Thomas Parnell, Nikolas Ioannou

    Abstract: Machine learning (ML) methods have recently emerged as an effective way to perform automated parameter tuning of databases. State-of-the-art approaches include Bayesian optimization (BO) and reinforcement learning (RL). In this work, we describe our experience when applying these methods to a database not yet studied in this context: FoundationDB. Firstly, we describe the challenges we faced, such… ▽ More

    Submitted 27 April, 2021; v1 submitted 16 November, 2020; originally announced November 2020.

  28. arXiv:2006.09745  [pdf, other

    cs.LG stat.ML

    SnapBoost: A Heterogeneous Boosting Machine

    Authors: Thomas Parnell, Andreea Anghel, Malgorzata Lazuka, Nikolas Ioannou, Sebastian Kurella, Peshal Agarwal, Nikolaos Papandreou, Haralampos Pozidis

    Abstract: Modern gradient boosting software frameworks, such as XGBoost and LightGBM, implement Newton descent in a functional space. At each boosting iteration, their goal is to find the base hypothesis, selected from some base hypothesis class, that is closest to the Newton descent direction in a Euclidean sense. Typically, the base hypothesis class is fixed to be all binary decision trees up to a given d… ▽ More

    Submitted 25 September, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

  29. arXiv:2006.07515  [pdf, other

    stat.ML cs.IR cs.LG

    Generalizing Gain Penalization for Feature Selection in Tree-based Models

    Authors: Bruna Wundervald, Andrew Parnell, Katarina Domijan

    Abstract: We develop a new approach for feature selection via gain penalization in tree-based models. First, we show that previous methods do not perform sufficient regularization and often exhibit sub-optimal out-of-sample performance, especially when correlated features are present. Instead, we develop a new gain penalization idea that exhibits a general local-global regularization for tree-based models.… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

    Comments: 13 pages, 2 figures

  30. Bayesian Additive Regression Trees with Model Trees

    Authors: Estevão B. Prado, Rafael A. Moral, Andrew C. Parnell

    Abstract: Bayesian Additive Regression Trees (BART) is a tree-based machine learning method that has been successfully applied to regression and classification problems. BART assumes regularisation priors on a set of trees that work as weak learners and is very flexible for predicting in the presence of non-linearity and high-order interactions. In this paper, we introduce an extension of BART, called Model… ▽ More

    Submitted 10 March, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

    Journal ref: Statistics and Computing 31, 20 (2021)

  31. arXiv:2006.07272  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private Stochastic Coordinate Descent

    Authors: Georgios Damaskinos, Celestine Mendler-Dünner, Rachid Guerraoui, Nikolaos Papandreou, Thomas Parnell

    Abstract: In this paper we tackle the challenge of making the stochastic coordinate descent algorithm differentially private. Compared to the classical gradient descent algorithm where updates operate on a single model vector and controlled noise addition to this vector suffices to hide critical information about individuals, stochastic coordinate descent crucially relies on keeping auxiliary information in… ▽ More

    Submitted 14 March, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

  32. arXiv:1911.07722  [pdf, other

    cs.LG cs.DC stat.ML

    SySCD: A System-Aware Parallel Coordinate Descent Algorithm

    Authors: Nikolas Ioannou, Celestine Mendler-Dünner, Thomas Parnell

    Abstract: In this paper we propose a novel parallel stochastic coordinate descent (SCD) algorithm with convergence guarantees that exhibits strong scalability. We start by studying a state-of-the-art parallel implementation of SCD and identify scalability as well as system-level performance bottlenecks of the respective implementation. We then take a principled approach to develop a new SCD variant which is… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.

    Comments: accepted as a spotlight at NeurIPS 2019, Vancouver, Canada

  33. arXiv:1910.06853  [pdf, other

    cs.LG stat.ML

    Breadth-first, Depth-next Training of Random Forests

    Authors: Andreea Anghel, Nikolas Ioannou, Thomas Parnell, Nikolaos Papandreou, Celestine Mendler-Dünner, Haris Pozidis

    Abstract: In this paper we analyze, evaluate, and improve the performance of training Random Forest (RF) models on modern CPU architectures. An exact, state-of-the-art binary decision tree building algorithm is used as the basis of this study. Firstly, we investigate the trade-offs between using different tree building algorithms, namely breadth-first-search (BFS) and depth-search-first (DFS). We design a n… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  34. arXiv:1909.07218  [pdf, other

    cs.LG stat.ML

    Learning to Tune XGBoost with XGBoost

    Authors: Johanna Sommer, Dimitrios Sarigiannis, Thomas Parnell

    Abstract: In this short paper we investigate whether meta-learning techniques can be used to more effectively tune the hyperparameters of machine learning models using successive halving (SH). We propose a novel variant of the SH algorithm (MeSH), that uses meta-regressors to determine which candidate configurations should be eliminated at each round. We apply MeSH to the problem of tuning the hyperparamete… ▽ More

    Submitted 21 November, 2019; v1 submitted 16 September, 2019; originally announced September 2019.

    Comments: Accepted for presentation at The 3rd Workshop on Meta-Learning (Meta-Learn 2019), Vancouver, Canada

  35. arXiv:1909.07140  [pdf, other

    cs.LG stat.ML

    Weighted Sampling for Combined Model Selection and Hyperparameter Tuning

    Authors: Dimitrios Sarigiannis, Thomas Parnell, Haris Pozidis

    Abstract: The combined algorithm selection and hyperparameter tuning (CASH) problem is characterized by large hierarchical hyperparameter spaces. Model-free hyperparameter tuning methods can explore such large spaces efficiently since they are highly parallelizable across multiple machines. When no prior knowledge or meta-data exists to boost their performance, these methods commonly sample random configura… ▽ More

    Submitted 21 November, 2019; v1 submitted 16 September, 2019; originally announced September 2019.

    Comments: Accepted for presentation at The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020)

  36. arXiv:1909.04885  [pdf, other

    cs.LG cs.DC cs.PF stat.ML

    Addressing Algorithmic Bottlenecks in Elastic Machine Learning with Chicle

    Authors: Michael Kaufmann, Kornilios Kourtis, Celestine Mendler-Dünner, Adrian Schüpbach, Thomas Parnell

    Abstract: Distributed machine learning training is one of the most common and important workloads running on data centers today, but it is rarely executed alone. Instead, to reduce costs, computing resources are consolidated and shared by different applications. In this scenario, elasticity and proper load balancing are vital to maximize efficiency, fairness, and utilization. Currently, most distributed tra… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

  37. arXiv:1906.03474  [pdf, other

    cs.LG

    5 Parallel Prism: A topology for pipelined implementations of convolutional neural networks using computational memory

    Authors: Martino Dazzi, Abu Sebastian, Pier Andrea Francese, Thomas Parnell, Luca Benini, Evangelos Eleftheriou

    Abstract: In-memory computing is an emerging computing paradigm that could enable deeplearning inference at significantly higher energy efficiency and reduced latency. The essential idea is to map the synaptic weights corresponding to each layer to one or more computational memory (CM) cores. During inference, these cores perform the associated matrix-vector multiply operations in place with O(1) time compl… ▽ More

    Submitted 8 June, 2019; originally announced June 2019.

  38. arXiv:1903.09434  [pdf, other

    cs.LG stat.ML

    Sampling Acquisition Functions for Batch Bayesian Optimization

    Authors: Alessandro De Palma, Celestine Mendler-Dünner, Thomas Parnell, Andreea Anghel, Haralampos Pozidis

    Abstract: We present Acquisition Thompson Sampling (ATS), a novel technique for batch Bayesian Optimization (BO) based on the idea of sampling multiple acquisition functions from a stochastic process. We define this process through the dependency of the acquisition functions on a set of model hyper-parameters. ATS is conceptually simple, straightforward to implement and, unlike other batch BO methods, it ca… ▽ More

    Submitted 16 October, 2019; v1 submitted 22 March, 2019; originally announced March 2019.

    Comments: Presented at BNP@NeurIPS 2018

  39. arXiv:1812.09178  [pdf

    cs.LG stat.AP stat.ML

    An Evaluation of Methods for Real-Time Anomaly Detection using Force Measurements from the Turning Process

    Authors: Yuanzhi Huang, Eamonn Ahearne, Szymon Baron, Andrew Parnell

    Abstract: We examined the use of three conventional anomaly detection methods and assess their potential for on-line tool wear monitoring. Through efficient data processing and transformation of the algorithm proposed here, in a real-time environment, these methods were tested for fast evaluation of cutting tools on CNC machines. The three-dimensional force data streams we used were extracted from a turning… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

    MSC Class: 60G35; 62P30; 68T10 ACM Class: G.3; I.5.0; I.6.0

  40. arXiv:1811.02322  [pdf, other

    cs.LG cs.HC stat.ML

    Elastic CoCoA: Scaling In to Improve Convergence

    Authors: Michael Kaufmann, Thomas Parnell, Kornilios Kourtis

    Abstract: In this paper we experimentally analyze the convergence behavior of CoCoA and show, that the number of workers required to achieve the highest convergence rate at any point in time, changes over the course of the training. Based on this observation, we build Chicle, an elastic framework that dynamically adjusts the number of workers based on feedback from the training algorithm, in order to select… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

  41. arXiv:1811.01564  [pdf, other

    cs.LG stat.ML

    Parallel training of linear models without compromising convergence

    Authors: Nikolas Ioannou, Celestine Dünner, Kornilios Kourtis, Thomas Parnell

    Abstract: In this paper we analyze, evaluate, and improve the performance of training generalized linear models on modern CPUs. We start with a state-of-the-art asynchronous parallel training algorithm, identify system-level performance bottlenecks, and apply optimizations that improve data parallelism, cache line locality, and cache line prefetching of the algorithm. These modifications reduce the per-epoc… ▽ More

    Submitted 19 December, 2018; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: Presented at the Workshop on Systems for ML and Open Source Software at NeurIPS 2018

  42. arXiv:1809.04559  [pdf, other

    cs.LG stat.ML

    Benchmarking and Optimization of Gradient Boosting Decision Tree Algorithms

    Authors: Andreea Anghel, Nikolaos Papandreou, Thomas Parnell, Alessandro De Palma, Haralampos Pozidis

    Abstract: Gradient boosting decision trees (GBDTs) have seen widespread adoption in academia, industry and competitive data science due to their state-of-the-art performance in many machine learning tasks. One relative downside to these models is the large number of hyper-parameters that they expose to the end-user. To maximize the predictive power of GBDT models, one must either manually tune the hyper-par… ▽ More

    Submitted 17 January, 2019; v1 submitted 12 September, 2018; originally announced September 2018.

    Comments: Workshop on Systems for ML and Open Source Software at NeurIPS 2018, Montreal, Canada

  43. arXiv:1803.06333  [pdf, other

    cs.LG cs.AI cs.DC

    Snap ML: A Hierarchical Framework for Machine Learning

    Authors: Celestine Dünner, Thomas Parnell, Dimitrios Sarigiannis, Nikolas Ioannou, Andreea Anghel, Gummadi Ravi, Madhusudanan Kandasamy, Haralampos Pozidis

    Abstract: We describe a new software framework for fast training of generalized linear models. The framework, named Snap Machine Learning (Snap ML), combines recent advances in machine learning systems and algorithms in a nested manner to reflect the hierarchical architecture of modern computing systems. We prove theoretically that such a hierarchical system can accelerate training in distributed environmen… ▽ More

    Submitted 29 November, 2018; v1 submitted 16 March, 2018; originally announced March 2018.

    Comments: in Proceedings of the Thirty-Second Conference on Neural Information Processing Systems (NeurIPS 2018)

  44. arXiv:1711.07227  [pdf, ps, other

    cs.IR cs.DC cs.DS

    Linear-Complexity Relaxed Word Mover's Distance with GPU Acceleration

    Authors: Kubilay Atasu, Thomas Parnell, Celestine Dünner, Manolis Sifalakis, Haralampos Pozidis, Vasileios Vasileiadis, Michail Vlachos, Cesar Berrospi, Abdel Labbi

    Abstract: The amount of unstructured text-based data is growing every day. Querying, clustering, and classifying this big data requires similarity computations across large sets of documents. Whereas low-complexity similarity metrics are available, attention has been shifting towards more complex methods that achieve a higher accuracy. In particular, the Word Mover's Distance (WMD) method proposed by Kusner… ▽ More

    Submitted 20 November, 2017; originally announced November 2017.

    Comments: To appear in the 2017 IEEE International Conference on Big Data (Big Data 2017) http://cci.drexel.edu/bigdata/bigdata2017/ December 11-14, 2017, Boston, MA, USA

  45. Neuromorphic computing with multi-memristive synapses

    Authors: Irem Boybat, Manuel Le Gallo, S. R. Nandakumar, Timoleon Moraitis, Thomas Parnell, Tomas Tuma, Bipin Rajendran, Yusuf Leblebici, Abu Sebastian, Evangelos Eleftheriou

    Abstract: Neuromorphic computing has emerged as a promising avenue towards building the next generation of intelligent computing systems. It has been proposed that memristive devices, which exhibit history-dependent conductivity modulation, could efficiently represent the synaptic weights in artificial neural networks. However, precise modulation of the device conductance over a wide dynamic range, necessar… ▽ More

    Submitted 24 February, 2019; v1 submitted 17 November, 2017; originally announced November 2017.

    Journal ref: Nature Communications, volume 9, page 2514 (2018)

  46. arXiv:1708.05357  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Efficient Use of Limited-Memory Accelerators for Linear Learning on Heterogeneous Systems

    Authors: Celestine Dünner, Thomas Parnell, Martin Jaggi

    Abstract: We propose a generic algorithmic building block to accelerate training of machine learning models on heterogeneous compute systems. Our scheme allows to efficiently employ compute accelerators such as GPUs and FPGAs for the training of large-scale machine learning models, when the training data exceeds their memory capacity. Also, it provides adaptivity to any system's memory hierarchy in terms of… ▽ More

    Submitted 7 November, 2017; v1 submitted 17 August, 2017; originally announced August 2017.

    MSC Class: 90C25; 68W15; 68W10 ACM Class: G.1.6; C.1.4

  47. Temporal correlation detection using computational phase-change memory

    Authors: Abu Sebastian, Tomas Tuma, Nikolaos Papandreou, Manuel Le Gallo, Lukas Kull, Thomas Parnell, Evangelos Eleftheriou

    Abstract: For decades, conventional computers based on the von Neumann architecture have performed computation by repeatedly transferring data between their processing and their memory units, which are physically separated. As computation becomes increasingly data-centric and as the scalability limits in terms of performance and power are being reached, alternative computing paradigms are searched for in wh… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

  48. arXiv:1702.07005  [pdf, other

    cs.LG cs.DC

    Large-Scale Stochastic Learning using GPUs

    Authors: Thomas Parnell, Celestine Dünner, Kubilay Atasu, Manolis Sifalakis, Haris Pozidis

    Abstract: In this work we propose an accelerated stochastic learning system for very large-scale applications. Acceleration is achieved by mapping the training algorithm onto massively parallel processors: we demonstrate a parallel, asynchronous GPU implementation of the widely used stochastic coordinate descent/ascent algorithm that can provide up to 35x speed-up over a sequential CPU implementation. In or… ▽ More

    Submitted 22 February, 2017; originally announced February 2017.

    Comments: Accepted for publication in ParLearning 2017: The 6th International Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics, Orlando, Florida, May 2017

  49. Understanding and Optimizing the Performance of Distributed Machine Learning Applications on Apache Spark

    Authors: Celestine Dünner, Thomas Parnell, Kubilay Atasu, Manolis Sifalakis, Haralampos Pozidis

    Abstract: In this paper we explore the performance limits of Apache Spark for machine learning applications. We begin by analyzing the characteristics of a state-of-the-art distributed machine learning algorithm implemented in Spark and compare it to an equivalent reference implementation using the high performance computing framework MPI. We identify critical bottlenecks of the Spark framework and carefull… ▽ More

    Submitted 12 December, 2017; v1 submitted 5 December, 2016; originally announced December 2016.

    Comments: To appear in the 2017 IEEE International Conference on Big Data (Big Data 2017), December 11-14, 2017, Boston, MA, USA

  50. arXiv:1604.02071  [pdf, other

    cs.IR

    Scalable and interpretable product recommendations via overlapping co-clustering

    Authors: Reinhard Heckel, Michail Vlachos, Thomas Parnell, Celestine Dünner

    Abstract: We consider the problem of generating interpretable recommendations by identifying overlapping co-clusters of clients and products, based only on positive or implicit feedback. Our approach is applicable on very large datasets because it exhibits almost linear complexity in the input examples and the number of co-clusters. We show, both on real industrial data and on publicly available datasets, t… ▽ More

    Submitted 17 May, 2017; v1 submitted 7 April, 2016; originally announced April 2016.

    Comments: In IEEE International Conference on Data Engineering (ICDE) 2017