Skip to main content

Showing 1–11 of 11 results for author: Ioannou, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.11261  [pdf, other

    cs.DC cs.DB

    An Elastic Ephemeral Datastore using Cheap, Transient Cloud Resources

    Authors: Malte Brodmann, Nikolas Ioannou, Bernard Metzler, Jonas Pfefferle, Ana Klimovic

    Abstract: Spot instances are virtual machines offered at 60-90% lower cost that can be reclaimed at any time, with only a short warning period. Spot instances have already been used to significantly reduce the cost of processing workloads in the cloud. However, leveraging spot instances to reduce the cost of stateful cloud applications is much more challenging, as the sudden preemptions lead to data loss. I… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  2. arXiv:2011.07921  [pdf, other

    cs.DB cs.LG

    Towards a General Framework for ML-based Self-tuning Databases

    Authors: Thomas Schmied, Diego Didona, Andreas Döring, Thomas Parnell, Nikolas Ioannou

    Abstract: Machine learning (ML) methods have recently emerged as an effective way to perform automated parameter tuning of databases. State-of-the-art approaches include Bayesian optimization (BO) and reinforcement learning (RL). In this work, we describe our experience when applying these methods to a database not yet studied in this context: FoundationDB. Firstly, we describe the challenges we faced, such… ▽ More

    Submitted 27 April, 2021; v1 submitted 16 November, 2020; originally announced November 2020.

  3. arXiv:2006.09745  [pdf, other

    cs.LG stat.ML

    SnapBoost: A Heterogeneous Boosting Machine

    Authors: Thomas Parnell, Andreea Anghel, Malgorzata Lazuka, Nikolas Ioannou, Sebastian Kurella, Peshal Agarwal, Nikolaos Papandreou, Haralampos Pozidis

    Abstract: Modern gradient boosting software frameworks, such as XGBoost and LightGBM, implement Newton descent in a functional space. At each boosting iteration, their goal is to find the base hypothesis, selected from some base hypothesis class, that is closest to the Newton descent direction in a Euclidean sense. Typically, the base hypothesis class is fixed to be all binary decision trees up to a given d… ▽ More

    Submitted 25 September, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

  4. arXiv:2006.04658  [pdf, other

    cs.DB cs.PF

    Toward a Better Understanding and Evaluation of Tree Structures on Flash SSDs

    Authors: Diego Didona, Nikolas Ioannou, Radu Stoica, Kornilios Kourtis

    Abstract: Solid-state drives (SSDs) are extensively used to deploy persistent data stores, as they provide low latency random access, high write throughput, high data density, and low cost. Tree-based data structures are widely used to build persistent data stores, and indeed they lie at the backbone of many of the data management systems used in production and research today. In this paper, we show that be… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  5. arXiv:2003.04293  [pdf, other

    cs.DC cs.ET

    Compiling Neural Networks for a Computational Memory Accelerator

    Authors: Kornilios Kourtis, Martino Dazzi, Nikolas Ioannou, Tobias Grosser, Abu Sebastian, Evangelos Eleftheriou

    Abstract: Computational memory (CM) is a promising approach for accelerating inference on neural networks (NN) by using enhanced memories that, in addition to storing data, allow computations on them. One of the main challenges of this approach is defining a hardware/software interface that allows a compiler to map NN models for efficient execution on the underlying CM accelerator. This is a non-trivial tas… ▽ More

    Submitted 24 April, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: Accepted at SPMA '20

  6. arXiv:2002.11528  [pdf, other

    cs.DC cs.OS

    Safe and Efficient Remote Application Code Execution on Disaggregated NVM Storage with eBPF

    Authors: Kornilios Kourtis, Animesh Trivedi, Nikolas Ioannou

    Abstract: With rapid improvements in NVM storage devices, the performance bottleneck is gradually shifting to the network, thus giving rise to the notion of "data movement wall". To reduce the amount of data movement over the network, researchers have proposed near-data computing by shipping operations and compute-extensions closer to storage devices. However, running arbitrary, user-provided extensions in… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

  7. arXiv:1911.07722  [pdf, other

    cs.LG cs.DC stat.ML

    SySCD: A System-Aware Parallel Coordinate Descent Algorithm

    Authors: Nikolas Ioannou, Celestine Mendler-Dünner, Thomas Parnell

    Abstract: In this paper we propose a novel parallel stochastic coordinate descent (SCD) algorithm with convergence guarantees that exhibits strong scalability. We start by studying a state-of-the-art parallel implementation of SCD and identify scalability as well as system-level performance bottlenecks of the respective implementation. We then take a principled approach to develop a new SCD variant which is… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.

    Comments: accepted as a spotlight at NeurIPS 2019, Vancouver, Canada

  8. arXiv:1910.06853  [pdf, other

    cs.LG stat.ML

    Breadth-first, Depth-next Training of Random Forests

    Authors: Andreea Anghel, Nikolas Ioannou, Thomas Parnell, Nikolaos Papandreou, Celestine Mendler-Dünner, Haris Pozidis

    Abstract: In this paper we analyze, evaluate, and improve the performance of training Random Forest (RF) models on modern CPU architectures. An exact, state-of-the-art binary decision tree building algorithm is used as the basis of this study. Firstly, we investigate the trade-offs between using different tree building algorithms, namely breadth-first-search (BFS) and depth-search-first (DFS). We design a n… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  9. arXiv:1811.01564  [pdf, other

    cs.LG stat.ML

    Parallel training of linear models without compromising convergence

    Authors: Nikolas Ioannou, Celestine Dünner, Kornilios Kourtis, Thomas Parnell

    Abstract: In this paper we analyze, evaluate, and improve the performance of training generalized linear models on modern CPUs. We start with a state-of-the-art asynchronous parallel training algorithm, identify system-level performance bottlenecks, and apply optimizations that improve data parallelism, cache line locality, and cache line prefetching of the algorithm. These modifications reduce the per-epoc… ▽ More

    Submitted 19 December, 2018; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: Presented at the Workshop on Systems for ML and Open Source Software at NeurIPS 2018

  10. arXiv:1803.06333  [pdf, other

    cs.LG cs.AI cs.DC

    Snap ML: A Hierarchical Framework for Machine Learning

    Authors: Celestine Dünner, Thomas Parnell, Dimitrios Sarigiannis, Nikolas Ioannou, Andreea Anghel, Gummadi Ravi, Madhusudanan Kandasamy, Haralampos Pozidis

    Abstract: We describe a new software framework for fast training of generalized linear models. The framework, named Snap Machine Learning (Snap ML), combines recent advances in machine learning systems and algorithms in a nested manner to reflect the hierarchical architecture of modern computing systems. We prove theoretically that such a hierarchical system can accelerate training in distributed environmen… ▽ More

    Submitted 29 November, 2018; v1 submitted 16 March, 2018; originally announced March 2018.

    Comments: in Proceedings of the Thirty-Second Conference on Neural Information Processing Systems (NeurIPS 2018)

  11. Elevating commodity storage with the SALSA host translation layer

    Authors: Nikolas Ioannou, Kornilios Kourtis, Ioannis Koltsidas

    Abstract: To satisfy increasing storage demands in both capacity and performance, industry has turned to multiple storage technologies, including Flash SSDs and SMR disks. These devices employ a translation layer that conceals the idiosyncrasies of their mediums and enables random access. Device translation layers are, however, inherently constrained: resources on the drive are scarce, they cannot be adapte… ▽ More

    Submitted 10 January, 2019; v1 submitted 17 January, 2018; originally announced January 2018.

    Comments: Presented at 2018 IEEE 26th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)