Skip to main content

Showing 1–4 of 4 results for author: Kolodziej, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:1906.04825  [pdf, other

    cs.OH cs.AI

    Customizing Pareto Simulated Annealing for Multi-objective Optimization of Control Cabinet Layout

    Authors: Sabri Pllana, Suejb Memeti, Joanna Kolodziej

    Abstract: Determining the optimal location of control cabinet components requires the exploration of a large configuration space. For real-world control cabinets it is impractical to evaluate all possible cabinet configurations. Therefore, we need to apply methods for intelligent exploration of cabinet configuration space that enable to find a near-optimal configuration without evaluation of all possible co… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: Preprint, CSCS22, (C) 2019 IEEE

  2. arXiv:1906.01992  [pdf, other

    cs.DC cs.LG cs.PF

    Performance Modelling of Deep Learning on Intel Many Integrated Core Architectures

    Authors: Andre Viebke, Sabri Pllana, Suejb Memeti, Joanna Kolodziej

    Abstract: Many complex problems, such as natural language processing or visual object detection, are solved using deep learning. However, efficient training of complex deep convolutional neural networks for large data sets is computationally demanding and requires parallel computing resources. In this paper, we present two parameterized performance models for estimation of execution time of training convolu… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: Preprint, HPCS. arXiv admin note: substantial text overlap with arXiv:1702.07908

  3. Using Meta-heuristics and Machine Learning for Software Optimization of Parallel Computing Systems: A Systematic Literature Review

    Authors: Suejb Memeti, Sabri Pllana, Alecio Binotto, Joanna Kolodziej, Ivona Brandic

    Abstract: While modern parallel computing systems offer high performance, utilizing these powerful computing resources to the highest possible extent demands advanced knowledge of various hardware architectures and parallel programming models. Furthermore, optimized software execution on parallel computing systems demands consideration of many parameters at compile-time and run-time. Determining the optimal… ▽ More

    Submitted 2 May, 2018; v1 submitted 29 January, 2018; originally announced January 2018.

    Comments: Preprint

  4. arXiv:1704.05316  [pdf, other

    cs.DC cs.PF cs.PL cs.SE

    Benchmarking OpenCL, OpenACC, OpenMP, and CUDA: programming productivity, performance, and energy consumption

    Authors: Suejb Memeti, Lu Li, Sabri Pllana, Joanna Kolodziej, Christoph Kessler

    Abstract: Many modern parallel computing systems are heterogeneous at their node level. Such nodes may comprise general purpose CPUs and accelerators (such as, GPU, or Intel Xeon Phi) that provide high performance with suitable energy-consumption characteristics. However, exploiting the available performance of heterogeneous architectures may be challenging. There are various parallel programming frameworks… ▽ More

    Submitted 18 April, 2017; originally announced April 2017.