Skip to main content

Showing 1–12 of 12 results for author: van Nieuwpoort, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.16001  [pdf, other

    cs.PF

    Multi-Strided Access Patterns to Boost Hardware Prefetching

    Authors: Miguel O. Blom, Kristian F. D. Rietveld, Rob V. van Nieuwpoort

    Abstract: Important memory-bound kernels, such as linear algebra, convolutions, and stencils, rely on SIMD instructions as well as optimizations targeting improved vectorized data traversal and data re-use to attain satisfactory performance. On on temporary CPU architectures, the hardware prefetcher is of key importance for efficient utilization of the memory hierarchy. In this paper, we demonstrate that tr… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

    Comments: 12 pages, 6 figures, 2 tables

  2. arXiv:2412.08490  [pdf, other

    astro-ph.IM cs.AI

    SuperCode: Sustainability PER AI-driven CO-DEsign

    Authors: P. Chris Broekema, Rob V. van Nieuwpoort

    Abstract: Currently, data-intensive scientific applications require vast amounts of compute resources to deliver world-leading science. The climate emergency has made it clear that unlimited use of resources (e.g., energy) for scientific discovery is no longer acceptable. Future computing hardware promises to be much more energy efficient, but without better optimized software this cannot reach its full pot… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

  3. In Serverless, OS Scheduler Choice Costs Money: A Hybrid Scheduling Approach for Cheaper FaaS

    Authors: Yuxuan Zhao, Weikang Weng, Rob van Nieuwpoort, Alexandru Uta

    Abstract: In Function-as-a-Service (FaaS) serverless, large applications are split into short-lived stateless functions. Deploying functions is mutually profitable: users need not be concerned with resource management, while providers can keep their servers at high utilization rates running thousands of functions concurrently on a single machine. It is exactly this high concurrency that comes at a cost. The… ▽ More

    Submitted 13 November, 2024; originally announced November 2024.

    Comments: Accepted at Middleware 2024, author draft made available for timely dissemination

  4. FAIRSECO: An Extensible Framework for Impact Measurement of Research Software

    Authors: Deekshitha, Siamak Farshidi, Jason Maassen, Rena Bakhshi, Rob van Nieuwpoort, Slinger Jansen

    Abstract: The growing usage of research software in the research community has highlighted the need to recognize and acknowledge the contributions made not only by researchers but also by Research Software Engineers. However, the existing methods for crediting research software and Research Software Engineers have proven to be insufficient. In response, we have developed FAIRSECO, an extensible open source… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 figures 2 tables. Preprint of the IEEE eScience 2023 Conference publication

    ACM Class: D.2; K.6.3; K.7

  5. arXiv:2406.01788  [pdf, other

    cs.SE

    RSMM: A Framework to Assess Maturity of Research Software Project

    Authors: Deekshitha, Rena Bakhshi, Jason Maassen, Carlos Martinez Ortiz, Rob van Nieuwpoort, Slinger Jansen

    Abstract: The organizations and researchers producing research software face a common problem of making their software sustainable beyond funding provided by a single research project. This is addressed by research software engineers through building communities around their software, providing appropriate licensing, creating reliable and reproducible research software, making it sustainable and impactful,… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages, 6 figures, 1 table. Submitted to IEEE eScience 2024 Conference

    ACM Class: D.2; K.6.3

  6. The ROAD to discovery: machine learning-driven anomaly detection in radio astronomy spectrograms

    Authors: Michael Mesarcik, Albert-Jan Boonstra, Marco Iacobelli, Elena Ranguelova, Cees de Laat, Rob van Nieuwpoort

    Abstract: As radio telescopes increase in sensitivity and flexibility, so do their complexity and data-rates. For this reason automated system health management approaches are becoming increasingly critical to ensure nominal telescope operations. We propose a new machine learning anomaly detection framework for classifying both commonly occurring anomalies in radio telescopes as well as detecting unknown ra… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Journal ref: A&A 680, A74 (2023)

  7. arXiv:2202.05549  [pdf, other

    cs.DC

    Lightning: Scaling the GPU Programming Model Beyond a Single GPU

    Authors: Stijn Heldens, Pieter Hijma, Ben van Werkhoven, Jason Maassen, Rob. V. van Nieuwpoort

    Abstract: The GPU programming model is primarily aimed at the development of applications that run one GPU. However, this limits the scalability of GPU code to the capabilities of a single GPU in terms of compute power and memory capacity. To scale GPU applications further, a great engineering effort is typically required: work and data must be divided over multiple GPUs by hand, possibly in multiple nodes,… ▽ More

    Submitted 2 March, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: To be published at 36th IEEE International Parallel & Distributed Processing Symposium (IPDPS)

  8. arXiv:2111.14991  [pdf, other

    cs.LG cs.DC cs.PF math.OC

    Bayesian Optimization for auto-tuning GPU kernels

    Authors: Floris-Jan Willemsen, Rob van Nieuwpoort, Ben van Werkhoven

    Abstract: Finding optimal parameter configurations for tunable GPU kernels is a non-trivial exercise for large search spaces, even when automated. This poses an optimization task on a non-convex search space, using an expensive to evaluate function with unknown derivative. These characteristics make a good candidate for Bayesian Optimization, which has not been applied to this problem before. However, the a… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: In context of the 2021 International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS) at SuperComputing 2021. Pending publication in IEEE Computer Society Technical Consortium on High Performance Computing (TCHPC)

    ACM Class: D.1.3; C.4; G.3; I.6.3

  9. Improving Novelty Detection using the Reconstructions of Nearest Neighbours

    Authors: Michael Mesarcik, Elena Ranguelova, Albert-Jan Boonstra, Rob V. van Nieuwpoort

    Abstract: We show that using nearest neighbours in the latent space of autoencoders (AE) significantly improves performance of semi-supervised novelty detection in both single and multi-class contexts. Autoencoding methods detect novelty by learning to differentiate between the non-novel training class(es) and all other unseen classes. Our method harnesses a combination of the reconstructions of the nearest… ▽ More

    Submitted 28 January, 2022; v1 submitted 11 November, 2021; originally announced November 2021.

  10. arXiv:2009.04755  [pdf, other

    cs.DC

    Rocket: Efficient and Scalable All-Pairs Computations on Heterogeneous Platforms

    Authors: Stijn Heldens, Pieter Hijma, Ben van Werkhoven, Jason Maassen, Henri Bal, Rob van Nieuwpoort

    Abstract: All-pairs compute problems apply a user-defined function to each combination of two items of a given data set. Although these problems present an abundance of parallelism, data reuse must be exploited to achieve good performance. Several researchers considered this problem, either resorting to partial replication with static work distribution or dynamic scheduling with full replication. In contras… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

  11. arXiv:2001.03389  [pdf, other

    astro-ph.IM cs.DC

    Real-Time RFI Mitigation for the Apertif Radio Transient System

    Authors: Alessio Sclocco, Dany Vohl, Rob V. van Nieuwpoort

    Abstract: Current and upcoming radio telescopes are being designed with increasing sensitivity to detect new and mysterious radio sources of astrophysical origin. While this increased sensitivity improves the likelihood of discoveries, it also makes these instruments more susceptible to the deleterious effects of Radio Frequency Interference (RFI). The challenge posed by RFI is exacerbated by the high data-… ▽ More

    Submitted 16 January, 2020; v1 submitted 10 January, 2020; originally announced January 2020.

    Comments: 6 pages, 10 figures. To appear in Proceedings from the 2019 Radio Frequency Interference workshop (RFI 2019), Toulouse, France (23-26 September)

  12. arXiv:1601.05052  [pdf, other

    cs.DC astro-ph.IM

    Auto-Tuning Dedispersion for Many-Core Accelerators

    Authors: Alessio Sclocco, Henri E. Bal, Jason Hessels, Joeri van Leeuwen, Rob V. van Nieuwpoort

    Abstract: In this paper, we study the parallelization of the dedispersion algorithm on many-core accelerators, including GPUs from AMD and NVIDIA, and the Intel Xeon Phi. An important contribution is the computational analysis of the algorithm, from which we conclude that dedispersion is inherently memory-bound in any realistic scenario, in contrast to earlier reports. We also provide empirical proof that,… ▽ More

    Submitted 18 January, 2016; originally announced January 2016.

    Comments: 10 pages, published in the proceedings of IPDPS 2014