Skip to main content

Showing 1–14 of 14 results for author: Malawski, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.12611  [pdf, ps, other

    cs.DC

    Accelerating Cloud-Based Transcriptomics: Performance Analysis and Optimization of the STAR Aligner Workflow

    Authors: Piotr Kica, Sabina Lichołai, Michał Orzechowski, Maciej Malawski

    Abstract: In this work, we explore the Transcriptomics Atlas pipeline adapted for cost-efficient and high-throughput computing in the cloud. We propose a scalable, cloud-native architecture designed for running a resource-intensive aligner -- STAR -- and processing tens or hundreds of terabytes of RNA-sequencing data. We implement multiple optimization techniques that give significant execution time and cos… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

    Comments: Accepted at ICCS2025

  2. arXiv:2504.05078  [pdf, other

    cs.DC

    Serverless Approach to Running Resource-Intensive STAR Aligner

    Authors: Piotr Kica, Michał Orzechowski, Maciej Malawski

    Abstract: The application of serverless computing for alignment of RNA-sequences can improve many existing bioinformatics workflows by reducing operational costs and execution times. This work analyzes the applicability of serverless services for running the STAR aligner, which is known for its accuracy and large memory requirement. This presents a challenge, as serverless services were designed for light a… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: Accepted at CCGrid2025 conference in the poster format

  3. arXiv:2411.09618  [pdf, other

    physics.med-ph cs.LG

    MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI

    Authors: Nancy R. Newlin, Kurt Schilling, Serge Koudoro, Bramsh Qamar Chandio, Praitayini Kanakaraj, Daniel Moyer, Claire E. Kelly, Sila Genc, Jian Chen, Joseph Yuan-Mou Yang, Ye Wu, Yifei He, Jiawei Zhang, Qingrun Zeng, Fan Zhang, Nagesh Adluru, Vishwesh Nath, Sudhir Pathak, Walter Schneider, Anurag Gade, Yogesh Rathi, Tom Hendriks, Anna Vilanova, Maxime Chamberland, Tomasz Pieciak , et al. (11 additional authors not shown)

    Abstract: White matter alterations are increasingly implicated in neurological diseases and their progression. International-scale studies use diffusion-weighted magnetic resonance imaging (DW-MRI) to qualitatively identify changes in white matter microstructure and connectivity. Yet, quantitative analysis of DW-MRI data is hindered by inconsistencies stemming from varying acquisition protocols. There is a… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://melba-journal.org/2024/019

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 2 (2024)

  4. arXiv:2409.05886  [pdf, other

    cs.DC

    Optimizing STAR Aligner for High Throughput Computing in the Cloud

    Authors: Piotr Kica, Sabina Lichołai, Michał Orzechowski, Maciej Malawski

    Abstract: We propose a scalable, cloud-native architecture designed for Transcriptomics Atlas Pipeline, using a resource-intensive STAR aligner and processing tens or hundreds of terabytes of RNA-seq data. We implement the pipeline using AWS cloud services, introduce performance optimizations and perform experimental evaluation in the cloud. Our optimization techniques result in computational savings thanks… ▽ More

    Submitted 26 August, 2024; originally announced September 2024.

    Comments: Accepted at Cluster2024 conference in the poster format

  5. Serverless Computing for Scientific Applications

    Authors: Maciej Malawski, Bartosz Balis

    Abstract: Serverless computing has become an important model in cloud computing and influenced the design of many applications. Here, we provide our perspective on how the recent landscape of serverless computing for scientific applications looks like. We discuss the advantages and problems with serverless computing for scientific applications, and based on the analysis of existing solutions and approaches,… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Journal ref: in IEEE Internet Computing, vol. 26, no. 4, pp. 53-58, 1 July-Aug. 2022

  6. arXiv:2304.08190  [pdf, other

    cs.DC

    Serverless Approach to Sensitivity Analysis of Computational Models

    Authors: Piotr Kica, Magdalena Otta, Krzysztof Czechowicz, Karol Zając, Piotr Nowakowski, Andrew Narracott, Ian Halliday, Maciej Malawski

    Abstract: Digital twins are virtual representations of physical objects or systems used for the purpose of analysis, most often via computer simulations, in many engineering and scientific disciplines. Recently, this approach has been introduced to computational medicine, within the concept of Digital Twin in Healthcare (DTH). Such research requires verification and validation of its models, as well as the… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: Accepted at CCGrid2023 conference

  7. arXiv:2302.03616  [pdf

    cs.LG cs.AI eess.SP

    Can gamification reduce the burden of self-reporting in mHealth applications? A feasibility study using machine learning from smartwatch data to estimate cognitive load

    Authors: Michal K. Grzeszczyk, Paulina Adamczyk, Sylwia Marek, Ryszard Pręcikowski, Maciej Kuś, M. Patrycja Lelujko, Rosmary Blanco, Tomasz Trzciński, Arkadiusz Sitek, Maciej Malawski, Aneta Lisowska

    Abstract: The effectiveness of digital treatments can be measured by requiring patients to self-report their state through applications, however, it can be overwhelming and causes disengagement. We conduct a study to explore the impact of gamification on self-reporting. Our approach involves the creation of a system to assess cognitive load (CL) through the analysis of photoplethysmography (PPG) signals. Th… ▽ More

    Submitted 21 December, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: Accepted for AMIA 2023

  8. arXiv:2211.00717  [pdf, other

    cs.DC

    Using Unused: Non-Invasive Dynamic FaaS Infrastructure with HPC-Whisk

    Authors: Bartłomiej Przybylski, Maciej Pawlik, Paweł Żuk, Bartłomiej Łagosz, Maciej Malawski, Krzysztof Rzadca

    Abstract: Modern HPC workload managers and their careful tuning contribute to the high utilization of HPC clusters. However, due to inevitable uncertainty it is impossible to completely avoid node idleness. Although such idle slots are usually too short for any HPC job, they are too long to ignore them. Function-as-a-Service (FaaS) paradigm promisingly fills this gap, and can be a good match, as typical Faa… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  9. A Serverless Engine for High Energy Physics Distributed Analysis

    Authors: Jacek Kuśnierz, Vincenzo Eduardo Padulano, Maciej Malawski, Kamil Burkiewicz, Enric Tejedor Saavedra, Pedro Alonso-Jordá, Michael Pitt, Valentina Avati

    Abstract: The Large Hadron Collider (LHC) at CERN has generated in the last decade an unprecedented volume of data for the High-Energy Physics (HEP) field. Scientific collaborations interested in analysing such data very often require computing power beyond a single machine. This issue has been tackled traditionally by running analyses in distributed environments using stateful, managed batch computing syst… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

    Comments: 10 pages, CCGRID 2022

  10. CXR-FL: Deep Learning-Based Chest X-ray Image Analysis Using Federated Learning

    Authors: Filip Ślazyk, Przemysław Jabłecki, Aneta Lisowska, Maciej Malawski, Szymon Płotka

    Abstract: Federated learning enables building a shared model from multicentre data while storing the training data locally for privacy. In this paper, we present an evaluation (called CXR-FL) of deep learning-based models for chest X-ray image analysis using the federated learning method. We examine the impact of federated learning parameters on the performance of central models. Additionally, we show that… ▽ More

    Submitted 8 August, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted at International Conference on Computational Science (ICCS) 2022, London

  11. A Community Roadmap for Scientific Workflows Research and Development

    Authors: Rafael Ferreira da Silva, Henri Casanova, Kyle Chard, Ilkay Altintas, Rosa M Badia, Bartosz Balis, Tainã Coleman, Frederik Coppens, Frank Di Natale, Bjoern Enders, Thomas Fahringer, Rosa Filgueira, Grigori Fursin, Daniel Garijo, Carole Goble, Dorran Howell, Shantenu Jha, Daniel S. Katz, Daniel Laney, Ulf Leser, Maciej Malawski, Kshitij Mehta, Loïc Pottier, Jonathan Ozik, J. Luc Peterson , et al. (4 additional authors not shown)

    Abstract: The landscape of workflow systems for scientific applications is notoriously convoluted with hundreds of seemingly equivalent workflow systems, many isolated research claims, and a steep learning curve. To address some of these challenges and lay the groundwork for transforming workflows research and development, the WorkflowsRI and ExaWorks projects partnered to bring the international workflows… ▽ More

    Submitted 8 October, 2021; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2103.09181

  12. Workflows Community Summit: Advancing the State-of-the-art of Scientific Workflows Management Systems Research and Development

    Authors: Rafael Ferreira da Silva, Henri Casanova, Kyle Chard, Tainã Coleman, Dan Laney, Dong Ahn, Shantenu Jha, Dorran Howell, Stian Soiland-Reys, Ilkay Altintas, Douglas Thain, Rosa Filgueira, Yadu Babuji, Rosa M. Badia, Bartosz Balis, Silvina Caino-Lores, Scott Callaghan, Frederik Coppens, Michael R. Crusoe, Kaushik De, Frank Di Natale, Tu M. A. Do, Bjoern Enders, Thomas Fahringer, Anne Fouilloux , et al. (33 additional authors not shown)

    Abstract: Scientific workflows are a cornerstone of modern scientific computing, and they have underpinned some of the most significant discoveries of the last decade. Many of these workflows have high computational, storage, and/or communication demands, and thus must execute on a wide range of large-scale platforms, from large clouds to upcoming exascale HPC platforms. Workflows will play a crucial role i… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

  13. arXiv:2010.11320  [pdf, other

    cs.DC

    Serverless Containers -- rising viable approach to Scientific Workflows

    Authors: Krzysztof Burkat, Maciej Pawlik, Bartosz Balis, Maciej Malawski, Karan Vahi, Mats Rynge, Rafael Ferreira da Silva, Ewa Deelman

    Abstract: Increasing popularity of the serverless computing approach has led to the emergence of new cloud infrastructures working in Container-as-a-Service (CaaS) model like AWS Fargate, Google Cloud Run, or Azure Container Instances. They introduce an innovative approach to running cloud containers where developers are freed from managing underlying resources. In this paper, we focus on evaluating capabil… ▽ More

    Submitted 21 October, 2020; originally announced October 2020.

  14. arXiv:1909.03555  [pdf, other

    cs.DC

    Performance considerations on execution of large scale workflow applications on cloud functions

    Authors: Maciej Pawlik, Kamil Figiela, Maciej Malawski

    Abstract: Function-as-a-Service is a novel type of cloud service used for creating distributed applications and utilizing computing resources. Application developer supplies source code of cloud functions, which are small applications or application components, while the service provider is responsible for provisioning the infrastructure, scaling and exposing a REST style API. This environment seems to be a… ▽ More

    Submitted 8 September, 2019; originally announced September 2019.