Skip to main content

Showing 1–4 of 4 results for author: Bountris, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.13705  [pdf, other

    cs.DC

    Exploring the Potential of Carbon-Aware Execution for Scientific Workflows

    Authors: Kathleen West, Fabian Lehmann, Vasilis Bountris, Ulf Leser, Yehia Elkhatib, Lauritz Thamsen

    Abstract: Scientific workflows are widely used to automate scientific data analysis and often involve processing large quantities of data on compute clusters. As such, their execution tends to be long-running and resource intensive, leading to significant energy consumption and carbon emissions. Meanwhile, a wealth of carbon-aware computing methods have been proposed, yet little work has focused specifica… ▽ More

    Submitted 19 March, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

    Comments: To appear in the Proceedings of the 25th IEEE International Symposium on Cluster, Cloud and Internet Computing (CCGrid). Updated with reformatted CC BY footer

  2. arXiv:2408.00411  [pdf, other

    cs.DC

    Low-level I/O Monitoring for Scientific Workflows

    Authors: Joel Witzke, Ansgar Lößer, Vasilis Bountris, Florian Schintke, Björn Scheuermann

    Abstract: While detailed resource usage monitoring is possible on the low-level using proper tools, associating such usage with higher-level abstractions in the application layer that actually cause the resource usage in the first place presents a number of challenges. Suppose a large-scale scientific data analysis workflow is run using a distributed execution environment such as a compute cluster or cloud… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 10 pages, 5 figures, code available under https://github.com/CRC-FONDA/workflow-monitoring

  3. Ponder: Online Prediction of Task Memory Requirements for Scientific Workflows

    Authors: Fabian Lehmann, Jonathan Bader, Ninon De Mecquenem, Xing Wang, Vasilis Bountris, Florian Friederici, Ulf Leser, Lauritz Thamsen

    Abstract: Scientific workflows are used to analyze large amounts of data. These workflows comprise numerous tasks, many of which are executed repeatedly, running the same custom program on different inputs. Users specify resource allocations for each task, which must be sufficient for all inputs to prevent task failures. As a result, task memory allocations tend to be overly conservative, wasting precious c… ▽ More

    Submitted 10 October, 2024; v1 submitted 31 July, 2024; originally announced August 2024.

    Journal ref: 2024 IEEE 20th International Conference on e-Science (e-Science)

  4. Large Language Models to the Rescue: Reducing the Complexity in Scientific Workflow Development Using ChatGPT

    Authors: Mario Sänger, Ninon De Mecquenem, Katarzyna Ewa Lewińska, Vasilis Bountris, Fabian Lehmann, Ulf Leser, Thomas Kosch

    Abstract: Scientific workflow systems are increasingly popular for expressing and executing complex data analysis pipelines over large datasets, as they offer reproducibility, dependability, and scalability of analyses by automatic parallelization on large compute clusters. However, implementing workflows is difficult due to the involvement of many black-box tools and the deep infrastructure stack necessary… ▽ More

    Submitted 6 November, 2023; v1 submitted 3 November, 2023; originally announced November 2023.

    Journal ref: Sänger et. al: A qualitative assessment of using ChatGPT as large language model for scientific workflow development, GigaScience, Volume 13, 2024, giae030