Skip to main content

Showing 1–38 of 38 results for author: Luckow, A

.
  1. arXiv:2505.03399  [pdf, other

    quant-ph

    Typical Machine Learning Datasets as Low-Depth Quantum Circuits

    Authors: Florian J. Kiwit, Bernhard Jobst, Andre Luckow, Frank Pollmann, Carlos A. Riofrío

    Abstract: Quantum machine learning (QML) is an emerging field that investigates the capabilities of quantum computers for learning tasks. While QML models can theoretically offer advantages such as exponential speed-ups, challenges in data loading and the ability to scale to relevant problem sizes have prevented demonstrations of such advantages on practical problems. In particular, the encoding of arbitrar… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  2. Exploring Visual Prompts: Refining Images with Scribbles and Annotations in Generative AI Image Tools

    Authors: Hyerim Park, Malin Eiband, Andre Luckow, Michael Sedlmair

    Abstract: Generative AI (GenAI) tools are increasingly integrated into design workflows. While text prompts remain the primary input method for GenAI image tools, designers often struggle to craft effective ones. Moreover, research has primarily focused on input methods for ideation, with limited attention to refinement tasks. This study explores designers' preferences for three input methods - text prompts… ▽ More

    Submitted 6 March, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

  3. arXiv:2412.18519  [pdf, other

    quant-ph cs.DC cs.ET

    Pilot-Quantum: A Quantum-HPC Middleware for Resource, Workload and Task Management

    Authors: Pradeep Mantha, Florian J. Kiwit, Nishant Saurabh, Shantenu Jha, Andre Luckow

    Abstract: As quantum hardware advances, integrating quantum processing units (QPUs) into HPC environments and managing diverse infrastructure and software stacks becomes increasingly essential. Pilot-Quantum addresses these challenges as a middleware designed to provide unified application-level management of resources and workloads across hybrid quantum-classical environments. It is built on a rigorous ana… ▽ More

    Submitted 28 May, 2025; v1 submitted 24 December, 2024; originally announced December 2024.

  4. arXiv:2409.14183  [pdf, other

    quant-ph cs.ET

    Quantum Computing for Automotive Applications

    Authors: Carlos A. Riofrío, Johannes Klepsch, Jernej Rudi Finžgar, Florian Kiwit, Leonhard Hölscher, Marvin Erdmann, Lukas Müller, Chandan Kumar, Youssef Achari Berrada, Andre Luckow

    Abstract: Quantum computing could impact various industries, with the automotive industry with many computational challenges, from optimizing supply chains and manufacturing to vehicle engineering, being particularly promising. This chapter investigates state-of-the-art quantum algorithms to enhance efficiency, accuracy, and scalability across the automotive value chain. We explore recent advances in quantu… ▽ More

    Submitted 24 March, 2025; v1 submitted 21 September, 2024; originally announced September 2024.

  5. arXiv:2408.02587  [pdf, other

    quant-ph

    Assessing the Requirements for Industry Relevant Quantum Computation

    Authors: Anna M. Krol, Marvin Erdmann, Ewan Munro, Andre Luckow, Zaid Al-Ars

    Abstract: In this paper, we use open-source tools to perform quantum resource estimation to assess the requirements for industry-relevant quantum computation. Our analysis uses the problem of industrial shift scheduling in manufacturing and the Quantum Industrial Shift Scheduling algorithm. We base our figures of merit on current technology, as well as theoretical high-fidelity scenarios for superconducting… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  6. arXiv:2406.17823  [pdf, other

    physics.flu-dyn physics.comp-ph quant-ph

    Quantum-Inspired Fluid Simulation of 2D Turbulence with GPU Acceleration

    Authors: Leonhard Hölscher, Pooja Rao, Lukas Müller, Johannes Klepsch, Andre Luckow, Tobias Stollenwerk, Frank K. Wilhelm

    Abstract: Tensor network algorithms can efficiently simulate complex quantum many-body systems by utilizing knowledge of their structure and entanglement. These methodologies have been adapted recently for solving the Navier-Stokes equations, which describe a spectrum of fluid phenomena, from the aerodynamics of vehicles to weather patterns. Within this quantum-inspired paradigm, velocity is encoded as matr… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  7. Quantum Mini-Apps: A Framework for Developing and Benchmarking Quantum-HPC Applications

    Authors: Nishant Saurabh, Pradeep Mantha, Florian J. Kiwit, Shantenu Jha, Andre Luckow

    Abstract: With the increasing maturity and scale of quantum hardware and its integration into HPC systems, there is a need to develop robust techniques for developing, characterizing, and benchmarking quantum-HPC applications and middleware systems. This requires a better understanding of interaction, coupling, and common execution patterns between quantum and classical workload tasks and components. This p… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 9 pages, 4 figures

  8. arXiv:2404.15153  [pdf, other

    cs.CL cs.AI cs.PF

    Performance Characterization of Expert Router for Scalable LLM Inference

    Authors: Josef Pichlmeier, Philipp Ross, Andre Luckow

    Abstract: Large Language Models (LLMs) have experienced widespread adoption across scientific and industrial domains due to their versatility and utility for diverse tasks. Nevertheless, deploying and serving these models at scale with optimal throughput and latency remains a significant challenge, primarily because of LLMs' high computational and memory demands. Specialized models optimized for specific ta… ▽ More

    Submitted 8 October, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  9. arXiv:2404.12433  [pdf, other

    quant-ph

    Towards Application-Aware Quantum Circuit Compilation

    Authors: Nils Quetschlich, Florian J. Kiwit, Maximilian A. Wolf, Carlos A. Riofrio, Lukas Burgholzer, Andre Luckow, Robert Wille

    Abstract: Quantum computing has made tremendous improvements in both software and hardware that have sparked interest in academia and industry to realize quantum computing applications. To this end, several steps are necessary: The underlying problem must be encoded in a quantum circuit, a suitable device must be selected to execute it, and it must be compiled accordingly. This compilation step has a signif… ▽ More

    Submitted 9 June, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: 8 pages, 3 figures, minor changes, to be published at IEEE International Conference on Quantum Software (QSW), 2024

  10. Benchmarking Quantum Generative Learning: A Study on Scalability and Noise Resilience using QUARK

    Authors: Florian J. Kiwit, Maximilian A. Wolf, Marwa Marso, Philipp Ross, Jeanette M. Lorenz, Carlos A. Riofrío, Andre Luckow

    Abstract: Quantum computing promises a disruptive impact on machine learning algorithms, taking advantage of the exponentially large Hilbert space available. However, it is not clear how to scale quantum machine learning (QML) to industrial-level applications. This paper investigates the scalability and noise resilience of quantum generative learning applications. We consider the training performance in the… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  11. arXiv:2401.07763  [pdf, other

    quant-ph

    QISS: Quantum Industrial Shift Scheduling Algorithm

    Authors: Anna M. Krol, Marvin Erdmann, Rajesh Mishra, Phattharaporn Singkanipa, Ewan Munro, Marcin Ziolkowski, Andre Luckow, Zaid Al-Ars

    Abstract: In this paper, we show the design and implementation of a quantum algorithm for industrial shift scheduling (QISS), which uses Grover's adaptive search to tackle a common and important class of valuable, real-world combinatorial optimization problems. We give an explicit circuit construction of the Grover's oracle, incorporating the multiple constraints present in the problem, and detail the corre… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  12. arXiv:2312.09733  [pdf, other

    quant-ph cond-mat.mtrl-sci

    Quantum-centric Supercomputing for Materials Science: A Perspective on Challenges and Future Directions

    Authors: Yuri Alexeev, Maximilian Amsler, Paul Baity, Marco Antonio Barroca, Sanzio Bassini, Torey Battelle, Daan Camps, David Casanova, Young Jai Choi, Frederic T. Chong, Charles Chung, Chris Codella, Antonio D. Corcoles, James Cruise, Alberto Di Meglio, Jonathan Dubois, Ivan Duran, Thomas Eckl, Sophia Economou, Stephan Eidenbenz, Bruce Elmegreen, Clyde Fare, Ismael Faro, Cristina Sanz Fernández, Rodrigo Neumann Barros Ferreira , et al. (102 additional authors not shown)

    Abstract: Computational models are an essential tool for the design, characterization, and discovery of novel materials. Hard computational tasks in materials science stretch the limits of existing high-performance supercomputing centers, consuming much of their simulation, analysis, and data resources. Quantum computing, on the other hand, is an emerging technology with the potential to accelerate many of… ▽ More

    Submitted 19 September, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 65 pages, 15 figures; comments welcome

    Journal ref: Future Generation Computer Systems, Volume 160, November 2024, Pages 666-710

  13. arXiv:2308.06608  [pdf, other

    quant-ph cs.DC

    A Conceptual Architecture for a Quantum-HPC Middleware

    Authors: Nishant Saurabh, Shantenu Jha, Andre Luckow

    Abstract: Quantum computing promises potential for science and industry by solving certain computationally complex problems faster than classical computers. Quantum computing systems evolved from monolithic systems towards modular architectures comprising multiple quantum processing units (QPUs) coupled to classical computing nodes (HPC). With the increasing scale, middleware systems that facilitate the eff… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: 12 pages, 3 figures

    ACM Class: D.m

  14. arXiv:2308.04082  [pdf, other

    quant-ph cs.DC cs.LG

    Application-Oriented Benchmarking of Quantum Generative Learning Using QUARK

    Authors: Florian J. Kiwit, Marwa Marso, Philipp Ross, Carlos A. Riofrío, Johannes Klepsch, Andre Luckow

    Abstract: Benchmarking of quantum machine learning (QML) algorithms is challenging due to the complexity and variability of QML systems, e.g., regarding model ansatzes, data sets, training techniques, and hyper-parameters selection. The QUantum computing Application benchmaRK (QUARK) framework simplifies and standardizes benchmarking studies for quantum computing applications. Here, we propose several exten… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: 10 pages, 10 figures

    MSC Class: 81-04 ACM Class: C.4

  15. Workflows Community Summit 2022: A Roadmap Revolution

    Authors: Rafael Ferreira da Silva, Rosa M. Badia, Venkat Bala, Debbie Bard, Peer-Timo Bremer, Ian Buckley, Silvina Caino-Lores, Kyle Chard, Carole Goble, Shantenu Jha, Daniel S. Katz, Daniel Laney, Manish Parashar, Frederic Suter, Nick Tyler, Thomas Uram, Ilkay Altintas, Stefan Andersson, William Arndt, Juan Aznar, Jonathan Bader, Bartosz Balis, Chris Blanton, Kelly Rosa Braghetto, Aharon Brodutch , et al. (80 additional authors not shown)

    Abstract: Scientific workflows have become integral tools in broad scientific computing use cases. Science discovery is increasingly dependent on workflows to orchestrate large and complex scientific experiments that range from execution of a cloud-based data preprocessing pipeline to multi-facility instrument-to-edge-to-HPC computational workflows. Given the changing landscape of scientific computing and t… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

    Report number: ORNL/TM-2023/2885

  16. A performance characterization of quantum generative models

    Authors: Carlos A. Riofrío, Oliver Mitevski, Caitlin Jones, Florian Krellner, Aleksandar Vučković, Joseph Doetsch, Johannes Klepsch, Thomas Ehmer, Andre Luckow

    Abstract: Quantum generative modeling is a growing area of interest for industry-relevant applications. With the field still in its infancy, there are many competing techniques. This work is an attempt to systematically compare a broad range of these techniques to guide quantum computing practitioners when deciding which models and techniques to use in their applications. We compare fundamentally different… ▽ More

    Submitted 26 March, 2024; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Revised version: Small corrections to figures, additional references, and link to open-source code associated with the project. This version is in line with the version accepted for publication

    Report number: Article No.: 12, Pages 1 - 34

    Journal ref: ACM Transactions on Quantum Computing, Volume 5, Issue 2 (2024)

  17. Quantum Computing Techniques for Multi-Knapsack Problems

    Authors: Abhishek Awasthi, Francesco Bär, Joseph Doetsch, Hans Ehm, Marvin Erdmann, Maximilian Hess, Johannes Klepsch, Peter A. Limacher, Andre Luckow, Christoph Niedermeier, Lilly Palackal, Ruben Pfeiffer, Philipp Ross, Hila Safi, Janik Schönmeier-Kromer, Oliver von Sicard, Yannick Wenger, Karen Wintersperger, Sheir Yarkoni

    Abstract: Optimization problems are ubiquitous in various industrial settings, and multi-knapsack optimization is one recurrent task faced daily by several industries. The advent of quantum computing has opened a new paradigm for computationally intensive tasks, with promises of delivering better and faster solutions for specific classes of problems. This work presents a comprehensive study of quantum compu… ▽ More

    Submitted 28 September, 2023; v1 submitted 13 January, 2023; originally announced January 2023.

    Comments: 20 pages

    Journal ref: Arai, K. (eds) Intelligent Computing. SAI 2023. Lecture Notes in Networks and Systems, vol 739

  18. Exploring privacy-enhancing technologies in the automotive value chain

    Authors: Gonzalo Munilla Garrido, Kaja Schmidt, Christopher Harth-Kitzerow, Johannes Klepsch, Andre Luckow, Florian Matthes

    Abstract: Privacy-enhancing technologies (PETs) are becoming increasingly crucial for addressing customer needs, security, privacy (e.g., enhancing anonymity and confidentiality), and regulatory requirements. However, applying PETs in organizations requires a precise understanding of use cases, technologies, and limitations. This paper investigates several industrial use cases, their characteristics, and th… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Journal ref: 2021 IEEE International Conference on Big Data (Big Data)

  19. arXiv:2206.03651  [pdf, other

    quant-ph cond-mat.dis-nn cs.NE cs.RO math.OC

    Optimization of Robot Trajectory Planning with Nature-Inspired and Hybrid Quantum Algorithms

    Authors: Martin J. A. Schuetz, J. Kyle Brubaker, Henry Montagu, Yannick van Dijk, Johannes Klepsch, Philipp Ross, Andre Luckow, Mauricio G. C. Resende, Helmut G. Katzgraber

    Abstract: We solve robot trajectory planning problems at industry-relevant scales. Our end-to-end solution integrates highly versatile random-key algorithms with model stacking and ensemble techniques, as well as path relinking for solution refinement. The core optimization module consists of a biased random-key genetic algorithm. Through a distinct separation of problem-independent and problem-dependent mo… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: 17 pages, 6 figures

    Journal ref: Phys. Rev. Applied 18, 054045 (2022)

  20. arXiv:2203.12646  [pdf, other

    cs.CR

    CRGC -- A Practical Framework for Constructing Reusable Garbled Circuits

    Authors: Christopher Harth-Kitzerow, Georg Carle, Fan Fei, Andre Luckow, Johannes Klepsch

    Abstract: In this work, we introduce two schemes to construct reusable garbled circuits (RGCs) in the semi-honest setting. Our completely reusable garbled circuit (CRGC) scheme allows the generator (party A) to construct and send an obfuscated boolean circuit along with an encoded input to the evaluator (party B). In contrast to Yao's Garbled Circuit protocol, B can securely evaluate the same CRGC with an a… ▽ More

    Submitted 6 May, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: 13 pages, 7 figures

  21. QUARK: A Framework for Quantum Computing Application Benchmarking

    Authors: Jernej Rudi Finžgar, Philipp Ross, Leonhard Hölscher, Johannes Klepsch, Andre Luckow

    Abstract: Quantum computing (QC) is anticipated to provide a speedup over classical HPC approaches for specific problems in optimization, simulation, and machine learning. With the advances in quantum computing toward practical applications, the need to analyze and compare different quantum solutions increases. While different low-level benchmarks for QC exist, these benchmarks do not provide sufficient ins… ▽ More

    Submitted 5 August, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: Improved version, as submitted to IEEE QCE22 conference

    Journal ref: IEEE QCE2022

  22. arXiv:2107.11905  [pdf, other

    cs.CR

    Revealing the Landscape of Privacy-Enhancing Technologies in the Context of Data Markets for the IoT: A Systematic Literature Review

    Authors: Gonzalo Munilla Garrido, Johannes Sedlmeir, Ömer Uludağ, Ilias Soto Alaoui, Andre Luckow, Florian Matthes

    Abstract: IoT data markets in public and private institutions have become increasingly relevant in recent years because of their potential to improve data availability and unlock new business models. However, exchanging data in markets bears considerable challenges related to disclosing sensitive information. Despite considerable research focused on different aspects of privacy-enhancing data markets for th… ▽ More

    Submitted 12 July, 2022; v1 submitted 25 July, 2021; originally announced July 2021.

    Comments: 49 pages, 17 figures, 11 tables

  23. arXiv:2104.03374  [pdf, other

    cs.DC

    Pilot-Edge: Distributed Resource Management Along the Edge-to-Cloud Continuum

    Authors: Andre Luckow, Kartik Rattan, Shantenu Jha

    Abstract: Many science and industry IoT applications necessitate data processing across the edge-to-cloud continuum to meet performance, security, cost, and privacy requirements. However, diverse abstractions and infrastructures for managing resources and tasks across the edge-to-cloud scenario are required. We propose Pilot-Edge as a common abstraction for resource management across the edge-to-cloud conti… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Comments: 5 pages, 3 figures

    ACM Class: C.4; C.2.4

  24. arXiv:2104.03368  [pdf, other

    cs.DC

    Exploring Task Placement for Edge-to-Cloud Applications using Emulation

    Authors: Andre Luckow, Kartik Rattan, Shantenu Jha

    Abstract: A vast and growing number of IoT applications connect physical devices, such as scientific instruments, technical equipment, machines, and cameras, across heterogenous infrastructure from the edge to the cloud to provide responsive, intelligent services while complying with privacy and security requirements. However, the integration of heterogeneous IoT, edge, and cloud technologies and the design… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Comments: 5 pages, 2 figures

    ACM Class: C.4

  25. Quantum Computing: Towards Industry Reference Problems

    Authors: Andre Luckow, Johannes Klepsch, Josef Pichlmeier

    Abstract: The complexity is increasing rapidly in many areas of the automotive industry. The design of an automobile involves many different engineering disciplines, e. g., mechanical, electrical, and software engineering. The software of a vehicle comprises millions of lines of code. Further, the manufacturing, logistics, distribution, and sales of a vehicle are highly complex. There is an immense need for… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

    Comments: This is a pre-print of an article published in DIGITALE WELT Volume 5, issue 2. The final authenticated version is available online at: https://doi.org/10.1007/s42354-021-0335-7

    Journal ref: Digitale Welt 5, 38-45 (2021)

  26. arXiv:2102.07731  [pdf, other

    cs.PF cs.CR cs.DB cs.DC

    An In-Depth Investigation of the Performance Characteristics of Hyperledger Fabric

    Authors: Tobias Guggenberger, Johannes Sedlmeir, Gilbert Fridgen, André Luckow

    Abstract: Private permissioned blockchains are deployed in ever greater numbers to facilitate cross-organizational processes in various industries, particularly in supply chain management. One popular example of this trend is Hyperledger Fabric. Compared to public permissionless blockchains, it promises improved performance and provides certain features that address key requirements of enterprises. However,… ▽ More

    Submitted 3 January, 2023; v1 submitted 15 February, 2021; originally announced February 2021.

    ACM Class: H.4; J.1

    Journal ref: Computers & Industrial Engineering (2022), Volume 173, 108716

  27. arXiv:2002.09009  [pdf, other

    cs.DC cs.SE

    Methods and Experiences for Developing Abstractions for Data-intensive, Scientific Applications

    Authors: Andre Luckow, Shantenu Jha

    Abstract: Developing software for scientific applications that require the integration of diverse types of computing, instruments, and data present challenges that are distinct from commercial software. These applications require scale, and the need to integrate various programming and computational models with evolving and heterogeneous infrastructure. Pervasive and effective abstractions for distributed i… ▽ More

    Submitted 25 March, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: 10 pages, 5 figures

  28. arXiv:1909.06055  [pdf, other

    cs.DC

    Performance Characterization and Modeling of Serverless and HPC Streaming Applications

    Authors: Andre Luckow, Shantenu Jha

    Abstract: Experiment-in-the-Loop Computing (EILC) requires support for numerous types of processing and the management of heterogeneous infrastructure over a dynamic range of scales: from the edge to the cloud and HPC, and intermediate resources. Serverless is an emerging service that combines high-level middleware services, such as distributed execution engines for managing tasks, with low-level infrastruc… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

  29. arXiv:1801.08648  [pdf, other

    cs.DC

    Pilot-Streaming: A Stream Processing Framework for High-Performance Computing

    Authors: Andre Luckow, George Chantzialexiou, Shantenu Jha

    Abstract: An increasing number of scientific applications rely on stream processing for generating timely insights from data feeds of scientific instruments, simulations, and Internet-of-Thing (IoT) sensors. The development of streaming applications is a complex task and requires the integration of heterogeneous, distributed infrastructure, frameworks, middleware and application components. Different applic… ▽ More

    Submitted 11 November, 2018; v1 submitted 25 January, 2018; originally announced January 2018.

    Comments: 12 pages

  30. arXiv:1801.07630  [pdf, other

    cs.DC

    Task-parallel Analysis of Molecular Dynamics Trajectories

    Authors: Ioannis Paraskevakos, Andre Luckow, Mahzad Khoshlessan, George Chantzialexiou, Thomas E. Cheatham, Oliver Beckstein, Geoffrey C. Fox, Shantenu Jha

    Abstract: Different parallel frameworks for implementing data analysis applications have been proposed by the HPC and Big Data communities. In this paper, we investigate three task-parallel frameworks: Spark, Dask and RADICAL-Pilot with respect to their ability to support data analytics on HPC resources and compare them with MPI. We investigate the data analysis requirements of Molecular Dynamics (MD) simul… ▽ More

    Submitted 10 June, 2018; v1 submitted 23 January, 2018; originally announced January 2018.

  31. Deep Learning in the Automotive Industry: Applications and Tools

    Authors: Andre Luckow, Matthew Cook, Nathan Ashcraft, Edwin Weill, Emil Djerekarov, Bennie Vorster

    Abstract: Deep Learning refers to a set of machine learning techniques that utilize neural networks with many hidden layers for tasks, such as image classification, speech recognition, language understanding. Deep learning has been proven to be very effective in these domains and is pervasively used by many Internet services. In this paper, we describe different automotive uses cases for deep learning in pa… ▽ More

    Submitted 30 April, 2017; originally announced May 2017.

    Comments: 10 pages

  32. arXiv:1611.05487  [pdf, ps, other

    stat.ML cs.DS cs.LG stat.CO

    Algebraic multigrid support vector machines

    Authors: Ehsan Sadrfaridpour, Sandeep Jeereddy, Ken Kennedy, Andre Luckow, Talayeh Razzaghi, Ilya Safro

    Abstract: The support vector machine is a flexible optimization-based technique widely used for classification problems. In practice, its training part becomes computationally expensive on large-scale data sets because of such reasons as the complexity and number of iterations in parameter fitting methods, underlying optimization solvers, and nonlinearity of kernels. We introduce a fast multilevel framework… ▽ More

    Submitted 23 November, 2016; v1 submitted 16 November, 2016; originally announced November 2016.

  33. arXiv:1609.03647  [pdf, other

    cs.DC

    Introducing Distributed Dynamic Data-intensive (D3) Science: Understanding Applications and Infrastructure

    Authors: Shantenu Jha, Daniel S. Katz, Andre Luckow, Omer Rana, Yogesh Simmhan, Neil Chue Hong

    Abstract: A common feature across many science and engineering applications is the amount and diversity of data and computation that must be integrated to yield insights. Data sets are growing larger and becoming distributed; and their location, availability and properties are often time-dependent. Collectively, these characteristics give rise to dynamic distributed data-intensive applications. While "stati… ▽ More

    Submitted 12 September, 2016; originally announced September 2016.

    Comments: 38 pages, 2 figures

  34. arXiv:1602.00345  [pdf, other

    cs.DC

    Hadoop on HPC: Integrating Hadoop and Pilot-based Dynamic Resource Management

    Authors: Andre Luckow, Ioannis Paraskevakos, George Chantzialexiou, Shantenu Jha

    Abstract: High-performance computing platforms such as supercomputers have traditionally been designed to meet the compute demands of scientific applications. Consequently, they have been architected as producers and not consumers of data. The Apache Hadoop ecosystem has evolved to meet the requirements of data processing applications and has addressed many of the limitations of HPC platforms. There exist a… ▽ More

    Submitted 31 January, 2016; originally announced February 2016.

  35. arXiv:1501.05041  [pdf, other

    cs.DC

    Pilot-Abstraction: A Valid Abstraction for Data-Intensive Applications on HPC, Hadoop and Cloud Infrastructures?

    Authors: Andre Luckow, Pradeep Mantha, Shantenu Jha

    Abstract: HPC environments have traditionally been designed to meet the compute demand of scientific applications and data has only been a second order concern. With science moving toward data-driven discoveries relying more on correlations in data to form scientific hypotheses, the limitations of HPC approaches become apparent: Architectural paradigms such as the separation of storage and compute are not o… ▽ More

    Submitted 20 January, 2015; originally announced January 2015.

    Comments: Submitted to HPDC 2015, 12 pages, 9 figures

    ACM Class: C.1.4; C.2.4; D.1.3; D.2.12

  36. arXiv:1403.1528  [pdf, other

    cs.DC

    A Tale of Two Data-Intensive Paradigms: Applications, Abstractions, and Architectures

    Authors: Shantenu Jha, Judy Qiu, Andre Luckow, Pradeep Mantha, Geoffrey C. Fox

    Abstract: Scientific problems that depend on processing large amounts of data require overcoming challenges in multiple areas: managing large-scale data distribution, co-placement and scheduling of data with compute resources, and storing and transferring large volumes of data. We analyze the ecosystems of the two prominent paradigms for data-intensive applications, hereafter referred to as the high-perform… ▽ More

    Submitted 22 June, 2014; v1 submitted 6 March, 2014; originally announced March 2014.

    Comments: 8 pages, 2 figures

  37. arXiv:1301.6228  [pdf, other

    cs.DC

    Pilot-Data: An Abstraction for Distributed Data

    Authors: Andre Luckow, Mark Santcroos, Ashley Zebrowski, Shantenu Jha

    Abstract: Scientific problems that depend on processing large amounts of data require overcoming challenges in multiple areas: managing large-scale data distribution, controlling co-placement and scheduling of data with compute resources, and storing, transferring, and managing large volumes of data. Although there exist multiple approaches to addressing each of these challenges, an integrative approach is… ▽ More

    Submitted 18 November, 2013; v1 submitted 26 January, 2013; originally announced January 2013.

    ACM Class: C.2.4

  38. arXiv:1207.6644  [pdf, other

    cs.DC

    P*: A Model of Pilot-Abstractions

    Authors: Andre Luckow, Mark Santcroos, Ole Weidner, Andre Merzky, Pradeep Mantha, Shantenu Jha

    Abstract: Pilot-Jobs support effective distributed resource utilization, and are arguably one of the most widely-used distributed computing abstractions - as measured by the number and types of applications that use them, as well as the number of production distributed cyberinfrastructures that support them. In spite of broad uptake, there does not exist a well-defined, unifying conceptual model of Pilot-Jo… ▽ More

    Submitted 27 July, 2012; originally announced July 2012.

    Comments: 10 pages