Skip to main content

Showing 1–5 of 5 results for author: Marroquín, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2004.03488  [pdf, other

    cs.DB

    Modularis: Modular Relational Analytics over Heterogeneous Distributed Platforms

    Authors: Dimitrios Koutsoukos, Ingo Müller, Renato Marroquín, Ana Klimovic, Gustavo Alonso

    Abstract: The enormous quantity of data produced every day together with advances in data analytics has led to a proliferation of data management and analysis systems. Typically, these systems are built around highly specialized monolithic operators optimized for the underlying hardware. While effective in the short term, such an approach makes the operators cumbersome to port and adapt, which is increasing… ▽ More

    Submitted 29 September, 2021; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: Accepted at PVLDB vol. 14

  2. arXiv:2004.01908  [pdf, other

    cs.DB cs.DC cs.PL

    The Collection Virtual Machine: An Abstraction for Multi-Frontend Multi-Backend Data Analysis

    Authors: Ingo Müller, Renato Marroquín, Dimitrios Koutsoukos, Mike Wawrzoniak, Sabir Akhadov, Gustavo Alonso

    Abstract: Getting the best performance from the ever-increasing number of hardware platforms has been a recurring challenge for data processing systems. In recent years, the advent of data science with its increasingly numerous and complex types of analytics has made this challenge even more difficult. In practice, system designers are overwhelmed by the number of combinations and typically implement only o… ▽ More

    Submitted 8 April, 2020; v1 submitted 4 April, 2020; originally announced April 2020.

    Comments: This paper is currently under review at DaMoN'20

  3. Lambada: Interactive Data Analytics on Cold Data using Serverless Cloud Infrastructure

    Authors: Ingo Müller, Renato Marroquín, Gustavo Alonso

    Abstract: The promise of ultimate elasticity and operational simplicity of serverless computing has recently lead to an explosion of research in this area. In the context of data analytics, the concept sounds appealing, but due to the limitations of current offerings, there is no consensus yet on whether or not this approach is technically and economically viable. In this paper, we identify interactive data… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Report number: https://doi.org/10.3929/ethz-b-000413183

  4. Pay One, Get Hundreds for Free: Reducing Cloud Costs through Shared Query Execution

    Authors: Renato Marroquín, Ingo Müller, Darko Makreshanski, Gustavo Alonso

    Abstract: Cloud-based data analysis is nowadays common practice because of the lower system management overhead as well as the pay-as-you-go pricing model. The pricing model, however, is not always suitable for query processing as heavy use results in high costs. For example, in query-as-a-service systems, where users are charged per processed byte, collections of queries accessing the same data frequently… ▽ More

    Submitted 1 September, 2018; originally announced September 2018.

    Journal ref: Proceedings of the ACM Symposium on Cloud Computing (SoCC) 2018, pages 439-450

  5. arXiv:1703.04290  [pdf, other

    cs.DB

    MTBase: Optimizing Cross-Tenant Database Queries

    Authors: Lucas Braun, Renato Marroquin, Kai-En Tsay, Donald Kossmann

    Abstract: In the last decade, many business applications have moved into the cloud. In particular, the "database-as-a-service" paradigm has become mainstream. While existing multi-tenant data management systems focus on single-tenant query processing, we believe that it is time to rethink how queries can be processed across multiple tenants in such a way that we do not only gain more valuable insights, but… ▽ More

    Submitted 13 March, 2017; originally announced March 2017.