Skip to main content

Showing 1–23 of 23 results for author: Interlandi, M

.
  1. arXiv:2506.10092  [pdf, ps, other

    cs.DB

    GPU Acceleration of SQL Analytics on Compressed Data

    Authors: Zezhou Huang, Krystian Sakowski, Hans Lehnert, Wei Cui, Carlo Curino, Matteo Interlandi, Marius Dumitru, Rathijit Sen

    Abstract: GPUs are uniquely suited to accelerate (SQL) analytics workloads thanks to their massive compute parallelism and High Bandwidth Memory (HBM) -- when datasets fit in the GPU HBM, performance is unparalleled. Unfortunately, GPU HBMs remain typically small when compared with lower-bandwidth CPU main memory. Besides brute-force scaling across many GPUs, current solutions to accelerate queries on large… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  2. arXiv:2506.09226  [pdf, ps, other

    cs.DB cs.DC cs.PF

    Terabyte-Scale Analytics in the Blink of an Eye

    Authors: Bowen Wu, Wei Cui, Carlo Curino, Matteo Interlandi, Rathijit Sen

    Abstract: For the past two decades, the DB community has devoted substantial research to take advantage of cheap clusters of machines for distributed data analytics -- we believe that we are at the beginning of a paradigm shift. The scaling laws and popularity of AI models lead to the deployment of incredibly powerful GPU clusters in commercial data centers. Compared to CPU-only solutions, these clusters de… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  3. arXiv:2411.14331  [pdf, other

    cs.DB

    Data Formats in Analytical DBMSs: Performance Trade-offs and Future Directions

    Authors: Chunwei Liu, Anna Pavlenko, Matteo Interlandi, Brandon Haynes

    Abstract: This paper evaluates the suitability of Apache Arrow, Parquet, and ORC as formats for subsumption in an analytical DBMS. We systematically identify and explore the high-level features that are important to support efficient querying in modern OLAP DBMSs and evaluate the ability of each format to support these features. We find that each format has trade-offs that make it more or less suitable for… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  4. Towards Building Autonomous Data Services on Azure

    Authors: Yiwen Zhu, Yuanyuan Tian, Joyce Cahoon, Subru Krishnan, Ankita Agarwal, Rana Alotaibi, Jesús Camacho-Rodríguez, Bibin Chundatt, Andrew Chung, Niharika Dutta, Andrew Fogarty, Anja Gruenheid, Brandon Haynes, Matteo Interlandi, Minu Iyer, Nick Jurgens, Sumeet Khushalani, Brian Kroth, Manoj Kumar, Jyoti Leeka, Sergiy Matusevych, Minni Mittal, Andreas Mueller, Kartheek Muthyala, Harsha Nagulapalli , et al. (13 additional authors not shown)

    Abstract: Modern cloud has turned data services into easily accessible commodities. With just a few clicks, users are now able to access a catalog of data processing systems for a wide range of tasks. However, the cloud brings in both complexity and opportunity. While cloud users can quickly start an application by using various data services, it can be difficult to configure and optimize these services to… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: SIGMOD Companion of the 2023 International Conference on Management of Data. 2023

  5. arXiv:2302.00734  [pdf, other

    cs.DB cs.AR

    Revisiting Query Performance in GPU Database Systems

    Authors: Jiashen Cao, Rathijit Sen, Matteo Interlandi, Joy Arulraj, Hyesoon Kim

    Abstract: GPUs offer massive compute parallelism and high-bandwidth memory accesses. GPU database systems seek to exploit those capabilities to accelerate data analytics. Although modern GPUs have more resources (e.g., higher DRAM bandwidth) than ever before, judicious choices for query processing that avoid wasteful resource allocations are still advantageous. Database systems can save GPU runtime costs th… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  6. arXiv:2211.02753  [pdf, other

    cs.DB cs.LG

    The Tensor Data Platform: Towards an AI-centric Database System

    Authors: Apurva Gandhi, Yuki Asada, Victor Fu, Advitya Gemawat, Lihao Zhang, Rathijit Sen, Carlo Curino, Jesús Camacho-Rodríguez, Matteo Interlandi

    Abstract: Database engines have historically absorbed many of the innovations in data processing, adding features to process graph data, XML, object oriented, and text among many others. In this paper, we make the case that it is time to do the same for AI -- but with a twist! While existing approaches have tried to achieve this by integrating databases with external ML tools, in this paper we claim that ac… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: Accepted for publication at The Conference on Innovative Data Systems Research (CIDR) 2023

  7. Deploying a Steered Query Optimizer in Production at Microsoft

    Authors: Wangda Zhang, Matteo Interlandi, Paul Mineiro, Shi Qiao, Nasim Ghazanfari Karlen Lie, Marc Friedman, Rafah Hosn, Hiren Patel, Alekh Jindal

    Abstract: Modern analytical workloads are highly heterogeneous and massively complex, making generic query optimizers untenable for many customers and scenarios. As a result, it is important to specialize these optimizers to instances of the workloads. In this paper, we continue a recent line of work in steering a query optimizer towards better plans for a given workload, and make major strides in pushing p… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Journal ref: Proceedings of the 2022 International Conference on Management of Data 2022 Jun 10 (pp. 2299-2311)

  8. Share the Tensor Tea: How Databases can Leverage the Machine Learning Ecosystem

    Authors: Yuki Asada, Victor Fu, Apurva Gandhi, Advitya Gemawat, Lihao Zhang, Dong He, Vivek Gupta, Ehi Nosakhare, Dalitso Banda, Rathijit Sen, Matteo Interlandi

    Abstract: We demonstrate Tensor Query Processor (TQP): a query processor that automatically compiles relational operators into tensor programs. By leveraging tensor runtimes such as PyTorch, TQP is able to: (1) integrate with ML tools (e.g., Pandas for data ingestion, Tensorboard for visualization); (2) target different hardware (e.g., CPU, GPU) and software (e.g., browser) backends; and (3) end-to-end acce… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Journal ref: PVLDB, 15(12): 3598 - 3601, 2022

  9. End-to-end Optimization of Machine Learning Prediction Queries

    Authors: Kwanghyun Park, Karla Saur, Dalitso Banda, Rathijit Sen, Matteo Interlandi, Konstantinos Karanasos

    Abstract: Prediction queries are widely used across industries to perform advanced analytics and draw insights from data. They include a data processing part (e.g., for joining, filtering, cleaning, featurizing the datasets) and a machine learning (ML) part invoking one or more trained models to perform predictions. These parts have so far been optimized in isolation, leaving significant opportunities for o… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

  10. arXiv:2204.11131  [pdf, other

    cs.LG cs.AI cs.DB

    Data Debugging with Shapley Importance over End-to-End Machine Learning Pipelines

    Authors: Bojan Karlaš, David Dao, Matteo Interlandi, Bo Li, Sebastian Schelter, Wentao Wu, Ce Zhang

    Abstract: Developing modern machine learning (ML) applications is data-centric, of which one fundamental challenge is to understand the influence of data quality to ML training -- "Which training examples are 'guilty' in making the trained ML model predictions inaccurate or unfair?" Modeling data influence for ML training has attracted intensive interest over the last decade, and one popular framework is to… ▽ More

    Submitted 26 April, 2022; v1 submitted 23 April, 2022; originally announced April 2022.

  11. arXiv:2203.01877  [pdf, other

    cs.DB cs.AI cs.LG

    Query Processing on Tensor Computation Runtimes

    Authors: Dong He, Supun Nakandala, Dalitso Banda, Rathijit Sen, Karla Saur, Kwanghyun Park, Carlo Curino, Jesús Camacho-Rodríguez, Konstantinos Karanasos, Matteo Interlandi

    Abstract: The huge demand for computation in artificial intelligence (AI) is driving unparalleled investments in hardware and software systems for AI. This leads to an explosion in the number of specialized hardware devices, which are now offered by major cloud vendors. By hiding the low-level complexity through a tensor-based interface, tensor computation runtimes (TCRs) such as PyTorch allow data scientis… ▽ More

    Submitted 9 February, 2023; v1 submitted 3 March, 2022; originally announced March 2022.

    Journal ref: Proceedings of the VLDB Endowment, 15(11): 2811 - 2825, 2022

  12. arXiv:2110.02313  [pdf, other

    cs.DB cs.AI cs.LG

    Phoebe: A Learning-based Checkpoint Optimizer

    Authors: Yiwen Zhu, Matteo Interlandi, Abhishek Roy, Krishnadhan Das, Hiren Patel, Malay Bag, Hitesh Sharma, Alekh Jindal

    Abstract: Easy-to-use programming interfaces paired with cloud-scale processing engines have enabled big data system users to author arbitrarily complex analytical jobs over massive volumes of data. However, as the complexity and scale of analytical jobs increase, they encounter a number of unforeseen problems, hotspots with large intermediate data on temporary storage, longer job recovery time after failur… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of the VLDB Endowment 14 (11), 2505-2518, 2021

  13. arXiv:2010.04804  [pdf, other

    cs.LG

    A Tensor Compiler for Unified Machine Learning Prediction Serving

    Authors: Supun Nakandala, Karla Saur, Gyeong-In Yu, Konstantinos Karanasos, Carlo Curino, Markus Weimer, Matteo Interlandi

    Abstract: Machine Learning (ML) adoption in the enterprise requires simpler and more efficient software infrastructure---the bespoke solutions typical in large web companies are simply untenable. Model scoring, the process of obtaining predictions from a trained model over new data, is a primary contributor to infrastructure complexity and cost as models are trained once but used many times. In this paper w… ▽ More

    Submitted 19 October, 2020; v1 submitted 9 October, 2020; originally announced October 2020.

  14. arXiv:1912.09536  [pdf, other

    cs.LG cs.DC stat.ML

    Data Science through the looking glass and what we found there

    Authors: Fotis Psallidas, Yiwen Zhu, Bojan Karlas, Matteo Interlandi, Avrilia Floratou, Konstantinos Karanasos, Wentao Wu, Ce Zhang, Subru Krishnan, Carlo Curino, Markus Weimer

    Abstract: The recent success of machine learning (ML) has led to an explosive growth both in terms of new systems and algorithms built in industry and academia, and new applications built by an ever-growing community of data science (DS) practitioners. This quickly shifting panorama of technologies and applications is challenging for builders and practitioners alike to follow. In this paper, we set out to c… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

  15. arXiv:1911.00231  [pdf, other

    cs.DB cs.LG

    Extending Relational Query Processing with ML Inference

    Authors: Konstantinos Karanasos, Matteo Interlandi, Doris Xin, Fotis Psallidas, Rathijit Sen, Kwanghyun Park, Ivan Popivanov, Supun Nakandal, Subru Krishnan, Markus Weimer, Yuan Yu, Raghu Ramakrishnan, Carlo Curino

    Abstract: The broadening adoption of machine learning in the enterprise is increasing the pressure for strict governance and cost-effective performance, in particular for the common and consequential steps of model storage and inference. The RDBMS provides a natural starting point, given its mature infrastructure for fast data access and processing, along with support for enterprise features (e.g., encrypti… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

  16. arXiv:1909.00084  [pdf, other

    cs.DB cs.DC cs.LG

    Cloudy with high chance of DBMS: A 10-year prediction for Enterprise-Grade ML

    Authors: Ashvin Agrawal, Rony Chatterjee, Carlo Curino, Avrilia Floratou, Neha Gowdal, Matteo Interlandi, Alekh Jindal, Kostantinos Karanasos, Subru Krishnan, Brian Kroth, Jyoti Leeka, Kwanghyun Park, Hiren Patel, Olga Poppe, Fotis Psallidas, Raghu Ramakrishnan, Abhishek Roy, Karla Saur, Rathijit Sen, Markus Weimer, Travis Wright, Yiwen Zhu

    Abstract: Machine learning (ML) has proven itself in high-value web applications such as search ranking and is emerging as a powerful tool in a much broader range of enterprise scenarios including voice recognition and conversational understanding for customer support, autotuning for videoconferencing, intelligent feedback loops in large-scale sysops, manufacturing and autonomous vehicle management, complex… ▽ More

    Submitted 27 December, 2019; v1 submitted 30 August, 2019; originally announced September 2019.

  17. arXiv:1906.03822  [pdf, other

    cs.LG stat.ML

    Making Classical Machine Learning Pipelines Differentiable: A Neural Translation Approach

    Authors: Gyeong-In Yu, Saeed Amizadeh, Sehoon Kim, Artidoro Pagnoni, Byung-Gon Chun, Markus Weimer, Matteo Interlandi

    Abstract: Classical Machine Learning (ML) pipelines often comprise of multiple ML models where models, within a pipeline, are trained in isolation. Conversely, when training neural network models, layers composing the neural models are simultaneously trained using backpropagation. We argue that the isolated training scheme of ML pipelines is sub-optimal, since it cannot jointly optimize multiple components.… ▽ More

    Submitted 12 December, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

  18. Machine Learning at Microsoft with ML .NET

    Authors: Zeeshan Ahmed, Saeed Amizadeh, Mikhail Bilenko, Rogan Carr, Wei-Sheng Chin, Yael Dekel, Xavier Dupre, Vadim Eksarevskiy, Eric Erhardt, Costin Eseanu, Senja Filipi, Tom Finley, Abhishek Goswami, Monte Hoover, Scott Inglis, Matteo Interlandi, Shon Katzenberger, Najeeb Kazmi, Gleb Krivosheev, Pete Luferenko, Ivan Matantsev, Sergiy Matusevych, Shahab Moradi, Gani Nazirov, Justin Ormont , et al. (9 additional authors not shown)

    Abstract: Machine Learning is transitioning from an art and science into a technology available to every developer. In the near future, every application on every platform will incorporate trained models to encode data-based decisions that would be impossible for developers to author. This presents a significant engineering challenge, since currently data science and modeling are largely decoupled from stan… ▽ More

    Submitted 15 May, 2019; v1 submitted 14 May, 2019; originally announced May 2019.

  19. arXiv:1812.06411  [pdf, other

    cs.IT

    Coded Elastic Computing

    Authors: Yaoqing Yang, Matteo Interlandi, Pulkit Grover, Soummya Kar, Saeed Amizadeh, Markus Weimer

    Abstract: Cloud providers have recently introduced new offerings whereby spare computing resources are accessible at discounts compared to on-demand computing. Exploiting such opportunity is challenging inasmuch as such resources are accessed with low-priority and therefore can elastically leave (through preemption) and join the computation at any time. In this paper, we design a new technique called coded… ▽ More

    Submitted 26 May, 2019; v1 submitted 16 December, 2018; originally announced December 2018.

    Comments: Some preliminary results of the paper have been presented at the Workshop on Systems for ML and Open Source Software at NeurIPS 2018 (without conference proceedings). An updated conference version will appear in ISIT 2019

  20. arXiv:1810.06115  [pdf, other

    cs.LG cs.DC stat.ML

    PRETZEL: Opening the Black Box of Machine Learning Prediction Serving Systems

    Authors: Yunseong Lee, Alberto Scolari, Byung-Gon Chun, Marco Domenico Santambrogio, Markus Weimer, Matteo Interlandi

    Abstract: Machine Learning models are often composed of pipelines of transformations. While this design allows to efficiently execute single model components at training time, prediction serving has different requirements such as low latency, high throughput and graceful performance degradation under heavy load. Current prediction serving systems consider models as black boxes, whereby prediction-time-speci… ▽ More

    Submitted 14 October, 2018; originally announced October 2018.

    Comments: 16 pages, 14 figures, 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2018

  21. arXiv:1807.02957  [pdf, other

    cs.DB cs.LO cs.PL

    Scaling-Up Reasoning and Advanced Analytics on BigData

    Authors: Tyson Condie, Ariyam Das, Matteo Interlandi, Alexander Shkapsky, Mohan Yang, Carlo Zaniolo

    Abstract: BigDatalog is an extension of Datalog that achieves performance and scalability on both Apache Spark and multicore systems to the point that its graph analytics outperform those written in GraphX. Looking back, we see how this realizes the ambitious goal pursued by deductive database researchers beginning forty years ago: this is the goal of combining the rigor and power of logic in expressing que… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

    Comments: Under consideration in Theory and Practice of Logic Programming (TPLP)

  22. arXiv:1707.05681  [pdf, other

    cs.DB

    Fixpoint Semantics and Optimization of Recursive Datalog Programs with Aggregates

    Authors: Carlo Zaniolo, Mohan Yang, Matteo Interlandi, Ariyam Das, Alexander Shkapsky, Tyson Condie

    Abstract: A very desirable Datalog extension investigated by many researchers in the last thirty years consists in allowing the use of the basic SQL aggregates min, max, count and sum in recursive rules. In this paper, we propose a simple comprehensive solution that extends the declarative least-fixpoint semantics of Horn Clauses, along with the optimization techniques used in the bottom-up implementation a… ▽ More

    Submitted 21 July, 2017; v1 submitted 18 July, 2017; originally announced July 2017.

    Comments: Paper presented at the 33nd International Conference on Logic Programming (ICLP 2017), Melbourne, Australia, August 28 to September 1, 2017. 16 pages, LaTeX (arXiv:1707.05681)

  23. arXiv:1405.7264  [pdf, ps, other

    cs.DB

    A Datalog-based Computational Model for Coordination-free, Data-Parallel Systems

    Authors: Matteo Interlandi, Letizia Tanca

    Abstract: Cloud computing refers to maximizing efficiency by sharing computational and storage resources, while data-parallel systems exploit the resources available in the cloud to perform parallel transformations over large amounts of data. In the same line, considerable emphasis has been recently given to two apparently disjoint research topics: data-parallel, and eventually consistent, distributed syste… ▽ More

    Submitted 8 July, 2018; v1 submitted 28 May, 2014; originally announced May 2014.

    Comments: Under consideration in Theory and Practice of Logic Programming (TPLP)