Skip to main content

Showing 1–7 of 7 results for author: Cetintemel, U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.00600  [pdf, ps, other

    cs.DB cs.AI cs.CL

    Semantic Integrity Constraints: Declarative Guardrails for AI-Augmented Data Processing Systems

    Authors: Alexander W. Lee, Justin Chan, Michael Fu, Nicolas Kim, Akshay Mehta, Deepti Raghavan, Ugur Cetintemel

    Abstract: AI-augmented data processing systems (DPSs) integrate large language models (LLMs) into query pipelines, allowing powerful semantic operations on structured and unstructured data. However, the reliability (a.k.a. trust) of these systems is fundamentally challenged by the potential for LLMs to produce errors, limiting their adoption in critical domains. To help address this reliability bottleneck,… ▽ More

    Submitted 1 June, 2025; v1 submitted 1 March, 2025; originally announced March 2025.

  2. arXiv:1909.06182  [pdf, other

    cs.DB

    DBPal: Weak Supervision for Learning a Natural Language Interface to Databases

    Authors: Nathaniel Weir, Andrew Crotty, Alex Galakatos, Amir Ilkhechi, Shekar Ramaswamy, Rohin Bhushan, Ugur Cetintemel, Prasetya Utama, Nadja Geisler, Benjamin Hättasch, Steffen Eger, Carsten Binnig

    Abstract: This paper describes DBPal, a new system to translate natural language utterances into SQL statements using a neural machine translation model. While other recent approaches use neural machine translation to implement a Natural Language Interface to Databases (NLIDB), existing techniques rely on supervised learning with manually curated training data, which results in substantial overhead for supp… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Comments: arXiv admin note: text overlap with arXiv:1804.00401

  3. arXiv:1804.00401  [pdf, other

    cs.DB cs.CL cs.HC

    An End-to-end Neural Natural Language Interface for Databases

    Authors: Prasetya Utama, Nathaniel Weir, Fuat Basik, Carsten Binnig, Ugur Cetintemel, Benjamin Hättasch, Amir Ilkhechi, Shekar Ramaswamy, Arif Usta

    Abstract: The ability to extract insights from new data sets is critical for decision making. Visual interactive tools play an important role in data exploration since they provide non-technical users with an effective way to visually compose queries and comprehend the results. Natural language has recently gained traction as an alternative query interface to databases with the potential to enable non-exper… ▽ More

    Submitted 2 April, 2018; originally announced April 2018.

  4. arXiv:1608.05678  [pdf, ps, other

    cs.DB

    Revisiting Reuse in Main Memory Database Systems

    Authors: Kayhan Dursun, Carsten Binnig, Ugur Cetintemel, Tim Kraska

    Abstract: Reusing intermediates in databases to speed-up analytical query processing has been studied in the past. Existing solutions typically require intermediate results of individual operators to be materialized into temporary tables to be considered for reuse in subsequent queries. However, these approaches are fundamentally ill-suited for use in modern main memory databases. The reason is that modern… ▽ More

    Submitted 19 August, 2016; originally announced August 2016.

    Comments: 13 Pages, 11 Figures

  5. arXiv:1503.01143  [pdf, other

    cs.DB

    S-Store: Streaming Meets Transaction Processing

    Authors: John Meehan, Nesime Tatbul, Stan Zdonik, Cansu Aslantas, Ugur Cetintemel, Jiang Du, Tim Kraska, Samuel Madden, David Maier, Andrew Pavlo, Michael Stonebraker, Kristin Tufte, Hao Wang

    Abstract: Stream processing addresses the needs of real-time applications. Transaction processing addresses the coordination and safety of short atomic computations. Heretofore, these two modes of operation existed in separate, stove-piped systems. In this work, we attempt to fuse the two computational paradigms in a single system called S-Store. In this way, S-Store can simultaneously accommodate OLTP and… ▽ More

    Submitted 10 March, 2015; v1 submitted 3 March, 2015; originally announced March 2015.

  6. arXiv:1406.6667  [pdf, other

    cs.DB

    Tupleware: Redefining Modern Analytics

    Authors: Andrew Crotty, Alex Galakatos, Kayhan Dursun, Tim Kraska, Ugur Cetintemel, Stan Zdonik

    Abstract: There is a fundamental discrepancy between the targeted and actual users of current analytics frameworks. Most systems are designed for the data and infrastructure of the Googles and Facebooks of the world---petabytes of data distributed across large cloud deployments consisting of thousands of cheap commodity machines. Yet, the vast majority of users operate clusters ranging from a few to a few d… ▽ More

    Submitted 30 July, 2014; v1 submitted 25 June, 2014; originally announced June 2014.

  7. arXiv:1101.5805  [pdf, ps, other

    cs.DB cs.DS cs.LG

    The VC-Dimension of Queries and Selectivity Estimation Through Sampling

    Authors: Matteo Riondato, Mert Akdere, Ugur Cetintemel, Stanley B. Zdonik, Eli Upfal

    Abstract: We develop a novel method, based on the statistical concept of the Vapnik-Chervonenkis dimension, to evaluate the selectivity (output cardinality) of SQL queries - a crucial step in optimizing the execution of large scale database and data-mining operations. The major theoretical contribution of this work, which is of independent interest, is an explicit bound to the VC-dimension of a range space… ▽ More

    Submitted 11 August, 2011; v1 submitted 30 January, 2011; originally announced January 2011.

    Comments: 20 pages, 3 figures

    ACM Class: H.2.4; G.3