Skip to main content

Showing 1–2 of 2 results for author: Demiralp, Ç

Searching in archive stat. Search in all archives.
.
  1. arXiv:2009.01444  [pdf, other

    cs.LG cs.CL cs.DB cs.HC stat.ML

    Data Programming by Demonstration: A Framework for Interactively Learning Labeling Functions

    Authors: Sara Evensen, Chang Ge, Dongjin Choi, Çağatay Demiralp

    Abstract: Data programming is a programmatic weak supervision approach to efficiently curate large-scale labeled training data. Writing data programs (labeling functions) requires, however, both programming literacy and domain expertise. Many subject matter experts have neither programming proficiency nor time to effectively write data programs. Furthermore, regardless of one's expertise in coding or machin… ▽ More

    Submitted 15 September, 2020; v1 submitted 3 September, 2020; originally announced September 2020.

  2. arXiv:1905.10688  [pdf, other

    cs.LG cs.DB cs.IR stat.ML

    Sherlock: A Deep Learning Approach to Semantic Data Type Detection

    Authors: Madelon Hulsebos, Kevin Hu, Michiel Bakker, Emanuel Zgraggen, Arvind Satyanarayan, Tim Kraska, Çağatay Demiralp, César Hidalgo

    Abstract: Correctly detecting the semantic type of data columns is crucial for data science tasks such as automated data cleaning, schema matching, and data discovery. Existing data preparation and analysis systems rely on dictionary lookups and regular expression matching to detect semantic types. However, these matching-based approaches often are not robust to dirty data and only detect a limited number o… ▽ More

    Submitted 25 May, 2019; originally announced May 2019.

    Comments: KDD'19