Skip to main content

Showing 1–1 of 1 results for author: Tailor, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.00293  [pdf, ps, other

    cs.DB cs.LG

    Illuminating Patterns of Divergence: DataDios SmartDiff for Large-Scale Data Difference Analysis

    Authors: Aryan Poduri, Yashwant Tailor

    Abstract: Data engineering workflows require reliable differencing across files, databases, and query outputs, yet existing tools falter under schema drift, heterogeneous types, and limited explainability. SmartDiff is a unified system that combines schema-aware mapping, type-specific comparators, and parallel execution. It aligns evolving schemas, compares structured and semi-structured data (strings, numb… ▽ More

    Submitted 29 August, 2025; originally announced September 2025.

    Comments: 10 pages, 4 figures

    ACM Class: I.2.6; H.2.8; D.1.3