Skip to main content

Showing 1–50 of 61 results for author: Jagadish, V

.
  1. arXiv:2506.05587  [pdf, ps, other

    cs.AI cs.CL cs.DB cs.LG

    MMTU: A Massive Multi-Task Table Understanding and Reasoning Benchmark

    Authors: Junjie Xing, Yeye He, Mengyu Zhou, Haoyu Dong, Shi Han, Lingjiao Chen, Dongmei Zhang, Surajit Chaudhuri, H. V. Jagadish

    Abstract: Tables and table-based use cases play a crucial role in many important real-world applications, such as spreadsheets, databases, and computational notebooks, which traditionally require expert-level users like data engineers, data analysts, and database administrators to operate. Although LLMs have shown remarkable progress in working with tables (e.g., in spreadsheet and database copilot scenario… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  2. arXiv:2501.06846  [pdf, ps, other

    quant-ph

    On the eternal non-Markovianity of qubit maps

    Authors: Vinayak Jagadish, R. Srikanth

    Abstract: As is well known, unital Pauli maps can be eternally non-CP-divisible. In contrast, here we show that in the case of non-unital maps, eternal non-Markovianity in the non-unital part is ruled out. In the unital case, the eternal non-Markovianity can be obtained by a convex combination of two dephasing semigroups, but not all three of them. We study these results and the ramifications arising from t… ▽ More

    Submitted 12 January, 2025; originally announced January 2025.

    Comments: 4 pages

  3. arXiv:2412.09788  [pdf, other

    cs.DB

    OpenForge: Probabilistic Metadata Integration

    Authors: Tianji Cong, Fatemeh Nargesian, Junjie Xing, H. V. Jagadish

    Abstract: Modern data stores increasingly rely on metadata for enabling diverse activities such as data cataloging and search. However, metadata curation remains a labor-intensive task, and the broader challenge of metadata maintenance -- ensuring its consistency, usefulness, and freshness -- has been largely overlooked. In this work, we tackle the problem of resolving relationships among metadata concepts… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

  4. arXiv:2408.00513  [pdf, other

    cs.LG

    VecAug: Unveiling Camouflaged Frauds with Cohort Augmentation for Enhanced Detection

    Authors: Fei Xiao, Shaofeng Cai, Gang Chen, H. V. Jagadish, Beng Chin Ooi, Meihui Zhang

    Abstract: Fraud detection presents a challenging task characterized by ever-evolving fraud patterns and scarce labeled data. Existing methods predominantly rely on graph-based or sequence-based approaches. While graph-based approaches connect users through shared entities to capture structural information, they remain vulnerable to fraudsters who can disrupt or manipulate these connections. In contrast, seq… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: Accepted by KDD 2024

  5. arXiv:2406.14015  [pdf, other

    cs.LG

    CohortNet: Empowering Cohort Discovery for Interpretable Healthcare Analytics

    Authors: Qingpeng Cai, Kaiping Zheng, H. V. Jagadish, Beng Chin Ooi, James Yip

    Abstract: Cohort studies are of significant importance in the field of healthcare analysis. However, existing methods typically involve manual, labor-intensive, and expert-driven pattern definitions or rely on simplistic clustering techniques that lack medical relevance. Automating cohort studies with interpretable patterns has great potential to facilitate healthcare analysis but remains an unmet need in p… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 10 pages, 12 figures

  6. arXiv:2405.00301  [pdf, other

    cs.CL

    Enhanced Language Model Truthfulness with Learnable Intervention and Uncertainty Expression

    Authors: Farima Fatahi Bayat, Xin Liu, H. V. Jagadish, Lu Wang

    Abstract: Large language models (LLMs) can generate long-form and coherent text, yet they often hallucinate facts, which undermines their reliability. To mitigate this issue, inference-time methods steer LLM representations toward the "truthful directions" previously learned for truth elicitation. However, applying these truthful directions with the same intensity fails to generalize across different query… ▽ More

    Submitted 6 June, 2024; v1 submitted 30 April, 2024; originally announced May 2024.

    Comments: ACL 2024 Findings (Long paper)

  7. arXiv:2402.01071  [pdf, other

    cs.LG cs.CY cs.DB

    Chameleon: Foundation Models for Fairness-aware Multi-modal Data Augmentation to Enhance Coverage of Minorities

    Authors: Mahdi Erfanian, H. V. Jagadish, Abolfazl Asudeh

    Abstract: The potential harms of the under-representation of minorities in training data, particularly in multi-modal settings, is a well-recognized concern. While there has been extensive effort in detecting such under-representation, resolution has remained a challenge. With recent advancements in generative AI, large language models and foundation models have emerged as versatile tools across various dom… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  8. arXiv:2310.07736  [pdf, other

    cs.DB cs.LG

    Observatory: Characterizing Embeddings of Relational Tables

    Authors: Tianji Cong, Madelon Hulsebos, Zhenjie Sun, Paul Groth, H. V. Jagadish

    Abstract: Language models and specialized table embedding models have recently demonstrated strong performance on many tasks over tabular data. Researchers and practitioners are keen to leverage these models in many new application contexts; but limited understanding of the strengths and weaknesses of these models, and the table representations they generate, makes the process of finding a suitable model fo… ▽ More

    Submitted 27 January, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: Camera ready of VLDB 2024

  9. arXiv:2309.07856  [pdf, other

    cs.DB

    SMARTFEAT: Efficient Feature Construction through Feature-Level Foundation Model Interactions

    Authors: Yin Lin, Bolin Ding, H. V. Jagadish, Jingren Zhou

    Abstract: Before applying data analytics or machine learning to a data set, a vital step is usually the construction of an informative set of features from the data. In this paper, we present SMARTFEAT, an efficient automated feature engineering tool to assist data users, even non-experts, in constructing useful features. Leveraging the power of Foundation Models (FMs), our approach enables the creation of… ▽ More

    Submitted 13 December, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

  10. Experimental realization of quantum non-Markovianity through the convex mixing of Pauli semigroups on an NMR quantum processor

    Authors: Vaishali Gulati, Vinayak Jagadish, R. Srikanth, Kavita Dorai

    Abstract: This experimental study aims to investigate the convex combinations of Pauli semigroups with arbitrary mixing parameters to determine whether the resulting dynamical map exhibits Markovian or non-Markovian behavior. Specifically, we consider the cases of equal as well as unequal mixing of two Pauli semigroups, and demonstrate that the resulting map is always non-Markovian. Additionally, we study t… ▽ More

    Submitted 26 April, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: 9 pages, 8 figures

    Journal ref: Phys. Rev. A 109, 042419 (2024)

  11. Noninvertibility and non-Markovianity of quantum dynamical maps

    Authors: Vinayak Jagadish, R. Srikanth, Francesco Petruccione

    Abstract: We identify two broad types of noninvertibilities in quantum dynamical maps, one necessarily associated with CP indivisibility and one not so. We study the production of (non-)Markovian, invertible maps by the process of mixing noninvertible Pauli maps, and quantify the fraction of the same. The memory kernel perspective appears to be less transparent on the issue of invertibility than the approac… ▽ More

    Submitted 14 October, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: 7 pages, 2 figures

    Journal ref: Phys. Rev. A 108, 042202 (2023)

  12. Petz recovery maps for qudit quantum channels

    Authors: Lea Lautenbacher, Vinayak Jagadish, Francesco Petruccione, Nadja K. Bernardes

    Abstract: This study delves into the efficacy of the Petz recovery map within the context of two paradigmatic quantum channels: dephasing and amplitude-damping. While prior investigations have predominantly focused on qubits, our research extends this inquiry to higher-dimensional systems. We introduce a novel, state-independent framework based on the Choi-Jamiołkowski isomorphism to evaluate the performanc… ▽ More

    Submitted 22 May, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: 10 pages, 10 figures, V2

    Journal ref: Physics Letters A, Volume 512, 2024, 129583

  13. arXiv:2301.04901  [pdf, other

    cs.DB cs.IR

    Pylon: Semantic Table Union Search in Data Lakes

    Authors: Tianji Cong, Fatemeh Nargesian, H. V. Jagadish

    Abstract: The large size and fast growth of data repositories, such as data lakes, has spurred the need for data discovery to help analysts find related data. The problem has become challenging as (i) a user typically does not know what datasets exist in an enormous data repository; and (ii) there is usually a lack of a unified data model to capture the interrelationships between heterogeneous datasets from… ▽ More

    Submitted 13 January, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: Version submitted to the third round of ICDE 2023 on October 8, 2022

  14. arXiv:2301.00719  [pdf, other

    cs.LG cs.DB

    Detection of Groups with Biased Representation in Ranking

    Authors: Jinyang Li, Yuval Moskovitch, H. V. Jagadish

    Abstract: Real-life tools for decision-making in many critical domains are based on ranking results. With the increasing awareness of algorithmic fairness, recent works have presented measures for fairness in ranking. Many of those definitions consider the representation of different ``protected groups'', in the top-$k$ ranked items, for any reasonable $k$. Given the protected groups, confirming algorithmic… ▽ More

    Submitted 6 July, 2023; v1 submitted 30 December, 2022; originally announced January 2023.

  15. arXiv:2212.14155  [pdf, other

    cs.DB

    WarpGate: A Semantic Join Discovery System for Cloud Data Warehouses

    Authors: Tianji Cong, James Gale, Jason Frantz, H. V. Jagadish, Çağatay Demiralp

    Abstract: Data discovery is a major challenge in enterprise data analysis: users often struggle to find data relevant to their analysis goals or even to navigate through data across data sources, each of which may easily contain thousands of tables. One common user need is to discover tables joinable with a given table. This need is particularly critical because join is a ubiquitous operation in data analys… ▽ More

    Submitted 2 January, 2023; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: CIDR'23

  16. arXiv:2211.06793  [pdf, other

    cs.DB cs.LG

    Reinforcement Learning Enhanced Weighted Sampling for Accurate Subgraph Counting on Fully Dynamic Graph Streams

    Authors: Kaixin Wang, Cheng Long, Da Yan, Jie Zhang, H. V. Jagadish

    Abstract: As the popularity of graph data increases, there is a growing need to count the occurrences of subgraph patterns of interest, for a variety of applications. Many graphs are massive in scale and also fully dynamic (with insertions and deletions of edges), rendering exact computation of these counts to be infeasible. Common practice is, instead, to use a small set of edges as a sample to estimate th… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

    Comments: 17 pages, 5 figures. Accepted by ICDE'23

  17. arXiv:2208.01613  [pdf, other

    cs.DB cs.HC

    Principles of Query Visualization

    Authors: Wolfgang Gatterbauer, Cody Dunne, H. V. Jagadish, Mirek Riedewald

    Abstract: Query Visualization (QV) is the problem of transforming a given query into a graphical representation that helps humans understand its meaning. This task is notably different from designing a Visual Query Language (VQL) that helps a user compose a query. This article discusses the principles of relational query visualization and its potential for simplifying user interactions with relational data.

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: 20 pages, 12 figures, preprint for IEEE Data Engineering Bulletin

  18. arXiv:2205.02880  [pdf, other

    cs.CL

    CompactIE: Compact Facts in Open Information Extraction

    Authors: Farima Fatahi Bayat, Nikita Bhutani, H. V. Jagadish

    Abstract: A major drawback of modern neural OpenIE systems and benchmarks is that they prioritize high coverage of information in extractions over compactness of their constituents. This severely limits the usefulness of OpenIE extractions in many downstream tasks. The utility of extractions can be improved if extractions are compact and share constituents. To this end, we study the problem of identifying c… ▽ More

    Submitted 9 June, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: NAACL 2022 main conference (Long paper)

  19. arXiv:2203.11852  [pdf, other

    cs.DB cs.LG

    Representation Bias in Data: A Survey on Identification and Resolution Techniques

    Authors: Nima Shahbazi, Yin Lin, Abolfazl Asudeh, H. V. Jagadish

    Abstract: Data-driven algorithms are only as good as the data they work with, while data sets, especially social data, often fail to represent minorities adequately. Representation Bias in data can happen due to various reasons ranging from historical discrimination to selection and sampling biases in the data acquisition and preparation methods. Given that "bias in, bias out", one cannot expect AI-based so… ▽ More

    Submitted 18 March, 2023; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: Just Accepted ACM Comput. Surv. (March 2023)

  20. Measure of invertible dynamical maps under convex combinations of noninvertible dynamical maps

    Authors: Vinayak Jagadish, R. Srikanth, Francesco Petruccione

    Abstract: We study the convex combinations of the $(d+1)$ generalized Pauli dynamical maps in a Hilbert space of dimension $d$. For certain choices of the decoherence function, the maps are noninvertible and they remain under convex combinations as well. For the case of dynamical maps characterized by the decoherence function $(1-e^{-ct})/n$ with the decoherence parameter $n$ and decay factor $c$, we evalua… ▽ More

    Submitted 28 July, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: 6 pages, 1 figure

    Journal ref: Phys. Rev. A 106, 012438 (2022)

  21. Noninvertibility as a requirement for creating a semigroup under convex combinations of channels

    Authors: Vinayak Jagadish, R. Srikanth, Francesco Petruccione

    Abstract: We study the conditions under which a semigroup is obtained upon convex combinations of channels. In particular, we study the set of Pauli and generalized Pauli channels. We find that mixing only semigroups can never produce a semigroup. Counter-intuitively, we find that for a convex combination to yield a semigroup, most of the input channels have to be noninvertible.

    Submitted 10 March, 2022; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: 5 pages

    Journal ref: Phys. Rev. A 105, 032422 (2022)

  22. ARM-Net: Adaptive Relation Modeling Network for Structured Data

    Authors: Shaofeng Cai, Kaiping Zheng, Gang Chen, H. V. Jagadish, Beng Chin Ooi, Meihui Zhang

    Abstract: Relational databases are the de facto standard for storing and querying structured data, and extracting insights from structured data requires advanced analytics. Deep neural networks (DNNs) have achieved super-human prediction performance in particular data types, e.g., images. However, existing DNNs may not produce meaningful results when applied to structured data. The reason is that there are… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 14 pages, 11 figures, 5 tables, published as a conference paper in ACM SIGMOD 2020

  23. arXiv:2012.12292  [pdf, other

    quant-ph

    Initial entanglement, entangling unitaries, and completely positive maps

    Authors: Vinayak Jagadish, R. Srikanth, Francesco Petruccione

    Abstract: The problem of conditions on the initial correlations between the system and the environment that lead to completely positive (CP) or not-completely positive (NCP) maps has been studied by various authors. Two lines of study may be discerned: one concerned with families of initial correlations that induce CP dynamics under the application of an arbitrary joint unitary on the system and environment… ▽ More

    Submitted 15 April, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: 5 pages, 2 figures

  24. arXiv:2010.16340  [pdf, other

    cs.DB

    Patterns Count-Based Labels for Datasets

    Authors: Yuval Moskovitch, H. V. Jagadish

    Abstract: Counts of attribute-value combinations are central to the profiling of a dataset, particularly in determining fitness for use and in eliminating bias and unfairness. While counts of individual attribute values may be stored in some dataset profiles, there are too many combinations of attributes for it to be practical to store counts for each combination. In this paper, we develop the notion of sto… ▽ More

    Submitted 7 November, 2020; v1 submitted 30 October, 2020; originally announced October 2020.

    Comments: ICDE2021

  25. arXiv:2010.08807  [pdf, other

    cs.DB

    MithraDetective: A System for Cherry-picked Trendlines Detection

    Authors: Yoko Nagafuchi, Yin Lin, Kaushal Mamgain, Abolfazl Asudeh, H. V. Jagadish, You, Wu, Cong Yu

    Abstract: Given a data set, misleading conclusions can be drawn from it by cherry-picking selected samples. One important class of conclusions is a trend derived from a data set of values over time. Our goal is to evaluate whether the 'trends' described by the extracted samples are representative of the true situation represented in the data. We demonstrate MithraDetective, a system to compute a support sco… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

  26. Compressed Sensing Tomography for qudits in Hilbert spaces of non-power-of-two dimensions

    Authors: Revanth Badveli, Vinayak Jagadish, R. Srikanth, Francesco Petruccione

    Abstract: The techniques of low-rank matrix recovery were adapted for Quantum State Tomography (QST) previously by D. Gross et al. [Phys. Rev. Lett. 105, 150401 (2010)], where they consider the tomography of $n$ spin-$1/2$ systems. For the density matrix of dimension $d = 2^n$ and rank $r$ with $r \ll 2^n$, it was shown that randomly chosen Pauli measurements of the order $O(dr \log(d)^2)$ are enough to ful… ▽ More

    Submitted 6 July, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: 6 pages, 2 figures

    Journal ref: Phys. Rev. A 101, 062328 (2020)

  27. arXiv:2004.11375  [pdf

    cs.DB cs.HC cs.LO

    QueryVis: Logic-based diagrams help users understand complicated SQL queries faster

    Authors: Aristotelis Leventidis, Jiahui Zhang, Cody Dunne, Wolfgang Gatterbauer, H. V. Jagadish, Mirek Riedewald

    Abstract: Understanding the meaning of existing SQL queries is critical for code maintenance and reuse. Yet SQL can be hard to read, even for expert users or the original creator of a query. We conjecture that it is possible to capture the logical intent of queries in \emph{automatically-generated visual diagrams} that can help users understand the meaning of queries faster and more accurately than SQL text… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

    Comments: Full version of paper appearing in SIGMOD 2020

  28. Duoquest: A Dual-Specification System for Expressive SQL Queries

    Authors: Christopher Baik, Zhongjun Jin, Michael Cafarella, H. V. Jagadish

    Abstract: Querying a relational database is difficult because it requires users to know both the SQL language and be familiar with the schema. On the other hand, many users possess enough domain familiarity or expertise to describe their desired queries by alternative means. For such users, two major alternatives to writing SQL are natural language interfaces (NLIs) and programming-by-example (PBE). Both of… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

    Comments: Technical Report, 16 pages. Shorter version to be published in SIGMOD 2020

  29. Dynamics of quantum correlations in a Qubit-Oscillator system interacting via a dissipative bath

    Authors: Revanth Badveli, Vinayak Jagadish, S. Akshaya, R. Srikanth, Francesco Petruccione

    Abstract: The entanglement dynamics in a bipartite system consisting of a qubit and a harmonic oscillator interacting only through their coupling with the same bath is studied. The considered model assumes that the qubit is coupled to the bath via the Jaynes-Cummings interaction, whilst the position of the oscillator is coupled to the position of the bath via a dipole interaction. We give a microscopic deri… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: 15 pages, 8 figures, Accepted in Open Systems & Information Dynamics for publication

    Journal ref: Open Systems & Information Dynamics 27, 2050004 (2020)

  30. Convex combinations of CP-divisible Pauli channels that are not semigroups

    Authors: Vinayak Jagadish, R. Srikanth, Francesco Petruccione

    Abstract: We study the memory property of the channels obtained by convex combinations of Markovian channels that are not necessarily quantum dynamical semigroups (QDSs). In particular, we characterize the geometry of the region of (non-)Markovian channels obtained by the convex combination of the three Pauli channels, as a function of deviation from the semigroup form in a family of channels. The regions a… ▽ More

    Submitted 29 September, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: 5 pages, 4 figures

    Journal ref: Physics Letters A 384(35) 126907 (2020)

  31. arXiv:1911.10073  [pdf, other

    cs.LG cs.AI stat.ML

    Responsible Scoring Mechanisms Through Function Sampling

    Authors: Abolfazl Asudeh, H. V. Jagadish

    Abstract: Human decision-makers often receive assistance from data-driven algorithmic systems that provide a score for evaluating objects, including individuals. The scores are generated by a function (mechanism) that takes a set of features as input and generates a score.The scoring functions are either machine-learned or human-designed and can be used for different decision purposes such as ranking or cla… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

  32. Convex Combinations of Pauli Semigroups: Geometry, Measure and an Application

    Authors: Vinayak Jagadish, R. Srikanth, Francesco Petruccione

    Abstract: Finite-time Markovian channels, unlike their infinitesimal counterparts, do not form a convex set. As a particular instance of this observation, we consider the problem of mixing the three Pauli channels, conservatively assumed to be quantum dynamical semigroups, and fully characterize the resulting ``Pauli simplex.'' We show that neither the set of non-Markovian (completely positive indivisible)… ▽ More

    Submitted 1 June, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: 5 pages, 1 figure

    Journal ref: Phys. Rev. A 101, 062304 (2020)

  33. Measure of not-completely-positive qubit maps: the general case

    Authors: Vinayak Jagadish, R. Srikanth, Francesco Petruccione

    Abstract: We show that the set of not-completely-positive (NCP) maps is unbounded, unless further assumptions are made. This is done by first proposing a reasonable definition of a valid NCP map, which is nontrivial because NCP maps may lack a full positivity domain. The definition is motivated by specific examples. We prove that for valid NCP maps, the eigenvalue spectrum of the corresponding dynamical mat… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

    Comments: 11 pages, 1 figure

    Journal ref: Phys. Rev. A 100, 012336 (2019)

  34. arXiv:1906.08986  [pdf, other

    cs.DB cs.DC cs.LG

    Database Meets Deep Learning: Challenges and Opportunities

    Authors: Wei Wang, Meihui Zhang, Gang Chen, H. V. Jagadish, Beng Chin Ooi, Kian-Lee Tan

    Abstract: Deep learning has recently become very popular on account of its incredible success in many complex data-driven applications, such as image classification and speech recognition. The database community has worked on data-driven applications for many years, and therefore should be playing a lead role in supporting this new wave. However, databases and deep learning are different in terms of both te… ▽ More

    Submitted 18 January, 2020; v1 submitted 21 June, 2019; originally announced June 2019.

    Comments: The first version of this paper has appeared in SIGMOD Record. In this (third) version, we extend it to include the recent developments in this field and references to recent work (especially for section 3.2 and section 4.2)

    Journal ref: ACM SIGMOD Record, Volume 45 Issue 2, June 2016, Pages 17-22

  35. arXiv:1903.00172  [pdf, other

    cs.CL

    Open Information Extraction from Question-Answer Pairs

    Authors: Nikita Bhutani, Yoshihiko Suhara, Wang-Chiew Tan, Alon Halevy, H. V. Jagadish

    Abstract: Open Information Extraction (OpenIE) extracts meaningful structured tuples from free-form text. Most previous work on OpenIE considers extracting data from one sentence at a time. We describe NeurON, a system for extracting tuples from question-answer pairs. Since real questions and answers often contain precisely the information that users care about, such information is particularly desirable to… ▽ More

    Submitted 6 April, 2019; v1 submitted 1 March, 2019; originally announced March 2019.

    Comments: NAACL 2019

  36. An Invitation to Quantum Channels

    Authors: Vinayak Jagadish, Francesco Petruccione

    Abstract: Open quantum systems have become an active area of research, owing to its potential applications in many different fields ranging from computation to biology. Here, we review the formalism of dynamical maps used to represent the time evolution of open quantum systems and discuss the various representations and properties of the same, with many examples.

    Submitted 3 February, 2019; originally announced February 2019.

    Comments: 14 pages, 6 figures

    Journal ref: Quanta 2018; 7: 54-67

  37. Measure of positive and not completely positive single-qubit Pauli maps

    Authors: Vinayak Jagadish, R. Srikanth, Francesco Petruccione

    Abstract: The time evolution of an initially uncorrelated system is governed by a completely positive (CP) map. More generally, the system may contain initial (quantum) correlations with an environment, in which case the system evolves according to a not-completely positive (NCP) map. It is an interesting question what the relative measure is for these two types of maps within the set of positive maps. Afte… ▽ More

    Submitted 3 February, 2019; originally announced February 2019.

    Comments: 10 pages, Accepted in Phys. Rev. A for publication

    Journal ref: Phys. Rev. A 99, 022321 (2019)

  38. Bridging the Semantic Gap with SQL Query Logs in Natural Language Interfaces to Databases

    Authors: Christopher Baik, H. V. Jagadish, Yunyao Li

    Abstract: A critical challenge in constructing a natural language interface to database (NLIDB) is bridging the semantic gap between a natural language query (NLQ) and the underlying data. Two specific ways this challenge exhibits itself is through keyword mapping and join path inference. Keyword mapping is the task of mapping individual keywords in the original NLQ to database elements (such as relations,… ▽ More

    Submitted 31 January, 2019; originally announced February 2019.

    Comments: Accepted to IEEE International Conference on Data Engineering (ICDE) 2019

  39. arXiv:1812.07658  [pdf, other

    cs.DB

    Demonstration of a Multiresolution Schema Mapping System

    Authors: Zhongjun Jin, Christopher Baik, Michael Cafarella, H. V. Jagadish, Yuze Lou

    Abstract: Enterprise databases usually contain large and complex schemas. Authoring complete schema mapping queries in this case requires deep knowledge about the source and target schemas and is thereby very challenging to programmers. Sample-driven schema mapping allows the user to describe the schema mapping using data records. However, real data records are still harder to specify than other useful insi… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

    Comments: 4 pages, 5 figures, CIDR 2019

    Journal ref: 9th Biennial Conference on Innovative Data Systems Research (CIDR 2019)

  40. Assessing and Remedying Coverage for a Given Dataset

    Authors: Abolfazl Asudeh, Zhongjun Jin, H. V. Jagadish

    Abstract: Data analysis impacts virtually every aspect of our society today. Often, this analysis is performed on an existing dataset, possibly collected through a process that the data scientists had limited control over. The existing data analyzed may not include the complete universe, but it is expected to cover the diversity of items in the universe. Lack of adequate coverage in the dataset can result i… ▽ More

    Submitted 23 February, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: in ICDE 2019

  41. arXiv:1809.04017  [pdf, other

    cs.DB

    Reducing Uncertainty of Schema Matching via Crowdsourcing with Accuracy Rates

    Authors: Chen Jason Zhang, Lei Chen, H. V. Jagadish, Mengchen Zhang, Yongxin Tong

    Abstract: Schema matching is a central challenge for data integration systems. Inspired by the popularity and the success of crowdsourcing platforms, we explore the use of crowdsourcing to reduce the uncertainty of schema matching. Since crowdsourcing platforms are most effective for simple questions, we assume that each Correspondence Correctness Question (CCQ) asks the crowd to decide whether a given corr… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Comments: 15 pages

  42. arXiv:1807.00071  [pdf

    cs.GL cs.DL

    GOTO Rankings Considered Helpful

    Authors: Emery Berger, Stephen M. Blackburn, Carla Brodley, H. V. Jagadish, Kathryn S. McKinley, Mario A. Nascimento, Minjeong Shin, Lexing Xie

    Abstract: Rankings are a fact of life. Whether or not one likes them, they exist and are influential. Within academia, and in computer science in particular, rankings not only capture our attention but also widely influence people who have a limited understanding of computing science research, including prospective students, university administrators, and policy-makers. In short, rankings matter. This posit… ▽ More

    Submitted 24 April, 2019; v1 submitted 29 June, 2018; originally announced July 2018.

    Comments: Accepted, to appear in Communications of the ACM

  43. On Obtaining Stable Rankings

    Authors: Abolfazl Asudeh, H. V. Jagadish, Gerome Miklau, Julia Stoyanovich

    Abstract: Decision making is challenging when there is more than one criterion to consider. In such cases, it is common to assign a goodness score to each item as a weighted sum of its attribute values and rank them accordingly. Clearly, the ranking obtained depends on the weights used for this summation. Ideally, one would want the ranked order not to change if the weights are changed slightly. We call thi… ▽ More

    Submitted 18 December, 2018; v1 submitted 29 April, 2018; originally announced April 2018.

    Journal ref: Abolfazl Asudeh, H. V. Jagadish, Gerome Miklau, Julia Stoyanovich. On Obtaining Stable Rankings. PVLDB , 12(3): 237-250, 2018

  44. arXiv:1804.09997  [pdf, other

    cs.AI cs.DB

    PANDA: Facilitating Usable AI Development

    Authors: Jinyang Gao, Wei Wang, Meihui Zhang, Gang Chen, H. V. Jagadish, Guoliang Li, Teck Khim Ng, Beng Chin Ooi, Sheng Wang, Jingren Zhou

    Abstract: Recent advances in artificial intelligence (AI) and machine learning have created a general perception that AI could be used to solve complex problems, and in some situations over-hyped as a tool that can be so easily used. Unfortunately, the barrier to realization of mass adoption of AI on various business domains is too high because most domain experts have no background in AI. Developing AI app… ▽ More

    Submitted 26 April, 2018; originally announced April 2018.

  45. arXiv:1803.00701  [pdf, other

    cs.DB

    CLX: Towards verifiable PBE data transformation

    Authors: Zhongjun Jin, Michael Cafarella, H. V. Jagadish, Sean Kandel, Michael Minar, Joseph M. Hellerstein

    Abstract: Effective data analytics on data collected from the real world usually begins with a notoriously expensive pre-processing step of data transformation and wrangling. Programming By Example (PBE) systems have been proposed to automatically infer transformations using simple examples that users provide as hints. However, an important usability issue - verification - limits the effective use of such P… ▽ More

    Submitted 12 August, 2019; v1 submitted 1 March, 2018; originally announced March 2018.

    Comments: 16 pages

  46. RRR: Rank-Regret Representative

    Authors: Abolfazl Asudeh, Azade Nazi, Nan Zhang, Gautam Das, H. V. Jagadish

    Abstract: Selecting the best items in a dataset is a common task in data exploration. However, the concept of "best" lies in the eyes of the beholder: different users may consider different attributes more important, and hence arrive at different rankings. Nevertheless, one can remove "dominated" items and create a "representative" subset of the data set, comprising the "best items" in it. A Pareto-optimal… ▽ More

    Submitted 3 March, 2018; v1 submitted 28 February, 2018; originally announced February 2018.

  47. Designing Fair Ranking Schemes

    Authors: Abolfazl Asudeh, H. V. Jagadish, Julia Stoyanovich, Gautam Das

    Abstract: Items from a database are often ranked based on a combination of multiple criteria. A user may have the flexibility to accept combinations that weigh these criteria differently, within limits. On the other hand, this choice of weights can greatly affect the fairness of the produced ranking. In this paper, we develop a system that helps users choose criterion weights that lead to greater fairness.… ▽ More

    Submitted 4 January, 2018; v1 submitted 27 December, 2017; originally announced December 2017.

  48. Non-Markovian evolution: a quantum walk perspective

    Authors: Pradeep Kumar, Subhashish Banerjee, R. Srikanth, Vinayak Jagadish, Francesco Petruccione

    Abstract: Quantum non-Markovianity of a quantum noisy channel manifests typically as information backflow, characterized by the departure of the intermediate map from complete positivity, though we indicate certain noisy channels that don't exhibit this behavior. In complex systems, non-Markovianity becomes more involved on account of subsystem dynamics. Here we study various facets of non-Markovian evoluti… ▽ More

    Submitted 8 January, 2019; v1 submitted 9 November, 2017; originally announced November 2017.

    Comments: Accepted version

    Journal ref: Open Systems & Information Dynamics 25, 1850014 (2018)

  49. arXiv:1703.08004  [pdf, other

    quant-ph

    Non-Markovian Dynamics of Discrete-Time Quantum Walks

    Authors: Subhashish Banerjee, N. Pradeep Kumar, R. Srikanth, Vinayak Jagadish, Francesco Petruccione

    Abstract: In the case of the discrete time coined quantum walk the reduced dynamics of the coin shows non-Markovian recurrence features due to information back-flow from the position degree of freedom. Here we study how this non-Markovian behavior is modified in the presence of open system dynamics. In the process, we obtain useful insights into the nature of non-Markovian physics. In particular, we show th… ▽ More

    Submitted 23 March, 2017; originally announced March 2017.

    Comments: 5 pages, 4 figures

  50. arXiv:1610.04789  [pdf, other

    cs.DB

    Bsmooth: Learning from user feedback to disambiguate query terms in interactive data retrieval

    Authors: Bernardo Gonçalves, H. V. Jagadish

    Abstract: There is great interest in supporting imprecise queries (e.g., keyword search or natural language queries) over databases today. To support such queries, the database system is typically required to disambiguate parts of the user-specified query against the database, using whatever resources are intrinsically available to it (the database schema, data values distributions, natural language models… ▽ More

    Submitted 26 April, 2017; v1 submitted 15 October, 2016; originally announced October 2016.

    Comments: 30 pages, 10 figures, 5 tables

    ACM Class: H.2; H.3