Skip to main content

Showing 1–11 of 11 results for author: Dumancic, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2007.05758  [pdf

    cs.LG stat.ML

    Feature Interactions in XGBoost

    Authors: Kshitij Goyal, Sebastijan Dumancic, Hendrik Blockeel

    Abstract: In this paper, we investigate how feature interactions can be identified to be used as constraints in the gradient boosting tree models using XGBoost's implementation. Our results show that accurate identification of these constraints can help improve the performance of baseline XGBoost model significantly. Further, the improvement in the model structure can also lead to better interpretability.

    Submitted 11 July, 2020; originally announced July 2020.

    Comments: 7 pages, 2 Figures

  2. arXiv:2004.09931  [pdf, other

    cs.AI cs.LG stat.ML

    Knowledge Refactoring for Inductive Program Synthesis

    Authors: Sebastijan Dumancic, Tias Guns, Andrew Cropper

    Abstract: Humans constantly restructure knowledge to use it more efficiently. Our goal is to give a machine learning system similar abilities so that it can learn more efficiently. We introduce the \textit{knowledge refactoring} problem, where the goal is to restructure a learner's knowledge base to reduce its size and to minimise redundancy in it. We focus on inductive logic programming, where the knowledg… ▽ More

    Submitted 24 November, 2020; v1 submitted 21 April, 2020; originally announced April 2020.

    Comments: 7 pages, 6 figures

  3. arXiv:1903.12577  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Relational Representations with Auto-encoding Logic Programs

    Authors: Sebastijan Dumancic, Tias Guns, Wannes Meert, Hendrik Blockeel

    Abstract: Deep learning methods capable of handling relational data have proliferated over the last years. In contrast to traditional relational learning methods that leverage first-order logic for representing such data, these deep learning methods aim at re-representing symbolic relational data in Euclidean spaces. They offer better scalability, but can only numerically approximate relational structures a… ▽ More

    Submitted 24 March, 2020; v1 submitted 29 March, 2019; originally announced March 2019.

    Comments: 8 pages,4 figures, paper + supplement, published at IJCAI

  4. arXiv:1806.11391  [pdf, other

    cs.AI cs.LG stat.ML

    A Comparative Study of Distributional and Symbolic Paradigms for Relational Learning

    Authors: Sebastijan Dumancic, Alberto Garcia-Duran, Mathias Niepert

    Abstract: Many real-world domains can be expressed as graphs and, more generally, as multi-relational knowledge graphs. Though reasoning and learning with knowledge graphs has traditionally been addressed by symbolic approaches, recent methods in (deep) representation learning has shown promising results for specialized tasks such as knowledge base completion. These approaches abandon the traditional symbol… ▽ More

    Submitted 24 March, 2020; v1 submitted 29 June, 2018; originally announced June 2018.

    Comments: corrected version: incorrect evaluation fixed; IJCAI 2019

  5. arXiv:1805.00779  [pdf, other

    stat.ML cs.AI cs.LG

    COBRAS-TS: A new approach to Semi-Supervised Clustering of Time Series

    Authors: Toon Van Craenendonck, Wannes Meert, Sebastijan Dumancic, Hendrik Blockeel

    Abstract: Clustering is ubiquitous in data analysis, including analysis of time series. It is inherently subjective: different users may prefer different clusterings for a particular dataset. Semi-supervised clustering addresses this by allowing the user to provide examples of instances that should (not) be in the same cluster. This paper studies semi-supervised clustering in the context of time series. We… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

  6. arXiv:1803.11060  [pdf, other

    cs.LG cs.AI stat.ML

    COBRAS: Fast, Iterative, Active Clustering with Pairwise Constraints

    Authors: Toon Van Craenendonck, Sebastijan Dumančić, Elia Van Wolputte, Hendrik Blockeel

    Abstract: Constraint-based clustering algorithms exploit background knowledge to construct clusterings that are aligned with the interests of a particular user. This background knowledge is often obtained by allowing the clustering system to pose pairwise queries to the user: should these two elements be in the same cluster or not? Active clustering methods aim to minimize the number of queries needed to ob… ▽ More

    Submitted 29 March, 2018; originally announced March 2018.

  7. arXiv:1801.09955  [pdf, other

    cs.AI cs.LG stat.ML

    COBRA: A Fast and Simple Method for Active Clustering with Pairwise Constraints

    Authors: Toon Van Craenendonck, Sebastijan Dumancic, Hendrik Blockeel

    Abstract: Clustering is inherently ill-posed: there often exist multiple valid clusterings of a single dataset, and without any additional information a clustering system has no way of knowing which clustering it should produce. This motivates the use of constraints in clustering, as they allow users to communicate their interests to the clustering system. Active constraint-based clustering algorithms selec… ▽ More

    Submitted 30 January, 2018; originally announced January 2018.

    Comments: Presented at IJCAI 2017

  8. arXiv:1705.05785  [pdf, other

    cs.AI cs.LG stat.ML

    Demystifying Relational Latent Representations

    Authors: Sebastijan Dumančić, Hendrik Blockeel

    Abstract: Latent features learned by deep learning approaches have proven to be a powerful tool for machine learning. They serve as a data abstraction that makes learning easier by capturing regularities in data explicitly. Their benefits motivated their adaptation to relational learning context. In our previous work, we introduce an approach that learns relational latent features by means of clustering ins… ▽ More

    Submitted 29 September, 2017; v1 submitted 16 May, 2017; originally announced May 2017.

    Comments: 12 pages, 8 figures; accepted to ILP 2017

  9. arXiv:1606.08660  [pdf, ps, other

    stat.ML cs.LG cs.LO

    Theory reconstruction: a representation learning view on predicate invention

    Authors: Sebastijan Dumancic, Wannes Meert, Hendrik Blockeel

    Abstract: With this positional paper we present a representation learning view on predicate invention. The intention of this proposal is to bridge the relational and deep learning communities on the problem of predicate invention. We propose a theory reconstruction approach, a formalism that extends autoencoder approach to representation learning to the relational settings. Our intention is to start a discu… ▽ More

    Submitted 29 June, 2016; v1 submitted 28 June, 2016; originally announced June 2016.

    Comments: 3 pages, StaRAI 2016 submission

  10. Clustering-Based Relational Unsupervised Representation Learning with an Explicit Distributed Representation

    Authors: Sebastijan Dumancic, Hendrik Blockeel

    Abstract: The goal of unsupervised representation learning is to extract a new representation of data, such that solving many different tasks becomes easier. Existing methods typically focus on vectorized data and offer little support for relational data, which additionally describe relationships among instances. In this work we introduce an approach for relational unsupervised representation learning. View… ▽ More

    Submitted 8 March, 2017; v1 submitted 28 June, 2016; originally announced June 2016.

    Comments: 8 pages, 1 figure, 2 tables, StaRAI 2016 submission, final version

  11. arXiv:1604.08934  [pdf, other

    stat.ML cs.AI cs.LG

    An expressive dissimilarity measure for relational clustering using neighbourhood trees

    Authors: Sebastijan Dumancic, Hendrik Blockeel

    Abstract: Clustering is an underspecified task: there are no universal criteria for what makes a good clustering. This is especially true for relational data, where similarity can be based on the features of individuals, the relationships between them, or a mix of both. Existing methods for relational clustering have strong and often implicit biases in this respect. In this paper, we introduce a novel simil… ▽ More

    Submitted 7 March, 2017; v1 submitted 29 April, 2016; originally announced April 2016.

    Comments: 9 pages, 3 figures, 4 tables, submitted to ECMLPKDD 2017