Skip to main content

Showing 1–14 of 14 results for author: Todorovski, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11515  [pdf, other

    cs.AI

    Quantifying Behavioural Distance Between Mathematical Expressions

    Authors: Sebastian Mežnar, Sašo Džeroski, Ljupčo Todorovski

    Abstract: Existing symbolic regression methods organize the space of candidate mathematical expressions primarily based on their syntactic, structural similarity. However, this approach overlooks crucial equivalences between expressions that arise from mathematical symmetries, such as commutativity, associativity, and distribution laws for arithmetic operations. Consequently, expressions with similar errors… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 15 pages, 10 figures, 1 table, 2 appendices

    MSC Class: 68T01 ACM Class: I.1.1; I.2.0

  2. arXiv:2310.16005  [pdf, other

    cs.LG

    MLFMF: Data Sets for Machine Learning for Mathematical Formalization

    Authors: Andrej Bauer, Matej Petković, Ljupčo Todorovski

    Abstract: We introduce MLFMF, a collection of data sets for benchmarking recommendation systems used to support formalization of mathematics with proof assistants. These systems help humans identify which previous entries (theorems, constructions, datatypes, and postulates) are relevant in proving a new theorem or carrying out a new construction. Each data set is derived from a library of formalized mathema… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  3. Efficient Generator of Mathematical Expressions for Symbolic Regression

    Authors: Sebastian Mežnar, Sašo Džeroski, Ljupčo Todorovski

    Abstract: We propose an approach to symbolic regression based on a novel variational autoencoder for generating hierarchical structures, HVAE. It combines simple atomic units with shared weights to recursively encode and decode the individual nodes in the hierarchy. Encoding is performed bottom-up and decoding top-down. We empirically show that HVAE can be trained efficiently with small corpora of mathemati… ▽ More

    Submitted 10 September, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: 35 pages, 11 tables, 7 multi-part figures, Machine learning (Springer) and journal track of ECML/PKDD 2023

    ACM Class: I.2.0; I.2.6

    Journal ref: Mach Learn (2023)

  4. arXiv:2212.00751  [pdf, other

    cs.FL cs.LG

    P(Expression|Grammar): Probability of deriving an algebraic expression with a probabilistic context-free grammar

    Authors: Urh Primožič, Ljupčo Todorovski, Matej Petković

    Abstract: Probabilistic context-free grammars have a long-term record of use as generative models in machine learning and symbolic regression. When used for symbolic regression, they generate algebraic expressions. We define the latter as equivalence classes of strings derived by grammar and address the problem of calculating the probability of deriving a given expression with a given grammar. We show that… ▽ More

    Submitted 2 December, 2022; v1 submitted 1 December, 2022; originally announced December 2022.

  5. Boosting the Performance of Quantum Annealers using Machine Learning

    Authors: Jure Brence, Dragan Mihailović, Viktor Kabanov, Ljupčo Todorovski, Sašo Džeroski, Jaka Vodeb

    Abstract: Noisy intermediate-scale quantum (NISQ) devices are spearheading the second quantum revolution. Of these, quantum annealers are the only ones currently offering real world, commercial applications on as many as 5000 qubits. The size of problems that can be solved by quantum annealers is limited mainly by errors caused by environmental noise and intrinsic imperfections of the processor. We address… ▽ More

    Submitted 7 March, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

  6. arXiv:2109.12331  [pdf

    cs.SI cs.LG

    Predicting Hidden Links and Missing Nodes in Scale-Free Networks with Artificial Neural Networks

    Authors: Rakib Hassan Pran, Ljupco Todorovski

    Abstract: There are many networks in real life which exist as form of Scale-free networks such as World Wide Web, protein-protein inter action network, semantic networks, airline networks, interbank payment networks, etc. If we want to analyze these networks, it is really necessary to understand the properties of scale-free networks. By using the properties of scale free networks, we can identify any type o… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

  7. arXiv:2106.15411  [pdf, other

    cs.LG cs.AI

    Explaining the Performance of Multi-label Classification Methods with Data Set Properties

    Authors: Jasmin Bogatinovski, Ljupčo Todorovski, Sašo Džeroski, Dragi Kocev

    Abstract: Meta learning generalizes the empirical experience with different learning tasks and holds promise for providing important empirical insight into the behaviour of machine learning algorithms. In this paper, we present a comprehensive meta-learning study of data sets and methods for multi-label classification (MLC). MLC is a practically relevant machine learning task where each example is labelled… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

  8. arXiv:2102.07113  [pdf, other

    cs.LG cs.AI cs.CC

    Comprehensive Comparative Study of Multi-Label Classification Methods

    Authors: Jasmin Bogatinovski, Ljupčo Todorovski, Sašo Džeroski, Dragi Kocev

    Abstract: Multi-label classification (MLC) has recently received increasing interest from the machine learning community. Several studies provide reviews of methods and datasets for MLC and a few provide empirical comparisons of MLC methods. However, they are limited in the number of methods and datasets considered. This work provides a comprehensive empirical study of a wide range of MLC methods on a pleth… ▽ More

    Submitted 16 February, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

  9. Probabilistic Grammars for Equation Discovery

    Authors: Jure Brence, Ljupčo Todorovski, Sašo Džeroski

    Abstract: Equation discovery, also known as symbolic regression, is a type of automated modeling that discovers scientific laws, expressed in the form of equations, from observed data and expert knowledge. Deterministic grammars, such as context-free grammars, have been used to limit the search spaces in equation discovery by providing hard constraints that specify which equations to consider and which not.… ▽ More

    Submitted 22 March, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

    Comments: Submitted to Knowledge-Based Systems, Elsevier. 28 pages + 13 pages appendix. 7 figures

    ACM Class: I.2.4; I.2.6; I.1.1; I.1.3; G.3

  10. arXiv:1907.00821  [pdf, other

    cs.LG eess.SY math.DS stat.ML

    Equation Discovery for Nonlinear System Identification

    Authors: Nikola Simidjievski, Ljupčo Todorovski, Juš Kocijan, Sašo Džeroski

    Abstract: Equation discovery methods enable modelers to combine domain-specific knowledge and system identification to construct models most suitable for a selected modeling task. The method described and evaluated in this paper can be used as a nonlinear system identification method for gray-box modeling. It consists of two interlaced parts of modeling that are computer-aided. The first performs computer-a… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

  11. arXiv:1906.09088  [pdf, other

    cs.LG math.DS math.OC stat.ML

    Meta-Model Framework for Surrogate-Based Parameter Estimation in Dynamical Systems

    Authors: Žiga Lukšič, Jovan Tanevski, Sašo Džeroski, Ljupčo Todorovski

    Abstract: The central task in modeling complex dynamical systems is parameter estimation. This task involves numerous evaluations of a computationally expensive objective function. Surrogate-based optimization introduces a computationally efficient predictive model that approximates the value of the objective function. The standard approach involves learning a surrogate from training examples that correspon… ▽ More

    Submitted 18 December, 2019; v1 submitted 21 June, 2019; originally announced June 2019.

  12. arXiv:1902.03896  [pdf, other

    math.DS cs.LG cs.SI physics.soc-ph stat.ML

    Reconstructing dynamical networks via feature ranking

    Authors: Marc G. Leguia, Zoran Levnajic, Ljupco Todorovski, Bernard Zenko

    Abstract: Empirical data on real complex systems are becoming increasingly available. Parallel to this is the need for new methods of reconstructing (inferring) the topology of networks from time-resolved observations of their node-dynamics. The methods based on physical insights often rely on strong assumptions about the properties and dynamics of the scrutinized network. Here, we use the insights from mac… ▽ More

    Submitted 26 August, 2019; v1 submitted 11 February, 2019; originally announced February 2019.

    Journal ref: Chaos 29, 093107 (2019)

  13. arXiv:1712.03100  [pdf, other

    physics.soc-ph cs.SI nlin.CD

    Decoupling approximation robustly reconstructs directed dynamical networks

    Authors: Nikola Simidjievski, Jovan Tanevski, Bernard Zenko, Zoran Levnajic, Ljupco Todorovski, Saso Dzeroski

    Abstract: Methods for reconstructing the topology of complex networks from time-resolved observations of node dynamics are gaining relevance across scientific disciplines. Of biggest practical interest are methods that make no assumptions about properties of the dynamics, and can cope with noisy, short and incomplete trajectories. Ideal reconstruction in such scenario requires and exhaustive approach of sim… ▽ More

    Submitted 7 November, 2018; v1 submitted 8 December, 2017; originally announced December 2017.

    Journal ref: New J. Phys. 20 (11), 113003 (2018)

  14. arXiv:1707.01321  [pdf, ps, other

    cs.CL

    The Influence of Feature Representation of Text on the Performance of Document Classification

    Authors: Sanda Martinčić-Ipšić, Tanja Miličić, Ljupčo Todorovski

    Abstract: In this paper we perform a comparative analysis of three models for feature representation of text documents in the context of document classification. In particular, we consider the most often used family of models bag-of-words, recently proposed continuous space models word2vec and doc2vec, and the model based on the representation of text documents as language networks. While the bag-of-word mo… ▽ More

    Submitted 5 July, 2017; originally announced July 2017.