Skip to main content

Showing 1–17 of 17 results for author: Klemen, M

Searching in archive cs. Search in all archives.
.
  1. Neural spell-checker: Beyond words with synthetic data generation

    Authors: Matej Klemen, Martin Božič, Špela Arhar Holdt, Marko Robnik-Šikonja

    Abstract: Spell-checkers are valuable tools that enhance communication by identifying misspelled words in written texts. Recent improvements in deep learning, and in particular in large language models, have opened new opportunities to improve traditional spell-checkers with new functionalities that not only assess spelling correctness but also the suitability of a word for a given context. In our work, we… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: Camera-ready version. Accepted to TSD 2024

  2. Code-mixed Sentiment and Hate-speech Prediction

    Authors: Anjali Yadav, Tanya Garg, Matej Klemen, Matej Ulcar, Basant Agarwal, Marko Robnik Sikonja

    Abstract: Code-mixed discourse combines multiple languages in a single text. It is commonly used in informal discourse in countries with several official languages, but also in many other countries in combination with English or neighboring languages. As recently large language models have dominated most natural language processing tasks, we investigated their performance in code-mixed settings for relevant… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  3. A Machine Learning-based Approach for Solving Recurrence Relations and its use in Cost Analysis of Logic Programs

    Authors: Louis Rustenholz, Maximiliano Klemen, Miguel Ángel Carreira-Perpiñán, Pedro López-García

    Abstract: Automatic static cost analysis infers information about the resources used by programs without actually running them with concrete data, and presents such information as functions of input data sizes. Most of the analysis tools for logic programs (and many for other languages), as CiaoPP, are based on setting up recurrence relations representing (bounds on) the computational cost of predicates, an… ▽ More

    Submitted 29 August, 2024; v1 submitted 11 May, 2024; originally announced May 2024.

    Comments: Under consideration in Theory and Practice of Logic Programming (TPLP). Extended, revised version of our work published in ICLP (Klemen et al. 2023, arXiv:2309.07259). arXiv admin note: text overlap with arXiv:2309.07259

    Journal ref: Theory and Practice of Logic Programming 24 (2024) 1163-1207

  4. Solving Recurrence Relations using Machine Learning, with Application to Cost Analysis

    Authors: Maximiliano Klemen, Miguel Á. Carreira-Perpiñán, Pedro Lopez-Garcia

    Abstract: Automatic static cost analysis infers information about the resources used by programs without actually running them with concrete data, and presents such information as functions of input data sizes. Most of the analysis tools for logic programs (and other languages) are based on setting up recurrence relations representing (bounds on) the computational cost of predicates, and solving them to fi… ▽ More

    Submitted 30 August, 2023; originally announced September 2023.

    Comments: In Proceedings ICLP 2023, arXiv:2308.14898

    ACM Class: F.3.2; F.3.1; F.4.1; I.2.6; D.1.6

    Journal ref: EPTCS 385, 2023, pp. 155-168

  5. arXiv:2208.10228  [pdf, other

    cs.CL cs.LG q-bio.BM

    Review of Natural Language Processing in Pharmacology

    Authors: Dimitar Trajanov, Vangel Trajkovski, Makedonka Dimitrieva, Jovana Dobreva, Milos Jovanovik, Matej Klemen, Aleš Žagar, Marko Robnik-Šikonja

    Abstract: Natural language processing (NLP) is an area of artificial intelligence that applies information technologies to process the human language, understand it to a certain degree, and use it in various applications. This area has rapidly developed in the last few years and now employs modern variants of deep neural networks to extract relevant patterns from large text corpora. The main objective of th… ▽ More

    Submitted 26 January, 2023; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: 42 pages, 2 figures, 7 tables

    ACM Class: J.3; A.1

  6. arXiv:2111.07119  [pdf, other

    cs.CL

    Extracting and filtering paraphrases by bridging natural language inference and paraphrasing

    Authors: Matej Klemen, Marko Robnik-Šikonja

    Abstract: Paraphrasing is a useful natural language processing task that can contribute to more diverse generated or translated texts. Natural language inference (NLI) and paraphrasing share some similarities and can benefit from a joint approach. We propose a novel methodology for the extraction of paraphrasing datasets from NLI datasets and cleaning existing paraphrasing datasets. Our approach is based on… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

  7. Regular Path Clauses and Their Application in Solving Loops

    Authors: Bishoksan Kafle, John P. Gallagher, Manuel V. Hermenegildo, Maximiliano Klemen, Pedro López-García, José F. Morales

    Abstract: A well-established approach to reasoning about loops during program analysis is to capture the effect of a loop by extracting recurrences from the loop; these express relationships between the values of variables, or program properties such as cost, on successive loop iterations. Recurrence solvers are capable of computing closed forms for some recurrences, thus deriving precise relationships capt… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: In Proceedings HCVS 2021, arXiv:2109.03988

    ACM Class: B.5.2

    Journal ref: EPTCS 344, 2021, pp. 22-35

  8. Enhancing deep neural networks with morphological information

    Authors: Matej Klemen, Luka Krsnik, Marko Robnik-Šikonja

    Abstract: Deep learning approaches are superior in NLP due to their ability to extract informative features and patterns from languages. The two most successful neural architectures are LSTM and transformers, used in large pretrained language models such as BERT. While cross-lingual approaches are on the rise, most current NLP techniques are designed and applied to English, and less-resourced languages are… ▽ More

    Submitted 1 March, 2022; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: Updated version, accepted to Natural Language Engineering

  9. arXiv:2008.02931  [pdf, other

    cs.PL cs.LO cs.SC

    From Big-Step to Small-Step Semantics and Back with Interpreter Specialisation

    Authors: John P. Gallagher, Manuel Hermenegildo, Bishoksan Kafle, Maximiliano Klemen, Pedro López García, José Morales

    Abstract: We investigate representations of imperative programs as constrained Horn clauses. Starting from operational semantics transition rules, we proceed by writing interpreters as constrained Horn clause programs directly encoding the rules. We then specialise an interpreter with respect to a given source program to achieve a compilation of the source language to Horn clauses (an instance of the first… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: In Proceedings VPT/HCVS 2020, arXiv:2008.02483

    Journal ref: EPTCS 320, 2020, pp. 50-64

  10. arXiv:1907.13272  [pdf, ps, other

    cs.PL

    Towards a General Framework for Static Cost Analysis of Parallel Logic Programs

    Authors: Maximiliano Klemen, Pedro Lopez-Garcia, John P. Gallagher, Jose F. Morales, Manuel V. Hermenegildo

    Abstract: The estimation and control of resource usage is now an important challenge in an increasing number of computing systems. In particular, requirements on timing and energy arise in a wide variety of applications such as internet of things, cloud computing, health, transportation, and robots. At the same time, parallel computing, with (heterogeneous) multi-core platforms in particular, has become the… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

    Comments: 19 pages, 3 tables; submitted to ICLP'19, accepted as technical communication

    Report number: CLIP-1/2019.0

  11. arXiv:1906.09468  [pdf, other

    cs.SI cs.LG

    Predicting kills in Game of Thrones using network properties

    Authors: Jaka Stavanja, Matej Klemen, Lovro Šubelj

    Abstract: TV series such as HBO's Game of Thrones have seen a high number of dedicated followers, mostly due to the dramatic murders of the most important characters. In our work, we try to predict killer and victim pairs using data about previous kills and additional metadata. We construct a network where two character nodes are linked if one killed the other and use a link prediction framework to evaluate… ▽ More

    Submitted 28 August, 2020; v1 submitted 22 June, 2019; originally announced June 2019.

    Comments: 8 pages, 4 figures. This article is a replacement for the previous version arXiv:1906.09468v1, which tweaks methods, recalculates results and tries additional approaches. It also provides some additional information about the field of link prediction

    Journal ref: Uporabna Informatika Vol 28 No 2 (2020), 55-65

  12. arXiv:1804.02380  [pdf, other

    cs.PL

    An Approach to Static Performance Guarantees for Programs with Run-time Checks

    Authors: Maximiliano Klemen, Nataliia Stulova, Pedro Lopez-Garcia, José F. Morales, Manuel V. Hermenegildo

    Abstract: Instrumenting programs for performing run-time checking of properties, such as regular shapes, is a common and useful technique that helps programmers detect incorrect program behaviors. This is specially true in dynamic languages such as Prolog. However, such run-time checks inevitably introduce run-time overhead (in execution time, memory, energy, etc.). Several approaches have been proposed for… ▽ More

    Submitted 6 April, 2018; originally announced April 2018.

    Comments: 15 pages, 3 tables; submitted to ICLP'18, accepted as technical communication

    Report number: CLIP-1/2018.0

  13. arXiv:1803.04451  [pdf, other

    cs.PL cs.LO

    Interval-based Resource Usage Verification by Translation into Horn Clauses and an Application to Energy Consumption

    Authors: Pedro Lopez-Garcia, Luthfi Darmawan, Maximiliano Klemen, Umer Liqat, Francisco Bueno, Manuel V. Hermenegildo

    Abstract: Many applications require conformance with specifications that constrain the use of resources, such as execution time, energy, bandwidth, etc. We have presented a configurable framework for static resource usage verification where specifications can include lower and upper bound, data size-dependent resource usage functions. To statically check such specifications, our framework infers the same ty… ▽ More

    Submitted 12 March, 2018; originally announced March 2018.

    Comments: Under consideration for publication in Theory and Practice of Logic Programming (TPLP)

  14. arXiv:1608.02780  [pdf, ps, other

    cs.PL cs.DC

    A General Framework for Static Profiling of Parametric Resource Usage

    Authors: Pedro Lopez-Garcia, Maximiliano Klemen, Umer Liqat, Manuel V. Hermenegildo

    Abstract: Traditional static resource analyses estimate the total resource usage of a program, without executing it. In this paper we present a novel resource analysis whose aim is instead the static profiling of accumulated cost, i.e., to discover, for selected parts of the program, an estimate or bound of the resource usage accumulated in each of those parts. Traditional resource analyses are parametric i… ▽ More

    Submitted 17 October, 2016; v1 submitted 9 August, 2016; originally announced August 2016.

    Comments: Paper presented at the 32nd International Conference on Logic Programming (ICLP 2016), New York City, USA, 16-21 October 2016, 22 pages, LaTeX

  15. ENTRA: Whole-Systems Energy Transparency

    Authors: Kerstin Eder, John P. Gallagher, Pedro Lopez-Garcia, Henk Muller, Zorana Bankovic, Kyriakos Georgiou, Remy Haemmerle, Manuel V. Hermenegildo, Bishoksan Kafle, Steve Kerrison, Maja Kirkeby, Maximiliano Klemen, Xueliang Li, Umer Liqat, Jeremy Morse, Morten Rhiger, Mads Rosendahl

    Abstract: Promoting energy efficiency to a first class system design goal is an important research challenge. Although more energy-efficient hardware can be designed, it is software that controls the hardware; for a given system the potential for energy savings is likely to be much greater at the higher levels of abstraction in the system stack. Thus the greatest savings are expected from energy-aware softw… ▽ More

    Submitted 18 June, 2016; v1 submitted 13 June, 2016; originally announced June 2016.

    Comments: Revised preprint submitted to MICPRO on 27 May 2016, 23 pages, 3 figures

  16. arXiv:1512.09369  [pdf, other

    cs.PL cs.DC cs.LO

    Towards Energy Consumption Verification via Static Analysis

    Authors: Pedro Lopez-Garcia, Remy Haemmerle, Maximiliano Klemen, Umer Liqat, Manuel V. Hermenegildo

    Abstract: In this paper we leverage an existing general framework for resource usage verification and specialize it for verifying energy consumption specifications of embedded programs. Such specifications can include both lower and upper bounds on energy usage, and they can express intervals within which energy usage is to be certified to be within such bounds. The bounds of the intervals can be given in g… ▽ More

    Submitted 31 December, 2015; originally announced December 2015.

    Comments: Presented at HIP3ES, 2015 (arXiv: 1501.03064)

    Report number: HIP3ES/2015/04 ACM Class: F.3.2; D.3.4; D.2.8

  17. arXiv:1501.03064   

    cs.DC

    Proceedings of the Workshop on High Performance Energy Efficient Embedded Systems (HIP3ES) 2015

    Authors: Francisco Corbera, Andrés Rodríguez, Rafael Asenjo, Angeles Navarro, Antonio Vilches, Maria Garzaran, Ismat Chaib Draa, Jamel Tayeb, Smail Niar, Mikael Desertot, Daniel Gregorek, Robert Schmidt, Alberto Garcia-Ortiz, Pedro Lopez-Garcia, Rémy Haemmerlé, Maximiliano Klemen, Umer Liqat, Manuel V. Hermenegildo, Radim Vavřík, Albert Saà-Garriga, David Castells-Rufas, Jordi Carrabina

    Abstract: Proceedings of the Workshop on High Performance Energy Efficient Embedded Systems (HIP3ES) 2015. Amsterdam, January 21st. Collocated with HIPEAC 2015 Conference.

    Submitted 13 January, 2015; originally announced January 2015.