Skip to main content

Showing 1–8 of 8 results for author: Stankevičius, L

.
  1. arXiv:2408.08073  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Extracting Sentence Embeddings from Pretrained Transformer Models

    Authors: Lukas Stankevičius, Mantas Lukoševičius

    Abstract: Pre-trained transformer models shine in many natural language processing tasks and therefore are expected to bear the representation of the input sentence or text meaning. These sentence-level embeddings are also important in retrieval-augmented generation. But do commonly used plain averaging or prompt templates sufficiently capture and represent the underlying meaning? After providing a comprehe… ▽ More

    Submitted 20 February, 2025; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: Postprint update

    MSC Class: 68T07; 68T50; 68T05 ACM Class: I.2.6; I.2.7

    Journal ref: Appl. Sci. 2024, 14(19), 8887

  2. arXiv:2407.19914  [pdf

    cs.CL cs.IR cs.LG

    Sentiment Analysis of Lithuanian Online Reviews Using Large Language Models

    Authors: Brigita Vileikytė, Mantas Lukoševičius, Lukas Stankevičius

    Abstract: Sentiment analysis is a widely researched area within Natural Language Processing (NLP), attracting significant interest due to the advent of automated solutions. Despite this, the task remains challenging because of the inherent complexity of languages and the subjective nature of sentiments. It is even more challenging for less-studied and less-resourced languages such as Lithuanian. Our review… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: Accepted at the 29th International Conference on Information Society and University Studies (IVUS 2024)

    MSC Class: 68T07; 68T50; 68T05; ACM Class: I.2.6; I.2.7

  3. arXiv:2403.04914  [pdf

    cs.CE stat.OT

    Improving the Equation of Exchange for Cryptoasset Valuation Using Empirical Data

    Authors: Stylianos Kampakis, Melody Yuan, Oritsebawo Paul Ikpobe, Linas Stankevicius

    Abstract: In the evolving domain of cryptocurrency markets, accurate token valuation remains a critical aspect influencing investment decisions and policy development. Whilst the prevailing equation of exchange pricing model offers a quantitative valuation approach based on the interplay between token price, transaction volume, supply, and either velocity or holding time, it exhibits intrinsic shortcomings.… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  4. arXiv:2203.09963  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Towards Lithuanian grammatical error correction

    Authors: Lukas Stankevičius, Mantas Lukoševičius

    Abstract: Everyone wants to write beautiful and correct text, yet the lack of language skills, experience, or hasty typing can result in errors. By employing the recent advances in transformer architectures, we construct a grammatical error correction model for Lithuanian, the language rich in archaic features. We compare subword and byte-level approaches and share our best trained model, achieving F… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    MSC Class: 68T07; 68T50; 68T05 ACM Class: I.2.6; I.2.7

  5. arXiv:2201.13242  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Correcting diacritics and typos with a ByT5 transformer model

    Authors: Lukas Stankevičius, Mantas Lukoševičius, Jurgita Kapočiūtė-Dzikienė, Monika Briedienė, Tomas Krilavičius

    Abstract: Due to the fast pace of life and online communications and the prevalence of English and the QWERTY keyboard, people tend to forgo using diacritics, make typographical errors (typos) when typing in other languages. Restoring diacritics and correcting spelling is important for proper language use and the disambiguation of texts for both humans and downstream algorithms. However, both of these probl… ▽ More

    Submitted 18 March, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    MSC Class: 68T07; 68T50; 68T05 ACM Class: I.2.6; I.2.7

    Journal ref: Appl. Sci. 2022, 12(5), 2636

  6. Generating abstractive summaries of Lithuanian news articles using a transformer model

    Authors: Lukas Stankevičius, Mantas Lukoševičius

    Abstract: In this work, we train the first monolingual Lithuanian transformer model on a relatively large corpus of Lithuanian news articles and compare various output decoding algorithms for abstractive news summarization. We achieve an average ROUGE-2 score 0.163, generated summaries are coherent and look impressive at first glance. However, some of them contain misleading information that is not so easy… ▽ More

    Submitted 22 June, 2021; v1 submitted 23 April, 2021; originally announced May 2021.

    Comments: Accepted in ICIST 2021

    MSC Class: 68T07; 68T50; 68T05 ACM Class: I.2.6; I.2.7

    Journal ref: International Conference on Information and Software Technologies - ICIST 2021, Communications in Computer and Information Science, vol 1486 (2021) 341-352

  7. arXiv:2004.03461  [pdf, other

    cs.IR cs.CL cs.LG

    Testing pre-trained Transformer models for Lithuanian news clustering

    Authors: Lukas Stankevičius, Mantas Lukoševičius

    Abstract: A recent introduction of Transformer deep learning architecture made breakthroughs in various natural language processing tasks. However, non-English languages could not leverage such new opportunities with the English text pre-trained models. This changed with research focusing on multilingual models, where less-spoken languages are the main beneficiaries. We compare pre-trained multilingual BERT… ▽ More

    Submitted 3 April, 2020; originally announced April 2020.

    Comments: Submission accepted at https://ivus.ktu.edu/

    MSC Class: 68T05 ACM Class: I.2.6

    Journal ref: Proceedings of the Information Society and University Studies 2020, pp. 46-53, vol. 2698, CEUR, Kaunas, 2020, ISSN: 1613-0073

  8. Patterning of diamond like carbon films for sensor applications using silicon containing thermoplastic resist (SiPol) as a hard mask

    Authors: D. Virganavičius, V. J. Cadarso, R. Kirchner, L. Stankevičius, T. Tamulevičius, S. Tamulevičius, H. Schift

    Abstract: Patterning of diamond-like carbon (DLC) and DLC:metal nanocomposites is of interest for an increasing number of applications. We demonstrate a nanoimprint lithography process based on silicon containing thermoplastic resist combined with plasma etching for straightforward patterning of such films. A variety of different structures with few hundred nanometer feature size and moderate aspect ratios… ▽ More

    Submitted 3 April, 2019; originally announced April 2019.

    Comments: 24 pages, 9 figures

    Journal ref: Applied Surface Science, Volume 385, 1 November 2016, Pages 145-152