Skip to main content

Showing 1–3 of 3 results for author: Proskura, P

.
  1. arXiv:2505.15443  [pdf, other

    cs.CL stat.ML

    AdUE: Improving uncertainty estimation head for LoRA adapters in LLMs

    Authors: Artem Zabolotnyi, Roman Makarov, Mile Mitrovic, Polina Proskura, Oleg Travkin, Roman Alferov, Alexey Zaytsev

    Abstract: Uncertainty estimation remains a critical challenge in adapting pre-trained language models to classification tasks, particularly under parameter-efficient fine-tuning approaches such as adapters. We introduce AdUE1, an efficient post-hoc uncertainty estimation (UE) method, to enhance softmax-based estimates. Our approach (1) uses a differentiable approximation of the maximum function and (2) appl… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 9 pages, 1 figure

  2. Beyond Simple Averaging: Improving NLP Ensemble Performance with Topological-Data-Analysis-Based Weighting

    Authors: Polina Proskura, Alexey Zaytsev

    Abstract: In machine learning, ensembles are important tools for improving the model performance. In natural language processing specifically, ensembles boost the performance of a method due to multiple large models available in open source. However, existing approaches mostly rely on simple averaging of predictions by ensembles with equal weights for each model, ignoring differences in the quality and conf… ▽ More

    Submitted 28 January, 2025; v1 submitted 21 February, 2024; originally announced February 2024.

    Journal ref: 2024 IEEE 11th International Conference on Data Science and Advanced Analytics (DSAA), San Diego, CA, USA, 2024, pp. 1-8

  3. arXiv:1905.10805  [pdf, other

    stat.AP cs.LG eess.SP physics.data-an

    Usage of multiple RTL features for Earthquake prediction

    Authors: P. Proskura, A. Zaytsev, I. Braslavsky, E. Egorov, E. Burnaev

    Abstract: We construct a classification model that predicts if an earthquake with the magnitude above a threshold will take place at a given location in a time range 30-180 days from a given moment of time. A common approach is to use expert forecasts based on features like Region-Time-Length (RTL) characteristics. The proposed approach uses machine learning on top of multiple RTL features to take into acco… ▽ More

    Submitted 26 May, 2019; originally announced May 2019.

    Comments: 13 pages, 3 figures, 3 tables

    Journal ref: Proceedings of the International Conference on Computational Science and Applications (ICCSA-2019), 2019