Skip to main content

Showing 1–2 of 2 results for author: Stanko, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.03725  [pdf, ps, other

    cs.LG math.OC

    Sign-SGD is the Golden Gate between Multi-Node to Single-Node Learning: Significant Boost via Parameter-Free Optimization

    Authors: Daniil Medyakov, Sergey Stanko, Gleb Molodtsov, Philip Zmushko, Grigoriy Evseev, Egor Petrov, Aleksandr Beznosikov

    Abstract: Quite recently, large language models have made a significant breakthrough across various disciplines. However, training them is an extremely resource-intensive task, even for major players with vast computing resources. One of the methods gaining popularity in light of these challenges is Sign-SGD. This method can be applied both as a memory-efficient approach in single-node training and as a gra… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 58 pages, 5 figures, 5 tables

  2. arXiv:1605.04655  [pdf, ps, other

    cs.CL cs.LG cs.NE

    Joint Learning of Sentence Embeddings for Relevance and Entailment

    Authors: Petr Baudis, Silvestr Stanko, Jan Sedivy

    Abstract: We consider the problem of Recognizing Textual Entailment within an Information Retrieval context, where we must simultaneously determine the relevancy as well as degree of entailment for individual pieces of evidence to determine a yes/no answer to a binary natural language question. We compare several variants of neural networks for sentence embeddings in a setting of decision-making based on… ▽ More

    Submitted 22 June, 2016; v1 submitted 16 May, 2016; originally announced May 2016.

    Comments: repl4nlp workshop at ACL Berlin 2016