Skip to main content

Showing 1–4 of 4 results for author: Fainman, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.05017  [pdf, other

    cs.CL cs.LG

    Controlling Summarization Length Through EOS Token Weighting

    Authors: Zeno Belligoli, Emmanouil Stergiadis, Eran Fainman, Ilya Gusev

    Abstract: Controlling the length of generated text can be crucial in various text-generation tasks, including summarization. Existing methods often require complex model alterations, limiting compatibility with pre-trained models. We address these limitations by developing a simple approach for controlling the length of automatic text summaries by increasing the importance of correctly predicting the EOS to… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  2. arXiv:2407.00787  [pdf, other

    cs.IR cs.LG

    Enhancing Travel Decision-Making: A Contrastive Learning Approach for Personalized Review Rankings in Accommodations

    Authors: Reda Igebaria, Eran Fainman, Sarai Mizrachi, Moran Beladev, Fengjun Wang

    Abstract: User-generated reviews significantly influence consumer decisions, particularly in the travel domain when selecting accommodations. This paper contribution comprising two main elements. Firstly, we present a novel dataset of authentic guest reviews sourced from a prominent online travel platform, totaling over two million reviews from 50,000 distinct accommodations. Secondly, we propose an innovat… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  3. arXiv:2310.14817  [pdf, other

    cs.LG

    Text2Topic: Multi-Label Text Classification System for Efficient Topic Detection in User Generated Content with Zero-Shot Capabilities

    Authors: Fengjun Wang, Moran Beladev, Ofri Kleinfeld, Elina Frayerman, Tal Shachar, Eran Fainman, Karen Lastmann Assaraf, Sarai Mizrachi, Benjamin Wang

    Abstract: Multi-label text classification is a critical task in the industry. It helps to extract structured information from large amount of textual data. We propose Text to Topic (Text2Topic), which achieves high multi-label classification performance by employing a Bi-Encoder Transformer architecture that utilizes concatenation, subtraction, and multiplication of embeddings on both text and topic. Text2T… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

  4. arXiv:1903.05382  [pdf, other

    cs.LG cs.AI stat.ML

    Online Budgeted Learning for Classifier Induction

    Authors: Eran Fainman, Bracha Shapira, Lior Rokach, Yisroel Mirsky

    Abstract: In real-world machine learning applications, there is a cost associated with sampling of different features. Budgeted learning can be used to select which feature-values to acquire from each instance in a dataset, such that the best model is induced under a given constraint. However, this approach is not possible in the domain of online learning since one may not retroactively acquire feature-valu… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.