Skip to main content

Showing 1–2 of 2 results for author: Alshanik, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2201.06592  [pdf, other

    cs.IR

    Proactive Query Expansion for Streaming Data Using External Source

    Authors: Farah Alshanik, Amy Apon, Yuheng Du, Alexander Herzog, Ilya Safro

    Abstract: Query expansion is the process of reformulating the original query by adding relevant words. Choosing which terms to add in order to improve the performance of the query expansion methods or to enhance the quality of the retrieved results is an important aspect of any information retrieval system. Adding words that can positively impact the quality of the search query or are informative enough pla… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

  2. arXiv:2012.02294  [pdf, other

    cs.IR cs.LG

    Accelerating Text Mining Using Domain-Specific Stop Word Lists

    Authors: Farah Alshanik, Amy Apon, Alexander Herzog, Ilya Safro, Justin Sybrandt

    Abstract: Text preprocessing is an essential step in text mining. Removing words that can negatively impact the quality of prediction algorithms or are not informative enough is a crucial storage-saving technique in text indexing and results in improved computational efficiency. Typically, a generic stop word list is applied to a dataset regardless of the domain. However, many common words are different fro… ▽ More

    Submitted 18 November, 2020; originally announced December 2020.