Skip to main content

Showing 1–1 of 1 results for author: Sýkora, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.07132  [pdf, other

    cs.LG cs.CL

    LLM-based feature generation from text for interpretable machine learning

    Authors: Vojtěch Balek, Lukáš Sýkora, Vilém Sklenák, Tomáš Kliegr

    Abstract: Existing text representations such as embeddings and bag-of-words are not suitable for rule learning due to their high dimensionality and absent or questionable feature-level interpretability. This article explores whether large language models (LLMs) could address this by extracting a small number of interpretable features from text. We demonstrate this process on two datasets (CORD-19 and M17+)… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.