Skip to main content

Showing 1–1 of 1 results for author: Dunaiski, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.19119  [pdf, other

    cs.IR cs.LG

    Introducing Three New Benchmark Datasets for Hierarchical Text Classification

    Authors: Jaco du Toit, Herman Redelinghuys, Marcel Dunaiski

    Abstract: Hierarchical Text Classification (HTC) is a natural language processing task with the objective to classify text documents into a set of classes from a structured class hierarchy. Many HTC approaches have been proposed which attempt to leverage the class hierarchy information in various ways to improve classification performance. Machine learning-based classification approaches require large amoun… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

    Comments: 16 pages, 11 figures

    ACM Class: I.2.7