Skip to main content

Showing 1–1 of 1 results for author: Renz, I

Searching in archive cs. Search in all archives.
.
  1. Domain and Language Independent Feature Extraction for Statistical Text Categorization

    Authors: Thomas Bayer, Ingrid Renz, Michael Stein, Ulrich Kressel

    Abstract: A generic system for text categorization is presented which uses a representative text corpus to adapt the processing steps: feature extraction, dimension reduction, and classification. Feature extraction automatically learns features from the corpus by reducing actual word forms using statistical information of the corpus and general linguistic knowledge. The dimension of feature vector is then… ▽ More

    Submitted 2 July, 1996; originally announced July 1996.

    Comments: 12 pages, TeX file, 9 Postscript figures, uses epsf.sty

    Journal ref: proceedings of workshop on language engineering for document analysis and recognition - ed. by L. Evett and T. Rose, part of the AISB 1996 Workshop Series, April 96, Sussex University, England, 21-32 (ISBN 0 905 488628)