Probabilistic Event Categorization

Wiebe, Janyce; Bruce, Rebecca; Duan, Lei

Computation and Language

arXiv:cmp-lg/9710008 (cmp-lg)

[Submitted on 30 Oct 1997 (v1), last revised 31 Oct 1997 (this version, v2)]

Title:Probabilistic Event Categorization

Authors:Janyce Wiebe (New Mexico State University), Rebecca Bruce (Southern Methodist University), Lei Duan (Sony Corporation)

View PDF

Abstract: This paper describes the automation of a new text categorization task. The categories assigned in this task are more syntactically, semantically, and contextually complex than those typically assigned by fully automatic systems that process unseen test data. Our system for assigning these categories is a probabilistic classifier, developed with a recent method for formulating a probabilistic model from a predefined set of potential features. This paper focuses on feature selection. It presents a number of fully automatic features. It identifies and evaluates various approaches to organizing collocational properties into features, and presents the results of experiments covarying type of organization and type of property. We find that one organization is not best for all kinds of properties, so this is an experimental parameter worth investigating in NLP systems. In addition, the results suggest a way to take advantage of properties that are low frequency but strongly indicative of a class. The problems of recognizing and organizing the various kinds of contextual information required to perform a linguistically complex categorization task have rarely been systematically investigated in NLP.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:cmp-lg/9710008
	(or arXiv:cmp-lg/9710008v2 for this version)
	https://doi.org/10.48550/arXiv.cmp-lg/9710008
Journal reference:	Recent Advances in Natural Language Processing (RANLP-97), European Commission, DG XIII, Tzigov Chark, Bulgaria, September 1997, pp. 163--170.

Submission history

From: Janyce Wiebe [view email]
[v1] Thu, 30 Oct 1997 21:59:07 UTC (13 KB)
[v2] Fri, 31 Oct 1997 01:32:32 UTC (13 KB)

Computation and Language

Title:Probabilistic Event Categorization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computation and Language

Title:Probabilistic Event Categorization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators