-
Segmentation of Expository Texts by Hierarchical Agglomerative Clustering
Abstract: We propose a method for segmentation of expository texts based on hierarchical agglomerative clustering. The method uses paragraphs as the basic segments for identifying hierarchical discourse structure in the text, applying lexical similarity between them as the proximity test. Linear segmentation can be induced from the identified structure through application of two simple rules. However the… ▽ More
Submitted 26 September, 1997; originally announced September 1997.
Comments: 7 pages, Latex2e, 4 postscript figures
Journal ref: RANLP'97, Bulgaria