Publication | Open Access
Multi-Paragraph Segmentation of Expository Text
392
Citations
1
References
1994
Year
Natural Language ProcessingApplied LinguisticsEngineeringDiscourse StructureMulti-paragraph SegmentationCorpus LinguisticsMajor Subtopic BoundariesComputational LinguisticsText SegmentationDiscourse AnalysisSubtopic StructureLanguage StudiesText ProcessingContent AnalysisLinguisticsText MiningAutomatic SummarizationExpository Texts
This paper describes TextTiling, an algorithm for partitioning expository texts into coherent multi-paragraph discourse units which reflect the subtopic structure of the texts. The algorithm uses domain-independent lexical frequency and distribution information to recognize the interactions of multiple simultaneous themes. Two fully-implemented versions of the algorithm are described and shown to produce segmentation that corresponds well to human judgments of the major subtopic boundaries of thirteen lengthy texts.
| Year | Citations | |
|---|---|---|
Page 1
Page 1