Concepedia

Publication | Closed Access

Tagging sentence boundaries

57

Citations

8

References

2000

Year

Andrei Mikheev

Unknown Venue

Abstract

In this paper we tackle sentence boundary disam- biguation through a part-of-speech (POS) tagging framework. We describe necessary changes in text tokenization and the implementation of a POS tagger and provide results of an evaluation of this system on two corpora. We also describe an extension of the traditional POS tagging by combining it with the document-centered approach to proper name identification and abbreviation handling. This made the resulting system robust to domain and topic shifts.

References

YearCitations

Page 1