Concepedia

Publication | Closed Access

Partially Supervised Classification of Text Documents

516

Citations

12

References

2002

Year

Abstract

We investigate the following problem: Given a set of documents of a particular topic or classÈ, and a large setÅof mixed documents that contains documents from classÈand other types of documents, identify the documents from classÈinÅ. The key feature of this problem is that there is no labeled non-Èdocument, which makes traditional machine learning techniques inapplicable, as they all need labeled documents of both classes. We call this problem partially supervised classification. In this paper, we show that this problem can be posed as a constrained optimization problem and that under appropriate conditions, solutions to the constrained optimization problem will give good solutions to the partially supervised classification problem. We present a novel technique to solve the problem and demonstrate the effectiveness of the technique through extensive experimentation. 1.

References

YearCitations

Page 1