Publication | Closed Access
Partially Supervised Classification of Text Documents
516
Citations
12
References
2002
Year
Unknown Venue
We investigate the following problem: Given a set of documents of a particular topic or classÈ, and a large setÅof mixed documents that contains documents from classÈand other types of documents, identify the documents from classÈinÅ. The key feature of this problem is that there is no labeled non-Èdocument, which makes traditional machine learning techniques inapplicable, as they all need labeled documents of both classes. We call this problem partially supervised classification. In this paper, we show that this problem can be posed as a constrained optimization problem and that under appropriate conditions, solutions to the constrained optimization problem will give good solutions to the partially supervised classification problem. We present a novel technique to solve the problem and demonstrate the effectiveness of the technique through extensive experimentation. 1.
| Year | Citations | |
|---|---|---|
Page 1
Page 1