Publication | Closed Access
Supervised clustering with support vector machines
211
Citations
16
References
2005
Year
Unknown Venue
EngineeringMachine LearningCorpus LinguisticsText MiningNatural Language ProcessingSupport Vector MachineClassification MethodInformation RetrievalData ScienceData MiningPattern RecognitionComputational LinguisticsDocument ClassificationClustering AlgorithmSupport Vector MachinesLanguage StudiesDesirable ClusteringsDocument ClusteringNews Article ClusteringKnowledge DiscoveryComputer ScienceVector Space ModelTopic ModelLinguisticsSemantic Similarity
Supervised clustering is the problem of training a clustering algorithm to produce desirable clusterings: given sets of items and complete clusterings over these sets, we learn how to cluster future sets of items. Example applications include noun-phrase coreference clustering, and clustering news articles by whether they refer to the same topic. In this paper we present an SVM algorithm that trains a clustering algorithm by adapting the item-pair similarity measure. The algorithm may optimize a variety of different clustering functions to a variety of clustering performance measures. We empirically evaluate the algorithm for noun-phrase and news article clustering.
| Year | Citations | |
|---|---|---|
Page 1
Page 1