Publication | Closed Access
PageSim: A Novel Link-Based Similarity Measure for the World Wide Web
45
Citations
17
References
2006
Year
Unknown Venue
Ranking AlgorithmEngineeringLearning To RankSemantic WebLink PredictionText MiningComputational Social ScienceInformation RetrievalData ScienceData MiningLink AnalysisPagesim ScoresSocial Network AnalysisKnowledge DiscoveryWebometricsComputer ScienceSearch Engine DesignPagerank Score PropagationWeb MiningNetwork SciencePagesim ModelBusinessSimilarity SearchSemantic Similarity
The requirement for measuring the similarity between Web pages arises in many applications on the Web, such as Web searching engine and Web document classification. According to the unique characteristics of the Web, which are huge, rapidly growing, high dynamic, and untrustworthy, we propose a novel link-based similarity measure called PageSim. Based on the strategy of PageRank score propagation, PageSim is efficient, scalable, stable, and "fairly" robust, and therefore is applicable to the Web. We present intuitions behind the PageSim model, and outline the model with mathematical definitions. We also suggest the pruning technique for efficient computation of PageSim scores, and conduct experiments to illustrate the effectiveness and specialities of PageSim
| Year | Citations | |
|---|---|---|
Page 1
Page 1