Publication | Closed Access
Detecting visually similar Web pages
98
Citations
69
References
2010
Year
Gestalt TheoryEngineeringSimilarity MeasureInformation SecurityInformation ForensicsVisual SimilarityText MiningGestalt PrinciplesSpam FilteringComputational Social ScienceImage AnalysisInformation RetrievalData ScienceData MiningPattern RecognitionSimilar Web PagesKnowledge DiscoveryComputer ScienceImage SimilaritySocial Engineering (Security)PhishingSimilarity Search
We propose a novel approach for detecting visual similarity between two Web pages. The proposed approach applies Gestalt theory and considers a Web page as a single indivisible entity. The concept of supersignals, as a realization of Gestalt principles, supports our contention that Web pages must be treated as indivisible entities. We objectify, and directly compare, these indivisible supersignals using algorithmic complexity theory. We illustrate our approach by applying it to the problem of detecting phishing scams. Via a large-scale, real-world case study, we demonstrate that 1) our approach effectively detects similar Web pages; and 2) it accuractely distinguishes legitimate and phishing pages.
| Year | Citations | |
|---|---|---|
Page 1
Page 1