Publication | Closed Access
High-performance content-based phishing attack detection
44
Citations
22
References
2011
Year
Unknown Venue
Source Code ChangesSpam FilteringSource CodeInternet SecurityEngineeringInformation RetrievalData ScienceData MiningInformation SecurityThreat DetectionTargeted AttackInformation ForensicsComputer ScienceDetection RateSocial Engineering (Security)PhishingData SecurityCryptography
Phishers continue to alter the source code of the web pages used in their attacks to mimic changes to legitimate websites of spoofed organizations and to avoid detection by phishing countermeasures. Manipulations can be as subtle as source code changes or as apparent as adding or removing significant content. To appropriately respond to these changes to phishing campaigns, a cadre of file matching algorithms is implemented to detect phishing websites based on their content, employing a custom data set consisting of 17,992 phishing attacks targeting 159 different brands. The results of the experiments using a variety of different content-based approaches demonstrate that some can achieve a detection rate of greater than 90% while maintaining a low false positive rate.
| Year | Citations | |
|---|---|---|
Page 1
Page 1