Publication | Closed Access
Detecting Phishing Websites and Targets Based on URLs and Webpage Links
32
Citations
19
References
2018
Year
Unknown Venue
Abuse DetectionEngineeringMachine LearningFeature MatrixSearch OperatorText MiningNatural Language ProcessingSpam FilteringTargeted AttackInformation RetrievalData ScienceData MiningPattern RecognitionAutomatic ClassificationThreat DetectionKnowledge DiscoveryComputer ScienceDeep LearningWebpage LinksSocial Engineering (Security)Textual ContentPhishing
In this paper, we propose to extract features from URLs and webpage links to detect phishing websites and their targets. In addition to the basic features of a given URL, such as length, suspicious characters, number of dots, a feature matrix is also constructed from these basic features of the links in the given URL's webpage. Furthermore, certain statistical features are extracted from each column of the feature matrix, such as mean, median, and variance. Lexical features are also extracted from the given URL, the links and content in its webpage, such as title and textual content. A number of machine learning models have been investigated for phishing detection, among which Deep Forest model shows competitive performance, achieving a true positive rate of 98.3% and a false alarm rate of 2.6%. In particular, we design an effective strategy based on search operator via search engines to find the phishing targets, which achieves an accuracy of 93.98%.
| Year | Citations | |
|---|---|---|
Page 1
Page 1