Publication | Closed Access
Classifying phishing URLs using recurrent neural networks
211
Citations
26
References
2017
Year
Unknown Venue
Natural Language ProcessingAbuse DetectionSpam FilteringEngineeringInformation RetrievalMachine LearningData ScienceData MiningThreat DetectionMachine Learning ModelRandom Forest ClassifierKnowledge DiscoveryRecurrent Neural NetworksMachine Learning ModelsComputer SciencePhishingRecurrent Neural NetworkText Mining
As the technical skills and costs associated with the deployment of phishing attacks decrease, we are witnessing an unprecedented level of scams that push the need for better methods to proactively detect phishing threats. In this work, we explored the use of URLs as input for machine learning models applied for phishing site prediction. In this way, we compared a feature-engineering approach followed by a random forest classifier against a novel method based on recurrent neural networks. We determined that the recurrent neural network approach provides an accuracy rate of 98.7% even without the need of manual feature creation, beating by 5% the random forest method. This means it is a scalable and fast-acting proactive detection system that does not require full content analysis.
| Year | Citations | |
|---|---|---|
Page 1
Page 1