Publication | Closed Access
Exploring linguistic features for web spam detection
61
Citations
12
References
2008
Year
Unknown Venue
Spam-detection TaskAbuse DetectionCommunicationCorpus LinguisticsText MiningNatural Language ProcessingApplied LinguisticsSpam FilteringInformation RetrievalComputational LinguisticsLanguage EngineeringDocument ClassificationCertain Linguistic FeaturesLanguage StudiesContent AnalysisLinguistic FeaturesKnowledge DiscoveryLanguage TechnologyWeb Spam DetectionLanguage CorpusArtsPhishingLinguistics
We study the usability of linguistic features in the Web spam classification task. The features were computed on two Web spam corpora: Webspam-Uk2006 and Webspam-Uk2007, we make them publicly available for other researchers. Preliminary analysis seems to indicate that certain linguistic features may be useful for the spam-detection task when combined with features studied elsewhere.
| Year | Citations | |
|---|---|---|
Page 1
Page 1