Publication | Closed Access
Mining e-mail content for author identification forensics
484
Citations
30
References
2001
Year
Abuse DetectionEngineeringInformation ForensicsWriter IdentificationCorpus LinguisticsJournalismText MiningE-mail DocumentsNatural Language ProcessingSpam FilteringForensic SearchInformation RetrievalData ScienceData MiningE-mail ContentDocument ClassificationContent AnalysisKnowledge DiscoveryAuthor ProfilingE-mail Content MiningArts
We describe an investigation into e-mail content mining for author identification, or authorship attribution, for the purpose of forensic investigation. We focus our discussion on the ability to discriminate between authors for the case of both aggregated e-mail topics as well as across different e-mail topics. An extended set of e-mail document features including structural characteristics and linguistic patterns were derived and, together with a Support Vector Machine learning algorithm, were used for mining the e-mail content. Experiments using a number of e-mail documents generated by different authors on a set of topics gave promising results for both aggregated and multi-topic author categorisation.
| Year | Citations | |
|---|---|---|
Page 1
Page 1