Publication | Open Access
Recognizing and Imitating Programmer Style: Adversaries in Program Authorship Attribution
30
Citations
18
References
2018
Year
EngineeringInformation SecuritySoftware EngineeringSource Code AnalysisInformation ForensicsWriter IdentificationCommunicationSoftware AnalysisProgram Authorship AttributionData ScienceAdversarial Machine LearningPotential TacticsOwn IdentityAutomatic ProgrammingAuthor ProfilingProgramming StyleComputer ScienceSoftware DesignData SecurityQualitative AnalysisProgram AnalysisAttack ModelSoftware Testing
Abstract Source code attribution classifiers have recently become powerful. We consider the possibility that an adversary could craft code with the intention of causing a misclassification, i.e., creating a forgery of another author’s programming style in order to hide the forger’s own identity or blame the other author. We find that it is possible for a non-expert adversary to defeat such a system. In order to inform the design of adversarially resistant source code attribution classifiers, we conduct two studies with C/C++ programmers to explore the potential tactics and capabilities both of such adversaries and, conversely, of human analysts doing source code authorship attribution. Through the quantitative and qualitative analysis of these studies, we (1) evaluate a state-of-the-art machine classifier against forgeries, (2) evaluate programmers as human analysts/forgery detectors, and (3) compile a set of modifications made to create forgeries. Based on our analyses, we then suggest features that future source code attribution systems might incorporate in order to be adversarially resistant.
| Year | Citations | |
|---|---|---|
Page 1
Page 1