Publication | Closed Access
User-representative feature selection for keystroke dynamics
25
Citations
4
References
2011
Year
Unknown Venue
EngineeringMachine LearningContinuous User AuthenticationBiometricsFeature SelectionCharacters SequencesIntelligent SystemsUser-representative Feature SelectionText MiningInformation RetrievalData ScienceData MiningPattern RecognitionCharacter SequencesRobot LearningInput DeviceFeature EngineeringKnowledge DiscoveryComputer ScienceFeature ConstructionHuman-computer Interaction
Continuous user authentication with keystroke dynamics uses characters sequences as features. Since users can type characters in any order, it is imperative to find character sequences (n-graphs) that are representative of user typing behavior. The contemporary feature selection approaches do not guarantee selecting frequently-typed features which may cause less accurate statistical user-representation. Furthermore, the selected features do not inherently reflect user typing behavior. We propose four statistical-based feature selection techniques that mitigate limitations of existing approaches. The first technique selects the most frequently occurring features. The other three consider different user typing behaviors by selecting: n-graphs that are typed quickly; n-graphs that are typed with consistent time; and n-graphs that have large time variance among users. We use Gunetti's keystroke dataset and k-means clustering algorithm for our experiments. The results show that among the proposed techniques, the most-frequent feature selection technique can effectively find user-representative features. We further substantiate our results by comparing the most-frequent feature selection technique with three existing approaches (popular Italian words, common n-graphs, and least frequent n-graphs). We find that it performs better than the existing approaches after selecting a certain number of most-frequent n-graphs.
| Year | Citations | |
|---|---|---|
Page 1
Page 1