Publication | Closed Access
Rapid and parallel content screening for detecting transformed data exposure
14
Citations
15
References
2015
Year
Unknown Venue
EngineeringInformation SecurityData-centric SecurityData PreparationInformation ForensicsParallel ContentUnpredictable Leak PatternsSoftware AnalysisHardware SecurityString-searching AlgorithmData ScienceData MiningString ProcessingManagementData IntegrationData ManagementStatisticsData ModelingKnowledge DiscoveryData PrivacyComputer ScienceOrganizational SecurityData ManipulationData-intensive ComputingData SecurityProgram AnalysisData TreatmentData Transformation (Computing)Massive Data ProcessingBig Data
The leak of sensitive data on computer systems poses a serious threat to organizational security. Organizations need to identify the exposure of sensitive data by screening the content in storage and transmission, i.e., to detect sensitive information being stored or transmitted in the clear. However, detecting the exposure of sensitive information is challenging due to data transformation in the content. Transformations (such as insertion, deletion) result in highly unpredictable leak patterns. Existing automata-based string matching algorithms are impractical for detecting transformed data leaks because of its formidable complexity when modeling the required regular expressions. We design two new algorithms for detecting long and inexact data leaks. Our system achieves high detection accuracy in recognizing transformed leaks compared with the state-of-the-art inspection methods. We parallelize our prototype on graphics processing unit and demonstrate the strong scalability of our data leak detection solution analyzing big data.
| Year | Citations | |
|---|---|---|
Page 1
Page 1