Publication | Closed Access
The Rules of Redaction: Identify, Protect, Review (and Repeat)
39
Citations
9
References
2009
Year
EngineeringLawInformation ForensicsConfidentialityLinguistic Content AnalysisTechnology LawDigital EvidenceCorpus LinguisticsJournalismText MiningNatural Language ProcessingCustomer ReviewInformation RetrievalDocument EngineeringDocument AnalysisComputational LinguisticsLanguage StudiesContent AnalysisData PrivacySensitive PortionsInformation ExtractionContent Similarity DetectionSensitive ContentText ProcessingLinguistics
Frequent data leak reports in the press attest to the difficulty of identifying and protecting sensitive content. Redaction is particularly challenging because it seeks to protect documents by selectively removing sensitive portions of them, rather than by quarantining or encrypting the whole document. The authors review current redaction practice and technology and describe a prototype system that supports the natural redaction workflow and addresses some limitations of current technology. Their system supports all phases of the redaction process through the use of linguistic content analysis, an interactive user interface, and inference detection algorithms.
| Year | Citations | |
|---|---|---|
Page 1
Page 1