Publication | Open Access
An Online Algorithm for Lightweight Grammar-Based Compression
34
Citations
19
References
2012
Year
Lossy CompressionLightweight Grammar-based CompressionEngineeringGrammar-based CompressionCorpus LinguisticsNatural Language ProcessingSyntaxComputational LinguisticsRestricted CfgGrammarLanguage StudiesMinimum Grammar SizeLossless CompressionVariable-length CodeMachine TranslationComputer ScienceGrammar InductionData CompressionLinguistics
Grammar-based compression is a well-studied technique to construct a context-free grammar (CFG) deriving a given text uniquely. In this work, we propose an online algorithm for grammar-based compression. Our algorithm guarantees O(log2 n)- approximation ratio for the minimum grammar size, where n is an input size, and it runs in input linear time and output linear space. In addition, we propose a practical encoding, which transforms a restricted CFG into a more compact representation. Experimental results by comparison with standard compressors demonstrate that our algorithm is especially effective for highly repetitive text.
| Year | Citations | |
|---|---|---|
Page 1
Page 1