Publication | Closed Access
Compression by induction of hierarchical grammars
82
Citations
9
References
2002
Year
Unknown Venue
Natural Language ProcessingSyntaxEngineeringHierarchical GrammarsAutomated ReasoningGrammatical FormalismComputational LinguisticsLanguage Modeling (Natural Language Processing)Symbol SequencesGrammatical Induction TechniqueFormal SyntaxGrammarComputer ScienceLanguage StudiesUnification GrammarGrammar InductionLinguisticsMachine Translation
The paper describes a technique that constructs models of symbol sequences in the form of small, human-readable, hierarchical grammars. The grammars are both semantically plausible and compact. The technique can induce structure from a variety of different kinds of sequence, and examples are given of models derived from English text, C source code and a sequence of terminal control codes. It explains the grammatical induction technique, demonstrates its application to three very different sequences, evaluates its compression performance, and concludes by briefly discussing its use as a method for knowledge acquisition.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1