Publication | Closed Access
The TreeBanker: a Tool for Supervised Training of Parsed Corpora
60
Citations
0
References
1997
Year
Unknown Venue
I describe the TreeBanker, a graphical tool for the supervised training involved in domain customization of the disambiguation component of a speech- or languageunderstanding system. The TreeBanker presents a user, who need not be a system expert, with a range of properties that distinguish competing analyses for an utterance and that are relatively easy to judge. This allows training on a corpus to be completed in far less time, and with far less expertise, than would be needed if analyses were inspected directly: it becomes possible for a corpus of about 20,000 sentences of the complexity of those in the ATIS corpus to be judged in around three weeks of work by a linguistically aware non-expert. 1 Introduction In a language understanding system where full, linguistically-motivated analyses of utterances are desired, the linguistic analyser needs to generate possible semantic representations and then choose the one most likely to be correct. If the analyser is a component of a pipeli...