Publication | Closed Access
The identification of stages in diachronic data: variability-based neighbour clustering
209
Citations
11
References
2008
Year
EngineeringSymbolic Data AnalysisVerbal ComplementationCorpus LinguisticsText MiningNatural Language ProcessingApplied LinguisticsLanguage DocumentationData ScienceData MiningText SegmentationComputational LinguisticsLanguage EngineeringLanguage StudiesStatisticsMachine TranslationDocument ClusteringComputational LexicologyKnowledge DiscoveryTemporal Pattern RecognitionFunctional Data AnalysisDiachronic DataStructure DiscoveryLanguage CorpusLinguisticsData PointsData ModelingCase Studies
In this paper, we introduce a data-driven bottom-up clustering method for the identification of stages in diachronic corpus data that differ from each other quantitatively. Much like regular approaches to hierarchical clustering, it is based on identifying and merging the most cohesive groups of data points, but, unlike regular approaches to clustering, it allows for the merging of temporally adjacent data, thus, in effect, preserving the chronological order. We exemplify the method with two case studies, one on verbal complementation of shall, the other on the development of the perfect in English.
| Year | Citations | |
|---|---|---|
Page 1
Page 1