Publication | Closed Access
A Corpus Factory for Many Languages
80
Citations
16
References
2010
Year
Unknown Venue
For many languages there are no large, general-language corpora available. Until the web, all but the richest institutions could do little but shake their heads in dismay as corpus-building was long, slow and expensive. But with the advent of the Web it can be highly automated and thereby fast and inexpensive. We have developed a `corpus factory ' where we build large corpora. In this paper we describe the method we use, and how it has worked, and how various problems were
| Year | Citations | |
|---|---|---|
Page 1
Page 1