Publication | Open Access
Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts
3K
Citations
75
References
2013
Year
Automated Text MethodsEngineeringPolitical TextsDocument AnalysisLinguisticsPolitical AgendaPolitical ProcessNew MethodsSocial SciencesPolitical PolarizationPolitical BehaviorPolitical CommunicationContent AnalysisPolitical ScienceAutomated Text AnalysisText MiningFact Checking
Political science research relies on textual data, yet the high cost of manual analysis limits its use, while automated text analysis offers a cost‑saving promise but also introduces pitfalls that require careful validation. The authors aim to guide researchers in automated political text analysis and argue that methodologists must develop new methods and validation techniques for it to become standard. They survey diverse automated text methods, offer validation guidance, and correct common misconceptions. They demonstrate that automated methods have already realized part of their promise in many cases.
Politics and political conflict often occur in the written and spoken word. Scholars have long recognized this, but the massive costs of analyzing even moderately sized collections of texts have hindered their use in political science research. Here lies the promise of automated text analysis: it substantially reduces the costs of analyzing large collections of text. We provide a guide to this exciting new area of research and show how, in many instances, the methods have already obtained part of their promise. But there are pitfalls to using automated methods—they are no substitute for careful thought and close reading and require extensive and problem-specific validation. We survey a wide range of new methods, provide guidance on how to validate the output of the models, and clarify misconceptions and errors in the literature. To conclude, we argue that for automated text methods to become a standard tool for political scientists, methodologists must contribute new methods and new methods of validation.
| Year | Citations | |
|---|---|---|
Page 1
Page 1