Publication | Open Access
Predicting judicial decisions of the European Court of Human Rights: a Natural Language Processing perspective
729
Citations
31
References
2016
Year
EngineeringMachine LearningTextual EntailmentSemanticsLanguage ProcessingText MiningNatural Language ProcessingForensic LinguisticsData ScienceComputational LinguisticsData ResourcesDocument ClassificationCorpus AnalysisLanguage StudiesNatural LanguageNlp TaskLanguage TechnologyHuman RightsKnowledge DiscoveryInformation ExtractionEuropean CourtTopical ContentJudicial DecisionsLinguistics
Recent advances in NLP and ML enable predictive models that uncover patterns in judicial decisions, offering lawyers and judges a tool to quickly identify cases and extract fact‑driven decision factors, consistent with legal realism. This paper presents the first systematic study on predicting the outcome of cases tried by the European Court of Human Rights based solely on textual content. The authors formulate a binary classification task using case text as input, represented by N‑grams and topic models, to predict whether the court found a convention violation. The models achieve an average accuracy of 79 %, with formal facts being the strongest predictor and topical content also contributing significantly.
Recent advances in Natural Language Processing and Machine Learning provide us with the tools to build predictive models that can be used to unveil patterns driving judicial decisions. This can be useful, for both lawyers and judges, as an assisting tool to rapidly identify cases and extract patterns which lead to certain decisions. This paper presents the first systematic study on predicting the outcome of cases tried by the European Court of Human Rights based solely on textual content. We formulate a binary classification task where the input of our classifiers is the textual content extracted from a case and the target output is the actual judgment as to whether there has been a violation of an article of the convention of human rights. Textual information is represented using contiguous word sequences, i.e., N-grams, and topics. Our models can predict the court’s decisions with a strong accuracy (79% on average). Our empirical analysis indicates that the formal facts of a case are the most important predictive factor. This is consistent with the theory of legal realism suggesting that judicial decision-making is significantly affected by the stimulus of the facts. We also observe that the topical content of a case is another important feature in this classification task and explore this relationship further by conducting a qualitative analysis.
| Year | Citations | |
|---|---|---|
Page 1
Page 1