Publication | Open Access
Legalbench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models
123
Citations
76
References
2023
Year
Large language models are increasingly used in law, raising the question of which legal reasoning abilities they can perform. This paper introduces LegalBench, a benchmark of 162 tasks covering six legal reasoning types, and shows how it can be used to evaluate 20 open‑source and commercial LLMs while aligning tasks with established legal frameworks. LegalBench was built collaboratively by legal professionals through an interdisciplinary process, compiling hand‑crafted tasks and mapping them to popular legal reasoning frameworks. The expert‑driven design ensures tasks measure practically useful or lawyer‑interested reasoning skills, and the benchmark’s evaluation of 20 LLMs demonstrates its usefulness for research.
The advent of large language models (LLMs) and their adoption by the legal community has given rise to the question: what types of legal reasoning can LLMs perform? To enable greater study of this question, we present LegalBench: a collaboratively constructed legal reasoning benchmark consisting of 162 tasks covering six different types of legal reasoning. LegalBench was built through an interdisciplinary process, in which we collected tasks designed and hand-crafted by legal professionals. Because these subject matter experts took a leading role in construction, tasks either measure legal reasoning capabilities that are practically useful, or measure reasoning skills that lawyers find interesting. To enable cross-disciplinary conversations about LLMs in the law, we additionally show how popular legal frameworks for describing legal reasoning—which distinguish between its many forms—correspond to LegalBench tasks, thus giving lawyers and LLM developers a common vocabulary. This paper describes LegalBench, presents an empirical evaluation of 20 open-source and commercial LLMs, and illustrates the types of research explorations LegalBench enables.
| Year | Citations | |
|---|---|---|
Page 1
Page 1