Publication | Open Access
Benchmarking Human–AI collaboration for common evidence appraisal tools
20
Citations
17
References
2024
Year
Current LLMs alone appraised evidence worse than humans. Human-AI collaboration may reduce workload for the second human rater for the assessment of reporting (PRISMA) and methodological rigor (AMSTAR) but not for complex tasks such as PRECIS-2.
| Year | Citations | |
|---|---|---|
Page 1
Page 1