Concepedia

Publication | Open Access

Benchmarking Human–AI collaboration for common evidence appraisal tools

20

Citations

17

References

2024

Year

Abstract

Current LLMs alone appraised evidence worse than humans. Human-AI collaboration may reduce workload for the second human rater for the assessment of reporting (PRISMA) and methodological rigor (AMSTAR) but not for complex tasks such as PRECIS-2.

References

YearCitations

Page 1