Publication | Open Access
Large language model uncertainty proxies: discrimination and calibration for medical diagnosis and treatment
35
Citations
13
References
2024
Year
SC is the most effective method for estimating LLM uncertainty of the proxies evaluated. SC by sentence embedding can effectively estimate uncertainty if the user has a set of reference cases with which to re-calibrate their results, while SC by GPT annotation is the more effective method if the user does not have reference cases and requires accurate raw calibration. Our results confirm LLMs are consistently over-confident when verbalizing their confidence (CE).
| Year | Citations | |
|---|---|---|
Page 1
Page 1