dgv

Paper notes: Ye et al. (2024)


These are notes for the paper by Ye et al. (2024) titled Benchmarking LLMs via Uncertainty Quantification.

image

Free form text generation

— maybe we should attempt using CP techniques?

NOTE

Good:

  • Different tasks
  • Conformal predictions — formal framework

Limitations:

  • Entangled sources of uncertainty
  • Focuses on multiple choice questions

arxiv.org

aclanthology.org