ncbi/MedCalc-Bench
Viewer
•
Updated
•
11.6k
•
38
•
1
Evaluating Large Language Models for Medical Calculations
Note The most up-to-date version (same as v1.2) of MedCalc-Bench. We recommend using this dataset for most cases (e.g., training and evaluating your LLMs).
Note The most up-to-date version of MedCalc-Bench. We recommend using this dataset for most cases (e.g., training and evaluating your LLMs). Release notes: https://github.com/ncbi-nlp/MedCalc-Bench/releases/tag/version-1.2
Note [Deprecated] Release notes: https://github.com/ncbi-nlp/MedCalc-Bench/releases/tag/version-1.1
Note [Deprecated] This is the original version of MedCalc-Bench, associated with our NeurIPS publication. Please only use this for reproducibility purposes.