MedCalc-Bench

ncbi 's Collections

MedCPT

updated 3 days ago

Evaluating Large Language Models for Medical Calculations

ncbi/MedCalc-Bench

Viewer • Updated 3 days ago • 11.6k • 38 • 1

Note The most up-to-date version (same as v1.2) of MedCalc-Bench. We recommend using this dataset for most cases (e.g., training and evaluating your LLMs).
ncbi/MedCalc-Bench-v1.2

Viewer • Updated 3 days ago • 11.6k • 298 • 1

Note The most up-to-date version of MedCalc-Bench. We recommend using this dataset for most cases (e.g., training and evaluating your LLMs). Release notes: https://github.com/ncbi-nlp/MedCalc-Bench/releases/tag/version-1.2
ncbi/MedCalc-Bench-v1.1

Viewer • Updated 20 days ago • 11.7k • 121 • 1

Note [Deprecated] Release notes: https://github.com/ncbi-nlp/MedCalc-Bench/releases/tag/version-1.1
ncbi/MedCalc-Bench-v1.0

Viewer • Updated 20 days ago • 11.1k • 492 • 1

Note [Deprecated] This is the original version of MedCalc-Bench, associated with our NeurIPS publication. Please only use this for reproducibility purposes.