News
Eval Protocol (EP) EP is an open specification, Python SDK, and pytest wrapper that provides a standardized way to write evaluations for large language model (LLM) applications. Start with simple ...
python evaluate_2021_LA.py Score_LA.txt ./keys eval python evaluate_2021_DF.py Score_DF.txt ./keys eval ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results