A simple program to evaluate large language model.

You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

Go to file

PeterAlbus b3f8e768ff Make the process coherent. The saving of results has been optimized.		10 months ago
evaluators	Make the process coherent.	10 months ago
scoring	Make the process coherent.	10 months ago
.gitignore	Init commit. Add Evaluators and support ChatGLM/ChatGLM2.	10 months ago
README.md	Write README.md.	10 months ago
eval.py	Make the process coherent.	10 months ago
generate_eval_text.py	Init commit. Add Evaluators and support ChatGLM/ChatGLM2.	10 months ago

LLM_Evaluator

A simple program to evaluate large language model.

需求其余文件

python eval.py --model_name chatglm --cuda_device 0 --finetune ptuning1