A simple program to evaluate large language model.

You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

Go to file

PeterAlbus c06f9a3684 增加大模型评分模块以及问答数据集处理模块（半成品）。		9 months ago
evaluators	增加大模型评分模块以及问答数据集处理模块（半成品）。	9 months ago
scoring	增加大模型评分模块以及问答数据集处理模块（半成品）。	9 months ago
.gitignore	Init commit. Add Evaluators and support ChatGLM/ChatGLM2.	10 months ago
README.md	增加大模型评分模块以及问答数据集处理模块（半成品）。	9 months ago
eval.py	增加大模型评分模块以及问答数据集处理模块（半成品）。	9 months ago
generate_eval_text.py	Init commit. Add Evaluators and support ChatGLM/ChatGLM2.	10 months ago

LLM_Evaluator

A simple program to evaluate large language model.

需求其余文件

python eval.py --model_name chatglm --cuda_device 0 --finetune ptuning1