介绍大规模语言模型(LLM)的评估方法与实践经验
参考资料
HuggingFace 评估手册:Evaluation Guidebook - a Hugging Face Space by OpenEvalsarrow-up-right
越狱攻击:Jailbreaking LLMs: A Comprehensive Guide (With Examples) | Promptfooarrow-up-right
安全十大基准:Top 10 Open Datasets for LLM Safety, Toxicity & Bias Evaluation | Promptfooarrow-up-right
Last updated 46 minutes ago
Was this helpful?