Shared References

← Back to browse

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

2023 article liu2023trustworthy
Authors
Yang Liu, Yuanshun Yao, Jean-Francois Ton, Xiaoying Zhang, Ruocheng Guo, Hao Cheng, Yegor Klochkov, Muhammad Faaiz Taufiq, Hang Li
Venue
arXiv preprint arXiv:2308.05374