LLM Alignment faking in large language models Paper • 2412.14093 • Published Dec 18, 2024 • 9 fka/awesome-chatgpt-prompts Viewer • Updated about 10 hours ago • 974 • 19k • 9.54k
LLM Alignment faking in large language models Paper • 2412.14093 • Published Dec 18, 2024 • 9 fka/awesome-chatgpt-prompts Viewer • Updated about 10 hours ago • 974 • 19k • 9.54k