Go offline with the Player FM app!
Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models
Manage episode 481800689 series 3524393
The paper introduces SAGE, an evaluation framework for assessing LLMs' social cognition through simulated emotional responses, revealing significant performance gaps among models in empathetic dialogue.
https://arxiv.org/abs//2505.02847
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
2243 episodes
Manage episode 481800689 series 3524393
The paper introduces SAGE, an evaluation framework for assessing LLMs' social cognition through simulated emotional responses, revealing significant performance gaps among models in empathetic dialogue.
https://arxiv.org/abs//2505.02847
YouTube: https://www.youtube.com/@ArxivPapers
TikTok: https://www.tiktok.com/@arxiv_papers
Apple Podcasts: https://podcasts.apple.com/us/podcast/arxiv-papers/id1692476016
Spotify: https://podcasters.spotify.com/pod/show/arxiv-papers
2243 episodes
All episodes
×Welcome to Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.