Allen Ai2 (Medium)
· Open Source
Digital Socrates: Evaluating LLMs through Explanation Critiques
Blog written by Yuling GuLooking for an interpretable explanation evaluation tool that can automatically characterize the explanation capabilities of modern LLMs? Meet Digital Socrates at ACL 2024!A better way of evaluating explanationsWhile large language models (LLMs) can provide explanations along with their answers