Cataloguing LLM evaluations
The paper proposes a taxonomy of the LLM evaluation landscape, comprising of five categories: General Capabilities, Domain Specific Capabilities, Safety and Trustworthiness, Extreme Risks, and Undesirable Use Cases. Read more
You might also like
-
Tech Salon recap: listen more and shift away from Western-centric framing to better address online violence against women and girls
-
Savita Bailur joins MTI as a Core Collaborator
-
Reviewing Mirca Madianou’s new book, “Technocolonialism: When Tech for Good is Harmful”
-
Welcoming our new AI+Africa Lead for the NLP-CoP: Vari Matimba