Latest post

Event Recap: Tests of Large and Small Language Models on Common Evaluation Tasks

On March 12, 2025, the NLP Community of Practice’s Sandbox Working Group hosted a webinar featuring Gerard Atkinson, Director at ARTD Consultants (Australia), who presented findings from his research comparing the performance of various language models on standard evaluation tasks, such as qualitative text analysis and the use of rubrics to assess documents.

Read more

MERL TECH News