标签:GenAI evaluation

AutoArena

分类: AI Developer Tools AI Testing Large Language Models (LLMs] AIOpensourcemodels

Open-source tool for automated head-to-head evaluation of GenAI systems using LLM judges.