[xwiki-notifications] [xwiki-contrib/ai-llm-benchmark] 3fe4cc: LLMAI-61: Implement an evaluation framework