[xwiki-notifications] [xwiki-contrib/ai-llm-benchmark] da3df3: LLMAI-61: Implement an evaluation framework