[xwiki-notifications] [xwiki-contrib/ai-llm-benchmark] a43361: LLMAI-61: Implement an evaluation framework