Evaluate the output language of the LLM in the benchmark

Issue created

Michael Hamann created this issue on 27/May/24 15:22

Summary:	Evaluate the output language of the LLM in the benchmark
Issue Type:	Improvement
Affects Versions:	0.4
Assignee:	Unassigned
Created:	27/May/24 15:22
Priority:	Major
Reporter:	Michael Hamann
Description:	The benchmark should evaluate the output language of the LLM and ensure it corresponds to the language of the question (as opposed to the language of the provided context). We should also try to improve the performance of the models by explicitly prompting them to use the question's language.

Changes by Michael Hamann on 27/May/24 15:22

Fix Version:	0.4
Assignee:	Paul Pantiru

This message was sent by Atlassian Jira (v9.3.0#930000-sha1:287aeb6)

If image attachments aren't displayed, see this article.