Branch: refs/heads/master
Home: https://github.com/xwiki-contrib/ai-llm-benchmark
Commit: 000fbaaaebcb1a31488369f87a453aafea98e269
https://github.com/xwiki-contrib/ai-llm-benchmark/commit/000fbaaaebcb1a3148…
Author: Paul Pantiru <paul.pantiru(a)xwiki.com>
Date: 2024-11-21 (Thu, 21 Nov 2024)
Changed paths:
A evaluation_results_graphics/en_only/RAG-qa_AnswerRelevancy_bar_chart.png
A evaluation_results_graphics/en_only/RAG-qa_AnswerRelevancy_box_plot.png
A evaluation_results_graphics/en_only/RAG-qa_ContextualPrecision_bar_chart.png
A evaluation_results_graphics/en_only/RAG-qa_ContextualPrecision_box_plot.png
A evaluation_results_graphics/en_only/RAG-qa_ContextualRecall_bar_chart.png
A evaluation_results_graphics/en_only/RAG-qa_ContextualRecall_box_plot.png
A evaluation_results_graphics/en_only/RAG-qa_Correctness_bar_chart.png
A evaluation_results_graphics/en_only/RAG-qa_Correctness_box_plot.png
A evaluation_results_graphics/en_only/RAG-qa_CustomContextualRelevancy_bar_chart.png
A evaluation_results_graphics/en_only/RAG-qa_CustomContextualRelevancy_box_plot.png
A evaluation_results_graphics/en_only/RAG-qa_Faithfulness_bar_chart.png
A evaluation_results_graphics/en_only/RAG-qa_Faithfulness_box_plot.png
A evaluation_results_graphics/en_only/RAG-qa_grouped_bar_chart.png
A evaluation_results_graphics/en_only/RAG-qa_overall_score_box_plot.png
A evaluation_results_graphics/en_only/average_average_power_draw_grouped_chart.png
A evaluation_results_graphics/en_only/average_energy_consumption_grouped_chart.png
A evaluation_results_graphics/en_only/average_energy_per_input_token_grouped_chart.png
A evaluation_results_graphics/en_only/average_energy_per_output_token_grouped_chart.png
A evaluation_results_graphics/en_only/average_energy_per_total_token_grouped_chart.png
A evaluation_results_graphics/en_only/average_power_draw_chart.png
A evaluation_results_graphics/en_only/correctness_comparison_bar_chart.png
A evaluation_results_graphics/en_only/model_average_power_chart.png
A evaluation_results_graphics/en_only/summarization_Alignment_bar_chart.png
A evaluation_results_graphics/en_only/summarization_Alignment_box_plot.png
A evaluation_results_graphics/en_only/summarization_Coverage_bar_chart.png
A evaluation_results_graphics/en_only/summarization_Coverage_box_plot.png
A evaluation_results_graphics/en_only/summarization_grouped_bar_chart.png
A evaluation_results_graphics/en_only/text_generation_grouped_bar_chart.png
A evaluation_results_graphics/en_only/text_generation_score_bar_chart.png
A evaluation_results_graphics/en_only/text_generation_score_box_plot.png
A reports/report_20241121_172428_en_only/evaluation_report_20241121_172428.pdf
A reports/report_20241121_172428_en_only/model_outputs_20241121_172429.pdf
Log Message:
-----------
New bencmark execuition with 8k context window for the ollama models and new correctness metric [english results only]
To unsubscribe from these emails, change your notification settings at https://github.com/xwiki-contrib/ai-llm-benchmark/settings/notifications
Branch: refs/heads/master
Home: https://github.com/xwiki-contrib/ai-llm-benchmark
Commit: b81b6bf6ec8f4da75041f75b07be0ec1d7fa8caf
https://github.com/xwiki-contrib/ai-llm-benchmark/commit/b81b6bf6ec8f4da750…
Author: Paul Pantiru <paul.pantiru(a)xwiki.com>
Date: 2024-11-21 (Thu, 21 Nov 2024)
Changed paths:
M scripts/evaluation_scripts/eval_rag_qa.py
M scripts/output_generation/collect_model_responses.py
M scripts/results_visualization/generate_plots.py
M scripts/results_visualization/generate_report.py
Log Message:
-----------
Added correctness metric, exponential backoff for model requests and adjacent alterations to plot gen and reports.
Commit: 8ad42f3f2b2340e2119d45e3453df7a484739597
https://github.com/xwiki-contrib/ai-llm-benchmark/commit/8ad42f3f2b2340e211…
Author: Paul Pantiru <paul.pantiru(a)xwiki.com>
Date: 2024-11-21 (Thu, 21 Nov 2024)
Changed paths:
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_001_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_002_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_003_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_004_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_005_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_006_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_007_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_008_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_009_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_010_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_011_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_012_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_013_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_014_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_015_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_016_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_017_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_018_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_019_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_020_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_021_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_022_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_023_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_024_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_025_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_026_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_027_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_028_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_029_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_030_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_031_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_032_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o-mini/qa_033_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_001_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_002_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_003_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_004_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_005_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_006_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_007_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_008_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_009_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_010_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_011_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_012_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_013_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_014_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_015_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_016_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_017_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_018_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_019_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_020_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_021_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_022_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_023_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_024_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_025_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_026_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_027_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_028_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_029_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_030_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_031_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_032_result.json
M evaluation_results/RAG-qa/AI.Models.qa_GPT4o/qa_033_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_001_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_002_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_003_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_004_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_005_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_006_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_007_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_008_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_009_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_010_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_011_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_012_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_013_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_014_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_015_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_016_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_017_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_018_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_019_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_020_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_021_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_022_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_023_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_024_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_025_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_026_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_027_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_028_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_029_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_030_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_031_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_032_result.json
M evaluation_results/RAG-qa/AI.Models.qa_claude3_5_sonet/qa_033_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_001_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_002_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_003_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_004_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_005_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_006_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_007_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_008_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_009_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_010_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_011_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_012_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_013_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_014_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_015_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_016_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_017_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_018_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_019_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_020_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_021_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_022_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_023_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_024_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_025_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_026_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_027_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_028_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_029_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_030_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_031_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_032_result.json
M evaluation_results/RAG-qa/AI.Models.qa_command-r_35B_Q4/qa_033_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_001_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_002_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_003_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_004_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_005_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_006_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_007_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_008_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_009_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_010_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_011_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_012_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_013_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_014_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_015_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_016_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_017_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_018_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_019_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_020_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_021_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_022_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_023_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_024_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_025_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_026_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_027_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_028_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_029_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_030_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_031_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_032_result.json
M evaluation_results/RAG-qa/AI.Models.qa_gemma2_9B_Q4/qa_033_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_001_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_002_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_003_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_004_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_005_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_006_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_007_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_008_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_009_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_010_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_011_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_012_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_013_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_014_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_015_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_016_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_017_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_018_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_019_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_020_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_021_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_022_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_023_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_024_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_025_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_026_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_027_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_028_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_029_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_030_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_031_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_032_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_402b/qa_033_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_001_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_002_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_003_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_004_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_005_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_006_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_007_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_008_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_009_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_010_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_011_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_012_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_013_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_014_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_015_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_016_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_017_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_018_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_019_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_020_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_021_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_022_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_023_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_024_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_025_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_026_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_027_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_028_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_029_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_030_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_031_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_032_result.json
M evaluation_results/RAG-qa/AI.Models.qa_llama3_1_8b_Q4/qa_033_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_001_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_002_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_003_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_004_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_005_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_006_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_007_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_008_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_009_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_010_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_011_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_012_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_013_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_014_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_015_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_016_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_017_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_018_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_019_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_020_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_021_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_022_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_023_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_024_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_025_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_026_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_027_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_028_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_029_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_030_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_031_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_032_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral-nemo_12b_Q4/qa_033_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_001_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_002_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_003_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_004_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_005_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_006_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_007_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_008_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_009_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_010_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_011_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_012_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_013_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_014_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_015_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_016_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_017_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_018_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_019_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_020_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_021_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_022_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_023_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_024_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_025_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_026_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_027_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_028_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_029_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_030_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_031_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_032_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mistral2_large/qa_033_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_001_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_002_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_003_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_004_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_005_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_006_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_007_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_008_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_009_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_010_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_011_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_012_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_013_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_014_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_015_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_016_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_017_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_018_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_019_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_020_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_021_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_022_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_023_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_024_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_025_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_026_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_027_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_028_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_029_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_030_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_031_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_032_result.json
M evaluation_results/RAG-qa/AI.Models.qa_mixtral-8x22b/qa_033_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_001_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_002_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_003_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_004_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_005_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_006_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_007_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_008_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_009_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_010_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_011_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_012_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_013_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_014_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_015_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_016_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_017_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_018_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_019_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_020_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_021_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_022_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_023_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_024_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_025_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_026_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_027_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_028_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_029_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_030_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_031_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_032_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_medium-128k_14b_Q4/qa_033_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_001_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_002_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_003_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_004_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_005_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_006_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_007_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_008_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_009_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_010_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_011_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_012_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_013_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_014_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_015_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_016_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_017_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_018_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_019_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_020_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_021_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_022_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_023_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_024_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_025_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_026_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_027_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_028_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_029_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_030_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_031_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_032_result.json
M evaluation_results/RAG-qa/AI.Models.qa_phi3_mini-128k_4b_Q4/qa_033_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_001_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_002_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_003_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_004_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_005_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_006_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_007_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_008_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_009_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_010_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_011_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_012_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_013_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_014_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_015_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_016_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_017_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_018_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_019_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_020_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_021_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_022_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_023_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_024_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_025_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_026_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_027_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_028_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_029_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_030_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_031_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_032_result.json
M evaluation_results/RAG-qa/AI.Models.qa_qwen2_7b_Q4/qa_033_result.json
M evaluation_results/average_power_consumption.json
M evaluation_results/summarization/AI.Models.GPT4o-mini/summ_001_result.json
M evaluation_results/summarization/AI.Models.GPT4o-mini/summ_002_result.json
M evaluation_results/summarization/AI.Models.GPT4o-mini/summ_003_result.json
M evaluation_results/summarization/AI.Models.GPT4o-mini/summ_004_result.json
M evaluation_results/summarization/AI.Models.GPT4o-mini/summ_005_result.json
M evaluation_results/summarization/AI.Models.GPT4o-mini/summ_006_result.json
M evaluation_results/summarization/AI.Models.GPT4o-mini/summ_007_result.json
M evaluation_results/summarization/AI.Models.GPT4o-mini/summ_008_result.json
M evaluation_results/summarization/AI.Models.GPT4o-mini/summ_009_result.json
M evaluation_results/summarization/AI.Models.GPT4o-mini/summ_010_result.json
M evaluation_results/summarization/AI.Models.GPT4o/summ_001_result.json
M evaluation_results/summarization/AI.Models.GPT4o/summ_002_result.json
M evaluation_results/summarization/AI.Models.GPT4o/summ_003_result.json
M evaluation_results/summarization/AI.Models.GPT4o/summ_004_result.json
M evaluation_results/summarization/AI.Models.GPT4o/summ_005_result.json
M evaluation_results/summarization/AI.Models.GPT4o/summ_006_result.json
M evaluation_results/summarization/AI.Models.GPT4o/summ_007_result.json
M evaluation_results/summarization/AI.Models.GPT4o/summ_008_result.json
M evaluation_results/summarization/AI.Models.GPT4o/summ_009_result.json
M evaluation_results/summarization/AI.Models.GPT4o/summ_010_result.json
M evaluation_results/summarization/AI.Models.claude3_5_sonet/summ_001_result.json
M evaluation_results/summarization/AI.Models.claude3_5_sonet/summ_002_result.json
M evaluation_results/summarization/AI.Models.claude3_5_sonet/summ_003_result.json
M evaluation_results/summarization/AI.Models.claude3_5_sonet/summ_004_result.json
M evaluation_results/summarization/AI.Models.claude3_5_sonet/summ_005_result.json
M evaluation_results/summarization/AI.Models.claude3_5_sonet/summ_006_result.json
M evaluation_results/summarization/AI.Models.claude3_5_sonet/summ_007_result.json
M evaluation_results/summarization/AI.Models.claude3_5_sonet/summ_008_result.json
M evaluation_results/summarization/AI.Models.claude3_5_sonet/summ_009_result.json
M evaluation_results/summarization/AI.Models.claude3_5_sonet/summ_010_result.json
M evaluation_results/summarization/AI.Models.command-r_35B_Q4/summ_001_result.json
M evaluation_results/summarization/AI.Models.command-r_35B_Q4/summ_002_result.json
M evaluation_results/summarization/AI.Models.command-r_35B_Q4/summ_003_result.json
M evaluation_results/summarization/AI.Models.command-r_35B_Q4/summ_004_result.json
M evaluation_results/summarization/AI.Models.command-r_35B_Q4/summ_005_result.json
M evaluation_results/summarization/AI.Models.command-r_35B_Q4/summ_006_result.json
M evaluation_results/summarization/AI.Models.command-r_35B_Q4/summ_007_result.json
M evaluation_results/summarization/AI.Models.command-r_35B_Q4/summ_008_result.json
M evaluation_results/summarization/AI.Models.command-r_35B_Q4/summ_009_result.json
M evaluation_results/summarization/AI.Models.command-r_35B_Q4/summ_010_result.json
M evaluation_results/summarization/AI.Models.gemma2_9B_Q4/summ_001_result.json
M evaluation_results/summarization/AI.Models.gemma2_9B_Q4/summ_002_result.json
M evaluation_results/summarization/AI.Models.gemma2_9B_Q4/summ_003_result.json
M evaluation_results/summarization/AI.Models.gemma2_9B_Q4/summ_004_result.json
M evaluation_results/summarization/AI.Models.gemma2_9B_Q4/summ_005_result.json
M evaluation_results/summarization/AI.Models.gemma2_9B_Q4/summ_006_result.json
M evaluation_results/summarization/AI.Models.gemma2_9B_Q4/summ_007_result.json
M evaluation_results/summarization/AI.Models.gemma2_9B_Q4/summ_008_result.json
M evaluation_results/summarization/AI.Models.gemma2_9B_Q4/summ_009_result.json
M evaluation_results/summarization/AI.Models.gemma2_9B_Q4/summ_010_result.json
M evaluation_results/summarization/AI.Models.llama3_1_402b/summ_001_result.json
M evaluation_results/summarization/AI.Models.llama3_1_402b/summ_002_result.json
M evaluation_results/summarization/AI.Models.llama3_1_402b/summ_003_result.json
M evaluation_results/summarization/AI.Models.llama3_1_402b/summ_004_result.json
M evaluation_results/summarization/AI.Models.llama3_1_402b/summ_005_result.json
M evaluation_results/summarization/AI.Models.llama3_1_402b/summ_006_result.json
M evaluation_results/summarization/AI.Models.llama3_1_402b/summ_007_result.json
M evaluation_results/summarization/AI.Models.llama3_1_402b/summ_008_result.json
M evaluation_results/summarization/AI.Models.llama3_1_402b/summ_009_result.json
M evaluation_results/summarization/AI.Models.llama3_1_402b/summ_010_result.json
M evaluation_results/summarization/AI.Models.llama3_1_8b_Q4/summ_001_result.json
M evaluation_results/summarization/AI.Models.llama3_1_8b_Q4/summ_002_result.json
M evaluation_results/summarization/AI.Models.llama3_1_8b_Q4/summ_003_result.json
M evaluation_results/summarization/AI.Models.llama3_1_8b_Q4/summ_004_result.json
M evaluation_results/summarization/AI.Models.llama3_1_8b_Q4/summ_005_result.json
M evaluation_results/summarization/AI.Models.llama3_1_8b_Q4/summ_006_result.json
M evaluation_results/summarization/AI.Models.llama3_1_8b_Q4/summ_007_result.json
M evaluation_results/summarization/AI.Models.llama3_1_8b_Q4/summ_008_result.json
M evaluation_results/summarization/AI.Models.llama3_1_8b_Q4/summ_009_result.json
M evaluation_results/summarization/AI.Models.llama3_1_8b_Q4/summ_010_result.json
M evaluation_results/summarization/AI.Models.mistral-nemo_12b_Q4/summ_001_result.json
M evaluation_results/summarization/AI.Models.mistral-nemo_12b_Q4/summ_002_result.json
M evaluation_results/summarization/AI.Models.mistral-nemo_12b_Q4/summ_003_result.json
M evaluation_results/summarization/AI.Models.mistral-nemo_12b_Q4/summ_004_result.json
M evaluation_results/summarization/AI.Models.mistral-nemo_12b_Q4/summ_005_result.json
M evaluation_results/summarization/AI.Models.mistral-nemo_12b_Q4/summ_006_result.json
M evaluation_results/summarization/AI.Models.mistral-nemo_12b_Q4/summ_007_result.json
M evaluation_results/summarization/AI.Models.mistral-nemo_12b_Q4/summ_008_result.json
M evaluation_results/summarization/AI.Models.mistral-nemo_12b_Q4/summ_009_result.json
M evaluation_results/summarization/AI.Models.mistral-nemo_12b_Q4/summ_010_result.json
M evaluation_results/summarization/AI.Models.mistral2_large/summ_001_result.json
M evaluation_results/summarization/AI.Models.mistral2_large/summ_002_result.json
M evaluation_results/summarization/AI.Models.mistral2_large/summ_003_result.json
M evaluation_results/summarization/AI.Models.mistral2_large/summ_004_result.json
M evaluation_results/summarization/AI.Models.mistral2_large/summ_005_result.json
M evaluation_results/summarization/AI.Models.mistral2_large/summ_006_result.json
M evaluation_results/summarization/AI.Models.mistral2_large/summ_007_result.json
M evaluation_results/summarization/AI.Models.mistral2_large/summ_008_result.json
M evaluation_results/summarization/AI.Models.mistral2_large/summ_009_result.json
M evaluation_results/summarization/AI.Models.mistral2_large/summ_010_result.json
M evaluation_results/summarization/AI.Models.mixtral-8x22b/summ_001_result.json
M evaluation_results/summarization/AI.Models.mixtral-8x22b/summ_002_result.json
M evaluation_results/summarization/AI.Models.mixtral-8x22b/summ_003_result.json
M evaluation_results/summarization/AI.Models.mixtral-8x22b/summ_004_result.json
M evaluation_results/summarization/AI.Models.mixtral-8x22b/summ_005_result.json
M evaluation_results/summarization/AI.Models.mixtral-8x22b/summ_006_result.json
M evaluation_results/summarization/AI.Models.mixtral-8x22b/summ_007_result.json
M evaluation_results/summarization/AI.Models.mixtral-8x22b/summ_008_result.json
M evaluation_results/summarization/AI.Models.mixtral-8x22b/summ_009_result.json
M evaluation_results/summarization/AI.Models.mixtral-8x22b/summ_010_result.json
M evaluation_results/summarization/AI.Models.phi3_medium-128k_14b_Q4/summ_001_result.json
M evaluation_results/summarization/AI.Models.phi3_medium-128k_14b_Q4/summ_002_result.json
M evaluation_results/summarization/AI.Models.phi3_medium-128k_14b_Q4/summ_003_result.json
M evaluation_results/summarization/AI.Models.phi3_medium-128k_14b_Q4/summ_004_result.json
M evaluation_results/summarization/AI.Models.phi3_medium-128k_14b_Q4/summ_005_result.json
M evaluation_results/summarization/AI.Models.phi3_medium-128k_14b_Q4/summ_006_result.json
M evaluation_results/summarization/AI.Models.phi3_medium-128k_14b_Q4/summ_007_result.json
M evaluation_results/summarization/AI.Models.phi3_medium-128k_14b_Q4/summ_008_result.json
M evaluation_results/summarization/AI.Models.phi3_medium-128k_14b_Q4/summ_009_result.json
M evaluation_results/summarization/AI.Models.phi3_medium-128k_14b_Q4/summ_010_result.json
M evaluation_results/summarization/AI.Models.phi3_mini-128k_4b_Q4/summ_001_result.json
M evaluation_results/summarization/AI.Models.phi3_mini-128k_4b_Q4/summ_002_result.json
M evaluation_results/summarization/AI.Models.phi3_mini-128k_4b_Q4/summ_003_result.json
M evaluation_results/summarization/AI.Models.phi3_mini-128k_4b_Q4/summ_004_result.json
M evaluation_results/summarization/AI.Models.phi3_mini-128k_4b_Q4/summ_005_result.json
M evaluation_results/summarization/AI.Models.phi3_mini-128k_4b_Q4/summ_006_result.json
M evaluation_results/summarization/AI.Models.phi3_mini-128k_4b_Q4/summ_007_result.json
M evaluation_results/summarization/AI.Models.phi3_mini-128k_4b_Q4/summ_008_result.json
M evaluation_results/summarization/AI.Models.phi3_mini-128k_4b_Q4/summ_009_result.json
M evaluation_results/summarization/AI.Models.phi3_mini-128k_4b_Q4/summ_010_result.json
M evaluation_results/summarization/AI.Models.qwen2_7b_Q4/summ_001_result.json
M evaluation_results/summarization/AI.Models.qwen2_7b_Q4/summ_002_result.json
M evaluation_results/summarization/AI.Models.qwen2_7b_Q4/summ_003_result.json
M evaluation_results/summarization/AI.Models.qwen2_7b_Q4/summ_004_result.json
M evaluation_results/summarization/AI.Models.qwen2_7b_Q4/summ_005_result.json
M evaluation_results/summarization/AI.Models.qwen2_7b_Q4/summ_006_result.json
M evaluation_results/summarization/AI.Models.qwen2_7b_Q4/summ_007_result.json
M evaluation_results/summarization/AI.Models.qwen2_7b_Q4/summ_008_result.json
M evaluation_results/summarization/AI.Models.qwen2_7b_Q4/summ_009_result.json
M evaluation_results/summarization/AI.Models.qwen2_7b_Q4/summ_010_result.json
M evaluation_results/text_generation/AI.Models.GPT4o-mini/text_gen_001_result.json
M evaluation_results/text_generation/AI.Models.GPT4o-mini/text_gen_002_result.json
M evaluation_results/text_generation/AI.Models.GPT4o-mini/text_gen_003_result.json
M evaluation_results/text_generation/AI.Models.GPT4o-mini/text_gen_004_result.json
M evaluation_results/text_generation/AI.Models.GPT4o-mini/text_gen_005_result.json
M evaluation_results/text_generation/AI.Models.GPT4o-mini/text_gen_006_result.json
M evaluation_results/text_generation/AI.Models.GPT4o-mini/text_gen_007_result.json
M evaluation_results/text_generation/AI.Models.GPT4o-mini/text_gen_008_result.json
M evaluation_results/text_generation/AI.Models.GPT4o-mini/text_gen_009_result.json
M evaluation_results/text_generation/AI.Models.GPT4o-mini/text_gen_010_result.json
M evaluation_results/text_generation/AI.Models.GPT4o/text_gen_001_result.json
M evaluation_results/text_generation/AI.Models.GPT4o/text_gen_002_result.json
M evaluation_results/text_generation/AI.Models.GPT4o/text_gen_003_result.json
M evaluation_results/text_generation/AI.Models.GPT4o/text_gen_004_result.json
M evaluation_results/text_generation/AI.Models.GPT4o/text_gen_005_result.json
M evaluation_results/text_generation/AI.Models.GPT4o/text_gen_006_result.json
M evaluation_results/text_generation/AI.Models.GPT4o/text_gen_007_result.json
M evaluation_results/text_generation/AI.Models.GPT4o/text_gen_008_result.json
M evaluation_results/text_generation/AI.Models.GPT4o/text_gen_009_result.json
M evaluation_results/text_generation/AI.Models.GPT4o/text_gen_010_result.json
M evaluation_results/text_generation/AI.Models.claude3_5_sonet/text_gen_001_result.json
M evaluation_results/text_generation/AI.Models.claude3_5_sonet/text_gen_002_result.json
M evaluation_results/text_generation/AI.Models.claude3_5_sonet/text_gen_003_result.json
M evaluation_results/text_generation/AI.Models.claude3_5_sonet/text_gen_004_result.json
M evaluation_results/text_generation/AI.Models.claude3_5_sonet/text_gen_005_result.json
M evaluation_results/text_generation/AI.Models.claude3_5_sonet/text_gen_006_result.json
M evaluation_results/text_generation/AI.Models.claude3_5_sonet/text_gen_007_result.json
M evaluation_results/text_generation/AI.Models.claude3_5_sonet/text_gen_008_result.json
M evaluation_results/text_generation/AI.Models.claude3_5_sonet/text_gen_009_result.json
M evaluation_results/text_generation/AI.Models.claude3_5_sonet/text_gen_010_result.json
M evaluation_results/text_generation/AI.Models.command-r_35B_Q4/text_gen_001_result.json
M evaluation_results/text_generation/AI.Models.command-r_35B_Q4/text_gen_002_result.json
M evaluation_results/text_generation/AI.Models.command-r_35B_Q4/text_gen_003_result.json
M evaluation_results/text_generation/AI.Models.command-r_35B_Q4/text_gen_004_result.json
M evaluation_results/text_generation/AI.Models.command-r_35B_Q4/text_gen_005_result.json
M evaluation_results/text_generation/AI.Models.command-r_35B_Q4/text_gen_006_result.json
M evaluation_results/text_generation/AI.Models.command-r_35B_Q4/text_gen_007_result.json
M evaluation_results/text_generation/AI.Models.command-r_35B_Q4/text_gen_008_result.json
M evaluation_results/text_generation/AI.Models.command-r_35B_Q4/text_gen_009_result.json
M evaluation_results/text_generation/AI.Models.command-r_35B_Q4/text_gen_010_result.json
M evaluation_results/text_generation/AI.Models.gemma2_9B_Q4/text_gen_001_result.json
M evaluation_results/text_generation/AI.Models.gemma2_9B_Q4/text_gen_002_result.json
M evaluation_results/text_generation/AI.Models.gemma2_9B_Q4/text_gen_003_result.json
M evaluation_results/text_generation/AI.Models.gemma2_9B_Q4/text_gen_004_result.json
M evaluation_results/text_generation/AI.Models.gemma2_9B_Q4/text_gen_005_result.json
M evaluation_results/text_generation/AI.Models.gemma2_9B_Q4/text_gen_006_result.json
M evaluation_results/text_generation/AI.Models.gemma2_9B_Q4/text_gen_007_result.json
M evaluation_results/text_generation/AI.Models.gemma2_9B_Q4/text_gen_008_result.json
M evaluation_results/text_generation/AI.Models.gemma2_9B_Q4/text_gen_009_result.json
M evaluation_results/text_generation/AI.Models.gemma2_9B_Q4/text_gen_010_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_402b/text_gen_001_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_402b/text_gen_002_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_402b/text_gen_003_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_402b/text_gen_004_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_402b/text_gen_005_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_402b/text_gen_006_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_402b/text_gen_007_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_402b/text_gen_008_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_402b/text_gen_009_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_402b/text_gen_010_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_8b_Q4/text_gen_001_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_8b_Q4/text_gen_002_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_8b_Q4/text_gen_003_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_8b_Q4/text_gen_004_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_8b_Q4/text_gen_005_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_8b_Q4/text_gen_006_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_8b_Q4/text_gen_007_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_8b_Q4/text_gen_008_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_8b_Q4/text_gen_009_result.json
M evaluation_results/text_generation/AI.Models.llama3_1_8b_Q4/text_gen_010_result.json
M evaluation_results/text_generation/AI.Models.mistral-nemo_12b_Q4/text_gen_001_result.json
M evaluation_results/text_generation/AI.Models.mistral-nemo_12b_Q4/text_gen_002_result.json
M evaluation_results/text_generation/AI.Models.mistral-nemo_12b_Q4/text_gen_003_result.json
M evaluation_results/text_generation/AI.Models.mistral-nemo_12b_Q4/text_gen_004_result.json
M evaluation_results/text_generation/AI.Models.mistral-nemo_12b_Q4/text_gen_005_result.json
M evaluation_results/text_generation/AI.Models.mistral-nemo_12b_Q4/text_gen_006_result.json
M evaluation_results/text_generation/AI.Models.mistral-nemo_12b_Q4/text_gen_007_result.json
M evaluation_results/text_generation/AI.Models.mistral-nemo_12b_Q4/text_gen_008_result.json
M evaluation_results/text_generation/AI.Models.mistral-nemo_12b_Q4/text_gen_009_result.json
M evaluation_results/text_generation/AI.Models.mistral-nemo_12b_Q4/text_gen_010_result.json
M evaluation_results/text_generation/AI.Models.mistral2_large/text_gen_001_result.json
M evaluation_results/text_generation/AI.Models.mistral2_large/text_gen_002_result.json
M evaluation_results/text_generation/AI.Models.mistral2_large/text_gen_003_result.json
M evaluation_results/text_generation/AI.Models.mistral2_large/text_gen_004_result.json
M evaluation_results/text_generation/AI.Models.mistral2_large/text_gen_005_result.json
M evaluation_results/text_generation/AI.Models.mistral2_large/text_gen_006_result.json
M evaluation_results/text_generation/AI.Models.mistral2_large/text_gen_007_result.json
M evaluation_results/text_generation/AI.Models.mistral2_large/text_gen_008_result.json
M evaluation_results/text_generation/AI.Models.mistral2_large/text_gen_009_result.json
M evaluation_results/text_generation/AI.Models.mistral2_large/text_gen_010_result.json
M evaluation_results/text_generation/AI.Models.mixtral-8x22b/text_gen_001_result.json
M evaluation_results/text_generation/AI.Models.mixtral-8x22b/text_gen_002_result.json
M evaluation_results/text_generation/AI.Models.mixtral-8x22b/text_gen_003_result.json
M evaluation_results/text_generation/AI.Models.mixtral-8x22b/text_gen_004_result.json
M evaluation_results/text_generation/AI.Models.mixtral-8x22b/text_gen_005_result.json
M evaluation_results/text_generation/AI.Models.mixtral-8x22b/text_gen_006_result.json
M evaluation_results/text_generation/AI.Models.mixtral-8x22b/text_gen_007_result.json
M evaluation_results/text_generation/AI.Models.mixtral-8x22b/text_gen_008_result.json
M evaluation_results/text_generation/AI.Models.mixtral-8x22b/text_gen_009_result.json
M evaluation_results/text_generation/AI.Models.mixtral-8x22b/text_gen_010_result.json
M evaluation_results/text_generation/AI.Models.phi3_medium-128k_14b_Q4/text_gen_001_result.json
M evaluation_results/text_generation/AI.Models.phi3_medium-128k_14b_Q4/text_gen_002_result.json
M evaluation_results/text_generation/AI.Models.phi3_medium-128k_14b_Q4/text_gen_003_result.json
M evaluation_results/text_generation/AI.Models.phi3_medium-128k_14b_Q4/text_gen_004_result.json
M evaluation_results/text_generation/AI.Models.phi3_medium-128k_14b_Q4/text_gen_005_result.json
M evaluation_results/text_generation/AI.Models.phi3_medium-128k_14b_Q4/text_gen_006_result.json
M evaluation_results/text_generation/AI.Models.phi3_medium-128k_14b_Q4/text_gen_007_result.json
M evaluation_results/text_generation/AI.Models.phi3_medium-128k_14b_Q4/text_gen_008_result.json
M evaluation_results/text_generation/AI.Models.phi3_medium-128k_14b_Q4/text_gen_009_result.json
M evaluation_results/text_generation/AI.Models.phi3_medium-128k_14b_Q4/text_gen_010_result.json
M evaluation_results/text_generation/AI.Models.phi3_mini-128k_4b_Q4/text_gen_001_result.json
M evaluation_results/text_generation/AI.Models.phi3_mini-128k_4b_Q4/text_gen_002_result.json
M evaluation_results/text_generation/AI.Models.phi3_mini-128k_4b_Q4/text_gen_003_result.json
M evaluation_results/text_generation/AI.Models.phi3_mini-128k_4b_Q4/text_gen_004_result.json
M evaluation_results/text_generation/AI.Models.phi3_mini-128k_4b_Q4/text_gen_005_result.json
M evaluation_results/text_generation/AI.Models.phi3_mini-128k_4b_Q4/text_gen_006_result.json
M evaluation_results/text_generation/AI.Models.phi3_mini-128k_4b_Q4/text_gen_007_result.json
M evaluation_results/text_generation/AI.Models.phi3_mini-128k_4b_Q4/text_gen_008_result.json
M evaluation_results/text_generation/AI.Models.phi3_mini-128k_4b_Q4/text_gen_009_result.json
M evaluation_results/text_generation/AI.Models.phi3_mini-128k_4b_Q4/text_gen_010_result.json
M evaluation_results/text_generation/AI.Models.qwen2_7b_Q4/text_gen_001_result.json
M evaluation_results/text_generation/AI.Models.qwen2_7b_Q4/text_gen_002_result.json
M evaluation_results/text_generation/AI.Models.qwen2_7b_Q4/text_gen_003_result.json
M evaluation_results/text_generation/AI.Models.qwen2_7b_Q4/text_gen_004_result.json
M evaluation_results/text_generation/AI.Models.qwen2_7b_Q4/text_gen_005_result.json
M evaluation_results/text_generation/AI.Models.qwen2_7b_Q4/text_gen_006_result.json
M evaluation_results/text_generation/AI.Models.qwen2_7b_Q4/text_gen_007_result.json
M evaluation_results/text_generation/AI.Models.qwen2_7b_Q4/text_gen_008_result.json
M evaluation_results/text_generation/AI.Models.qwen2_7b_Q4/text_gen_009_result.json
M evaluation_results/text_generation/AI.Models.qwen2_7b_Q4/text_gen_010_result.json
M evaluation_results_graphics/RAG-qa_AnswerRelevancy_bar_chart.png
M evaluation_results_graphics/RAG-qa_AnswerRelevancy_box_plot.png
M evaluation_results_graphics/RAG-qa_ContextualPrecision_bar_chart.png
M evaluation_results_graphics/RAG-qa_ContextualPrecision_box_plot.png
M evaluation_results_graphics/RAG-qa_ContextualRecall_bar_chart.png
M evaluation_results_graphics/RAG-qa_ContextualRecall_box_plot.png
A evaluation_results_graphics/RAG-qa_Correctness_bar_chart.png
A evaluation_results_graphics/RAG-qa_Correctness_box_plot.png
A evaluation_results_graphics/RAG-qa_CustomContextualRelevancy_bar_chart.png
A evaluation_results_graphics/RAG-qa_CustomContextualRelevancy_box_plot.png
M evaluation_results_graphics/RAG-qa_Faithfulness_bar_chart.png
M evaluation_results_graphics/RAG-qa_Faithfulness_box_plot.png
M evaluation_results_graphics/RAG-qa_grouped_bar_chart.png
M evaluation_results_graphics/RAG-qa_overall_score_box_plot.png
M evaluation_results_graphics/average_average_power_draw_grouped_chart.png
M evaluation_results_graphics/average_energy_consumption_grouped_chart.png
M evaluation_results_graphics/average_energy_per_input_token_grouped_chart.png
M evaluation_results_graphics/average_energy_per_output_token_grouped_chart.png
M evaluation_results_graphics/average_energy_per_total_token_grouped_chart.png
M evaluation_results_graphics/average_power_draw_chart.png
A evaluation_results_graphics/correctness_comparison_bar_chart.png
M evaluation_results_graphics/model_average_power_chart.png
M evaluation_results_graphics/summarization_Alignment_bar_chart.png
M evaluation_results_graphics/summarization_Alignment_box_plot.png
M evaluation_results_graphics/summarization_Coverage_bar_chart.png
M evaluation_results_graphics/summarization_Coverage_box_plot.png
M evaluation_results_graphics/summarization_grouped_bar_chart.png
M evaluation_results_graphics/text_generation_grouped_bar_chart.png
M evaluation_results_graphics/text_generation_score_bar_chart.png
M evaluation_results_graphics/text_generation_score_box_plot.png
M output/AI.Models.GPT4o-mini/tasks/summarization/summ_001.json
M output/AI.Models.GPT4o-mini/tasks/summarization/summ_002.json
M output/AI.Models.GPT4o-mini/tasks/summarization/summ_003.json
M output/AI.Models.GPT4o-mini/tasks/summarization/summ_004.json
M output/AI.Models.GPT4o-mini/tasks/summarization/summ_005.json
M output/AI.Models.GPT4o-mini/tasks/summarization/summ_006.json
M output/AI.Models.GPT4o-mini/tasks/summarization/summ_007.json
M output/AI.Models.GPT4o-mini/tasks/summarization/summ_008.json
M output/AI.Models.GPT4o-mini/tasks/summarization/summ_009.json
M output/AI.Models.GPT4o-mini/tasks/summarization/summ_010.json
M output/AI.Models.GPT4o-mini/tasks/text_generation/text_gen_002.json
M output/AI.Models.GPT4o-mini/tasks/text_generation/text_gen_003.json
M output/AI.Models.GPT4o-mini/tasks/text_generation/text_gen_004.json
M output/AI.Models.GPT4o-mini/tasks/text_generation/text_gen_005.json
M output/AI.Models.GPT4o-mini/tasks/text_generation/text_gen_006.json
M output/AI.Models.GPT4o-mini/tasks/text_generation/text_gen_007.json
M output/AI.Models.GPT4o-mini/tasks/text_generation/text_gen_008.json
M output/AI.Models.GPT4o-mini/tasks/text_generation/text_gen_009.json
M output/AI.Models.GPT4o-mini/tasks/text_generation/text_gen_010.json
M output/AI.Models.GPT4o/tasks/summarization/summ_001.json
M output/AI.Models.GPT4o/tasks/summarization/summ_002.json
M output/AI.Models.GPT4o/tasks/summarization/summ_003.json
M output/AI.Models.GPT4o/tasks/summarization/summ_004.json
M output/AI.Models.GPT4o/tasks/summarization/summ_005.json
M output/AI.Models.GPT4o/tasks/summarization/summ_006.json
M output/AI.Models.GPT4o/tasks/summarization/summ_007.json
M output/AI.Models.GPT4o/tasks/summarization/summ_008.json
M output/AI.Models.GPT4o/tasks/summarization/summ_009.json
M output/AI.Models.GPT4o/tasks/summarization/summ_010.json
M output/AI.Models.GPT4o/tasks/text_generation/text_gen_003.json
M output/AI.Models.GPT4o/tasks/text_generation/text_gen_004.json
M output/AI.Models.GPT4o/tasks/text_generation/text_gen_005.json
M output/AI.Models.GPT4o/tasks/text_generation/text_gen_006.json
M output/AI.Models.GPT4o/tasks/text_generation/text_gen_007.json
M output/AI.Models.GPT4o/tasks/text_generation/text_gen_008.json
M output/AI.Models.GPT4o/tasks/text_generation/text_gen_009.json
M output/AI.Models.GPT4o/tasks/text_generation/text_gen_010.json
M output/AI.Models.claude3_5_sonet/tasks/summarization/summ_001.json
M output/AI.Models.claude3_5_sonet/tasks/summarization/summ_002.json
M output/AI.Models.claude3_5_sonet/tasks/summarization/summ_003.json
M output/AI.Models.claude3_5_sonet/tasks/summarization/summ_004.json
M output/AI.Models.claude3_5_sonet/tasks/summarization/summ_007.json
M output/AI.Models.claude3_5_sonet/tasks/summarization/summ_008.json
M output/AI.Models.claude3_5_sonet/tasks/summarization/summ_009.json
M output/AI.Models.claude3_5_sonet/tasks/summarization/summ_010.json
M output/AI.Models.claude3_5_sonet/tasks/text_generation/text_gen_002.json
M output/AI.Models.claude3_5_sonet/tasks/text_generation/text_gen_003.json
M output/AI.Models.claude3_5_sonet/tasks/text_generation/text_gen_004.json
M output/AI.Models.claude3_5_sonet/tasks/text_generation/text_gen_005.json
M output/AI.Models.claude3_5_sonet/tasks/text_generation/text_gen_006.json
M output/AI.Models.claude3_5_sonet/tasks/text_generation/text_gen_007.json
M output/AI.Models.claude3_5_sonet/tasks/text_generation/text_gen_008.json
M output/AI.Models.claude3_5_sonet/tasks/text_generation/text_gen_009.json
M output/AI.Models.claude3_5_sonet/tasks/text_generation/text_gen_010.json
M output/AI.Models.command-r_35B_Q4/tasks/summarization/summ_001.json
M output/AI.Models.command-r_35B_Q4/tasks/summarization/summ_002.json
M output/AI.Models.command-r_35B_Q4/tasks/summarization/summ_003.json
M output/AI.Models.command-r_35B_Q4/tasks/summarization/summ_004.json
M output/AI.Models.command-r_35B_Q4/tasks/summarization/summ_005.json
M output/AI.Models.command-r_35B_Q4/tasks/summarization/summ_006.json
M output/AI.Models.command-r_35B_Q4/tasks/summarization/summ_007.json
M output/AI.Models.command-r_35B_Q4/tasks/summarization/summ_008.json
M output/AI.Models.command-r_35B_Q4/tasks/summarization/summ_009.json
M output/AI.Models.command-r_35B_Q4/tasks/summarization/summ_010.json
M output/AI.Models.command-r_35B_Q4/tasks/text_generation/text_gen_001.json
M output/AI.Models.command-r_35B_Q4/tasks/text_generation/text_gen_002.json
M output/AI.Models.command-r_35B_Q4/tasks/text_generation/text_gen_003.json
M output/AI.Models.command-r_35B_Q4/tasks/text_generation/text_gen_004.json
M output/AI.Models.command-r_35B_Q4/tasks/text_generation/text_gen_005.json
M output/AI.Models.command-r_35B_Q4/tasks/text_generation/text_gen_006.json
M output/AI.Models.command-r_35B_Q4/tasks/text_generation/text_gen_007.json
M output/AI.Models.command-r_35B_Q4/tasks/text_generation/text_gen_008.json
M output/AI.Models.command-r_35B_Q4/tasks/text_generation/text_gen_009.json
M output/AI.Models.command-r_35B_Q4/tasks/text_generation/text_gen_010.json
M output/AI.Models.gemma2_9B_Q4/tasks/summarization/summ_001.json
M output/AI.Models.gemma2_9B_Q4/tasks/summarization/summ_002.json
M output/AI.Models.gemma2_9B_Q4/tasks/summarization/summ_003.json
M output/AI.Models.gemma2_9B_Q4/tasks/summarization/summ_004.json
M output/AI.Models.gemma2_9B_Q4/tasks/summarization/summ_005.json
M output/AI.Models.gemma2_9B_Q4/tasks/summarization/summ_006.json
M output/AI.Models.gemma2_9B_Q4/tasks/summarization/summ_007.json
M output/AI.Models.gemma2_9B_Q4/tasks/summarization/summ_008.json
M output/AI.Models.gemma2_9B_Q4/tasks/summarization/summ_009.json
M output/AI.Models.gemma2_9B_Q4/tasks/summarization/summ_010.json
M output/AI.Models.gemma2_9B_Q4/tasks/text_generation/text_gen_001.json
M output/AI.Models.gemma2_9B_Q4/tasks/text_generation/text_gen_002.json
M output/AI.Models.gemma2_9B_Q4/tasks/text_generation/text_gen_003.json
M output/AI.Models.gemma2_9B_Q4/tasks/text_generation/text_gen_004.json
M output/AI.Models.gemma2_9B_Q4/tasks/text_generation/text_gen_005.json
M output/AI.Models.gemma2_9B_Q4/tasks/text_generation/text_gen_006.json
M output/AI.Models.gemma2_9B_Q4/tasks/text_generation/text_gen_007.json
M output/AI.Models.gemma2_9B_Q4/tasks/text_generation/text_gen_008.json
M output/AI.Models.gemma2_9B_Q4/tasks/text_generation/text_gen_009.json
M output/AI.Models.gemma2_9B_Q4/tasks/text_generation/text_gen_010.json
M output/AI.Models.llama3_1_402b/tasks/summarization/summ_001.json
M output/AI.Models.llama3_1_402b/tasks/summarization/summ_002.json
M output/AI.Models.llama3_1_402b/tasks/summarization/summ_003.json
M output/AI.Models.llama3_1_402b/tasks/summarization/summ_004.json
M output/AI.Models.llama3_1_402b/tasks/summarization/summ_005.json
M output/AI.Models.llama3_1_402b/tasks/summarization/summ_006.json
M output/AI.Models.llama3_1_402b/tasks/summarization/summ_007.json
M output/AI.Models.llama3_1_402b/tasks/summarization/summ_008.json
M output/AI.Models.llama3_1_402b/tasks/summarization/summ_009.json
M output/AI.Models.llama3_1_402b/tasks/summarization/summ_010.json
M output/AI.Models.llama3_1_402b/tasks/text_generation/text_gen_002.json
M output/AI.Models.llama3_1_402b/tasks/text_generation/text_gen_003.json
M output/AI.Models.llama3_1_402b/tasks/text_generation/text_gen_004.json
M output/AI.Models.llama3_1_402b/tasks/text_generation/text_gen_005.json
M output/AI.Models.llama3_1_402b/tasks/text_generation/text_gen_006.json
M output/AI.Models.llama3_1_402b/tasks/text_generation/text_gen_007.json
M output/AI.Models.llama3_1_402b/tasks/text_generation/text_gen_008.json
M output/AI.Models.llama3_1_402b/tasks/text_generation/text_gen_009.json
M output/AI.Models.llama3_1_402b/tasks/text_generation/text_gen_010.json
M output/AI.Models.llama3_1_8b_Q4/tasks/summarization/summ_001.json
M output/AI.Models.llama3_1_8b_Q4/tasks/summarization/summ_002.json
M output/AI.Models.llama3_1_8b_Q4/tasks/summarization/summ_003.json
M output/AI.Models.llama3_1_8b_Q4/tasks/summarization/summ_004.json
M output/AI.Models.llama3_1_8b_Q4/tasks/summarization/summ_005.json
M output/AI.Models.llama3_1_8b_Q4/tasks/summarization/summ_006.json
M output/AI.Models.llama3_1_8b_Q4/tasks/summarization/summ_007.json
M output/AI.Models.llama3_1_8b_Q4/tasks/summarization/summ_008.json
M output/AI.Models.llama3_1_8b_Q4/tasks/summarization/summ_009.json
M output/AI.Models.llama3_1_8b_Q4/tasks/summarization/summ_010.json
M output/AI.Models.llama3_1_8b_Q4/tasks/text_generation/text_gen_001.json
M output/AI.Models.llama3_1_8b_Q4/tasks/text_generation/text_gen_002.json
M output/AI.Models.llama3_1_8b_Q4/tasks/text_generation/text_gen_003.json
M output/AI.Models.llama3_1_8b_Q4/tasks/text_generation/text_gen_004.json
M output/AI.Models.llama3_1_8b_Q4/tasks/text_generation/text_gen_005.json
M output/AI.Models.llama3_1_8b_Q4/tasks/text_generation/text_gen_006.json
M output/AI.Models.llama3_1_8b_Q4/tasks/text_generation/text_gen_007.json
M output/AI.Models.llama3_1_8b_Q4/tasks/text_generation/text_gen_008.json
M output/AI.Models.llama3_1_8b_Q4/tasks/text_generation/text_gen_009.json
M output/AI.Models.llama3_1_8b_Q4/tasks/text_generation/text_gen_010.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/summarization/summ_001.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/summarization/summ_002.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/summarization/summ_003.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/summarization/summ_004.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/summarization/summ_005.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/summarization/summ_006.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/summarization/summ_007.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/summarization/summ_008.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/summarization/summ_009.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/summarization/summ_010.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/text_generation/text_gen_001.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/text_generation/text_gen_002.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/text_generation/text_gen_003.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/text_generation/text_gen_004.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/text_generation/text_gen_005.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/text_generation/text_gen_006.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/text_generation/text_gen_007.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/text_generation/text_gen_008.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/text_generation/text_gen_009.json
M output/AI.Models.mistral-nemo_12b_Q4/tasks/text_generation/text_gen_010.json
M output/AI.Models.mistral2_large/tasks/summarization/summ_001.json
M output/AI.Models.mistral2_large/tasks/summarization/summ_002.json
M output/AI.Models.mistral2_large/tasks/summarization/summ_003.json
M output/AI.Models.mistral2_large/tasks/summarization/summ_004.json
M output/AI.Models.mistral2_large/tasks/summarization/summ_005.json
M output/AI.Models.mistral2_large/tasks/summarization/summ_006.json
M output/AI.Models.mistral2_large/tasks/summarization/summ_007.json
M output/AI.Models.mistral2_large/tasks/summarization/summ_008.json
M output/AI.Models.mistral2_large/tasks/summarization/summ_009.json
M output/AI.Models.mistral2_large/tasks/summarization/summ_010.json
M output/AI.Models.mistral2_large/tasks/text_generation/text_gen_001.json
M output/AI.Models.mistral2_large/tasks/text_generation/text_gen_002.json
M output/AI.Models.mistral2_large/tasks/text_generation/text_gen_003.json
M output/AI.Models.mistral2_large/tasks/text_generation/text_gen_004.json
M output/AI.Models.mistral2_large/tasks/text_generation/text_gen_005.json
M output/AI.Models.mistral2_large/tasks/text_generation/text_gen_006.json
M output/AI.Models.mistral2_large/tasks/text_generation/text_gen_007.json
M output/AI.Models.mistral2_large/tasks/text_generation/text_gen_008.json
M output/AI.Models.mistral2_large/tasks/text_generation/text_gen_009.json
M output/AI.Models.mistral2_large/tasks/text_generation/text_gen_010.json
M output/AI.Models.mixtral-8x22b/tasks/summarization/summ_001.json
M output/AI.Models.mixtral-8x22b/tasks/summarization/summ_002.json
M output/AI.Models.mixtral-8x22b/tasks/summarization/summ_003.json
M output/AI.Models.mixtral-8x22b/tasks/summarization/summ_004.json
M output/AI.Models.mixtral-8x22b/tasks/summarization/summ_005.json
M output/AI.Models.mixtral-8x22b/tasks/summarization/summ_006.json
M output/AI.Models.mixtral-8x22b/tasks/summarization/summ_007.json
M output/AI.Models.mixtral-8x22b/tasks/summarization/summ_008.json
M output/AI.Models.mixtral-8x22b/tasks/summarization/summ_009.json
M output/AI.Models.mixtral-8x22b/tasks/summarization/summ_010.json
M output/AI.Models.mixtral-8x22b/tasks/text_generation/text_gen_002.json
M output/AI.Models.mixtral-8x22b/tasks/text_generation/text_gen_003.json
M output/AI.Models.mixtral-8x22b/tasks/text_generation/text_gen_004.json
M output/AI.Models.mixtral-8x22b/tasks/text_generation/text_gen_005.json
M output/AI.Models.mixtral-8x22b/tasks/text_generation/text_gen_006.json
M output/AI.Models.mixtral-8x22b/tasks/text_generation/text_gen_007.json
M output/AI.Models.mixtral-8x22b/tasks/text_generation/text_gen_008.json
M output/AI.Models.mixtral-8x22b/tasks/text_generation/text_gen_009.json
M output/AI.Models.mixtral-8x22b/tasks/text_generation/text_gen_010.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/summarization/summ_001.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/summarization/summ_002.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/summarization/summ_003.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/summarization/summ_004.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/summarization/summ_005.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/summarization/summ_006.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/summarization/summ_007.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/summarization/summ_008.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/summarization/summ_009.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/summarization/summ_010.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/text_generation/text_gen_001.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/text_generation/text_gen_002.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/text_generation/text_gen_003.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/text_generation/text_gen_004.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/text_generation/text_gen_005.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/text_generation/text_gen_006.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/text_generation/text_gen_007.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/text_generation/text_gen_008.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/text_generation/text_gen_009.json
M output/AI.Models.phi3_medium-128k_14b_Q4/tasks/text_generation/text_gen_010.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/summarization/summ_001.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/summarization/summ_002.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/summarization/summ_003.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/summarization/summ_004.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/summarization/summ_005.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/summarization/summ_006.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/summarization/summ_007.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/summarization/summ_008.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/summarization/summ_009.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/summarization/summ_010.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/text_generation/text_gen_001.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/text_generation/text_gen_002.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/text_generation/text_gen_003.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/text_generation/text_gen_004.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/text_generation/text_gen_005.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/text_generation/text_gen_006.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/text_generation/text_gen_007.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/text_generation/text_gen_008.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/text_generation/text_gen_009.json
M output/AI.Models.phi3_mini-128k_4b_Q4/tasks/text_generation/text_gen_010.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_001.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_002.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_003.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_004.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_005.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_006.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_007.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_008.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_009.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_010.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_011.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_012.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_013.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_014.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_015.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_016.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_017.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_018.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_019.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_020.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_021.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_022.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_023.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_024.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_025.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_026.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_027.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_028.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_029.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_030.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_031.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_032.json
M output/AI.Models.qa_GPT4o-mini/tasks/RAG-qa/qa_033.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_001.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_002.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_003.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_004.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_005.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_006.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_007.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_008.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_009.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_010.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_011.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_012.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_013.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_014.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_015.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_016.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_017.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_018.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_019.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_020.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_021.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_022.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_023.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_024.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_025.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_026.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_027.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_028.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_029.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_030.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_031.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_032.json
M output/AI.Models.qa_GPT4o/tasks/RAG-qa/qa_033.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_001.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_002.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_003.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_004.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_005.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_006.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_007.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_008.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_009.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_010.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_011.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_012.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_013.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_014.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_015.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_016.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_017.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_018.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_019.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_020.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_021.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_022.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_023.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_024.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_025.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_026.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_027.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_028.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_029.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_030.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_031.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_032.json
M output/AI.Models.qa_claude3_5_sonet/tasks/RAG-qa/qa_033.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_001.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_002.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_003.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_004.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_005.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_006.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_007.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_008.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_009.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_010.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_011.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_012.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_013.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_014.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_015.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_016.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_017.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_018.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_019.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_020.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_021.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_022.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_023.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_024.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_025.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_026.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_027.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_028.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_029.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_030.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_031.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_032.json
M output/AI.Models.qa_command-r_35B_Q4/tasks/RAG-qa/qa_033.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_001.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_002.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_003.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_004.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_005.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_006.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_007.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_008.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_009.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_010.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_011.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_012.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_013.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_014.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_015.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_016.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_017.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_018.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_019.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_020.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_021.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_022.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_023.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_024.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_025.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_026.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_027.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_028.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_029.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_030.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_031.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_032.json
M output/AI.Models.qa_gemma2_9B_Q4/tasks/RAG-qa/qa_033.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_001.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_002.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_003.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_004.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_005.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_006.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_007.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_008.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_009.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_010.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_011.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_012.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_013.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_014.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_015.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_016.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_017.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_018.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_019.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_020.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_021.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_022.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_023.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_024.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_025.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_026.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_027.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_028.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_029.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_030.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_031.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_032.json
M output/AI.Models.qa_llama3_1_402b/tasks/RAG-qa/qa_033.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_001.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_002.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_003.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_004.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_005.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_006.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_007.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_008.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_009.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_010.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_011.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_012.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_013.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_014.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_015.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_016.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_017.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_018.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_019.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_020.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_021.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_022.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_023.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_024.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_025.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_026.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_027.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_028.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_029.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_030.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_031.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_032.json
M output/AI.Models.qa_llama3_1_8b_Q4/tasks/RAG-qa/qa_033.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_001.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_002.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_003.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_004.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_005.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_006.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_007.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_008.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_009.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_010.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_011.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_012.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_013.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_014.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_015.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_016.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_017.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_018.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_019.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_020.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_021.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_022.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_023.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_024.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_025.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_026.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_027.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_028.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_029.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_030.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_031.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_032.json
M output/AI.Models.qa_mistral-nemo_12b_Q4/tasks/RAG-qa/qa_033.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_001.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_002.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_003.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_004.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_005.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_006.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_007.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_008.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_009.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_010.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_011.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_012.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_013.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_014.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_015.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_016.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_017.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_018.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_019.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_020.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_021.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_022.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_023.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_024.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_025.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_026.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_027.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_028.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_029.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_030.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_031.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_032.json
M output/AI.Models.qa_mistral2_large/tasks/RAG-qa/qa_033.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_001.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_002.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_003.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_004.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_005.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_006.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_007.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_008.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_009.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_010.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_011.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_012.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_013.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_014.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_015.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_016.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_017.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_018.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_019.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_020.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_021.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_022.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_023.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_024.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_025.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_026.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_027.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_028.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_029.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_030.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_031.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_032.json
M output/AI.Models.qa_mixtral-8x22b/tasks/RAG-qa/qa_033.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_001.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_002.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_003.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_004.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_005.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_006.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_007.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_008.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_009.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_010.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_011.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_012.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_013.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_014.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_015.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_016.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_017.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_018.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_019.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_020.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_021.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_022.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_023.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_024.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_025.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_026.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_027.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_028.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_029.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_030.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_031.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_032.json
M output/AI.Models.qa_phi3_medium-128k_14b_Q4/tasks/RAG-qa/qa_033.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_001.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_002.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_003.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_004.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_005.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_006.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_007.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_008.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_009.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_010.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_011.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_012.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_013.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_014.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_015.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_016.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_017.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_018.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_019.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_020.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_021.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_022.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_023.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_024.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_025.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_026.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_027.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_028.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_029.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_030.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_031.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_032.json
M output/AI.Models.qa_phi3_mini-128k_4b_Q4/tasks/RAG-qa/qa_033.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_001.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_002.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_003.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_004.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_005.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_006.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_007.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_008.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_009.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_010.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_011.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_012.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_013.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_014.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_015.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_016.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_017.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_018.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_019.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_020.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_021.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_022.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_023.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_024.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_025.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_026.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_027.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_028.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_029.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_030.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_031.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_032.json
M output/AI.Models.qa_qwen2_7b_Q4/tasks/RAG-qa/qa_033.json
M output/AI.Models.qwen2_7b_Q4/tasks/summarization/summ_001.json
M output/AI.Models.qwen2_7b_Q4/tasks/summarization/summ_002.json
M output/AI.Models.qwen2_7b_Q4/tasks/summarization/summ_003.json
M output/AI.Models.qwen2_7b_Q4/tasks/summarization/summ_004.json
M output/AI.Models.qwen2_7b_Q4/tasks/summarization/summ_005.json
M output/AI.Models.qwen2_7b_Q4/tasks/summarization/summ_006.json
M output/AI.Models.qwen2_7b_Q4/tasks/summarization/summ_007.json
M output/AI.Models.qwen2_7b_Q4/tasks/summarization/summ_008.json
M output/AI.Models.qwen2_7b_Q4/tasks/summarization/summ_009.json
M output/AI.Models.qwen2_7b_Q4/tasks/summarization/summ_010.json
M output/AI.Models.qwen2_7b_Q4/tasks/text_generation/text_gen_001.json
M output/AI.Models.qwen2_7b_Q4/tasks/text_generation/text_gen_002.json
M output/AI.Models.qwen2_7b_Q4/tasks/text_generation/text_gen_003.json
M output/AI.Models.qwen2_7b_Q4/tasks/text_generation/text_gen_004.json
M output/AI.Models.qwen2_7b_Q4/tasks/text_generation/text_gen_005.json
M output/AI.Models.qwen2_7b_Q4/tasks/text_generation/text_gen_006.json
M output/AI.Models.qwen2_7b_Q4/tasks/text_generation/text_gen_007.json
M output/AI.Models.qwen2_7b_Q4/tasks/text_generation/text_gen_008.json
M output/AI.Models.qwen2_7b_Q4/tasks/text_generation/text_gen_009.json
M output/AI.Models.qwen2_7b_Q4/tasks/text_generation/text_gen_010.json
A reports/report_20241121_164244/evaluation_report_20241121_164244.pdf
A reports/report_20241121_164244/model_outputs_20241121_164247.pdf
Log Message:
-----------
New bencmark execuition with 8k context window for the ollama models and new correctness metric
Compare: https://github.com/xwiki-contrib/ai-llm-benchmark/compare/826378343cea...8a…
To unsubscribe from these emails, change your notification settings at https://github.com/xwiki-contrib/ai-llm-benchmark/settings/notifications
Branch: refs/heads/master
Home: https://github.com/xwiki/xwiki-platform
Commit: 5dad14db7117b173d2936a3f93f1f68fe0687a6c
https://github.com/xwiki/xwiki-platform/commit/5dad14db7117b173d2936a3f93f1…
Author: Vincent Massol <vincent(a)massol.net>
Date: 2024-11-21 (Thu, 21 Nov 2024)
Changed paths:
M xwiki-platform-core/xwiki-platform-rendering/xwiki-platform-rendering-async/xwiki-platform-rendering-async-macro/src/test/java/org/xwiki/rendering/async/IntegrationTests.java
Log Message:
-----------
[Misc] Convert another JUnit4-based rendering tests to JUnit5, removing the jmock usage
To unsubscribe from these emails, change your notification settings at https://github.com/xwiki/xwiki-platform/settings/notifications
Branch: refs/heads/stable-15.10.x
Home: https://github.com/xwiki/xwiki-platform
Commit: 585d27090dbe0364999922adcdca5548d81da539
https://github.com/xwiki/xwiki-platform/commit/585d27090dbe0364999922adcdca…
Author: Thomas Mortagne <thomas.mortagne(a)gmail.com>
Date: 2024-11-21 (Thu, 21 Nov 2024)
Changed paths:
M xwiki-platform-core/xwiki-platform-rendering/xwiki-platform-rendering-xwiki/src/main/java/org/xwiki/rendering/internal/resolver/AbstractResourceReferenceEntityReferenceResolver.java
M xwiki-platform-core/xwiki-platform-rendering/xwiki-platform-rendering-xwiki/src/test/java/org/xwiki/rendering/internal/resolver/DefaultResourceReferenceEntityReferenceResolverTest.java
Log Message:
-----------
XWIKI-22673: Syntax [[...]] in terminal page breaks rendering and indexing of content
(cherry picked from commit 358cc4ce3224387006c0c469a33fe9487a70e761)
To unsubscribe from these emails, change your notification settings at https://github.com/xwiki/xwiki-platform/settings/notifications