This issue has been created

There are 2 updates.

LLM AI Integration

/

Open

Measure energy consumption for the benchmark

View issue · Add comment

Issue created

Michael Hamann created this issue on 27/May/24 15:05

Summary:	Measure energy consumption for the benchmark
Issue Type:	New Feature
Affects Versions:	0.3.1
Assignee:	Unassigned
Created:	27/May/24 15:05
Priority:	Major
Reporter:	Michael Hamann
Description:	In the LLM benchmark, we need to measure energy consumption of the different tasks. For this, we should measure energy consumption on the inference server and associate this data to the different tasks we execute, depending on the running time. It seems hard to do this exactly, we should therefore probably work with average values and try to come up with some estimates of the consumed power per input and output token. We should also compare our measurements to publicly reported performance in particular for parallel requests. When a publication reports a certain number of tokens per second on a certain GPU, we can, based on the maximum power consumption of that GPU, and the tokens per second derive a maximum of the consumed power per token.

2 updates

Changes by Michael Hamann on 27/May/24 15:06

Fix Version:	0.4
Assignee:	Paul Pantiru

This message was sent by Atlassian Jira (v9.3.0#930000-sha1:287aeb6)

If image attachments aren't displayed, see this article.