blob: 339ac615b957dccbf7e8cae64b16dbd726cc7c43 (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
|
Running OpenDC results in three output files. The first file ([Task](#task)) contains metrics related to the jobs being executed.
The second file ([Host](#host)) contains all metrics related to the hosts on which jobs can be executed. The third file ([Power](#power))
contains all metrics related to the power sources that power the hosts. Finally, the third file ([Service](#service))
contains metrics describing the overall performance. An experiment in OpenDC has
### Task
The task output file, contains all metrics of related to the tasks that are being executed.
| Metric | Datatype | Unit | Summary |
|--------------------|----------|-----------|-------------------------------------------------------------------------------|
| timestamp | int64 | ms | Timestamp of the sample since the start of the workload |
| absolute timestamp | int64 | ms | The absolute timestamp based on the given workload |
| server_id | binary | string | The id of the server determined during runtime |
| server_name | binary | string | The name of the server provided by the Trace |
| host_id | binary | string | The id of the host on which the server is hosted or `null` if it has no host. |
| mem_capacity | int64 | Mb | |
| cpu_count | int32 | count | |
| cpu_limit | double | MHz | The capacity of the CPUs of Host on which the server is running. |
| cpu_time_active | int64 | ms | The duration that a CPU was active in the server. |
| cpu_time_idle | int64 | ms | The duration that a CPU was idle in the server. |
| cpu_time_steal | int64 | ms | The duration that a vCPU wanted to run, but no capacity was available. |
| cpu_time_lost | int64 | ms | The duration of CPU time that was lost due to interference. |
| uptime | int64 | ms | The uptime of the host since last sample. |
| downtime | int64 | ms | The downtime of the host since last sample. |
| provision_time | int64 | ms | The time for which the server was enqueued for the scheduler. |
| boot_time | int64 | ms | The time a task got booted. |
| boot_time_absolute | int64 | ms | The absolute time a task got booted. |
| creation_time | int64 | ms | The time at which the task was created by the ComputeService |
| finish_time | int64 | ms | The time at which the task was finished (either completed or terminated) |
| task_state | String | TaskState | The status of the Task |
### Host
The host output file, contains all metrics of related to the host run.
| Metric | DataType | Unit | Summary |
|--------------------|----------|------------|-------------------------------------------------------------------------------------------------|
| timestamp | int64 | ms | Timestamp of the sample |
| absolute timestamp | int64 | ms | The absolute timestamp based on the given workload |
| host_id | binary | string | The id of the host given by OpenDC |
| cpu_count | int32 | count | The number of available cpuModel cores |
| mem_capacity | int64 | Mb | The amount of available memory |
| guests_terminated | int32 | count | The number of guests that are in a terminated state. |
| guests_running | int32 | count | The number of guests that are in a running state. |
| guests_error | int32 | count | The number of guests that are in an error state. |
| guests_invalid | int32 | count | The number of guests that are in an unknown state. |
| cpu_limit | double | MHz | The capacity of the CPUs in the host. |
| cpu_usage | double | MHz | The usage of all CPUs in the host. |
| cpu_demand | double | MHz | The demand of all vCPUs of the guests |
| cpu_utilization | double | ratio | The CPU utilization of the host. This is calculated by dividing the cpu_usage, by the cpu_limit |
| cpu_time_active | int64 | ms | The duration that a CPU was active in the host. |
| cpu_time_idle | int64 | ms | The duration that a CPU was idle in the host. |
| cpu_time_steal | int64 | ms | The duration that a vCPU wanted to run, but no capacity was available. |
| cpu_time_lost | int64 | ms | The duration of CPU time that was lost due to interference. |
| power_draw | double | Watt | The current power draw of the host. |
| energy_usage | double | Joule (Ws) | The total energy consumption of the host since last sample. |
| uptime | int64 | ms | The uptime of the host since last sample. |
| downtime | int64 | ms | The downtime of the host since last sample. |
| boot_time | int64 | ms | The time a host got booted. |
| boot_time_absolute | int64 | ms | The absolute time a host got booted. |
### Power Source
The host output file, contains all metrics of related to the host run.
| Metric | DataType | Unit | Summary |
|--------------------|----------|------------|------------------------------------------------------------------------------------------|
| timestamp | int64 | ms | Timestamp of the sample |
| absolute timestamp | int64 | ms | The absolute timestamp based on the given workload |
| hosts_connected | int | Count | The number of hosts connected to the power Source (WARNING: does not work at the moment) |
| power_draw | double | Watt | The current power draw of the host. |
| energy_usage | double | Joule (Ws) | The total energy consumption of the host since last sample. |
| carbon_intensity | double | gCO2/kW | The amount of carbon that is emitted when using a unit of energy |
| carbon_emission | double | gram | The amount of carbon emitted since the previous sample |
### Service
The service output file, contains metrics providing an overview of the performance.
| Metric | DataType | Unit | Summary |
|--------------------|----------|-------|------------------------------------------------------------------------|
| timestamp | int64 | ms | Timestamp of the sample |
| absolute timestamp | int64 | ms | The absolute timestamp based on the given workload |
| hosts_up | int32 | count | The number of hosts that are up at this instant. |
| hosts_down | int32 | count | The number of hosts that are down at this instant. |
| tasks_total | int32 | count | The number of servers that are currently active. |
| tasks_pending | int32 | count | The number of servers that are pending to be scheduled. |
| tasks_active | int32 | count | The number of servers that are currently active. |
| tasks_terminated | int32 | count | The number of servers that are currently active. |
| tasks_completed | int32 | count | The number of servers that are currently active. |
| attempts_success | int32 | count | The scheduling attempts that were successful. |
| attempts_failure | int32 | count | The scheduling attempts that were unsuccessful due to client error. |
|