Add additional PromQL operators to synthetic load #747

kushalShukla-web · 2024-09-18T17:59:40Z

This PR enhances the synthetic load generation by incorporating additional PromQL operators in the 6_loadgen.yaml file:

Added binary arithmetic operators (joins) to cover more complex query types.
Included logical operators (and, or, unless) for better query testing coverage.
Added topk function to test query performance with ranked results.

kushalShukla-web · 2024-09-19T05:16:41Z

Solves #705

bboreham

Hi, I commented on a few, but they are all likely problemmatic.
I recommend you change tack to use metrics exposed by the fake webserver, since there are a lot of them and prombench can control them.

bboreham · 2024-09-19T15:36:06Z

prombench/manifests/prombench/benchmark/6_loadgen.yaml

+        interval: 10s
+        type: instant
+        queries:
+        - expr: sum(node_cpu_seconds_total)/sum(container_memory_rss)


This is not terribly good as a test of operator performance, since it matches one series against one other series, both of which have no labels.

bboreham · 2024-09-19T15:36:49Z

prombench/manifests/prombench/benchmark/6_loadgen.yaml

+        type: instant
+        queries:
+        - expr: sum(node_cpu_seconds_total)/sum(container_memory_rss)
+        - expr: rate(node_cpu_seconds_total[5m]) * 5


This is better, at 256 series when tested.

bboreham · 2024-09-19T15:37:30Z

prombench/manifests/prombench/benchmark/6_loadgen.yaml

+        queries:
+        - expr: sum(node_cpu_seconds_total)/sum(container_memory_rss)
+        - expr: rate(node_cpu_seconds_total[5m]) * 5
+        - expr: sum(go_gc_heap_goal_bytes)/sum(loadgen_query_duration_seconds_created)


This has the same structural problem as the first one.

bboreham · 2024-09-19T15:38:29Z

prombench/manifests/prombench/benchmark/6_loadgen.yaml

+        interval: 10s
+        type: instant
+        queries:
+        - expr: node_cpu_seconds_total{mode="nice"} and node_cpu_seconds_total{namespace="default"}


Nodes don't have a namespace, so this is short-cutted to return nothing.

kushalShukla-web · 2024-09-19T18:19:03Z

Hi, I commented on a few, but they are all likely problemmatic. I recommend you change tack to use metrics exposed by the fake webserver, since there are a lot of them and prombench can control them.

Actually i have tested this on one of the Pull Request where the prombench was running.

bboreham · 2024-09-19T19:34:08Z

Ok but my recommendation remains the same.

Replaced Old metrics with the new ones

bboreham · 2024-09-24T19:54:12Z

I have replaced some of the existing metrics with new ones from PromBench, such as:

How many of those did you count?

kushalShukla-web · 2024-10-02T13:38:32Z

Hi @bboreham all the queries having value above 2000 , and codelabz metric is having 52000 different variations .

updated metrics with some heavy count

bboreham

Thanks, this is getting better.

I am still interested to think about the cardinality you are expecting for each operator in each query.

At the end of the day there should be a balance across different kinds of load, so we can justify that prombench is a realistic test, and also we want the prometheus under load to be able to keep up.

bboreham · 2024-10-07T10:45:31Z

prombench/manifests/prombench/benchmark/6_loadgen.yaml

+        type: instant
+        queries:
+        - expr: topk(2000, sum(rate(go_gc_duration_seconds_count[5m])) by (instance, job))
+        - expr: topk(10000, sum(codelab_api_request_duration_seconds_bucket) by (method,job))     


topk(10000 is not realistic; nobody is going to scroll down 10,000 lines of screen output to find something.
k should be more like 10, or perhaps 100.
Also I don't think there are 10,000 combinations of method and job.
Also it is not valid to sum histogram buckets.

bboreham · 2024-10-07T10:47:51Z

prombench/manifests/prombench/benchmark/6_loadgen.yaml

+        - expr: codelab_api_request_duration_seconds_bucket{method="GET"} or codelab_api_request_duration_seconds_bucket{method="POST"}
+        - expr: codelab_api_request_duration_seconds_sum{status="200"} or codelab_api_request_duration_seconds_sum{status="500"}
+        - expr: codelab_api_request_duration_seconds_bucket{status="200"} and codelab_api_request_duration_seconds_bucket{method="GET"}
+        - expr: codelab_api_request_duration_seconds_count{method="POST"} and codelab_api_request_duration_seconds_count{status="500"}


I don't see much point in doing multiple expressions that are essentially the same.
or is different to and, but after that you could have an /, taking the ratio of errors to all requests for instance.

I realized a couple of things since this comment: you have / in "arithmetic operation" above, but this and is never going to return anything because the labels on each side are different. We want the benchmark queries to make sense.

prombench/manifests/prombench/benchmark/6_loadgen.yaml

Slow down arithmetic_operation and logic_operator; take out a few queries to avoid overloading the server. Stop querying `_bucket` series directly; those should be used by `histogram_quantile` or similar. Use more realistic `k` parameters to `topk`. Signed-off-by: Bryan Boreham <[email protected]>

For balance, to retain about the same overall load on the server as before. Signed-off-by: Bryan Boreham <[email protected]>

bboreham · 2024-10-20T17:13:56Z

I trimmed down the newly-added queries a bit:

Slow down arithmetic_operation and logic_operator.
Took out a few queries to avoid overloading the server.
Stop querying _bucket series directly; those should be used by histogram_quantile or similar.
Use more realistic k parameters to topk.

I also trimmed down some pre-existing queries for balance, to retain about the same overall load on the server as before.

kushalShukla-web force-pushed the queries branch from ca2452f to 872732a Compare September 18, 2024 18:02

kushalShukla-web force-pushed the queries branch from 872732a to 74f8a46 Compare September 19, 2024 05:21

bboreham reviewed Sep 19, 2024

View reviewed changes

kushalShukla-web force-pushed the queries branch 4 times, most recently from fcab2db to a16c6db Compare September 21, 2024 04:26

Signed-off-by: kushal shukla <[email protected]>

4991729

Replaced Old metrics with the new ones

kushalShukla-web force-pushed the queries branch from a16c6db to 4991729 Compare September 21, 2024 05:41

kushalShukla-web requested a review from bboreham October 2, 2024 13:19

Signed-off-by: Kushal Shukla <[email protected]>

0a30d46

updated metrics with some heavy count

kushalShukla-web force-pushed the queries branch from c0c0f48 to 0a30d46 Compare October 5, 2024 15:48

bboreham reviewed Oct 7, 2024

View reviewed changes

kushalShukla-web and others added 3 commits October 10, 2024 14:55

Merge branch 'prometheus:master' into queries

64cc91b

Trim down pre-existing queries

c66f365

For balance, to retain about the same overall load on the server as before. Signed-off-by: Bryan Boreham <[email protected]>

bboreham approved these changes Oct 20, 2024

View reviewed changes

bboreham merged commit 1bba995 into prometheus:master Oct 20, 2024
6 checks passed

bboreham mentioned this pull request Oct 20, 2024

added-binary-arithmetic-operators-logical-operators-topk #723

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add additional PromQL operators to synthetic load #747

Add additional PromQL operators to synthetic load #747

kushalShukla-web commented Sep 18, 2024

kushalShukla-web commented Sep 19, 2024

bboreham left a comment

bboreham Sep 19, 2024

bboreham Sep 19, 2024

bboreham Sep 19, 2024

bboreham Sep 19, 2024

kushalShukla-web commented Sep 19, 2024

bboreham commented Sep 19, 2024

bboreham commented Sep 24, 2024

kushalShukla-web commented Oct 2, 2024

bboreham left a comment

bboreham Oct 7, 2024

bboreham Oct 7, 2024

bboreham Oct 20, 2024

bboreham commented Oct 20, 2024

Add additional PromQL operators to synthetic load #747

Add additional PromQL operators to synthetic load #747

Conversation

kushalShukla-web commented Sep 18, 2024

kushalShukla-web commented Sep 19, 2024

bboreham left a comment

Choose a reason for hiding this comment

bboreham Sep 19, 2024

Choose a reason for hiding this comment

bboreham Sep 19, 2024

Choose a reason for hiding this comment

bboreham Sep 19, 2024

Choose a reason for hiding this comment

bboreham Sep 19, 2024

Choose a reason for hiding this comment

kushalShukla-web commented Sep 19, 2024

bboreham commented Sep 19, 2024

bboreham commented Sep 24, 2024

kushalShukla-web commented Oct 2, 2024

bboreham left a comment

Choose a reason for hiding this comment

bboreham Oct 7, 2024

Choose a reason for hiding this comment

bboreham Oct 7, 2024

Choose a reason for hiding this comment

bboreham Oct 20, 2024

Choose a reason for hiding this comment

bboreham commented Oct 20, 2024