Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduce a new operation to benchmark the shard changes recovery api #722

Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
39 commits
Select commit Hold shift + click to select a range
9e2cc41
feature: new synthetic source recovery challenge
salvatore-campagna Dec 17, 2024
3832110
fix: improve challenge description
salvatore-campagna Dec 17, 2024
112da4f
fix: remove underscore
salvatore-campagna Dec 17, 2024
67d45b7
fix: non-default challenge
salvatore-campagna Dec 17, 2024
f6aaf85
fix: default false
salvatore-campagna Dec 17, 2024
e2a6d71
fix: challenge name
salvatore-campagna Dec 17, 2024
790d1a5
Merge branch 'master' into feature/synthetic-source-recovery-challenge
salvatore-campagna Jan 10, 2025
75e06c3
fix: default security shard changes recovery target
salvatore-campagna Jan 14, 2025
27a172a
fix: default tsdb shard changes recovery target
salvatore-campagna Jan 14, 2025
617c34c
fix: default logs shard changes recovery target
salvatore-campagna Jan 14, 2025
e801b1c
fix: improve elastic/logs shard recovery
salvatore-campagna Jan 15, 2025
3067434
fix: simplify logs shard changes challenge
salvatore-campagna Jan 15, 2025
1e362d8
fix: replace challenge with new operation in existing challenge
salvatore-campagna Jan 15, 2025
0708b06
fix: move shard-recovery operation out of if statement
salvatore-campagna Jan 15, 2025
98e1bce
fix: p_recovery_target instead of recovery_target
salvatore-campagna Jan 15, 2025
c3cd16e
Merge branch 'master' into feature/synthetic-source-recovery-challenge
elasticmachine Jan 15, 2025
162450a
fix: gate execution of shard-recovery operation
salvatore-campagna Jan 15, 2025
7c8ace3
fix: update readme with include_shard_recovery_op param
salvatore-campagna Jan 15, 2025
23c1075
Revert "fix: update readme with include_shard_recovery_op param"
salvatore-campagna Jan 20, 2025
22da06f
Revert "fix: gate execution of shard-recovery operation"
salvatore-campagna Jan 20, 2025
7af8cff
Revert "fix: p_recovery_target instead of recovery_target"
salvatore-campagna Jan 20, 2025
421cd94
Revert "fix: move shard-recovery operation out of if statement"
salvatore-campagna Jan 20, 2025
3099736
Revert "fix: replace challenge with new operation in existing challenge"
salvatore-campagna Jan 20, 2025
8c01702
fix: add retention elase and remove max bucket setting
salvatore-campagna Jan 20, 2025
5bb4b58
fix: missing comma
salvatore-campagna Jan 20, 2025
a5b83cc
fix: use variable name with default
salvatore-campagna Jan 20, 2025
3d51874
fix: reduce warmup and iterations
salvatore-campagna Jan 20, 2025
cc9efd9
fix: increase warmup and iterations
salvatore-campagna Jan 20, 2025
40aef67
fix: serverless markers
salvatore-campagna Jan 20, 2025
c052cef
fix: move misplaced comma
salvatore-campagna Jan 21, 2025
4977014
fix: move misplaced comma
salvatore-campagna Jan 21, 2025
7c48efd
fix: index mode only for tsdb
salvatore-campagna Jan 21, 2025
4ab1173
fix: remove post ingest sleep
salvatore-campagna Jan 21, 2025
e83c1de
fix: remove condition on serverless operator
salvatore-campagna Jan 21, 2025
8dd4f73
fix: move add-retention-lease before indexing
salvatore-campagna Jan 22, 2025
00eb801
fix: remove misplaced commas
salvatore-campagna Jan 22, 2025
9111582
fix: tsdb missing comma
salvatore-campagna Jan 22, 2025
1d5e905
fix: further simplify synthetic-source-recovery challenge
salvatore-campagna Jan 22, 2025
0422faf
fix: typo target instead of tagret
salvatore-campagna Jan 22, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 5 additions & 0 deletions elastic/logs/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -223,6 +223,11 @@ The following parameters are available:
* `synthetic_source_keep` (default: unset): If specified, configures the `index.mapping.synthetic_source_keep` index setting.
* `source_mode` (default: unset) - Specifies the source mode to be used.
* `use_synthetic_source_recovery` (default: unset): Whether synthetic recovery source will be used.
* `recovery_target` (required) - The target index or data stream for fetching shard changes via the recovery API.
* `recovery_from_seq_no` (default: `0`) - The sequence number from which to start fetching translog operations.
* `recovery_poll_timeout` (default: `1m`) - The maximum time to wait for additional translog operations before returning an empty result.
* `recovery_max_batch_size` (default: `32MB`) - The maximum estimated size for the batch of translog operations to return.
* `recovery_max_operations_count` (default: `1048576`) - The maximum number of translog operations to return in a single batch.

### Data Download Parameters

Expand Down
54 changes: 54 additions & 0 deletions elastic/logs/challenges/synthetic-source-recovery.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,54 @@
{% import "rally.helpers" as rally %}
{
"name": "synthetic-source-recovery",
"description": "Indexes logs for a specified time period and volume per day and then runs the recovery api fetching translog operations for a given target index.",
"default": false,
"schedule": [
{% include "tasks/index-setup.json" %},
{# serverless-shard-recovery-marker-start #}{%- if build_flavor != "serverless" -%}
{
"name": "add-retention-lease",
"operation": {
"operation-type": "raw-request",
"path": {{ "/%s/seq_no/add_retention_lease" | format(p_recovery_target) | tojson }},
"method": "PUT"
},
"warmup-iterations": 0,
"iterations": 1
},
{%- endif -%}{# serverless-shard-recovery-marker-end #}
{
"name": "bulk-index",
"operation": {
"operation-type": "raw-bulk",
"param-source": "processed-source",
"time-format": "milliseconds",
"profile": "fixed_interval",
"init-load": true,
"bulk-size": {{ p_bulk_size }},
"detailed-results": true
},
"clients": {{ p_bulk_indexing_clients }},
"ignore-response-error-level": "{{error_level | default('non-fatal')}}"
}
{# serverless-shard-recovery-marker-start #}{%- if build_flavor != "serverless" -%}
,
{
"name": "shard-recovery",
"operation": {
"operation-type": "raw-request",
"path": {{ "/%s/ccr/shard_changes" | format(p_recovery_target) | tojson }},
"method": "GET",
"request-params": {
"from_seq_no": {{ p_recovery_from_seq_no | tojson }},
"max_batch_size": {{ p_recovery_max_batch_size | tojson }},
"poll_timeout": {{ p_recovery_poll_timeout | tojson }},
"max_operations_count": {{ p_recovery_max_operations_count | tojson }}
}
},
"warmup-iterations": 10,
"iterations": 20
}
{%- endif -%}{# serverless-shard-recovery-marker-end #}
]
}
6 changes: 6 additions & 0 deletions elastic/logs/track.json
Original file line number Diff line number Diff line change
Expand Up @@ -60,6 +60,12 @@
{% set p_dsl_default_rollover = (dsl_default_rollover | default(false) ) %}
{% set p_skip_fleet_globals = (skip_fleet_globals | default(false) ) %}

{% set p_recovery_target = (recovery_target | default('logs-k8-application.log-default') ) %}
martijnvg marked this conversation as resolved.
Show resolved Hide resolved
{% set p_recovery_from_seq_no = (recovery_seq_no | default(0) ) %}
{% set p_recovery_max_batch_size = (recovery_max_batch_size | default ("32MB") ) %}
{% set p_recovery_poll_timeout = (recovery_poll_timeout | default("1m")) %}
{% set p_recovery_max_operations_count = (recovery_max_operations_count | default(1048576)) %}

{% set p_worker_threads_enabled = (worker_threads_enabled | default(true)) %}

{% set es_version = "7.13.2" %}
Expand Down
5 changes: 5 additions & 0 deletions elastic/security/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -87,6 +87,11 @@ The following parameters are available:
* `synthetic_source_keep` (default: unset): If specified, configures the `index.mapping.synthetic_source_keep` index setting.
* `source_mode` (default: unset) - Specifies the source mode to be used.
* `use_synthetic_source_recovery` (default: unset): Whether synthetic recovery source will be used.
* `recovery_target` (required) - The target index or data stream for fetching shard changes via the recovery API.
* `recovery_from_seq_no` (default: `0`) - The sequence number from which to start fetching translog operations.
* `recovery_poll_timeout` (default: `1m`) - The maximum time to wait for additional translog operations before returning an empty result.
* `recovery_max_batch_size` (default: `32MB`) - The maximum estimated size for the batch of translog operations to return.
* `recovery_max_operations_count` (default: `1048576`) - The maximum number of translog operations to return in a single batch.

### Data Generation Parameters

Expand Down
53 changes: 53 additions & 0 deletions elastic/security/challenges/synthetic-source-recovery.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,53 @@
{% import "rally.helpers" as rally %}
{
"name": "synthetic-source-recovery",
"description": "Indexes security data for a specified time period and volume per day and then runs the recovery api fetching translog operations for a given target index.",
"default": false,
"schedule": [
{% include "tasks/index-setup.json" %},
{# serverless-shard-recovery-marker-start #}{%- if build_flavor != "serverless" -%}
{
"name": "add-retention-lease",
"operation": {
"operation-type": "raw-request",
"path": {{ "/%s/seq_no/add_retention_lease" | format(p_recovery_target) | tojson }},
"method": "PUT"
},
"warmup-iterations": 0,
"iterations": 1
},
{%- endif -%}{# serverless-shard-recovery-marker-end #}
{
"name": "bulk-index",
"operation": {
"operation-type": "raw-bulk",
"param-source": "processed-source",
"time-format": "milliseconds",
"profile": "fixed_interval",
"detailed-results": true,
"bulk-size": {{ (bulk_size | default(50)) }}
},
"ingest-percentage": {{ ingest_percentage | default(100) }},
"clients": {{ p_bulk_indexing_clients }}
}
{# serverless-shard-recovery-marker-start #}{%- if build_flavor != "serverless" -%}
,
{
"name": "shard-recovery",
"operation": {
"operation-type": "raw-request",
"path": {{ "/%s/ccr/shard_changes" | format(p_recovery_target) | tojson }},
"method": "GET",
"request-params": {
"from_seq_no": {{ p_recovery_from_seq_no | tojson }},
"max_batch_size": {{ p_recovery_max_batch_size | tojson }},
"poll_timeout": {{ p_recovery_poll_timeout | tojson }},
"max_operations_count": {{ p_recovery_max_operations_count | tojson }}
}
},
"warmup-iterations": 10,
"iterations": 20
}
{%- endif -%}{# serverless-shard-recovery-marker-end #}
]
}
5 changes: 5 additions & 0 deletions elastic/security/track.json
Original file line number Diff line number Diff line change
Expand Up @@ -10,6 +10,11 @@
{% set p_number_of_replicas = (number_of_replicas | default(1)) %}
{% set p_skip_delete_component_template = (skip_delete_component_template | default(false) ) %}
{% set p_logs_endpoint_from_kibana = (logs_endpoint_from_kibana | default(false) ) %}
{% set p_recovery_from_seq_no = (recovery_seq_no | default(0) ) %}
{% set p_recovery_target = ( recovery_target | default('filebeat-default') ) %}
{% set p_recovery_max_batch_size = (recovery_max_batch_size | default ("32MB") ) %}
{% set p_recovery_poll_timeout = (recovery_poll_timeout | default("1m")) %}
{% set p_recovery_max_operations_count = (recovery_max_operations_count | default(1048576)) %}
{% set p_include_non_serverless_index_settings = (include_non_serverless_index_settings | default(build_flavor != "serverless")) %}
{% set p_integration_ratios = (integration_ratios | default({
"auditbeat": {
Expand Down
5 changes: 5 additions & 0 deletions tsdb/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -205,6 +205,11 @@ This track allows to overwrite the following parameters using `--track-params`:
* `document_ids`: documentd IDs to use for search, get and mget apis in the `low-latency` challenge. If empty, a default set of 4 values is used.
* `synthetic_source_keep` (default: unset): If specified, configures the `index.mapping.synthetic_source_keep` index setting.
* `use_synthetic_source_recovery` (default: unset): Whether synthetic recovery source will be used.
* `recovery_target` (required) - The target index or data stream for fetching shard changes via the recovery API.
* `recovery_from_seq_no` (default: `0`) - The sequence number from which to start fetching translog operations.
* `recovery_poll_timeout` (default: `1m`) - The maximum time to wait for additional translog operations before returning an empty result.
* `recovery_max_batch_size` (default: `32MB`) - The maximum estimated size for the batch of translog operations to return.
* `recovery_max_operations_count` (default: `1048576`) - The maximum number of translog operations to return in a single batch.

### License

Expand Down
65 changes: 65 additions & 0 deletions tsdb/challenges/synthetic-source-recovery.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,65 @@
{
"name": "synthetic-source-recovery",
"description": "Indexes the whole document corpus and then runs the recovery api fetching translog operations for a given target index.",
"default": false,
"schedule": [
{
"operation": "delete-index"
},
{
"operation": {
"operation-type": "create-index",
"settings": {{index_settings | default({}) | tojson}}
}
},
{
"name": "check-cluster-health",
"operation": {
"operation-type": "cluster-health",
"index": "tsdb",
"request-params": {
"wait_for_status": "{{cluster_health | default('green')}}",
"wait_for_no_relocating_shards": "true"
},
"retry-until-success": true
}
},
{# serverless-shard-recovery-marker-start #}{%- if build_flavor != "serverless" -%}
{
"name": "add-retention-lease",
"operation": {
"operation-type": "raw-request",
"path": {{ "/%s/seq_no/add_retention_lease" | format(p_recovery_target) | tojson }},
"method": "PUT"
},
"warmup-iterations": 0,
"iterations": 1
},
{%- endif -%}{# serverless-shard-recovery-marker-end #}
{
"operation": "index",
"warmup-time-period": 240,
"clients": {{bulk_indexing_clients | default(8)}},
"ignore-response-error-level": "{{error_level | default('non-fatal')}}"
}
{# serverless-shard-recovery-marker-start #}{%- if build_flavor != "serverless" -%}
,
{
martijnvg marked this conversation as resolved.
Show resolved Hide resolved
"name": "shard-recovery",
"operation": {
"operation-type": "raw-request",
"path": {{ "/%s/ccr/shard_changes" | format(p_recovery_target) | tojson }},
"method": "GET",
"request-params": {
"from_seq_no": {{ p_recovery_from_seq_no | tojson }},
"max_batch_size": {{ p_recovery_max_batch_size | tojson }},
"poll_timeout": {{ p_recovery_poll_timeout | tojson }},
"max_operations_count": {{ p_recovery_max_operations_count | tojson }}
}
},
"warmup-iterations": 10,
"iterations": 20
}
{%- endif -%}{# serverless-shard-recovery-marker-end #}
]
}
22 changes: 6 additions & 16 deletions tsdb/track.json
Original file line number Diff line number Diff line change
@@ -1,30 +1,20 @@
{% import "rally.helpers" as rally with context %}

{% set p_recovery_target = ( recovery_target | default('tsdb') ) %}
{% set p_recovery_from_seq_no = (recovery_seq_no | default(0) ) %}
{% set p_recovery_max_batch_size = (recovery_max_batch_size | default ("32MB") ) %}
{% set p_recovery_poll_timeout = (recovery_poll_timeout | default("1m")) %}
{% set p_recovery_max_operations_count = (recovery_max_operations_count | default(1048576)) %}

{
"version": 2,
"description": "metricbeat information for elastic-app k8s cluster",
{%- if ingest_mode is defined and ingest_mode == "data_stream" %}
"composable-templates": [
{
"name": "tsdb-template",
"index-pattern": "k8s*",
"delete-matching-indices": true,
"template": "index-template.json"
}
],
"data-streams": [
{
"name": "k8s"
}
],
{%- else %}
"indices": [
{
"name": "tsdb",
"body": "index.json"
}
],
{%- endif %}
"corpora": [
{
"name": "tsdb",
Expand Down
Loading