Upload developer notes: understanding errors and triggers #724

constanca-m · 2024-05-28T15:23:47Z

What does this PR do?

This PR adds some notes to help the maintenance of ESF. Please read the file and the comments for more details.

Signed-off-by: constanca <[email protected]>

constanca-m · 2024-05-28T15:26:58Z

dev-corner/README.md

+- Ingestion errors cause the trigger `replay-sqs` to be activated
+- Configuration errors use the trigger `sqs`, set by the user
+- The `config.yaml` to consume messages affected by ingestion errors needs to have inputs for the replay queue and for the resource that produced those messages
+- The input for the replay queue to consume ingestion errors is completely discarded. The output used will not be the one from the replay queue, but the one from the input that produced the message


This is an issue. Why does the user specify outputs for the input replay queue if they are not used?

Good point.

By adding an input to fetch messages from the replay queue (SQS), we can route messages back to ESF. However, these messages will use the original input, including the output, that produced them.

It makes sense to try to honor the original path, but it's also confusing to have an output setting that has no effect.

Could having a replay-sqs input type help reduce ambiguity? Also, making output optional—at least for this type—with override could also help.

Agree! Do we want the users to have the ability to change output when using replay-sqs?

constanca-m · 2024-05-28T15:28:03Z

dev-corner/README.md

+- Configuration errors use the trigger `sqs`, set by the user
+- The `config.yaml` to consume messages affected by ingestion errors needs to have inputs for the replay queue and for the resource that produced those messages
+- The input for the replay queue to consume ingestion errors is completely discarded. The output used will not be the one from the replay queue, but the one from the input that produced the message
+- There is no way to consume messages in the replay queue put there for configuration errors


This is an issue. The current config.yaml should be used, not the one in the message attributes. This only means we get stuck in a loop.

Oh, that's why the config is also stored in the message, right?

We probably need to reevaluate this behavior.

So we are losing messages in this case?

So we are losing messages in this case?

We are not loosing them because they are in the replay queue, and later get sent to the DLQ. But we cannot forward them to a custom output.

constanca-m · 2024-05-28T15:29:15Z

Does it make sense to even set the replay queue as an input the same way we set the other inputs?

The input type of the replay queue should not be set by the user. If the outputs don't matter for the replay queue, then the user should not need to specify them either.

zmoog · 2024-05-28T21:44:41Z

Does it make sense to even set the replay queue as an input the same way we set the other inputs?

I guess the intent was probably to let users decide when and how to reprocess messages in the replay queue.

The input type of the replay queue should not be set by the user. If the outputs don't matter for the replay queue, then the user should not need to specify them either.

ESF creates the replay queue during the installation, right? To enable reprocessing the messages in the replay queue, users must manually create a trigger to pull the message to replay from the SQS queue to the lambda function, right?

constanca-m · 2024-05-29T07:15:04Z

ESF creates the replay queue during the installation, right?

Yes.

To enable reprocessing the messages in the replay queue, users must manually create a trigger to pull the message to replay from the SQS queue to the lambda function, right?

Yes.

bturquet

Approved to get the documentation update

constanca-m added 2 commits May 28, 2024 17:22

Update dev notes

4c9879c

Signed-off-by: constanca <[email protected]>

update toc

20bed4d

Signed-off-by: constanca <[email protected]>

constanca-m commented May 28, 2024

View reviewed changes

constanca-m self-assigned this May 28, 2024

constanca-m requested a review from a team May 28, 2024 15:29

bturquet approved these changes May 29, 2024

View reviewed changes

constanca-m merged commit befdd23 into elastic:main May 29, 2024
4 checks passed

constanca-m deleted the understand-replay-sqs-trigger branch May 29, 2024 15:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upload developer notes: understanding errors and triggers #724

Upload developer notes: understanding errors and triggers #724

constanca-m commented May 28, 2024

constanca-m May 28, 2024

zmoog May 28, 2024

kaiyan-sheng May 28, 2024

constanca-m May 28, 2024

zmoog May 28, 2024

kaiyan-sheng May 28, 2024

constanca-m May 29, 2024 •

edited

Loading

constanca-m commented May 28, 2024

zmoog commented May 28, 2024

constanca-m commented May 29, 2024

bturquet left a comment

Upload developer notes: understanding errors and triggers #724

Upload developer notes: understanding errors and triggers #724

Conversation

constanca-m commented May 28, 2024

What does this PR do?

constanca-m May 28, 2024

Choose a reason for hiding this comment

zmoog May 28, 2024

Choose a reason for hiding this comment

kaiyan-sheng May 28, 2024

Choose a reason for hiding this comment

constanca-m May 28, 2024

Choose a reason for hiding this comment

zmoog May 28, 2024

Choose a reason for hiding this comment

kaiyan-sheng May 28, 2024

Choose a reason for hiding this comment

constanca-m May 29, 2024 • edited Loading

Choose a reason for hiding this comment

constanca-m commented May 28, 2024

zmoog commented May 28, 2024

constanca-m commented May 29, 2024

bturquet left a comment

Choose a reason for hiding this comment

constanca-m May 29, 2024 •

edited

Loading