Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Monitoring, statistics and alerting system #937

Open
nour-massri opened this issue Jan 28, 2025 · 0 comments
Open

Monitoring, statistics and alerting system #937

nour-massri opened this issue Jan 28, 2025 · 0 comments
Labels
module: backend Related to the Siarnaq backend module module: devops Related to deployments and other operations module: saturn Related to the Saturn module priority: p2 medium triage New or unclassified issue type: feature New feature or request, or quick non-essential bugfix

Comments

@nour-massri
Copy link
Member

During battlecode 2025 we experience a variety of issues:

  • matches that last for a long time -> immediate solution: needed to cancel them
  • queue get so big and saturn isn't executing any -> immediate solution: purge messages in pub/sub and re-queue again
  • ranked matches rating not calculated -> recalculate all ratings from oldest to newest

these problem shouldn't have happened in the first place and we are working on making our system more robust to this kinda of problems, however this brings the importance of having a monitoring system that monitors the functionality of our system and reports any disruptions by emailing the devs

@nour-massri nour-massri added module: backend Related to the Siarnaq backend module module: devops Related to deployments and other operations module: saturn Related to the Saturn module triage New or unclassified issue type: feature New feature or request, or quick non-essential bugfix priority: p2 medium labels Jan 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module: backend Related to the Siarnaq backend module module: devops Related to deployments and other operations module: saturn Related to the Saturn module priority: p2 medium triage New or unclassified issue type: feature New feature or request, or quick non-essential bugfix
Projects
None yet
Development

No branches or pull requests

1 participant