Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

forum Healthcheck isn't reliable #2558

Open
Ardiea opened this issue Jul 18, 2024 · 2 comments
Open

forum Healthcheck isn't reliable #2558

Ardiea opened this issue Jul 18, 2024 · 2 comments
Labels
bug Something isn't working product:infrastructure Issues related to application and operations infrastructure project:open-edx

Comments

@Ardiea
Copy link
Member

Ardiea commented Jul 18, 2024

Expected Behavior

If forum can't talk to its mongodb or opensearch backends, the app should crash / stop outright. Not enter a funky state where the ASG / LB healthcheck passes but the app itself isn't working.

Current Behavior

If forum can't find it's mongodb or opensearch instances for 10 minutes, it just stops looking for them and enters a catatonic state where it is still 'running' good enough for the LB healthchecks to pass but it isn't really working because it won't answer any requests, and the container is possibly stopped / not listening.

Possible Solution

Put traefik infront of the container to create a healthcheck endpoint that works?
Figure out the behavior of forum and adjust the healthcheck status matcher appropriately.

Additional Details

Discussion starting here and going to about 4pm that day. https://mitodl.slack.com/archives/C02QLTAE05S/p1721329113019089

@Ardiea Ardiea added the bug Something isn't working label Jul 18, 2024
@blarghmatey blarghmatey added product:infrastructure Issues related to application and operations infrastructure project:open-edx labels Aug 2, 2024
@pdpinch
Copy link
Member

pdpinch commented Oct 25, 2024

Do you think we can work on this and have it deployed with the Sumac updates to xPRO and Residential MITx?

cc @blarghmatey and @feoh

@pdpinch
Copy link
Member

pdpinch commented Dec 20, 2024

We missed our opportunity with Sumac. The parameters will change if/when we move to forums v2 and mysql instead of mongodb.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working product:infrastructure Issues related to application and operations infrastructure project:open-edx
Projects
None yet
Development

No branches or pull requests

3 participants