Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

timeout issues with hdbnsutil -sr_stateConfiguration #53

Open
rscottwatson opened this issue Feb 12, 2019 · 2 comments
Open

timeout issues with hdbnsutil -sr_stateConfiguration #53

rscottwatson opened this issue Feb 12, 2019 · 2 comments

Comments

@rscottwatson
Copy link

rscottwatson commented Feb 12, 2019

Hi Guys,

We have noticed that some of our clusters are failing over because we have 4 consecutive time out on the command hdbnsutil -sr_stateConfiguration. Do you know why that would happen? Normally this runs in about 4 seconds but once it has failed 4 times all hell breaks loose in the cluster. I am running an older version of the RAs 152.17 however, the latest code (.22) only seem to add a check for a 124 timeout on the hana call which doesn't explain why this would happen.

Thanks
Scott

@fdanapfel
Copy link
Collaborator

Hi Scott,

since hdbnsutil is developed and maintained by SAP, and we (as in the upstream maintainers of the resource agents) do not have any insights in what it is actually doing, you will have to work with SAP to figure out what is the reason why the command sometimes times out.

Based on my experience such time outs are usually caused by performance issues on the HANA side, therefore I would recommend to start verifying that your HANA setup meets all the KPIs required by SAP.

If you are still using version 152.17 of the resource agents I would strongly recommend to update to the latest version provided by your distribution, While the newer versions most likely won' fix your issue with the hdbnsutil timouts (since they are out of our control as mentioned before) these newer versions contain many fixes and improvements that can help to get a more stable HANA Scale-Up SR HA setup.

Regards,
Frank

@fmherschel
Copy link
Owner

Hi Frank,

thanks for answering. And I also recommend to use always the newest RA available in the
enterprise products.

If this issue is about a productive cluster you should also open an SAP ticket and set it either to the
SUSE or Red Hat component.

This open source initiative is not intended to work directly on customer issues as support also needs contracts between the organization which needs the help and the organization which provides the help.

So only general statements could come from here.

We have heard that sometimes parts of the SAP HANA (like the master name server) could be non-
reactive for some time and then commands like ' hdbnsutil' are hanging. Your expectation, that the
runtime of the command should be less then 5 seconds is also my assumption.

Regards
Fabian

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants