You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We have noticed that some of our clusters are failing over because we have 4 consecutive time out on the command hdbnsutil -sr_stateConfiguration. Do you know why that would happen? Normally this runs in about 4 seconds but once it has failed 4 times all hell breaks loose in the cluster. I am running an older version of the RAs 152.17 however, the latest code (.22) only seem to add a check for a 124 timeout on the hana call which doesn't explain why this would happen.
Thanks
Scott
The text was updated successfully, but these errors were encountered:
since hdbnsutil is developed and maintained by SAP, and we (as in the upstream maintainers of the resource agents) do not have any insights in what it is actually doing, you will have to work with SAP to figure out what is the reason why the command sometimes times out.
Based on my experience such time outs are usually caused by performance issues on the HANA side, therefore I would recommend to start verifying that your HANA setup meets all the KPIs required by SAP.
If you are still using version 152.17 of the resource agents I would strongly recommend to update to the latest version provided by your distribution, While the newer versions most likely won' fix your issue with the hdbnsutil timouts (since they are out of our control as mentioned before) these newer versions contain many fixes and improvements that can help to get a more stable HANA Scale-Up SR HA setup.
thanks for answering. And I also recommend to use always the newest RA available in the
enterprise products.
If this issue is about a productive cluster you should also open an SAP ticket and set it either to the
SUSE or Red Hat component.
This open source initiative is not intended to work directly on customer issues as support also needs contracts between the organization which needs the help and the organization which provides the help.
So only general statements could come from here.
We have heard that sometimes parts of the SAP HANA (like the master name server) could be non-
reactive for some time and then commands like ' hdbnsutil' are hanging. Your expectation, that the
runtime of the command should be less then 5 seconds is also my assumption.
Hi Guys,
We have noticed that some of our clusters are failing over because we have 4 consecutive time out on the command hdbnsutil -sr_stateConfiguration. Do you know why that would happen? Normally this runs in about 4 seconds but once it has failed 4 times all hell breaks loose in the cluster. I am running an older version of the RAs 152.17 however, the latest code (.22) only seem to add a check for a 124 timeout on the hana call which doesn't explain why this would happen.
Thanks
Scott
The text was updated successfully, but these errors were encountered: