Hello, we use our VCO cluster in a production situation and I need to find some way to better monitor it for uptime.
We have the F5 and monitoring through Nagios setup to check https://VCOSERVER:8281/vco. The issue is we have had a couple instances where this was functioning and our monitoring tools didn't report VCO as being down because it wasn't completely down. Part of VCO was running but workflows were failing and killing deployments. At the same time the java client wasn't working ether. After a restart thing start running fine.
We are getting a lot of heat because we use VCO for all our VM deployments for internal and external users. I need a solution to better monitor VCO. Does anyone have anything?