To keep our Selenium grid with over 1500+ real browsers healthy, we use various tools and techniques to monitor and troubleshoot potential problems.
Since we're booting thousands of pristine VMs per day for you to run tests on, and since computers and networks sometimes act weird, this monitoring is essential for us to quickly fix issues and keep our grid of browsers healthy at all times.
TestingBot Status Updates
You can keep track of our status updates on this status page: https://status.testingbot.com/. We will post about issues, maintenance updates and other status updates on this page.
Monitoring
At TestingBot, we have several internal/external alerts and monitoring tools to help us with quickly troubleshooting any issues as they come up.
We're using tools like Zabbix
, Grafana + InfluxDB
and other custom built tools to monitor our Selenium Grid.
This way, we continue on improving our service availability for everyone using our service.