PROBLEM DESCRIPTION
At approximately 6:50 BST, Gradwell’s system administrators were alerted to some system issues affecting a multitude of Gradwell services.
Affected Services:
Web services
Control panels
VoIP services
Customer Impact:
Affected customers will be seeing errors when accessing hosted websites/control panels and will be seeing errors when attempting to make outbound calls.
Estimated Resolution Time:
Our system admin team are working on this now and will update again at or before 09:30.
***Update*** 9:35
VoIP services and control panels should now be working as expected, we are still working on the web clusters and expect to have these back online shortly. We will update again at or before 10:30
***Update*** 10:36
The web cluster is now back online and all services should be running as expected.
There may be some slowdown on control panels as systems are busy processing any backlogs.
We are continuing to monitor and will update again at 11:30
***Update*** 11:29
All systems are running correctly and remain stable. We will continue to monitor closely for the next few hours and update/close this status at 13:30
***Update*** 13:20
All systems are now running correctly and we are now closing this status update.
The problem has been identified as being one of our DNS cache servers. This cache server, 193.111.200.191, stopped responding and this in turn caused our master MySQL server to effectively lock up. This then failed to respond to queries correctly. The majority of our infrastructure relies on this database, hence parts of it became unstable.
We apologise for any problems this has caused you.