Recent Downtime

As many of you are probably aware, the Curse family of sites has been down since early AM (PST) the 22nd. This post is meant to clear up any questions that may arise about this event.


At approximently 7:30AM PST, on June 22nd, our primary SAN controller experienced a catastrophic failure, bringing it offline. This controller was the primary controller for most of the database nodes, and most of the web nodes. Redundant systems failed to come online, even though they had reported as 'ready' before the primary systems failed.

After replacing the failed controller, it begain booting and copying it's configuration from it's peer server. Unfortunately, as soon as the configuration was copied, the secondary controller also died.

After replacing the second failed controller, we began powering the servers that reiled on the SAN for their data - all of the database servers and the network-attached storage (NAS) file servers that store all the media, static content, and most of our web files as well.

This process only took a few hours. The main delay was an extended period (24 hours) of checking the continuty of the data on the disks. This was the majority of the downtime experienced.

As we started pulling our servers online after check was complete, we noticed an issue with the firmware's on the new SAN controllers. The newer firmware versions were conflicting with the storage array, and thus, the controllers couldn't talk to the disks.

The manufacturer told us that this was a known issue, and provided us with a method of repairing it. We then backed up all the data again, and proceeded to apply the firmware patch.

After the patch, we were able to restore the drives, and start booting up the critical systems, followed by the non-critical systems.

We can assure you that at no time during the hardware failure was any of your personal information compromised. We take the sacred trust you put in us with your information VERY seriously.

Thank You

We realize that you rely on the forums as part of your Minecraft experience. Once again, we sincerely apologize for the downtime and hope you'll continue to enjoy the Minecraft Forums.

The Minecraft Forums Team

Comments

  • To post a comment, please .
Posts Quoted:
Reply
Clear All Quotes