We’ve seen a lot of stories this year about big tech companies getting hacked while others were having server issues leading to internet outages which aren’t anything new in the tech industry at large.
The most recent one is from the world’s largest enterprise cloud hosting service provider, Amazon Web Service or AWS for short. The company which has made billions of dollars hosting services of other big tech companies recently had a server outage but the company provided some explanation as to what led to the outage.
AWS made it known that while its services have a fair share of the blame for what caused the outage, third-party websites that were hosted on the platform also had an impact.
According to AWS, there was an automated process from its own end that led to the outage which start at about 10:30 AM ET and the spike in congestion prevented the company’s operations team from using its real-time monitoring system.
“An automated activity to scale capacity of one of the AWS services hosted in the main AWS network triggered an unexpected behavior from a large number of clients inside the internal network,” Amazon’s report says.
“This resulted in a large surge of connection activity that overwhelmed the networking devices between the internal network and the main AWS network, resulting in delays for communication between these networks.”
The report further stated that the issue impacted Amazon’s ability to see what had happened with the system.
The company’s operations team was prevented from using the real-time monitoring system and internal controls that they rely on to get notified about such a situation – which is why the outage took very long to fix lasting for several hours and ending around 5:22 PM ET.
The worst-case was the fact that customers were unable to create support cases for several hours during the outage because Amazon’s support customer contact center also runs on the AWS network so all the systems were literally down.
Another important service hit by this outage was Amazon’s Service Health dashboard that provides status updates.
But now, Amazon is saying that it’s trying its possible best to combat any future occurrence by working on its response to outages as well as formulating plans to release a revamped version of the Service Health Dashboard that will help customers across receive timely updates in case of a future outage.
Services affected during the outage include Venmo, Tinder, Disney Plus, Roomba, and even some Amazon deliveries. And this wouldn’t be the first time as the company also had a lengthy outage last year causing a devastating effect on its customers’ sites and apps.