Today British police made available their new service – a complete street level map of crimes in England and Wales. The webpage made to all possible headlines in daily papers, tv and radio. The effect is that the service is virtually down and won’t be usable until the novelty wears out.
All web pages have a peak from time to time. It can by due to various reasons – PR publication, Facebook craze, advertisement campaign, algorithm change at google or change of cosmic radiation. You can predict some peaks, some will come as a total surprise. In both cases the effect is the same – the system overloads and you’re starting losing users. That’s bad, really bad.
What could site owners do to prevent this? There are couple of options:
- Scale IT infrastructure to handle the traffic during peaks. This sounds good until you start calculating cost. If you scale for a peak 10 times larger than normal traffic, you will spent 10 times more money. And how can you be sure the peak won’t be 11 times normal traffic? Or 101 times? Or 1001? And what would you do if when the peak is over? You can donate unused computing power to [email protected], but I doubt your CFO will appreciate this.
- Deploy to a cloud. When your application is running in a cloud and you foreseen incoming peak, because you’ve bought nation wide TV ad campaign for exmaple, you allocate more resources for some time (for example start some new instances on Amazon EC2). And when the peak is unexpected? You allocate more resources when it happens! Or if you set up the cloud to allocate additional resources automatically for you, you just sit and monitor if everything works smoothly. When the traffic calms down, you just switch off unwanted resources and stop paying for them.
“What’s the catch?”, you may ask. There is none. The whole process just requires some planning in advance. Designing application for scalability is slightly tougher, but it’s not a black magic. And it’s not even rocket science. There are many infrastructure scalability patterns detailed on web. You will have to make some technical decisions on load balancing, session sharing, fallback procedures. The testing is also bit more complicated, but only due to fact of additional features. The rest will be exactly the same as normal design and development job.
I have always known that police is a useful force ;)
[email protected]:~$ dig +short http://www.police.uk
policeuk-167782603.eu-west-1.elb.amazonaws.com.
46.137.113.103
[email protected]:~$ dig +short -x 46.137.113.103
ec2-46-137-113-103.eu-west-1.compute.amazonaws.com.
Don’t let reality spoil your “point”, will you?
Thanks Jon for that. What this proves is that the technology won’t do anything for you, the proper design and development are still crucial!
I wonder what was the bottleneck on their side. Database?
God knows. Just proves “cloud” doesn’t mean a damn thing. It’s just all servers, after all.