Update: Greg Linden points to a new Google article MapReduce: simplified data processing on large clusters. Some interesting stats: 100k MapReduce jobs are executed each day; more than 20 petabytes of data are processed per day; more than 10k MapReduce programs have been implemented; machines are dual processor with gigabit ethernet and 4-8 GB of memory.
Google is the King of scalability. Everyone knows Google for their large, sophisticated, and fast searching, but they don't just shine in search. Their platform approach to building scalable applications allows them to roll out internet scale applications at an alarmingly high competition crushing rate. Their goal is always to build a higher performing higher scaling infrastructure to support their products. How do they do that?
Recent comments
3 hours 16 min ago
13 hours 9 min ago
14 hours 34 min ago
19 hours 40 min ago
20 hours 26 min ago
22 hours 46 min ago
1 day 5 hours ago
1 day 21 hours ago
2 days 1 hour ago
2 days 2 hours ago