advertise
Thursday
Jun102010

The Four Meta Secrets of Scaling at Facebook

Aditya Agarwal, Director of Engineering at Facebook, gave an excellent Scale at Facebook talk that covers their architecture, but the talk is really more about how to scale an organization by preserving the best parts of its culture. The key take home of the talk is: 

You can get the code right, you can get the products right, but you need to get the culture right first. If you don't get the culture right then your company won't scale.

This leads into the four meta secrets of scaling at Facebook:

  1. Scaling takes Iteration
  2. Don't Over Design
  3. Choose the right tool for the job, but realize that your choice comes with overhead.
  4. Get the culture right. Move Fast - break things. Huge Impact - small teams. Be bold - innovate.

Click to read more ...

Wednesday
Jun092010

Paper: Propagation Networks: A Flexible and Expressive Substrate for Computation 

Alexey Radul in his fascinating 174 page dissertation Propagation Networks: A Flexible and Expressive Substrate for Computation, offers to help us break free of the tyranny of linear time by arranging computation as a network of autonomous but interconnected machines.  We can do this by organizing computation as a network of interconnected machines of some kind, each of which is free to run when it pleases, propagating  information around the network as proves possible. The consequence of this freedom is that the structure of the aggregate does not impose an order of time. The abstract from his thesis is:

Click to read more ...

Tuesday
Jun082010

Sponsored Post: Jobs: Digg, Huffington Post Events: Velocity Conference, Social Developer Summit

Monday
Jun072010

Six Ways Twitter May Reach its Big Hairy Audacious Goal of One Billion Users

Twitter has a big hairy audacious goal of reaching one billion users by 2013. Three forces stand against Twitter. The world will end in 2012. But let's be optimistic and assume we'll make it. Next is Facebook. Currently Facebook is the user leader with over 400 million users. Will Facebook stumble or will they rocket to one billion users before Twitter? And lastly, there's Twitter's "low" starting point and "slow" growth rate. Twitter currently has 106 million registered users and adds about 300,000 new users a day. That doesn't add up to a billion in three years. Twitter needs to triple the number of registered users they add per day. How will Twitter reach its goal of over one billion users served?

Click to read more ...

Friday
Jun042010

Strategy: Cache Larger Chunks - Cache Hit Rate is a Bad Indicator

Isn't the secret to fast, scalable websites to cache everything? Caching, if not the secret sauce of many a website, is it at least a popular condiment. But not so fast says Peter Zaitsev in Beyond great cache hit ratio. The point Peter makes is that we read about websites like Amazon and Facebook that can literally make hundreds of calls to satisfy a user request. Even if you have an awesome cache hit ratio, pages can still be slow because making and processing all those requests takes time. The solution is to remove requests all together. You do this by caching larger blocks so you have to make fewer requests. 

The post has a lot of good advice worth reading: 1) Make non cacheable blocks as small as possible, 2) Maximize amount of uses of the cache item, 3) Control invalidation, 4) Multi-Get.

 

Thursday
Jun032010

Hot Scalability Links for June 3, 2010

Tuesday
Jun012010

Web Speed Can Push You Off of Google Search Rankings! What Can You Do?

Google made a right move by adding web-speed to the search engine ranking. With this change site latency does't just impact the user experience but it will determine where you will be placed in google search results.

This move could be a real game changer as it bring site latency and scalability to the front stage. Sites will not only compete on content but on thier performance. It is now clear that site performance have much more direct impact on our business (as appose to indirect impact resulted in user expereince) then ever before.

In this post i try to provide some architecture guide line on how to control and improve site latency under scale based on a discussion with a leading eCommerce site the Netherland.

See detailed story  here

 

Tuesday
Jun012010

Sponsored Post: Get Your High Scalability Fix at Digg

Get Your High Scalability Fix at Digg 

Interested in working on cutting-edge high-scale infrastructure at Digg? We're making a big investment in scaling and have committed to the NoSQL (Not only SQL) path with Cassandra. We're using other open-source infrastructure to help us scale including Hadoop, RabbitMQ, Zookeeper, Thrift, HDFS and Lucene. We're rewriting Digg from the ground up and we need amazing developers to join our world-class team. If you think you are up for the challenge, or you know someone who might be, take a look at our jobs page for more information.

Monday
May312010

Scalable federated security with Kerberos 

In my last post, I outlined considerations that need to be taken into account when choosing between a centralized and federated security model. So, how do we implement the chosen model? Based on a real-world case study, I will outline a Kerberos architecture that enables cutting-edge collaborative research through federated sharing of resources.

Read more on BigDataMatters.com

Wednesday
May262010

End-To-End Performance Study of Cloud Services

Cloud computing promises a number of advantages for the deployment of data-intensive applications. Most prominently, these include reducing cost with a pay-as-you-go business model and (virtually) unlimited throughput by adding servers if the workload increases. At the Systems Group, ETH Zurich, we did an extensive end-to-end performance study to compare the major cloud offerings regarding their ability to fulfill these promises and their implied cost.

Click to read more ...