<?xml version="1.0" encoding="UTF-8"?>
<!--Generated by Squarespace Site Server v5.9.1 (http://www.squarespace.com/) on Tue, 09 Feb 2010 15:21:58 GMT--><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0"><channel><title>High Scalability</title><link>http://highscalability.com/blog/</link><description></description><lastBuildDate>Mon, 08 Feb 2010 16:20:21 +0000</lastBuildDate><copyright></copyright><language>en-US</language><generator>Squarespace Site Server v5.9.1 (http://www.squarespace.com/)</generator><item><title>How FarmVille Scales to Harvest 75 Million Players a Month</title><category>Example</category><category>facebook</category><dc:creator>Todd Hoff</dc:creator><pubDate>Mon, 08 Feb 2010 16:04:49 +0000</pubDate><link>http://highscalability.com/blog/2010/2/8/how-farmville-scales-to-harvest-75-million-players-a-month.html</link><guid isPermaLink="false">426227:4867632:6610374</guid><description><![CDATA[<p><img src="http://farm5.static.flickr.com/4049/4335031559_fa690c1f81_m.jpg" alt="" align="RIGHT" /> If real farming was as comforting as it is in <a href="http://www.farmville.com/">Zynga's mega-hit Farmville</a> then my family would have probably never left those harsh North Dakota winters. None of the scary bedtime stories my Grandma used to tell about farming are true in FarmVille. <a href="http://w.good.is/post/What-Does-Farmville-Mean-for-Farmers/">Farmers</a> make money, plants grow, and animals never visit the <a href="http://www.youtube.com/watch?v=eUJz3137EQ8">red barn</a>. I guess it's just that keep-your-shoes-clean back-to-the-land charm that has helped make FarmVille the "largest game in the world" in such an astonishingly short time.</p>
<p>How did FarmVille scale a web application to handle 75 million players a month? Fortunately FarmVille's Luke Rajlich has agreed to let us in on a few their challenges and secrets. Here's what Luke has to say...</p>]]></description><wfw:commentRss>http://highscalability.com/blog/rss-comments-entry-6610374.xml</wfw:commentRss></item><item><title>GEO-aware traffic load balancing and caching at CNBC.com</title><dc:creator>Rashid Karimov</dc:creator><pubDate>Sat, 06 Feb 2010 17:56:28 +0000</pubDate><link>http://highscalability.com/blog/2010/2/6/geo-aware-traffic-load-balancing-and-caching-at-cnbccom.html</link><guid isPermaLink="false">426227:4867632:6583132</guid><description><![CDATA[CNBC, like many large web sites, relied &nbsp;on a CDN for content delivery. &nbsp;Recently, we started looking &nbsp;to see if we could improve this model. &nbsp;Our criteria was:<br /> <br /> <strong>- improve response time<br /> - have better control over traffic (real time reporting, change management and alerting)<br /> - better utilize internal datacenters and their infrastructure<br /> - shield users from any troubles at the origin infrastructure<br /> - cost out</strong>]]></description><wfw:commentRss>http://highscalability.com/blog/rss-comments-entry-6583132.xml</wfw:commentRss></item><item><title>High Availability Principle : Concurrency Control</title><dc:creator>Ashish Soni</dc:creator><pubDate>Fri, 05 Feb 2010 16:55:25 +0000</pubDate><link>http://highscalability.com/blog/2010/2/5/high-availability-principle-concurrency-control.html</link><guid isPermaLink="false">426227:4867632:6572018</guid><description><![CDATA[<p>One important high availability principle is concurrency control. &nbsp;The idea is to allow only that much traffic through to your system which your system can handle successfully. &nbsp;For example: if your system is certified to handle a concurrency of 100 then the 101st request should either timeout, be asked to try later &nbsp;or wait until one of the previous 100 requests finish. &nbsp;The 101st request should not be allowed to negatively impact the experience of the other 100 users. &nbsp;Only the 101st request should be impacted. <a href="http://saasinterrupted.com/2010/02/05/high-availability-principle-concurrency-control/">Read more here...</a></p><p>Related: High Availability Principle : Concurrency Control (http://saasinterrupted.com/2010/02/05/high-availability-principle-concurrency-control/) by Ashish Soni</p>]]></description><wfw:commentRss>http://highscalability.com/blog/rss-comments-entry-6572018.xml</wfw:commentRss></item><item><title>Hot Scalability Links for February 4, 2010</title><category>hot links</category><dc:creator>Todd Hoff</dc:creator><pubDate>Thu, 04 Feb 2010 15:38:53 +0000</pubDate><link>http://highscalability.com/blog/2010/2/4/hot-scalability-links-for-february-4-2010.html</link><guid isPermaLink="false">426227:4867632:6559771</guid><description><![CDATA[<p>Lots of cool stuff happening this week...</p>
<ol>
<li><a href="http://wiki.github.com/voldemort/voldemort/voldemort-rebalancing#"><span><span>Voldemort</span></span> gets <span><span>rebalancing</span></span>.</a> It's one thing to shard data to scale, it's a completely different level of functionality to manage those shards intelligently. <span><span>Voldemort</span></span> has stepped up by adding advanced <span><span>rebalancing</span></span> functionality: Dynamic addition of new nodes to the cluster; Deletion of nodes from cluster; Load balancing of data inside a cluster.</li>
<li><a href="http://gigaom.com/2010/02/01/microsoft-finally-opens-azure-for-business/">Microsoft Finally Opens Azure for&nbsp;Business. </a> Out of the blue Microsoft opens up their platform as a service service. Good to have more competition and we'll keep an eye out for experience reports.</li>
<li><a href="http://glinden.blogspot.com/2010/02/new-details-on-linkedin-architecture.html">New details on <span><span>LinkedIn</span></span> architecture</a> by Greg Linden. <em><span><span>LinkedIn</span></span> appears to only use caching minimally, preferring to spend their efforts and machine resources on making sure they can <span><span>recompute</span></span> computations quickly than on hiding poor performance behind caching layers</em>.</li>
<li><a href="http://blogs.computerworld.com/15510/the_end_of_sql_and_relational_databases_part_1_of_3">The end of <span><span>SQL</span></span> and relational databases?</a>&nbsp; by <a class="blogger_name" href="http://blogs.computerworld.com/intersimone">David <span><span>Intersimone</span></span></a>. <em>For new projects, I believe, we have genuine non-relational alternatives on the table (pun intended)</em>.</li>
<li><a id="story_358" href="http://developers.facebook.com/news.php?blog=1&amp;story=358"><span><span>HipHop</span></span> for <span><span>PHP</span></span>: Move Fast</a>. When you make millions of widgets saving pennies per widget quickly adds up to real money. <span><span>Facebook</span></span> released <span><span>HipHop</span></span>, a <span><span>PHP</span></span> compiler, aimed at shaving off cycle of CPU and bytes of memory in production of their social widgets.&nbsp;</li>
</ol>]]></description><wfw:commentRss>http://highscalability.com/blog/rss-comments-entry-6559771.xml</wfw:commentRss></item><item><title>NoSQL Means Never Having to Store Blobs Again</title><category>nosql</category><dc:creator>Todd Hoff</dc:creator><pubDate>Wed, 03 Feb 2010 16:19:54 +0000</pubDate><link>http://highscalability.com/blog/2010/2/3/nosql-means-never-having-to-store-blobs-again.html</link><guid isPermaLink="false">426227:4867632:6548359</guid><description><![CDATA[<p><span>Morgan <span>Tocker</span> has an awesome article and comment thread in the </span><a rel="bookmark" href="http://www.mysqlperformanceblog.com/2010/01/21/when-should-you-store-serialized-objects-in-the-database/"><span><span>MySQL</span> Performance Blog</span></a> about<a rel="bookmark" href="http://www.mysqlperformanceblog.com/2010/01/21/when-should-you-store-serialized-objects-in-the-database/"> When should you store serialized objects in the database?</a><span> Before the <span>NoSQL</span> age is was very common to simulate <span>schemalessness</span> by storing blobs in <span>MySQL</span>. <span>Sharding</span> was implemented by running multiple <span>MySQL</span> instances and spreading writes across them. While not ideal for the purpose, developers felt comfortable with <span>MySQL</span>. They knew how to install it, back it up, replicate it, in short:&nbsp; they knew how to make it work. Yet they also needed to store objects without the penalty of joins. Searches and aggregate queries were handled by indexes kept in separate tables, this offloaded the fast path to objects.</span></p>
<p><span>This all made perfect sense. Usually we just want stuff to work and going with what you know is often the best path to that goal. And what we have known is <span>MySQL</span>. All the different pros and cons of this approach are covered wonderfully in the post. </span></p>
<p><span>But the world has changed.</span></p>]]></description><wfw:commentRss>http://highscalability.com/blog/rss-comments-entry-6548359.xml</wfw:commentRss></item><item><title>Scale out your identity management</title><dc:creator>Pawel Plaszczak</dc:creator><pubDate>Tue, 02 Feb 2010 13:34:34 +0000</pubDate><link>http://highscalability.com/blog/2010/2/2/scale-out-your-identity-management.html</link><guid isPermaLink="false">426227:4867632:6528857</guid><description><![CDATA[<p>BigDataMatters is focused on the issues faced when processing and managing large amounts of data. In light of this, it would be a crime not to blog about the security of this data. Over the next few weeks, I will write a series of posts focused on identity management in the enterprise. Before you read any more, how is your identity secured?﻿</p>
<p><a href="http://bigdatamatters.com/bigdatamatters/2010/02/scale-identity-management.html">Read more on BigDataMatters.com</a></p>
<p>&nbsp;</p>]]></description><wfw:commentRss>http://highscalability.com/blog/rss-comments-entry-6528857.xml</wfw:commentRss></item><item><title>What Will Kill the Cloud?</title><dc:creator>Todd Hoff</dc:creator><pubDate>Mon, 01 Feb 2010 15:57:00 +0000</pubDate><link>http://highscalability.com/blog/2010/2/1/what-will-kill-the-cloud.html</link><guid isPermaLink="false">426227:4867632:6518043</guid><description><![CDATA[<p><a href="http://whatisthematrix.warnerbros.com/"><img src="http://farm3.static.flickr.com/2661/4188985841_8a7dd5671e.jpg" alt="" width="250" align="RIGHT" /></a> <em>This is an excerpt from my article <a href="http://highscalability.com/blog/2009/12/16/building-super-scalable-systems-blade-runner-meets-autonomic.html">Building Super Scalable Systems: Blade Runner Meets Autonomic Computing in the Ambient Cloud.</a> </em></p>
<p>If datacenters are the new castles, then what will be the new gunpowder? As soon as <a href="http://www.scribd.com/doc/13764820/The-Impact-of-Gunpowder-on-Medieval-Warfare">gunpowder</a> came on the scene, castles, which are defensive structures, quickly became the future's cold, drafty hotels. Gunpowder fueled cannon balls make short work of castle walls.</p>
<p>There's a long history of "gunpowder" type inventions in the tech industry.  PCs took out the timeshare model. The cloud is taking out the PC model. There must be something that will take out the cloud.</p>
<p>Right now it's hard to believe the cloud will one day be no more. They seem so much the future, but something will transcend the cloud.]]></description><wfw:commentRss>http://highscalability.com/blog/rss-comments-entry-6518043.xml</wfw:commentRss></item><item><title>Hot Scalability Links for January 28 2010</title><category>hot links</category><dc:creator>Todd Hoff</dc:creator><pubDate>Thu, 28 Jan 2010 05:52:51 +0000</pubDate><link>http://highscalability.com/blog/2010/1/27/hot-scalability-links-for-january-28-2010.html</link><guid isPermaLink="false">426227:4867632:6450749</guid><description><![CDATA[<ol>
<li><a href="http://googleresearch.blogspot.com/2010/01/research-areas-of-interest-building.html">Google's Research Areas of Interest:  Building scalable, robust cluster applications</a>. <em>At Google we see distributed systems as a technology in its infancy, with huge gaps in the supporting research&nbsp; that represent some of the most important problems in the space. Here are some examples:</em> <em>Resource sharing, </em><em>Balancing cost, performance, and reliability, </em><em>Self-maintaining systems.&nbsp;</em></li>
<li><a href="http://www.ddj.com/database/222400291">Amazon SimpleDB: A Simple Way to Store Complex Data</a> by <em>Paul Tremblett. </em><em>The most effective way I have found to understand SimpleDB is to think about it in terms of something else we all use and understand -- a spreadsheet.</em></li>
<li><a href="http://www.thebitsource.com/2010/01/11/rackspace-cloud-servers-versus-amazon-ec2-performance-analysis/">Rackspace Cloud Servers versus Amazon EC2: Performance Analysis</a>. <em>The Bitsource conducted a review of the two cloud computing platforms, Rackspace Cloud Servers and Amazon Elastic Compute Cloud (EC2), to get a general idea of overall system performance.</em></li>
<li>
<div class="itemHeadStyle">
<div class="itemTitleStyle"><a class="TitleLinkStyle" rel="bookmark" href="http://perspectives.mvdirona.com/2010/01/17/PrivateCloudsAreNotTheFuture.aspx">Private Clouds Are Not The Future</a> by Jame Hamilton. <em><span style="font-size: small;"><span style="color: #000000;"><span style="font-family: Calibri;">Private clouds are better than nothing but an investment in a private cloud is an investment in a temporary fix that will only slow the path to the final destination: shared clouds.</span></span></span></em></div>
</div>
</li>
<li>
<div class="itemHeadStyle">
<div class="itemTitleStyle"><a href="http://dbmsmusings.blogspot.com/2009/06/what-is-right-way-to-measure-scale.html">What is the right way to measure scale?</a> by Daniel Abadi. <em>So which scales better? Is using the number of nodes a better proxy than size of data? Hadoop can &ldquo;scale&rdquo; to 3800 nodes. So far, all we know is that Greenplum can &ldquo;scale&rdquo; to 96 nodes. Can it handle more nodes?</em></div>
</div>
</li>
</ol>]]></description><wfw:commentRss>http://highscalability.com/blog/rss-comments-entry-6450749.xml</wfw:commentRss></item><item><title>Product: HyperGraphDB - A Graph Database</title><category>Product</category><category>graph</category><dc:creator>Todd Hoff</dc:creator><pubDate>Tue, 26 Jan 2010 19:30:57 +0000</pubDate><link>http://highscalability.com/blog/2010/1/26/product-hypergraphdb-a-graph-database.html</link><guid isPermaLink="false">426227:4867632:6436196</guid><description><![CDATA[<p>With the success of <a href="http://neo4j.org/">Neo4j</a><span> as a graph database in the <span>NoSQL</span> revolution, it's interesting to see another graph database,</span><span> <a href="http://www.kobrix.com/hgdb.jsp"><span><span>HyperGraphDB</span></span></a><span>, in the mix. Their quick blurb on <span>HyperGraphDB</span> says it is a: </span></span><em>general purpose, extensible, portable, distributed, embeddable, open-source data storage mechanism. It is a graph database designed specifically for artificial intelligence and semantic web projects, it can also be used as an embedded object-oriented database for projects of all sizes.</em></p>
<p>From the <a href="http://nosql-database.org/"><span><span>NoSQL</span> Archive</span></a> the summary<span> on <span>HyperGraphDB</span> is: </span><span><span>API</span>: </span><span style="font-weight: bold;">Java</span><span> (and Java <span>Langs</span>), Written in:</span><span style="font-weight: bold;">Java</span>,&nbsp; Query Method: <span style="font-weight: bold;">Java</span> or P2P, Replication: <span style="font-weight: bold;">P2P</span>, Concurrency: <span style="font-weight: bold;"><span><span>STM</span></span></span><span>, <span>Misc</span>: Open-Source, Especially for AI and Semantic Web.</span></p>
<p><span>So it has some interesting features, like software <span>transactional</span> memory and </span><span id="main" style="visibility: visible;"><span id="search" style="visibility: visible;"> <em>P2P</em>&nbsp; for data distribution</span></span>, but I found that my first and most obvious question was not answered: what the heck is a <a href="http://en.wikipedia.org/wiki/Hypergraph"><span><span>hypergraph</span></span></a><span> and why do I care? Buried in the tutorial was:</span></p>
<blockquote>
<p>A HyperGraphDB database is a generalized graph of entities. The generalization is two-fold:</p>
<ol>
<li> Links/edges "point to" an arbitrary number of elements instead of just two as in regular graphs&nbsp;</li>
<li>Links can be pointed to by other links as well. </li>
</ol></blockquote>
<p>OK, but I wish there was some explanation of why this is valuable. What can I do with it that I can't do with normal graphs?<span> Given that there have been concerns over the complexity of the <span>API</span> this would seem a natural topic to cover. I assume it's cool, it sounds cool, but I would like to know why :-)</span></p>
<ol> </ol>
<p>In any case it looks like an interesting product to take a look at. Database options are expanding fast.</p>]]></description><wfw:commentRss>http://highscalability.com/blog/rss-comments-entry-6436196.xml</wfw:commentRss></item><item><title>Let's Welcome our Neo-Feudal Overlords</title><dc:creator>Todd Hoff</dc:creator><pubDate>Mon, 25 Jan 2010 16:23:31 +0000</pubDate><link>http://highscalability.com/blog/2010/1/25/lets-welcome-our-neo-feudal-overlords.html</link><guid isPermaLink="false">426227:4867632:6425082</guid><description><![CDATA[<a href="http://whatisthematrix.warnerbros.com/"><img src="http://farm3.static.flickr.com/2661/4188985841_8a7dd5671e.jpg" alt="" width="250" align="RIGHT" /></a> <em>This is an excerpt from my article <a href="http://highscalability.com/blog/2009/12/16/building-super-scalable-systems-blade-runner-meets-autonomic.html">Building Super Scalable Systems: Blade Runner Meets Autonomic Computing in the Ambient Cloud.</a></em></p>
<p>There's a pattern, already begun, that has accelerated by the need for applications to scale and increase complexity, the end result of which will be that applications give up their independence and enter a kind of <a href="http://en.wikipedia.org/wiki/Feudalism">feudal relationship</a> with their platform provider.</p>
<p>To understand how this process works, like a glacier slowly and inevitably carving out a deep river valley, here's the type of question I get quite a lot:</p>
<blockquote>I've learned PHP and MySQL and I've built a web app that I HOPE will receive traffic comparable to eBay's with a similar database structure. I see all these different opinions and different techniques and languages being recommended and it's so confusing. All I want is perhaps one book or one website that focuses on PHP and MySQL and building a large database web app like eBay's. Does something like this exist?</blockquote>
<p>I'm always at a loss for words with these questions. What can I possibly say?]]></description><wfw:commentRss>http://highscalability.com/blog/rss-comments-entry-6425082.xml</wfw:commentRss></item></channel></rss>