« How Gravatar scales on WordPress.com hardware | Main | Should JSPs be avoided for high scalability? »

Paper: Wikipedia's Site Internals, Configuration, Code Examples and Management Issues

Wikipedia and Wikimedia have some of the best, most complete real-world documentation on how to build highly scalable systems. This paper by Domas Mituzas covers a lot of details about how Wikipedia works, including: an overview of the different packages used (Linux, PowerDNS, LVS, Squid, lighttpd, Apache, PHP5, Lucene, Mono, Memcached), how they use their CDN, how caching works, how they profile their code, how they store their media, how they structure their database access, how they handle search, how they handle load balancing and administration. All with real code examples and examples of configuration files. This is a really useful resource.

Related Articles

  • Wikimedia Architecture
  • Domas Mituzas' Blog
  • References (1)

    References allow you to track sources for this article, as well as articles that were written in response to this article.

    Reader Comments (1)

    Very detailed document really covering most (or all?) topics mentioned in the post.
    I haven't yet finished reading it, still in progress, but that's already absolutely clear that it's worth reading, thanks for the link!

    November 29, 1990 | Unregistered CommenterBlinkov Ivan

    PostPost a New Comment

    Enter your information below to add a new comment.
    Author Email (optional):
    Author URL (optional):
    Some HTML allowed: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <code> <em> <i> <strike> <strong>