The recent Data-Intensive Computing Symposium brought together experts in system design, programming, parallel algorithms, data management, scientific applications, and information-based applications to better understand existing capabilities in the development and application of large-scale computing systems, and to explore future opportunities.
Google Fellow Jeff Dean had a very interesting presentation on Handling Large Datasets at Google: Current Systems and Future Directions. He discussed:
• Hardware infrastructure
• Distributed systems infrastructure:
–Scheduling system
–GFS
–BigTable
–MapReduce
• Challenges and Future Directions
–Infrastructure that spans all datacenters
–More automation
It is really like a "How does Google work" presentation in ~60 slides?
Recent comments
58 min 42 sec ago
1 hour 9 sec ago
1 hour 2 min ago
1 hour 3 min ago
1 hour 5 min ago
1 hour 6 min ago
1 hour 7 min ago
1 hour 9 min ago
1 hour 10 min ago
1 hour 11 min ago