Map-Reduce With Ruby Using Hadoop
Tuesday, January 4, 2011 at 9:03AM
Phil Whelan in Hadoop, Map Reduce, aamazon ec2, amazon, cloudera, hdfs, jclouds, map-reduce, mapreduce, ruby, whirr

Map-Reduce With Hadoop Using Ruby A demonstration, with repeatable steps, of how to quickly fire-up a Hadoop cluster on Amazon EC2, load data onto the HDFS (Hadoop Distributed File-System), write map-reduce scripts in Ruby and use them to run a map-reduce job on your Hadoop cluster. You will not need to ssh into the cluster, as all tasks are run from your local machine. Below I am using my MacBook Pro as my local machine, but the steps I have provided should be reproducible on other platforms running bash and Java.

Article originally appeared on (
See website for complete article licensing information.