October 7, 2024, Monday, 280

Hadoop on Mac OS X

From NeoWiki

(Difference between revisions)
Jump to: navigation, search
(New page: This guide is written to help Cornell students using Mac OS X 10.5 with setting up a development environment for working with [http://www.hadoop.org/ Hadoop] and running Hadoop jobs on the...)
 
m
Line 1: Line 1:
 +
''From http://www.infosci.cornell.edu/hadoop/mac.html''
 +
 
This guide is written to help Cornell students using Mac OS X 10.5 with setting up a development environment for working with [http://www.hadoop.org/ Hadoop] and running Hadoop jobs on the [http://7thgen.info/images/intro_icon_blog.jpg Cornell Center for Advanced Computing (CAC)] Hadoop cluster. This guide will walk you through compiling and running a simple example Hadoop job. More information is available at the official [http://hadoop.apache.org/core/docs/current/mapred_tutorial.html#Example%3A+WordCount+v1.0 Hadoop Map-Reduce Tutorial].
 
This guide is written to help Cornell students using Mac OS X 10.5 with setting up a development environment for working with [http://www.hadoop.org/ Hadoop] and running Hadoop jobs on the [http://7thgen.info/images/intro_icon_blog.jpg Cornell Center for Advanced Computing (CAC)] Hadoop cluster. This guide will walk you through compiling and running a simple example Hadoop job. More information is available at the official [http://hadoop.apache.org/core/docs/current/mapred_tutorial.html#Example%3A+WordCount+v1.0 Hadoop Map-Reduce Tutorial].
  

Revision as of 02:37, 17 November 2009

From http://www.infosci.cornell.edu/hadoop/mac.html

This guide is written to help Cornell students using Mac OS X 10.5 with setting up a development environment for working with Hadoop and running Hadoop jobs on the Cornell Center for Advanced Computing (CAC) Hadoop cluster. This guide will walk you through compiling and running a simple example Hadoop job. More information is available at the official Hadoop Map-Reduce Tutorial.

The overall process of developing a Hadoop job is as follows:

  1. Install Hadoop on your development machine (personal or lab computer)
  2. Compile the Hadoop job, create a JAR file
  3. Run the Hadoop job JAR file on your development machine, for testing and debugging
  4. Run the Hadoop job JAR file on the CAC Hadoop cluster, for production