[StarCluster] New Grid Engine Hadoop Integration HOWTO

Paul McDonagh mcdonaghpd at gmail.com
Fri Jun 1 15:31:53 EDT 2012


Thanks Rayson,

This and the previous email are a couple of really good suggestions. I'll try 'em out and see what happens.

Best,
Paul.

On Jun 1, 2012, at 14:52, Rayson Ho wrote:

> If you are running Hadoop on StarCluster, you may also be interested
> in this new method contributed by Prakashan Korambath of UCLA.
> 
> http://gridscheduler.sourceforge.net/howto/GridEngineHadoop.html
> 
> The difference between the original SGE 6.2u5 method vs the new one is
> that with Prakashan's approach, Grid Engine is used for resource
> allocation, and the Hadoop job scheduler/Job Tracker is used to handle
> all the MapReduce operations. A Hadoop cluster is created on demand
> with Prakashan's approach, but in the original SGE 6.2u5 method Grid
> Engine replaces the Hadoop job scheduler.
> 
> As standard Grid Engine PEs are used in this new approach, one can
> call "qrsh -inherit" and use Grid Engine's method to start Hadoop
> services on remote nodes, and thus get full job control, job
> accounting, and cleanup at terminate benefits like any other tight PE
> jobs!
> 
> Rayson
> 
> ================================
> Open Grid Scheduler / Grid Engine
> http://gridscheduler.sourceforge.net/
> 
> Scalable Grid Engine Support Program
> http://www.scalablelogic.com/
> _______________________________________________
> StarCluster mailing list
> StarCluster at mit.edu
> http://mailman.mit.edu/mailman/listinfo/starcluster




More information about the StarCluster mailing list