Transcript of "Automated Hadoop Cluster Construction on EC2"
Automated Hadoop Clusters on EC2 Mark Kerzner SHMsoft
What is Hadoop? :) :) :)Everybody knows that... What is your definition?
What is a cloud?Everybody knows that, but1. Elastic resources2. Internet delivery3. SAAS4. Virtualization5. Device-enabled6. Only (1) or all of the above
You are the Hadoop programmer... and you need toolsWhat are your alternatives?● IDE● Local "cluster"● Pseudo-distributed cluster● EC2
You are the Hadoop programmer... and you need toolsWhat are your alternatives?● IDE - compile and run the code● Local "cluster" - local file system● Pseudo-distributed cluster - test outside● EC2 - test on the cluster, test for scale
What are your resources● Tom White, "Hadoop, the Definitive Guide"● www.hadoopilluminated.com