Your SlideShare is downloading. ×
An Introduction to Apache Hama
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

An Introduction to Apache Hama

571

Published on

An Introduction to Apache Hama, what is it ? …

An Introduction to Apache Hama, what is it ?
Why is it useful for scientific computations
on big data ?

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
571
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
40
Comments
0
Likes
0
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. Apache Hama ● What is it ? ● What is it used for ? ● How does it work ? ● Architecture www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 2. Hama – What is it ? ● A Distributed computing framework ● For massive scientific calculations ● For distributed computing ● Uses Bulk Synchronous Parallel Computing ( BSP ) ● Written in Java ● Licensed via Apache ● Available for Hadoop greater than 1.0.x ● Requires Java 1.6.x or higher www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 3. Hama – What is it used for ? ● large scale computations that – exceed the abilities of Map reduce ● Machine learning ● Matrix ● Graph ● Network Algorithms www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 4. Hama – How does it work ? ● A series of super steps ● Each of which is – Concurent computation on local data – Point to point communication – Synchronise all processors ● Uses Hadoop RPC for communication www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 5. Hama – Architecture Hama consists of BSP Master, GroomServers, ZooKeeper www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 6. Hama – Architecture ● Role of BSPMaster – Maintain groom server statuses – Controlling cluster super steps – Maintaining job progress information – Scheduling Jobs – Assign tasks to groom servers – Fault control – Provide cluster control to users www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 7. Hama – Architecture ● Role of GroomServer – Performs BSP tasks – Reports status to master – Runs on a single HDFS data node – Starts with a BSP Peer instance www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 8. Hama – Architecture ● Role of ZooKeeper – Manages barrier synchronisation of BSP Peers www.semtech-solutions.co.nz info@semtech-solutions.co.nz
  • 9. Contact Us ● Feel free to contact us at – www.semtech-solutions.co.nz – info@semtech-solutions.co.nz ● We offer IT project consultancy ● We are happy to hear about your problems ● You can just pay for those hours that you need ● To solve your problems
  • 10. Contact Us ● Feel free to contact us at – www.semtech-solutions.co.nz – info@semtech-solutions.co.nz ● We offer IT project consultancy ● We are happy to hear about your problems ● You can just pay for those hours that you need ● To solve your problems

×