PTL
Sergey Lukjanov
Data Processing Update
OpenStack Sahara (ex. Savanna)
To provide a scalable data processing stack
and associated management interfaces
● provision & operate Hadoop clusters
● s...
Elastic Data Processing (EDP) is Sahara’s take
on data processing workflow management.
Icehouse release overview
● 700+ code commits from 50 people
● 57 blueprints implemented
● ~5000 code reviews
● ~140 bugs ...
Heat-based cluster provisioning
Hadoop 2 support
● in both Vanilla and HDP plugins
● EDP supports Hadoop 2
EDP improvements
● HBase and Sqoop via the HDP plugin
● Streaming & Java MapReduce
● External HDFS
CLI @ python-saharaclient
Sahara @ integrated gate
Juno plans
● Spark plugin supported by EDP
● Merge dashboard to Horizon
● Sahara as resources in Heat
● Heat engine by def...
Juno plans
● Spark plugin supported by EDP
● Merge dashboard to Horizon
● Sahara as resources in Heat
● Heat engine by def...
Data Processing Updates - Juno Edition
Upcoming SlideShare
Loading in...5
×

Data Processing Updates - Juno Edition

512

Published on

Sergey Lukjanov, Data Processing PTL, outlines the changes made in the Icehouse release as well as upcoming updates for Juno.

Learn more about Data Processing (Sahara) here: https://wiki.openstack.org/wiki/Sahara

Published in: Technology, Business
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
512
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
35
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Data Processing Updates - Juno Edition

  1. 1. PTL Sergey Lukjanov Data Processing Update OpenStack Sahara (ex. Savanna)
  2. 2. To provide a scalable data processing stack and associated management interfaces ● provision & operate Hadoop clusters ● schedule & operate Hadoop jobs / workloads
  3. 3. Elastic Data Processing (EDP) is Sahara’s take on data processing workflow management.
  4. 4. Icehouse release overview ● 700+ code commits from 50 people ● 57 blueprints implemented ● ~5000 code reviews ● ~140 bugs fixed details: https://launchpad.net/sahara/icehouse
  5. 5. Heat-based cluster provisioning
  6. 6. Hadoop 2 support ● in both Vanilla and HDP plugins ● EDP supports Hadoop 2
  7. 7. EDP improvements ● HBase and Sqoop via the HDP plugin ● Streaming & Java MapReduce ● External HDFS
  8. 8. CLI @ python-saharaclient
  9. 9. Sahara @ integrated gate
  10. 10. Juno plans ● Spark plugin supported by EDP ● Merge dashboard to Horizon ● Sahara as resources in Heat ● Heat engine by default ● Testing improvements
  11. 11. Juno plans ● Spark plugin supported by EDP ● Merge dashboard to Horizon ● Sahara as resources in Heat ● Heat engine by default ● Testing improvements
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×