SlideShare a Scribd company logo
1 of 22
Download to read offline
INTERACTIVE
SPARK IN YOUR
BROWSER
Spark	
  Lightning	
  Night	
  -­‐	
  Hadoop	
  World
September	
  2015
GOAL

OF HUE
WEB INTERFACE FOR ANALYZING DATA
WITH APACHE HADOOP	
  
SIMPLIFY AND INTEGRATE



FREE AND OPEN SOURCE
—> “BIG NOTEBOOK” FOR HADOOP
SPARK

ARCHITECTURE
Livy
Spark
Spark
Spark
YARN
Backend	
  partWeb	
  part
SPARK

ARCHITECTURE
Livy
Spark
Spark
Spark
YARN
Spark	
  ServerUI
HADOOP NOTEBOOK
UI
• Married	
  with	
  full	
  ecosystem	
  
• File,	
  Job	
  browsers	
  
• Create	
  table	
  wizards	
  
• Any	
  language	
  (Hive,	
  Spark...)	
  
• Graphing	
  
• Export/Import/Sharing	
  
• MulT	
  users	
  
• ImpersonaTon	
  
HADOOP WITH SPARK

NOTEBOOK
LIVY SPARK SERVER
Backend
• REST	
  Web	
  server	
  in	
  Scala	
  
• InteracTve	
  Spark	
  Sessions	
  and	
  Batch	
  Jobs	
  
• Type	
  IntrospecTon	
  for	
  VisualizaTon	
  
• Running	
  sessions	
  in	
  YARN	
  	
  local	
  
• Backends:	
  Scala,	
  Python,	
  R	
  
• Open	
  Source:	
  h[ps://github.com/cloudera/hue/
tree/master/apps/spark/java	
  
• Play	
  with	
  Curl	
  h[p://gethue.com/how-­‐to-­‐use-­‐the-­‐
LIVY

SPARK SERVER
LIVY WEB SERVER

ARCHITECTURE YARN	
  
Master
Spark	
  
Client
YARN

Node
Spark

Interpreter
Spark

Context
YARN

Node
Spark

Worker
YARN

Node
Spark

Worker
Livy	
  Server
Scalatra
Session	
  Manager
Session
LIVY WEB SERVER

ARCHITECTURE
Livy	
  Server
YARN	
  
Master
Scalatra
Spark	
  
Client
Session	
  Manager
Session
YARN

Node
Spark

Interpreter
Spark

Context
YARN

Node
Spark

Worker
YARN

Node
Spark

Worker
1
LIVY WEB SERVER

ARCHITECTURE YARN	
  
Master
Spark	
  
Client
YARN

Node
Spark

Interpreter
Spark

Context
YARN

Node
Spark

Worker
YARN

Node
Spark

Worker
1
2
Livy	
  Server
Scalatra
Session	
  Manager
Session
LIVY WEB SERVER

ARCHITECTURE YARN	
  
Master
Spark	
  
Client
YARN

Node
Spark

Interpreter
Spark

Context
YARN

Node
Spark

Worker
YARN

Node
Spark

Worker
1
2
3
Livy	
  Server
Scalatra
Session	
  Manager
Session
LIVY WEB SERVER

ARCHITECTURE YARN	
  
Master
Spark	
  
Client
YARN

Node
Spark

Interpreter
Spark

Context
YARN

Node
Spark

Worker
YARN

Node
Spark

Worker
1
2
3
4
Livy	
  Server
Scalatra
Session	
  Manager
Session
LIVY WEB SERVER

ARCHITECTURE YARN	
  
Master
Spark	
  
Client
YARN

Node
Spark

Interpreter
Spark

Context
YARN

Node
Spark

Worker
YARN

Node
Spark

Worker
1
2
3
4
5
Livy	
  Server
Scalatra
Session	
  Manager
Session
LIVY WEB SERVER

ARCHITECTURE YARN	
  
Master
Spark	
  
Client
YARN

Node
Spark

Interpreter
Spark

Context
YARN

Node
Spark

Worker
YARN

Node
Spark

Worker
1
2
3
4
5
6
Livy	
  Server
Scalatra
Session	
  Manager
Session
LIVY WEB SERVER

ARCHITECTURE YARN	
  
Master
Spark	
  
Client
YARN

Node
Spark

Interpreter
Spark

Context
YARN

Node
Spark

Worker
YARN

Node
Spark

Worker
1 7
2
3
4
5
6
Livy	
  Server
Scalatra
Session	
  Manager
Session
SESSION CREATION

AND EXECUTION
%	
  curl	
  -­‐XPOST	
  localhost:8998/sessions	
  	
  
	
  	
  -­‐d	
  '{"kind":	
  "spark"}'	
  
{	
  
	
  	
  "id":	
  0,	
  
	
  	
  "kind":	
  "spark",	
  
	
  	
  "log":	
  [...],	
  
	
  	
  "state":	
  "idle"	
  
}	
  
%	
  curl	
  -­‐XPOST	
  localhost:8998/sessions/0/statements	
  -­‐d	
  '{"code":	
  "
{	
  
	
  	
  "id":	
  0,	
  
	
  	
  "output":	
  {	
  
	
  	
  	
  	
  "data":	
  {	
  "text/plain":	
  "res0:	
  Int	
  =	
  2"	
  },	
  
	
  	
  	
  	
  "execution_count":	
  0,	
  
	
  	
  	
  	
  "status":	
  "ok"	
  
	
  	
  },	
  
	
  	
  "state":	
  "available"	
  
}	
  
DEMO
TIME

• Stability	
  and	
  Scaling	
  
• Security	
  
• iPython/Jupyter	
  interpreter	
  
and	
  file	
  format	
  
COMING SOON
TWITTER
@gethue
USER GROUP
hue-­‐user@
WEBSITE
h[p://gethue.com
LEARN
h[p://learn.gethue.com
THANKS!


More Related Content

More from gethue

Spark Summit Europe: Building a REST Job Server for interactive Spark as a se...
Spark Summit Europe: Building a REST Job Server for interactive Spark as a se...Spark Summit Europe: Building a REST Job Server for interactive Spark as a se...
Spark Summit Europe: Building a REST Job Server for interactive Spark as a se...
gethue
 

More from gethue (13)

Spark Summit Europe: Building a REST Job Server for interactive Spark as a se...
Spark Summit Europe: Building a REST Job Server for interactive Spark as a se...Spark Summit Europe: Building a REST Job Server for interactive Spark as a se...
Spark Summit Europe: Building a REST Job Server for interactive Spark as a se...
 
SF Solr Meetup - Interactively Search and Visualize Your Big Data
SF Solr Meetup - Interactively Search and Visualize Your Big DataSF Solr Meetup - Interactively Search and Visualize Your Big Data
SF Solr Meetup - Interactively Search and Visualize Your Big Data
 
Hadoop Summit - Interactive Big Data Analysis with Solr, Spark and Hue
Hadoop Summit - Interactive Big Data Analysis with Solr, Spark and HueHadoop Summit - Interactive Big Data Analysis with Solr, Spark and Hue
Hadoop Summit - Interactive Big Data Analysis with Solr, Spark and Hue
 
Harness the power of Spark and Solr in Hue: Big Data Amsterdam v.2.0
Harness the power of Spark and Solr in Hue: Big Data Amsterdam v.2.0Harness the power of Spark and Solr in Hue: Big Data Amsterdam v.2.0
Harness the power of Spark and Solr in Hue: Big Data Amsterdam v.2.0
 
Interactively Search and Visualize Your Big Data
Interactively Search and Visualize Your Big DataInteractively Search and Visualize Your Big Data
Interactively Search and Visualize Your Big Data
 
Sqoop2 refactoring for generic data transfer - NYC Sqoop Meetup
Sqoop2 refactoring for generic data transfer - NYC Sqoop MeetupSqoop2 refactoring for generic data transfer - NYC Sqoop Meetup
Sqoop2 refactoring for generic data transfer - NYC Sqoop Meetup
 
LDAP, SAML and Hue
LDAP, SAML and HueLDAP, SAML and Hue
LDAP, SAML and Hue
 
Hadoop Israel - HBase Browser in Hue
Hadoop Israel - HBase Browser in HueHadoop Israel - HBase Browser in Hue
Hadoop Israel - HBase Browser in Hue
 
Hue: The Hadoop UI - Hadoop Singapore
Hue: The Hadoop UI - Hadoop SingaporeHue: The Hadoop UI - Hadoop Singapore
Hue: The Hadoop UI - Hadoop Singapore
 
SF Dev Meetup - Hue SDK
SF Dev Meetup - Hue SDKSF Dev Meetup - Hue SDK
SF Dev Meetup - Hue SDK
 
Hue: The Hadoop UI - Where we stand, Hue Meetup SF
Hue: The Hadoop UI - Where we stand, Hue Meetup SF Hue: The Hadoop UI - Where we stand, Hue Meetup SF
Hue: The Hadoop UI - Where we stand, Hue Meetup SF
 
Hue: The Hadoop UI - HUG France
Hue: The Hadoop UI - HUG FranceHue: The Hadoop UI - HUG France
Hue: The Hadoop UI - HUG France
 
Hue: The Hadoop UI - Stockholm HUG
Hue: The Hadoop UI - Stockholm HUGHue: The Hadoop UI - Stockholm HUG
Hue: The Hadoop UI - Stockholm HUG
 

Recently uploaded

一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
pyhepag
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
fztigerwe
 
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra MalangToko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
adet6151
 
Toko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat Viagra
Toko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat ViagraToko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat Viagra
Toko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat Viagra
adet6151
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
cyebo
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
pyhepag
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
ppy8zfkfm
 
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
dq9vz1isj
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Valters Lauzums
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
cyebo
 

Recently uploaded (20)

Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
 
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfGenerative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
 
Machine Learning for Accident Severity Prediction
Machine Learning for Accident Severity PredictionMachine Learning for Accident Severity Prediction
Machine Learning for Accident Severity Prediction
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp online
 
basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdf
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)
 
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra MalangToko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
 
Toko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat Viagra
Toko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat ViagraToko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat Viagra
Toko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat Viagra
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
 
Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdf
 
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
ℂall Girls Kashmiri Gate ℂall Now Chhaya ☎ 9899900591 WhatsApp Number 24/7
ℂall Girls Kashmiri Gate ℂall Now Chhaya ☎ 9899900591 WhatsApp  Number 24/7ℂall Girls Kashmiri Gate ℂall Now Chhaya ☎ 9899900591 WhatsApp  Number 24/7
ℂall Girls Kashmiri Gate ℂall Now Chhaya ☎ 9899900591 WhatsApp Number 24/7
 

Hadoop World Spark Meetup: Interactive Spark in your Browser

  • 1. INTERACTIVE SPARK IN YOUR BROWSER Spark  Lightning  Night  -­‐  Hadoop  World September  2015
  • 2.
  • 3.
  • 4. GOAL
 OF HUE WEB INTERFACE FOR ANALYZING DATA WITH APACHE HADOOP   SIMPLIFY AND INTEGRATE
 
 FREE AND OPEN SOURCE —> “BIG NOTEBOOK” FOR HADOOP
  • 8. • Married  with  full  ecosystem   • File,  Job  browsers   • Create  table  wizards   • Any  language  (Hive,  Spark...)   • Graphing   • Export/Import/Sharing   • MulT  users   • ImpersonaTon   HADOOP WITH SPARK
 NOTEBOOK
  • 10. • REST  Web  server  in  Scala   • InteracTve  Spark  Sessions  and  Batch  Jobs   • Type  IntrospecTon  for  VisualizaTon   • Running  sessions  in  YARN    local   • Backends:  Scala,  Python,  R   • Open  Source:  h[ps://github.com/cloudera/hue/ tree/master/apps/spark/java   • Play  with  Curl  h[p://gethue.com/how-­‐to-­‐use-­‐the-­‐ LIVY
 SPARK SERVER
  • 11. LIVY WEB SERVER
 ARCHITECTURE YARN   Master Spark   Client YARN
 Node Spark
 Interpreter Spark
 Context YARN
 Node Spark
 Worker YARN
 Node Spark
 Worker Livy  Server Scalatra Session  Manager Session
  • 12. LIVY WEB SERVER
 ARCHITECTURE Livy  Server YARN   Master Scalatra Spark   Client Session  Manager Session YARN
 Node Spark
 Interpreter Spark
 Context YARN
 Node Spark
 Worker YARN
 Node Spark
 Worker 1
  • 13. LIVY WEB SERVER
 ARCHITECTURE YARN   Master Spark   Client YARN
 Node Spark
 Interpreter Spark
 Context YARN
 Node Spark
 Worker YARN
 Node Spark
 Worker 1 2 Livy  Server Scalatra Session  Manager Session
  • 14. LIVY WEB SERVER
 ARCHITECTURE YARN   Master Spark   Client YARN
 Node Spark
 Interpreter Spark
 Context YARN
 Node Spark
 Worker YARN
 Node Spark
 Worker 1 2 3 Livy  Server Scalatra Session  Manager Session
  • 15. LIVY WEB SERVER
 ARCHITECTURE YARN   Master Spark   Client YARN
 Node Spark
 Interpreter Spark
 Context YARN
 Node Spark
 Worker YARN
 Node Spark
 Worker 1 2 3 4 Livy  Server Scalatra Session  Manager Session
  • 16. LIVY WEB SERVER
 ARCHITECTURE YARN   Master Spark   Client YARN
 Node Spark
 Interpreter Spark
 Context YARN
 Node Spark
 Worker YARN
 Node Spark
 Worker 1 2 3 4 5 Livy  Server Scalatra Session  Manager Session
  • 17. LIVY WEB SERVER
 ARCHITECTURE YARN   Master Spark   Client YARN
 Node Spark
 Interpreter Spark
 Context YARN
 Node Spark
 Worker YARN
 Node Spark
 Worker 1 2 3 4 5 6 Livy  Server Scalatra Session  Manager Session
  • 18. LIVY WEB SERVER
 ARCHITECTURE YARN   Master Spark   Client YARN
 Node Spark
 Interpreter Spark
 Context YARN
 Node Spark
 Worker YARN
 Node Spark
 Worker 1 7 2 3 4 5 6 Livy  Server Scalatra Session  Manager Session
  • 19. SESSION CREATION
 AND EXECUTION %  curl  -­‐XPOST  localhost:8998/sessions        -­‐d  '{"kind":  "spark"}'   {      "id":  0,      "kind":  "spark",      "log":  [...],      "state":  "idle"   }   %  curl  -­‐XPOST  localhost:8998/sessions/0/statements  -­‐d  '{"code":  " {      "id":  0,      "output":  {          "data":  {  "text/plain":  "res0:  Int  =  2"  },          "execution_count":  0,          "status":  "ok"      },      "state":  "available"   }  
  • 21. • Stability  and  Scaling   • Security   • iPython/Jupyter  interpreter   and  file  format   COMING SOON