Toulouse Data Science meetup - Apache zeppelin

Gérard Dupont
Gérard DupontSenior Data Scientist
Apache Zeppelin
The (very) short field trip
by G.Alléon & G.DupontTDS meetup - 2016.06.30
Who are we?
Guillaume Alleon - AIRBUS Group Innovation (corporate research center)
Research leader for more than 30 people from UK to China, tackling problems in massive data processing
and information extraction.
Was already in “big data” when it was still called HPC…
Gerard Dupont - AIRBUS Defence & Space (space systems)
Technical coordinator for R&T studies on distributed processing systems.
Spend way too much time processing web data for intelligence, now looking to the sky (satellite data ;-)
Zeppelin moto
“A web-based notebook that enables interactive data analytics.”
Origins & history
Missing piece in HADOOP landscape: a modern analytic playground.
2012.12 - Data analytics
solution (NFLabs)
2013.10 - Opensourced
2014.12 - ASF incubation
2015 - 3 stable releases
2016.05 - Maturing to Apache
top level project
3000 feet view
What’s cool about Zeppelin
⊕interactive
⊕out-of-the-box spark integration
⊕out-of-the-box visualization options
⊕direct access to DOM for customized visualization
⊕nice UI (bootstrap & angular)
⊕notebook run scheduler
⊕easy to configure
⊕extensibility, extensibility and extensibility...
What’s cool about Zeppelin
⊕interactive
⊕out-of-the-box spark integration
⊕out-of-the-box visualization options
⊕direct access to DOM for customized visualization
⊕nice UI (bootstrap & angular)
⊕notebook run scheduler
⊕easy to configure
⊕extensibility, extensibility and extensibility...
… the dark side
⊝hard to install
⊝need to build from the source
(for customized version)
⊝not (yet) multi-users
Overview/look & feel
Interpreter text
(aka your code)
Interpreter config
Interactive results
DEMO time
credits: https://www.weasyl.com/~uszatyarbuz
Under the hood
○ Interpreter isolation with their
own JVM
○ Dynamic dependencies loading
○ REST & websocket on front
○ Thrift in back
(or whatever you add)
○ Process scheduler (cron-like)
Roadmap
Enterprise Ready
○ Multi-tenancy
○ Job scheduler
○ HA
Usability Improvement
○ UX improvement
○ Table data support
○ Dynamic interpreter integration
○ Reusable analytic application catalog
Thx
Offical website: https://zeppelin.apache.org/
Notebook sample: https://www.zeppelinhub.com/viewer
Source code: https://github.com/apache/incubator-zeppelin
Mailing lists: http://zeppelin.apache.org/community.html
This TDS notebook: http://tinyurl.com/zeppelin-tds
Sources for this presentation:
○ http://www.slideshare.net/FlinkForward/moon-soo-lee-data-science-lifecycle-with-apache-flink-and-apache-zeppelin/23
○ http://www.slideshare.net/HadoopSummit/apache-zeppelin-helium-and-beyond
○ http://www.slideshare.net/felixcss/interactive-data-science-from-scratch-with-apache-zeppelin-and-apache-spark
○ http://www.slideshare.net/BrunoBonnin/explorez-vos-donnes-avec-apache-zeppelin
credits: https://www.weasyl.com/~uszatyarbuz
BACKUP
Origins & history
Active core teams
Descent number of
external contributors
Plenty of interpreters
(official and external)
0.6.0-SNAPSHOT
(pending stabilization)
3000 feet view
1 of 15

Recommended

Tds — big science dec 2021 by
Tds — big science dec 2021Tds — big science dec 2021
Tds — big science dec 2021Gérard Dupont
38 views19 slides
Introduction to SARA's Hadoop Hackathon - dec 7th 2010 by
Introduction to SARA's Hadoop Hackathon - dec 7th 2010Introduction to SARA's Hadoop Hackathon - dec 7th 2010
Introduction to SARA's Hadoop Hackathon - dec 7th 2010Evert Lammerts
709 views10 slides
Intro to hadoop ecosystem by
Intro to hadoop ecosystemIntro to hadoop ecosystem
Intro to hadoop ecosystemGrzegorz Kolpuc
144 views54 slides
First NL-HUG: Large-scale data processing at SARA with Apache Hadoop by
First NL-HUG: Large-scale data processing at SARA with Apache HadoopFirst NL-HUG: Large-scale data processing at SARA with Apache Hadoop
First NL-HUG: Large-scale data processing at SARA with Apache HadoopEvert Lammerts
1.1K views38 slides
Hadoop @ Sara & BiG Grid by
Hadoop @ Sara & BiG GridHadoop @ Sara & BiG Grid
Hadoop @ Sara & BiG GridEvert Lammerts
688 views51 slides
Deep learning with Tensorflow in R by
Deep learning with Tensorflow in RDeep learning with Tensorflow in R
Deep learning with Tensorflow in Rmikaelhuss
6.8K views21 slides

More Related Content

What's hot

Introduction NL-HUG (April) by
Introduction NL-HUG (April)Introduction NL-HUG (April)
Introduction NL-HUG (April)Evert Lammerts
1.1K views12 slides
Session 09 learning relationships.pptx by
Session 09 learning relationships.pptxSession 09 learning relationships.pptx
Session 09 learning relationships.pptxbodaceacat
345 views40 slides
Notes on data-intensive processing with Hadoop Mapreduce by
Notes on data-intensive processing with Hadoop MapreduceNotes on data-intensive processing with Hadoop Mapreduce
Notes on data-intensive processing with Hadoop MapreduceEvert Lammerts
1.3K views48 slides
Introduction to TensorFlow by
Introduction to TensorFlowIntroduction to TensorFlow
Introduction to TensorFlowMatthias Feys
18.6K views29 slides
Text Mining with Node.js - Philipp Burckhardt, Carnegie Mellon University by
Text Mining with Node.js - Philipp Burckhardt, Carnegie Mellon UniversityText Mining with Node.js - Philipp Burckhardt, Carnegie Mellon University
Text Mining with Node.js - Philipp Burckhardt, Carnegie Mellon UniversityNodejsFoundation
2.7K views21 slides
Sociopath presentation by
Sociopath presentationSociopath presentation
Sociopath presentationGalina Alperovich
173 views21 slides

What's hot(20)

Introduction NL-HUG (April) by Evert Lammerts
Introduction NL-HUG (April)Introduction NL-HUG (April)
Introduction NL-HUG (April)
Evert Lammerts1.1K views
Session 09 learning relationships.pptx by bodaceacat
Session 09 learning relationships.pptxSession 09 learning relationships.pptx
Session 09 learning relationships.pptx
bodaceacat345 views
Notes on data-intensive processing with Hadoop Mapreduce by Evert Lammerts
Notes on data-intensive processing with Hadoop MapreduceNotes on data-intensive processing with Hadoop Mapreduce
Notes on data-intensive processing with Hadoop Mapreduce
Evert Lammerts1.3K views
Introduction to TensorFlow by Matthias Feys
Introduction to TensorFlowIntroduction to TensorFlow
Introduction to TensorFlow
Matthias Feys18.6K views
Text Mining with Node.js - Philipp Burckhardt, Carnegie Mellon University by NodejsFoundation
Text Mining with Node.js - Philipp Burckhardt, Carnegie Mellon UniversityText Mining with Node.js - Philipp Burckhardt, Carnegie Mellon University
Text Mining with Node.js - Philipp Burckhardt, Carnegie Mellon University
NodejsFoundation2.7K views
Your data isn't that big @ Big Things Meetup 2016-05-16 by Boaz Menuhin
Your data isn't that big @ Big Things Meetup 2016-05-16Your data isn't that big @ Big Things Meetup 2016-05-16
Your data isn't that big @ Big Things Meetup 2016-05-16
Boaz Menuhin181 views
Large-Scale Data Storage and Processing for Scientists with Hadoop by Evert Lammerts
Large-Scale Data Storage and Processing for Scientists with HadoopLarge-Scale Data Storage and Processing for Scientists with Hadoop
Large-Scale Data Storage and Processing for Scientists with Hadoop
Evert Lammerts6.4K views
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data" by Paco Nathan
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
Paco Nathan12.7K views
Hadoop Jute Record Python by Paul Tarjan
Hadoop Jute Record PythonHadoop Jute Record Python
Hadoop Jute Record Python
Paul Tarjan1.6K views
TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par... by Big Data Spain
TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...
TENSORFLOW: ARCHITECTURE AND USE CASE - NASA SPACE APPS CHALLENGE by Gema Par...
Big Data Spain3.7K views
Tech Talk - Underutilized Resources in Distributed System by Rishabh Dugar
Tech Talk - Underutilized Resources in Distributed SystemTech Talk - Underutilized Resources in Distributed System
Tech Talk - Underutilized Resources in Distributed System
Rishabh Dugar151 views
simple introduction to hadoop by vishnu rao
simple introduction to hadoopsimple introduction to hadoop
simple introduction to hadoop
vishnu rao1.1K views
Machine learning in python course contents by MRUNALINI
Machine learning in python course contentsMachine learning in python course contents
Machine learning in python course contents
MRUNALINI34 views
DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros... by Deltares
DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...
DSD-NL 2017 Digishape project: "Heel AHN2 is inmiddels ingeladen in de Micros...
Deltares515 views

Viewers also liked

Ashcraft.edu103.module3 by
Ashcraft.edu103.module3Ashcraft.edu103.module3
Ashcraft.edu103.module3KaitySue8
135 views6 slides
Jennyresumen by
JennyresumenJennyresumen
Jennyresumenamandaaltamirano
70 views4 slides
Unit 8 - Textbook Lesson 1 by
Unit 8 - Textbook Lesson 1Unit 8 - Textbook Lesson 1
Unit 8 - Textbook Lesson 1Mirna Deakle
1.4K views24 slides
The Networked Supply Chain - Gary Philbin, Chief Operating Officer, Dollar T... by
The Networked Supply Chain -  Gary Philbin, Chief Operating Officer, Dollar T...The Networked Supply Chain -  Gary Philbin, Chief Operating Officer, Dollar T...
The Networked Supply Chain - Gary Philbin, Chief Operating Officer, Dollar T...SAP Ariba
1.1K views23 slides
Pedal to the Metal: Accelerating Spark with Silicon Innovation by
Pedal to the Metal: Accelerating Spark with Silicon InnovationPedal to the Metal: Accelerating Spark with Silicon Innovation
Pedal to the Metal: Accelerating Spark with Silicon InnovationJen Aman
837 views10 slides
Voorstelling 4Betterresults_voor_KMOs_en_ondernemers by
Voorstelling 4Betterresults_voor_KMOs_en_ondernemersVoorstelling 4Betterresults_voor_KMOs_en_ondernemers
Voorstelling 4Betterresults_voor_KMOs_en_ondernemersKurt Vandewalle
21K views14 slides

Viewers also liked(18)

Ashcraft.edu103.module3 by KaitySue8
Ashcraft.edu103.module3Ashcraft.edu103.module3
Ashcraft.edu103.module3
KaitySue8135 views
Unit 8 - Textbook Lesson 1 by Mirna Deakle
Unit 8 - Textbook Lesson 1Unit 8 - Textbook Lesson 1
Unit 8 - Textbook Lesson 1
Mirna Deakle1.4K views
The Networked Supply Chain - Gary Philbin, Chief Operating Officer, Dollar T... by SAP Ariba
The Networked Supply Chain -  Gary Philbin, Chief Operating Officer, Dollar T...The Networked Supply Chain -  Gary Philbin, Chief Operating Officer, Dollar T...
The Networked Supply Chain - Gary Philbin, Chief Operating Officer, Dollar T...
SAP Ariba1.1K views
Pedal to the Metal: Accelerating Spark with Silicon Innovation by Jen Aman
Pedal to the Metal: Accelerating Spark with Silicon InnovationPedal to the Metal: Accelerating Spark with Silicon Innovation
Pedal to the Metal: Accelerating Spark with Silicon Innovation
Jen Aman837 views
Voorstelling 4Betterresults_voor_KMOs_en_ondernemers by Kurt Vandewalle
Voorstelling 4Betterresults_voor_KMOs_en_ondernemersVoorstelling 4Betterresults_voor_KMOs_en_ondernemers
Voorstelling 4Betterresults_voor_KMOs_en_ondernemers
Kurt Vandewalle21K views
Seminarie 'Sturen op effecten door slimme dashboards' 3 december 2015 by Möbius Business Redesign
Seminarie 'Sturen op effecten door slimme dashboards' 3 december 2015Seminarie 'Sturen op effecten door slimme dashboards' 3 december 2015
Seminarie 'Sturen op effecten door slimme dashboards' 3 december 2015
Postgres & Redis Sitting in a Tree- Rimas Silkaitis, Heroku by Redis Labs
Postgres & Redis Sitting in a Tree- Rimas Silkaitis, HerokuPostgres & Redis Sitting in a Tree- Rimas Silkaitis, Heroku
Postgres & Redis Sitting in a Tree- Rimas Silkaitis, Heroku
Redis Labs2.4K views
我想請你吃飯 (繁体) by Na Li
我想請你吃飯 (繁体)我想請你吃飯 (繁体)
我想請你吃飯 (繁体)
Na Li667 views
HIgh Performance Redis- Tague Griffith, GoPro by Redis Labs
HIgh Performance Redis- Tague Griffith, GoProHIgh Performance Redis- Tague Griffith, GoPro
HIgh Performance Redis- Tague Griffith, GoPro
Redis Labs1.4K views
[biurowi 5 - en] basic principles of fire protection by AktywBHP
[biurowi 5 - en] basic principles of fire protection[biurowi 5 - en] basic principles of fire protection
[biurowi 5 - en] basic principles of fire protection
AktywBHP5.3K views
Lista de verbos Irregulares - Inglés by Jakeline Cuenca
Lista de verbos Irregulares - InglésLista de verbos Irregulares - Inglés
Lista de verbos Irregulares - Inglés
Jakeline Cuenca62.3K views
Chinese Link Lesson 20 worksheet 2016 by Joanne Chen
Chinese Link Lesson 20  worksheet 2016Chinese Link Lesson 20  worksheet 2016
Chinese Link Lesson 20 worksheet 2016
Joanne Chen680 views
Chinese link textbook Lesson 6 vocabulary by Joanne Chen
Chinese link textbook Lesson 6 vocabulary Chinese link textbook Lesson 6 vocabulary
Chinese link textbook Lesson 6 vocabulary
Joanne Chen638 views
A Presentation on "NGO's Role in Disaster Management" Presented by Mr. Deepak... by CDRN
A Presentation on "NGO's Role in Disaster Management" Presented by Mr. Deepak...A Presentation on "NGO's Role in Disaster Management" Presented by Mr. Deepak...
A Presentation on "NGO's Role in Disaster Management" Presented by Mr. Deepak...
CDRN22.1K views

Similar to Toulouse Data Science meetup - Apache zeppelin

Unified Data API for Distributed Cloud Analytics and AI by
Unified Data API for Distributed Cloud Analytics and AIUnified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AIAlluxio, Inc.
31 views33 slides
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio by
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & AlluxioUltra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & AlluxioAlluxio, Inc.
118 views23 slides
Architecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds by
Architecting a Heterogeneous Data Platform Across Clusters, Regions, and CloudsArchitecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds
Architecting a Heterogeneous Data Platform Across Clusters, Regions, and CloudsAlluxio, Inc.
194 views46 slides
Working together with SURF Raymond Oonk Annette Langedijk SURF by
Working together with SURF Raymond Oonk Annette Langedijk SURFWorking together with SURF Raymond Oonk Annette Langedijk SURF
Working together with SURF Raymond Oonk Annette Langedijk SURFCommunicatieSURF
188 views37 slides
Google Cloud Platform for Data Science teams by
Google Cloud Platform for Data Science teamsGoogle Cloud Platform for Data Science teams
Google Cloud Platform for Data Science teamsBarton Rhodes
385 views20 slides
Hopsworks in the cloud Berlin Buzzwords 2019 by
Hopsworks in the cloud Berlin Buzzwords 2019 Hopsworks in the cloud Berlin Buzzwords 2019
Hopsworks in the cloud Berlin Buzzwords 2019 Jim Dowling
565 views36 slides

Similar to Toulouse Data Science meetup - Apache zeppelin(20)

Unified Data API for Distributed Cloud Analytics and AI by Alluxio, Inc.
Unified Data API for Distributed Cloud Analytics and AIUnified Data API for Distributed Cloud Analytics and AI
Unified Data API for Distributed Cloud Analytics and AI
Alluxio, Inc.31 views
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio by Alluxio, Inc.
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & AlluxioUltra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Alluxio, Inc.118 views
Architecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds by Alluxio, Inc.
Architecting a Heterogeneous Data Platform Across Clusters, Regions, and CloudsArchitecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds
Architecting a Heterogeneous Data Platform Across Clusters, Regions, and Clouds
Alluxio, Inc.194 views
Working together with SURF Raymond Oonk Annette Langedijk SURF by CommunicatieSURF
Working together with SURF Raymond Oonk Annette Langedijk SURFWorking together with SURF Raymond Oonk Annette Langedijk SURF
Working together with SURF Raymond Oonk Annette Langedijk SURF
CommunicatieSURF188 views
Google Cloud Platform for Data Science teams by Barton Rhodes
Google Cloud Platform for Data Science teamsGoogle Cloud Platform for Data Science teams
Google Cloud Platform for Data Science teams
Barton Rhodes385 views
Hopsworks in the cloud Berlin Buzzwords 2019 by Jim Dowling
Hopsworks in the cloud Berlin Buzzwords 2019 Hopsworks in the cloud Berlin Buzzwords 2019
Hopsworks in the cloud Berlin Buzzwords 2019
Jim Dowling565 views
Run your code serverlessly on Google's open cloud by wesley chun
Run your code serverlessly on Google's open cloudRun your code serverlessly on Google's open cloud
Run your code serverlessly on Google's open cloud
wesley chun172 views
Large scale, interactive ad-hoc queries over different datastores with Apache... by jaxLondonConference
Large scale, interactive ad-hoc queries over different datastores with Apache...Large scale, interactive ad-hoc queries over different datastores with Apache...
Large scale, interactive ad-hoc queries over different datastores with Apache...
jaxLondonConference6.1K views
Containers for sensor web services, applications and research @ Sensor Web Co... by Daniel Nüst
Containers for sensor web services, applications and research @ Sensor Web Co...Containers for sensor web services, applications and research @ Sensor Web Co...
Containers for sensor web services, applications and research @ Sensor Web Co...
Daniel Nüst413 views
Hambug R Meetup - Intro to H2O by Sri Ambati
Hambug R Meetup - Intro to H2OHambug R Meetup - Intro to H2O
Hambug R Meetup - Intro to H2O
Sri Ambati272 views
Netflix Data Engineering @ Uber Engineering Meetup by Blake Irvine
Netflix Data Engineering @ Uber Engineering MeetupNetflix Data Engineering @ Uber Engineering Meetup
Netflix Data Engineering @ Uber Engineering Meetup
Blake Irvine4.7K views
Berlin Hadoop Get Together Apache Drill by MapR Technologies
Berlin Hadoop Get Together Apache Drill Berlin Hadoop Get Together Apache Drill
Berlin Hadoop Get Together Apache Drill
MapR Technologies818 views
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015 by Andrey Vykhodtsev
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Andrey Vykhodtsev526 views
Large Scale Deep Learning with TensorFlow by Jen Aman
Large Scale Deep Learning with TensorFlow Large Scale Deep Learning with TensorFlow
Large Scale Deep Learning with TensorFlow
Jen Aman24.8K views
Rise of Intermediate APIs - Beam and Alluxio at Alluxio Meetup 2016 by Alluxio, Inc.
Rise of Intermediate APIs - Beam and Alluxio at Alluxio Meetup 2016Rise of Intermediate APIs - Beam and Alluxio at Alluxio Meetup 2016
Rise of Intermediate APIs - Beam and Alluxio at Alluxio Meetup 2016
Alluxio, Inc.2.7K views
Data pipelines from zero to solid by Lars Albertsson
Data pipelines from zero to solidData pipelines from zero to solid
Data pipelines from zero to solid
Lars Albertsson10.7K views
Accelerating Spark with Kubernetes by Alluxio, Inc.
Accelerating Spark with KubernetesAccelerating Spark with Kubernetes
Accelerating Spark with Kubernetes
Alluxio, Inc.477 views

Recently uploaded

Future of AR - Facebook Presentation by
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook PresentationRob McCarty
54 views27 slides
State of the Union - Rohit Yadav - Apache CloudStack by
State of the Union - Rohit Yadav - Apache CloudStackState of the Union - Rohit Yadav - Apache CloudStack
State of the Union - Rohit Yadav - Apache CloudStackShapeBlue
218 views53 slides
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...ShapeBlue
128 views20 slides
The Power of Heat Decarbonisation Plans in the Built Environment by
The Power of Heat Decarbonisation Plans in the Built EnvironmentThe Power of Heat Decarbonisation Plans in the Built Environment
The Power of Heat Decarbonisation Plans in the Built EnvironmentIES VE
67 views20 slides
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue by
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueCloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueShapeBlue
63 views15 slides
"Surviving highload with Node.js", Andrii Shumada by
"Surviving highload with Node.js", Andrii Shumada "Surviving highload with Node.js", Andrii Shumada
"Surviving highload with Node.js", Andrii Shumada Fwdays
49 views29 slides

Recently uploaded(20)

Future of AR - Facebook Presentation by Rob McCarty
Future of AR - Facebook PresentationFuture of AR - Facebook Presentation
Future of AR - Facebook Presentation
Rob McCarty54 views
State of the Union - Rohit Yadav - Apache CloudStack by ShapeBlue
State of the Union - Rohit Yadav - Apache CloudStackState of the Union - Rohit Yadav - Apache CloudStack
State of the Union - Rohit Yadav - Apache CloudStack
ShapeBlue218 views
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or... by ShapeBlue
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
Zero to Cloud Hero: Crafting a Private Cloud from Scratch with XCP-ng, Xen Or...
ShapeBlue128 views
The Power of Heat Decarbonisation Plans in the Built Environment by IES VE
The Power of Heat Decarbonisation Plans in the Built EnvironmentThe Power of Heat Decarbonisation Plans in the Built Environment
The Power of Heat Decarbonisation Plans in the Built Environment
IES VE67 views
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue by ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlueCloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
CloudStack Object Storage - An Introduction - Vladimir Petrov - ShapeBlue
ShapeBlue63 views
"Surviving highload with Node.js", Andrii Shumada by Fwdays
"Surviving highload with Node.js", Andrii Shumada "Surviving highload with Node.js", Andrii Shumada
"Surviving highload with Node.js", Andrii Shumada
Fwdays49 views
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue by ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
2FA and OAuth2 in CloudStack - Andrija Panić - ShapeBlue
ShapeBlue75 views
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue by ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlueElevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
Elevating Privacy and Security in CloudStack - Boris Stoyanov - ShapeBlue
ShapeBlue149 views
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda... by ShapeBlue
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...
Hypervisor Agnostic DRS in CloudStack - Brief overview & demo - Vishesh Jinda...
ShapeBlue93 views
Why and How CloudStack at weSystems - Stephan Bienek - weSystems by ShapeBlue
Why and How CloudStack at weSystems - Stephan Bienek - weSystemsWhy and How CloudStack at weSystems - Stephan Bienek - weSystems
Why and How CloudStack at weSystems - Stephan Bienek - weSystems
ShapeBlue172 views
Extending KVM Host HA for Non-NFS Storage - Alex Ivanov - StorPool by ShapeBlue
Extending KVM Host HA for Non-NFS Storage -  Alex Ivanov - StorPoolExtending KVM Host HA for Non-NFS Storage -  Alex Ivanov - StorPool
Extending KVM Host HA for Non-NFS Storage - Alex Ivanov - StorPool
ShapeBlue56 views
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue by ShapeBlue
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlueMigrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue
Migrating VMware Infra to KVM Using CloudStack - Nicolas Vazquez - ShapeBlue
ShapeBlue147 views
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N... by James Anderson
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
James Anderson142 views
DRBD Deep Dive - Philipp Reisner - LINBIT by ShapeBlue
DRBD Deep Dive - Philipp Reisner - LINBITDRBD Deep Dive - Philipp Reisner - LINBIT
DRBD Deep Dive - Philipp Reisner - LINBIT
ShapeBlue110 views
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online by ShapeBlue
KVM Security Groups Under the Hood - Wido den Hollander - Your.OnlineKVM Security Groups Under the Hood - Wido den Hollander - Your.Online
KVM Security Groups Under the Hood - Wido den Hollander - Your.Online
ShapeBlue154 views
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ... by ShapeBlue
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
Live Demo Showcase: Unveiling Dell PowerFlex’s IaaS Capabilities with Apache ...
ShapeBlue52 views
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha... by ShapeBlue
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
Mitigating Common CloudStack Instance Deployment Failures - Jithin Raju - Sha...
ShapeBlue113 views
Igniting Next Level Productivity with AI-Infused Data Integration Workflows by Safe Software
Igniting Next Level Productivity with AI-Infused Data Integration Workflows Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Igniting Next Level Productivity with AI-Infused Data Integration Workflows
Safe Software373 views
Keynote Talk: Open Source is Not Dead - Charles Schulz - Vates by ShapeBlue
Keynote Talk: Open Source is Not Dead - Charles Schulz - VatesKeynote Talk: Open Source is Not Dead - Charles Schulz - Vates
Keynote Talk: Open Source is Not Dead - Charles Schulz - Vates
ShapeBlue178 views

Toulouse Data Science meetup - Apache zeppelin

  • 1. Apache Zeppelin The (very) short field trip by G.Alléon & G.DupontTDS meetup - 2016.06.30
  • 2. Who are we? Guillaume Alleon - AIRBUS Group Innovation (corporate research center) Research leader for more than 30 people from UK to China, tackling problems in massive data processing and information extraction. Was already in “big data” when it was still called HPC… Gerard Dupont - AIRBUS Defence & Space (space systems) Technical coordinator for R&T studies on distributed processing systems. Spend way too much time processing web data for intelligence, now looking to the sky (satellite data ;-)
  • 3. Zeppelin moto “A web-based notebook that enables interactive data analytics.”
  • 4. Origins & history Missing piece in HADOOP landscape: a modern analytic playground. 2012.12 - Data analytics solution (NFLabs) 2013.10 - Opensourced 2014.12 - ASF incubation 2015 - 3 stable releases 2016.05 - Maturing to Apache top level project
  • 6. What’s cool about Zeppelin ⊕interactive ⊕out-of-the-box spark integration ⊕out-of-the-box visualization options ⊕direct access to DOM for customized visualization ⊕nice UI (bootstrap & angular) ⊕notebook run scheduler ⊕easy to configure ⊕extensibility, extensibility and extensibility...
  • 7. What’s cool about Zeppelin ⊕interactive ⊕out-of-the-box spark integration ⊕out-of-the-box visualization options ⊕direct access to DOM for customized visualization ⊕nice UI (bootstrap & angular) ⊕notebook run scheduler ⊕easy to configure ⊕extensibility, extensibility and extensibility... … the dark side ⊝hard to install ⊝need to build from the source (for customized version) ⊝not (yet) multi-users
  • 8. Overview/look & feel Interpreter text (aka your code) Interpreter config Interactive results
  • 10. Under the hood ○ Interpreter isolation with their own JVM ○ Dynamic dependencies loading ○ REST & websocket on front ○ Thrift in back (or whatever you add) ○ Process scheduler (cron-like)
  • 11. Roadmap Enterprise Ready ○ Multi-tenancy ○ Job scheduler ○ HA Usability Improvement ○ UX improvement ○ Table data support ○ Dynamic interpreter integration ○ Reusable analytic application catalog
  • 12. Thx Offical website: https://zeppelin.apache.org/ Notebook sample: https://www.zeppelinhub.com/viewer Source code: https://github.com/apache/incubator-zeppelin Mailing lists: http://zeppelin.apache.org/community.html This TDS notebook: http://tinyurl.com/zeppelin-tds Sources for this presentation: ○ http://www.slideshare.net/FlinkForward/moon-soo-lee-data-science-lifecycle-with-apache-flink-and-apache-zeppelin/23 ○ http://www.slideshare.net/HadoopSummit/apache-zeppelin-helium-and-beyond ○ http://www.slideshare.net/felixcss/interactive-data-science-from-scratch-with-apache-zeppelin-and-apache-spark ○ http://www.slideshare.net/BrunoBonnin/explorez-vos-donnes-avec-apache-zeppelin credits: https://www.weasyl.com/~uszatyarbuz
  • 14. Origins & history Active core teams Descent number of external contributors Plenty of interpreters (official and external) 0.6.0-SNAPSHOT (pending stabilization)

Editor's Notes

  1. Interactive & extensible Ingestion, Discovery, Analytics, Visualization, Collaboration, Data product Toward better capitalization of analytical application (helium)
  2. ~4 years top level apache project after less than 18 months of incubation
  3. Scala & spark integration Direct DOM for super cool visualization