Spark Summit 2017 - A feedback for TASM

Jean-Georges Perrin
Jean-Georges PerrinSenior Enterprise Architect | Lifetime IBM Champion at The NPD Group
Zaloni Confidential and Proprietary - Provided under NDA
Spark Summit 2017
TASM Feedback
Jean Georges Perrin / jgperrin@zaloni.com
2017-06-22
Zaloni Confidential and Proprietary - Provided under NDA
is hiring!
Check out https://www.zaloni.com/about/careers/
Forbes:
Best Big Data Companies And CEOs To
Work For In 2017
Zaloni Confidential and Proprietary - Provided under NDA
• June 5-7 2017
• San Francisco's Moscone Center
• Just under 3000 attendees
• 11 tracks: Data Science , Data Science 2, Developer, Enterprise, Machine
Learning, Research, Spark Ecosystem, Use Cases, Sponsored Sessions,
Streaming, Technical Deep Dives
• About 30 exhibitors
• About 50 sponsors
• At least four French speakers
• One Zaloni Speaker
Logistics
Zaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
Significant Growth in the Community
Zaloni Confidential and Proprietary - Provided under NDA
• Spark 2.2 is coming:
▪ Cost-based optimizer (IBM contribution).
▪ Structured streaming.
▪ Easier Python Experience (pip support).
• New Databricks Open Source contribution:
▪ Deep Learning.
▪ Streaming Performance.
Announces
Zaloni Confidential and Proprietary - Provided under NDA
Deep & Machine Learning
Zaloni Confidential and Proprietary - Provided under NDA
• Initiative from Databricks
▪ https://databricks.com/blog/2017/06/06/databricks-vision-simplify-large-sc
ale-deep-learning.html
▪ https://github.com/databricks/spark-deep-learning
• Easier integration of TensorFlow and other frameworks
• Partnership with Stanford U
Deep Learning - Making it Easier
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
Yes, it is!
Christopher Ré, Stanford U
Zaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
Streaming
Zaloni Confidential and Proprietary - Provided under NDA
Clearly after Kafka Streams
Zaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
Some Sessions
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
• Building a Mica-like tool internally.
• Looking at Open-Sourcing it.
• Video: https://www.youtube.com/watch?v=-hDIkTUPhZY&feature=youtu.be
• Slides:
https://www.slideshare.net/databricks/using-sparkml-to-power-a-dsaas-data-sc
ience-as-a-service-with-kiran-muglurmath-and-sridhar-alla
Comcast
Zaloni Confidential and Proprietary - Provided under NDA
Sunning Too...
Zaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
Sunning's Extensions to Spark ML
Zaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
• Giving ML capabilities to Business Users, mainly in fraud detection.
• Slides:
https://www.slideshare.net/databricks/machine-learning-as-a-service-apache-s
park-mllib-enrichment-and-webbased-codeless-modeling-with-zhengyi-le
• Video: https://www.youtube.com/watch?v=R4VEHoCvHy4&feature=youtu.be
Sunning - ML as a Service
Zaloni Confidential and Proprietary - Provided under NDA
A Religion War about to Start?
Zaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
The Cloud is Too (Damn) Hard!
Zaloni Confidential and Proprietary - Provided under NDA
More and more of NLP and Spark
Zaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
More Keynotes
Zaloni Confidential and Proprietary - Provided under NDA
Serverless is the Future of Cloud
Zaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
• Dynamic allocation of resources.
• More flexibility for the customers.
• Lower TCO.
• Non-blocking jobs.
• Faster.
• Matching Amazon offers?
Serverless
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Up to 12x Faster
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Intermission with Ben, Ion, and Matei
Ben Lorica (O’Reilly Media)
Ion Stoica (UC Berkeley AMP/RISELab & Databricks)
Matei Zaharia
(Databricks)
Zaloni Confidential and Proprietary - Provided under NDA
Various Takeaways
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
DRY & DRO
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Smarter Notebooks
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Microsoft Fully Embracing the Apache Stack
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
Finally Some Common Sense!
Zaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
2.2 rocks!
• Simply Faster.
▪ Autoboxing kills performance!
▪ Scala sucks (yeah!)
▪ Better Catalyst, including cost-based optimizer (donated by IBM).
Zaloni Confidential and Proprietary - Provided under NDA
GPU Analytics is a Trend
Zaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
• IBM mentioned it.
• 4 sessions on the subject.
▪ 3 sessions on GPU
▪ 2 sessions on FPGA
• Vendors: MapD, Intel, Nvidia.
Analytics on GPU? FPGA?
Zaloni Confidential and Proprietary - Provided under NDA
Vendors
Zaloni Confidential and Proprietary - Provided under NDA
Classics
• Databricks
• Intel
• IBM
• Cloudera
• Pepperdata
• Cask
• Mesosphere
• Google Cloud
• Amazon
• Mapr
• Netapp
• BlueTalon
• DataIku
• Talend
• MemSQL
• Redis
• Microsoft
• Confluent
• VMware
• ...
• Not Hortonworks
Zaloni Confidential and Proprietary - Provided under NDA
• Gridgain - in memory DB
• SnappyData - in memory DB
• Target - looking to hire people
• Yelp! - looking to hire people
Others
Zaloni Confidential and Proprietary - Provided under NDA
Freebie
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
And the best session of all times...
Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
Zaloni Confidential and Proprietary - Provided under NDA
• Video:
https://www.youtube.com/watch?v=ka8xhQAoj-E&feature=youtu.be
(go like it!)
• Slides:
▪ On Databricks' channel:
https://www.slideshare.net/databricks/the-key-to-machine-learning-is-prep
ping-the-right-data-with-jean-georges-perrin
(go like it!)
▪ On my channel:
https://www.slideshare.net/jgperrin/the-key-to-machine-learning-is-preppin
g-the-right-data
(go like it!)
The Key to ML is Prepping the Right Data
Zaloni Confidential and Proprietary - Provided under NDA
Thank you
1 of 55

Recommended

台灣新創團隊中國經驗談:人工智能視頻廣告為快速變現的創新模式 by
台灣新創團隊中國經驗談:人工智能視頻廣告為快速變現的創新模式台灣新創團隊中國經驗談:人工智能視頻廣告為快速變現的創新模式
台灣新創團隊中國經驗談:人工智能視頻廣告為快速變現的創新模式台灣紫牛創業協會(Taiwan Techmakers Association)
630 views90 slides
台灣新創團隊中國經驗談:人工智能視頻廣告為快速變現的創新模式 by
台灣新創團隊中國經驗談:人工智能視頻廣告為快速變現的創新模式台灣新創團隊中國經驗談:人工智能視頻廣告為快速變現的創新模式
台灣新創團隊中國經驗談:人工智能視頻廣告為快速變現的創新模式台灣紫牛創業協會(Taiwan Techmakers Association)
224 views90 slides
Fast Delivery DevOps Israel by
Fast Delivery DevOps IsraelFast Delivery DevOps Israel
Fast Delivery DevOps IsraelAdrian Cockcroft
15.6K views98 slides
Distributed systems in practice, in theory by
Distributed systems in practice, in theoryDistributed systems in practice, in theory
Distributed systems in practice, in theoryAysylu Greenberg
2.1K views111 slides
[CB19] I KNOW WHAT YOU DID LAST NIGHT : Pwning The State-Of-The-Art the IoT H... by
[CB19] I KNOW WHAT YOU DID LAST NIGHT : Pwning The State-Of-The-Art the IoT H...[CB19] I KNOW WHAT YOU DID LAST NIGHT : Pwning The State-Of-The-Art the IoT H...
[CB19] I KNOW WHAT YOU DID LAST NIGHT : Pwning The State-Of-The-Art the IoT H...CODE BLUE
338 views59 slides
Distributed Systems in Practice, in Theory by
Distributed Systems in Practice, in TheoryDistributed Systems in Practice, in Theory
Distributed Systems in Practice, in TheoryC4Media
506 views114 slides

More Related Content

Similar to Spark Summit 2017 - A feedback for TASM

Symbian: collaboration, open, closed, dead? by
Symbian: collaboration, open, closed, dead?Symbian: collaboration, open, closed, dead?
Symbian: collaboration, open, closed, dead?Stephen Walli
524 views16 slides
Beyond the Google Search Appliance with Lucidworks Fusion by
Beyond the Google Search Appliance with Lucidworks Fusion Beyond the Google Search Appliance with Lucidworks Fusion
Beyond the Google Search Appliance with Lucidworks Fusion MC+A
560 views71 slides
How to Backdoor Diffie-Hellman by
How to Backdoor Diffie-HellmanHow to Backdoor Diffie-Hellman
How to Backdoor Diffie-HellmanDavid Wong
733 views94 slides
JUC Europe 2015: Continuous Integration and Distribution in the Cloud with DE... by
JUC Europe 2015: Continuous Integration and Distribution in the Cloud with DE...JUC Europe 2015: Continuous Integration and Distribution in the Cloud with DE...
JUC Europe 2015: Continuous Integration and Distribution in the Cloud with DE...CloudBees
293 views47 slides
btNOG 4: DNSSEC Key Rollover by
btNOG 4: DNSSEC Key RolloverbtNOG 4: DNSSEC Key Rollover
btNOG 4: DNSSEC Key RolloverAPNIC
466 views33 slides
Zenko @Cloud Native Foundation London Meetup March 6th 2018 by
Zenko @Cloud Native Foundation London Meetup March 6th 2018Zenko @Cloud Native Foundation London Meetup March 6th 2018
Zenko @Cloud Native Foundation London Meetup March 6th 2018Laure Vergeron
450 views29 slides

Similar to Spark Summit 2017 - A feedback for TASM(20)

Symbian: collaboration, open, closed, dead? by Stephen Walli
Symbian: collaboration, open, closed, dead?Symbian: collaboration, open, closed, dead?
Symbian: collaboration, open, closed, dead?
Stephen Walli524 views
Beyond the Google Search Appliance with Lucidworks Fusion by MC+A
Beyond the Google Search Appliance with Lucidworks Fusion Beyond the Google Search Appliance with Lucidworks Fusion
Beyond the Google Search Appliance with Lucidworks Fusion
MC+A560 views
How to Backdoor Diffie-Hellman by David Wong
How to Backdoor Diffie-HellmanHow to Backdoor Diffie-Hellman
How to Backdoor Diffie-Hellman
David Wong733 views
JUC Europe 2015: Continuous Integration and Distribution in the Cloud with DE... by CloudBees
JUC Europe 2015: Continuous Integration and Distribution in the Cloud with DE...JUC Europe 2015: Continuous Integration and Distribution in the Cloud with DE...
JUC Europe 2015: Continuous Integration and Distribution in the Cloud with DE...
CloudBees293 views
btNOG 4: DNSSEC Key Rollover by APNIC
btNOG 4: DNSSEC Key RolloverbtNOG 4: DNSSEC Key Rollover
btNOG 4: DNSSEC Key Rollover
APNIC466 views
Zenko @Cloud Native Foundation London Meetup March 6th 2018 by Laure Vergeron
Zenko @Cloud Native Foundation London Meetup March 6th 2018Zenko @Cloud Native Foundation London Meetup March 6th 2018
Zenko @Cloud Native Foundation London Meetup March 6th 2018
Laure Vergeron450 views
Puppet Camp Charlotte 2015: Introduction to SIMP: An Open Source Infrastructu... by Puppet
Puppet Camp Charlotte 2015: Introduction to SIMP: An Open Source Infrastructu...Puppet Camp Charlotte 2015: Introduction to SIMP: An Open Source Infrastructu...
Puppet Camp Charlotte 2015: Introduction to SIMP: An Open Source Infrastructu...
Puppet2.6K views
Intro to OSGi – the Microservices kernel - P Kriens & T Ward by mfrancis
Intro to OSGi – the Microservices kernel - P Kriens & T WardIntro to OSGi – the Microservices kernel - P Kriens & T Ward
Intro to OSGi – the Microservices kernel - P Kriens & T Ward
mfrancis3.7K views
通信の秘密とブロッキング by 751c74dc
通信の秘密とブロッキング通信の秘密とブロッキング
通信の秘密とブロッキング
751c74dc73 views
SharePoint Saturday Munich 2015 - Office 365 Next-gen portals driving your b... by Jasper Oosterveld
SharePoint Saturday Munich 2015 -  Office 365 Next-gen portals driving your b...SharePoint Saturday Munich 2015 -  Office 365 Next-gen portals driving your b...
SharePoint Saturday Munich 2015 - Office 365 Next-gen portals driving your b...
Jasper Oosterveld3.9K views
Kali Linux - Falconer - ISS 2014 by TGodfrey
Kali Linux - Falconer - ISS 2014Kali Linux - Falconer - ISS 2014
Kali Linux - Falconer - ISS 2014
TGodfrey2.6K views
Creative venturing creative funding v2 12 06-2013 for distribution by Fas (Feisal) Mosleh
Creative venturing creative funding v2 12 06-2013 for distributionCreative venturing creative funding v2 12 06-2013 for distribution
Creative venturing creative funding v2 12 06-2013 for distribution
Schema.org Structured data the What, Why, & How by Richard Wallis
Schema.org Structured data the What, Why, & HowSchema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & How
Richard Wallis2.8K views
Goto Berlin - Migrating to Microservices (Fast Delivery) by Adrian Cockcroft
Goto Berlin - Migrating to Microservices (Fast Delivery)Goto Berlin - Migrating to Microservices (Fast Delivery)
Goto Berlin - Migrating to Microservices (Fast Delivery)
Adrian Cockcroft24.1K views
Kali Linux - CleveSec 2015 by TGodfrey
Kali Linux - CleveSec 2015Kali Linux - CleveSec 2015
Kali Linux - CleveSec 2015
TGodfrey5.5K views

More from Jean-Georges Perrin

It's painful how much data rules the world by
It's painful how much data rules the worldIt's painful how much data rules the world
It's painful how much data rules the worldJean-Georges Perrin
305 views53 slides
Apache Spark v3.0.0 by
Apache Spark v3.0.0Apache Spark v3.0.0
Apache Spark v3.0.0Jean-Georges Perrin
262 views34 slides
Big data made easy with a Spark by
Big data made easy with a SparkBig data made easy with a Spark
Big data made easy with a SparkJean-Georges Perrin
376 views79 slides
Why i love Apache Spark? by
Why i love Apache Spark?Why i love Apache Spark?
Why i love Apache Spark?Jean-Georges Perrin
320 views24 slides
Big Data made easy with a Spark by
Big Data made easy with a SparkBig Data made easy with a Spark
Big Data made easy with a SparkJean-Georges Perrin
689 views76 slides
The road to AI is paved with pragmatic intentions by
The road to AI is paved with pragmatic intentionsThe road to AI is paved with pragmatic intentions
The road to AI is paved with pragmatic intentionsJean-Georges Perrin
352 views59 slides

More from Jean-Georges Perrin(20)

The road to AI is paved with pragmatic intentions by Jean-Georges Perrin
The road to AI is paved with pragmatic intentionsThe road to AI is paved with pragmatic intentions
The road to AI is paved with pragmatic intentions
Spark Summit Europe Wrap Up and TASM State of the Community by Jean-Georges Perrin
Spark Summit Europe Wrap Up and TASM State of the CommunitySpark Summit Europe Wrap Up and TASM State of the Community
Spark Summit Europe Wrap Up and TASM State of the Community
2CRSI presentation for ISC-HPC: When High-Performance Computing meets High-Pe... by Jean-Georges Perrin
2CRSI presentation for ISC-HPC: When High-Performance Computing meets High-Pe...2CRSI presentation for ISC-HPC: When High-Performance Computing meets High-Pe...
2CRSI presentation for ISC-HPC: When High-Performance Computing meets High-Pe...
Jean-Georges Perrin1.2K views
Vision stratégique de l'utilisation de l'(Open)Data dans l'entreprise by Jean-Georges Perrin
Vision stratégique de l'utilisation de l'(Open)Data dans l'entrepriseVision stratégique de l'utilisation de l'(Open)Data dans l'entreprise
Vision stratégique de l'utilisation de l'(Open)Data dans l'entreprise
Jean-Georges Perrin1.8K views
A la découverte des nouvelles tendances du web (Mulhouse Edition) by Jean-Georges Perrin
A la découverte des nouvelles tendances du web (Mulhouse Edition)A la découverte des nouvelles tendances du web (Mulhouse Edition)
A la découverte des nouvelles tendances du web (Mulhouse Edition)
MashupXFeed et la stratégie éditoriale - Workshop Activis - GreenIvory by Jean-Georges Perrin
MashupXFeed et la stratégie éditoriale - Workshop Activis - GreenIvoryMashupXFeed et la stratégie éditoriale - Workshop Activis - GreenIvory
MashupXFeed et la stratégie éditoriale - Workshop Activis - GreenIvory
MashupXFeed et le référencement - Workshop Activis - Greenivory by Jean-Georges Perrin
MashupXFeed et le référencement - Workshop Activis - GreenivoryMashupXFeed et le référencement - Workshop Activis - Greenivory
MashupXFeed et le référencement - Workshop Activis - Greenivory

Recently uploaded

Dapr Unleashed: Accelerating Microservice Development by
Dapr Unleashed: Accelerating Microservice DevelopmentDapr Unleashed: Accelerating Microservice Development
Dapr Unleashed: Accelerating Microservice DevelopmentMiroslav Janeski
10 views29 slides
Unlocking the Power of AI in Product Management - A Comprehensive Guide for P... by
Unlocking the Power of AI in Product Management - A Comprehensive Guide for P...Unlocking the Power of AI in Product Management - A Comprehensive Guide for P...
Unlocking the Power of AI in Product Management - A Comprehensive Guide for P...NimaTorabi2
8 views17 slides
Agile 101 by
Agile 101Agile 101
Agile 101John Valentino
7 views20 slides
tecnologia18.docx by
tecnologia18.docxtecnologia18.docx
tecnologia18.docxnosi6702
5 views5 slides
.NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra... by
.NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra....NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra...
.NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra...Marc Müller
38 views62 slides
Unleash The Monkeys by
Unleash The MonkeysUnleash The Monkeys
Unleash The MonkeysJacob Duijzer
7 views28 slides

Recently uploaded(20)

Dapr Unleashed: Accelerating Microservice Development by Miroslav Janeski
Dapr Unleashed: Accelerating Microservice DevelopmentDapr Unleashed: Accelerating Microservice Development
Dapr Unleashed: Accelerating Microservice Development
Miroslav Janeski10 views
Unlocking the Power of AI in Product Management - A Comprehensive Guide for P... by NimaTorabi2
Unlocking the Power of AI in Product Management - A Comprehensive Guide for P...Unlocking the Power of AI in Product Management - A Comprehensive Guide for P...
Unlocking the Power of AI in Product Management - A Comprehensive Guide for P...
NimaTorabi28 views
tecnologia18.docx by nosi6702
tecnologia18.docxtecnologia18.docx
tecnologia18.docx
nosi67025 views
.NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra... by Marc Müller
.NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra....NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra...
.NET Developer Conference 2023 - .NET Microservices mit Dapr – zu viel Abstra...
Marc Müller38 views
Sprint 226 by ManageIQ
Sprint 226Sprint 226
Sprint 226
ManageIQ5 views
2023-November-Schneider Electric-Meetup-BCN Admin Group.pptx by animuscrm
2023-November-Schneider Electric-Meetup-BCN Admin Group.pptx2023-November-Schneider Electric-Meetup-BCN Admin Group.pptx
2023-November-Schneider Electric-Meetup-BCN Admin Group.pptx
animuscrm14 views
SUGCON ANZ Presentation V2.1 Final.pptx by Jack Spektor
SUGCON ANZ Presentation V2.1 Final.pptxSUGCON ANZ Presentation V2.1 Final.pptx
SUGCON ANZ Presentation V2.1 Final.pptx
Jack Spektor22 views
AI and Ml presentation .pptx by FayazAli87
AI and Ml presentation .pptxAI and Ml presentation .pptx
AI and Ml presentation .pptx
FayazAli8711 views
Copilot Prompting Toolkit_All Resources.pdf by Riccardo Zamana
Copilot Prompting Toolkit_All Resources.pdfCopilot Prompting Toolkit_All Resources.pdf
Copilot Prompting Toolkit_All Resources.pdf
Riccardo Zamana8 views
Team Transformation Tactics for Holistic Testing and Quality (Japan Symposium... by Lisi Hocke
Team Transformation Tactics for Holistic Testing and Quality (Japan Symposium...Team Transformation Tactics for Holistic Testing and Quality (Japan Symposium...
Team Transformation Tactics for Holistic Testing and Quality (Japan Symposium...
Lisi Hocke30 views
Quality Engineer: A Day in the Life by John Valentino
Quality Engineer: A Day in the LifeQuality Engineer: A Day in the Life
Quality Engineer: A Day in the Life
John Valentino6 views
Myths and Facts About Hospice Care: Busting Common Misconceptions by Care Coordinations
Myths and Facts About Hospice Care: Busting Common MisconceptionsMyths and Facts About Hospice Care: Busting Common Misconceptions
Myths and Facts About Hospice Care: Busting Common Misconceptions
20231129 - Platform @ localhost 2023 - Application-driven infrastructure with... by sparkfabrik
20231129 - Platform @ localhost 2023 - Application-driven infrastructure with...20231129 - Platform @ localhost 2023 - Application-driven infrastructure with...
20231129 - Platform @ localhost 2023 - Application-driven infrastructure with...
sparkfabrik5 views
DSD-INT 2023 The Danube Hazardous Substances Model - Kovacs by Deltares
DSD-INT 2023 The Danube Hazardous Substances Model - KovacsDSD-INT 2023 The Danube Hazardous Substances Model - Kovacs
DSD-INT 2023 The Danube Hazardous Substances Model - Kovacs
Deltares8 views

Spark Summit 2017 - A feedback for TASM

  • 1. Zaloni Confidential and Proprietary - Provided under NDA Spark Summit 2017 TASM Feedback Jean Georges Perrin / jgperrin@zaloni.com 2017-06-22
  • 2. Zaloni Confidential and Proprietary - Provided under NDA is hiring! Check out https://www.zaloni.com/about/careers/ Forbes: Best Big Data Companies And CEOs To Work For In 2017
  • 3. Zaloni Confidential and Proprietary - Provided under NDA • June 5-7 2017 • San Francisco's Moscone Center • Just under 3000 attendees • 11 tracks: Data Science , Data Science 2, Developer, Enterprise, Machine Learning, Research, Spark Ecosystem, Use Cases, Sponsored Sessions, Streaming, Technical Deep Dives • About 30 exhibitors • About 50 sponsors • At least four French speakers • One Zaloni Speaker Logistics
  • 4. Zaloni Confidential and Proprietary - Provided under NDA
  • 5. Zaloni Confidential and Proprietary - Provided under NDA Significant Growth in the Community
  • 6. Zaloni Confidential and Proprietary - Provided under NDA • Spark 2.2 is coming: ▪ Cost-based optimizer (IBM contribution). ▪ Structured streaming. ▪ Easier Python Experience (pip support). • New Databricks Open Source contribution: ▪ Deep Learning. ▪ Streaming Performance. Announces
  • 7. Zaloni Confidential and Proprietary - Provided under NDA Deep & Machine Learning
  • 8. Zaloni Confidential and Proprietary - Provided under NDA • Initiative from Databricks ▪ https://databricks.com/blog/2017/06/06/databricks-vision-simplify-large-sc ale-deep-learning.html ▪ https://github.com/databricks/spark-deep-learning • Easier integration of TensorFlow and other frameworks • Partnership with Stanford U Deep Learning - Making it Easier
  • 9. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 10. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 11. Zaloni Confidential and Proprietary - Provided under NDA Yes, it is! Christopher Ré, Stanford U Zaloni Confidential and Proprietary - Provided under NDA
  • 12. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 13. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 14. Zaloni Confidential and Proprietary - Provided under NDA Streaming
  • 15. Zaloni Confidential and Proprietary - Provided under NDA Clearly after Kafka Streams Zaloni Confidential and Proprietary - Provided under NDA
  • 16. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 17. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 18. Zaloni Confidential and Proprietary - Provided under NDA Some Sessions
  • 19. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 20. Zaloni Confidential and Proprietary - Provided under NDA • Building a Mica-like tool internally. • Looking at Open-Sourcing it. • Video: https://www.youtube.com/watch?v=-hDIkTUPhZY&feature=youtu.be • Slides: https://www.slideshare.net/databricks/using-sparkml-to-power-a-dsaas-data-sc ience-as-a-service-with-kiran-muglurmath-and-sridhar-alla Comcast
  • 21. Zaloni Confidential and Proprietary - Provided under NDA Sunning Too... Zaloni Confidential and Proprietary - Provided under NDA
  • 22. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 23. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 24. Zaloni Confidential and Proprietary - Provided under NDA Sunning's Extensions to Spark ML Zaloni Confidential and Proprietary - Provided under NDA
  • 25. Zaloni Confidential and Proprietary - Provided under NDA • Giving ML capabilities to Business Users, mainly in fraud detection. • Slides: https://www.slideshare.net/databricks/machine-learning-as-a-service-apache-s park-mllib-enrichment-and-webbased-codeless-modeling-with-zhengyi-le • Video: https://www.youtube.com/watch?v=R4VEHoCvHy4&feature=youtu.be Sunning - ML as a Service
  • 26. Zaloni Confidential and Proprietary - Provided under NDA A Religion War about to Start? Zaloni Confidential and Proprietary - Provided under NDA
  • 27. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA The Cloud is Too (Damn) Hard!
  • 28. Zaloni Confidential and Proprietary - Provided under NDA More and more of NLP and Spark Zaloni Confidential and Proprietary - Provided under NDA
  • 29. Zaloni Confidential and Proprietary - Provided under NDA More Keynotes
  • 30. Zaloni Confidential and Proprietary - Provided under NDA Serverless is the Future of Cloud Zaloni Confidential and Proprietary - Provided under NDA
  • 31. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 32. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 33. Zaloni Confidential and Proprietary - Provided under NDA • Dynamic allocation of resources. • More flexibility for the customers. • Lower TCO. • Non-blocking jobs. • Faster. • Matching Amazon offers? Serverless
  • 34. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA Up to 12x Faster
  • 35. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA Intermission with Ben, Ion, and Matei Ben Lorica (O’Reilly Media) Ion Stoica (UC Berkeley AMP/RISELab & Databricks) Matei Zaharia (Databricks)
  • 36. Zaloni Confidential and Proprietary - Provided under NDA Various Takeaways
  • 37. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA DRY & DRO
  • 38. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA Smarter Notebooks
  • 39. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA Microsoft Fully Embracing the Apache Stack
  • 40. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 41. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 42. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 43. Zaloni Confidential and Proprietary - Provided under NDA Finally Some Common Sense! Zaloni Confidential and Proprietary - Provided under NDA
  • 44. Zaloni Confidential and Proprietary - Provided under NDA 2.2 rocks! • Simply Faster. ▪ Autoboxing kills performance! ▪ Scala sucks (yeah!) ▪ Better Catalyst, including cost-based optimizer (donated by IBM).
  • 45. Zaloni Confidential and Proprietary - Provided under NDA GPU Analytics is a Trend Zaloni Confidential and Proprietary - Provided under NDA
  • 46. Zaloni Confidential and Proprietary - Provided under NDA • IBM mentioned it. • 4 sessions on the subject. ▪ 3 sessions on GPU ▪ 2 sessions on FPGA • Vendors: MapD, Intel, Nvidia. Analytics on GPU? FPGA?
  • 47. Zaloni Confidential and Proprietary - Provided under NDA Vendors
  • 48. Zaloni Confidential and Proprietary - Provided under NDA Classics • Databricks • Intel • IBM • Cloudera • Pepperdata • Cask • Mesosphere • Google Cloud • Amazon • Mapr • Netapp • BlueTalon • DataIku • Talend • MemSQL • Redis • Microsoft • Confluent • VMware • ... • Not Hortonworks
  • 49. Zaloni Confidential and Proprietary - Provided under NDA • Gridgain - in memory DB • SnappyData - in memory DB • Target - looking to hire people • Yelp! - looking to hire people Others
  • 50. Zaloni Confidential and Proprietary - Provided under NDA Freebie
  • 51. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 52. Zaloni Confidential and Proprietary - Provided under NDA And the best session of all times...
  • 53. Zaloni Confidential and Proprietary - Provided under NDAZaloni Confidential and Proprietary - Provided under NDA
  • 54. Zaloni Confidential and Proprietary - Provided under NDA • Video: https://www.youtube.com/watch?v=ka8xhQAoj-E&feature=youtu.be (go like it!) • Slides: ▪ On Databricks' channel: https://www.slideshare.net/databricks/the-key-to-machine-learning-is-prep ping-the-right-data-with-jean-georges-perrin (go like it!) ▪ On my channel: https://www.slideshare.net/jgperrin/the-key-to-machine-learning-is-preppin g-the-right-data (go like it!) The Key to ML is Prepping the Right Data
  • 55. Zaloni Confidential and Proprietary - Provided under NDA Thank you