SlideShare a Scribd company logo
1 of 9
Download to read offline
pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API
Yes 28 27.7%
No 73 72.3%
Core Spark 70 69.3%
Spark SQL + DataFrames 78 77.2%
Spark Streaming 66 65.3%
MLlib (machine learning) 72 71.3%
GraphX 30 29.7%
Zero Knowledge 44 43.6%
Beginner 47 46.5%
Medium 9 8.9%
Expert 1 1%
101 responses
Summary
Have you edited Wikipedia articles before?
Which of the following Spark components are you mostly interested in using after class?
Scala [Which programming language API of Spark are you most comfortable in?]
Java [Which programming language API of Spark are you most comfortable in?]
72.3%
27.7%
0 15 30 45 60 75
Core Spark
Spark SQL +…
Spark Stream…
MLlib (machin…
GraphX
0 10 20 30 40
Zero Knowled…
Beginner
Medium
Expert
SIGN IN
The version of the browser you are using is no longer supported. Please upgrade to a supported browser. Dismiss
pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API
Zero Knowledge 18 17.8%
Beginner 21 20.8%
Medium 41 40.6%
Expert 21 20.8%
Zero Knowledge 30 29.7%
Beginner 31 30.7%
Medium 27 26.7%
Expert 13 12.9%
Zero Knowledge 6 5.9%
Beginner 16 15.8%
Medium 54 53.5%
Expert 25 24.8%
Zero Knowledge 33 32.7%
Beginner 43 42.6%
Medium 19 18.8%
Expert 6 5.9%
Development (how to write Spark apps, API coverage, debugging) 86 85.1%
Python [Which programming language API of Spark are you most comfortable in?]
SQL [Which programming language API of Spark are you most comfortable in?]
R [Which programming language API of Spark are you most comfortable in?]
I would like the focus of the class to be:
0 10 20 30 40
Zero Knowled…
Beginner
Medium
Expert
0.0 7.5 15.0 22.5 30.0
Zero Knowled…
Beginner
Medium
Expert
0 10 20 30 40 50
Zero Knowled…
Beginner
Medium
Expert
0 10 20 30 40
Zero Knowled…
Beginner
Medium
Expert
pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API
Administration / Ops (how Spark scales, configuration parameters, tuning) 39 38.6%
Architecture (how the JVMs interact with each other, Spark Standalone, YARN integration, etc) 62 61.4%
Use Cases (non-technical section on how companies are using Spark) 55 54.5%
Level 0: I am a totally new to Spark 50 49.5%
Level 1: I have launched the Spark shell and executed a few transformations & actions and looked at the Spark UIs 32 31.7%
Level 2: I have either written 100+ lines of code for a Spark application or I understand the following: what narrow vs wide dependencies are, how to figure out which transformations cause a shuffle 15 14.9%
Level 3: I have been using Spark greater than 50% of the time in my job for over 2 months in either a development or administration role 4 4%
Level 4: I have contributed at least 20 lines of code to the Apache Spark project 0 0%
Class day will be my first hands-on exposure to programming in Spark 52 51.5%
I have been playing with the Spark shells for less than a week 26 25.7%
I have under 1 month of experience with Spark 7 6.9%
I have 1 - 6 months of experience with Spark 13 12.9%
6 - 12 months 2 2%
1+ year 1 1%
How experienced are you with Spark?
For how long have you been doing hands-on Development or Operations work with Apache Spark?
Where are you in the Spark usage lifecycle?
0 20 40 60 80
Development…
Administratio…
Architecture (…
Use Cases (no…
14.9%
31.7%
49.5%
25.7%
51.5%
pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API
Just starting to learn about Spark, reading about it... 61 60.4%
I have a small 1-node Spark cluster or VM that I'm playing around with 15 14.9%
I am currently building a Proof of Concept or Prototype to demonstrate a use case 21 20.8%
We are in production! 4 4%
Please select which of the following Big Data technologies you have at least "medium" level technical proficiency in:
20.8%
14.9%
60.4%
pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API
HDFS 64 63.4%
MapReduce 50 49.5%
YARN 31 30.7%
Mesos 2 2%
Cascading 7 6.9%
Kafka 19 18.8%
Storm 9 8.9%
Flume 12 11.9%
HBase 20 19.8%
Cassandra 11 10.9%
Hive 42 41.6%
Impala 13 12.9%
Pig 24 23.8%
Parquet 16 15.8%
ZooKeeper 20 19.8%
MongoDB 26 25.7%
Couchbase 4 4%
Neo4j 5 5%
Titan 0 0%
Oozie 16 15.8%
Sqoop 17 16.8%
Giraph or Graphlab 2 2%
Accumulo 0 0%
Phoenix 3 3%
Tez 7 6.9%
ElasticSearch 15 14.9%
Lucene / Solr 19 18.8%
Math: Statistics, Linear Algebra, Calculus, Matrix math, etc 46 45.5%
0 15 30 45 60
HDFS
MapReduce
YARN
Mesos
Cascading
Kafka
Storm
Flume
HBase
Cassandra
Hive
Impala
Pig
Parquet
ZooKeeper
MongoDB
Couchbase
Neo4j
Titan
Oozie
Sqoop
Giraph or…
Accumulo
Phoenix
Tez
ElasticSe…
Lucene /…
Math: Sta…
pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API
with Databricks Cloud 6 5.9%
with Hadoop (YARN/HDFS) 55 54.5%
with Cassandra (Standalone mode) 10 9.9%
with pure Apache Spark (Standalone mode) 21 20.8%
with Mesos 6 5.9%
I don't know yet 37 36.6%
Within Amazon Cloud 31 30.7%
On-premise within our private data center 73 72.3%
A different cloud provider 17 16.8%
AmpCamp training at UC Berkeley (Academic) 0 0%
SparkCamp training from Databricks (Industry) 3 3%
Cloudera Spark training 2 2%
Another vendor's Spark training 2 2%
Spark Summit conference 2 2%
None of the above 94 93.1%
How are you planning on deploying Spark within your organization?
Where do you plan on deploying Spark clusters for your organization?
Which of the following Spark training sessions, if any, have you attended before?
Which industry do you work in?
0 10 20 30 40 50
w ith Databric…
w ith Hadoop…
w ith Cassan…
w ith pure Ap…
w ith Mesos
I don't know yet
Within Amazo…
On-premise w…
A different cl…
0 20 40 60 80
AmpCamp tra…
SparkCamp tr…
Cloudera Spa…
Another ven…
Spark Summit…
None of the a…
pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API
IT / Systems / Solution Provider / IT Consultancy 53 52.5%
Banking / Finance 17 16.8%
Science & Technology 8 7.9%
Academia / University / Education 2 2%
Advertising / Marketing / PR 3 3%
Telecommunications 8 7.9%
Healthcare / Medical / Pharmaceuticals 5 5%
Publishing / Media 4 4%
Retailer / Distributor / Wholesale 2 2%
Government 6 5.9%
Insurance 0 0%
Legal 2 2%
Manufacturing / Design 6 5.9%
Nonprofit 2 2%
Business Services Consulting (Non-IT) 3 3%
Other 6 5.9%
Developer / Software Engineer / Software Architect 56 55.4%
Administrator / Operations / DevOps 8 7.9%
Data Scientist / Statistics / Machine Learning 40 39.6%
Management / Executive 8 7.9%
Sales / Marketing 3 3%
Other 5 5%
Which of the following job categories best describes your role at your company?
How far did you travel from to attend this class?
0 10 20 30 40 50
IT / System…
Banking / Fi…
Science &…
Academia /…
Advertising…
Telecomm…
Healthcare…
Publishing…
Retailer / Di…
Government
Insurance
Legal
Manufacturi…
Nonprofit
Business S…
Other
0 10 20 30 40 50
Developer / S…
Administrator…
Data Scientis…
Management…
Sales / Marke…
Other
pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API
Singapore: I live in Singapore already 54 53.5%
USA: I live in the Western half of the United States (like San Francisco, Seattle, Denver, Portland) 0 0%
USA: I live in the Eastern half of United States (NYC, D.C., Atlanta, etc) 1 1%
INTERNATIONAL: I flew in from an Asian country like Japan, China, India, South Korea, etc 35 34.7%
INTERNATIONAL: I am coming from a European country 4 4%
INTERNATIONAL: Other 7 6.9%
OPTIONAL: Finally, please freely describe your experience with Spark so far.
RDD vs DataFrames; which one to focus on
I am beginer. We are exploring Apache spark to implement some of the use cases in our organization.
Fast & simpler than MR
Interesting
huge amount of data
class loader problems. :(
Australia
we are going to implement Spark in our current project
I have been exploring spark mostly from Hadoop Data Processing
OPTIONAL: Is there anything you want to communicate to the instructor?
Want to hear more on Real-Time Architecture
I have heard about the limitations of dataframes of 22 columns due to the tupes limitations. How do you overcome this?
Thank you
Slow if it is possible as I am new to Spark
Could you talk about the trade-offs between developing in RDDs vs Dataframes? Data frames are great and reduce development time, but are RDDs significantly faster? Also, could you also
talk about the trade-offs between developing in Scala vs Python? Python is more easily maintainable but lags behind Scala in terms of Spark release.
Nothing , as of now
Number of daily responses
34.7%
53.5%
0
30
60
90
120
pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API

More Related Content

Similar to Strata singapore survey

Tiny Batches, in the wine: Shiny New Bits in Spark Streaming
Tiny Batches, in the wine: Shiny New Bits in Spark StreamingTiny Batches, in the wine: Shiny New Bits in Spark Streaming
Tiny Batches, in the wine: Shiny New Bits in Spark StreamingPaco Nathan
 
Spring Boot & Spring Cloud on PAS- Nate Schutta (1/2)
Spring Boot & Spring Cloud on PAS- Nate Schutta (1/2)Spring Boot & Spring Cloud on PAS- Nate Schutta (1/2)
Spring Boot & Spring Cloud on PAS- Nate Schutta (1/2)VMware Tanzu
 
SemTech 2010: Pelorus Platform
SemTech 2010: Pelorus PlatformSemTech 2010: Pelorus Platform
SemTech 2010: Pelorus PlatformClark & Parsia LLC
 
30 Skills to Master to Become a Senior Software Engineer
30 Skills to Master to Become a Senior Software Engineer30 Skills to Master to Become a Senior Software Engineer
30 Skills to Master to Become a Senior Software EngineerSean Coates
 
BDM26: Spark Summit 2014 Debriefing
BDM26: Spark Summit 2014 DebriefingBDM26: Spark Summit 2014 Debriefing
BDM26: Spark Summit 2014 DebriefingDavid Lauzon
 
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and More
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and MoreStrata 2015 Data Preview: Spark, Data Visualization, YARN, and More
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and MorePaco Nathan
 
2013 - Dustin whittle - Escalando PHP en la vida real
2013 - Dustin whittle - Escalando PHP en la vida real2013 - Dustin whittle - Escalando PHP en la vida real
2013 - Dustin whittle - Escalando PHP en la vida realPHP Conference Argentina
 
Where do you want to go today 2007
Where do you want to go today   2007Where do you want to go today   2007
Where do you want to go today 2007Mike Feltman
 
Java web and application development services
Java web and application development servicesJava web and application development services
Java web and application development servicesNexSoftsys
 
JavaOne 2016: Getting Started with Apache Spark: Use Scala, Java, Python, or ...
JavaOne 2016: Getting Started with Apache Spark: Use Scala, Java, Python, or ...JavaOne 2016: Getting Started with Apache Spark: Use Scala, Java, Python, or ...
JavaOne 2016: Getting Started with Apache Spark: Use Scala, Java, Python, or ...David Taieb
 
8_reasons_php_developers_love_using_laravel.pptx
8_reasons_php_developers_love_using_laravel.pptx8_reasons_php_developers_love_using_laravel.pptx
8_reasons_php_developers_love_using_laravel.pptxsarah david
 
Как да станем софтуерни инженери и да стартираме ИТ бизнес?
Как да станем софтуерни инженери и да стартираме ИТ бизнес?Как да станем софтуерни инженери и да стартираме ИТ бизнес?
Как да станем софтуерни инженери и да стартираме ИТ бизнес?Svetlin Nakov
 
How To be a Backend developer
How To be a Backend developer    How To be a Backend developer
How To be a Backend developer Ramy Hakam
 
STLDODN - Get Rid of CRUD faster!
STLDODN - Get Rid of CRUD faster!STLDODN - Get Rid of CRUD faster!
STLDODN - Get Rid of CRUD faster!kshaffar
 
How to Become an SAP ABAP Developer? Career Scope, Salary, Skills, Future Tre...
How to Become an SAP ABAP Developer? Career Scope, Salary, Skills, Future Tre...How to Become an SAP ABAP Developer? Career Scope, Salary, Skills, Future Tre...
How to Become an SAP ABAP Developer? Career Scope, Salary, Skills, Future Tre...Aspire Techsoft Academy
 
A Technical Driven Seminar
A Technical Driven SeminarA Technical Driven Seminar
A Technical Driven SeminarDeepak Chawla
 
Where do you want to go today
Where do you want to go todayWhere do you want to go today
Where do you want to go todayMike Feltman
 

Similar to Strata singapore survey (20)

WoMakersCode 2016 - Shit Happens
WoMakersCode 2016 -  Shit HappensWoMakersCode 2016 -  Shit Happens
WoMakersCode 2016 - Shit Happens
 
Tiny Batches, in the wine: Shiny New Bits in Spark Streaming
Tiny Batches, in the wine: Shiny New Bits in Spark StreamingTiny Batches, in the wine: Shiny New Bits in Spark Streaming
Tiny Batches, in the wine: Shiny New Bits in Spark Streaming
 
Spring Boot & Spring Cloud on PAS- Nate Schutta (1/2)
Spring Boot & Spring Cloud on PAS- Nate Schutta (1/2)Spring Boot & Spring Cloud on PAS- Nate Schutta (1/2)
Spring Boot & Spring Cloud on PAS- Nate Schutta (1/2)
 
General Learning.pptx
General Learning.pptxGeneral Learning.pptx
General Learning.pptx
 
SemTech 2010: Pelorus Platform
SemTech 2010: Pelorus PlatformSemTech 2010: Pelorus Platform
SemTech 2010: Pelorus Platform
 
30 Skills to Master to Become a Senior Software Engineer
30 Skills to Master to Become a Senior Software Engineer30 Skills to Master to Become a Senior Software Engineer
30 Skills to Master to Become a Senior Software Engineer
 
BDM26: Spark Summit 2014 Debriefing
BDM26: Spark Summit 2014 DebriefingBDM26: Spark Summit 2014 Debriefing
BDM26: Spark Summit 2014 Debriefing
 
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and More
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and MoreStrata 2015 Data Preview: Spark, Data Visualization, YARN, and More
Strata 2015 Data Preview: Spark, Data Visualization, YARN, and More
 
2013 - Dustin whittle - Escalando PHP en la vida real
2013 - Dustin whittle - Escalando PHP en la vida real2013 - Dustin whittle - Escalando PHP en la vida real
2013 - Dustin whittle - Escalando PHP en la vida real
 
Where do you want to go today 2007
Where do you want to go today   2007Where do you want to go today   2007
Where do you want to go today 2007
 
Java web and application development services
Java web and application development servicesJava web and application development services
Java web and application development services
 
JavaOne 2016: Getting Started with Apache Spark: Use Scala, Java, Python, or ...
JavaOne 2016: Getting Started with Apache Spark: Use Scala, Java, Python, or ...JavaOne 2016: Getting Started with Apache Spark: Use Scala, Java, Python, or ...
JavaOne 2016: Getting Started with Apache Spark: Use Scala, Java, Python, or ...
 
8_reasons_php_developers_love_using_laravel.pptx
8_reasons_php_developers_love_using_laravel.pptx8_reasons_php_developers_love_using_laravel.pptx
8_reasons_php_developers_love_using_laravel.pptx
 
Как да станем софтуерни инженери и да стартираме ИТ бизнес?
Как да станем софтуерни инженери и да стартираме ИТ бизнес?Как да станем софтуерни инженери и да стартираме ИТ бизнес?
Как да станем софтуерни инженери и да стартираме ИТ бизнес?
 
UDG - PHP osnove
UDG - PHP osnoveUDG - PHP osnove
UDG - PHP osnove
 
How To be a Backend developer
How To be a Backend developer    How To be a Backend developer
How To be a Backend developer
 
STLDODN - Get Rid of CRUD faster!
STLDODN - Get Rid of CRUD faster!STLDODN - Get Rid of CRUD faster!
STLDODN - Get Rid of CRUD faster!
 
How to Become an SAP ABAP Developer? Career Scope, Salary, Skills, Future Tre...
How to Become an SAP ABAP Developer? Career Scope, Salary, Skills, Future Tre...How to Become an SAP ABAP Developer? Career Scope, Salary, Skills, Future Tre...
How to Become an SAP ABAP Developer? Career Scope, Salary, Skills, Future Tre...
 
A Technical Driven Seminar
A Technical Driven SeminarA Technical Driven Seminar
A Technical Driven Seminar
 
Where do you want to go today
Where do you want to go todayWhere do you want to go today
Where do you want to go today
 

More from Cheng Feng

Sparkcamp stratasingapore
Sparkcamp stratasingaporeSparkcamp stratasingapore
Sparkcamp stratasingaporeCheng Feng
 
运营商去O浅析 公开版-王晓征
运营商去O浅析 公开版-王晓征运营商去O浅析 公开版-王晓征
运营商去O浅析 公开版-王晓征Cheng Feng
 
数据库架构师做什么 58同城数据库架构设计思路-沈剑
数据库架构师做什么 58同城数据库架构设计思路-沈剑数据库架构师做什么 58同城数据库架构设计思路-沈剑
数据库架构师做什么 58同城数据库架构设计思路-沈剑Cheng Feng
 
Tdsql在微众银行核心交易系统中的实践 雷海林
Tdsql在微众银行核心交易系统中的实践 雷海林Tdsql在微众银行核心交易系统中的实践 雷海林
Tdsql在微众银行核心交易系统中的实践 雷海林Cheng Feng
 
Maria db新特性剖析 京东张金鹏
Maria db新特性剖析 京东张金鹏Maria db新特性剖析 京东张金鹏
Maria db新特性剖析 京东张金鹏Cheng Feng
 
Inception自动审核系统设计与实现 王竹峰
Inception自动审核系统设计与实现 王竹峰Inception自动审核系统设计与实现 王竹峰
Inception自动审核系统设计与实现 王竹峰Cheng Feng
 
Maria db新特性剖析 京东张金鹏
Maria db新特性剖析 京东张金鹏Maria db新特性剖析 京东张金鹏
Maria db新特性剖析 京东张金鹏Cheng Feng
 
Epsrcws08 campbell kbm_01
Epsrcws08 campbell kbm_01Epsrcws08 campbell kbm_01
Epsrcws08 campbell kbm_01Cheng Feng
 
Epsrcws08 campbell isvm_01
Epsrcws08 campbell isvm_01Epsrcws08 campbell isvm_01
Epsrcws08 campbell isvm_01Cheng Feng
 

More from Cheng Feng (9)

Sparkcamp stratasingapore
Sparkcamp stratasingaporeSparkcamp stratasingapore
Sparkcamp stratasingapore
 
运营商去O浅析 公开版-王晓征
运营商去O浅析 公开版-王晓征运营商去O浅析 公开版-王晓征
运营商去O浅析 公开版-王晓征
 
数据库架构师做什么 58同城数据库架构设计思路-沈剑
数据库架构师做什么 58同城数据库架构设计思路-沈剑数据库架构师做什么 58同城数据库架构设计思路-沈剑
数据库架构师做什么 58同城数据库架构设计思路-沈剑
 
Tdsql在微众银行核心交易系统中的实践 雷海林
Tdsql在微众银行核心交易系统中的实践 雷海林Tdsql在微众银行核心交易系统中的实践 雷海林
Tdsql在微众银行核心交易系统中的实践 雷海林
 
Maria db新特性剖析 京东张金鹏
Maria db新特性剖析 京东张金鹏Maria db新特性剖析 京东张金鹏
Maria db新特性剖析 京东张金鹏
 
Inception自动审核系统设计与实现 王竹峰
Inception自动审核系统设计与实现 王竹峰Inception自动审核系统设计与实现 王竹峰
Inception自动审核系统设计与实现 王竹峰
 
Maria db新特性剖析 京东张金鹏
Maria db新特性剖析 京东张金鹏Maria db新特性剖析 京东张金鹏
Maria db新特性剖析 京东张金鹏
 
Epsrcws08 campbell kbm_01
Epsrcws08 campbell kbm_01Epsrcws08 campbell kbm_01
Epsrcws08 campbell kbm_01
 
Epsrcws08 campbell isvm_01
Epsrcws08 campbell isvm_01Epsrcws08 campbell isvm_01
Epsrcws08 campbell isvm_01
 

Recently uploaded

Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts servicesonalikaur4
 
How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)Damian Radcliffe
 
Russian Call Girls in Kolkata Ishita 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Ishita 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Ishita 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Ishita 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxellan12
 
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Delhi Call girls
 
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...APNIC
 
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607dollysharma2066
 
VIP Call Girls Pune Madhuri 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Madhuri 8617697112 Independent Escort Service PuneVIP Call Girls Pune Madhuri 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Madhuri 8617697112 Independent Escort Service PuneCall girls in Ahmedabad High profile
 
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls KolkataVIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
AlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with FlowsAlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with FlowsThierry TROUIN ☁
 
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012rehmti665
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...aditipandeya
 
Gram Darshan PPT cyber rural in villages of india
Gram Darshan PPT cyber rural  in villages of indiaGram Darshan PPT cyber rural  in villages of india
Gram Darshan PPT cyber rural in villages of indiaimessage0108
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Sheetaleventcompany
 
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine ServiceHot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Servicesexy call girls service in goa
 
Russian Call Girls in Kolkata Samaira 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Samaira 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Samaira 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Samaira 🤌 8250192130 🚀 Vip Call Girls Kolkataanamikaraghav4
 
Radiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girlsRadiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girlsstephieert
 

Recently uploaded (20)

Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts serviceChennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service
 
How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)How is AI changing journalism? (v. April 2024)
How is AI changing journalism? (v. April 2024)
 
Russian Call Girls in Kolkata Ishita 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Ishita 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Ishita 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Ishita 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptxAWS Community DAY Albertini-Ellan Cloud Security (1).pptx
AWS Community DAY Albertini-Ellan Cloud Security (1).pptx
 
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
Best VIP Call Girls Noida Sector 75 Call Me: 8448380779
 
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...
 
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Rohini 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607
 
VIP Call Girls Pune Madhuri 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Madhuri 8617697112 Independent Escort Service PuneVIP Call Girls Pune Madhuri 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Madhuri 8617697112 Independent Escort Service Pune
 
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls KolkataVIP Call Girls Kolkata Ananya 🤌  8250192130 🚀 Vip Call Girls Kolkata
VIP Call Girls Kolkata Ananya 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
AlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with FlowsAlbaniaDreamin24 - How to easily use an API with Flows
AlbaniaDreamin24 - How to easily use an API with Flows
 
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
Call Girls South Delhi Delhi reach out to us at ☎ 9711199012
 
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...
 
Gram Darshan PPT cyber rural in villages of india
Gram Darshan PPT cyber rural  in villages of indiaGram Darshan PPT cyber rural  in villages of india
Gram Darshan PPT cyber rural in villages of india
 
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
Call Girls Service Chandigarh Lucky ❤️ 7710465962 Independent Call Girls In C...
 
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine ServiceHot Service (+9316020077 ) Goa  Call Girls Real Photos and Genuine Service
Hot Service (+9316020077 ) Goa Call Girls Real Photos and Genuine Service
 
Russian Call Girls in Kolkata Samaira 🤌 8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Samaira 🤌  8250192130 🚀 Vip Call Girls KolkataRussian Call Girls in Kolkata Samaira 🤌  8250192130 🚀 Vip Call Girls Kolkata
Russian Call Girls in Kolkata Samaira 🤌 8250192130 🚀 Vip Call Girls Kolkata
 
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceRohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
Rohini Sector 22 Call Girls Delhi 9999965857 @Sabina Saikh No Advance
 
Radiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girlsRadiant Call girls in Dubai O56338O268 Dubai Call girls
Radiant Call girls in Dubai O56338O268 Dubai Call girls
 

Strata singapore survey

  • 1. pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API Yes 28 27.7% No 73 72.3% Core Spark 70 69.3% Spark SQL + DataFrames 78 77.2% Spark Streaming 66 65.3% MLlib (machine learning) 72 71.3% GraphX 30 29.7% Zero Knowledge 44 43.6% Beginner 47 46.5% Medium 9 8.9% Expert 1 1% 101 responses Summary Have you edited Wikipedia articles before? Which of the following Spark components are you mostly interested in using after class? Scala [Which programming language API of Spark are you most comfortable in?] Java [Which programming language API of Spark are you most comfortable in?] 72.3% 27.7% 0 15 30 45 60 75 Core Spark Spark SQL +… Spark Stream… MLlib (machin… GraphX 0 10 20 30 40 Zero Knowled… Beginner Medium Expert SIGN IN The version of the browser you are using is no longer supported. Please upgrade to a supported browser. Dismiss
  • 2. pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API Zero Knowledge 18 17.8% Beginner 21 20.8% Medium 41 40.6% Expert 21 20.8% Zero Knowledge 30 29.7% Beginner 31 30.7% Medium 27 26.7% Expert 13 12.9% Zero Knowledge 6 5.9% Beginner 16 15.8% Medium 54 53.5% Expert 25 24.8% Zero Knowledge 33 32.7% Beginner 43 42.6% Medium 19 18.8% Expert 6 5.9% Development (how to write Spark apps, API coverage, debugging) 86 85.1% Python [Which programming language API of Spark are you most comfortable in?] SQL [Which programming language API of Spark are you most comfortable in?] R [Which programming language API of Spark are you most comfortable in?] I would like the focus of the class to be: 0 10 20 30 40 Zero Knowled… Beginner Medium Expert 0.0 7.5 15.0 22.5 30.0 Zero Knowled… Beginner Medium Expert 0 10 20 30 40 50 Zero Knowled… Beginner Medium Expert 0 10 20 30 40 Zero Knowled… Beginner Medium Expert
  • 3. pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API Administration / Ops (how Spark scales, configuration parameters, tuning) 39 38.6% Architecture (how the JVMs interact with each other, Spark Standalone, YARN integration, etc) 62 61.4% Use Cases (non-technical section on how companies are using Spark) 55 54.5% Level 0: I am a totally new to Spark 50 49.5% Level 1: I have launched the Spark shell and executed a few transformations & actions and looked at the Spark UIs 32 31.7% Level 2: I have either written 100+ lines of code for a Spark application or I understand the following: what narrow vs wide dependencies are, how to figure out which transformations cause a shuffle 15 14.9% Level 3: I have been using Spark greater than 50% of the time in my job for over 2 months in either a development or administration role 4 4% Level 4: I have contributed at least 20 lines of code to the Apache Spark project 0 0% Class day will be my first hands-on exposure to programming in Spark 52 51.5% I have been playing with the Spark shells for less than a week 26 25.7% I have under 1 month of experience with Spark 7 6.9% I have 1 - 6 months of experience with Spark 13 12.9% 6 - 12 months 2 2% 1+ year 1 1% How experienced are you with Spark? For how long have you been doing hands-on Development or Operations work with Apache Spark? Where are you in the Spark usage lifecycle? 0 20 40 60 80 Development… Administratio… Architecture (… Use Cases (no… 14.9% 31.7% 49.5% 25.7% 51.5%
  • 4. pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API Just starting to learn about Spark, reading about it... 61 60.4% I have a small 1-node Spark cluster or VM that I'm playing around with 15 14.9% I am currently building a Proof of Concept or Prototype to demonstrate a use case 21 20.8% We are in production! 4 4% Please select which of the following Big Data technologies you have at least "medium" level technical proficiency in: 20.8% 14.9% 60.4%
  • 5. pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API HDFS 64 63.4% MapReduce 50 49.5% YARN 31 30.7% Mesos 2 2% Cascading 7 6.9% Kafka 19 18.8% Storm 9 8.9% Flume 12 11.9% HBase 20 19.8% Cassandra 11 10.9% Hive 42 41.6% Impala 13 12.9% Pig 24 23.8% Parquet 16 15.8% ZooKeeper 20 19.8% MongoDB 26 25.7% Couchbase 4 4% Neo4j 5 5% Titan 0 0% Oozie 16 15.8% Sqoop 17 16.8% Giraph or Graphlab 2 2% Accumulo 0 0% Phoenix 3 3% Tez 7 6.9% ElasticSearch 15 14.9% Lucene / Solr 19 18.8% Math: Statistics, Linear Algebra, Calculus, Matrix math, etc 46 45.5% 0 15 30 45 60 HDFS MapReduce YARN Mesos Cascading Kafka Storm Flume HBase Cassandra Hive Impala Pig Parquet ZooKeeper MongoDB Couchbase Neo4j Titan Oozie Sqoop Giraph or… Accumulo Phoenix Tez ElasticSe… Lucene /… Math: Sta…
  • 6. pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API with Databricks Cloud 6 5.9% with Hadoop (YARN/HDFS) 55 54.5% with Cassandra (Standalone mode) 10 9.9% with pure Apache Spark (Standalone mode) 21 20.8% with Mesos 6 5.9% I don't know yet 37 36.6% Within Amazon Cloud 31 30.7% On-premise within our private data center 73 72.3% A different cloud provider 17 16.8% AmpCamp training at UC Berkeley (Academic) 0 0% SparkCamp training from Databricks (Industry) 3 3% Cloudera Spark training 2 2% Another vendor's Spark training 2 2% Spark Summit conference 2 2% None of the above 94 93.1% How are you planning on deploying Spark within your organization? Where do you plan on deploying Spark clusters for your organization? Which of the following Spark training sessions, if any, have you attended before? Which industry do you work in? 0 10 20 30 40 50 w ith Databric… w ith Hadoop… w ith Cassan… w ith pure Ap… w ith Mesos I don't know yet Within Amazo… On-premise w… A different cl… 0 20 40 60 80 AmpCamp tra… SparkCamp tr… Cloudera Spa… Another ven… Spark Summit… None of the a…
  • 7. pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API IT / Systems / Solution Provider / IT Consultancy 53 52.5% Banking / Finance 17 16.8% Science & Technology 8 7.9% Academia / University / Education 2 2% Advertising / Marketing / PR 3 3% Telecommunications 8 7.9% Healthcare / Medical / Pharmaceuticals 5 5% Publishing / Media 4 4% Retailer / Distributor / Wholesale 2 2% Government 6 5.9% Insurance 0 0% Legal 2 2% Manufacturing / Design 6 5.9% Nonprofit 2 2% Business Services Consulting (Non-IT) 3 3% Other 6 5.9% Developer / Software Engineer / Software Architect 56 55.4% Administrator / Operations / DevOps 8 7.9% Data Scientist / Statistics / Machine Learning 40 39.6% Management / Executive 8 7.9% Sales / Marketing 3 3% Other 5 5% Which of the following job categories best describes your role at your company? How far did you travel from to attend this class? 0 10 20 30 40 50 IT / System… Banking / Fi… Science &… Academia /… Advertising… Telecomm… Healthcare… Publishing… Retailer / Di… Government Insurance Legal Manufacturi… Nonprofit Business S… Other 0 10 20 30 40 50 Developer / S… Administrator… Data Scientis… Management… Sales / Marke… Other
  • 8. pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API Singapore: I live in Singapore already 54 53.5% USA: I live in the Western half of the United States (like San Francisco, Seattle, Denver, Portland) 0 0% USA: I live in the Eastern half of United States (NYC, D.C., Atlanta, etc) 1 1% INTERNATIONAL: I flew in from an Asian country like Japan, China, India, South Korea, etc 35 34.7% INTERNATIONAL: I am coming from a European country 4 4% INTERNATIONAL: Other 7 6.9% OPTIONAL: Finally, please freely describe your experience with Spark so far. RDD vs DataFrames; which one to focus on I am beginer. We are exploring Apache spark to implement some of the use cases in our organization. Fast & simpler than MR Interesting huge amount of data class loader problems. :( Australia we are going to implement Spark in our current project I have been exploring spark mostly from Hadoop Data Processing OPTIONAL: Is there anything you want to communicate to the instructor? Want to hear more on Real-Time Architecture I have heard about the limitations of dataframes of 22 columns due to the tupes limitations. How do you overcome this? Thank you Slow if it is possible as I am new to Spark Could you talk about the trade-offs between developing in RDDs vs Dataframes? Data frames are great and reduce development time, but are RDDs significantly faster? Also, could you also talk about the trade-offs between developing in Scala vs Python? Python is more easily maintainable but lags behind Scala in terms of Spark release. Nothing , as of now Number of daily responses 34.7% 53.5% 0 30 60 90 120
  • 9. pdfcrowd.comopen in browser PRO version Are you a developer? Try out the HTML to PDF API