Fire in the Sky: An Introduction to Monitoring Apache Spark in the Cloud with Michael McCune

Spark Summit
Spark SummitSpark Summit
FIRE IN THE SKY
AN INTRODUCTION TO MONITORING APACHE SPARK
IN THE CLOUD
Michael McCune - msm@redhat.com
1
WHY MONITOR SPARK?
2
WHAT ABOUT LOGS?
spark-submit --master=spark://cluster:7077 app.py
17/10/13 13:33:03 INFO BlockManager: BlockManager stopped
17/10/13 13:33:03 INFO BlockManagerMaster: BlockManagerMaster stopped
17/10/13 13:33:03 INFO OutputCommitCoordinator$OutputCommitCoordinatorEndpoint: OutputCommitCoordinator stopped!
17/10/13 13:33:03 INFO SparkContext: Successfully stopped SparkContext
10.0.1.113 - - [13/Oct/2017 13:33:04] "GET /sparkpi?scale=1000 HTTP/1.1" 200 -
Traceback (most recent call last):
File "/home/mike/workspace/github/radanalyticsio/tutorial-sparkpi-python-flask/app.py", line 41, in
app.run(host='0.0.0.0', port=port)
File "/home/mike/.venvs/sparkpi/lib/python2.7/site-packages/flask/app.py", line 841, in run
run_simple(host, port, self, **options)
File "/home/mike/.venvs/sparkpi/lib/python2.7/site-packages/werkzeug/serving.py", line 739, in run_simple
inner()
File "/home/mike/.venvs/sparkpi/lib/python2.7/site-packages/werkzeug/serving.py", line 702, in inner
srv.serve_forever()
File "/home/mike/.venvs/sparkpi/lib/python2.7/site-packages/werkzeug/serving.py", line 539, in serve_forever
HTTPServer.serve_forever(self)
File "/usr/lib64/python2.7/SocketServer.py", line 231, in serve_forever
poll_interval)
File "/usr/lib64/python2.7/SocketServer.py", line 150, in _eintr_retry
return func(*args)
File "/home/mike/opt/other-spark/python/lib/pyspark.zip/pyspark/context.py", line 236, in signal_handler
File "/home/mike/opt/other-spark/python/lib/pyspark.zip/pyspark/context.py", line 962, in cancelAllJobs
AttributeError: 'NoneType' object has no attribute 'sc'
17/10/13 13:37:30 INFO ShutdownHookManager: Shutdown hook called
17/10/13 13:37:30 INFO ShutdownHookManager: Deleting directory /tmp/spark-c9989888-5190-4d70-9d20-5960ede510e4/pysp
17/10/13 13:37:30 INFO ShutdownHookManager: Deleting directory /tmp/spark-c9989888-5190-4d70-9d20-5960ede510e4
3
LET'S TALK METRICS
4
BASIC CONFIGURATION
metrics.properties
master.source.jvm.class=org.apache.spark.metrics.source.JvmSource
worker.source.jvm.class=org.apache.spark.metrics.source.JvmSource
driver.source.jvm.class=org.apache.spark.metrics.source.JvmSource
executor.source.jvm.class=org.apache.spark.metrics.source.JvmSource
5
METRICS SERVLET
GET /metrics/master/json/
{
"gauges": {
"jvm.PS-MarkSweep.count": {
"value": 1
},
"jvm.PS-MarkSweep.time": {
"value": 21
},
"jvm.PS-Scavenge.count": {
"value": 3
},
"jvm.PS-Scavenge.time": {
"value": 18
},
"jvm.heap.committed": {
"value": 235405312
6
METRICS SERVLET
GET /metrics/applications/json/
GET /metrics/json/
7
CONSOLE SINK
metrics.properties
*.sink.console.class=org.apache.spark.metrics.sink.ConsoleSink
*.sink.console.period=5
*.sink.console.unit=seconds
8
CONSOLE SINK
10/12/17 4:59:54 PM ======================================================
-- Gauges ----------------------------------------------------------------
jvm.PS-MarkSweep.count
value = 1
jvm.PS-MarkSweep.time
value = 23
jvm.PS-Scavenge.count
value = 4
jvm.PS-Scavenge.time
value = 31
jvm.heap.committed
value = 198705152
jvm.heap.init
value = 253755392
jvm.heap.max
value = 954728448
jvm.heap.usage
9
SLF4J SINK
metrics.properties
*.sink.slf4j.class=org.apache.spark.metrics.sink.Slf4jSink
*.sink.slf4j.period=5
*.sink.slf4j.unit=seconds
10
SLF4J SINK
17/10/14 11:47:22 INFO Master: I have been elected leader! New state: ALIVE
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.PS-MarkSweep.count, value=1
17/10/14 11:47:25 INFO metrics: type=COUNTER, name=HiveExternalCatalog.fileCacheHits, count=0
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.PS-MarkSweep.time, value=20
17/10/14 11:47:25 INFO metrics: type=COUNTER, name=HiveExternalCatalog.filesDiscovered, count=0
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.PS-Scavenge.count, value=3
17/10/14 11:47:25 INFO metrics: type=COUNTER, name=HiveExternalCatalog.hiveClientCalls, count=0
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.PS-Scavenge.time, value=17
17/10/14 11:47:25 INFO metrics: type=COUNTER, name=HiveExternalCatalog.parallelListingJobCount, count=0
17/10/14 11:47:25 INFO metrics: type=COUNTER, name=HiveExternalCatalog.partitionsFetched, count=0
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.heap.committed, value=171442176
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.heap.init, value=253755392
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.heap.max, value=954728448
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.heap.usage, value=0.04117022393366454
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.heap.used, value=39306384
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.non-heap.committed, value=34996224
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.non-heap.init, value=2555904
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.non-heap.max, value=-1
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.non-heap.usage, value=-3.4380496E7
17/10/14 11:47:25 INFO metrics: type=HISTOGRAM, name=CodeGenerator.compilationTime, count=0, min=0, max
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.non-heap.used, value=34381136
17/10/14 11:47:25 INFO metrics: type=HISTOGRAM, name=CodeGenerator.generatedClassSize, count=0, min=0,
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.pools.Code-Cache.committed, value=5111808
17/10/14 11:47:25 INFO metrics: type=HISTOGRAM, name=CodeGenerator.generatedMethodSize, count=0, min=0,
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.pools.Code-Cache.init, value=2555904
17/10/14 11:47:25 INFO metrics: type=HISTOGRAM, name=CodeGenerator.sourceCodeSize, count=0, min=0, max=
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.pools.Code-Cache.max, value=251658240
17/10/14 11:47:25 INFO metrics: type=GAUGE, name=jvm.pools.Code-Cache.usage, value=0.019977823893229166 11
CSV SINK
metrics.properties
*.sink.csv.class=org.apache.spark.metrics.sink.CsvSink
*.sink.csv.period=1
*.sink.csv.unit=seconds
*.sink.csv.directory=/tmp/spark/csv/master/
12
CSV SINK
[mike@opb-ultra] /tmp/spark/csv/master
$ ls
application.PythonPi.1507910958999.cores.csv
application.PythonPi.1507910958999.runtime_ms.csv
application.PythonPi.1507910958999.status.csv
CodeGenerator.compilationTime.csv
CodeGenerator.generatedClassSize.csv
CodeGenerator.generatedMethodSize.csv
CodeGenerator.sourceCodeSize.csv
HiveExternalCatalog.fileCacheHits.csv
HiveExternalCatalog.filesDiscovered.csv
HiveExternalCatalog.hiveClientCalls.csv
HiveExternalCatalog.parallelListingJobCount.csv
HiveExternalCatalog.partitionsFetched.csv
jvm.heap.committed.csv
jvm.heap.init.csv
jvm.heap.max.csv
jvm.heap.usage.csv
jvm.heap.used.csv
jvm.non-heap.committed.csv
13
CSV SINK
jvm.PS-MarkSweep.count.csv
t,value
1507910949,1
1507910950,1
1507910951,1
1507910952,1
1507910953,1
1507910954,1
1507910955,1
1507910956,1
1507910957,1
1507910958,1
1507910959,1
1507910960,1
1507910961,1
1507910962,1
1507910963,1
1507910964,1
14
JMX SINK
metrics.properties
spark-defaults.conf
*.sink.jmx.class=org.apache.spark.metrics.sink.JmxSink
spark.driver.extraJavaOptions 
-Dcom.sun.management.jmxremote
-Dcom.sun.management.jmxremote.authenticate=false
-Dcom.sun.management.jmxremote.ssl=false
-Dcom.sun.management.jmxremote.port=19150
-Dcom.sun.management.jmxremote.local.only=false
-Djava.rmi.server.hostname=127.0.0.1
-Dcom.sun.management.jmxremote.rmi.port=19151
15
JMX SINK
jconsole
16
JMX SINK
Jolokia Java Agent
spark-class -javaagent:jolokia-jvm-agent.jar=port=19150,host=127.0.0.1
17
JMX SINK
GET /jolokia/
{
"request": {
"type": "version"
},
"status": 200,
"timestamp": 1507946690,
"value": {
"agent": "1.3.7",
"config": {
"agentContext": "/jolokia",
"agentId": "10.0.1.113-7868-4926097b-jvm",
"agentType": "jvm",
"debug": "false",
"debugMaxEntries": "100",
"discoveryEnabled": "true",
"historyMaxEntries": "10",
"maxCollectionSize": "0",
"maxDepth": "15",
"maxObjects": "0"
},
"info": {
"product": "jetty",
"vendor": "Mortbay",
"version": "6.1.26"
},
"protocol": "7.2"
}
} 18
JMX SINK
GET /jolokia/read/metrics:name=jvm.PS-MarkSweep.count
{
"request": {
"mbean": "metrics:name=jvm.PS-MarkSweep.count",
"type": "read"
},
"status": 200,
"timestamp": 1507946213,
"value": {
"Value": 1
}
}
19
GRAPHITE SINK
metrics.properties
*.sink.graphite.class=org.apache.spark.metrics.sink.GraphiteSink
*.sink.graphite.host=127.0.0.1
*.sink.graphite.port=2003
*.sink.graphite.period=1
*.sink.graphite.unit=seconds
20
GRAPHITE SINK
21
GRAPHITE SINK
/render?target=jvm.heap.usage&from=-3minutes
22
GRAPHITE SINK
/metrics/index.json
[
"jvm.PS-MarkSweep.count",
"jvm.PS-MarkSweep.time",
"jvm.PS-Scavenge.count",
"jvm.PS-Scavenge.time",
"jvm.heap.committed",
"jvm.heap.init",
"jvm.heap.max",
"jvm.heap.usage",
"jvm.heap.used",
"jvm.non-heap.committed",
"jvm.non-heap.init",
"jvm.non-heap.max",
"jvm.non-heap.usage",
"jvm.non-heap.used",
"jvm.pools.Code-Cache.committed",
23
GRAPHITE SINK
/render?target=jvm.PS-MarkSweep.count&format=json
[
{
"datapoints": [
[
null,
1507947840
],
[
1.0,
1507947900
],
[
1.0,
1507947960
]
],
"target": "jvm.PS-MarkSweep.count"
24
MUCH MORE TO DISCOVER
app-*-*.driver.BlockManager.memory.*
app-*-*.driver.CodeGenerator.compilationTime.*
app-*-*.driver.CodeGenerator.generatedClassSize.*
app-*-*.driver.CodeGenerator.generatedMethodSize.*
app-*-*.driver.CodeGenerator.sourceCodeSize.*
app-*-*.driver.DAGScheduler.job.*
app-*-*.driver.DAGScheduler.messageProcessingTime.*
app-*-*.driver.DAGScheduler.stage.*
application.PythonPi.*.cores
application.PythonPi.*.runtime_ms
jvm.total.committed
jvm.total.init
jvm.total.max
jvm.total.used
worker.coresFree
worker.coresUsed
worker.executors
worker.memFree_MB
25
26
WHAT IS CLOUD NATIVE?
Containerized
Dynamically orchestrated
Microservice oriented
cncf.io/about/faq
27
TRADITIONAL ORCHESTRATION
28 . 1
TRADITIONAL ORCHESTRATION
28 . 2
CLOUD NATIVE ORCHESTRATION
29 . 1
CLOUD NATIVE ORCHESTRATION
29 . 2
CLOUD MONITORING
30
SCRAPING
31 . 1
SCRAPING
rhuss/jolokia
prometheus/jmx_exporter
fabric8io/agent-bond
31 . 2
BROADCAST
32 . 1
BROADCAST
erikerlandson/spark-kafka-sink
32 . 2
HYBRID
33 . 1
HYBRID
33 . 2
HYBRID
radanalyticsio/scorpion-stare
33 . 3
CHALLENGES
34 . 1
CHALLENGES
34 . 2
CHALLENGES
34 . 3
CHALLENGES
34 . 4
WHAT WILL YOU BUILD?
35
THANK YOU
STAY IN TOUCH
@FOSSjunkie
CHECK OUT OUR PROJECTS
https://radanalytics.io
36
1 of 45

Recommended

Tuning Apache Spark for Large-Scale Workloads Gaoxiang Liu and Sital Kedia by
Tuning Apache Spark for Large-Scale Workloads Gaoxiang Liu and Sital KediaTuning Apache Spark for Large-Scale Workloads Gaoxiang Liu and Sital Kedia
Tuning Apache Spark for Large-Scale Workloads Gaoxiang Liu and Sital KediaDatabricks
11.7K views32 slides
How to Automate Performance Tuning for Apache Spark by
How to Automate Performance Tuning for Apache SparkHow to Automate Performance Tuning for Apache Spark
How to Automate Performance Tuning for Apache SparkDatabricks
1.8K views36 slides
Monitor Apache Spark 3 on Kubernetes using Metrics and Plugins by
Monitor Apache Spark 3 on Kubernetes using Metrics and PluginsMonitor Apache Spark 3 on Kubernetes using Metrics and Plugins
Monitor Apache Spark 3 on Kubernetes using Metrics and PluginsDatabricks
685 views28 slides
A Deep Dive into Query Execution Engine of Spark SQL by
A Deep Dive into Query Execution Engine of Spark SQLA Deep Dive into Query Execution Engine of Spark SQL
A Deep Dive into Query Execution Engine of Spark SQLDatabricks
6.6K views88 slides
Apache Spark At Scale in the Cloud by
Apache Spark At Scale in the CloudApache Spark At Scale in the Cloud
Apache Spark At Scale in the CloudDatabricks
3.2K views76 slides
LSFMM 2019 BPF Observability by
LSFMM 2019 BPF ObservabilityLSFMM 2019 BPF Observability
LSFMM 2019 BPF ObservabilityBrendan Gregg
8.3K views67 slides

More Related Content

What's hot

Building a SIMD Supported Vectorized Native Engine for Spark SQL by
Building a SIMD Supported Vectorized Native Engine for Spark SQLBuilding a SIMD Supported Vectorized Native Engine for Spark SQL
Building a SIMD Supported Vectorized Native Engine for Spark SQLDatabricks
725 views28 slides
Do flink on web with flow - Dongwon Kim & Haemee park, SK Telecom) by
Do flink on web with flow - Dongwon Kim & Haemee park, SK Telecom)Do flink on web with flow - Dongwon Kim & Haemee park, SK Telecom)
Do flink on web with flow - Dongwon Kim & Haemee park, SK Telecom)Flink Forward
627 views82 slides
Photon Technical Deep Dive: How to Think Vectorized by
Photon Technical Deep Dive: How to Think VectorizedPhoton Technical Deep Dive: How to Think Vectorized
Photon Technical Deep Dive: How to Think VectorizedDatabricks
1.4K views38 slides
ModelDB: A System to Manage Machine Learning Models: Spark Summit East talk b... by
ModelDB: A System to Manage Machine Learning Models: Spark Summit East talk b...ModelDB: A System to Manage Machine Learning Models: Spark Summit East talk b...
ModelDB: A System to Manage Machine Learning Models: Spark Summit East talk b...Spark Summit
3.8K views20 slides
Top 5 Mistakes When Writing Spark Applications by
Top 5 Mistakes When Writing Spark ApplicationsTop 5 Mistakes When Writing Spark Applications
Top 5 Mistakes When Writing Spark ApplicationsSpark Summit
26.4K views75 slides
Zeus: Uber’s Highly Scalable and Distributed Shuffle as a Service by
Zeus: Uber’s Highly Scalable and Distributed Shuffle as a ServiceZeus: Uber’s Highly Scalable and Distributed Shuffle as a Service
Zeus: Uber’s Highly Scalable and Distributed Shuffle as a ServiceDatabricks
1.7K views43 slides

What's hot(20)

Building a SIMD Supported Vectorized Native Engine for Spark SQL by Databricks
Building a SIMD Supported Vectorized Native Engine for Spark SQLBuilding a SIMD Supported Vectorized Native Engine for Spark SQL
Building a SIMD Supported Vectorized Native Engine for Spark SQL
Databricks725 views
Do flink on web with flow - Dongwon Kim & Haemee park, SK Telecom) by Flink Forward
Do flink on web with flow - Dongwon Kim & Haemee park, SK Telecom)Do flink on web with flow - Dongwon Kim & Haemee park, SK Telecom)
Do flink on web with flow - Dongwon Kim & Haemee park, SK Telecom)
Flink Forward627 views
Photon Technical Deep Dive: How to Think Vectorized by Databricks
Photon Technical Deep Dive: How to Think VectorizedPhoton Technical Deep Dive: How to Think Vectorized
Photon Technical Deep Dive: How to Think Vectorized
Databricks1.4K views
ModelDB: A System to Manage Machine Learning Models: Spark Summit East talk b... by Spark Summit
ModelDB: A System to Manage Machine Learning Models: Spark Summit East talk b...ModelDB: A System to Manage Machine Learning Models: Spark Summit East talk b...
ModelDB: A System to Manage Machine Learning Models: Spark Summit East talk b...
Spark Summit3.8K views
Top 5 Mistakes When Writing Spark Applications by Spark Summit
Top 5 Mistakes When Writing Spark ApplicationsTop 5 Mistakes When Writing Spark Applications
Top 5 Mistakes When Writing Spark Applications
Spark Summit26.4K views
Zeus: Uber’s Highly Scalable and Distributed Shuffle as a Service by Databricks
Zeus: Uber’s Highly Scalable and Distributed Shuffle as a ServiceZeus: Uber’s Highly Scalable and Distributed Shuffle as a Service
Zeus: Uber’s Highly Scalable and Distributed Shuffle as a Service
Databricks1.7K views
Why Spark Is the Next Top (Compute) Model by Dean Wampler
Why Spark Is the Next Top (Compute) ModelWhy Spark Is the Next Top (Compute) Model
Why Spark Is the Next Top (Compute) Model
Dean Wampler24.5K views
Understanding Memory Management In Spark For Fun And Profit by Spark Summit
Understanding Memory Management In Spark For Fun And ProfitUnderstanding Memory Management In Spark For Fun And Profit
Understanding Memory Management In Spark For Fun And Profit
Spark Summit23K views
Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17 by spark-project
Deep Dive with Spark Streaming - Tathagata  Das - Spark Meetup 2013-06-17Deep Dive with Spark Streaming - Tathagata  Das - Spark Meetup 2013-06-17
Deep Dive with Spark Streaming - Tathagata Das - Spark Meetup 2013-06-17
spark-project53.4K views
Kafka Summit NYC 2017 - Singe Message Transforms are not the Transformations ... by confluent
Kafka Summit NYC 2017 - Singe Message Transforms are not the Transformations ...Kafka Summit NYC 2017 - Singe Message Transforms are not the Transformations ...
Kafka Summit NYC 2017 - Singe Message Transforms are not the Transformations ...
confluent9K views
SparkSQL: A Compiler from Queries to RDDs by Databricks
SparkSQL: A Compiler from Queries to RDDsSparkSQL: A Compiler from Queries to RDDs
SparkSQL: A Compiler from Queries to RDDs
Databricks6.1K views
Autoscaling Flink with Reactive Mode by Flink Forward
Autoscaling Flink with Reactive ModeAutoscaling Flink with Reactive Mode
Autoscaling Flink with Reactive Mode
Flink Forward921 views
Kafka replication apachecon_2013 by Jun Rao
Kafka replication apachecon_2013Kafka replication apachecon_2013
Kafka replication apachecon_2013
Jun Rao21.3K views
Evening out the uneven: dealing with skew in Flink by Flink Forward
Evening out the uneven: dealing with skew in FlinkEvening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in Flink
Flink Forward2.5K views
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas by Flink Forward
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy FarkasVirtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas
Virtual Flink Forward 2020: Autoscaling Flink at Netflix - Timothy Farkas
Flink Forward1.1K views
What is New with Apache Spark Performance Monitoring in Spark 3.0 by Databricks
What is New with Apache Spark Performance Monitoring in Spark 3.0What is New with Apache Spark Performance Monitoring in Spark 3.0
What is New with Apache Spark Performance Monitoring in Spark 3.0
Databricks441 views
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli... by Flink Forward
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Dynamically Scaling Data Streams across Multiple Kafka Clusters with Zero Fli...
Flink Forward264 views
Physical Plans in Spark SQL by Databricks
Physical Plans in Spark SQLPhysical Plans in Spark SQL
Physical Plans in Spark SQL
Databricks7K views
Kernel Recipes 2017: Using Linux perf at Netflix by Brendan Gregg
Kernel Recipes 2017: Using Linux perf at NetflixKernel Recipes 2017: Using Linux perf at Netflix
Kernel Recipes 2017: Using Linux perf at Netflix
Brendan Gregg1.5M views
Performance Troubleshooting Using Apache Spark Metrics by Databricks
Performance Troubleshooting Using Apache Spark MetricsPerformance Troubleshooting Using Apache Spark Metrics
Performance Troubleshooting Using Apache Spark Metrics
Databricks3.5K views

Similar to Fire in the Sky: An Introduction to Monitoring Apache Spark in the Cloud with Michael McCune

Keynote 1 - Engineering Software Analytics Studies by
Keynote 1 - Engineering Software Analytics StudiesKeynote 1 - Engineering Software Analytics Studies
Keynote 1 - Engineering Software Analytics StudiesESEM 2014
413 views70 slides
How to see the event and audit logs through ( gui and cli) in cluster ontap n... by
How to see the event and audit logs through ( gui and cli) in cluster ontap n...How to see the event and audit logs through ( gui and cli) in cluster ontap n...
How to see the event and audit logs through ( gui and cli) in cluster ontap n...Saroj Sahu
2K views6 slides
Linux Tracing Superpowers by Eugene Pirogov by
Linux Tracing Superpowers by Eugene PirogovLinux Tracing Superpowers by Eugene Pirogov
Linux Tracing Superpowers by Eugene PirogovPivorak MeetUp
612 views161 slides
Grâce aux tags Varnish, j'ai switché ma prod sur Raspberry Pi by
Grâce aux tags Varnish, j'ai switché ma prod sur Raspberry PiGrâce aux tags Varnish, j'ai switché ma prod sur Raspberry Pi
Grâce aux tags Varnish, j'ai switché ma prod sur Raspberry PiJérémy Derussé
2.6K views29 slides
Parrot Drones Hijacking by
Parrot Drones HijackingParrot Drones Hijacking
Parrot Drones HijackingPriyanka Aash
1.3K views43 slides
Bringing choas to order in your node.js app by
Bringing choas to order in your node.js appBringing choas to order in your node.js app
Bringing choas to order in your node.js appDan Jenkins
2.9K views54 slides

Similar to Fire in the Sky: An Introduction to Monitoring Apache Spark in the Cloud with Michael McCune(20)

Keynote 1 - Engineering Software Analytics Studies by ESEM 2014
Keynote 1 - Engineering Software Analytics StudiesKeynote 1 - Engineering Software Analytics Studies
Keynote 1 - Engineering Software Analytics Studies
ESEM 2014413 views
How to see the event and audit logs through ( gui and cli) in cluster ontap n... by Saroj Sahu
How to see the event and audit logs through ( gui and cli) in cluster ontap n...How to see the event and audit logs through ( gui and cli) in cluster ontap n...
How to see the event and audit logs through ( gui and cli) in cluster ontap n...
Saroj Sahu2K views
Linux Tracing Superpowers by Eugene Pirogov by Pivorak MeetUp
Linux Tracing Superpowers by Eugene PirogovLinux Tracing Superpowers by Eugene Pirogov
Linux Tracing Superpowers by Eugene Pirogov
Pivorak MeetUp612 views
Grâce aux tags Varnish, j'ai switché ma prod sur Raspberry Pi by Jérémy Derussé
Grâce aux tags Varnish, j'ai switché ma prod sur Raspberry PiGrâce aux tags Varnish, j'ai switché ma prod sur Raspberry Pi
Grâce aux tags Varnish, j'ai switché ma prod sur Raspberry Pi
Jérémy Derussé2.6K views
Parrot Drones Hijacking by Priyanka Aash
Parrot Drones HijackingParrot Drones Hijacking
Parrot Drones Hijacking
Priyanka Aash1.3K views
Bringing choas to order in your node.js app by Dan Jenkins
Bringing choas to order in your node.js appBringing choas to order in your node.js app
Bringing choas to order in your node.js app
Dan Jenkins2.9K views
Alexander Reelsen - Seccomp for Developers by DevDay Dresden
Alexander Reelsen - Seccomp for DevelopersAlexander Reelsen - Seccomp for Developers
Alexander Reelsen - Seccomp for Developers
DevDay Dresden116 views
Container: is it safe enough to run you application? by Aleksey Zalesov
Container: is it safe enough to run you application?Container: is it safe enough to run you application?
Container: is it safe enough to run you application?
Aleksey Zalesov742 views
Nginx Scripting - Extending Nginx Functionalities with Lua by Tony Fabeen
Nginx Scripting - Extending Nginx Functionalities with LuaNginx Scripting - Extending Nginx Functionalities with Lua
Nginx Scripting - Extending Nginx Functionalities with Lua
Tony Fabeen6.4K views
Devinsampa nginx-scripting by Tony Fabeen
Devinsampa nginx-scriptingDevinsampa nginx-scripting
Devinsampa nginx-scripting
Tony Fabeen1.8K views
도커 없이 컨테이너 만들기 4편 네트워크네임스페이스 (2) by Sam Kim
도커 없이 컨테이너 만들기 4편 네트워크네임스페이스 (2)도커 없이 컨테이너 만들기 4편 네트워크네임스페이스 (2)
도커 없이 컨테이너 만들기 4편 네트워크네임스페이스 (2)
Sam Kim265 views
Frontend Performance: Beginner to Expert to Crazy Person by Philip Tellis
Frontend Performance: Beginner to Expert to Crazy PersonFrontend Performance: Beginner to Expert to Crazy Person
Frontend Performance: Beginner to Expert to Crazy Person
Philip Tellis1.9K views
Bare metal performance in Elixir by Aaron Seigo
Bare metal performance in ElixirBare metal performance in Elixir
Bare metal performance in Elixir
Aaron Seigo141 views
Kubernetes Basic Operation by Simon Su
Kubernetes Basic OperationKubernetes Basic Operation
Kubernetes Basic Operation
Simon Su514 views
Riot Games Scalable Data Warehouse Lecture at UCSB / UCLA by sean_seannery
Riot Games Scalable Data Warehouse Lecture at UCSB / UCLARiot Games Scalable Data Warehouse Lecture at UCSB / UCLA
Riot Games Scalable Data Warehouse Lecture at UCSB / UCLA
sean_seannery7.7K views
Efficient System Monitoring in Cloud Native Environments by Gergely Szabó
Efficient System Monitoring in Cloud Native EnvironmentsEfficient System Monitoring in Cloud Native Environments
Efficient System Monitoring in Cloud Native Environments
Gergely Szabó123 views
Visual Security Event Analysis - DefCon 13 - 2005 by Raffael Marty
Visual Security Event Analysis - DefCon 13 - 2005Visual Security Event Analysis - DefCon 13 - 2005
Visual Security Event Analysis - DefCon 13 - 2005
Raffael Marty1.2K views
Leveraging APIs without Programming in FME Server Provisioning by Safe Software
Leveraging APIs without Programming in FME Server ProvisioningLeveraging APIs without Programming in FME Server Provisioning
Leveraging APIs without Programming in FME Server Provisioning
Safe Software410 views

More from Spark Summit

FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang by
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang Spark Summit
6.8K views32 slides
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M... by
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...Spark Summit
4.3K views34 slides
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang Wu by
Apache Spark Structured Streaming Helps Smart Manufacturing with  Xiaochang WuApache Spark Structured Streaming Helps Smart Manufacturing with  Xiaochang Wu
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang WuSpark Summit
3.1K views21 slides
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra by
Improving Traffic Prediction Using Weather Data  with Ramya RaghavendraImproving Traffic Prediction Using Weather Data  with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data with Ramya RaghavendraSpark Summit
3.1K views21 slides
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem... by
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...Spark Summit
3.4K views56 slides
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ... by
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...Spark Summit
1.7K views90 slides

More from Spark Summit(20)

FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang by Spark Summit
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
FPGA-Based Acceleration Architecture for Spark SQL Qi Xie and Quanfu Wang
Spark Summit6.8K views
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M... by Spark Summit
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
VEGAS: The Missing Matplotlib for Scala/Apache Spark with DB Tsai and Roger M...
Spark Summit4.3K views
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang Wu by Spark Summit
Apache Spark Structured Streaming Helps Smart Manufacturing with  Xiaochang WuApache Spark Structured Streaming Helps Smart Manufacturing with  Xiaochang Wu
Apache Spark Structured Streaming Helps Smart Manufacturing with Xiaochang Wu
Spark Summit3.1K views
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra by Spark Summit
Improving Traffic Prediction Using Weather Data  with Ramya RaghavendraImproving Traffic Prediction Using Weather Data  with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
Spark Summit3.1K views
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem... by Spark Summit
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...
A Tale of Two Graph Frameworks on Spark: GraphFrames and Tinkerpop OLAP Artem...
Spark Summit3.4K views
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ... by Spark Summit
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark Marcin ...
Spark Summit1.7K views
Apache Spark and Tensorflow as a Service with Jim Dowling by Spark Summit
Apache Spark and Tensorflow as a Service with Jim DowlingApache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim Dowling
Spark Summit2.1K views
Apache Spark and Tensorflow as a Service with Jim Dowling by Spark Summit
Apache Spark and Tensorflow as a Service with Jim DowlingApache Spark and Tensorflow as a Service with Jim Dowling
Apache Spark and Tensorflow as a Service with Jim Dowling
Spark Summit1.1K views
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library... by Spark Summit
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
MMLSpark: Lessons from Building a SparkML-Compatible Machine Learning Library...
Spark Summit2.8K views
Next CERN Accelerator Logging Service with Jakub Wozniak by Spark Summit
Next CERN Accelerator Logging Service with Jakub WozniakNext CERN Accelerator Logging Service with Jakub Wozniak
Next CERN Accelerator Logging Service with Jakub Wozniak
Spark Summit2.3K views
Powering a Startup with Apache Spark with Kevin Kim by Spark Summit
Powering a Startup with Apache Spark with Kevin KimPowering a Startup with Apache Spark with Kevin Kim
Powering a Startup with Apache Spark with Kevin Kim
Spark Summit787 views
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra by Spark Summit
Improving Traffic Prediction Using Weather Datawith Ramya RaghavendraImproving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Spark Summit730 views
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—... by Spark Summit
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
Hiding Apache Spark Complexity for Fast Prototyping of Big Data Applications—...
Spark Summit1.1K views
How Nielsen Utilized Databricks for Large-Scale Research and Development with... by Spark Summit
How Nielsen Utilized Databricks for Large-Scale Research and Development with...How Nielsen Utilized Databricks for Large-Scale Research and Development with...
How Nielsen Utilized Databricks for Large-Scale Research and Development with...
Spark Summit999 views
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov... by Spark Summit
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
Spline: Apache Spark Lineage not Only for the Banking Industry with Marek Nov...
Spark Summit2.3K views
Goal Based Data Production with Sim Simeonov by Spark Summit
Goal Based Data Production with Sim SimeonovGoal Based Data Production with Sim Simeonov
Goal Based Data Production with Sim Simeonov
Spark Summit746 views
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le... by Spark Summit
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
Preventing Revenue Leakage and Monitoring Distributed Systems with Machine Le...
Spark Summit1.5K views
Getting Ready to Use Redis with Apache Spark with Dvir Volk by Spark Summit
Getting Ready to Use Redis with Apache Spark with Dvir VolkGetting Ready to Use Redis with Apache Spark with Dvir Volk
Getting Ready to Use Redis with Apache Spark with Dvir Volk
Spark Summit3K views
Deduplication and Author-Disambiguation of Streaming Records via Supervised M... by Spark Summit
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Deduplication and Author-Disambiguation of Streaming Records via Supervised M...
Spark Summit915 views
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization... by Spark Summit
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
MatFast: In-Memory Distributed Matrix Computation Processing and Optimization...
Spark Summit1.7K views

Recently uploaded

Advanced_Recommendation_Systems_Presentation.pptx by
Advanced_Recommendation_Systems_Presentation.pptxAdvanced_Recommendation_Systems_Presentation.pptx
Advanced_Recommendation_Systems_Presentation.pptxneeharikasingh29
5 views9 slides
CRIJ4385_Death Penalty_F23.pptx by
CRIJ4385_Death Penalty_F23.pptxCRIJ4385_Death Penalty_F23.pptx
CRIJ4385_Death Penalty_F23.pptxyvettemm100
6 views24 slides
RIO GRANDE SUPPLY COMPANY INC, JAYSON.docx by
RIO GRANDE SUPPLY COMPANY INC, JAYSON.docxRIO GRANDE SUPPLY COMPANY INC, JAYSON.docx
RIO GRANDE SUPPLY COMPANY INC, JAYSON.docxJaysonGarabilesEspej
6 views3 slides
Chapter 3b- Process Communication (1) (1)(1) (1).pptx by
Chapter 3b- Process Communication (1) (1)(1) (1).pptxChapter 3b- Process Communication (1) (1)(1) (1).pptx
Chapter 3b- Process Communication (1) (1)(1) (1).pptxayeshabaig2004
5 views30 slides
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdf by
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdfVikas 500 BIG DATA TECHNOLOGIES LAB.pdf
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdfvikas12611618
8 views30 slides
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx by
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptxDataScienceConferenc1
5 views12 slides

Recently uploaded(20)

Advanced_Recommendation_Systems_Presentation.pptx by neeharikasingh29
Advanced_Recommendation_Systems_Presentation.pptxAdvanced_Recommendation_Systems_Presentation.pptx
Advanced_Recommendation_Systems_Presentation.pptx
CRIJ4385_Death Penalty_F23.pptx by yvettemm100
CRIJ4385_Death Penalty_F23.pptxCRIJ4385_Death Penalty_F23.pptx
CRIJ4385_Death Penalty_F23.pptx
yvettemm1006 views
Chapter 3b- Process Communication (1) (1)(1) (1).pptx by ayeshabaig2004
Chapter 3b- Process Communication (1) (1)(1) (1).pptxChapter 3b- Process Communication (1) (1)(1) (1).pptx
Chapter 3b- Process Communication (1) (1)(1) (1).pptx
ayeshabaig20045 views
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdf by vikas12611618
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdfVikas 500 BIG DATA TECHNOLOGIES LAB.pdf
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdf
vikas126116188 views
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx by DataScienceConferenc1
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
UNEP FI CRS Climate Risk Results.pptx by pekka28
UNEP FI CRS Climate Risk Results.pptxUNEP FI CRS Climate Risk Results.pptx
UNEP FI CRS Climate Risk Results.pptx
pekka2811 views
SUPER STORE SQL PROJECT.pptx by khan888620
SUPER STORE SQL PROJECT.pptxSUPER STORE SQL PROJECT.pptx
SUPER STORE SQL PROJECT.pptx
khan88862012 views
Survey on Factuality in LLM's.pptx by NeethaSherra1
Survey on Factuality in LLM's.pptxSurvey on Factuality in LLM's.pptx
Survey on Factuality in LLM's.pptx
NeethaSherra15 views
Short Story Assignment by Kelly Nguyen by kellynguyen01
Short Story Assignment by Kelly NguyenShort Story Assignment by Kelly Nguyen
Short Story Assignment by Kelly Nguyen
kellynguyen0119 views
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation by DataScienceConferenc1
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
Ukraine Infographic_22NOV2023_v2.pdf by AnastosiyaGurin
Ukraine Infographic_22NOV2023_v2.pdfUkraine Infographic_22NOV2023_v2.pdf
Ukraine Infographic_22NOV2023_v2.pdf
AnastosiyaGurin1.4K views
Cross-network in Google Analytics 4.pdf by GA4 Tutorials
Cross-network in Google Analytics 4.pdfCross-network in Google Analytics 4.pdf
Cross-network in Google Analytics 4.pdf
GA4 Tutorials6 views
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M... by DataScienceConferenc1
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...

Fire in the Sky: An Introduction to Monitoring Apache Spark in the Cloud with Michael McCune