SlideShare a Scribd company logo
1 of 93
Download to read offline
SPARK CORE
SHIJIE ZHANG
OUTLINE
▸ Intro
▸ Standalone-view RDD concepts
▸ Basic RDD
▸ Key-Pair RDD
▸ Distributed-view RDD concepts
▸ Lazy evaluation
▸ Cache
▸ Shuffle/Partition
▸ Broadcast
▸ Accumulator
▸ Conclusion
INTRO - LEGACY NUMBERS
INTRO
▸ To get a better intuition about differences of these
numbers, humanize these durations
▸ Then, we can map each latency number to a human activity
INTRO
INTRO
Humanized numbers
INTRO
INTRO
Distributed data parallel programming, two more worries
▸ Partial failure: a subset of machine crash
▸ Latency: certain operations have higher latency
RUN ON: JVM
IMPLEMENTED BY: SCALA
SDKS: PYTHON / SCALA / JAVA
OTHER LIBRARIES: R
INTRO
Spark components
Spark
▸ A distributed data parallel programming framework
▸ Implements a data parallel model called RDD
RDD Core
INTRO
WHAT’S RDD
▸ Just looks like Scala collections (but distributed across machines)
INTRO
WHAT’S RDD
▸ While signatures differ a bit, their semantics are the same
INTRO
WHAT’S RDD
▸ Scientific Def: An interface containing five properties
▸ Set of partitions
▸ List of dependencies
▸ Function to compute a partition given its parents
▸ (Optional) Partition method
▸ (Optional) Preferred locations
INTRO
Spark Core components - code base (2012)
RDD
OUTLINE
▸ Intro
▸ Standalone-view RDD concepts
▸ Basic RDD
▸ Key-Pair RDD
▸ Distributed-view RDD concepts
▸ Lazy evaluation
▸ Cache
▸ Partition/Shuffle
▸ Broadcast
▸ Accumulator
▸ Conclusion
Operators
Scheduler
Block manager
Networking
Broadcast
Accumulator
OUTLINE
▸ Intro
▸ Standalone-view RDD concepts
▸ Basic RDD
▸ Key-Pair RDD
▸ Distributed-view RDD concepts
▸ Lazy evaluation
▸ Cache
▸ Partition/Shuffle
▸ Broadcast
▸ Accumulator
▸ Conclusion
Operators
Scheduler
Block manager
Networking
Broadcast
Accumulator
INTRO
Execution of Spark program
▸ Execution mode: Standalone / Cluster
Yarn / Mesos
Your program
OUTLINE
▸ Intro
▸ Standalone-view RDD concepts
▸ Basic RDD
▸ Key-Pair RDD
▸ Distributed-view RDD concepts
▸ Lazy evaluation
▸ Cache
▸ Partition/Shuffle
▸ Broadcast
▸ Accumulator
▸ Conclusion
Operators
Scheduler
Block manager
Networking
Broadcast
Accumulator
STANDALONE-VIEW
RDD
transformations another
RDD
actions
result values
creation
Note: actions/transformations will not modify input RDD, but will always create a new one
STANDALONE-VIEW
RDD CREATION
▸ Loading an external dataset
▸ Parallelizing a collection
STANDALONE-VIEW
TRANSFORMATIONS
STANDALONE-VIEW
TRANSFORMATIONS
STANDALONE-VIEW
ACTIONS
transformations another
RDD
actions
single values
PYTHON EXAMPLE - COUNT RATING NUM FOR EACH STAR
https://www.udemy.com/taming-big-data-with-apache-spark-hands-on/learn/v4/overview
For more detailed step, see:
PYTHON EXAMPLE - COUNT RATING NUM FOR EACH STAR
PYTHON EXAMPLE - COUNT RATING NUM FOR EACH STAR
PYTHON EXAMPLE - COUNT RATING NUM FOR EACH STAR
PYTHON EXAMPLE - COUNT RATING NUM FOR EACH STAR
PYTHON EXAMPLE - COUNT RATING NUM FOR EACH STAR
PYTHON EXAMPLE - COUNT RATING NUM FOR EACH STAR
PYTHON EXAMPLE - COUNT RATING NUM FOR EACH STAR
KEY-PAIR RDD
▸ Def: RDDs containing key/value pairs
▸ Motivation: allows to act on each key
KEY-PAIR RDD CREATION
▸ Create from RDDs by transforming to tuples
▸ Load external data
KEY-PAIR RDD TRANSFORMATIONS
▸ Key-Pair RDDs have all transformations available to standard RDDs
KEY-PAIR RDD TRANSFORMATIONS
▸ Key-Pair RDDs have all transformations available to standard RDDs
KEY-PAIR RDD ACTIONS
OUTLINE
▸ Intro
▸ Standalone-view RDD concepts
▸ Basic RDD
▸ Key-Pair RDD
▸ Distributed-view RDD concepts
▸ Lazy evaluation
▸ Cache
▸ Partition/Shuffle
▸ Broadcast
▸ Accumulator
▸ Conclusion
Operators
Scheduler
Networking
Broadcast
Accumulator
Block manager
OUTLINE
▸ Intro
▸ Standalone-view RDD concepts
▸ Basic RDD
▸ Key-Pair RDD
▸ Distributed-view RDD concepts
▸ Lazy evaluation
▸ Cache
▸ Partition/Shuffle
▸ Broadcast
▸ Accumulator
▸ Conclusion
Operators
Scheduler
Networking
Broadcast
Accumulator
Block manager
LAZY-EVALUATION
▸ Def:
▸ Transformations are lazy. Their result RDD is not immediately computed.
▸ Actions are eager. Their result is immediately computed.
▸ Example:
LAZY-EVALUATION
▸ Def:
▸ Transformations are lazy. Their result RDD is not immediately computed.
▸ Actions are eager. Their result is immediately computed.
▸ Example:
LAZY-EVALUATION
▸ Def:
▸ Transformations are lazy. Their result RDD is not immediately computed.
▸ Actions are eager. Their result is immediately computed.
▸ Example:
LAZY-EVALUATION
▸ Why:
▸ Enable computation optimization
LAZY-EVALUATION
▸ Why:
▸ Limit network communication
▸ If you perform an action, its result returns back to the driver program
RDD results return back
RDD transformations and actions
LAZY-EVALUATION
▸ What does it do when loading/transformation are performed:
▸ Record RDD dependencies with a directed acyclic graph (DAG)
▸ For failure recovery
▸ Pipeline operations execution order
▸ Optimize by utilizing network traffic based on partition and shuffling
LAZY EVALUATION
Life time of a job in Spark
OUTLINE
▸ Intro
▸ Standalone-view RDD concepts
▸ Basic RDD
▸ Key-Pair RDD
▸ Distributed-view RDD concepts
▸ Lazy evaluation
▸ Cache
▸ Partition/Shuffle
▸ Broadcast
▸ Accumulator
▸ Conclusion
Operators
Scheduler
Networking
Broadcast
Accumulator
Block manager
CACHE
▸ Def
▸ By default, RDDs are recomputed each time you run an action on them
CACHE
▸ Cache options
CACHE
▸ Why
▸ Avoid network IO and recomputation
▸ How
▸ Implemented with Least Recently Used (LRU) cache policy
▸ Users do not have to worry about caching too much data
OUTLINE
▸ Intro
▸ Standalone-view RDD concepts
▸ Basic RDD
▸ Key-Pair RDD
▸ Distributed-view RDD concepts
▸ Lazy evaluation
▸ Cache
▸ Shuffle/Partition
▸ Broadcast
▸ Accumulator
▸ Conclusion
Operators
Scheduler
Networking
Broadcast
Accumulator
Block manager
OUTLINE
▸ Intro
▸ Standalone-view RDD concepts
▸ Basic RDD
▸ Key-Pair RDD
▸ Distributed-view RDD concepts
▸ Lazy evaluation
▸ Cache
▸ Shuffle/Partition
▸ Broadcast
▸ Accumulator
▸ Conclusion
Operators
Scheduler
Networking
Broadcast
Accumulator
Block manager
SHUFFLE
SHUFFLE
SHUFFLE
SHUFFLE
SHUFFLE
SHUFFLE
SHUFFLE
REMINDER: LATENCY MATTERS (HUMANIZED)
SHUFFLE
▸ Can we do a better job
SHUFFLE
SHUFFLE
SHUFFLE
SHUFFLE
SHUFFLE
OUTLINE
▸ Intro
▸ Standalone-view RDD concepts
▸ Basic RDD
▸ Key-Pair RDD
▸ Distributed-view RDD concepts
▸ Lazy evaluation
▸ Cache
▸ Shuffle/Partition
▸ Broadcast
▸ Accumulator
▸ Conclusion
Operators
Scheduler
Networking
Broadcast
Accumulator
Block manager
PARTITION
HASH PARTITION
HASH PARTITION
RANGE PARTITION
RANGE PARTITION
RANGE PARTITION
RANGE PARTITION
RANGE PARTITION
RANGE PARTITION
RANGE PARTITION
PARTITIONING DATA:
PARTITIONING DATA:
PARTITIONING DATA:
PARTITIONING DATA:
OPTIMIZING PREVIOUS EXAMPLE USING RANGE PARTITIONING
HOW DO I KNOW A SHUFFLE WILL OCCUR?
HOW DO I KNOW A SHUFFLE WILL OCCUR?
HOW DO I KNOW A SHUFFLE WILL OCCUR?
▸ Operations that might cause a shuffle
OUTLINE
▸ Intro
▸ Standalone-view RDD concepts
▸ Basic RDD
▸ Key-Pair RDD
▸ Distributed-view RDD concepts
▸ Lazy evaluation
▸ Cache
▸ Shuffle/Partition
▸ Broadcast
▸ Accumulator
▸ Conclusion
Operators
Scheduler
Networking
Broadcast
Accumulator
Block manager
SHARED VARIABLES
▸ Start with an example
BROADCAST VARIABLES
▸ Motivation:
▸ Normally, Spark closures, including variables they use, are sent separately within each task.
▸ But in same cases, the same variable will be used in multiple parallel operations
▸ By default, Spark task launching mechanism is optimized for small task sizes.
▸ But in some cases, a large read-only variable needs to be shared across tasks, or across operations
▸ Examples: large lookup tables
OUTLINE
▸ Intro
▸ Standalone-view RDD concepts
▸ Basic RDD
▸ Key-Pair RDD
▸ Distributed-view RDD concepts
▸ Lazy evaluation
▸ Cache
▸ Shuffle/Partition
▸ Broadcast
▸ Accumulator
▸ Conclusion
Operators
Scheduler
Networking
Broadcast
Accumulator
Block manager
BROADCAST VARIABLES
ACCUMULATORS
▸ Motivation:
▸ Normally, Spark closures, including variables they use, are sent separately within each task.
▸ But values inside worker nodes are not propagated back to the driver
▸ Normally, we could get the entire RDD back with reduce(),
▸ But sometimes, we just need a simpler way
▸ Examples: counters
OUTLINE
▸ Intro
▸ Standalone-view RDD concepts
▸ Basic RDD
▸ Key-Pair RDD
▸ Distributed-view RDD concepts
▸ Lazy evaluation
▸ Cache
▸ Shuffle/Partition
▸ Broadcast
▸ Accumulator
▸ Conclusion
Operators
Scheduler
Networking
Broadcast
Accumulator
Block manager
CONCLUSION
▸ Spark core
▸ basic Spark RDDs
▸ Things to take care when programming in clusters
▸ What’s next?
▸ Spark SQL/Hive
REFERENCES
▸ Udemy hands-one course
▸ https://www.udemy.com/taming-big-data-with-apache-spark-hands-on/
learn/v4/overview
▸ Several slides:
▸ http://heather.miller.am/teaching/cs212/slides
▸ http://ampcamp.berkeley.edu/wp-content/uploads/2012/06/matei-zaharia-
amp-camp-2012-advanced-spark.pdf
▸ https://databricks-training.s3.amazonaws.com/slides/advanced-spark-
training.pdf
▸ <<Learning Spark>>

More Related Content

What's hot

Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...
Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...
Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...Simplilearn
 
Spark shuffle introduction
Spark shuffle introductionSpark shuffle introduction
Spark shuffle introductioncolorant
 
Apache Spark Introduction
Apache Spark IntroductionApache Spark Introduction
Apache Spark Introductionsudhakara st
 
Introduction to Spark Internals
Introduction to Spark InternalsIntroduction to Spark Internals
Introduction to Spark InternalsPietro Michiardi
 
Introduction to Apache Spark
Introduction to Apache SparkIntroduction to Apache Spark
Introduction to Apache SparkRahul Jain
 
Apache Spark Internals
Apache Spark InternalsApache Spark Internals
Apache Spark InternalsKnoldus Inc.
 
OpenStack Tutorial
OpenStack TutorialOpenStack Tutorial
OpenStack TutorialBret Piatt
 
Apache spark - Architecture , Overview & libraries
Apache spark - Architecture , Overview & librariesApache spark - Architecture , Overview & libraries
Apache spark - Architecture , Overview & librariesWalaa Hamdy Assy
 
Spark architecture
Spark architectureSpark architecture
Spark architecturedatamantra
 
Introduction to apache spark
Introduction to apache spark Introduction to apache spark
Introduction to apache spark Aakashdata
 
Apache Spark in Depth: Core Concepts, Architecture & Internals
Apache Spark in Depth: Core Concepts, Architecture & InternalsApache Spark in Depth: Core Concepts, Architecture & Internals
Apache Spark in Depth: Core Concepts, Architecture & InternalsAnton Kirillov
 
Spark - Alexis Seigneurin (Français)
Spark - Alexis Seigneurin (Français)Spark - Alexis Seigneurin (Français)
Spark - Alexis Seigneurin (Français)Alexis Seigneurin
 
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...Simplilearn
 

What's hot (20)

Apache Spark 101
Apache Spark 101Apache Spark 101
Apache Spark 101
 
Spark SQL
Spark SQLSpark SQL
Spark SQL
 
Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...
Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...
Spark SQL Tutorial | Spark SQL Using Scala | Apache Spark Tutorial For Beginn...
 
Apache Spark Architecture
Apache Spark ArchitectureApache Spark Architecture
Apache Spark Architecture
 
Spark architecture
Spark architectureSpark architecture
Spark architecture
 
Spark shuffle introduction
Spark shuffle introductionSpark shuffle introduction
Spark shuffle introduction
 
Apache Spark Introduction
Apache Spark IntroductionApache Spark Introduction
Apache Spark Introduction
 
Introduction to Spark Internals
Introduction to Spark InternalsIntroduction to Spark Internals
Introduction to Spark Internals
 
Introduction to Apache Spark
Introduction to Apache SparkIntroduction to Apache Spark
Introduction to Apache Spark
 
Apache Spark Internals
Apache Spark InternalsApache Spark Internals
Apache Spark Internals
 
OpenStack Tutorial
OpenStack TutorialOpenStack Tutorial
OpenStack Tutorial
 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark Overview
 
Apache spark - Architecture , Overview & libraries
Apache spark - Architecture , Overview & librariesApache spark - Architecture , Overview & libraries
Apache spark - Architecture , Overview & libraries
 
Spark architecture
Spark architectureSpark architecture
Spark architecture
 
spark_intro_1208
spark_intro_1208spark_intro_1208
spark_intro_1208
 
Introduction to apache spark
Introduction to apache spark Introduction to apache spark
Introduction to apache spark
 
Apache Spark in Depth: Core Concepts, Architecture & Internals
Apache Spark in Depth: Core Concepts, Architecture & InternalsApache Spark in Depth: Core Concepts, Architecture & Internals
Apache Spark in Depth: Core Concepts, Architecture & Internals
 
Spark - Alexis Seigneurin (Français)
Spark - Alexis Seigneurin (Français)Spark - Alexis Seigneurin (Français)
Spark - Alexis Seigneurin (Français)
 
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
What Is Apache Spark? | Introduction To Apache Spark | Apache Spark Tutorial ...
 
Apache spark
Apache sparkApache spark
Apache spark
 

Viewers also liked

Role-based Access Control on AWS
Role-based Access Control on AWSRole-based Access Control on AWS
Role-based Access Control on AWSFreeman Zhang
 
Keyboard covert channels
Keyboard covert channelsKeyboard covert channels
Keyboard covert channelsFreeman Zhang
 
Partner Cloud Solutions Event August 2016-Forlac
Partner Cloud Solutions Event August 2016-ForlacPartner Cloud Solutions Event August 2016-Forlac
Partner Cloud Solutions Event August 2016-ForlacDave Rendón
 
CAP теорема Брюера и ее применения на практике
CAP теорема Брюера и ее применения на практикеCAP теорема Брюера и ее применения на практике
CAP теорема Брюера и ее применения на практикеAlexey Bokov
 
Case-for-Support-Evas-Phoenix-Rising-Capital-Campaign
Case-for-Support-Evas-Phoenix-Rising-Capital-CampaignCase-for-Support-Evas-Phoenix-Rising-Capital-Campaign
Case-for-Support-Evas-Phoenix-Rising-Capital-CampaignHelen Choi (최효경), CFRE
 
Azure web apps - designing and debugging
Azure web apps  - designing and debuggingAzure web apps  - designing and debugging
Azure web apps - designing and debuggingAlexey Bokov
 
Pieter de Bruin (Microsoft) - Welke technologie gebruiken bij implementatie M...
Pieter de Bruin (Microsoft) - Welke technologie gebruiken bij implementatie M...Pieter de Bruin (Microsoft) - Welke technologie gebruiken bij implementatie M...
Pieter de Bruin (Microsoft) - Welke technologie gebruiken bij implementatie M...AFAS Software
 
Functional Scala II (in practice)
Functional Scala II (in practice)Functional Scala II (in practice)
Functional Scala II (in practice)Mario Gleichmann
 
Software Engineering Thailand: Programming with Scala
Software Engineering Thailand: Programming with ScalaSoftware Engineering Thailand: Programming with Scala
Software Engineering Thailand: Programming with ScalaBrian Topping
 
Functional Programming in Scala Reading #2
Functional Programming in Scala Reading #2Functional Programming in Scala Reading #2
Functional Programming in Scala Reading #2Naoyuki Yamada
 
Scala 2.10.0 (english version)
Scala 2.10.0 (english version)Scala 2.10.0 (english version)
Scala 2.10.0 (english version)Daniel Sobral
 
Basic Wicket and Scala
Basic Wicket and ScalaBasic Wicket and Scala
Basic Wicket and Scalastuq
 
Scala: Object-Oriented Meets Functional, by Iulian Dragos
Scala: Object-Oriented Meets Functional, by Iulian DragosScala: Object-Oriented Meets Functional, by Iulian Dragos
Scala: Object-Oriented Meets Functional, by Iulian Dragos3Pillar Global
 
Scala For Java Programmers
Scala For Java ProgrammersScala For Java Programmers
Scala For Java ProgrammersEnno Runne
 

Viewers also liked (20)

Path oram
Path oramPath oram
Path oram
 
Role-based Access Control on AWS
Role-based Access Control on AWSRole-based Access Control on AWS
Role-based Access Control on AWS
 
Java8
Java8Java8
Java8
 
Keyboard covert channels
Keyboard covert channelsKeyboard covert channels
Keyboard covert channels
 
Spark sql
Spark sqlSpark sql
Spark sql
 
Spark rdd part 2
Spark rdd part 2Spark rdd part 2
Spark rdd part 2
 
Spark infrastructure
Spark infrastructureSpark infrastructure
Spark infrastructure
 
Partner Cloud Solutions Event August 2016-Forlac
Partner Cloud Solutions Event August 2016-ForlacPartner Cloud Solutions Event August 2016-Forlac
Partner Cloud Solutions Event August 2016-Forlac
 
CAP теорема Брюера и ее применения на практике
CAP теорема Брюера и ее применения на практикеCAP теорема Брюера и ее применения на практике
CAP теорема Брюера и ее применения на практике
 
Case-for-Support-Evas-Phoenix-Rising-Capital-Campaign
Case-for-Support-Evas-Phoenix-Rising-Capital-CampaignCase-for-Support-Evas-Phoenix-Rising-Capital-Campaign
Case-for-Support-Evas-Phoenix-Rising-Capital-Campaign
 
Azure web apps - designing and debugging
Azure web apps  - designing and debuggingAzure web apps  - designing and debugging
Azure web apps - designing and debugging
 
Pieter de Bruin (Microsoft) - Welke technologie gebruiken bij implementatie M...
Pieter de Bruin (Microsoft) - Welke technologie gebruiken bij implementatie M...Pieter de Bruin (Microsoft) - Welke technologie gebruiken bij implementatie M...
Pieter de Bruin (Microsoft) - Welke technologie gebruiken bij implementatie M...
 
Functional Scala I
Functional Scala IFunctional Scala I
Functional Scala I
 
Functional Scala II (in practice)
Functional Scala II (in practice)Functional Scala II (in practice)
Functional Scala II (in practice)
 
Software Engineering Thailand: Programming with Scala
Software Engineering Thailand: Programming with ScalaSoftware Engineering Thailand: Programming with Scala
Software Engineering Thailand: Programming with Scala
 
Functional Programming in Scala Reading #2
Functional Programming in Scala Reading #2Functional Programming in Scala Reading #2
Functional Programming in Scala Reading #2
 
Scala 2.10.0 (english version)
Scala 2.10.0 (english version)Scala 2.10.0 (english version)
Scala 2.10.0 (english version)
 
Basic Wicket and Scala
Basic Wicket and ScalaBasic Wicket and Scala
Basic Wicket and Scala
 
Scala: Object-Oriented Meets Functional, by Iulian Dragos
Scala: Object-Oriented Meets Functional, by Iulian DragosScala: Object-Oriented Meets Functional, by Iulian Dragos
Scala: Object-Oriented Meets Functional, by Iulian Dragos
 
Scala For Java Programmers
Scala For Java ProgrammersScala For Java Programmers
Scala For Java Programmers
 

Similar to Spark core

How Spark Does It Internally?
How Spark Does It Internally?How Spark Does It Internally?
How Spark Does It Internally?Knoldus Inc.
 
Apache Spark Fundamentals Meetup Talk
Apache Spark Fundamentals Meetup TalkApache Spark Fundamentals Meetup Talk
Apache Spark Fundamentals Meetup TalkEren Avşaroğulları
 
Building the Right Platform Architecture for Hadoop
Building the Right Platform Architecture for HadoopBuilding the Right Platform Architecture for Hadoop
Building the Right Platform Architecture for HadoopAll Things Open
 
A Deep Dive Into Spark
A Deep Dive Into SparkA Deep Dive Into Spark
A Deep Dive Into SparkAshish kumar
 
How to build your query engine in spark
How to build your query engine in sparkHow to build your query engine in spark
How to build your query engine in sparkPeng Cheng
 
GraphQL And Relay Modern
GraphQL And Relay ModernGraphQL And Relay Modern
GraphQL And Relay ModernBrad Pillow
 
GraphQL And Relay Modern
GraphQL And Relay ModernGraphQL And Relay Modern
GraphQL And Relay ModernBrad Pillow
 
Oreilly Webcast 01 19 10
Oreilly Webcast 01 19 10Oreilly Webcast 01 19 10
Oreilly Webcast 01 19 10Sean Hull
 
Spark 计算模型
Spark 计算模型Spark 计算模型
Spark 计算模型wang xing
 
From Zero to Hadoop: a tutorial for getting started writing Hadoop jobs on Am...
From Zero to Hadoop: a tutorial for getting started writing Hadoop jobs on Am...From Zero to Hadoop: a tutorial for getting started writing Hadoop jobs on Am...
From Zero to Hadoop: a tutorial for getting started writing Hadoop jobs on Am...Alexander Dean
 
Developing with-devstack
Developing with-devstackDeveloping with-devstack
Developing with-devstackDeepak Garg
 
PaddlePaddle: A Complete Enterprise Solution
PaddlePaddle: A Complete Enterprise SolutionPaddlePaddle: A Complete Enterprise Solution
PaddlePaddle: A Complete Enterprise SolutionYi Wang
 
GraphQL IndyJS April 2016
GraphQL IndyJS April 2016GraphQL IndyJS April 2016
GraphQL IndyJS April 2016Brad Pillow
 
Comparison of RDBMS, MongoDB, and Cassandra
Comparison of RDBMS, MongoDB, and CassandraComparison of RDBMS, MongoDB, and Cassandra
Comparison of RDBMS, MongoDB, and CassandraSunghyun Lee
 

Similar to Spark core (20)

How Spark Does It Internally?
How Spark Does It Internally?How Spark Does It Internally?
How Spark Does It Internally?
 
Overview of Spark for HPC
Overview of Spark for HPCOverview of Spark for HPC
Overview of Spark for HPC
 
Apache Spark Fundamentals Meetup Talk
Apache Spark Fundamentals Meetup TalkApache Spark Fundamentals Meetup Talk
Apache Spark Fundamentals Meetup Talk
 
Building the Right Platform Architecture for Hadoop
Building the Right Platform Architecture for HadoopBuilding the Right Platform Architecture for Hadoop
Building the Right Platform Architecture for Hadoop
 
ivanova-samba_backend.pdf
ivanova-samba_backend.pdfivanova-samba_backend.pdf
ivanova-samba_backend.pdf
 
A Deep Dive Into Spark
A Deep Dive Into SparkA Deep Dive Into Spark
A Deep Dive Into Spark
 
How to build your query engine in spark
How to build your query engine in sparkHow to build your query engine in spark
How to build your query engine in spark
 
GraphQL And Relay Modern
GraphQL And Relay ModernGraphQL And Relay Modern
GraphQL And Relay Modern
 
GraphQL and Relay Modern
GraphQL and Relay ModernGraphQL and Relay Modern
GraphQL and Relay Modern
 
GraphQL And Relay Modern
GraphQL And Relay ModernGraphQL And Relay Modern
GraphQL And Relay Modern
 
Oreilly Webcast 01 19 10
Oreilly Webcast 01 19 10Oreilly Webcast 01 19 10
Oreilly Webcast 01 19 10
 
Intro to Apache Spark
Intro to Apache SparkIntro to Apache Spark
Intro to Apache Spark
 
Spark 计算模型
Spark 计算模型Spark 计算模型
Spark 计算模型
 
From Zero to Hadoop: a tutorial for getting started writing Hadoop jobs on Am...
From Zero to Hadoop: a tutorial for getting started writing Hadoop jobs on Am...From Zero to Hadoop: a tutorial for getting started writing Hadoop jobs on Am...
From Zero to Hadoop: a tutorial for getting started writing Hadoop jobs on Am...
 
Developing with-devstack
Developing with-devstackDeveloping with-devstack
Developing with-devstack
 
SparkNotes
SparkNotesSparkNotes
SparkNotes
 
PaddlePaddle: A Complete Enterprise Solution
PaddlePaddle: A Complete Enterprise SolutionPaddlePaddle: A Complete Enterprise Solution
PaddlePaddle: A Complete Enterprise Solution
 
GraphQL IndyJS April 2016
GraphQL IndyJS April 2016GraphQL IndyJS April 2016
GraphQL IndyJS April 2016
 
Spark Deep Dive
Spark Deep DiveSpark Deep Dive
Spark Deep Dive
 
Comparison of RDBMS, MongoDB, and Cassandra
Comparison of RDBMS, MongoDB, and CassandraComparison of RDBMS, MongoDB, and Cassandra
Comparison of RDBMS, MongoDB, and Cassandra
 

Recently uploaded

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
Clustering techniques data mining book ....
Clustering techniques data mining book ....Clustering techniques data mining book ....
Clustering techniques data mining book ....ShaimaaMohamedGalal
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionSolGuruz
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendArshad QA
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfCionsystems
 

Recently uploaded (20)

W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Clustering techniques data mining book ....
Clustering techniques data mining book ....Clustering techniques data mining book ....
Clustering techniques data mining book ....
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Diamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with PrecisionDiamond Application Development Crafting Solutions with Precision
Diamond Application Development Crafting Solutions with Precision
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdf
 

Spark core