SlideShare a Scribd company logo
1 of 20
Download to read offline
Large-scale Neural ModelingLarge-scale Neural Modeling
in MapReduce and Giraphin MapReduce and Giraph
Co-authors
Nicholas D. Spielman
Neuroscience Program
University of St. Thomas
Presenter
Shuo Yang
Graduate Programs in Software
University of St. Thomas
Special thanks
Bhabani Misra, PhD
Graduate Programs in Software
University of St. Thomas
Jadin C. Jackson PhD
Department of Biology
University of St. Thomas
Bradley S. Rubin, PhD
Graduate Programs in Software
University of St. Thomas
Why Hadoop & What is Hadoop
Why not supercomputers?
Expensive
Limited access
Scalability
Why Hadoop?
Runs on commodity hardware
Scalable
Full-fledged eco-system & community
Open-source implementation of MapReduce
based on Java
MapReduce Model
Client
Map
Reduce
HDFS
Split Data
Output
Map
Map
Reduce Output
…....
…....
∑ I
input currents
from neighbors
∆vI1
I2
In
currents to all neighbors
Synaptic weight matrix
0 1000Time Step
Neuron ID
Simulation results
0
2500
Neural Model (Izhikevich model)
…....
…....
∑ I
input currents
from neighbors
∆vI1
I2
In
currents to all neighbors
Synaptic weight matrix
0 1000Time Step
Neuron ID
Simulation results
0
2500
Neural Model (Izhikevich model)
This is a graph structure
Mapper
N1 I2
I3
N1 I2
I3
I2
I3
Reducer
Mapper
N2 I1
I3
N2 I1
I3
I1
I3
Mapper
N3 I2
I1
N3 I2
I1
I2
I1
Reducer
Reducer
N1 I2
I3
N2 I1
I3
N3 I2
I1
sum currents to N1
sum currents to N2
sum currents to N3
update N1
update N2
update N3
HDFS
initial input
write back to HDFS
N1 and its
local structure
N2 and its
local structure
N3 and its
local structure
Map
Sort &
Shuffle Reduce
Basic MapReduce Implementation
input from previous job
Mapper
N1 I2
I3
N1 I2
I3
I2
I3
Reducer
Mapper
N2 I1
I3
N2 I1
I3
I1
I3
Mapper
N3 I2
I1
N3 I2
I1
I2
I1
Reducer
Reducer
N1 I2
I3
N2 I1
I3
N3 I2
I1
sum currents to N1
sum currents to N2
sum currents to N3
update N1
update N2
update N3
HDFS
initial input
write back to HDFS
N1 and its
local structure
N2 and its
local structure
N3 and its
local structure
Map
Sort &
Shuffle Reduce
Basic MapReduce Implementation
input from previous job
Problems:
synaptic currents are sent
directly to the reducers without
local aggregation
The graph structure is shuffled in
each iteration
N1 I2
I3
Mapper
N2 I1
I3
N3 I2
I1
HDFS
initial input
Map
Sort &
Shuffle
Reduce
In-Mapper Combining (IMC, introduced by Lin & Schatz)
N1 I2
I3
N2 I1
I3
N3 I2
I1
I1
I1
I2
I2
I3
I3
∑
∑
∑
Reducer
Reducer
Reducer
I3
N2 I1
I3
N3
I1
update N1
update N2
update N3
I2
I2
N1 I2
I3
Mapper
N2 I1
I3
N3 I2
I1
HDFS
initial input
Map
Sort &
Shuffle
Reduce
In-Mapper Combining (IMC, introduced by Lin & Schatz)
N1 I2
I3
N2 I1
I3
N3 I2
I1
I1
I1
I2
I2
I3
I3
∑
∑
∑
Reducer
Reducer
Reducer
I3
N2 I1
I3
N3
I1
update N1
update N2
update N3
I2
I2
The graph structure is still shuffled!
Mapper
N1 I2
I3
I2
I3
Reducer
Mapper
N2 I1
I3
I1
I3
Mapper
N3 I2
I1
I2
I1
Reducer
Reducer
N1 I2
I3
N2 I1
I3
N3 I2
I1
sum currents to N1
sum currents to N2
sum currents to N3
update N1
update N2
update N3
HDFS
initial input
write back to HDFS
N1 and its
local structure
N2 and its
local structure
N3 and its
local structure
Schimmy (introduced by Lin & Schatz)
N1 I2
I3
N2 I1
I3
N3 I2
I1
Map
remotely read graph structure
sort &
shuffle Reduce
Mapper
N1 I2
I3
I2
I3
Reducer
Mapper
N2 I1
I3
I1
I3
Mapper
N3 I2
I1
I2
I1
Reducer
Reducer
N1 I2
I3
N2 I1
I3
N3 I2
I1
sum currents to N1
sum currents to N2
sum currents to N3
update N1
update N2
update N3
HDFS
initial input
write back to HDFS
N1 and its
local structure
N2 and its
local structure
N3 and its
local structure
Schimmy (introduced by Lin & Schatz)
N1 I2
I3
N2 I1
I3
N3 I2
I1
Map
remotely read graph structure
sort &
shuffle Reduce
Problems:
Remote reading from HDFS
The graph structure is read and
written in each iteration
Mapper
N1 I2
I3
I2
I3
Reducer
Mapper
N2 I1
I3
I1
I3
Mapper
N3 I2
I1
I2
I1
Reducer
Reducer
N1 I2
I3
N2 I1
I3
N3 I2
I1
sum currents to N1
sum currents to N2
sum currents to N3
update N1
update N2
update N3
HDFS
initial input
write back to HDFS
N1 and its
local structure
N2 and its
local structure
N3 and its
local structure
Schimmy (introduced by Lin & Schatz)
N1 I2
I3
N2 I1
I3
N3 I2
I1
Map
remotely read graph structure
sort &
shuffle Reduce
Observation:
The graph structure is read-only!
Mapper
N1
I2
I3
Reducer
Mapper
I1
I3
Mapper
N3 I2
I1
Reducer
Reducer
N1
N2
N3
sum currents to N1
sum currents to N2
sum currents to N3
update N1
update N2
update N3
HDFS
initial input
write back to HDFS
Mapper-side Schimmy
N1 I2
I3
N2 I1
I3
N3 I2
I1
N2
Map
sort &
shuffle Reduce
Drawbacks of Graph algorithm in MapReduce
Non-intuitive and hard to implement
Not efficiently expressed as iterative algorithms
Not optimized for large numbers of iterations
input from
HDFS
output to
HDFS
input from
HDFS
output to
HDFS
Mapper Intermediate files Reducer
Iterate
Startup Penalty Disk Penalty Disk Penalty
Not optimized for large numbers of iterations
Giraph
N1 I2
I3
N2 I1
I3
N3 I2
I1
N1 I2
I3
N2 I1
I3
N3 I2
I1
H
D
F
S
Load input Synchronous barrier Synchronous barrier
N1 I2
I3
N2 I1
I3
N3 I2
I1
H
D
F
S
…...
Write results back
Iterative graph processing system
Powers Facebook graph search
Highly scalable
Based on BSP model
Mapper-only job on Hadoop
In-memory computation
“Think like a vertex”
More intuitive APIs
Giraph
N1 I2
I3
N2 I1
I3
N3 I2
I1
N1 I2
I3
N2 I1
I3
N3 I2
I1
H
D
F
S
Load input Synchronous barrier Synchronous barrier
N1 I2
I3
N2 I1
I3
N3 I2
I1
H
D
F
S
…...
Write results back
Iterative graph processing system
Powers Facebook graph search
Highly scalable
Based on BSP model
Mapper-only job on Hadoop
In-memory computation
“Think like a NEURON”
More intuitive APIs
Comparison of running time of each iteration
Comparison of speeds – 40 ms simulation
6% 0% -11% -48% -64% -91%
Conclusion
Hadoop is capable of modeling large-scale neural
networks.
Based on IMC and Schimmy, our Mapper-side Schimmy
improves MapReduce graph algorithms
Where graph structure is read-only.
Vertex-centric approaches, such as, Giraph showed
superior performance. However,
# of iterations specified as a global variable
Limited by memory per node
Not widely adopted by industry
Large-scale Neural ModelingLarge-scale Neural Modeling
in MapReduce and Giraphin MapReduce and Giraph
Co-authors
Nicholas D. Spielman
Neuroscience Program
University of St. Thomas
Presenter
Shuo Yang
Graduate Programs in Software
University of St. Thomas
Special thanks
Bhabani Misra, PhD
Graduate Programs in Software
University of St. Thomas
Jadin C. Jackson PhD
Department of Biology
University of St. Thomas
Bradley S. Rubin, PhD
Graduate Programs in Software
University of St. Thomas
Comparison of speeds – 40 ms simulation
Comparison of speeds – 20 ms to 40 ms simulation

More Related Content

Similar to Ieee eit-talk-large-scale-neural-modeling-in-map reduce-giraph

Sparse matrix computations in MapReduce
Sparse matrix computations in MapReduceSparse matrix computations in MapReduce
Sparse matrix computations in MapReduceDavid Gleich
 
Scalable Hadoop with succinct Python: the best of both worlds
Scalable Hadoop with succinct Python: the best of both worldsScalable Hadoop with succinct Python: the best of both worlds
Scalable Hadoop with succinct Python: the best of both worldsDataWorks Summit
 
MAP REDUCE IN DATA SCIENCE.pptx
MAP REDUCE IN DATA SCIENCE.pptxMAP REDUCE IN DATA SCIENCE.pptx
MAP REDUCE IN DATA SCIENCE.pptxHARIKRISHNANU13
 
Introduction to map reduce
Introduction to map reduceIntroduction to map reduce
Introduction to map reduceBhupesh Chawda
 
Introduction to Map Reduce
Introduction to Map ReduceIntroduction to Map Reduce
Introduction to Map ReduceApache Apex
 
Behm Shah Pagerank
Behm Shah PagerankBehm Shah Pagerank
Behm Shah Pagerankgothicane
 
Ling liu part 02:big graph processing
Ling liu part 02:big graph processingLing liu part 02:big graph processing
Ling liu part 02:big graph processingjins0618
 
Hadoop 101 for bioinformaticians
Hadoop 101 for bioinformaticiansHadoop 101 for bioinformaticians
Hadoop 101 for bioinformaticiansattilacsordas
 
Amazon-style shopping cart analysis using MapReduce on a Hadoop cluster
Amazon-style shopping cart analysis using MapReduce on a Hadoop clusterAmazon-style shopping cart analysis using MapReduce on a Hadoop cluster
Amazon-style shopping cart analysis using MapReduce on a Hadoop clusterAsociatia ProLinux
 
Hadoop World 2011: The Powerful Marriage of R and Hadoop - David Champagne, R...
Hadoop World 2011: The Powerful Marriage of R and Hadoop - David Champagne, R...Hadoop World 2011: The Powerful Marriage of R and Hadoop - David Champagne, R...
Hadoop World 2011: The Powerful Marriage of R and Hadoop - David Champagne, R...Cloudera, Inc.
 
Hadoop trainting-in-hyderabad@kelly technologies
Hadoop trainting-in-hyderabad@kelly technologiesHadoop trainting-in-hyderabad@kelly technologies
Hadoop trainting-in-hyderabad@kelly technologiesKelly Technologies
 

Similar to Ieee eit-talk-large-scale-neural-modeling-in-map reduce-giraph (20)

MapReduce and NoSQL
MapReduce and NoSQLMapReduce and NoSQL
MapReduce and NoSQL
 
Data Science
Data ScienceData Science
Data Science
 
Apache Hadoop: DFS and Map Reduce
Apache Hadoop: DFS and Map ReduceApache Hadoop: DFS and Map Reduce
Apache Hadoop: DFS and Map Reduce
 
Sparse matrix computations in MapReduce
Sparse matrix computations in MapReduceSparse matrix computations in MapReduce
Sparse matrix computations in MapReduce
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
 
Scalable Hadoop with succinct Python: the best of both worlds
Scalable Hadoop with succinct Python: the best of both worldsScalable Hadoop with succinct Python: the best of both worlds
Scalable Hadoop with succinct Python: the best of both worlds
 
MAP REDUCE IN DATA SCIENCE.pptx
MAP REDUCE IN DATA SCIENCE.pptxMAP REDUCE IN DATA SCIENCE.pptx
MAP REDUCE IN DATA SCIENCE.pptx
 
Introduction to map reduce
Introduction to map reduceIntroduction to map reduce
Introduction to map reduce
 
Hadoop MapReduce
Hadoop MapReduceHadoop MapReduce
Hadoop MapReduce
 
Introduction to Map Reduce
Introduction to Map ReduceIntroduction to Map Reduce
Introduction to Map Reduce
 
Using MapReduce for Large–scale Medical Image Analysis
Using MapReduce for Large–scale Medical Image AnalysisUsing MapReduce for Large–scale Medical Image Analysis
Using MapReduce for Large–scale Medical Image Analysis
 
Behm Shah Pagerank
Behm Shah PagerankBehm Shah Pagerank
Behm Shah Pagerank
 
Ling liu part 02:big graph processing
Ling liu part 02:big graph processingLing liu part 02:big graph processing
Ling liu part 02:big graph processing
 
Hadoop 101 for bioinformaticians
Hadoop 101 for bioinformaticiansHadoop 101 for bioinformaticians
Hadoop 101 for bioinformaticians
 
Big Data & Hadoop. Simone Leo (CRS4)
Big Data & Hadoop. Simone Leo (CRS4)Big Data & Hadoop. Simone Leo (CRS4)
Big Data & Hadoop. Simone Leo (CRS4)
 
Amazon-style shopping cart analysis using MapReduce on a Hadoop cluster
Amazon-style shopping cart analysis using MapReduce on a Hadoop clusterAmazon-style shopping cart analysis using MapReduce on a Hadoop cluster
Amazon-style shopping cart analysis using MapReduce on a Hadoop cluster
 
Hadoop World 2011: The Powerful Marriage of R and Hadoop - David Champagne, R...
Hadoop World 2011: The Powerful Marriage of R and Hadoop - David Champagne, R...Hadoop World 2011: The Powerful Marriage of R and Hadoop - David Champagne, R...
Hadoop World 2011: The Powerful Marriage of R and Hadoop - David Champagne, R...
 
Hadoop Internals
Hadoop InternalsHadoop Internals
Hadoop Internals
 
Hadoop Internals
Hadoop InternalsHadoop Internals
Hadoop Internals
 
Hadoop trainting-in-hyderabad@kelly technologies
Hadoop trainting-in-hyderabad@kelly technologiesHadoop trainting-in-hyderabad@kelly technologies
Hadoop trainting-in-hyderabad@kelly technologies
 

Recently uploaded

Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptkotipi9215
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsMehedi Hasan Shohan
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 

Recently uploaded (20)

Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
chapter--4-software-project-planning.ppt
chapter--4-software-project-planning.pptchapter--4-software-project-planning.ppt
chapter--4-software-project-planning.ppt
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
XpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software SolutionsXpertSolvers: Your Partner in Building Innovative Software Solutions
XpertSolvers: Your Partner in Building Innovative Software Solutions
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...Unit 1.1 Excite Part 1, class 9, cbse...
Unit 1.1 Excite Part 1, class 9, cbse...
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The Ugly
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 

Ieee eit-talk-large-scale-neural-modeling-in-map reduce-giraph

  • 1. Large-scale Neural ModelingLarge-scale Neural Modeling in MapReduce and Giraphin MapReduce and Giraph Co-authors Nicholas D. Spielman Neuroscience Program University of St. Thomas Presenter Shuo Yang Graduate Programs in Software University of St. Thomas Special thanks Bhabani Misra, PhD Graduate Programs in Software University of St. Thomas Jadin C. Jackson PhD Department of Biology University of St. Thomas Bradley S. Rubin, PhD Graduate Programs in Software University of St. Thomas
  • 2. Why Hadoop & What is Hadoop Why not supercomputers? Expensive Limited access Scalability Why Hadoop? Runs on commodity hardware Scalable Full-fledged eco-system & community Open-source implementation of MapReduce based on Java MapReduce Model Client Map Reduce HDFS Split Data Output Map Map Reduce Output
  • 3. ….... ….... ∑ I input currents from neighbors ∆vI1 I2 In currents to all neighbors Synaptic weight matrix 0 1000Time Step Neuron ID Simulation results 0 2500 Neural Model (Izhikevich model)
  • 4. ….... ….... ∑ I input currents from neighbors ∆vI1 I2 In currents to all neighbors Synaptic weight matrix 0 1000Time Step Neuron ID Simulation results 0 2500 Neural Model (Izhikevich model) This is a graph structure
  • 5. Mapper N1 I2 I3 N1 I2 I3 I2 I3 Reducer Mapper N2 I1 I3 N2 I1 I3 I1 I3 Mapper N3 I2 I1 N3 I2 I1 I2 I1 Reducer Reducer N1 I2 I3 N2 I1 I3 N3 I2 I1 sum currents to N1 sum currents to N2 sum currents to N3 update N1 update N2 update N3 HDFS initial input write back to HDFS N1 and its local structure N2 and its local structure N3 and its local structure Map Sort & Shuffle Reduce Basic MapReduce Implementation input from previous job
  • 6. Mapper N1 I2 I3 N1 I2 I3 I2 I3 Reducer Mapper N2 I1 I3 N2 I1 I3 I1 I3 Mapper N3 I2 I1 N3 I2 I1 I2 I1 Reducer Reducer N1 I2 I3 N2 I1 I3 N3 I2 I1 sum currents to N1 sum currents to N2 sum currents to N3 update N1 update N2 update N3 HDFS initial input write back to HDFS N1 and its local structure N2 and its local structure N3 and its local structure Map Sort & Shuffle Reduce Basic MapReduce Implementation input from previous job Problems: synaptic currents are sent directly to the reducers without local aggregation The graph structure is shuffled in each iteration
  • 7. N1 I2 I3 Mapper N2 I1 I3 N3 I2 I1 HDFS initial input Map Sort & Shuffle Reduce In-Mapper Combining (IMC, introduced by Lin & Schatz) N1 I2 I3 N2 I1 I3 N3 I2 I1 I1 I1 I2 I2 I3 I3 ∑ ∑ ∑ Reducer Reducer Reducer I3 N2 I1 I3 N3 I1 update N1 update N2 update N3 I2 I2
  • 8. N1 I2 I3 Mapper N2 I1 I3 N3 I2 I1 HDFS initial input Map Sort & Shuffle Reduce In-Mapper Combining (IMC, introduced by Lin & Schatz) N1 I2 I3 N2 I1 I3 N3 I2 I1 I1 I1 I2 I2 I3 I3 ∑ ∑ ∑ Reducer Reducer Reducer I3 N2 I1 I3 N3 I1 update N1 update N2 update N3 I2 I2 The graph structure is still shuffled!
  • 9. Mapper N1 I2 I3 I2 I3 Reducer Mapper N2 I1 I3 I1 I3 Mapper N3 I2 I1 I2 I1 Reducer Reducer N1 I2 I3 N2 I1 I3 N3 I2 I1 sum currents to N1 sum currents to N2 sum currents to N3 update N1 update N2 update N3 HDFS initial input write back to HDFS N1 and its local structure N2 and its local structure N3 and its local structure Schimmy (introduced by Lin & Schatz) N1 I2 I3 N2 I1 I3 N3 I2 I1 Map remotely read graph structure sort & shuffle Reduce
  • 10. Mapper N1 I2 I3 I2 I3 Reducer Mapper N2 I1 I3 I1 I3 Mapper N3 I2 I1 I2 I1 Reducer Reducer N1 I2 I3 N2 I1 I3 N3 I2 I1 sum currents to N1 sum currents to N2 sum currents to N3 update N1 update N2 update N3 HDFS initial input write back to HDFS N1 and its local structure N2 and its local structure N3 and its local structure Schimmy (introduced by Lin & Schatz) N1 I2 I3 N2 I1 I3 N3 I2 I1 Map remotely read graph structure sort & shuffle Reduce Problems: Remote reading from HDFS The graph structure is read and written in each iteration
  • 11. Mapper N1 I2 I3 I2 I3 Reducer Mapper N2 I1 I3 I1 I3 Mapper N3 I2 I1 I2 I1 Reducer Reducer N1 I2 I3 N2 I1 I3 N3 I2 I1 sum currents to N1 sum currents to N2 sum currents to N3 update N1 update N2 update N3 HDFS initial input write back to HDFS N1 and its local structure N2 and its local structure N3 and its local structure Schimmy (introduced by Lin & Schatz) N1 I2 I3 N2 I1 I3 N3 I2 I1 Map remotely read graph structure sort & shuffle Reduce Observation: The graph structure is read-only!
  • 12. Mapper N1 I2 I3 Reducer Mapper I1 I3 Mapper N3 I2 I1 Reducer Reducer N1 N2 N3 sum currents to N1 sum currents to N2 sum currents to N3 update N1 update N2 update N3 HDFS initial input write back to HDFS Mapper-side Schimmy N1 I2 I3 N2 I1 I3 N3 I2 I1 N2 Map sort & shuffle Reduce
  • 13. Drawbacks of Graph algorithm in MapReduce Non-intuitive and hard to implement Not efficiently expressed as iterative algorithms Not optimized for large numbers of iterations input from HDFS output to HDFS input from HDFS output to HDFS Mapper Intermediate files Reducer Iterate Startup Penalty Disk Penalty Disk Penalty Not optimized for large numbers of iterations
  • 14. Giraph N1 I2 I3 N2 I1 I3 N3 I2 I1 N1 I2 I3 N2 I1 I3 N3 I2 I1 H D F S Load input Synchronous barrier Synchronous barrier N1 I2 I3 N2 I1 I3 N3 I2 I1 H D F S …... Write results back Iterative graph processing system Powers Facebook graph search Highly scalable Based on BSP model Mapper-only job on Hadoop In-memory computation “Think like a vertex” More intuitive APIs
  • 15. Giraph N1 I2 I3 N2 I1 I3 N3 I2 I1 N1 I2 I3 N2 I1 I3 N3 I2 I1 H D F S Load input Synchronous barrier Synchronous barrier N1 I2 I3 N2 I1 I3 N3 I2 I1 H D F S …... Write results back Iterative graph processing system Powers Facebook graph search Highly scalable Based on BSP model Mapper-only job on Hadoop In-memory computation “Think like a NEURON” More intuitive APIs
  • 16. Comparison of running time of each iteration
  • 17. Comparison of speeds – 40 ms simulation 6% 0% -11% -48% -64% -91%
  • 18. Conclusion Hadoop is capable of modeling large-scale neural networks. Based on IMC and Schimmy, our Mapper-side Schimmy improves MapReduce graph algorithms Where graph structure is read-only. Vertex-centric approaches, such as, Giraph showed superior performance. However, # of iterations specified as a global variable Limited by memory per node Not widely adopted by industry
  • 19. Large-scale Neural ModelingLarge-scale Neural Modeling in MapReduce and Giraphin MapReduce and Giraph Co-authors Nicholas D. Spielman Neuroscience Program University of St. Thomas Presenter Shuo Yang Graduate Programs in Software University of St. Thomas Special thanks Bhabani Misra, PhD Graduate Programs in Software University of St. Thomas Jadin C. Jackson PhD Department of Biology University of St. Thomas Bradley S. Rubin, PhD Graduate Programs in Software University of St. Thomas
  • 20. Comparison of speeds – 40 ms simulation Comparison of speeds – 20 ms to 40 ms simulation