SlideShare a Scribd company logo
1 of 1
Download to read offline
Data-Centric, Extreme Scale
Computing using Legion
Richard Haney, Song Park,
Dale Shires ARL/CISD
Alex Aiken, et al Stanford University
S&T Campaign: Computational Sciences
Tier 2 Computing Sciences
Research Objective
• Task-Oriented/Data-Centric Approach For Parallel,
Distributed, Heterogeneous Computing
• Programming system that can address future Exascale
computing
Challenges
• How to leverage computational power of different
architectures such as multi-core CPU, GPU, and FPGA
• Dynamic partitioning for non-uniform distributions of
data (e.g. n-body simulations) generate load-balancing
issues
• Maintain correctness and optimize performance
Breaking million core with Stanford CTR
ARL Facilities and Capabilities Available
to Support Collaborative Research
• Computing Environment(s):
• 64 node heterogeneous cluster consisting of 48 IBM dx360M4
nodes, each with one Intel Phi 5110P and 16 dx360M4 nodes
each with one NVIDIA Kepler K20M GPU.
• Each node contained dual Intel Xeon E5-2670 (Sandy
Bridge) CPUs
• Publications:
• R. Haney, S. Park, D. Shires, “Building Task-Oriented Applications
– An Introduction to the Legion Programming System,” ARL
Technical Reports, 2014 (submitted for publication)
• M. Bauer, S. Treichler, E. Slaughter and A. Aiken, "Legion:
expressing locality and independence with logical regions," in
SC '12 Proceedings of the International Conference on High
Performance Computing, Networking, Storage and Analysis, Salt
Lake City, UT, 2012.
• S. Treichler, M. Bauer and A. Aiken, "Language Support for
Dynamic, Hierarchical Data Partitioning," in Proceedings of the
2013 ACM SIGPLAN international conference on Object oriented
programming systems languages & applications, Portland,
2013.
• ARL has years experience with HPC theory and application
Complementary Expertise/ Facilities/
Capabilities Sought in Collaboration
• Create optimal parallel algorithms with fault tolerance for
large, heterogeneous distributed systems
• Establish new paradigm for cross-architecture parallel
programming defining both portability and robustness
• Define programming system able to produce valid
computational results at extreme scales (e.g. Exascale)
Legion Logical Regions and Field/Index Structures
Task Distribution (2 processors)
APPROVED FOR PUBLIC RELEASE; DISTRIBUTION IS UNLIMITED
APPROVED FOR PUBLIC RELEASE; DISTRIBUTION IS UNLIMITED

More Related Content

What's hot

Application-Aware Big Data Deduplication in Cloud Environment
Application-Aware Big Data Deduplication in Cloud EnvironmentApplication-Aware Big Data Deduplication in Cloud Environment
Application-Aware Big Data Deduplication in Cloud Environment
Safayet Hossain
 
Annu Sharma Resume
Annu Sharma ResumeAnnu Sharma Resume
Annu Sharma Resume
Annu Sharma
 

What's hot (13)

Satwik mishra resume
Satwik mishra resumeSatwik mishra resume
Satwik mishra resume
 
Satwik resume
Satwik resumeSatwik resume
Satwik resume
 
Satwik mishra resume
Satwik mishra resumeSatwik mishra resume
Satwik mishra resume
 
Gray-Box Models for Performance Assessment of Spark Applications
Gray-Box Models for Performance Assessment of Spark ApplicationsGray-Box Models for Performance Assessment of Spark Applications
Gray-Box Models for Performance Assessment of Spark Applications
 
Application-Aware Big Data Deduplication in Cloud Environment
Application-Aware Big Data Deduplication in Cloud EnvironmentApplication-Aware Big Data Deduplication in Cloud Environment
Application-Aware Big Data Deduplication in Cloud Environment
 
Claremont Report on Database Research: Research Directions (Anastasia Ailamaki)
Claremont Report on Database Research: Research Directions (Anastasia Ailamaki)Claremont Report on Database Research: Research Directions (Anastasia Ailamaki)
Claremont Report on Database Research: Research Directions (Anastasia Ailamaki)
 
Distributed system
Distributed systemDistributed system
Distributed system
 
Deep Hybrid DataCloud
Deep Hybrid DataCloudDeep Hybrid DataCloud
Deep Hybrid DataCloud
 
DGterzo
DGterzoDGterzo
DGterzo
 
Quantifying Uncertainty in Agent-Based Models for Smart City Forecasts
Quantifying Uncertainty in Agent-Based Models for Smart City ForecastsQuantifying Uncertainty in Agent-Based Models for Smart City Forecasts
Quantifying Uncertainty in Agent-Based Models for Smart City Forecasts
 
Cross cloud map reduce for big data
Cross cloud map reduce for big dataCross cloud map reduce for big data
Cross cloud map reduce for big data
 
Twister4Azure - Iterative MapReduce for Azure Cloud
Twister4Azure - Iterative MapReduce for Azure CloudTwister4Azure - Iterative MapReduce for Azure Cloud
Twister4Azure - Iterative MapReduce for Azure Cloud
 
Annu Sharma Resume
Annu Sharma ResumeAnnu Sharma Resume
Annu Sharma Resume
 

Similar to CS7_HANEY_DataCentricExtremeScaleComputLegion PAO edit

NASA Advanced Computing Environment for Science & Engineering
NASA Advanced Computing Environment for Science & EngineeringNASA Advanced Computing Environment for Science & Engineering
NASA Advanced Computing Environment for Science & Engineering
inside-BigData.com
 
How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...
How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...
How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...
inside-BigData.com
 
Exploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design spaceExploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design space
jsvetter
 
Belak_ICME_June02015
Belak_ICME_June02015Belak_ICME_June02015
Belak_ICME_June02015
Jim Belak
 

Similar to CS7_HANEY_DataCentricExtremeScaleComputLegion PAO edit (20)

Scientific
Scientific Scientific
Scientific
 
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習 Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
Azure 機器學習 - 使用Python, R, Spark, CNTK 深度學習
 
High Performance Computing and Big Data
High Performance Computing and Big Data High Performance Computing and Big Data
High Performance Computing and Big Data
 
Exascale Computing Project - Driving a HUGE Change in a Changing World
Exascale Computing Project - Driving a HUGE Change in a Changing WorldExascale Computing Project - Driving a HUGE Change in a Changing World
Exascale Computing Project - Driving a HUGE Change in a Changing World
 
Future of hpc
Future of hpcFuture of hpc
Future of hpc
 
Scientific Application Development and Early results on Summit
Scientific Application Development and Early results on SummitScientific Application Development and Early results on Summit
Scientific Application Development and Early results on Summit
 
AI Super computer update
AI Super computer update AI Super computer update
AI Super computer update
 
Matching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software ArchitecturesMatching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software Architectures
 
Matching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software ArchitecturesMatching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software Architectures
 
NASA Advanced Computing Environment for Science & Engineering
NASA Advanced Computing Environment for Science & EngineeringNASA Advanced Computing Environment for Science & Engineering
NASA Advanced Computing Environment for Science & Engineering
 
grid computing
grid computinggrid computing
grid computing
 
09 The Extreme-scale Scientific Software Stack for Collaborative Open Source
09 The Extreme-scale Scientific Software Stack for Collaborative Open Source09 The Extreme-scale Scientific Software Stack for Collaborative Open Source
09 The Extreme-scale Scientific Software Stack for Collaborative Open Source
 
How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...
How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...
How to Design Scalable HPC, Deep Learning, and Cloud Middleware for Exascale ...
 
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
Evolving Cyberinfrastructure, Democratizing Data, and Scaling AI to Catalyze ...
 
Overview of the Exascale Additive Manufacturing Project
Overview of the Exascale Additive Manufacturing ProjectOverview of the Exascale Additive Manufacturing Project
Overview of the Exascale Additive Manufacturing Project
 
Parsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in PythonParsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in Python
 
Exploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design spaceExploring emerging technologies in the HPC co-design space
Exploring emerging technologies in the HPC co-design space
 
AI for Science
AI for ScienceAI for Science
AI for Science
 
Designing HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale SystemsDesigning HPC & Deep Learning Middleware for Exascale Systems
Designing HPC & Deep Learning Middleware for Exascale Systems
 
Belak_ICME_June02015
Belak_ICME_June02015Belak_ICME_June02015
Belak_ICME_June02015
 

More from Richard Haney

More from Richard Haney (6)

Visualizing STEP Files
Visualizing STEP FilesVisualizing STEP Files
Visualizing STEP Files
 
JAVA_STEP_V7
JAVA_STEP_V7JAVA_STEP_V7
JAVA_STEP_V7
 
ADA613693
ADA613693ADA613693
ADA613693
 
CMES201308262603_16563
CMES201308262603_16563CMES201308262603_16563
CMES201308262603_16563
 
ARL-TR-7315
ARL-TR-7315ARL-TR-7315
ARL-TR-7315
 
V3I8-0460
V3I8-0460V3I8-0460
V3I8-0460
 

CS7_HANEY_DataCentricExtremeScaleComputLegion PAO edit

  • 1. Data-Centric, Extreme Scale Computing using Legion Richard Haney, Song Park, Dale Shires ARL/CISD Alex Aiken, et al Stanford University S&T Campaign: Computational Sciences Tier 2 Computing Sciences Research Objective • Task-Oriented/Data-Centric Approach For Parallel, Distributed, Heterogeneous Computing • Programming system that can address future Exascale computing Challenges • How to leverage computational power of different architectures such as multi-core CPU, GPU, and FPGA • Dynamic partitioning for non-uniform distributions of data (e.g. n-body simulations) generate load-balancing issues • Maintain correctness and optimize performance Breaking million core with Stanford CTR ARL Facilities and Capabilities Available to Support Collaborative Research • Computing Environment(s): • 64 node heterogeneous cluster consisting of 48 IBM dx360M4 nodes, each with one Intel Phi 5110P and 16 dx360M4 nodes each with one NVIDIA Kepler K20M GPU. • Each node contained dual Intel Xeon E5-2670 (Sandy Bridge) CPUs • Publications: • R. Haney, S. Park, D. Shires, “Building Task-Oriented Applications – An Introduction to the Legion Programming System,” ARL Technical Reports, 2014 (submitted for publication) • M. Bauer, S. Treichler, E. Slaughter and A. Aiken, "Legion: expressing locality and independence with logical regions," in SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, Salt Lake City, UT, 2012. • S. Treichler, M. Bauer and A. Aiken, "Language Support for Dynamic, Hierarchical Data Partitioning," in Proceedings of the 2013 ACM SIGPLAN international conference on Object oriented programming systems languages & applications, Portland, 2013. • ARL has years experience with HPC theory and application Complementary Expertise/ Facilities/ Capabilities Sought in Collaboration • Create optimal parallel algorithms with fault tolerance for large, heterogeneous distributed systems • Establish new paradigm for cross-architecture parallel programming defining both portability and robustness • Define programming system able to produce valid computational results at extreme scales (e.g. Exascale) Legion Logical Regions and Field/Index Structures Task Distribution (2 processors) APPROVED FOR PUBLIC RELEASE; DISTRIBUTION IS UNLIMITED APPROVED FOR PUBLIC RELEASE; DISTRIBUTION IS UNLIMITED