This document discusses advances in scalable adaptive mesh refinement (AMR) for extreme scale simulations using Enzo-P/Cello. Enzo-P uses numerical solvers from the Enzo cosmology code within the Cello AMR framework. Cello implements a forest-of-octrees AMR approach and uses Charm++ for parallelization. Weak scaling tests on Blue Waters showed good scaling to 1.7 trillion cells on 262k cores. Future work includes improving gravity solvers and local timestepping to enable trillion particle hydrodynamic cosmology simulations.
AV digitization projects in Mediateka at RTV Slovenija - NoTimeToWait symposi...Bojan Kosi
Avdio, video and film digitization projects and workflows in Mediateka at RTV Slovenija.
Presentation from NoTimeToWait symposium, Austrian Film Museum, Vienna, November 10, 2017
Migration of audio files using Hadoop, SCAPE Information Day, 25 June 2014SCAPE Project
Hadoop has been used at the State and University Library, Denmark, in connection with an experiment on the migration of a large collection of audio files from mp3 to wav. This experiment was presented by Bolette Ammitzbøll Jurik at ‘SCAPE Information Day at the State and University Library, Denmark’, on 25 June 2014. The information day introduced the EU-funded project SCAPE (Scalable Preservation Environments) and its tools and services to the participants.
The experiment used Hadoop and Taverna but also xcorrSound waveform-compare which is a small tool developed within SCAPE to compare the content of audio files.
Read more about the event in this blog post, http://bit.ly/SCAPE_SB_Demo.
This ppt gives an overview of the recent MIT paper on Sparse Fourier transform using which data can be processed 10 or 100 times faster than the traditional Fast Fourier Transform. This is possible as the FFT has a complexity of O(nlogn), where the sparse FT has a potentially lower complexity of O(klogn) in the sparse spectrum case.
Looking into the past - feature extraction from historic maps using Python, O...James Crone
Tutorial presentation providing an overview of extracting geospatial features from scanned historic maps in an automated fashion using Python, OpenCV and PostGIS.
AV digitization projects in Mediateka at RTV Slovenija - NoTimeToWait symposi...Bojan Kosi
Avdio, video and film digitization projects and workflows in Mediateka at RTV Slovenija.
Presentation from NoTimeToWait symposium, Austrian Film Museum, Vienna, November 10, 2017
Migration of audio files using Hadoop, SCAPE Information Day, 25 June 2014SCAPE Project
Hadoop has been used at the State and University Library, Denmark, in connection with an experiment on the migration of a large collection of audio files from mp3 to wav. This experiment was presented by Bolette Ammitzbøll Jurik at ‘SCAPE Information Day at the State and University Library, Denmark’, on 25 June 2014. The information day introduced the EU-funded project SCAPE (Scalable Preservation Environments) and its tools and services to the participants.
The experiment used Hadoop and Taverna but also xcorrSound waveform-compare which is a small tool developed within SCAPE to compare the content of audio files.
Read more about the event in this blog post, http://bit.ly/SCAPE_SB_Demo.
This ppt gives an overview of the recent MIT paper on Sparse Fourier transform using which data can be processed 10 or 100 times faster than the traditional Fast Fourier Transform. This is possible as the FFT has a complexity of O(nlogn), where the sparse FT has a potentially lower complexity of O(klogn) in the sparse spectrum case.
Looking into the past - feature extraction from historic maps using Python, O...James Crone
Tutorial presentation providing an overview of extracting geospatial features from scanned historic maps in an automated fashion using Python, OpenCV and PostGIS.
Conference: 43rd Annual Conference of the IEEE
Industrial Electronics Society (IECON2017)
29 October – 1 November, 2017
China National Convention Center, Beijing, China
Title of the paper: An Architecture for Implementing
Private Local Automation Clouds Built by CPS
Authors: Borja Ramis Ferrer, Jose L. Martinez Lastra
Programming Languages & Tools for Higher Performance & ProductivityLinaro
By Hitoshi Murai, RIKEN AICS
For higher performance and productivity of HPC systems, it is important to provide users with good programming environment including languages, compilers, and tools. In this talk, the programming model of the post-K supercomputer will be shown.
Hitoshi Murai Bio
Hitoshi Murai received a master's degree in information science from Kyoto University in 1996. He worked as a software developer in NEC from 1996 to 2010. He received a Ph.D degree in computer science from University of Tsukuba in 2010. He is currently a research scientist of the programming environment research team and the Flagship 2020 project in Advanced Institute for Computational Science, RIKEN. His research interests include compilers and parallel programming languages.
Email
h-murai@riken.jp
For more info on The Linaro High Performance Computing (HPC) visit https://www.linaro.org/sig/hpc/
Yuki Oyama - Markov assignment for a pedestrian activity-based network design...Yuki Oyama
Oyama, Y., Hato, E., Scarinci, R., Bierlaire, M. (2017) Markov assignment for a pedestrian activity-based network design problem. The 6th symposium arranged by European Association for Research in Transportation (hEART), Haifa, Israel.
In this deck from the GoingARM workshop at SC17, Filippo Mantovani describes the contributions of the Barcelona Supercomputing Center to the European Mont-Blanc project.
"Since 2011, Mont-Blanc has pushed the adoption of Arm technology in High Performance Computing, deploying Arm-based prototypes, enhancing system software ecosystem and projecting performance of current systems for developing new, more powerful and less power hungry HPC computing platforms based on Arm SoC. In this talk, Filippo introduces the last Mont-Blanc system, called Dibona, designed and integrated by the coordinator and industrial partner of the project, Bull/ATOS. He also talks about tests performed at BSC of the Arm software tools (HPC compiler and mathematical libraries) as well as the Dynamic Load Balancing (DLB) technique and the Multiscale Simulator Architecture (MUSA)."
Watch the video: https://wp.me/p3RLHQ-i6o
Learn more: http://www.goingarm.com/
Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter
Presentation of few recent papers on Deep Learning ... in particular Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding, Song Han, Huizi Mao, William J. Dally International Conference on Learning Representations ICLR2016
40 Powers of 10 - Simulating the Universe with the DiRAC HPC Facilityinside-BigData.com
In this deck from the Swiss HPC Conference, Mark Wilkinson presents: 40 Powers of 10 - Simulating the Universe with the DiRAC HPC Facility.
"DiRAC is the integrated supercomputing facility for theoretical modeling and HPC-based research in particle physics, and astrophysics, cosmology, and nuclear physics, all areas in which the UK is world-leading. DiRAC provides a variety of compute resources, matching machine architecture to the algorithm design and requirements of the research problems to be solved. As a single federated Facility, DiRAC allows more effective and efficient use of computing resources, supporting the delivery of the science programs across the STFC research communities. It provides a common training and consultation framework and, crucially, provides critical mass and a coordinating structure for both small- and large-scale cross-discipline science projects, the technical support needed to run and develop a distributed HPC service, and a pool of expertise to support knowledge transfer and industrial partnership projects. The on-going development and sharing of best-practice for the delivery of productive, national HPC services with DiRAC enables STFC researchers to produce world-leading science across the entire STFC science theory program."
Watch the video: https://wp.me/p3RLHQ-k94
Learn more: https://dirac.ac.uk/
and
http://hpcadvisorycouncil.com/events/2019/swiss-workshop/agenda.php
Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter
A brief introduction to deep learning, providing rough interpretation to deep neural networks and simple implementations with Keras for deep learning beginners.
Kernel Recipes 2017 - EBPF and XDP - Eric LeblondAnne Nicolas
Berkeley Packet Filter is an old friend for most people that deal with network under Linux. But its extended version eBPF is completely redefining the scope of usage and interaction with the kernel. It can indeed be used to instrument most parts of the kernel. This goes from network tracing to process or I/O monitoring.
This talk will provide an overview of eBPF, from concept to tools like BCC. It will then focus on XDP for eXtreme Data Path and the possible applications in term of networking provided by this new framework.
Eric Leblond, Stamus Network
External Aerodynamic Optimization Using ANSYS Mesh MorphingMarco E. Biancolini
Aero development requires the exploration of many shape variations, so a highly reliable and fully automated workflow using HPC is needed. This presentation will show how mesh morphing can be used to make the ANSYS CFD model parametric, and how the DesignXplorer optimizer cann be used to automate the calculation.
Code reviews are vital for ensuring good code quality. They serve as one of our last lines of defense against bugs and subpar code reaching production.
Yet, they often turn into annoying tasks riddled with frustration, hostility, unclear feedback and lack of standards. How can we improve this crucial process?
In this session we will cover:
- The Art of Effective Code Reviews
- Streamlining the Review Process
- Elevating Reviews with Automated Tools
By the end of this presentation, you'll have the knowledge on how to organize and improve your code review proces
Navigating the Metaverse: A Journey into Virtual Evolution"Donna Lenk
Join us for an exploration of the Metaverse's evolution, where innovation meets imagination. Discover new dimensions of virtual events, engage with thought-provoking discussions, and witness the transformative power of digital realms."
More Related Content
Similar to Scalable AMR - HPC China 2017, Hefei China
Conference: 43rd Annual Conference of the IEEE
Industrial Electronics Society (IECON2017)
29 October – 1 November, 2017
China National Convention Center, Beijing, China
Title of the paper: An Architecture for Implementing
Private Local Automation Clouds Built by CPS
Authors: Borja Ramis Ferrer, Jose L. Martinez Lastra
Programming Languages & Tools for Higher Performance & ProductivityLinaro
By Hitoshi Murai, RIKEN AICS
For higher performance and productivity of HPC systems, it is important to provide users with good programming environment including languages, compilers, and tools. In this talk, the programming model of the post-K supercomputer will be shown.
Hitoshi Murai Bio
Hitoshi Murai received a master's degree in information science from Kyoto University in 1996. He worked as a software developer in NEC from 1996 to 2010. He received a Ph.D degree in computer science from University of Tsukuba in 2010. He is currently a research scientist of the programming environment research team and the Flagship 2020 project in Advanced Institute for Computational Science, RIKEN. His research interests include compilers and parallel programming languages.
Email
h-murai@riken.jp
For more info on The Linaro High Performance Computing (HPC) visit https://www.linaro.org/sig/hpc/
Yuki Oyama - Markov assignment for a pedestrian activity-based network design...Yuki Oyama
Oyama, Y., Hato, E., Scarinci, R., Bierlaire, M. (2017) Markov assignment for a pedestrian activity-based network design problem. The 6th symposium arranged by European Association for Research in Transportation (hEART), Haifa, Israel.
In this deck from the GoingARM workshop at SC17, Filippo Mantovani describes the contributions of the Barcelona Supercomputing Center to the European Mont-Blanc project.
"Since 2011, Mont-Blanc has pushed the adoption of Arm technology in High Performance Computing, deploying Arm-based prototypes, enhancing system software ecosystem and projecting performance of current systems for developing new, more powerful and less power hungry HPC computing platforms based on Arm SoC. In this talk, Filippo introduces the last Mont-Blanc system, called Dibona, designed and integrated by the coordinator and industrial partner of the project, Bull/ATOS. He also talks about tests performed at BSC of the Arm software tools (HPC compiler and mathematical libraries) as well as the Dynamic Load Balancing (DLB) technique and the Multiscale Simulator Architecture (MUSA)."
Watch the video: https://wp.me/p3RLHQ-i6o
Learn more: http://www.goingarm.com/
Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter
Presentation of few recent papers on Deep Learning ... in particular Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding, Song Han, Huizi Mao, William J. Dally International Conference on Learning Representations ICLR2016
40 Powers of 10 - Simulating the Universe with the DiRAC HPC Facilityinside-BigData.com
In this deck from the Swiss HPC Conference, Mark Wilkinson presents: 40 Powers of 10 - Simulating the Universe with the DiRAC HPC Facility.
"DiRAC is the integrated supercomputing facility for theoretical modeling and HPC-based research in particle physics, and astrophysics, cosmology, and nuclear physics, all areas in which the UK is world-leading. DiRAC provides a variety of compute resources, matching machine architecture to the algorithm design and requirements of the research problems to be solved. As a single federated Facility, DiRAC allows more effective and efficient use of computing resources, supporting the delivery of the science programs across the STFC research communities. It provides a common training and consultation framework and, crucially, provides critical mass and a coordinating structure for both small- and large-scale cross-discipline science projects, the technical support needed to run and develop a distributed HPC service, and a pool of expertise to support knowledge transfer and industrial partnership projects. The on-going development and sharing of best-practice for the delivery of productive, national HPC services with DiRAC enables STFC researchers to produce world-leading science across the entire STFC science theory program."
Watch the video: https://wp.me/p3RLHQ-k94
Learn more: https://dirac.ac.uk/
and
http://hpcadvisorycouncil.com/events/2019/swiss-workshop/agenda.php
Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter
A brief introduction to deep learning, providing rough interpretation to deep neural networks and simple implementations with Keras for deep learning beginners.
Kernel Recipes 2017 - EBPF and XDP - Eric LeblondAnne Nicolas
Berkeley Packet Filter is an old friend for most people that deal with network under Linux. But its extended version eBPF is completely redefining the scope of usage and interaction with the kernel. It can indeed be used to instrument most parts of the kernel. This goes from network tracing to process or I/O monitoring.
This talk will provide an overview of eBPF, from concept to tools like BCC. It will then focus on XDP for eXtreme Data Path and the possible applications in term of networking provided by this new framework.
Eric Leblond, Stamus Network
External Aerodynamic Optimization Using ANSYS Mesh MorphingMarco E. Biancolini
Aero development requires the exploration of many shape variations, so a highly reliable and fully automated workflow using HPC is needed. This presentation will show how mesh morphing can be used to make the ANSYS CFD model parametric, and how the DesignXplorer optimizer cann be used to automate the calculation.
Code reviews are vital for ensuring good code quality. They serve as one of our last lines of defense against bugs and subpar code reaching production.
Yet, they often turn into annoying tasks riddled with frustration, hostility, unclear feedback and lack of standards. How can we improve this crucial process?
In this session we will cover:
- The Art of Effective Code Reviews
- Streamlining the Review Process
- Elevating Reviews with Automated Tools
By the end of this presentation, you'll have the knowledge on how to organize and improve your code review proces
Navigating the Metaverse: A Journey into Virtual Evolution"Donna Lenk
Join us for an exploration of the Metaverse's evolution, where innovation meets imagination. Discover new dimensions of virtual events, engage with thought-provoking discussions, and witness the transformative power of digital realms."
Enhancing Research Orchestration Capabilities at ORNL.pdfGlobus
Cross-facility research orchestration comes with ever-changing constraints regarding the availability and suitability of various compute and data resources. In short, a flexible data and processing fabric is needed to enable the dynamic redirection of data and compute tasks throughout the lifecycle of an experiment. In this talk, we illustrate how we easily leveraged Globus services to instrument the ACE research testbed at the Oak Ridge Leadership Computing Facility with flexible data and task orchestration capabilities.
In the ever-evolving landscape of technology, enterprise software development is undergoing a significant transformation. Traditional coding methods are being challenged by innovative no-code solutions, which promise to streamline and democratize the software development process.
This shift is particularly impactful for enterprises, which require robust, scalable, and efficient software to manage their operations. In this article, we will explore the various facets of enterprise software development with no-code solutions, examining their benefits, challenges, and the future potential they hold.
Understanding Nidhi Software Pricing: A Quick Guide 🌟
Choosing the right software is vital for Nidhi companies to streamline operations. Our latest presentation covers Nidhi software pricing, key factors, costs, and negotiation tips.
📊 What You’ll Learn:
Key factors influencing Nidhi software price
Understanding the true cost beyond the initial price
Tips for negotiating the best deal
Affordable and customizable pricing options with Vector Nidhi Software
🔗 Learn more at: www.vectornidhisoftware.com/software-for-nidhi-company/
#NidhiSoftwarePrice #NidhiSoftware #VectorNidhi
Unleash Unlimited Potential with One-Time Purchase
BoxLang is more than just a language; it's a community. By choosing a Visionary License, you're not just investing in your success, you're actively contributing to the ongoing development and support of BoxLang.
Large Language Models and the End of ProgrammingMatt Welsh
Talk by Matt Welsh at Craft Conference 2024 on the impact that Large Language Models will have on the future of software development. In this talk, I discuss the ways in which LLMs will impact the software industry, from replacing human software developers with AI, to replacing conventional software with models that perform reasoning, computation, and problem-solving.
We describe the deployment and use of Globus Compute for remote computation. This content is aimed at researchers who wish to compute on remote resources using a unified programming interface, as well as system administrators who will deploy and operate Globus Compute services on their research computing infrastructure.
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Mind IT Systems
Healthcare providers often struggle with the complexities of chronic conditions and remote patient monitoring, as each patient requires personalized care and ongoing monitoring. Off-the-shelf solutions may not meet these diverse needs, leading to inefficiencies and gaps in care. It’s here, custom healthcare software offers a tailored solution, ensuring improved care and effectiveness.
A Study of Variable-Role-based Feature Enrichment in Neural Models of CodeAftab Hussain
Understanding variable roles in code has been found to be helpful by students
in learning programming -- could variable roles help deep neural models in
performing coding tasks? We do an exploratory study.
- These are slides of the talk given at InteNSE'23: The 1st International Workshop on Interpretability and Robustness in Neural Software Engineering, co-located with the 45th International Conference on Software Engineering, ICSE 2023, Melbourne Australia
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Globus
Large Language Models (LLMs) are currently the center of attention in the tech world, particularly for their potential to advance research. In this presentation, we'll explore a straightforward and effective method for quickly initiating inference runs on supercomputers using the vLLM tool with Globus Compute, specifically on the Polaris system at ALCF. We'll begin by briefly discussing the popularity and applications of LLMs in various fields. Following this, we will introduce the vLLM tool, and explain how it integrates with Globus Compute to efficiently manage LLM operations on Polaris. Attendees will learn the practical aspects of setting up and remotely triggering LLMs from local machines, focusing on ease of use and efficiency. This talk is ideal for researchers and practitioners looking to leverage the power of LLMs in their work, offering a clear guide to harnessing supercomputing resources for quick and effective LLM inference.
Check out the webinar slides to learn more about how XfilesPro transforms Salesforce document management by leveraging its world-class applications. For more details, please connect with sales@xfilespro.com
If you want to watch the on-demand webinar, please click here: https://www.xfilespro.com/webinars/salesforce-document-management-2-0-smarter-faster-better/
OpenMetadata Community Meeting - 5th June 2024OpenMetadata
The OpenMetadata Community Meeting was held on June 5th, 2024. In this meeting, we discussed about the data quality capabilities that are integrated with the Incident Manager, providing a complete solution to handle your data observability needs. Watch the end-to-end demo of the data quality features.
* How to run your own data quality framework
* What is the performance impact of running data quality frameworks
* How to run the test cases in your own ETL pipelines
* How the Incident Manager is integrated
* Get notified with alerts when test cases fail
Watch the meeting recording here - https://www.youtube.com/watch?v=UbNOje0kf6E
E-commerce Application Development Company.pdfHornet Dynamics
Your business can reach new heights with our assistance as we design solutions that are specifically appropriate for your goals and vision. Our eCommerce application solutions can digitally coordinate all retail operations processes to meet the demands of the marketplace while maintaining business continuity.
Need for Speed: Removing speed bumps from your Symfony projects ⚡️Łukasz Chruściel
No one wants their application to drag like a car stuck in the slow lane! Yet it’s all too common to encounter bumpy, pothole-filled solutions that slow the speed of any application. Symfony apps are not an exception.
In this talk, I will take you for a spin around the performance racetrack. We’ll explore common pitfalls - those hidden potholes on your application that can cause unexpected slowdowns. Learn how to spot these performance bumps early, and more importantly, how to navigate around them to keep your application running at top speed.
We will focus in particular on tuning your engine at the application level, making the right adjustments to ensure that your system responds like a well-oiled, high-performance race car.
Need for Speed: Removing speed bumps from your Symfony projects ⚡️
Scalable AMR - HPC China 2017, Hefei China
1. Advances in Scalable Adaptive
Mesh Refinement (AMR) at
Extreme Scales
Prof. Michael L Norman
Director, San Diego Supercomputer Center
University of California, San Diego
Developer: Dr. James Bordner
Supported by NSF grants SI2-SSE-1440709, PHY-1104819 and AST-0808184.
10/19/2017 M. L. Norman - HPC China 2017, Hefei 1
2. My intrepid partner in this journey
• James Bordner
• PhD CS UIUC, 1999
• C++ programmer
extraordinaire
• Enzo-P/Cello is entirely his
design and implementation
10/19/2017 M. L. Norman - Charm++ Workshop 2017 2
3. Motivation : Extreme Scale Numerical Cosmology
• Dark matter only N-body
simulations have crossed the
1012 particle threshold on the
world’s largest
supercomputers
• Hydrodynamic cosmology
applications are lagging behind
N-body simulations
• This is due to the lack of
extreme scale AMR
frameworks
1 trillion particle dark matter simulation on
IBM BG/Q, Habib et al. (2013)
10/19/2017 M. L. Norman - HPC China 2017, Hefei 3
5. Enzo: AMR Hydrodynamic Cosmology Code
http://enzo-project.org
• Enzo code under
continuous development
since 1994
– First hydrodynamic
cosmological AMR code
– Hundreds of users
• Rich set of physics solvers
(hydro, N-body, radiation
transport, chemistry,…)
• Have done simulations with
1012 dynamic range and 42
levels
First Stars
First Galaxies Reionization
10/19/2017 M. L. Norman - HPC China 2017, Hefei 5
6. Extreme AMR: 1012 dynamic range from 42
levels of refinement
• Gordon Bell prize
finalist 2001
• Formation of the first
star in the universe
10/19/2017 M. L. Norman - HPC China 2017, Hefei 6
Bryan, Norman & Abel 2001
7. Enzo uses structured AMR (Berger & Colella 1989)
10/19/2017 M. L. Norman - HPC China 2arrier017, Hefei 7
• Unequal patch sizes
• Arbitrary patch locations
• Complex parent-child-sibling relationships
barriers to scalability
Structured AMR produces
Norman & Bryan 1998
8. time
∆t
∆t/2 ∆t/2
∆t/4 ∆t/4 ∆t/4 ∆t/4
“W cycle”
Serialization over level
updates also limits
scalability and performance
10/19/2017 M. L. Norman - HPC China 2017, Hefei 8
9. Adopted Strategy
• Keep the best part of Enzo (numerical solvers) and
replace the AMR infrastructure
• Implement using modern OOP best practices for
modularity and extensibility
• Use the best available scalable AMR algorithm
• Move from bulk synchronous to data-driven
asynchronous execution model to support patch
adaptive timestepping
• Leverage parallel runtimes that support this execution
model, and have a path to exascale
• Make AMR software library application-independent so
others can use it
10/19/2017 M. L. Norman - HPC China 2017, Hefei 10
10. Software Architecture
10/19/2017 M. L. Norman - HPC China 2017, Hefei 11
Enzo numerical solvers (Enzo-P)
Forest-of-octrees AMR (Cello)
Charm++
Hardware (heterogeneous, hierarchical)
11. (C++)
10/19/2017 M. L. Norman - HPC China 2017, Hefei 13
Object-oriented design implements “separation of concerns”, enhancing extensibility,
maintainability, understandability
12. Forest (=Array) of Octrees
Burstedde, Wilcox, Gattas 2011
10/19/2017 M. L. Norman - HPC China 2017, Hefei 14
2 x 2 x 2 trees 6 x 2 x 2 trees
unrefined
tree
refined
tree
13. p4est weak scaling: mantle convection
Burstedde et al. (2010), Gordon Bell prize finalist paper
10/19/2017 M. L. Norman - HPC China 2017, Hefei 15
14. What makes it so scalable?
Fully distributed data structure; no parent-child
10/19/2017 M. L. Norman - HPC China 2017, Hefei 16
Burstedde, Wilcox, Gattas 2011
17. How does Cello use Charm++ to
implement FOT?
• A forest is array of octrees
of arbitrary size K x L x M
• An octree has leaf nodes
which are blocks (N x N x N)
• Each block is a chare (unit
of sequential work)
• The entire FOT is stored as a
chare array using a bit index
scheme
• Chare arrays are fully
distributed data structures
in Charm++
10/19/2017 M. L. Norman - HPC China 2017, Hefei 19
2 x 2 x 2 tree
N x N x N block
18. 10/19/2017 M. L. Norman - HPC China 2017, Hefei
20
• Each leaf node of the tree is a block
• Each block is a chare
• The forest of trees is represented as a chare array
20. Demonstration of Enzo-P/Cello
10/19/2017 M. L. Norman - HPC China 2017, Hefei 24
Quadtree AMR mesh – 5 levels
Effective resolution: 4096 x 2048 mesh
Every square is a 162 block, communicating and executing asynchronously with neighbors
21. WEAK SCALING TEST –
HOW BIG AN AMR MESH CAN WE DO?
10/19/2017 M. L. Norman - HPC China 2017, Hefei 29
22. A 3x5 array of blast waves
10/19/2017 M. L. Norman - HPC China 2017, Hefei 30
Total energy
23. Unit cell: 1 tree per core
201 blocks/tree, 323 cells/block
10/19/2017 M. L. Norman - Charm++ Workshop 2017 32
28. Toward Extreme Scale AMR
Hydrodynamic Cosmology
10/19/2017 M. L. Norman - HPC China 2017, Hefei 37
Scalable
hydrodynamics
Scalable N-body
dynamics
Scalable
gravity
36. Hydrodynamic Cosmology Scaling Tests
• Uniform grid only
• Weak and strong scaling
• 323 blocks/chare
• 1, 8, 64 chares/core
• 643 to 20483 meshes
• p=8 to 128k cores
10/19/2017 M. L. Norman - HPC China 2017, Hefei 45
Density projection in 5123 simulation
39. Toward Extreme Scale AMR
Hydrodynamic Cosmology
10/19/2017 M. L. Norman - HPC China 2017, Hefei 48
Scalable
hydrodynamics
Scalable N-body
dynamics
Scalable
gravity
uniform
grid only
40. Path Forward
• Finish debugging scalable AMR gravity solver
• Implement block-local timestepping
• Experiment with Charm’s built-in DLB schemes
and dynamic execution
• Do a 1 trillion cell/particle hydro cosmology
simulation to demonstrate capability
• Publish!
10/19/2017 M. L. Norman - HPC China 2017, Hefei 57
41. Resources
• Project site: http://www.cello-project.org
• Source code: https://bitbucket.org/cello-project
• Tutorials: on project site
10/19/2017 M. L. Norman - HPC China 2017, Hefei 58