SlideShare a Scribd company logo
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
The Upcoming HPC Evolution
Los Alamos National Lab, Scientist/HPC Engineer
Joseph ‘Joshi’ Fullop
LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Joseph ‘Joshi’ Fullop
2
Joshi a high-performance computing (HPC)
scientist/engineer in scheduling, monitoring, and
analytics at Los Alamos National Lab.
Formerly with the National Center for
Supercomputing Applications (NCSA) for 19 years.
LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
▪ Department of Energy (DoE) weapons laboratory with priorities set by National Nuclear
Security Administration (NNSA)
▪ Established in 1943 as site Y of the Manhattan Project
▪ In 1992, the U.S. stopped full-scale testing of nuclear weapons, which means using other
methods to assure the safety of the U.S. stockpile and requiring a tremendous amount of
science, technology, computational horsepower, and analytical tools.
▪ We perform work for the Department of Defense (DoD), Intelligence Community (IC), and
Department of Homeland Security (DHS), among others. As a result, our strategy reflects
U.S. priorities spanning nuclear security, intelligence, defense, emergency response,
nonproliferation, counterterrorism, energy security, emerging threats, and environmental
management.
3LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
SECTION 1
LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
What is HPC?
▪ High Performance Computing
o Delivering extreme scale, parallel, floating-point operations for scientific, engineering
and business endeavors using supercomputers.
o 10s of thousands servers, 300k-1M+ cores, ~20PetaFlops, ~80PB storage
o Many different combinations and use of technologies to serve the machine’s purpose,
which is driven by the applications that is serves. (e.g. GPUs, FPGAs)
o Application architectures mostly based on Message Passing Interface (MPI)
▪ 20 PetaFlops: If you did one 14-digit calculation per second without
stopping to eat or sleep, it would take 634 million years to do what a
supercomputer can do in 1 second.
5LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Two Examples of HPC Supercomputers
6
▪ Trinity @ LANL
~20k nodes
http://www.lanl.gov/projects/trinity/
▪ Blue Waters @ NCSA
~27k nodes
http://www.ncsa.illinois.edu/enabling/bluewaters
LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
HPV vs Cloud Computing
▪ Some vast over generalizations
o Network
• HPC: Extremely low latency AND High Bandwidth. Multi-dimensional architectures.
• Cloud: Tends to be more commodity based.
o Parallel File System
• HPC: Focal point of contention. Major factor of scalability.
• Cloud: May not even be required.
o Workload
• HPC: GB in / TB out. Not always embarrassingly parallel.
• Cloud: GB or TB in / KB out. Tends to be more parallel or asynchronous by nature.
o Infrastructure
• HPC: Generally persistent
• Cloud: Can be highly dynamic.
7LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
New HPC Application Architecture
8
▪ Traditional HPC applications
oSimulation based
• Create a model.
• Feed it a data set.
• Turn the crank.
• Analyze what comes out.
oMeasured by number of
calculations and model size.
▪ Data Analytics applications
oQuery based
• Pose a question.
• Traverse dataset.
• Do statistics.
• Make a plausible
determination.
oMeasured by the amount of
data assessed.
LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Supercomputer Job Cycle
9
Job
Launch
Compute
Store
Results
Visualize
Job Launch
• Massive number & bandwidth of reads
• Node communication & synchronization
Compute
• Requires very low latency network
• Intolerant to interruptions (jitter)
• Potential check-pointing (burst writes)
Store Results
• Massive file system write(s)
Visualize
• Significantly different I/O patterns
LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Opportunity in HPC
1. Data engines become part of the infrastructure
- Applications will need to be re-written
- Separate budget line item – requires proven need
2. Data engines provide a file system interface
Immediate adoption
Prove in place
Potential game changer – handle burst usage
Will need a Consistency sync function
10LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Benefits of Containers in HPC
• Custom Environment for Users
• Not all applications benefit from changes
• No Conflicting or evolving default packages
• Consistent, Reproducible Workflow
• Fewer things changing under the application
• More defendable performance comparisons
• Fewer required application rebuilds
• Some applications take over a day to rebuild
• Unknown when/if a rebuild may be required
11LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Hurdles for Containers in HPC
• Security
• Enforcement of required patches
• Restriction of certain software
• Localized node security
• Perceived performance penalty
• Environment management at scale
12LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Dilemma for Many HPC Sites
• Provide cycles for legacy/mission codes
• These pay the bills
• Codes do not change much
• Adverse to change in environment
• Facilitate new applications & investigations
• Minimize ‘Time to Science’ by making the systems easy to learn and use
• Embrace new architectures
13LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Virtual Cluster Architecture
14
Scalable
Database
Virtual HPC Cluster Visualization
Cluster
V I R T U A L M A C H I N E R E S O U R C E S
B A R E M E T A L M A C H I N E S
LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Custom Container Cluster Architecture
15
Job 123 Job 456
Scalable Database Compute Nodes Visualization Compute Nodes
B A R E M E T A L M A C H I N E S
LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Relief for Parallel File System
• Burst Buffers / Data Warp
• Regional SSD file system for staging data into/out of parallel file system
• Provide large data sets to applications
• Local OS buffering
• Add local storage to each node
• Use out of band service network to pre-stage container directly onto node
• Job launch now from local storage
• Normal launch from network storage is a scalability limiter
• Greatly reduced launch latency
16LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
17LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
SECTION 2
LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
As clusters continue to scale…
• Management of these resources also needs to scale
• “If you don’t monitor it, you can’t manage it.”
• Monitoring efforts need to scale at:
• Collection
• Aggregation
• Storage
• Query time
• Visualization
19LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Types of Monitoring Data
• Events
• Something that occurred at a time and location
• Syslog … and application logs … and external events
• Metrics
• A measurable value that can be assessed over time
• Context/Job
• What was running on what nodes at what time
20LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Audiences for Monitoring Data
• Admins
• Track the current health of the system
• Users
• See how their jobs used the allocated resources
• Code Developers
• Locate bottlenecks and inefficiencies in their codes
• System Designers
• Determine where to improve performance
• Researchers
• To find the location of the Holy Grail?
21LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Feeds and Speeds
• Logs
• Average in the 10s of GB/day, can peak at 200GB+/day.
• Asynchronous: Can have huge spikes.
• Metrics
• 20k nodes * 200 metrics/node/second = 4M metrics/sec (4TB/day)
• Can easily step up to 10Hz
• Fairly regular flow rate.
• Multiply that by 12-15 systems (*not all quite that size)
22LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
What to do with the data?
23
Visualization Understanding Automation
LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
24LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Workload Throughput
25LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
HELO & Baler
• Machine Learning log classification systems
• Builds a central template library and uses that to pattern match incoming logs messages.
• Tags each log message with the template integer ID (much easier to use later).
• If a log message does not match a pattern, it is analyzed against the template library to see if
there is a very similar one that could be modified to include the new one.
- If a close candidate is found, the template is modified and the message is tagged.
- If there is no close match, a new template with ID is created and used to tag the message.
• Makes possible various types of analyses
• Event macro-event detection (fingerprints).
• Correlation and Cascades
• Event Prediction
26LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Log Comparator
27LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Log Comparator
28LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
29LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Event Correlation & Prediction
30LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Bringing it all together
• Management tools and methods need to scale.
• Provisioning & Job Launch
• Monitoring Data Storage & Access
• Container Management
• Scalable visualizations are needed.
• We need to leverage each others successes.
31LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
Parting Thought
High Performance Computing and Cloud Computing are like teen siblings that
share a lot of clothes but play different sports. Yet, at the end of the day, it is
good to come home for dinner together.
32LA-UR-17-29616
PRESENTATION TITLE ON ONE LINE
AND ON TWO LINES
First and last name
Position, company
THANK YOU
fullop@lanl.gov
Please stay in touch
Any questions?
Joseph ‘Joshi’ Fullop
http://hpc.lanl.gov
LA-UR-17-29616

More Related Content

What's hot

Scylla Summit 2017: How to Use Gocql to Execute Queries and What the Driver D...
Scylla Summit 2017: How to Use Gocql to Execute Queries and What the Driver D...Scylla Summit 2017: How to Use Gocql to Execute Queries and What the Driver D...
Scylla Summit 2017: How to Use Gocql to Execute Queries and What the Driver D...
ScyllaDB
 
Scylla Summit 2017: A Deep Dive on Heat Weighted Load Balancing
Scylla Summit 2017: A Deep Dive on Heat Weighted Load BalancingScylla Summit 2017: A Deep Dive on Heat Weighted Load Balancing
Scylla Summit 2017: A Deep Dive on Heat Weighted Load Balancing
ScyllaDB
 
Scylla Summit 2017: Scylla on Samsung NVMe Z-SSDs
Scylla Summit 2017: Scylla on Samsung NVMe Z-SSDsScylla Summit 2017: Scylla on Samsung NVMe Z-SSDs
Scylla Summit 2017: Scylla on Samsung NVMe Z-SSDs
ScyllaDB
 
Scylla Summit 2017: Migrating to Scylla From Cassandra and Others With No Dow...
Scylla Summit 2017: Migrating to Scylla From Cassandra and Others With No Dow...Scylla Summit 2017: Migrating to Scylla From Cassandra and Others With No Dow...
Scylla Summit 2017: Migrating to Scylla From Cassandra and Others With No Dow...
ScyllaDB
 
Scylla Summit 2017: Scylla on Kubernetes
Scylla Summit 2017: Scylla on KubernetesScylla Summit 2017: Scylla on Kubernetes
Scylla Summit 2017: Scylla on Kubernetes
ScyllaDB
 
Scylla Summit 2017: Snapfish's Journey Towards Scylla
Scylla Summit 2017: Snapfish's Journey Towards ScyllaScylla Summit 2017: Snapfish's Journey Towards Scylla
Scylla Summit 2017: Snapfish's Journey Towards Scylla
ScyllaDB
 
If You Care About Performance, Use User Defined Types
If You Care About Performance, Use User Defined TypesIf You Care About Performance, Use User Defined Types
If You Care About Performance, Use User Defined Types
ScyllaDB
 
Scylla Summit 2017: How to Optimize and Reduce Inter-DC Network Traffic and S...
Scylla Summit 2017: How to Optimize and Reduce Inter-DC Network Traffic and S...Scylla Summit 2017: How to Optimize and Reduce Inter-DC Network Traffic and S...
Scylla Summit 2017: How to Optimize and Reduce Inter-DC Network Traffic and S...
ScyllaDB
 
Scylla Summit 2017: Scylla's Open Source Monitoring Solution
Scylla Summit 2017: Scylla's Open Source Monitoring SolutionScylla Summit 2017: Scylla's Open Source Monitoring Solution
Scylla Summit 2017: Scylla's Open Source Monitoring Solution
ScyllaDB
 
Scylla Summit 2017: Distributed Materialized Views
Scylla Summit 2017: Distributed Materialized ViewsScylla Summit 2017: Distributed Materialized Views
Scylla Summit 2017: Distributed Materialized Views
ScyllaDB
 
Scylla Summit 2017: Cry in the Dojo, Laugh in the Battlefield: How We Constan...
Scylla Summit 2017: Cry in the Dojo, Laugh in the Battlefield: How We Constan...Scylla Summit 2017: Cry in the Dojo, Laugh in the Battlefield: How We Constan...
Scylla Summit 2017: Cry in the Dojo, Laugh in the Battlefield: How We Constan...
ScyllaDB
 
Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...
Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...
Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...
ScyllaDB
 
Scylla Summit 2017: Saving Thousands by Running Scylla on EC2 Spot Instances
Scylla Summit 2017: Saving Thousands by Running Scylla on EC2 Spot InstancesScylla Summit 2017: Saving Thousands by Running Scylla on EC2 Spot Instances
Scylla Summit 2017: Saving Thousands by Running Scylla on EC2 Spot Instances
ScyllaDB
 
Replicate Oracle to Oracle, Oracle to MySQL, and Oracle to Analytics
Replicate Oracle to Oracle, Oracle to MySQL, and Oracle to AnalyticsReplicate Oracle to Oracle, Oracle to MySQL, and Oracle to Analytics
Replicate Oracle to Oracle, Oracle to MySQL, and Oracle to Analytics
Linas Virbalas
 
Oracle acfs in oracle 11
Oracle acfs in oracle 11Oracle acfs in oracle 11
Oracle acfs in oracle 11
Guenadi JILEVSKI
 
Amazon RDS for PostgreSQL - PGConf 2016
Amazon RDS for PostgreSQL - PGConf 2016 Amazon RDS for PostgreSQL - PGConf 2016
Amazon RDS for PostgreSQL - PGConf 2016
Grant McAlister
 
Tungsten University: MySQL Multi-Master Operations Made Simple With Tungsten ...
Tungsten University: MySQL Multi-Master Operations Made Simple With Tungsten ...Tungsten University: MySQL Multi-Master Operations Made Simple With Tungsten ...
Tungsten University: MySQL Multi-Master Operations Made Simple With Tungsten ...
Continuent
 
Setup & Operate Tungsten Replicator
Setup & Operate Tungsten ReplicatorSetup & Operate Tungsten Replicator
Setup & Operate Tungsten Replicator
Continuent
 
Data Storage Formats in Hadoop
Data Storage Formats in HadoopData Storage Formats in Hadoop
Data Storage Formats in Hadoop
Botond Balázs
 
Eventually, Scylla Chooses Consistency
Eventually, Scylla Chooses ConsistencyEventually, Scylla Chooses Consistency
Eventually, Scylla Chooses Consistency
ScyllaDB
 

What's hot (20)

Scylla Summit 2017: How to Use Gocql to Execute Queries and What the Driver D...
Scylla Summit 2017: How to Use Gocql to Execute Queries and What the Driver D...Scylla Summit 2017: How to Use Gocql to Execute Queries and What the Driver D...
Scylla Summit 2017: How to Use Gocql to Execute Queries and What the Driver D...
 
Scylla Summit 2017: A Deep Dive on Heat Weighted Load Balancing
Scylla Summit 2017: A Deep Dive on Heat Weighted Load BalancingScylla Summit 2017: A Deep Dive on Heat Weighted Load Balancing
Scylla Summit 2017: A Deep Dive on Heat Weighted Load Balancing
 
Scylla Summit 2017: Scylla on Samsung NVMe Z-SSDs
Scylla Summit 2017: Scylla on Samsung NVMe Z-SSDsScylla Summit 2017: Scylla on Samsung NVMe Z-SSDs
Scylla Summit 2017: Scylla on Samsung NVMe Z-SSDs
 
Scylla Summit 2017: Migrating to Scylla From Cassandra and Others With No Dow...
Scylla Summit 2017: Migrating to Scylla From Cassandra and Others With No Dow...Scylla Summit 2017: Migrating to Scylla From Cassandra and Others With No Dow...
Scylla Summit 2017: Migrating to Scylla From Cassandra and Others With No Dow...
 
Scylla Summit 2017: Scylla on Kubernetes
Scylla Summit 2017: Scylla on KubernetesScylla Summit 2017: Scylla on Kubernetes
Scylla Summit 2017: Scylla on Kubernetes
 
Scylla Summit 2017: Snapfish's Journey Towards Scylla
Scylla Summit 2017: Snapfish's Journey Towards ScyllaScylla Summit 2017: Snapfish's Journey Towards Scylla
Scylla Summit 2017: Snapfish's Journey Towards Scylla
 
If You Care About Performance, Use User Defined Types
If You Care About Performance, Use User Defined TypesIf You Care About Performance, Use User Defined Types
If You Care About Performance, Use User Defined Types
 
Scylla Summit 2017: How to Optimize and Reduce Inter-DC Network Traffic and S...
Scylla Summit 2017: How to Optimize and Reduce Inter-DC Network Traffic and S...Scylla Summit 2017: How to Optimize and Reduce Inter-DC Network Traffic and S...
Scylla Summit 2017: How to Optimize and Reduce Inter-DC Network Traffic and S...
 
Scylla Summit 2017: Scylla's Open Source Monitoring Solution
Scylla Summit 2017: Scylla's Open Source Monitoring SolutionScylla Summit 2017: Scylla's Open Source Monitoring Solution
Scylla Summit 2017: Scylla's Open Source Monitoring Solution
 
Scylla Summit 2017: Distributed Materialized Views
Scylla Summit 2017: Distributed Materialized ViewsScylla Summit 2017: Distributed Materialized Views
Scylla Summit 2017: Distributed Materialized Views
 
Scylla Summit 2017: Cry in the Dojo, Laugh in the Battlefield: How We Constan...
Scylla Summit 2017: Cry in the Dojo, Laugh in the Battlefield: How We Constan...Scylla Summit 2017: Cry in the Dojo, Laugh in the Battlefield: How We Constan...
Scylla Summit 2017: Cry in the Dojo, Laugh in the Battlefield: How We Constan...
 
Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...
Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...
Scylla Summit 2017: Repair, Backup, Restore: Last Thing Before You Go to Prod...
 
Scylla Summit 2017: Saving Thousands by Running Scylla on EC2 Spot Instances
Scylla Summit 2017: Saving Thousands by Running Scylla on EC2 Spot InstancesScylla Summit 2017: Saving Thousands by Running Scylla on EC2 Spot Instances
Scylla Summit 2017: Saving Thousands by Running Scylla on EC2 Spot Instances
 
Replicate Oracle to Oracle, Oracle to MySQL, and Oracle to Analytics
Replicate Oracle to Oracle, Oracle to MySQL, and Oracle to AnalyticsReplicate Oracle to Oracle, Oracle to MySQL, and Oracle to Analytics
Replicate Oracle to Oracle, Oracle to MySQL, and Oracle to Analytics
 
Oracle acfs in oracle 11
Oracle acfs in oracle 11Oracle acfs in oracle 11
Oracle acfs in oracle 11
 
Amazon RDS for PostgreSQL - PGConf 2016
Amazon RDS for PostgreSQL - PGConf 2016 Amazon RDS for PostgreSQL - PGConf 2016
Amazon RDS for PostgreSQL - PGConf 2016
 
Tungsten University: MySQL Multi-Master Operations Made Simple With Tungsten ...
Tungsten University: MySQL Multi-Master Operations Made Simple With Tungsten ...Tungsten University: MySQL Multi-Master Operations Made Simple With Tungsten ...
Tungsten University: MySQL Multi-Master Operations Made Simple With Tungsten ...
 
Setup & Operate Tungsten Replicator
Setup & Operate Tungsten ReplicatorSetup & Operate Tungsten Replicator
Setup & Operate Tungsten Replicator
 
Data Storage Formats in Hadoop
Data Storage Formats in HadoopData Storage Formats in Hadoop
Data Storage Formats in Hadoop
 
Eventually, Scylla Chooses Consistency
Eventually, Scylla Chooses ConsistencyEventually, Scylla Chooses Consistency
Eventually, Scylla Chooses Consistency
 

Viewers also liked

Scylla Summit 2017: How to Ruin Your Workload's Performance by Choosing the W...
Scylla Summit 2017: How to Ruin Your Workload's Performance by Choosing the W...Scylla Summit 2017: How to Ruin Your Workload's Performance by Choosing the W...
Scylla Summit 2017: How to Ruin Your Workload's Performance by Choosing the W...
ScyllaDB
 
Scylla Summit 2017: A Toolbox for Understanding Scylla in the Field
Scylla Summit 2017: A Toolbox for Understanding Scylla in the FieldScylla Summit 2017: A Toolbox for Understanding Scylla in the Field
Scylla Summit 2017: A Toolbox for Understanding Scylla in the Field
ScyllaDB
 
Scylla Summit 2017: Managing 10,000 Node Storage Clusters at Twitter
Scylla Summit 2017: Managing 10,000 Node Storage Clusters at TwitterScylla Summit 2017: Managing 10,000 Node Storage Clusters at Twitter
Scylla Summit 2017: Managing 10,000 Node Storage Clusters at Twitter
ScyllaDB
 
Scylla Summit 2017: Streaming ETL in Kafka for Everyone with KSQL
Scylla Summit 2017: Streaming ETL in Kafka for Everyone with KSQLScylla Summit 2017: Streaming ETL in Kafka for Everyone with KSQL
Scylla Summit 2017: Streaming ETL in Kafka for Everyone with KSQL
ScyllaDB
 
Scylla Summit 2017: Keynote, Looking back, looking ahead
Scylla Summit 2017: Keynote, Looking back, looking aheadScylla Summit 2017: Keynote, Looking back, looking ahead
Scylla Summit 2017: Keynote, Looking back, looking ahead
ScyllaDB
 
Scylla Summit 2017: How Baidu Runs Scylla on a Petabyte-Level Big Data Platform
Scylla Summit 2017: How Baidu Runs Scylla on a Petabyte-Level Big Data PlatformScylla Summit 2017: How Baidu Runs Scylla on a Petabyte-Level Big Data Platform
Scylla Summit 2017: How Baidu Runs Scylla on a Petabyte-Level Big Data Platform
ScyllaDB
 
mParticle's Journey to Scylla from Cassandra
mParticle's Journey to Scylla from CassandramParticle's Journey to Scylla from Cassandra
mParticle's Journey to Scylla from Cassandra
ScyllaDB
 
Scylla Summit 2016: Keynote - Big Data Goes Native
Scylla Summit 2016: Keynote - Big Data Goes NativeScylla Summit 2016: Keynote - Big Data Goes Native
Scylla Summit 2016: Keynote - Big Data Goes Native
ScyllaDB
 
Scylla Summit 2017: Stretching Scylla Silly: The Datastore of a Graph Databas...
Scylla Summit 2017: Stretching Scylla Silly: The Datastore of a Graph Databas...Scylla Summit 2017: Stretching Scylla Silly: The Datastore of a Graph Databas...
Scylla Summit 2017: Stretching Scylla Silly: The Datastore of a Graph Databas...
ScyllaDB
 
Scylla Summit 2017: Welcome and Keynote - Nextgen NoSQL
Scylla Summit 2017: Welcome and Keynote - Nextgen NoSQLScylla Summit 2017: Welcome and Keynote - Nextgen NoSQL
Scylla Summit 2017: Welcome and Keynote - Nextgen NoSQL
ScyllaDB
 

Viewers also liked (10)

Scylla Summit 2017: How to Ruin Your Workload's Performance by Choosing the W...
Scylla Summit 2017: How to Ruin Your Workload's Performance by Choosing the W...Scylla Summit 2017: How to Ruin Your Workload's Performance by Choosing the W...
Scylla Summit 2017: How to Ruin Your Workload's Performance by Choosing the W...
 
Scylla Summit 2017: A Toolbox for Understanding Scylla in the Field
Scylla Summit 2017: A Toolbox for Understanding Scylla in the FieldScylla Summit 2017: A Toolbox for Understanding Scylla in the Field
Scylla Summit 2017: A Toolbox for Understanding Scylla in the Field
 
Scylla Summit 2017: Managing 10,000 Node Storage Clusters at Twitter
Scylla Summit 2017: Managing 10,000 Node Storage Clusters at TwitterScylla Summit 2017: Managing 10,000 Node Storage Clusters at Twitter
Scylla Summit 2017: Managing 10,000 Node Storage Clusters at Twitter
 
Scylla Summit 2017: Streaming ETL in Kafka for Everyone with KSQL
Scylla Summit 2017: Streaming ETL in Kafka for Everyone with KSQLScylla Summit 2017: Streaming ETL in Kafka for Everyone with KSQL
Scylla Summit 2017: Streaming ETL in Kafka for Everyone with KSQL
 
Scylla Summit 2017: Keynote, Looking back, looking ahead
Scylla Summit 2017: Keynote, Looking back, looking aheadScylla Summit 2017: Keynote, Looking back, looking ahead
Scylla Summit 2017: Keynote, Looking back, looking ahead
 
Scylla Summit 2017: How Baidu Runs Scylla on a Petabyte-Level Big Data Platform
Scylla Summit 2017: How Baidu Runs Scylla on a Petabyte-Level Big Data PlatformScylla Summit 2017: How Baidu Runs Scylla on a Petabyte-Level Big Data Platform
Scylla Summit 2017: How Baidu Runs Scylla on a Petabyte-Level Big Data Platform
 
mParticle's Journey to Scylla from Cassandra
mParticle's Journey to Scylla from CassandramParticle's Journey to Scylla from Cassandra
mParticle's Journey to Scylla from Cassandra
 
Scylla Summit 2016: Keynote - Big Data Goes Native
Scylla Summit 2016: Keynote - Big Data Goes NativeScylla Summit 2016: Keynote - Big Data Goes Native
Scylla Summit 2016: Keynote - Big Data Goes Native
 
Scylla Summit 2017: Stretching Scylla Silly: The Datastore of a Graph Databas...
Scylla Summit 2017: Stretching Scylla Silly: The Datastore of a Graph Databas...Scylla Summit 2017: Stretching Scylla Silly: The Datastore of a Graph Databas...
Scylla Summit 2017: Stretching Scylla Silly: The Datastore of a Graph Databas...
 
Scylla Summit 2017: Welcome and Keynote - Nextgen NoSQL
Scylla Summit 2017: Welcome and Keynote - Nextgen NoSQLScylla Summit 2017: Welcome and Keynote - Nextgen NoSQL
Scylla Summit 2017: Welcome and Keynote - Nextgen NoSQL
 

Similar to Scylla Summit 2017: The Upcoming HPC Evolution

The Sierra Supercomputer: Science and Technology on a Mission
The Sierra Supercomputer: Science and Technology on a MissionThe Sierra Supercomputer: Science and Technology on a Mission
The Sierra Supercomputer: Science and Technology on a Mission
inside-BigData.com
 
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
Mathieu Dumoulin
 
Update on Trinity System Procurement and Plans
Update on Trinity System Procurement and PlansUpdate on Trinity System Procurement and Plans
Update on Trinity System Procurement and Plans
inside-BigData.com
 
Webinar: Large Scale Graph Processing with IBM Power Systems & Neo4j
Webinar: Large Scale Graph Processing with IBM Power Systems & Neo4jWebinar: Large Scale Graph Processing with IBM Power Systems & Neo4j
Webinar: Large Scale Graph Processing with IBM Power Systems & Neo4j
Neo4j
 
Evolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and RainEvolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and Rain
MapR Technologies
 
Porting an MPI application to hybrid MPI+OpenMP with Reveal tool on Shaheen II
Porting an MPI application to hybrid MPI+OpenMP with Reveal tool on Shaheen IIPorting an MPI application to hybrid MPI+OpenMP with Reveal tool on Shaheen II
Porting an MPI application to hybrid MPI+OpenMP with Reveal tool on Shaheen II
George Markomanolis
 
Scylla Summit 2017: Performance Evaluation of Scylla as a Database Backend fo...
Scylla Summit 2017: Performance Evaluation of Scylla as a Database Backend fo...Scylla Summit 2017: Performance Evaluation of Scylla as a Database Backend fo...
Scylla Summit 2017: Performance Evaluation of Scylla as a Database Backend fo...
ScyllaDB
 
Large-Scale Stream Processing in the Hadoop Ecosystem
Large-Scale Stream Processing in the Hadoop Ecosystem Large-Scale Stream Processing in the Hadoop Ecosystem
Large-Scale Stream Processing in the Hadoop Ecosystem
DataWorks Summit/Hadoop Summit
 
Large-Scale Stream Processing in the Hadoop Ecosystem - Hadoop Summit 2016
Large-Scale Stream Processing in the Hadoop Ecosystem - Hadoop Summit 2016Large-Scale Stream Processing in the Hadoop Ecosystem - Hadoop Summit 2016
Large-Scale Stream Processing in the Hadoop Ecosystem - Hadoop Summit 2016
Gyula Fóra
 
3.2 Streaming and Messaging
3.2 Streaming and Messaging3.2 Streaming and Messaging
3.2 Streaming and Messaging
振东 刘
 
Open Programmable Architecture for Java-enabled Network Devices
Open Programmable Architecture for Java-enabled Network DevicesOpen Programmable Architecture for Java-enabled Network Devices
Open Programmable Architecture for Java-enabled Network Devices
Tal Lavian Ph.D.
 
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
BigDataEverywhere
 
Geospatial Sensor Networks and Partitioning Data
Geospatial Sensor Networks and Partitioning DataGeospatial Sensor Networks and Partitioning Data
Geospatial Sensor Networks and Partitioning Data
AlexMiowski
 
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
eswcsummerschool
 
Real-time Hadoop: The Ideal Messaging System for Hadoop
Real-time Hadoop: The Ideal Messaging System for Hadoop Real-time Hadoop: The Ideal Messaging System for Hadoop
Real-time Hadoop: The Ideal Messaging System for Hadoop
DataWorks Summit/Hadoop Summit
 
Big Data Processing Beyond MapReduce by Dr. Flavio Villanustre
Big Data Processing Beyond MapReduce by Dr. Flavio VillanustreBig Data Processing Beyond MapReduce by Dr. Flavio Villanustre
Big Data Processing Beyond MapReduce by Dr. Flavio Villanustre
HPCC Systems
 
Big Data, Fast Data @ PayPal (YOW 2018)
Big Data, Fast Data @ PayPal (YOW 2018)Big Data, Fast Data @ PayPal (YOW 2018)
Big Data, Fast Data @ PayPal (YOW 2018)
Sid Anand
 
Stream Data Processing at Big Data Landscape by Oleksandr Fedirko
Stream Data Processing at Big Data Landscape by Oleksandr Fedirko Stream Data Processing at Big Data Landscape by Oleksandr Fedirko
Stream Data Processing at Big Data Landscape by Oleksandr Fedirko
GlobalLogic Ukraine
 
Huawei Advanced Data Science With Spark Streaming
Huawei Advanced Data Science With Spark StreamingHuawei Advanced Data Science With Spark Streaming
Huawei Advanced Data Science With Spark Streaming
Jen Aman
 
Blue Waters and Resource Management - Now and in the Future
 Blue Waters and Resource Management - Now and in the Future Blue Waters and Resource Management - Now and in the Future
Blue Waters and Resource Management - Now and in the Future
inside-BigData.com
 

Similar to Scylla Summit 2017: The Upcoming HPC Evolution (20)

The Sierra Supercomputer: Science and Technology on a Mission
The Sierra Supercomputer: Science and Technology on a MissionThe Sierra Supercomputer: Science and Technology on a Mission
The Sierra Supercomputer: Science and Technology on a Mission
 
CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016CEP - simplified streaming architecture - Strata Singapore 2016
CEP - simplified streaming architecture - Strata Singapore 2016
 
Update on Trinity System Procurement and Plans
Update on Trinity System Procurement and PlansUpdate on Trinity System Procurement and Plans
Update on Trinity System Procurement and Plans
 
Webinar: Large Scale Graph Processing with IBM Power Systems & Neo4j
Webinar: Large Scale Graph Processing with IBM Power Systems & Neo4jWebinar: Large Scale Graph Processing with IBM Power Systems & Neo4j
Webinar: Large Scale Graph Processing with IBM Power Systems & Neo4j
 
Evolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and RainEvolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and Rain
 
Porting an MPI application to hybrid MPI+OpenMP with Reveal tool on Shaheen II
Porting an MPI application to hybrid MPI+OpenMP with Reveal tool on Shaheen IIPorting an MPI application to hybrid MPI+OpenMP with Reveal tool on Shaheen II
Porting an MPI application to hybrid MPI+OpenMP with Reveal tool on Shaheen II
 
Scylla Summit 2017: Performance Evaluation of Scylla as a Database Backend fo...
Scylla Summit 2017: Performance Evaluation of Scylla as a Database Backend fo...Scylla Summit 2017: Performance Evaluation of Scylla as a Database Backend fo...
Scylla Summit 2017: Performance Evaluation of Scylla as a Database Backend fo...
 
Large-Scale Stream Processing in the Hadoop Ecosystem
Large-Scale Stream Processing in the Hadoop Ecosystem Large-Scale Stream Processing in the Hadoop Ecosystem
Large-Scale Stream Processing in the Hadoop Ecosystem
 
Large-Scale Stream Processing in the Hadoop Ecosystem - Hadoop Summit 2016
Large-Scale Stream Processing in the Hadoop Ecosystem - Hadoop Summit 2016Large-Scale Stream Processing in the Hadoop Ecosystem - Hadoop Summit 2016
Large-Scale Stream Processing in the Hadoop Ecosystem - Hadoop Summit 2016
 
3.2 Streaming and Messaging
3.2 Streaming and Messaging3.2 Streaming and Messaging
3.2 Streaming and Messaging
 
Open Programmable Architecture for Java-enabled Network Devices
Open Programmable Architecture for Java-enabled Network DevicesOpen Programmable Architecture for Java-enabled Network Devices
Open Programmable Architecture for Java-enabled Network Devices
 
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
Big Data Everywhere Chicago: High Performance Computing - Contributions Towar...
 
Geospatial Sensor Networks and Partitioning Data
Geospatial Sensor Networks and Partitioning DataGeospatial Sensor Networks and Partitioning Data
Geospatial Sensor Networks and Partitioning Data
 
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
ESWC SS 2012 - Wednesday Tutorial Barry Norton: Building (Production) Semanti...
 
Real-time Hadoop: The Ideal Messaging System for Hadoop
Real-time Hadoop: The Ideal Messaging System for Hadoop Real-time Hadoop: The Ideal Messaging System for Hadoop
Real-time Hadoop: The Ideal Messaging System for Hadoop
 
Big Data Processing Beyond MapReduce by Dr. Flavio Villanustre
Big Data Processing Beyond MapReduce by Dr. Flavio VillanustreBig Data Processing Beyond MapReduce by Dr. Flavio Villanustre
Big Data Processing Beyond MapReduce by Dr. Flavio Villanustre
 
Big Data, Fast Data @ PayPal (YOW 2018)
Big Data, Fast Data @ PayPal (YOW 2018)Big Data, Fast Data @ PayPal (YOW 2018)
Big Data, Fast Data @ PayPal (YOW 2018)
 
Stream Data Processing at Big Data Landscape by Oleksandr Fedirko
Stream Data Processing at Big Data Landscape by Oleksandr Fedirko Stream Data Processing at Big Data Landscape by Oleksandr Fedirko
Stream Data Processing at Big Data Landscape by Oleksandr Fedirko
 
Huawei Advanced Data Science With Spark Streaming
Huawei Advanced Data Science With Spark StreamingHuawei Advanced Data Science With Spark Streaming
Huawei Advanced Data Science With Spark Streaming
 
Blue Waters and Resource Management - Now and in the Future
 Blue Waters and Resource Management - Now and in the Future Blue Waters and Resource Management - Now and in the Future
Blue Waters and Resource Management - Now and in the Future
 

More from ScyllaDB

ScyllaDB Real-Time Event Processing with CDC
ScyllaDB Real-Time Event Processing with CDCScyllaDB Real-Time Event Processing with CDC
ScyllaDB Real-Time Event Processing with CDC
ScyllaDB
 
MongoDB to ScyllaDB: Technical Comparison and the Path to Success
MongoDB to ScyllaDB: Technical Comparison and the Path to SuccessMongoDB to ScyllaDB: Technical Comparison and the Path to Success
MongoDB to ScyllaDB: Technical Comparison and the Path to Success
ScyllaDB
 
Real-Time or Analytics Workloads... Why Not Both?
Real-Time or Analytics Workloads... Why Not Both?Real-Time or Analytics Workloads... Why Not Both?
Real-Time or Analytics Workloads... Why Not Both?
ScyllaDB
 
Real-Time Persisted Events at Supercell
Real-Time Persisted Events at  SupercellReal-Time Persisted Events at  Supercell
Real-Time Persisted Events at Supercell
ScyllaDB
 
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDBScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB
 
An All-Around Benchmark of the DBaaS Market
An All-Around Benchmark of the DBaaS MarketAn All-Around Benchmark of the DBaaS Market
An All-Around Benchmark of the DBaaS Market
ScyllaDB
 
Discover the Unseen: Tailored Recommendation of Unwatched Content
Discover the Unseen: Tailored Recommendation of Unwatched ContentDiscover the Unseen: Tailored Recommendation of Unwatched Content
Discover the Unseen: Tailored Recommendation of Unwatched Content
ScyllaDB
 
So You've Lost Quorum: Lessons From Accidental Downtime
So You've Lost Quorum: Lessons From Accidental DowntimeSo You've Lost Quorum: Lessons From Accidental Downtime
So You've Lost Quorum: Lessons From Accidental Downtime
ScyllaDB
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
ScyllaDB
 
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's TipsGetting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
ScyllaDB
 
ShareChat's Path to High-Performance NoSQL with ScyllaDB
ShareChat's Path to High-Performance NoSQL with ScyllaDBShareChat's Path to High-Performance NoSQL with ScyllaDB
ShareChat's Path to High-Performance NoSQL with ScyllaDB
ScyllaDB
 
Tracking Millions of Heartbeats on Zee's OTT Platform
Tracking Millions of Heartbeats on Zee's OTT PlatformTracking Millions of Heartbeats on Zee's OTT Platform
Tracking Millions of Heartbeats on Zee's OTT Platform
ScyllaDB
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
ScyllaDB
 
Real-Time Persisted Events at Supercell
Real-Time Persisted Events at  SupercellReal-Time Persisted Events at  Supercell
Real-Time Persisted Events at Supercell
ScyllaDB
 
Why We Chose ScyllaDB over DynamoDB for "User Watch Status"
Why We Chose ScyllaDB over DynamoDB for "User Watch Status"Why We Chose ScyllaDB over DynamoDB for "User Watch Status"
Why We Chose ScyllaDB over DynamoDB for "User Watch Status"
ScyllaDB
 
ScyllaDB Kubernetes Operator Goes Global
ScyllaDB Kubernetes Operator Goes GlobalScyllaDB Kubernetes Operator Goes Global
ScyllaDB Kubernetes Operator Goes Global
ScyllaDB
 
ScyllaDB Drivers: Optimizing Performance
ScyllaDB Drivers: Optimizing PerformanceScyllaDB Drivers: Optimizing Performance
ScyllaDB Drivers: Optimizing Performance
ScyllaDB
 
Cost-Efficient Stream Processing with RisingWave and ScyllaDB
Cost-Efficient Stream Processing with RisingWave and ScyllaDBCost-Efficient Stream Processing with RisingWave and ScyllaDB
Cost-Efficient Stream Processing with RisingWave and ScyllaDB
ScyllaDB
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
ScyllaDB
 

More from ScyllaDB (20)

ScyllaDB Real-Time Event Processing with CDC
ScyllaDB Real-Time Event Processing with CDCScyllaDB Real-Time Event Processing with CDC
ScyllaDB Real-Time Event Processing with CDC
 
MongoDB to ScyllaDB: Technical Comparison and the Path to Success
MongoDB to ScyllaDB: Technical Comparison and the Path to SuccessMongoDB to ScyllaDB: Technical Comparison and the Path to Success
MongoDB to ScyllaDB: Technical Comparison and the Path to Success
 
Real-Time or Analytics Workloads... Why Not Both?
Real-Time or Analytics Workloads... Why Not Both?Real-Time or Analytics Workloads... Why Not Both?
Real-Time or Analytics Workloads... Why Not Both?
 
Real-Time Persisted Events at Supercell
Real-Time Persisted Events at  SupercellReal-Time Persisted Events at  Supercell
Real-Time Persisted Events at Supercell
 
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDBScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
ScyllaDB Leaps Forward with Dor Laor, CEO of ScyllaDB
 
An All-Around Benchmark of the DBaaS Market
An All-Around Benchmark of the DBaaS MarketAn All-Around Benchmark of the DBaaS Market
An All-Around Benchmark of the DBaaS Market
 
Discover the Unseen: Tailored Recommendation of Unwatched Content
Discover the Unseen: Tailored Recommendation of Unwatched ContentDiscover the Unseen: Tailored Recommendation of Unwatched Content
Discover the Unseen: Tailored Recommendation of Unwatched Content
 
So You've Lost Quorum: Lessons From Accidental Downtime
So You've Lost Quorum: Lessons From Accidental DowntimeSo You've Lost Quorum: Lessons From Accidental Downtime
So You've Lost Quorum: Lessons From Accidental Downtime
 
ScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking ReplicationScyllaDB Tablets: Rethinking Replication
ScyllaDB Tablets: Rethinking Replication
 
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's TipsGetting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
 
ShareChat's Path to High-Performance NoSQL with ScyllaDB
ShareChat's Path to High-Performance NoSQL with ScyllaDBShareChat's Path to High-Performance NoSQL with ScyllaDB
ShareChat's Path to High-Performance NoSQL with ScyllaDB
 
Tracking Millions of Heartbeats on Zee's OTT Platform
Tracking Millions of Heartbeats on Zee's OTT PlatformTracking Millions of Heartbeats on Zee's OTT Platform
Tracking Millions of Heartbeats on Zee's OTT Platform
 
A Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's ArchitectureA Deep Dive into ScyllaDB's Architecture
A Deep Dive into ScyllaDB's Architecture
 
Real-Time Persisted Events at Supercell
Real-Time Persisted Events at  SupercellReal-Time Persisted Events at  Supercell
Real-Time Persisted Events at Supercell
 
Why We Chose ScyllaDB over DynamoDB for "User Watch Status"
Why We Chose ScyllaDB over DynamoDB for "User Watch Status"Why We Chose ScyllaDB over DynamoDB for "User Watch Status"
Why We Chose ScyllaDB over DynamoDB for "User Watch Status"
 
ScyllaDB Kubernetes Operator Goes Global
ScyllaDB Kubernetes Operator Goes GlobalScyllaDB Kubernetes Operator Goes Global
ScyllaDB Kubernetes Operator Goes Global
 
ScyllaDB Drivers: Optimizing Performance
ScyllaDB Drivers: Optimizing PerformanceScyllaDB Drivers: Optimizing Performance
ScyllaDB Drivers: Optimizing Performance
 
Cost-Efficient Stream Processing with RisingWave and ScyllaDB
Cost-Efficient Stream Processing with RisingWave and ScyllaDBCost-Efficient Stream Processing with RisingWave and ScyllaDB
Cost-Efficient Stream Processing with RisingWave and ScyllaDB
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 

Recently uploaded

Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
Fwdays
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
Alex Pruden
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
operationspcvita
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
Ivo Velitchkov
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 

Recently uploaded (20)

Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk"Frontline Battles with DDoS: Best practices and Lessons Learned",  Igor Ivaniuk
"Frontline Battles with DDoS: Best practices and Lessons Learned", Igor Ivaniuk
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
Artificial Intelligence and Electronic Warfare
Artificial Intelligence and Electronic WarfareArtificial Intelligence and Electronic Warfare
Artificial Intelligence and Electronic Warfare
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
zkStudyClub - LatticeFold: A Lattice-based Folding Scheme and its Application...
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
The Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptxThe Microsoft 365 Migration Tutorial For Beginner.pptx
The Microsoft 365 Migration Tutorial For Beginner.pptx
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 

Scylla Summit 2017: The Upcoming HPC Evolution

  • 1. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company The Upcoming HPC Evolution Los Alamos National Lab, Scientist/HPC Engineer Joseph ‘Joshi’ Fullop LA-UR-17-29616
  • 2. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Joseph ‘Joshi’ Fullop 2 Joshi a high-performance computing (HPC) scientist/engineer in scheduling, monitoring, and analytics at Los Alamos National Lab. Formerly with the National Center for Supercomputing Applications (NCSA) for 19 years. LA-UR-17-29616
  • 3. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company ▪ Department of Energy (DoE) weapons laboratory with priorities set by National Nuclear Security Administration (NNSA) ▪ Established in 1943 as site Y of the Manhattan Project ▪ In 1992, the U.S. stopped full-scale testing of nuclear weapons, which means using other methods to assure the safety of the U.S. stockpile and requiring a tremendous amount of science, technology, computational horsepower, and analytical tools. ▪ We perform work for the Department of Defense (DoD), Intelligence Community (IC), and Department of Homeland Security (DHS), among others. As a result, our strategy reflects U.S. priorities spanning nuclear security, intelligence, defense, emergency response, nonproliferation, counterterrorism, energy security, emerging threats, and environmental management. 3LA-UR-17-29616
  • 4. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company SECTION 1 LA-UR-17-29616
  • 5. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company What is HPC? ▪ High Performance Computing o Delivering extreme scale, parallel, floating-point operations for scientific, engineering and business endeavors using supercomputers. o 10s of thousands servers, 300k-1M+ cores, ~20PetaFlops, ~80PB storage o Many different combinations and use of technologies to serve the machine’s purpose, which is driven by the applications that is serves. (e.g. GPUs, FPGAs) o Application architectures mostly based on Message Passing Interface (MPI) ▪ 20 PetaFlops: If you did one 14-digit calculation per second without stopping to eat or sleep, it would take 634 million years to do what a supercomputer can do in 1 second. 5LA-UR-17-29616
  • 6. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Two Examples of HPC Supercomputers 6 ▪ Trinity @ LANL ~20k nodes http://www.lanl.gov/projects/trinity/ ▪ Blue Waters @ NCSA ~27k nodes http://www.ncsa.illinois.edu/enabling/bluewaters LA-UR-17-29616
  • 7. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company HPV vs Cloud Computing ▪ Some vast over generalizations o Network • HPC: Extremely low latency AND High Bandwidth. Multi-dimensional architectures. • Cloud: Tends to be more commodity based. o Parallel File System • HPC: Focal point of contention. Major factor of scalability. • Cloud: May not even be required. o Workload • HPC: GB in / TB out. Not always embarrassingly parallel. • Cloud: GB or TB in / KB out. Tends to be more parallel or asynchronous by nature. o Infrastructure • HPC: Generally persistent • Cloud: Can be highly dynamic. 7LA-UR-17-29616
  • 8. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company New HPC Application Architecture 8 ▪ Traditional HPC applications oSimulation based • Create a model. • Feed it a data set. • Turn the crank. • Analyze what comes out. oMeasured by number of calculations and model size. ▪ Data Analytics applications oQuery based • Pose a question. • Traverse dataset. • Do statistics. • Make a plausible determination. oMeasured by the amount of data assessed. LA-UR-17-29616
  • 9. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Supercomputer Job Cycle 9 Job Launch Compute Store Results Visualize Job Launch • Massive number & bandwidth of reads • Node communication & synchronization Compute • Requires very low latency network • Intolerant to interruptions (jitter) • Potential check-pointing (burst writes) Store Results • Massive file system write(s) Visualize • Significantly different I/O patterns LA-UR-17-29616
  • 10. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Opportunity in HPC 1. Data engines become part of the infrastructure - Applications will need to be re-written - Separate budget line item – requires proven need 2. Data engines provide a file system interface Immediate adoption Prove in place Potential game changer – handle burst usage Will need a Consistency sync function 10LA-UR-17-29616
  • 11. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Benefits of Containers in HPC • Custom Environment for Users • Not all applications benefit from changes • No Conflicting or evolving default packages • Consistent, Reproducible Workflow • Fewer things changing under the application • More defendable performance comparisons • Fewer required application rebuilds • Some applications take over a day to rebuild • Unknown when/if a rebuild may be required 11LA-UR-17-29616
  • 12. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Hurdles for Containers in HPC • Security • Enforcement of required patches • Restriction of certain software • Localized node security • Perceived performance penalty • Environment management at scale 12LA-UR-17-29616
  • 13. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Dilemma for Many HPC Sites • Provide cycles for legacy/mission codes • These pay the bills • Codes do not change much • Adverse to change in environment • Facilitate new applications & investigations • Minimize ‘Time to Science’ by making the systems easy to learn and use • Embrace new architectures 13LA-UR-17-29616
  • 14. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Virtual Cluster Architecture 14 Scalable Database Virtual HPC Cluster Visualization Cluster V I R T U A L M A C H I N E R E S O U R C E S B A R E M E T A L M A C H I N E S LA-UR-17-29616
  • 15. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Custom Container Cluster Architecture 15 Job 123 Job 456 Scalable Database Compute Nodes Visualization Compute Nodes B A R E M E T A L M A C H I N E S LA-UR-17-29616
  • 16. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Relief for Parallel File System • Burst Buffers / Data Warp • Regional SSD file system for staging data into/out of parallel file system • Provide large data sets to applications • Local OS buffering • Add local storage to each node • Use out of band service network to pre-stage container directly onto node • Job launch now from local storage • Normal launch from network storage is a scalability limiter • Greatly reduced launch latency 16LA-UR-17-29616
  • 17. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company 17LA-UR-17-29616
  • 18. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company SECTION 2 LA-UR-17-29616
  • 19. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company As clusters continue to scale… • Management of these resources also needs to scale • “If you don’t monitor it, you can’t manage it.” • Monitoring efforts need to scale at: • Collection • Aggregation • Storage • Query time • Visualization 19LA-UR-17-29616
  • 20. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Types of Monitoring Data • Events • Something that occurred at a time and location • Syslog … and application logs … and external events • Metrics • A measurable value that can be assessed over time • Context/Job • What was running on what nodes at what time 20LA-UR-17-29616
  • 21. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Audiences for Monitoring Data • Admins • Track the current health of the system • Users • See how their jobs used the allocated resources • Code Developers • Locate bottlenecks and inefficiencies in their codes • System Designers • Determine where to improve performance • Researchers • To find the location of the Holy Grail? 21LA-UR-17-29616
  • 22. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Feeds and Speeds • Logs • Average in the 10s of GB/day, can peak at 200GB+/day. • Asynchronous: Can have huge spikes. • Metrics • 20k nodes * 200 metrics/node/second = 4M metrics/sec (4TB/day) • Can easily step up to 10Hz • Fairly regular flow rate. • Multiply that by 12-15 systems (*not all quite that size) 22LA-UR-17-29616
  • 23. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company What to do with the data? 23 Visualization Understanding Automation LA-UR-17-29616
  • 24. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company 24LA-UR-17-29616
  • 25. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Workload Throughput 25LA-UR-17-29616
  • 26. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company HELO & Baler • Machine Learning log classification systems • Builds a central template library and uses that to pattern match incoming logs messages. • Tags each log message with the template integer ID (much easier to use later). • If a log message does not match a pattern, it is analyzed against the template library to see if there is a very similar one that could be modified to include the new one. - If a close candidate is found, the template is modified and the message is tagged. - If there is no close match, a new template with ID is created and used to tag the message. • Makes possible various types of analyses • Event macro-event detection (fingerprints). • Correlation and Cascades • Event Prediction 26LA-UR-17-29616
  • 27. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Log Comparator 27LA-UR-17-29616
  • 28. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Log Comparator 28LA-UR-17-29616
  • 29. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company 29LA-UR-17-29616
  • 30. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Event Correlation & Prediction 30LA-UR-17-29616
  • 31. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Bringing it all together • Management tools and methods need to scale. • Provisioning & Job Launch • Monitoring Data Storage & Access • Container Management • Scalable visualizations are needed. • We need to leverage each others successes. 31LA-UR-17-29616
  • 32. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company Parting Thought High Performance Computing and Cloud Computing are like teen siblings that share a lot of clothes but play different sports. Yet, at the end of the day, it is good to come home for dinner together. 32LA-UR-17-29616
  • 33. PRESENTATION TITLE ON ONE LINE AND ON TWO LINES First and last name Position, company THANK YOU fullop@lanl.gov Please stay in touch Any questions? Joseph ‘Joshi’ Fullop http://hpc.lanl.gov LA-UR-17-29616