SlideShare a Scribd company logo
1 of 25
GE Digital on Scylla
Venkatesh Sivasubramanian
Senior Director, Data and Analytics Services, GE Digital
Arvind Singh
Engineer, Data Services, GE Digital
Presenter Bios
Venkatesh Sivasubramanian
Senior Director, Data and Analytics Services, GE Digital
Venkat drives the architecture and development of Data
and Analytics Services for Predix, an Industrial IoT platform.
Arvind Singh
Senior Staff Engineer, Data Services at GE Digital
Solving performance and scalability problems gives him a
high! He’s done just that at Cisco, Argus and now GE.
The GE That’s Not Obvious
GE RenewablesLargest and the most powerful offshore wind turbines
GE Renewables
Better models to avoid unplanned downtimes
GE Renewables
Alpine Battery
Big rotating things
GE Power
Atlanta Monitoring and Diagnostics
GE Power
Every 2 seconds a GE Powered aircraft takes off
GE Aviation
Creating impossibly complex parts at scale
GE Aviation
GE Predix
▪ The world’s largest
Industrial IoT Platform
▪ Cross-industries data
collection, analytics and
alerts generation
GE Predix - Data Fabric
Requirements for Building the Largest
IIoT Platform in the World
▪ Sustain more than 1TB/day of data
▪ Absorb 10’s of billions of records per day
▪ Support millions of parallel analytics transactions
▪ Must keep sub-second response time, end-to-end
What Wasn’t Working
▪ Node storage limitation cause node sprawl
▪ Increasing and decreasing clusters became a challenge
▪ Application latency numbers are on the edge of the SLA
envelope
▪ Managing large number of nodes required increasing
our talent pool
Evaluation
▪ Started with a fairly small cluster of 5 nodes
▪ AWS based deployment, using Scylla AMIs
▪ Directed our application to Scylla, same API, no code change
▪ Node consolidation works: same performance, fewer nodes
▪ Testing takes time, prepare a team and test plan ahead of time
▪ Use Scylla Slack channel for help and questions
Our Data Model
▪ Time-series data model
▪ Sensor information inserted on a per tenant record
▪ Resulting in → A heavy single table
• Tenant info
• Tag info
• Metrics info
▪ We bucket data on time intervals
• Helps with partition size
▪ Rollups based on time-intervals are kept in database
▪ Different data attrition policies
How it Looks from the Application Side
▪ Our performance cluster setup
Component Quantity Resource Type
ScyllaDB nodes 9 I3.8Xlarge
TSQS nodes 5 I3.2Xlarge
Kafka and ZooKeeper Nodes 7 C4.8Xlarge
Go Pipeline 32 CloudFoundry instances
Ingestion Performance Results
▪ ~2.5B data points per hour → ~700K data points per second
Query Performance Results
▪ ~325M Queries per hour → ~90K queries per second
Migrating to Scylla
▪ Online operations must continue
▪ Used Spark to migrate existing data from Cassandra to Scylla
▪ Collaborate with Scylla team to optimize data model
▪ Reducing cluster footprints helps us reduce operational
overhead
Migrating Using Spark
▪ Operations continue as normal during migration
▪ Data model optimization is possible during migration process
Shrink the Cluster
▪ Same or higher workload with fewer nodes
Cassandra
75 i3.2xl
Node reduction
Lower cost
Reduced admin
overhead
Scylla
15 i3.8xl
Scylla Deployments
▪ Replacement for Cassandra clusters
▪ Various clusters, at different size
• Build a tool for the task
▪ Reduced clusters footprint by >60%
▪ Stable, predictable performance over time
Result: We can serve more users, at lower operational cost
Conclusions, or What We Care About
▪ Scalability is paramount
▪ Database stability
▪ Performance predictability
Thank You
Any Questions ?
Please stay in touch
VenkateshS@ge.com & arvind.singh1@ge.com

More Related Content

More from ScyllaDB

Database Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr SarnaDatabase Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr SarnaScyllaDB
 
Replacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDBReplacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDBScyllaDB
 
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear ScalabilityPowering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear ScalabilityScyllaDB
 
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptx7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptxScyllaDB
 
Getting the most out of ScyllaDB
Getting the most out of ScyllaDBGetting the most out of ScyllaDB
Getting the most out of ScyllaDBScyllaDB
 
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a MigrationNoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a MigrationScyllaDB
 
NoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration LogisticsNoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration LogisticsScyllaDB
 
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and ChallengesNoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and ChallengesScyllaDB
 
ScyllaDB Virtual Workshop
ScyllaDB Virtual WorkshopScyllaDB Virtual Workshop
ScyllaDB Virtual WorkshopScyllaDB
 
DBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & TradeoffsDBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & TradeoffsScyllaDB
 
Build Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDBBuild Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDBScyllaDB
 
NoSQL Data Modeling 101
NoSQL Data Modeling 101NoSQL Data Modeling 101
NoSQL Data Modeling 101ScyllaDB
 
Top NoSQL Data Modeling Mistakes
Top NoSQL Data Modeling MistakesTop NoSQL Data Modeling Mistakes
Top NoSQL Data Modeling MistakesScyllaDB
 
NoSQL Data Modeling Foundations — Introducing Concepts & Principles
NoSQL Data Modeling Foundations — Introducing Concepts & PrinciplesNoSQL Data Modeling Foundations — Introducing Concepts & Principles
NoSQL Data Modeling Foundations — Introducing Concepts & PrinciplesScyllaDB
 
Optimizing Performance in Rust for Low-Latency Database Drivers
Optimizing Performance in Rust for Low-Latency Database DriversOptimizing Performance in Rust for Low-Latency Database Drivers
Optimizing Performance in Rust for Low-Latency Database DriversScyllaDB
 
Overcoming Media Streaming Challenges with NoSQL
Overcoming Media Streaming Challenges with NoSQLOvercoming Media Streaming Challenges with NoSQL
Overcoming Media Streaming Challenges with NoSQLScyllaDB
 
How Optimizely (Safely) Maximizes Database Concurrency.pdf
How Optimizely (Safely) Maximizes Database Concurrency.pdfHow Optimizely (Safely) Maximizes Database Concurrency.pdf
How Optimizely (Safely) Maximizes Database Concurrency.pdfScyllaDB
 
How Development Teams Cut Costs with ScyllaDB.pdf
How Development Teams Cut Costs with ScyllaDB.pdfHow Development Teams Cut Costs with ScyllaDB.pdf
How Development Teams Cut Costs with ScyllaDB.pdfScyllaDB
 
Learning Rust the Hard Way for a Production Kafka + ScyllaDB Pipeline
Learning Rust the Hard Way for a Production Kafka + ScyllaDB PipelineLearning Rust the Hard Way for a Production Kafka + ScyllaDB Pipeline
Learning Rust the Hard Way for a Production Kafka + ScyllaDB PipelineScyllaDB
 
NoSQL at Scale: Proven Practices & Pitfalls
NoSQL at Scale: Proven Practices & PitfallsNoSQL at Scale: Proven Practices & Pitfalls
NoSQL at Scale: Proven Practices & PitfallsScyllaDB
 

More from ScyllaDB (20)

Database Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr SarnaDatabase Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
Database Performance at Scale Masterclass: Driver Strategies by Piotr Sarna
 
Replacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDBReplacing Your Cache with ScyllaDB
Replacing Your Cache with ScyllaDB
 
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear ScalabilityPowering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
Powering Real-Time Apps with ScyllaDB_ Low Latency & Linear Scalability
 
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptx7 Reasons Not to Put an External Cache in Front of Your Database.pptx
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
 
Getting the most out of ScyllaDB
Getting the most out of ScyllaDBGetting the most out of ScyllaDB
Getting the most out of ScyllaDB
 
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a MigrationNoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
NoSQL Database Migration Masterclass - Session 2: The Anatomy of a Migration
 
NoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration LogisticsNoSQL Database Migration Masterclass - Session 3: Migration Logistics
NoSQL Database Migration Masterclass - Session 3: Migration Logistics
 
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and ChallengesNoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
NoSQL Data Migration Masterclass - Session 1 Migration Strategies and Challenges
 
ScyllaDB Virtual Workshop
ScyllaDB Virtual WorkshopScyllaDB Virtual Workshop
ScyllaDB Virtual Workshop
 
DBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & TradeoffsDBaaS in the Real World: Risks, Rewards & Tradeoffs
DBaaS in the Real World: Risks, Rewards & Tradeoffs
 
Build Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDBBuild Low-Latency Applications in Rust on ScyllaDB
Build Low-Latency Applications in Rust on ScyllaDB
 
NoSQL Data Modeling 101
NoSQL Data Modeling 101NoSQL Data Modeling 101
NoSQL Data Modeling 101
 
Top NoSQL Data Modeling Mistakes
Top NoSQL Data Modeling MistakesTop NoSQL Data Modeling Mistakes
Top NoSQL Data Modeling Mistakes
 
NoSQL Data Modeling Foundations — Introducing Concepts & Principles
NoSQL Data Modeling Foundations — Introducing Concepts & PrinciplesNoSQL Data Modeling Foundations — Introducing Concepts & Principles
NoSQL Data Modeling Foundations — Introducing Concepts & Principles
 
Optimizing Performance in Rust for Low-Latency Database Drivers
Optimizing Performance in Rust for Low-Latency Database DriversOptimizing Performance in Rust for Low-Latency Database Drivers
Optimizing Performance in Rust for Low-Latency Database Drivers
 
Overcoming Media Streaming Challenges with NoSQL
Overcoming Media Streaming Challenges with NoSQLOvercoming Media Streaming Challenges with NoSQL
Overcoming Media Streaming Challenges with NoSQL
 
How Optimizely (Safely) Maximizes Database Concurrency.pdf
How Optimizely (Safely) Maximizes Database Concurrency.pdfHow Optimizely (Safely) Maximizes Database Concurrency.pdf
How Optimizely (Safely) Maximizes Database Concurrency.pdf
 
How Development Teams Cut Costs with ScyllaDB.pdf
How Development Teams Cut Costs with ScyllaDB.pdfHow Development Teams Cut Costs with ScyllaDB.pdf
How Development Teams Cut Costs with ScyllaDB.pdf
 
Learning Rust the Hard Way for a Production Kafka + ScyllaDB Pipeline
Learning Rust the Hard Way for a Production Kafka + ScyllaDB PipelineLearning Rust the Hard Way for a Production Kafka + ScyllaDB Pipeline
Learning Rust the Hard Way for a Production Kafka + ScyllaDB Pipeline
 
NoSQL at Scale: Proven Practices & Pitfalls
NoSQL at Scale: Proven Practices & PitfallsNoSQL at Scale: Proven Practices & Pitfalls
NoSQL at Scale: Proven Practices & Pitfalls
 

Recently uploaded

Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendArshad QA
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...gurkirankumar98700
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfCionsystems
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about usDynamic Netsoft
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 

Recently uploaded (20)

Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Test Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and BackendTest Automation Strategy for Frontend and Backend
Test Automation Strategy for Frontend and Backend
 
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
(Genuine) Escort Service Lucknow | Starting ₹,5K To @25k with A/C 🧑🏽‍❤️‍🧑🏻 89...
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdf
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
DNT_Corporate presentation know about us
DNT_Corporate presentation know about usDNT_Corporate presentation know about us
DNT_Corporate presentation know about us
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Mukherjee Nagar 📱  9999965857  🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 

Scylla Summit 2018: GE Predix - Industrial-Strength IoT at Scale

  • 1. GE Digital on Scylla Venkatesh Sivasubramanian Senior Director, Data and Analytics Services, GE Digital Arvind Singh Engineer, Data Services, GE Digital
  • 2. Presenter Bios Venkatesh Sivasubramanian Senior Director, Data and Analytics Services, GE Digital Venkat drives the architecture and development of Data and Analytics Services for Predix, an Industrial IoT platform. Arvind Singh Senior Staff Engineer, Data Services at GE Digital Solving performance and scalability problems gives him a high! He’s done just that at Cisco, Argus and now GE.
  • 3. The GE That’s Not Obvious
  • 4. GE RenewablesLargest and the most powerful offshore wind turbines
  • 5. GE Renewables Better models to avoid unplanned downtimes
  • 8. Atlanta Monitoring and Diagnostics GE Power
  • 9. Every 2 seconds a GE Powered aircraft takes off GE Aviation
  • 10. Creating impossibly complex parts at scale GE Aviation
  • 11. GE Predix ▪ The world’s largest Industrial IoT Platform ▪ Cross-industries data collection, analytics and alerts generation
  • 12. GE Predix - Data Fabric
  • 13. Requirements for Building the Largest IIoT Platform in the World ▪ Sustain more than 1TB/day of data ▪ Absorb 10’s of billions of records per day ▪ Support millions of parallel analytics transactions ▪ Must keep sub-second response time, end-to-end
  • 14. What Wasn’t Working ▪ Node storage limitation cause node sprawl ▪ Increasing and decreasing clusters became a challenge ▪ Application latency numbers are on the edge of the SLA envelope ▪ Managing large number of nodes required increasing our talent pool
  • 15. Evaluation ▪ Started with a fairly small cluster of 5 nodes ▪ AWS based deployment, using Scylla AMIs ▪ Directed our application to Scylla, same API, no code change ▪ Node consolidation works: same performance, fewer nodes ▪ Testing takes time, prepare a team and test plan ahead of time ▪ Use Scylla Slack channel for help and questions
  • 16. Our Data Model ▪ Time-series data model ▪ Sensor information inserted on a per tenant record ▪ Resulting in → A heavy single table • Tenant info • Tag info • Metrics info ▪ We bucket data on time intervals • Helps with partition size ▪ Rollups based on time-intervals are kept in database ▪ Different data attrition policies
  • 17. How it Looks from the Application Side ▪ Our performance cluster setup Component Quantity Resource Type ScyllaDB nodes 9 I3.8Xlarge TSQS nodes 5 I3.2Xlarge Kafka and ZooKeeper Nodes 7 C4.8Xlarge Go Pipeline 32 CloudFoundry instances
  • 18. Ingestion Performance Results ▪ ~2.5B data points per hour → ~700K data points per second
  • 19. Query Performance Results ▪ ~325M Queries per hour → ~90K queries per second
  • 20. Migrating to Scylla ▪ Online operations must continue ▪ Used Spark to migrate existing data from Cassandra to Scylla ▪ Collaborate with Scylla team to optimize data model ▪ Reducing cluster footprints helps us reduce operational overhead
  • 21. Migrating Using Spark ▪ Operations continue as normal during migration ▪ Data model optimization is possible during migration process
  • 22. Shrink the Cluster ▪ Same or higher workload with fewer nodes Cassandra 75 i3.2xl Node reduction Lower cost Reduced admin overhead Scylla 15 i3.8xl
  • 23. Scylla Deployments ▪ Replacement for Cassandra clusters ▪ Various clusters, at different size • Build a tool for the task ▪ Reduced clusters footprint by >60% ▪ Stable, predictable performance over time Result: We can serve more users, at lower operational cost
  • 24. Conclusions, or What We Care About ▪ Scalability is paramount ▪ Database stability ▪ Performance predictability
  • 25. Thank You Any Questions ? Please stay in touch VenkateshS@ge.com & arvind.singh1@ge.com

Editor's Notes

  1. Introduction - Venkat and Arvind How we deal with Time Series in the IIoT world Kinds of problems we face Exploration of ScyllaDB And the migration Strategy we employed
  2. GE isn’t just an appliance or lighting company.
  3. GE Renewables manufacture and service some of the largest and powerful offshore wind turbines. The picture show Hallide-X: Standing 260 meters tall from its heel to blade tips — more than half the height of the Empire State Building — with blades that each extend 107 meters, the turbine will generate 12 megawatts (16K European homes) Wind is depending
  4. To successfully ​Better models of the entire site leads to better operations and avoid unplanned downtime ​https://www.youtube.com/watch?v=IGDASBgrIRc In Switzerland over 60% of the country’s electricity is generated by water.
  5. ​Using drones to map the area around a hydro dam in Linthal, Switzerland the team was able to develop a way to store potential energy in an elevated basin ​And visualize their entire operations ​finding creative ways to store excess solar and wind energy in the form of pumped water ​can be reprocessed by hydro to provide power during low periods of solar/wind ​https://www.youtube.com/watch?v=IGDASBgrIRc ​
  6. ​GE manufactures massive turbines used to generate electricity, Gas or Steam powered ​Example produces enough electricity to power 500k homes ​GE equipment generates 30% the worlds power 80% of power moved throughout north america is controlled by GE systems A power plant in France using GE’s HA gas turbine won the Guinness World Record for net efficiency at 62.22 percent in 2016.
  7. ​Atlanta Monitoring & Diagnostic Center >200GW capacity, >1500 GE turbines 1 million sensors attached to machines send 200 billion data points per day Today and every day we will run over 900,000 analytics on the world’s largest fleet of gas turbines under management across 60 countries, covering 350 million people. https://www.ge.com/reports/knowledge-power-takes-software/
  8. ​state of the art jet engines ​every 2 seconds there is a commercial airplane with a GE engine taking off ​about 2200 aircraft are concurrently in the air, each carrying approximately 300 passengers, or a city of 660k in the air ​the most efficient jet engine GE-NX long haul flight ​ceramic matrix composite fan blades ​https://www.ge.com/reports/the-art-of-engineering-the-worlds-largest-jet-engine-shows-off-composite-curves/ ​additive metal manufacturing ​https://www.ge.com/reports/treat-avgeeks-inside-look-ges-3d-printed-aircraft-engine/ ​The 3D printed nozzle combined all 20 parts into a single unit, but it also weighed 25 percent less ​https://www.ge.com/reports/epiphany-disruption-ge-additive-chief-explains-3d-printing-will-upend-manufacturing/ ​Ultra-long-haul flights to become more fuel efficient: perth to London 17h20m ​https://www.ge.com/reports/flight-fancy-qantas-jet-flies-non-stop-australia-london-first-time-powered-ge/ ​20% less fuel vs planes of similar size. ~$1.6M per plane per year
  9. 30,000 fuel nozzles have been 3D printed for its LEAP engines. Under the additive manufacturing method, the number of parts in a single fuel nozzle tip was reduced from about 20 pieces previously welded together to one whole piece. https://www.3dprintingmedia.network/ge-aviation-already-3d-printed-30000-fuel-nozzles-for-its-leap-engine/ --- The ATP’s cubist-looking fuel heater is honeycombed with tiny complex passages. GE will 3D print it. Image credit: Tomas Kellner/GE Reports https://www.ge.com/reports/treat-avgeeks-inside-look-ges-3d-printed-aircraft-engine/
  10. GE Predix: The world’s leading IOT platform, helps monitor oil rigs, gas/wind turbines, nuclear reactors, escalators/elevators & flying airplanes, issue timely alerts and recommend actions. Predix not only saves its customers dollars, it helps save lives! One of those times, when you say every second counts, it literally means that! For Predix, the scalability and performance is mission critical.
  11. Data Fabric offers a way to ingest at scale, enable data and analytical processing without knowing the The world’s leading IOT platform, helps monitor oil rigs, gas/wind turbines, nuclear reactors, escalators/elevators & flying airplanes, issue timely alerts and recommend actions. Predix not only saves its customers dollars, it helps save lives! One of those times, when you say every second counts, it literally means that! For Predix, the scalability and performance is mission critical.
  12. We build a system to last. Our IoT system have to