SlideShare a Scribd company logo
1 of 26
Download to read offline
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Denis	Magda
GridGainProduct	Manager
Apache	Ignite	PMC
Apache	Ignite and	Apache	Spark
Where	Fast	Data	Meets	the	IoT
http://ignite.apache.org #apacheignite #denismagda
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Agenda
• IoTDemands	to	Software
• IoTSoftware	Stack
• Device	OS/RTOS
• Data	Collection	and	Enrichment
• HTAP	Platform
• Application	APIs
• Demo
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
IoT Demands	to	Software
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
IoT Demands	to	Software
• Real-time	ingestion
• Real-time	processing
• Time-series	support
• Flexible	Querying	APIs
– SQL
– Full-text	search
– Geo-spatial
• Analytics
– BI
– Machine	Learning
• High-availability
• Simple	scalability
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
IoT Software	Stack
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
HTAP Platform
Data Collectionand Enrichment
Device OS / Real-Time OS
Application APIs
IoT Software	Stack
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
HTAP Platform
Data Collectionand Enrichment
Device OS / Real-Time OS
Application APIs
Apache	IoT Software	Stack
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Device	OS/RTOS
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Apache	MyNewt
• Open	Source	RTOS
– Cortex	M0-M4
– MIPS	&	RISC-V
• Networking
– Bluetooth	Low	Energy
– Wi-Fi
– TCP/IP	&	UPD
• Build	&	Package	Management
• Secure	bootloader	and	signed	
images
• Remote	Firmware	Upgrade
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Data	Collection	and	Enrichment
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Data	Collection	and	Enrichment
• Spark	Streaming
– Fault-Tolerant	Streams	
Processing
• Data	Collection
– Sockets
– Kafka
– Flume
• Data	Enrichment
– Spark	API
• Data	Storage
– Apache	Ignite
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
HTAP	Platform
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Use	Case:	 Smart Metering and Utilities – delivers a comprehensive IOT platform
50+ Million
Meters
• SilverSpring	Requirements
– Migrate	to	in-memory	processing
– Add	scalability	&	elasticity
– Use	open	source	technologies
• SilverSpring Solution
– SaaS	Architecture
• Multi-Tenancy
• Advanced	Security
– Strong	compute	capabilities
• Co-located	 in-memory	processing
– Demonstrated	best
• On-demand	elasticity	&	scalability
• ANSI-99	SQL	Support
• Transactional	consistency
GridGain
Security
SilverSpring IoT
Platform
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Apache	Ignite	HTAP	Platfrom
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Key-Value	Data	Grid
• Distributed	Key-Value	Data	Store
• Data	Reliability
• High-Availability	
– Active	replicas,	automatic	
failover
• Data	Consistency	
– ACID	distributed	transactions
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
In-Memory	Data	Grid:	Partitioning
Partitioned	Cache Replicated	Cache
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Streaming	to	Ignite
• Ignite	Data	Streamer
– Fastest	Data	Injection
– Automatic	data	partitioning
• Stream	Receivers
– Custom	logic	execution
– Additional	data	transformation
• Stream	Adapter
– Socket	
– Kafka
– Flink
– RocketMQ
– etc.
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Application	APIs
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Apache	Ignite	SQL	APIs
• ANSI-99	SQL
• Geo-spatial	Queries
• Full-text	Search
• Always	Consistent
• Fault	Tolerant
• Cross-Platform
– JDBC	and	ODBC	drivers	
– DML	(INSERT,	UPDATE,	etc.)
– DDL	(CREATE,	DROP,	etc.)
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Apache	Ignite	SQL	Grid:	Queries
• Distributed	Joins
• Automatic	Group	By,	
Aggregations,	Sorting
• Cross-Cache	Joins,	Unions,	etc.
• Ad-Hoc	SQL	Support
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Apache	Ignite	Compute	Grid
• Direct	API	for	
MapReduce
• Direct	API	for	ForkJoin
• Zero	Deployment
• State	Checkpoints
• Load	Balancing
• Automatic	Failover
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Spark	Shared	RDDs
• IgniteRDD
– Share	RDD	across	jobs	on	
the	host
– Share	RDD	across	jobs	in	
the	application
– Share	RDD	globally
• Faster	SQL
– In-Memory	Indexes
– SQL	on	top	of	Shared	RDD
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Machine	Learning	Grid
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Demo
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
Resources
• Documentation:
– Apache	Ignite:	https://apacheignite.readme.io/docs
– Apache	Spark	Streaming:	http://spark.apache.org/streaming/
– Ignite	and	Spark	Integration:	https://apacheignite-
fs.readme.io/docs/ignite-for-spark
– Apache	MyNewt:	http://mynewt.apache.org
• Demo	Source	Code:
– https://github.com/dmagda/IgniteSparkIoT
Apache®,	 Apache	 Ignite,	 Ignite®,	 and	the	 Apache	 Ignite	 logo	are	 either	 registered	 trademarks	 or	trademarks	 of	the	Apache	 Software	 Foundation	 in	the	United	 States	 and/or	other	countries.
ANY	QUESTIONS?
Thank	you	for	joining	us.	Follow	 the	conversation.
http://ignite.apache.org
#apacheignite #denismagda

More Related Content

What's hot

What's hot (20)

Data Lake Architecture
Data Lake ArchitectureData Lake Architecture
Data Lake Architecture
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
Kafka Tutorial - basics of the Kafka streaming platform
Kafka Tutorial - basics of the Kafka streaming platformKafka Tutorial - basics of the Kafka streaming platform
Kafka Tutorial - basics of the Kafka streaming platform
 
Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive


 
Apache Spark Overview
Apache Spark OverviewApache Spark Overview
Apache Spark Overview
 
Data Migration with Spark to Hive
Data Migration with Spark to HiveData Migration with Spark to Hive
Data Migration with Spark to Hive
 
Cassandra Introduction & Features
Cassandra Introduction & FeaturesCassandra Introduction & Features
Cassandra Introduction & Features
 
Introduction to Apache Spark
Introduction to Apache SparkIntroduction to Apache Spark
Introduction to Apache Spark
 
File Format Benchmark - Avro, JSON, ORC & Parquet
File Format Benchmark - Avro, JSON, ORC & ParquetFile Format Benchmark - Avro, JSON, ORC & Parquet
File Format Benchmark - Avro, JSON, ORC & Parquet
 
Optimizing Delta/Parquet Data Lakes for Apache Spark
Optimizing Delta/Parquet Data Lakes for Apache SparkOptimizing Delta/Parquet Data Lakes for Apache Spark
Optimizing Delta/Parquet Data Lakes for Apache Spark
 
Delta from a Data Engineer's Perspective
Delta from a Data Engineer's PerspectiveDelta from a Data Engineer's Perspective
Delta from a Data Engineer's Perspective
 
Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...
Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...
Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...
 
What is Apache Spark | Apache Spark Tutorial For Beginners | Apache Spark Tra...
What is Apache Spark | Apache Spark Tutorial For Beginners | Apache Spark Tra...What is Apache Spark | Apache Spark Tutorial For Beginners | Apache Spark Tra...
What is Apache Spark | Apache Spark Tutorial For Beginners | Apache Spark Tra...
 
Optimizing Apache Spark SQL Joins
Optimizing Apache Spark SQL JoinsOptimizing Apache Spark SQL Joins
Optimizing Apache Spark SQL Joins
 
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapRHadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
Hadoop benchmark: Evaluating Cloudera, Hortonworks, and MapR
 
Modern Data Warehousing with Amazon Redshift
Modern Data Warehousing with Amazon RedshiftModern Data Warehousing with Amazon Redshift
Modern Data Warehousing with Amazon Redshift
 
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdfSpring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
Spring Cloud Gateway - SpringOne Tour 2023 Charles Schwab.pdf
 
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
Part 2: Apache Kudu: Extending the Capabilities of Operational and Analytic D...
 
Apache Spark Architecture
Apache Spark ArchitectureApache Spark Architecture
Apache Spark Architecture
 
The Apache Spark File Format Ecosystem
The Apache Spark File Format EcosystemThe Apache Spark File Format Ecosystem
The Apache Spark File Format Ecosystem
 

Similar to Apache Spark and Apache Ignite: Where Fast Data Meets the IoT with Denis Magda

August 2016 HUG: Better together: Fast Data with Apache Spark™ and Apache Ign...
August 2016 HUG: Better together: Fast Data with Apache Spark™ and Apache Ign...August 2016 HUG: Better together: Fast Data with Apache Spark™ and Apache Ign...
August 2016 HUG: Better together: Fast Data with Apache Spark™ and Apache Ign...
Yahoo Developer Network
 

Similar to Apache Spark and Apache Ignite: Where Fast Data Meets the IoT with Denis Magda (20)

IMCSummit 2015 - Day 2 Developer Track - Anatomy of an In-Memory Data Fabric:...
IMCSummit 2015 - Day 2 Developer Track - Anatomy of an In-Memory Data Fabric:...IMCSummit 2015 - Day 2 Developer Track - Anatomy of an In-Memory Data Fabric:...
IMCSummit 2015 - Day 2 Developer Track - Anatomy of an In-Memory Data Fabric:...
 
IMCSummite 2016 Breakout - Nikita Ivanov - Apache Ignite 2.0 Towards a Conver...
IMCSummite 2016 Breakout - Nikita Ivanov - Apache Ignite 2.0 Towards a Conver...IMCSummite 2016 Breakout - Nikita Ivanov - Apache Ignite 2.0 Towards a Conver...
IMCSummite 2016 Breakout - Nikita Ivanov - Apache Ignite 2.0 Towards a Conver...
 
Microservices Architectures With Apache Ignite
Microservices Architectures With Apache IgniteMicroservices Architectures With Apache Ignite
Microservices Architectures With Apache Ignite
 
August 2016 HUG: Better together: Fast Data with Apache Spark™ and Apache Ign...
August 2016 HUG: Better together: Fast Data with Apache Spark™ and Apache Ign...August 2016 HUG: Better together: Fast Data with Apache Spark™ and Apache Ign...
August 2016 HUG: Better together: Fast Data with Apache Spark™ and Apache Ign...
 
Detailed guide to the Apache Spark Framework
Detailed guide to the Apache Spark FrameworkDetailed guide to the Apache Spark Framework
Detailed guide to the Apache Spark Framework
 
Apache Deep Learning 101 - DWS Berlin 2018
Apache Deep Learning 101 - DWS Berlin 2018Apache Deep Learning 101 - DWS Berlin 2018
Apache Deep Learning 101 - DWS Berlin 2018
 
Introduction to Apache Flink, Vienna 07.11.2018
Introduction to Apache Flink, Vienna 07.11.2018Introduction to Apache Flink, Vienna 07.11.2018
Introduction to Apache Flink, Vienna 07.11.2018
 
Performance tuning your Hadoop/Spark clusters to use cloud storage
Performance tuning your Hadoop/Spark clusters to use cloud storagePerformance tuning your Hadoop/Spark clusters to use cloud storage
Performance tuning your Hadoop/Spark clusters to use cloud storage
 
Introduction ciot workshop premeetup
Introduction ciot workshop premeetupIntroduction ciot workshop premeetup
Introduction ciot workshop premeetup
 
SAP & Open Souce - Give & Take
SAP & Open Souce - Give & TakeSAP & Open Souce - Give & Take
SAP & Open Souce - Give & Take
 
Big Data Processing with Hadoop-MapReduce in Cloud Systems
Big Data Processing with Hadoop-MapReduce in Cloud SystemsBig Data Processing with Hadoop-MapReduce in Cloud Systems
Big Data Processing with Hadoop-MapReduce in Cloud Systems
 
Tweet4Beer (atualizada): Torneira de Chopp Controlada por Java, JavaFX, IoT ...
Tweet4Beer (atualizada): Torneira de Chopp Controlada por Java, JavaFX, IoT ...Tweet4Beer (atualizada): Torneira de Chopp Controlada por Java, JavaFX, IoT ...
Tweet4Beer (atualizada): Torneira de Chopp Controlada por Java, JavaFX, IoT ...
 
[Rakuten TechConf2014] [C-6] Leveraging Spark for Cluster Computing
[Rakuten TechConf2014] [C-6] Leveraging Spark for Cluster Computing[Rakuten TechConf2014] [C-6] Leveraging Spark for Cluster Computing
[Rakuten TechConf2014] [C-6] Leveraging Spark for Cluster Computing
 
Varun-CV-J
Varun-CV-JVarun-CV-J
Varun-CV-J
 
The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
 The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a... The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
The Analytic Platform behind IBM’s Watson Data Platform by Luciano Resende a...
 
20180921_DOAG_BigDataDays_OracleSpatialandPython_kpatenge
20180921_DOAG_BigDataDays_OracleSpatialandPython_kpatenge20180921_DOAG_BigDataDays_OracleSpatialandPython_kpatenge
20180921_DOAG_BigDataDays_OracleSpatialandPython_kpatenge
 
20180417 hivemall meetup#4
20180417 hivemall meetup#420180417 hivemall meetup#4
20180417 hivemall meetup#4
 
CCA 175 - Hadoop & Spark Developer Certification | Cloudera CCA 175 Exam
CCA 175 - Hadoop & Spark Developer Certification | Cloudera CCA 175 ExamCCA 175 - Hadoop & Spark Developer Certification | Cloudera CCA 175 Exam
CCA 175 - Hadoop & Spark Developer Certification | Cloudera CCA 175 Exam
 
CCA 175 - Hadoop & Spark Developer Certification | Cloudera CCA 175 Exam
CCA 175 - Hadoop & Spark Developer Certification | Cloudera CCA 175 ExamCCA 175 - Hadoop & Spark Developer Certification | Cloudera CCA 175 Exam
CCA 175 - Hadoop & Spark Developer Certification | Cloudera CCA 175 Exam
 
Konsolidace Oracle DB na systémech s procesory M7
Konsolidace Oracle DB na systémech s procesory M7Konsolidace Oracle DB na systémech s procesory M7
Konsolidace Oracle DB na systémech s procesory M7
 

More from Databricks

Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
Databricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
Databricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Databricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
Databricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
Databricks
 

More from Databricks (20)

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML Monitoring
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on Kubernetes
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature Aggregations
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and Spark
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta Lake
 

Recently uploaded

Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
RafigAliyev2
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
cyebo
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
pyhepag
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
pyhepag
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
pyhepag
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
pyhepag
 

Recently uploaded (20)

Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)
 
Fuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertaintyFuzzy Sets decision making under information of uncertainty
Fuzzy Sets decision making under information of uncertainty
 
Data analytics courses in Nepal Presentation
Data analytics courses in Nepal PresentationData analytics courses in Nepal Presentation
Data analytics courses in Nepal Presentation
 
AI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdfAI Imagen for data-storytelling Infographics.pdf
AI Imagen for data-storytelling Infographics.pdf
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
 
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflictSupply chain analytics to combat the effects of Ukraine-Russia-conflict
Supply chain analytics to combat the effects of Ukraine-Russia-conflict
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
 
Slip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp ClaimsSlip-and-fall Injuries: Top Workers' Comp Claims
Slip-and-fall Injuries: Top Workers' Comp Claims
 
How I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prisonHow I opened a fake bank account and didn't go to prison
How I opened a fake bank account and didn't go to prison
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
 
一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理一比一原版西悉尼大学毕业证成绩单如何办理
一比一原版西悉尼大学毕业证成绩单如何办理
 
Machine Learning for Accident Severity Prediction
Machine Learning for Accident Severity PredictionMachine Learning for Accident Severity Prediction
Machine Learning for Accident Severity Prediction
 
basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdf
 
一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理一比一原版阿德莱德大学毕业证成绩单如何办理
一比一原版阿德莱德大学毕业证成绩单如何办理
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
Artificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdfArtificial_General_Intelligence__storm_gen_article.pdf
Artificial_General_Intelligence__storm_gen_article.pdf
 
Easy and simple project file on mp online
Easy and simple project file on mp onlineEasy and simple project file on mp online
Easy and simple project file on mp online
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfGenerative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
 

Apache Spark and Apache Ignite: Where Fast Data Meets the IoT with Denis Magda

  • 1. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Denis Magda GridGainProduct Manager Apache Ignite PMC Apache Ignite and Apache Spark Where Fast Data Meets the IoT http://ignite.apache.org #apacheignite #denismagda
  • 2. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Agenda • IoTDemands to Software • IoTSoftware Stack • Device OS/RTOS • Data Collection and Enrichment • HTAP Platform • Application APIs • Demo
  • 3. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. IoT Demands to Software
  • 4. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. IoT Demands to Software • Real-time ingestion • Real-time processing • Time-series support • Flexible Querying APIs – SQL – Full-text search – Geo-spatial • Analytics – BI – Machine Learning • High-availability • Simple scalability
  • 5. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. IoT Software Stack
  • 6. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. HTAP Platform Data Collectionand Enrichment Device OS / Real-Time OS Application APIs IoT Software Stack
  • 7. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. HTAP Platform Data Collectionand Enrichment Device OS / Real-Time OS Application APIs Apache IoT Software Stack
  • 8. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Device OS/RTOS
  • 9. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Apache MyNewt • Open Source RTOS – Cortex M0-M4 – MIPS & RISC-V • Networking – Bluetooth Low Energy – Wi-Fi – TCP/IP & UPD • Build & Package Management • Secure bootloader and signed images • Remote Firmware Upgrade
  • 10. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Data Collection and Enrichment
  • 11. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Data Collection and Enrichment • Spark Streaming – Fault-Tolerant Streams Processing • Data Collection – Sockets – Kafka – Flume • Data Enrichment – Spark API • Data Storage – Apache Ignite
  • 12. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. HTAP Platform
  • 13. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Use Case: Smart Metering and Utilities – delivers a comprehensive IOT platform 50+ Million Meters • SilverSpring Requirements – Migrate to in-memory processing – Add scalability & elasticity – Use open source technologies • SilverSpring Solution – SaaS Architecture • Multi-Tenancy • Advanced Security – Strong compute capabilities • Co-located in-memory processing – Demonstrated best • On-demand elasticity & scalability • ANSI-99 SQL Support • Transactional consistency GridGain Security SilverSpring IoT Platform
  • 14. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Apache Ignite HTAP Platfrom
  • 15. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Key-Value Data Grid • Distributed Key-Value Data Store • Data Reliability • High-Availability – Active replicas, automatic failover • Data Consistency – ACID distributed transactions
  • 16. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. In-Memory Data Grid: Partitioning Partitioned Cache Replicated Cache
  • 17. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Streaming to Ignite • Ignite Data Streamer – Fastest Data Injection – Automatic data partitioning • Stream Receivers – Custom logic execution – Additional data transformation • Stream Adapter – Socket – Kafka – Flink – RocketMQ – etc.
  • 18. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Application APIs
  • 19. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Apache Ignite SQL APIs • ANSI-99 SQL • Geo-spatial Queries • Full-text Search • Always Consistent • Fault Tolerant • Cross-Platform – JDBC and ODBC drivers – DML (INSERT, UPDATE, etc.) – DDL (CREATE, DROP, etc.)
  • 20. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Apache Ignite SQL Grid: Queries • Distributed Joins • Automatic Group By, Aggregations, Sorting • Cross-Cache Joins, Unions, etc. • Ad-Hoc SQL Support
  • 21. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Apache Ignite Compute Grid • Direct API for MapReduce • Direct API for ForkJoin • Zero Deployment • State Checkpoints • Load Balancing • Automatic Failover
  • 22. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Spark Shared RDDs • IgniteRDD – Share RDD across jobs on the host – Share RDD across jobs in the application – Share RDD globally • Faster SQL – In-Memory Indexes – SQL on top of Shared RDD
  • 23. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Machine Learning Grid
  • 24. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Demo
  • 25. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. Resources • Documentation: – Apache Ignite: https://apacheignite.readme.io/docs – Apache Spark Streaming: http://spark.apache.org/streaming/ – Ignite and Spark Integration: https://apacheignite- fs.readme.io/docs/ignite-for-spark – Apache MyNewt: http://mynewt.apache.org • Demo Source Code: – https://github.com/dmagda/IgniteSparkIoT
  • 26. Apache®, Apache Ignite, Ignite®, and the Apache Ignite logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. ANY QUESTIONS? Thank you for joining us. Follow the conversation. http://ignite.apache.org #apacheignite #denismagda