SlideShare a Scribd company logo
1 of 42
1
Rahul Gupta
© 2018 Hazelcast Inc. Confidential & Proprietary
2
DISTRIBUTED COMPUTING. SIMPLIFIED.
Hazelcast Jet is a distributed
computing platform for fast processing
of big data sets. Jet is based on a
parallel core engine allowing data-
intensive applications to operate at
near real-time speeds.
© 2018 Hazelcast Inc. Confidential & Proprietary
Hazelcast Company Snapshot
Hazelcast founded
commercial open source
business model
Offices in Palo
Alto, London,
Istanbul
2008 2012 2013 2014 2015
JCache
released
Greg Luck
co-lead JCache
(JSR107)
Hazelcast introduces
JCache standard
for enterprise caching
40MM
Phone Homes per month
2016 2017
135 Subscribing
customers
Cash flow
positive
Gartner Market Guide for IMDG &
Leader in Forrester IMDG Wave
Report 2015
Office in BrnoHazelcast 1: First open
source IMDG
Hazelcast
Jet, fast Big
Data product
launched
Gartner Market
Guide for IMDG
© 2018 Hazelcast Inc. Confidential & Proprietary
Why Hazelcast Jet?
Performance – Low-latency at scale
Highly optimized reading from and writing to IMDG
Embeddable and light-weight
© 2018 Hazelcast Inc. Confidential & Proprietary
Hazelcast Jet Key Differentiators
PRODUCTIVE
ENTERPRISE GRADE STORAGEFLEXIBLE INTEGRATION
HIGH PERFORMANCE
• One solution provides both batch and stream
processing.
• Based on Hazelcast legendary ease-of-use.
• Variety of connectors enable easy integration into data
processing pipeline.
• Scaling, failure handling and recovery are all automated
for ease of operational management.
• Exactly-Once or At-Least Once guarantees.
• Jet team provides quick response to new feature
requests.
• Embeds Hazelcast IMDG as an operational storage for
enterprise-grade reliability and resilience.
• Most widely deployed IMDG with tens of thousands of
deployments in production worldwide.
• High-Performance integration eliminates network transit
latency penalties from moving data back and forth.
• Eliminates the integration issues between processing
and storage.
• Operates as shared application infrastructure or is
embedded directly in applications as a JAR- making
big data processing an application concern.
• Ideal for microservices with light-weight footprint --each
Jet processing job is launched within its own cluster
maximizing service isolation.
• Light-weight makes manipulation with Jet easy for both
developers and DevOps.
• Cloud and container ready. Pivotal Cloud Foundry,
OpenShift, Docker, Azure, AWS, Kubernetes, …
• Distributed computing platform enables fast processing
of big data.
• Parallel and low-latency core engine allows data-
intensive applications to operate at real-time speeds.
• Cooperative multi-threading architecture allows
operation of thousands of jobs simultaneously.
• Scales both vertically and horizontally.
© 2018 Hazelcast Inc. Confidential & Proprietary
Hazelcast Jet Key Competitive Differentiators
6
• High Performance | Industry Leading Performance
• Works great with Hazelcast IMDG | Source, Sink, Enrichment
• Very simple to program | Leverages existing standards
• Very simple to deploy | Embed 10MB jar or Client Server
• Works in every Cloud | Same as Hazelcast IMDG
• For Developers by Developers | Code it
© 2018 Hazelcast Inc. Confidential & Proprietary
Hazelcast Jet Product Overview
7
© 2018 Hazelcast Inc. Confidential & Proprietary
Hazelcast Jet Architecture
8
© 2018 Hazelcast Inc. Confidential & Proprietary
Hazelcast Jet Use Cases
Real-time Stream
processing
Distributed
In-Memory Computation
• Big Data in near real-time
• Plugged into data processing pipeline
• Distributed, in-memory computation
• Aggregating, joining multiple sources,
filtering, transforming, enriching
• Elastic scalability
• Super fast
• High availability
• Fault tolerant
• Simple, modern API
• Distributed, in-memory computation
• Transforming and querying
distributed data structures
• Computation close to the data
• Distributed clustering
• High availability
• Fault tolerance
Data-Processing
Microservices
• Data-processing microservices
• Isolation of services with many,
small clusters
• Service registry
• Network discovery
• Inter-process messaging
• Fully embeddable
• Spring Cloud, Boot Data Services
© 2018 Hazelcast Inc. Confidential & Proprietary
Jet Streaming Performance
*
* Spark had all performance options turn on including Tungsten
© 2018 Hazelcast Inc. Confidential & Proprietary
Performance - Throughput
*
* Spark had all performance options turn on including Tungsten
© 2018 Hazelcast Inc. Confidential & Proprietary
Performance - ForkJoin
• Jet is faster than JDK’s j.u.s implementation. The issue is in the JDK the
character stream reports "unknown size" and thus it cannot be parallelized
• If you first load the data into a List in RAM then run the JDK’s j.u.s it comes out a
little faster
© 2018 Hazelcast Inc. Confidential & Proprietary
Jet Application Deployment Options
IMDG IMDG
• No separate process to manage
• Great for microservices
• Great for OEM
• Simplest for Ops – nothing extra
Embedded
Application
Java API
Application
Java API
Application
Java API
Java Client
Application
Client-Server
Jet
Jet Jet
Jet
Jet Jet
Java Client
Application
Java Client
Application
Java Client
Application
• Separate Jet Cluster
• Scale Jet independent of applications
• Isolate Jet from application server lifecycle
• Managed by Ops
© 2018 Hazelcast Inc. Confidential & Proprietary
Jet with Hazelcast Deployment Choices
Jet Compute
Cluster
Hazelcast
IMDG Cluster
Sink Enrichment
Message
Broker
(Kafka)
Data
Enrichment
HDFS
Jet Cluster
Sink
Source /
Enrichment
Good when:
• Where source and sink are primarily Hazelcast
• Jet and Hazelcast have equivalent sizing needs
Good when:
• Where source and sink are primarily Hazelcast
• Where you want isolation of the Jet cluster
© 2018 Hazelcast Inc. Confidential & Proprietary
Stream Processing
• Support for events arriving out of order via Watermarks
• Sliding, Tumbling and Session window support
© 2018 Hazelcast Inc. Confidential & Proprietary
Fault Tolerance – Distributed State Snapshots
Exactly-Once, At-Least Once or No Guarantee to
optimize between performance and correctness
Distributed State Snapshots to back-up running
computations
Resilience with backups distributed and replicated
across the cluster to prevent losing data when member
fails
Simplicity as the snapshots are stored in embedded in-
memory structures. No further infrastructure is
necessary
© 2018 Hazelcast Inc. Confidential & Proprietary
Job Management & Fault Tolerance
• Job state and lifecycle saved to IMDG IMaps and benefit
from their performance, resilience, scale and persistence
• Automatic re-execution of part of the job in the event of a
failed worker
• Tolerant of loss of nodes, missing work will be recovered
from last snapshot and re-executed
• Cluster can be scaled without interrupting jobs – new jobs
benefit from the increased capacity
• State and snapshots can be persisted to resume after
cluster restart
© 2018 Hazelcast Inc. Confidential & Proprietary
Processing Guarantees
Guarantee Snapshots Performance
None No Fastest
At-Least Once Yes Slower
Exactly-Once Yes Slower
© 2018 Hazelcast Inc. Confidential & Proprietary
Data Input and Output
• Hazelcast Icache (JCache), (batch and streaming of
changes)
• Hazelcast IMap (batch and streaming of changes)
• Hazelcast IList (batch)
• HDFS (batch)
• Kafka (streaming)
• Socket (text encoding) (streaming)
• File (batch)
• FileWatcher (streaming – as new files appear)
• Custom, as sources and sinks are blocking Processors
© 2018 Hazelcast Inc. Confidential & Proprietary
Stream and Batch Processing
Maps, Caches
and Lists
Databases
IoT
Custom
Connector
Enterprise
Applications
Hazelcast IMDG
HDFS
Stream
Batch or
Stream
Batch or
Stream
Batch or
Stream
Batch
Ingest
Alerts
Enterprise
Applications
Interactive
Analytics
Hazelcast IMDG
Maps, Caches,
Lists
Output
Databases
Enrichment
Kafka
Stream
File /
File Watcher Batch
Socket
Stream
Kafka
HDFS
FileBatch
Database
Database
Events Stream
(via Striim CDC)
© 2018 Hazelcast Inc. Confidential & Proprietary
Business Use Cases
© 2018 Hazelcast Inc. Confidential & Proprietary
Field Equipment Monitoring and Analytics
Hazelcast Jet Enables Real-time Monitoring of Sensors and Processing of Streaming Data from
Field Equipment
Industries such as Oil & Gas use sensor data to automate decision making. Events reported from oil rig sensors are evaluated in
the field so that rig settings are adjusted in real time. Hazelcast Jet is ideal for applications that require near real-time insight.
Deployment simplicity makes it a optimal choice for both remote field and data center applications.
Why most processing solutions fall short… Why is Hazelcast Jet ideal?
Traditional data processing frameworks don’t allow direct embedding into
an application or independent system, forcing users to install them as a
secondary standalone server complicating deployment of the solution.
Jet can be used as an embedded library --avoiding need for separate server
tier. Jet doesn’t have any dependencies, so no further infrastructure
necessary to run a cluster. Jet allows users to build self-contained
applications.
Field data processing spans from sensors to on-rig processing, and from
there to processing within the data center --involving multiple execution
environments and hardware architectures. Traditional tools require
different runtimes (single-node, multi-node, different architectures),
resulting in higher system TCO.
Jet is a lightweight tool with a small memory footprint. There is just one
distribution to be used among all deployments. Running on a JVM, Jet can be
used on a variety of hardware platforms.
Traditional complex event processing solutions are built for single
machine/server deployments and cannot be scaled out easily.
Jet can be scaled out easily, as it’s built on top of a massively parallel,
distributed computing core (Hazelcast IMDG). It’s designed to scale out.
Most processing solutions experience growing latencies as the workload
throughput increases.
Jet is designed for low latency under the most grueling workloads with latency
remaining relatively flat even up to 5,000,000 aggregations per second.
© 2018 Hazelcast Inc. Confidential & Proprietary
Field Equipment Monitoring and Analytics Case
Study
CUSTOMER SUCCESS
Leading oil & gas system
integrator, specializing in
acquisition, persistence,
secure transportation and
dissemination of high-
frequency sensor data
10’s of 1000’s of events
per second are gathered
throughout the process
CHALLENGE
• Use sensor from oil wells data to automate decision-making:
- Keep latency low and consistent at scale, while system turns insights into decisions executed in a closed loop
- Simplify system deployment and maintenance in remote locations with limited bandwidth
- Use standards that extend system viability and maintenance lowering TCO
• Requires high application availability/performance for early issue detection to avoid production loss and
optimize well productivity
SOLUTION
• Hazelcast Jet is the processing backbone of application monitoring well sensors. API connects multiple
sensor data streams with varying formats and frequency. After joining and filtering, sliding and session
windows compute data insights to decisions
• Embedded Hazelcast IMDG is operational data store for easy scaling, whether on bare metal or in AWS
• Real-time analysis and correlation of event data from various devices and sensors throughout the process
allow immediate action. Jet adjusts rig settings in real time.
WHY HAZELCAST JET
• Performance and scalability
• In-memory data store with parallel processing enables scalable real-time analytics
• Open source, standards-based avoids vendor lock-in
© 2018 Hazelcast Inc. Confidential & Proprietary
Payment Processing
Hazelcast Jet Modernizes Payment Processing, Accelerating Bank Performance and Improving
Customer Experience
Hazelcast Jet is an ideal technology for payment processing systems as it offers the lowest end-to-end processing latencies on a
consistent basis. This is the result of Jet combining data processing and messaging in a single solution with elastically scalable
storage for caching data from 3rd party systems.
Why most processing solutions fall short… Why is Hazelcast Jet ideal?
Most processing solutions experience growing latencies as the workload
throughput increases.
Jet is designed for low latency under the most grueling workloads with latency
remaining relatively flat even up to 5,000,000 aggregations per second.
Most data processing solutions rely on 3rd party tools for messaging and
storage increasing deployment and maintenance complexity.
Jet embeds Hazelcast IMDG, which provides messaging and storage
capabilities in a single solution simplifying deployment and maintenance while
lowering TCO.
Traditional solutions for service orchestration are built for single
machine/server deployments and cannot be scaled out easily.
Jet scales easily, as it’s built on top of a massively parallel, distributed
computing core (Hazelcast IMDG). It’s designed to scale out.
The elasticity of a Jet cluster allows adding and removing computing and
storage resources in accordance with changing processing demand.
© 2018 Hazelcast Inc. Confidential & Proprietary
Payment Processing Case Study
CUSTOMER SUCCESS
A global information
technology solutions
company
Processing 10’s of 1,000’s
of payments per second
today.
Built-in scalability to
support future business.
CHALLENGE
• Before settling a transaction, payment processing systems check the merchant details by forwarding them to
the card’s issuing bank or association for verification, and carry out anti-fraud measures
• Each step in this pipeline requires the lowest possible latency to deliver a positive customer experience
• With 24/7 global operations and hard SLAs, resiliency and automatic recovery are a must-have
SOLUTION
• Within the payment processing application, Jet acts as the pipeline for each payment process step
• The payment management application orchestrates XML payment instructions and forwards them to the
respective card’s issuing bank or association for verification, then carries out anti-fraud measures before
settling transactions
• Multiple Jet processing jobs are pipeline components. Hazelcast IMDG distributed IMaps are used for
transaction ingestion and messaging
WHY HAZELCAST JET
• High-performance connectors between Jet and IMDG enable low-latency operations; consistent low latency
of the Hazelcast platform keeps the CGI payment management application within strictest SLA requirements
• Automatic recovery of the Jet cluster achieves high-availability even during failures
• Open source, standards-based avoids vendor lock-in
© 2018 Hazelcast Inc. Confidential & Proprietary
APIs
© 2018 Hazelcast Inc. Confidential & Proprietary
APIs
© 2018 Hazelcast Inc. Confidential & Proprietary
Pipeline API
• General purpose, declarative API: both powerful and simple to use
• Recommended as the best place to start using Jet
• Supports fork, join, cogroup, map, filter, flatmap, reduce, groupby
• Works with all sinks and sources
• Is a DSL which is put through a planner and converted to DAG plan
for execution
• Batch and Streaming (window support in 0.6)
See https://github.com/hazelcast/hazelcast-jet-code-samples
© 2018 Hazelcast Inc. Confidential & Proprietary
Pipeline API Code Sample
JetInstance jet = Jet.newJetInstance();
Pattern delimiter = Pattern.compile("W+");
Pipeline p = Pipeline.create();
p.drawFrom(Sources.<Long, String>readMap(BOOK_LINES))
.flatMap(e -> traverseArray(delimiter.split(e.getValue().toLowerCase()))) .filter(word ->
!word.isEmpty())
.groupBy(wholeItem(), counting())
.drainTo(Sinks.writeMap(COUNTS));
Job job = jet.newJob(p);
job.join();
© 2018 Hazelcast Inc. Confidential & Proprietary
Distributed java.util.stream API
• Jet adds distributed support for the java.util.stream API for
Hazelcast Map, List and Cache
• Supports all j.u.s. operations such as:
− map(), flatMap(), filter(), reduce(), collect(), sorted(), distinct()
• Lambda serialization is solved by creating Serializable versions of
the interfaces
• j.u.s streams are converted to Processor API (DAG) for execution
• Strictly a batch processing API
• Easiest place to start, but we recommend the Pipeline API to
exploit all features of Jet
See https://github.com/hazelcast/hazelcast-jet-code-samples
© 2018 Hazelcast Inc. Confidential & Proprietary
j.u.s API
JetInstance jet = Jet.newJetInstance();
Jet.newJetInstance();
IStreamMap<Long, String> lines = jet.getMap("lines");
Map<String, Long> counts = lines
.stream()
.flatMap(m -> Stream.of(PATTERN.split(m.getValue().toLowerCase())))
.filter(w -> !w.isEmpty())
.collect(DistributedCollectors.toIMap(”counts", w -> w,
w -> 1L, (left, right) -> left + right));
© 2018 Hazelcast Inc. Confidential & Proprietary
DAG API: Powerful, Low Level API
DAG describes how vertices are connected to each other:
DAG dag = new DAG();
// nil -> (docId, docName)
Vertex source = dag.newVertex("source", readMap(DOCID_NAME));
// (docId, docName) -> lines
Vertex docLines = dag.newVertex("doc-lines",
nonCooperative(flatMap((Entry<?, String> e) ->
traverseStream(docLines(e.getValue()))))
);
// line -> words
Vertex tokenize = dag.newVertex("tokenize",
flatMap((String line) -> traverseArray(delimiter.split(line.toLowerCase()))
.filter(word -> !word.isEmpty()))
);
// word -> (word, count)
Vertex accumulate = dag.newVertex("accumulate", accumulateByKey(wholeItem(),
AggregateOperations.counting()));
// (word, count) -> (word, count)
Vertex combine = dag.newVertex("combine", combineByKey(AggregateOperations.counting()));
// (word, count) -> nil
Vertex sink = dag.newVertex("sink", writeMap("counts"));
return dag.edge(between(source.localParallelism(1), docLines))
.edge(between(docLines.localParallelism(1), tokenize))
.edge(between(tokenize, accumulate).partitioned(wholeItem(), HASH_CODE))
.edge(between(accumulate, combine).distributed().partitioned(entryKey()))
.edge(between(combine, sink));
© 2018 Hazelcast Inc. Confidential & Proprietary
Building Custom Processors
• Unified API for sinks, sources and intermediate steps
• Not required to be thread safe
• Each Processor has an Inbox and Outbox per inbound and outbound
edge
• Two main methods to implement:
boolean tryProcess(int ordinal, Object item)
– Process incoming item and emit new items by populating the outbox
boolean complete()
– Called after all upstream processors are also completed. Typically used for
sources and batch operations such as group by and distinct
• Non-cooperative processors may block indefinitely
• Cooperative processors must respect Outbox when emitting and yield if
Outbox is already full
© 2018 Hazelcast Inc. Confidential & Proprietary
Hazelcast Jet Professional Support
© 2018 Hazelcast Inc. Confidential & Proprietary
Meet Your Business SLAs with
Hazelcast Jet Professional Support
100% Success Rate on Customer Issues
“As usual, the response was timely beyond
expectations, and very good technical content
returned. Exemplary support, hard to find in any
company...”
—Fortune 100 Financial
Services customer
FEATURE COMPARISON HAZELCAST JET PROFESSIONAL SUPPORT
24x7 Support Window
Service-Level Agreement 4 hours
Quarterly Review of Feature Requests P
Quarterly Review of the Hazelcast Jet Roadmap P
Email, IM, Phone Support Contacts 2
Hazelcast Jet Production Assurance for Initial Deployment/Charge
Control with Hazelcast Simulator P
Maintenance Patches P
Access to Hot Fix Patches P
IP Compliance Assurance with Black Duck ® Protex™ P
• Minimized downtime with 24/7 response to
support tickets, continuous delivery of
patches, and Hot Fix Patches
• Lower cost of operation of DevOps for your
Hazelcast Jet apps with simulation/
configuration testing
• Certified IP Compliance guarantees avoiding
legal exposure from open source
© 2018 Hazelcast Inc. Confidential & Proprietary
Hazelcast Jet Production Assurance for Initial
Deployment/Change Control with Hazelcast Simulator
Production simulation of your Hazelcast Jet
environment including hardware, virtualization,
operating system, configuration, feature usage,
topology, sizing and load
Includes assistance with ongoing simulation and
testing of your Hazelcast Jet platform and
deployed applications using Hazelcast Simulator
• Simulate the expected throughput/latency with
your specific requirements during pre-
production
• Test if Hazelcast Jet behaves as expected
when implementing new functionality, load
characteristics or expansion
• Standardize your application deployment
process with a battery of rigorous tests that we
will run for you before initial deployment and on
each upgrade
© 2018 Hazelcast Inc. Confidential & Proprietary
Hot Fix Patches
Personalized release of Hazelcast Jet software
dedicated to addressing your specific reported issue
Made to the exact version of Hazelcast Jet that you
are running with only the specific bug fix applied
Hot Fix Patches can thus be applied to your
production system with no or minimal testing
We reproduce the problem, and apply the patch in
our Simulator production simulation tool, to ensure we
go red before we go green
© 2018 Hazelcast Inc. Confidential & Proprietary
IP Compliance Assurance with
Black Duck® Protex™
• Full insight into Hazelcast
Jet IP compliance with
Black Duck® Protex™
reporting to help you
understand license
obligations, conflicts and
risks
• Black Duck Protex is the
leading solution for
managing open source
compliance
• Regular code scans before
minor and major releases
• We provide you a copy of
the report on demand
© 2018 Hazelcast Inc. Confidential & Proprietary
Hazelcast Jet Product Roadmap
© 2018 Hazelcast Inc. Confidential & Proprietary
Hazelcast Jet 0.6 Features (03/2018)
40
Features Description
Distributed Computation
Distributed data processing framework. It’s based on directed acyclic graphs (DAGs) and in-memory
computation
Robust stream processing
Fault tolerance based on distributed snapshots. When there is a failure (network failure or split, node
failure), Jet uses the latest state snapshot and automatically restarts
Ex-Once and At-least Once Jet can be configured to at-least-once or exactly-once semantics to favor performance or consistency
Windowing with Event-time
Improved streaming support including windowing support with event-time semantics. Out of the box
support for tumbling, sliding and session window aggregations
High-Level Pipeline API Convenient API to implement data processing pipeline
Pivotal Cloud Foundry Tile Hazelcast Jet is available in Pivotal Cloud Foundry
Spring Integration Integration with Spring Framework and Spring Boot
High-Performance Hazelcast
Integrations
Integrates with both embedded and non-embedded Hazelcast instances to be used for reading and
writing data for distributed computation. Change event reader allows streaming from Hazelcast IMDG
File and Socket Connector Ability to read and write unbounded data from text sockets and files
Kafka and HDFS Connector Reads and writes data streams to the Kafka message broker and Hadoop Distributed File System
© 2018 Hazelcast Inc. Confidential & Proprietary
Hazelcast 2018 Jet Roadmap
41
Features Description
Elasticity Dynamic rescaling of the computation
Diagnostics and management Management and monitoring features for Jet
More Connectors JMS | JDBC
1.0 Maturity To reach 1.0 maturity in 2018
Thank you

More Related Content

What's hot

Hazelcast Deep Dive (Paris JUG-2)
Hazelcast Deep Dive (Paris JUG-2)Hazelcast Deep Dive (Paris JUG-2)
Hazelcast Deep Dive (Paris JUG-2)Emrah Kocaman
 
Hazelcast for Terracotta Users
Hazelcast for Terracotta UsersHazelcast for Terracotta Users
Hazelcast for Terracotta UsersHazelcast
 
Tez big datacamp-la-bikas_saha
Tez big datacamp-la-bikas_sahaTez big datacamp-la-bikas_saha
Tez big datacamp-la-bikas_sahaData Con LA
 
YARN Containerized Services: Fading The Lines Between On-Prem And Cloud
YARN Containerized Services: Fading The Lines Between On-Prem And CloudYARN Containerized Services: Fading The Lines Between On-Prem And Cloud
YARN Containerized Services: Fading The Lines Between On-Prem And CloudDataWorks Summit
 
Think Distributed: The Hazelcast Way
Think Distributed: The Hazelcast WayThink Distributed: The Hazelcast Way
Think Distributed: The Hazelcast WayRahul Gupta
 
Tez Data Processing over Yarn
Tez Data Processing over YarnTez Data Processing over Yarn
Tez Data Processing over YarnInMobi Technology
 
Pig on Tez: Low Latency Data Processing with Big Data
Pig on Tez: Low Latency Data Processing with Big DataPig on Tez: Low Latency Data Processing with Big Data
Pig on Tez: Low Latency Data Processing with Big DataDataWorks Summit
 
Hazelcast Essentials
Hazelcast EssentialsHazelcast Essentials
Hazelcast EssentialsRahul Gupta
 
Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...
Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...
Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...DataWorks Summit
 
20150314 sahara intro and the future plan for open stack meetup
20150314 sahara intro and the future plan for open stack meetup20150314 sahara intro and the future plan for open stack meetup
20150314 sahara intro and the future plan for open stack meetupWei Ting Chen
 
What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?DataWorks Summit
 
Red Hat Ceph Storage: Past, Present and Future
Red Hat Ceph Storage: Past, Present and FutureRed Hat Ceph Storage: Past, Present and Future
Red Hat Ceph Storage: Past, Present and FutureRed_Hat_Storage
 
Apache Tez - Accelerating Hadoop Data Processing
Apache Tez - Accelerating Hadoop Data ProcessingApache Tez - Accelerating Hadoop Data Processing
Apache Tez - Accelerating Hadoop Data Processinghitesh1892
 
20151027 sahara + manila final
20151027 sahara + manila final20151027 sahara + manila final
20151027 sahara + manila finalWei Ting Chen
 
OLTP+OLAP=HTAP
 OLTP+OLAP=HTAP OLTP+OLAP=HTAP
OLTP+OLAP=HTAPEDB
 
Accelerating the Hadoop data stack with Apache Ignite, Spark and Bigtop
Accelerating the Hadoop data stack with Apache Ignite, Spark and BigtopAccelerating the Hadoop data stack with Apache Ignite, Spark and Bigtop
Accelerating the Hadoop data stack with Apache Ignite, Spark and BigtopIn-Memory Computing Summit
 
Building scalable applications with hazelcast
Building scalable applications with hazelcastBuilding scalable applications with hazelcast
Building scalable applications with hazelcastFuad Malikov
 
Hadoop {Submarine} Project: Running Deep Learning Workloads on YARN
Hadoop {Submarine} Project: Running Deep Learning Workloads on YARNHadoop {Submarine} Project: Running Deep Learning Workloads on YARN
Hadoop {Submarine} Project: Running Deep Learning Workloads on YARNDataWorks Summit
 
Hazelcast For Beginners (Paris JUG-1)
Hazelcast For Beginners (Paris JUG-1)Hazelcast For Beginners (Paris JUG-1)
Hazelcast For Beginners (Paris JUG-1)Emrah Kocaman
 

What's hot (20)

Hazelcast Deep Dive (Paris JUG-2)
Hazelcast Deep Dive (Paris JUG-2)Hazelcast Deep Dive (Paris JUG-2)
Hazelcast Deep Dive (Paris JUG-2)
 
Hazelcast for Terracotta Users
Hazelcast for Terracotta UsersHazelcast for Terracotta Users
Hazelcast for Terracotta Users
 
Tez big datacamp-la-bikas_saha
Tez big datacamp-la-bikas_sahaTez big datacamp-la-bikas_saha
Tez big datacamp-la-bikas_saha
 
YARN Containerized Services: Fading The Lines Between On-Prem And Cloud
YARN Containerized Services: Fading The Lines Between On-Prem And CloudYARN Containerized Services: Fading The Lines Between On-Prem And Cloud
YARN Containerized Services: Fading The Lines Between On-Prem And Cloud
 
Think Distributed: The Hazelcast Way
Think Distributed: The Hazelcast WayThink Distributed: The Hazelcast Way
Think Distributed: The Hazelcast Way
 
Tez Data Processing over Yarn
Tez Data Processing over YarnTez Data Processing over Yarn
Tez Data Processing over Yarn
 
Pig on Tez: Low Latency Data Processing with Big Data
Pig on Tez: Low Latency Data Processing with Big DataPig on Tez: Low Latency Data Processing with Big Data
Pig on Tez: Low Latency Data Processing with Big Data
 
Hazelcast Essentials
Hazelcast EssentialsHazelcast Essentials
Hazelcast Essentials
 
Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...
Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...
Deep Learning with DL4J on Apache Spark: Yeah it's Cool, but are You Doing it...
 
20150314 sahara intro and the future plan for open stack meetup
20150314 sahara intro and the future plan for open stack meetup20150314 sahara intro and the future plan for open stack meetup
20150314 sahara intro and the future plan for open stack meetup
 
What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?What's the Hadoop-la about Kubernetes?
What's the Hadoop-la about Kubernetes?
 
Red Hat Ceph Storage: Past, Present and Future
Red Hat Ceph Storage: Past, Present and FutureRed Hat Ceph Storage: Past, Present and Future
Red Hat Ceph Storage: Past, Present and Future
 
Apache Tez - Accelerating Hadoop Data Processing
Apache Tez - Accelerating Hadoop Data ProcessingApache Tez - Accelerating Hadoop Data Processing
Apache Tez - Accelerating Hadoop Data Processing
 
20151027 sahara + manila final
20151027 sahara + manila final20151027 sahara + manila final
20151027 sahara + manila final
 
OLTP+OLAP=HTAP
 OLTP+OLAP=HTAP OLTP+OLAP=HTAP
OLTP+OLAP=HTAP
 
Accelerating the Hadoop data stack with Apache Ignite, Spark and Bigtop
Accelerating the Hadoop data stack with Apache Ignite, Spark and BigtopAccelerating the Hadoop data stack with Apache Ignite, Spark and Bigtop
Accelerating the Hadoop data stack with Apache Ignite, Spark and Bigtop
 
February 2014 HUG : Hive On Tez
February 2014 HUG : Hive On TezFebruary 2014 HUG : Hive On Tez
February 2014 HUG : Hive On Tez
 
Building scalable applications with hazelcast
Building scalable applications with hazelcastBuilding scalable applications with hazelcast
Building scalable applications with hazelcast
 
Hadoop {Submarine} Project: Running Deep Learning Workloads on YARN
Hadoop {Submarine} Project: Running Deep Learning Workloads on YARNHadoop {Submarine} Project: Running Deep Learning Workloads on YARN
Hadoop {Submarine} Project: Running Deep Learning Workloads on YARN
 
Hazelcast For Beginners (Paris JUG-1)
Hazelcast For Beginners (Paris JUG-1)Hazelcast For Beginners (Paris JUG-1)
Hazelcast For Beginners (Paris JUG-1)
 

Similar to Distributed Computing Platform for Fast Processing of Big Data Sets

Open Sourcing GemFire - Apache Geode
Open Sourcing GemFire - Apache GeodeOpen Sourcing GemFire - Apache Geode
Open Sourcing GemFire - Apache GeodeApache Geode
 
An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)Anthony Baker
 
Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...HostedbyConfluent
 
Application services in Private Clouds - IBM Smarter Business 2013
Application services in Private Clouds - IBM Smarter Business 2013Application services in Private Clouds - IBM Smarter Business 2013
Application services in Private Clouds - IBM Smarter Business 2013IBM Sverige
 
Running Stateful Apps on Kubernetes
Running Stateful Apps on KubernetesRunning Stateful Apps on Kubernetes
Running Stateful Apps on KubernetesYugabyte
 
Oracle GoldenGate Performance Tuning
Oracle GoldenGate Performance TuningOracle GoldenGate Performance Tuning
Oracle GoldenGate Performance TuningBobby Curtis
 
ArchivePod a legacy data solution when migrating to the #CLOUD
ArchivePod a legacy data solution when migrating to the #CLOUDArchivePod a legacy data solution when migrating to the #CLOUD
ArchivePod a legacy data solution when migrating to the #CLOUDGaret Keller
 
Iperconvergenza come migliora gli economics del tuo IT
Iperconvergenza come migliora gli economics del tuo ITIperconvergenza come migliora gli economics del tuo IT
Iperconvergenza come migliora gli economics del tuo ITNetApp
 
VMware Workspace ONE a synergie s Microsoftem
VMware Workspace ONE a synergie s MicrosoftemVMware Workspace ONE a synergie s Microsoftem
VMware Workspace ONE a synergie s MicrosoftemMarketingArrowECS_CZ
 
Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster Fran Navarro
 
An overview of reference architectures for Postgres
An overview of reference architectures for PostgresAn overview of reference architectures for Postgres
An overview of reference architectures for PostgresEDB
 
Toward Scalable and Powerful CloudStack
Toward Scalable and Powerful CloudStackToward Scalable and Powerful CloudStack
Toward Scalable and Powerful CloudStackTakashi Kanai
 
NetApp IT Efficiencies Gained with Flash, NetApp ONTAP, OnCommand Insight, Al...
NetApp IT Efficiencies Gained with Flash, NetApp ONTAP, OnCommand Insight, Al...NetApp IT Efficiencies Gained with Flash, NetApp ONTAP, OnCommand Insight, Al...
NetApp IT Efficiencies Gained with Flash, NetApp ONTAP, OnCommand Insight, Al...NetApp
 
Containers and Kubernetes
Containers and KubernetesContainers and Kubernetes
Containers and KubernetesAltoros
 
Oracle Database 19c - poslední z rodiny 12.2 a co přináší nového
Oracle Database 19c - poslední z rodiny 12.2 a co přináší novéhoOracle Database 19c - poslední z rodiny 12.2 a co přináší nového
Oracle Database 19c - poslední z rodiny 12.2 a co přináší novéhoMarketingArrowECS_CZ
 
Introduction to Amazon EC2 F1 Instances
Introduction to Amazon EC2 F1 Instances Introduction to Amazon EC2 F1 Instances
Introduction to Amazon EC2 F1 Instances Amazon Web Services
 
times ten in-memory database for extreme performance
times ten in-memory database for extreme performancetimes ten in-memory database for extreme performance
times ten in-memory database for extreme performanceOracle Korea
 
Accelerating & Optimizing Machine Learning on VMware vSphere leveraging NVIDI...
Accelerating & Optimizing Machine Learning on VMware vSphere leveraging NVIDI...Accelerating & Optimizing Machine Learning on VMware vSphere leveraging NVIDI...
Accelerating & Optimizing Machine Learning on VMware vSphere leveraging NVIDI...inside-BigData.com
 
Coherence RoadMap 2018
Coherence RoadMap 2018Coherence RoadMap 2018
Coherence RoadMap 2018harvraja
 

Similar to Distributed Computing Platform for Fast Processing of Big Data Sets (20)

Open Sourcing GemFire - Apache Geode
Open Sourcing GemFire - Apache GeodeOpen Sourcing GemFire - Apache Geode
Open Sourcing GemFire - Apache Geode
 
An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)An Introduction to Apache Geode (incubating)
An Introduction to Apache Geode (incubating)
 
Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...Comparing three data ingestion approaches where Apache Kafka integrates with ...
Comparing three data ingestion approaches where Apache Kafka integrates with ...
 
Application services in Private Clouds - IBM Smarter Business 2013
Application services in Private Clouds - IBM Smarter Business 2013Application services in Private Clouds - IBM Smarter Business 2013
Application services in Private Clouds - IBM Smarter Business 2013
 
Running Stateful Apps on Kubernetes
Running Stateful Apps on KubernetesRunning Stateful Apps on Kubernetes
Running Stateful Apps on Kubernetes
 
Oracle GoldenGate Performance Tuning
Oracle GoldenGate Performance TuningOracle GoldenGate Performance Tuning
Oracle GoldenGate Performance Tuning
 
ArchivePod a legacy data solution when migrating to the #CLOUD
ArchivePod a legacy data solution when migrating to the #CLOUDArchivePod a legacy data solution when migrating to the #CLOUD
ArchivePod a legacy data solution when migrating to the #CLOUD
 
Iperconvergenza come migliora gli economics del tuo IT
Iperconvergenza come migliora gli economics del tuo ITIperconvergenza come migliora gli economics del tuo IT
Iperconvergenza come migliora gli economics del tuo IT
 
Geode Meetup Apachecon
Geode Meetup ApacheconGeode Meetup Apachecon
Geode Meetup Apachecon
 
VMware Workspace ONE a synergie s Microsoftem
VMware Workspace ONE a synergie s MicrosoftemVMware Workspace ONE a synergie s Microsoftem
VMware Workspace ONE a synergie s Microsoftem
 
Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster
 
An overview of reference architectures for Postgres
An overview of reference architectures for PostgresAn overview of reference architectures for Postgres
An overview of reference architectures for Postgres
 
Toward Scalable and Powerful CloudStack
Toward Scalable and Powerful CloudStackToward Scalable and Powerful CloudStack
Toward Scalable and Powerful CloudStack
 
NetApp IT Efficiencies Gained with Flash, NetApp ONTAP, OnCommand Insight, Al...
NetApp IT Efficiencies Gained with Flash, NetApp ONTAP, OnCommand Insight, Al...NetApp IT Efficiencies Gained with Flash, NetApp ONTAP, OnCommand Insight, Al...
NetApp IT Efficiencies Gained with Flash, NetApp ONTAP, OnCommand Insight, Al...
 
Containers and Kubernetes
Containers and KubernetesContainers and Kubernetes
Containers and Kubernetes
 
Oracle Database 19c - poslední z rodiny 12.2 a co přináší nového
Oracle Database 19c - poslední z rodiny 12.2 a co přináší novéhoOracle Database 19c - poslední z rodiny 12.2 a co přináší nového
Oracle Database 19c - poslední z rodiny 12.2 a co přináší nového
 
Introduction to Amazon EC2 F1 Instances
Introduction to Amazon EC2 F1 Instances Introduction to Amazon EC2 F1 Instances
Introduction to Amazon EC2 F1 Instances
 
times ten in-memory database for extreme performance
times ten in-memory database for extreme performancetimes ten in-memory database for extreme performance
times ten in-memory database for extreme performance
 
Accelerating & Optimizing Machine Learning on VMware vSphere leveraging NVIDI...
Accelerating & Optimizing Machine Learning on VMware vSphere leveraging NVIDI...Accelerating & Optimizing Machine Learning on VMware vSphere leveraging NVIDI...
Accelerating & Optimizing Machine Learning on VMware vSphere leveraging NVIDI...
 
Coherence RoadMap 2018
Coherence RoadMap 2018Coherence RoadMap 2018
Coherence RoadMap 2018
 

Recently uploaded

Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...OnePlan Solutions
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfAlina Yurenko
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作qr0udbr0
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....kzayra69
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)jennyeacort
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Andreas Granig
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureDinusha Kumarasiri
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesŁukasz Chruściel
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024StefanoLambiase
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfPower Karaoke
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh
 

Recently uploaded (20)

Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
Maximizing Efficiency and Profitability with OnePlan’s Professional Service A...
 
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdfGOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
GOING AOT WITH GRAALVM – DEVOXX GREECE.pdf
 
英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作英国UN学位证,北安普顿大学毕业证书1:1制作
英国UN学位证,北安普顿大学毕业证书1:1制作
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
Call Us🔝>༒+91-9711147426⇛Call In girls karol bagh (Delhi)
 
Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024Automate your Kamailio Test Calls - Kamailio World 2024
Automate your Kamailio Test Calls - Kamailio World 2024
 
Implementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with AzureImplementing Zero Trust strategy with Azure
Implementing Zero Trust strategy with Azure
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
Unveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New FeaturesUnveiling the Future: Sylius 2.0 New Features
Unveiling the Future: Sylius 2.0 New Features
 
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
Dealing with Cultural Dispersion — Stefano Lambiase — ICSE-SEIS 2024
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
The Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdfThe Evolution of Karaoke From Analog to App.pdf
The Evolution of Karaoke From Analog to App.pdf
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...
 

Distributed Computing Platform for Fast Processing of Big Data Sets

  • 2. © 2018 Hazelcast Inc. Confidential & Proprietary 2 DISTRIBUTED COMPUTING. SIMPLIFIED. Hazelcast Jet is a distributed computing platform for fast processing of big data sets. Jet is based on a parallel core engine allowing data- intensive applications to operate at near real-time speeds.
  • 3. © 2018 Hazelcast Inc. Confidential & Proprietary Hazelcast Company Snapshot Hazelcast founded commercial open source business model Offices in Palo Alto, London, Istanbul 2008 2012 2013 2014 2015 JCache released Greg Luck co-lead JCache (JSR107) Hazelcast introduces JCache standard for enterprise caching 40MM Phone Homes per month 2016 2017 135 Subscribing customers Cash flow positive Gartner Market Guide for IMDG & Leader in Forrester IMDG Wave Report 2015 Office in BrnoHazelcast 1: First open source IMDG Hazelcast Jet, fast Big Data product launched Gartner Market Guide for IMDG
  • 4. © 2018 Hazelcast Inc. Confidential & Proprietary Why Hazelcast Jet? Performance – Low-latency at scale Highly optimized reading from and writing to IMDG Embeddable and light-weight
  • 5. © 2018 Hazelcast Inc. Confidential & Proprietary Hazelcast Jet Key Differentiators PRODUCTIVE ENTERPRISE GRADE STORAGEFLEXIBLE INTEGRATION HIGH PERFORMANCE • One solution provides both batch and stream processing. • Based on Hazelcast legendary ease-of-use. • Variety of connectors enable easy integration into data processing pipeline. • Scaling, failure handling and recovery are all automated for ease of operational management. • Exactly-Once or At-Least Once guarantees. • Jet team provides quick response to new feature requests. • Embeds Hazelcast IMDG as an operational storage for enterprise-grade reliability and resilience. • Most widely deployed IMDG with tens of thousands of deployments in production worldwide. • High-Performance integration eliminates network transit latency penalties from moving data back and forth. • Eliminates the integration issues between processing and storage. • Operates as shared application infrastructure or is embedded directly in applications as a JAR- making big data processing an application concern. • Ideal for microservices with light-weight footprint --each Jet processing job is launched within its own cluster maximizing service isolation. • Light-weight makes manipulation with Jet easy for both developers and DevOps. • Cloud and container ready. Pivotal Cloud Foundry, OpenShift, Docker, Azure, AWS, Kubernetes, … • Distributed computing platform enables fast processing of big data. • Parallel and low-latency core engine allows data- intensive applications to operate at real-time speeds. • Cooperative multi-threading architecture allows operation of thousands of jobs simultaneously. • Scales both vertically and horizontally.
  • 6. © 2018 Hazelcast Inc. Confidential & Proprietary Hazelcast Jet Key Competitive Differentiators 6 • High Performance | Industry Leading Performance • Works great with Hazelcast IMDG | Source, Sink, Enrichment • Very simple to program | Leverages existing standards • Very simple to deploy | Embed 10MB jar or Client Server • Works in every Cloud | Same as Hazelcast IMDG • For Developers by Developers | Code it
  • 7. © 2018 Hazelcast Inc. Confidential & Proprietary Hazelcast Jet Product Overview 7
  • 8. © 2018 Hazelcast Inc. Confidential & Proprietary Hazelcast Jet Architecture 8
  • 9. © 2018 Hazelcast Inc. Confidential & Proprietary Hazelcast Jet Use Cases Real-time Stream processing Distributed In-Memory Computation • Big Data in near real-time • Plugged into data processing pipeline • Distributed, in-memory computation • Aggregating, joining multiple sources, filtering, transforming, enriching • Elastic scalability • Super fast • High availability • Fault tolerant • Simple, modern API • Distributed, in-memory computation • Transforming and querying distributed data structures • Computation close to the data • Distributed clustering • High availability • Fault tolerance Data-Processing Microservices • Data-processing microservices • Isolation of services with many, small clusters • Service registry • Network discovery • Inter-process messaging • Fully embeddable • Spring Cloud, Boot Data Services
  • 10. © 2018 Hazelcast Inc. Confidential & Proprietary Jet Streaming Performance * * Spark had all performance options turn on including Tungsten
  • 11. © 2018 Hazelcast Inc. Confidential & Proprietary Performance - Throughput * * Spark had all performance options turn on including Tungsten
  • 12. © 2018 Hazelcast Inc. Confidential & Proprietary Performance - ForkJoin • Jet is faster than JDK’s j.u.s implementation. The issue is in the JDK the character stream reports "unknown size" and thus it cannot be parallelized • If you first load the data into a List in RAM then run the JDK’s j.u.s it comes out a little faster
  • 13. © 2018 Hazelcast Inc. Confidential & Proprietary Jet Application Deployment Options IMDG IMDG • No separate process to manage • Great for microservices • Great for OEM • Simplest for Ops – nothing extra Embedded Application Java API Application Java API Application Java API Java Client Application Client-Server Jet Jet Jet Jet Jet Jet Java Client Application Java Client Application Java Client Application • Separate Jet Cluster • Scale Jet independent of applications • Isolate Jet from application server lifecycle • Managed by Ops
  • 14. © 2018 Hazelcast Inc. Confidential & Proprietary Jet with Hazelcast Deployment Choices Jet Compute Cluster Hazelcast IMDG Cluster Sink Enrichment Message Broker (Kafka) Data Enrichment HDFS Jet Cluster Sink Source / Enrichment Good when: • Where source and sink are primarily Hazelcast • Jet and Hazelcast have equivalent sizing needs Good when: • Where source and sink are primarily Hazelcast • Where you want isolation of the Jet cluster
  • 15. © 2018 Hazelcast Inc. Confidential & Proprietary Stream Processing • Support for events arriving out of order via Watermarks • Sliding, Tumbling and Session window support
  • 16. © 2018 Hazelcast Inc. Confidential & Proprietary Fault Tolerance – Distributed State Snapshots Exactly-Once, At-Least Once or No Guarantee to optimize between performance and correctness Distributed State Snapshots to back-up running computations Resilience with backups distributed and replicated across the cluster to prevent losing data when member fails Simplicity as the snapshots are stored in embedded in- memory structures. No further infrastructure is necessary
  • 17. © 2018 Hazelcast Inc. Confidential & Proprietary Job Management & Fault Tolerance • Job state and lifecycle saved to IMDG IMaps and benefit from their performance, resilience, scale and persistence • Automatic re-execution of part of the job in the event of a failed worker • Tolerant of loss of nodes, missing work will be recovered from last snapshot and re-executed • Cluster can be scaled without interrupting jobs – new jobs benefit from the increased capacity • State and snapshots can be persisted to resume after cluster restart
  • 18. © 2018 Hazelcast Inc. Confidential & Proprietary Processing Guarantees Guarantee Snapshots Performance None No Fastest At-Least Once Yes Slower Exactly-Once Yes Slower
  • 19. © 2018 Hazelcast Inc. Confidential & Proprietary Data Input and Output • Hazelcast Icache (JCache), (batch and streaming of changes) • Hazelcast IMap (batch and streaming of changes) • Hazelcast IList (batch) • HDFS (batch) • Kafka (streaming) • Socket (text encoding) (streaming) • File (batch) • FileWatcher (streaming – as new files appear) • Custom, as sources and sinks are blocking Processors
  • 20. © 2018 Hazelcast Inc. Confidential & Proprietary Stream and Batch Processing Maps, Caches and Lists Databases IoT Custom Connector Enterprise Applications Hazelcast IMDG HDFS Stream Batch or Stream Batch or Stream Batch or Stream Batch Ingest Alerts Enterprise Applications Interactive Analytics Hazelcast IMDG Maps, Caches, Lists Output Databases Enrichment Kafka Stream File / File Watcher Batch Socket Stream Kafka HDFS FileBatch Database Database Events Stream (via Striim CDC)
  • 21. © 2018 Hazelcast Inc. Confidential & Proprietary Business Use Cases
  • 22. © 2018 Hazelcast Inc. Confidential & Proprietary Field Equipment Monitoring and Analytics Hazelcast Jet Enables Real-time Monitoring of Sensors and Processing of Streaming Data from Field Equipment Industries such as Oil & Gas use sensor data to automate decision making. Events reported from oil rig sensors are evaluated in the field so that rig settings are adjusted in real time. Hazelcast Jet is ideal for applications that require near real-time insight. Deployment simplicity makes it a optimal choice for both remote field and data center applications. Why most processing solutions fall short… Why is Hazelcast Jet ideal? Traditional data processing frameworks don’t allow direct embedding into an application or independent system, forcing users to install them as a secondary standalone server complicating deployment of the solution. Jet can be used as an embedded library --avoiding need for separate server tier. Jet doesn’t have any dependencies, so no further infrastructure necessary to run a cluster. Jet allows users to build self-contained applications. Field data processing spans from sensors to on-rig processing, and from there to processing within the data center --involving multiple execution environments and hardware architectures. Traditional tools require different runtimes (single-node, multi-node, different architectures), resulting in higher system TCO. Jet is a lightweight tool with a small memory footprint. There is just one distribution to be used among all deployments. Running on a JVM, Jet can be used on a variety of hardware platforms. Traditional complex event processing solutions are built for single machine/server deployments and cannot be scaled out easily. Jet can be scaled out easily, as it’s built on top of a massively parallel, distributed computing core (Hazelcast IMDG). It’s designed to scale out. Most processing solutions experience growing latencies as the workload throughput increases. Jet is designed for low latency under the most grueling workloads with latency remaining relatively flat even up to 5,000,000 aggregations per second.
  • 23. © 2018 Hazelcast Inc. Confidential & Proprietary Field Equipment Monitoring and Analytics Case Study CUSTOMER SUCCESS Leading oil & gas system integrator, specializing in acquisition, persistence, secure transportation and dissemination of high- frequency sensor data 10’s of 1000’s of events per second are gathered throughout the process CHALLENGE • Use sensor from oil wells data to automate decision-making: - Keep latency low and consistent at scale, while system turns insights into decisions executed in a closed loop - Simplify system deployment and maintenance in remote locations with limited bandwidth - Use standards that extend system viability and maintenance lowering TCO • Requires high application availability/performance for early issue detection to avoid production loss and optimize well productivity SOLUTION • Hazelcast Jet is the processing backbone of application monitoring well sensors. API connects multiple sensor data streams with varying formats and frequency. After joining and filtering, sliding and session windows compute data insights to decisions • Embedded Hazelcast IMDG is operational data store for easy scaling, whether on bare metal or in AWS • Real-time analysis and correlation of event data from various devices and sensors throughout the process allow immediate action. Jet adjusts rig settings in real time. WHY HAZELCAST JET • Performance and scalability • In-memory data store with parallel processing enables scalable real-time analytics • Open source, standards-based avoids vendor lock-in
  • 24. © 2018 Hazelcast Inc. Confidential & Proprietary Payment Processing Hazelcast Jet Modernizes Payment Processing, Accelerating Bank Performance and Improving Customer Experience Hazelcast Jet is an ideal technology for payment processing systems as it offers the lowest end-to-end processing latencies on a consistent basis. This is the result of Jet combining data processing and messaging in a single solution with elastically scalable storage for caching data from 3rd party systems. Why most processing solutions fall short… Why is Hazelcast Jet ideal? Most processing solutions experience growing latencies as the workload throughput increases. Jet is designed for low latency under the most grueling workloads with latency remaining relatively flat even up to 5,000,000 aggregations per second. Most data processing solutions rely on 3rd party tools for messaging and storage increasing deployment and maintenance complexity. Jet embeds Hazelcast IMDG, which provides messaging and storage capabilities in a single solution simplifying deployment and maintenance while lowering TCO. Traditional solutions for service orchestration are built for single machine/server deployments and cannot be scaled out easily. Jet scales easily, as it’s built on top of a massively parallel, distributed computing core (Hazelcast IMDG). It’s designed to scale out. The elasticity of a Jet cluster allows adding and removing computing and storage resources in accordance with changing processing demand.
  • 25. © 2018 Hazelcast Inc. Confidential & Proprietary Payment Processing Case Study CUSTOMER SUCCESS A global information technology solutions company Processing 10’s of 1,000’s of payments per second today. Built-in scalability to support future business. CHALLENGE • Before settling a transaction, payment processing systems check the merchant details by forwarding them to the card’s issuing bank or association for verification, and carry out anti-fraud measures • Each step in this pipeline requires the lowest possible latency to deliver a positive customer experience • With 24/7 global operations and hard SLAs, resiliency and automatic recovery are a must-have SOLUTION • Within the payment processing application, Jet acts as the pipeline for each payment process step • The payment management application orchestrates XML payment instructions and forwards them to the respective card’s issuing bank or association for verification, then carries out anti-fraud measures before settling transactions • Multiple Jet processing jobs are pipeline components. Hazelcast IMDG distributed IMaps are used for transaction ingestion and messaging WHY HAZELCAST JET • High-performance connectors between Jet and IMDG enable low-latency operations; consistent low latency of the Hazelcast platform keeps the CGI payment management application within strictest SLA requirements • Automatic recovery of the Jet cluster achieves high-availability even during failures • Open source, standards-based avoids vendor lock-in
  • 26. © 2018 Hazelcast Inc. Confidential & Proprietary APIs
  • 27. © 2018 Hazelcast Inc. Confidential & Proprietary APIs
  • 28. © 2018 Hazelcast Inc. Confidential & Proprietary Pipeline API • General purpose, declarative API: both powerful and simple to use • Recommended as the best place to start using Jet • Supports fork, join, cogroup, map, filter, flatmap, reduce, groupby • Works with all sinks and sources • Is a DSL which is put through a planner and converted to DAG plan for execution • Batch and Streaming (window support in 0.6) See https://github.com/hazelcast/hazelcast-jet-code-samples
  • 29. © 2018 Hazelcast Inc. Confidential & Proprietary Pipeline API Code Sample JetInstance jet = Jet.newJetInstance(); Pattern delimiter = Pattern.compile("W+"); Pipeline p = Pipeline.create(); p.drawFrom(Sources.<Long, String>readMap(BOOK_LINES)) .flatMap(e -> traverseArray(delimiter.split(e.getValue().toLowerCase()))) .filter(word -> !word.isEmpty()) .groupBy(wholeItem(), counting()) .drainTo(Sinks.writeMap(COUNTS)); Job job = jet.newJob(p); job.join();
  • 30. © 2018 Hazelcast Inc. Confidential & Proprietary Distributed java.util.stream API • Jet adds distributed support for the java.util.stream API for Hazelcast Map, List and Cache • Supports all j.u.s. operations such as: − map(), flatMap(), filter(), reduce(), collect(), sorted(), distinct() • Lambda serialization is solved by creating Serializable versions of the interfaces • j.u.s streams are converted to Processor API (DAG) for execution • Strictly a batch processing API • Easiest place to start, but we recommend the Pipeline API to exploit all features of Jet See https://github.com/hazelcast/hazelcast-jet-code-samples
  • 31. © 2018 Hazelcast Inc. Confidential & Proprietary j.u.s API JetInstance jet = Jet.newJetInstance(); Jet.newJetInstance(); IStreamMap<Long, String> lines = jet.getMap("lines"); Map<String, Long> counts = lines .stream() .flatMap(m -> Stream.of(PATTERN.split(m.getValue().toLowerCase()))) .filter(w -> !w.isEmpty()) .collect(DistributedCollectors.toIMap(”counts", w -> w, w -> 1L, (left, right) -> left + right));
  • 32. © 2018 Hazelcast Inc. Confidential & Proprietary DAG API: Powerful, Low Level API DAG describes how vertices are connected to each other: DAG dag = new DAG(); // nil -> (docId, docName) Vertex source = dag.newVertex("source", readMap(DOCID_NAME)); // (docId, docName) -> lines Vertex docLines = dag.newVertex("doc-lines", nonCooperative(flatMap((Entry<?, String> e) -> traverseStream(docLines(e.getValue())))) ); // line -> words Vertex tokenize = dag.newVertex("tokenize", flatMap((String line) -> traverseArray(delimiter.split(line.toLowerCase())) .filter(word -> !word.isEmpty())) ); // word -> (word, count) Vertex accumulate = dag.newVertex("accumulate", accumulateByKey(wholeItem(), AggregateOperations.counting())); // (word, count) -> (word, count) Vertex combine = dag.newVertex("combine", combineByKey(AggregateOperations.counting())); // (word, count) -> nil Vertex sink = dag.newVertex("sink", writeMap("counts")); return dag.edge(between(source.localParallelism(1), docLines)) .edge(between(docLines.localParallelism(1), tokenize)) .edge(between(tokenize, accumulate).partitioned(wholeItem(), HASH_CODE)) .edge(between(accumulate, combine).distributed().partitioned(entryKey())) .edge(between(combine, sink));
  • 33. © 2018 Hazelcast Inc. Confidential & Proprietary Building Custom Processors • Unified API for sinks, sources and intermediate steps • Not required to be thread safe • Each Processor has an Inbox and Outbox per inbound and outbound edge • Two main methods to implement: boolean tryProcess(int ordinal, Object item) – Process incoming item and emit new items by populating the outbox boolean complete() – Called after all upstream processors are also completed. Typically used for sources and batch operations such as group by and distinct • Non-cooperative processors may block indefinitely • Cooperative processors must respect Outbox when emitting and yield if Outbox is already full
  • 34. © 2018 Hazelcast Inc. Confidential & Proprietary Hazelcast Jet Professional Support
  • 35. © 2018 Hazelcast Inc. Confidential & Proprietary Meet Your Business SLAs with Hazelcast Jet Professional Support 100% Success Rate on Customer Issues “As usual, the response was timely beyond expectations, and very good technical content returned. Exemplary support, hard to find in any company...” —Fortune 100 Financial Services customer FEATURE COMPARISON HAZELCAST JET PROFESSIONAL SUPPORT 24x7 Support Window Service-Level Agreement 4 hours Quarterly Review of Feature Requests P Quarterly Review of the Hazelcast Jet Roadmap P Email, IM, Phone Support Contacts 2 Hazelcast Jet Production Assurance for Initial Deployment/Charge Control with Hazelcast Simulator P Maintenance Patches P Access to Hot Fix Patches P IP Compliance Assurance with Black Duck ® Protex™ P • Minimized downtime with 24/7 response to support tickets, continuous delivery of patches, and Hot Fix Patches • Lower cost of operation of DevOps for your Hazelcast Jet apps with simulation/ configuration testing • Certified IP Compliance guarantees avoiding legal exposure from open source
  • 36. © 2018 Hazelcast Inc. Confidential & Proprietary Hazelcast Jet Production Assurance for Initial Deployment/Change Control with Hazelcast Simulator Production simulation of your Hazelcast Jet environment including hardware, virtualization, operating system, configuration, feature usage, topology, sizing and load Includes assistance with ongoing simulation and testing of your Hazelcast Jet platform and deployed applications using Hazelcast Simulator • Simulate the expected throughput/latency with your specific requirements during pre- production • Test if Hazelcast Jet behaves as expected when implementing new functionality, load characteristics or expansion • Standardize your application deployment process with a battery of rigorous tests that we will run for you before initial deployment and on each upgrade
  • 37. © 2018 Hazelcast Inc. Confidential & Proprietary Hot Fix Patches Personalized release of Hazelcast Jet software dedicated to addressing your specific reported issue Made to the exact version of Hazelcast Jet that you are running with only the specific bug fix applied Hot Fix Patches can thus be applied to your production system with no or minimal testing We reproduce the problem, and apply the patch in our Simulator production simulation tool, to ensure we go red before we go green
  • 38. © 2018 Hazelcast Inc. Confidential & Proprietary IP Compliance Assurance with Black Duck® Protex™ • Full insight into Hazelcast Jet IP compliance with Black Duck® Protex™ reporting to help you understand license obligations, conflicts and risks • Black Duck Protex is the leading solution for managing open source compliance • Regular code scans before minor and major releases • We provide you a copy of the report on demand
  • 39. © 2018 Hazelcast Inc. Confidential & Proprietary Hazelcast Jet Product Roadmap
  • 40. © 2018 Hazelcast Inc. Confidential & Proprietary Hazelcast Jet 0.6 Features (03/2018) 40 Features Description Distributed Computation Distributed data processing framework. It’s based on directed acyclic graphs (DAGs) and in-memory computation Robust stream processing Fault tolerance based on distributed snapshots. When there is a failure (network failure or split, node failure), Jet uses the latest state snapshot and automatically restarts Ex-Once and At-least Once Jet can be configured to at-least-once or exactly-once semantics to favor performance or consistency Windowing with Event-time Improved streaming support including windowing support with event-time semantics. Out of the box support for tumbling, sliding and session window aggregations High-Level Pipeline API Convenient API to implement data processing pipeline Pivotal Cloud Foundry Tile Hazelcast Jet is available in Pivotal Cloud Foundry Spring Integration Integration with Spring Framework and Spring Boot High-Performance Hazelcast Integrations Integrates with both embedded and non-embedded Hazelcast instances to be used for reading and writing data for distributed computation. Change event reader allows streaming from Hazelcast IMDG File and Socket Connector Ability to read and write unbounded data from text sockets and files Kafka and HDFS Connector Reads and writes data streams to the Kafka message broker and Hadoop Distributed File System
  • 41. © 2018 Hazelcast Inc. Confidential & Proprietary Hazelcast 2018 Jet Roadmap 41 Features Description Elasticity Dynamic rescaling of the computation Diagnostics and management Management and monitoring features for Jet More Connectors JMS | JDBC 1.0 Maturity To reach 1.0 maturity in 2018

Editor's Notes

  1. General idea across the slide deck: The value of Jet is in simplicity and integration with IMDG Big data isn’t enough, fast big data is what matters (information available in a near real-time fashion) Jet grabs the information from the data in near real-time and stores it to IMDG staying in-memory and distributed
  2. Hazelcast Jet is a distributed computing platform for fast processing of big data sets. Jet is based on a parallel, streaming core engine allowing data-intensive applications to operate at near real-time speeds. A general purpose distributed data processing engine built on Hazelcast and based on directed acyclic graphs to model data flow. IMDG as storage, Jet as processing engine (querying, transforming) -> those tasks go oftehn hand in hand. This is motivation for Jet. Define market demand for in-memory data processing (Motivation for Jet): 1/ Big and Fast Data 2/ Near real-time processing 3/ IMDG based — Jet comes with IMDG — Jet takes care about processing, IMDG about storage (in-mem speed, distributed). —> operational simplicity, products integrated
  3. Performance – For both stream (unbounded data) and batch (bounded data) processing, Jet is able to process data with an impressive throughput capacity, with very low latency even under increasing load Integration with IMDG – IMDG provides scalable in-memory data storage to be used during processing (as source, sink, operational storage for cached/temporary data). Hazelcast IMDG and Jet are engineered to work together for high performance. Simplicity – Jet is simple to deploy as it shares the lower stack with Hazelcast IMDG. Jet is application-embeddable for OEMs and microservices, making it is easier for manufacturers to build and maintain next generation systems.
  4. Jet vision = fast big data, real-time stream processor + fast batch processor How the ecosystem and looks like. Jet is part of the data-processing pipeline Two major deployment setups - stream and batch. Batch — A complete dataset is available before the processing starts. The data are submitted as one batch to be processed. Usually in Database, IMDG, on filesystem Data are loaded from (potentially multiple) sources, joined, transformed, cleaned. Then stored and/or left in IMDG for further in-memory usage. - Typically used for ETL (Extract-transfer-load) tasks. Stream - data potentially unbounded and infinite the goal is to process the data as soon as possible. Process = extract important information from the data. The whole processing stays in memory, therefore another systems may be alerted in near real-time. Data ingested to the system via message broker (Kafka) The streaming deployment is useful for use-cases, where real-time behaviour matters.
  5. Use-cases enabled by Jet Based on use-cases from web https://hazelcast.atlassian.net/wiki/display/JET/Reference+Use-Cases+and+Target+Audience
  6. - Performs at lowest latency. Important for real-time use cases (streaming) – after item is ingested, how fast do I get the results? Latency of Jet remains low at scale, opposed to competitors. Thanks to windowing design.
  7. Compared to Spark, Flink doing word count, data on HDFS/IMap, cluster of 10 nodes, 32 cores each…
  8. Hazelcast can be used in two different modes: Embedded Mode is very easy to setup and run. It can only be done with peer Java applications. Developers really like this mode during prototyping phases. Client-Server Mode is more work but efficient once an application is required to scale to larger volume of transactions or users as it separates the application from Hazelcast IMDG for better manageability. It can be done with different types of applications (C+, C#/.NET, Python, Scala, Node.js) using native clients provided by Hazelcast (C+, C#/.NET, Python, Scala, Node.js). Hazelcast Open Binary Protocol enables clients for any other language to be developed as needed (see Hazelcast IMDG Docs). With both client-server and embedded architecture, Hazelcast IMDG offers: + Elasticity and scalability + Transparent integration with backend databases using Map Store interfaces + Management of the cluster through your web browser with Management Center
  9. Jet is build on top of Hazelcast IMDG with the benefits of close integration. Hazelcast Jet and IMDG can run in these deployments: Every Jet cluster works on top of embedded Hazelcast in-memory data grid. Embedded deployment suits well for: Sharing processing state among Jet nodes Caching intermediate processing results Enriching processed events. Cache remote data (e.g. fact tables from database) on Jet nodes. Running advanced data processing tasks on top of Hazelcast data structures Development purposes. Since starting standalone Jet cluster is simple and fast. Remote deployment Share state or intermediate results among more Jet jobs Cache intermediate results (e.g. to show real-time processing stats on dashboard) Load input or store results of Jet computation in in-memory fashion to IMDG Isolation of processing and storage layer Jet x IMDG – product differentiation Both Jet and IMDG are for parallel and distributed computing, why Jet is more expensive? Is Hazelcast IMDG not good enough?
  10. Jet vision = fast big data, real-time stream processor + fast batch processor How the ecosystem and looks like. Jet is part of the data-processing pipeline Two major deployment setups - stream and batch. Batch — A complete dataset is available before the processing starts. The data are submitted as one batch to be processed. Usually in Database, IMDG, on filesystem Data are loaded from (potentially multiple) sources, joined, transformed, cleaned. Then stored and/or left in IMDG for further in-memory usage. - Typically used for ETL (Extract-transfer-load) tasks. Stream - data potentially unbounded and infinite the goal is to process the data as soon as possible. Process = extract important information from the data. The whole processing stays in memory, therefore another systems may be alerted in near real-time. Data ingested to the system via message broker (Kafka) The streaming deployment is useful for use-cases, where real-time behaviour matters.
  11. CUSTOMER SUCCESS Leading oil & gas system integrator, specializing in the acquisition, persistence, secure transportation and dissemination of high frequency sensor data. 10’s of 1000’s of events per second are gathered from oil rigs and various points in this process. CHALLENGE Industries such as Oil & Gas use sensor data to automate decision making; this requires: Keeping latency low and consistent at scale while system operates turning insights into decisions that are executed in a closed loop system. Simplifying deployment and maintenance of system in remote locations with limited bandwidth. Sticking to standards that extend system viability and maintenance lowering TCO; there are many nascent protocols and standards in the emerging IoT industry. Application high-availability and performance are required for early detection of issues to avoid costly production losses and optimize the productivity of wells. As the link between the oil rig site and the data center is usually limited, some processing has to be done in the field. This involves operating a processing infrastructure in the limited confines of the field locality. SOLUTION Hazelcast Jet is used as the processing backbone. The application makes use of Jet API to connect multiple sensor data streams with varying format and frequency. After joined and filtered, sliding and session windows are used to compute data insights that are turned into decisions. Hazelcast IMDG, which is embedded in Jet, is the operational data store, making the deployment easily scaled. This same Jet-based application is used in bare metal field installations as well as in the AWS cloud. The application gathers data from various devices and sensors throughout the oil exploration, drilling, and lifting process. The events are correlated and analyzed in real-time allowing actions to be taken for mitigating risks or maximizing opportunities. Events reported from oil rig sensors are evaluated in the field by Jet so that the settings of the rig are adjusted in real time, i.e., sensor data on torque and depth of the drill are combined in Jet to indicate bottom of the oil reservoir. WHY HAZELCAST JET Performance and scalability In-memory data store with parallel processing enables scalable real-time analytics Open source standards-based avoids vendor lock-in
  12. CUSTOMER SUCCESS A global information technology solutions company. processing of 10’s of 1,000’s of payments per second today and has built-in scalability with Hazelcast IMDG to support future business growth. CHALLENGE Payment processing systems check the details received from a merchant by forwarding them to the respective card’s issuing bank or card association for verification, and also carry out a series of anti-fraud measures before settling the transaction. All of these payment process steps must happen before the transaction can be cleared. Each step involved in the payment processing pipeline requires the lowest possible latency in order to deliver a positive customer experience. With 24/7 global operations and hard SLAs, resiliency and automatic recovery are a must have. SOLUTION Jet acts as the processing pipeline for each step of the payment process within the payment processing application. The payment management application acts as an orchestrator which analyses XML payment instructions and forwards them to the respective card’s issuing bank or card association for verification, and also carries out a series of anti-fraud measures against the transaction before settling the payment transaction. Multiple Jet processing jobs are involved as pipeline components. The distributed IMaps of Hazelcast IMDG are used for transaction ingestion and messaging. WHY HAZELCAST JET High-performance connectors between Jet and IMDG enable low-latency operations; consistent low latency of the Hazelcast platform allows the CGI payment management application to keep within strictest SLA requirements. Automatic recovery of the Jet cluster is an important feature to achieve high-availability even during failures. Open source standards-based avoids vendor lock-in.
  13. n addition to getting regular open source patch releases, Hazelcast support subscriptions provide customers with Hot Fix patches. Hot Fix patches are a special release of Hazelcast software dedicated to addressing your specific reported issue. They are made to the exact version of Hazelcast that you are running with only the specific bug fix applied. We then perform Hazelcast Production Assurance with a long production simulation using Hazelcast Simulator with a hardware, operating system and Hazelcast configuration as equivalent as possible to your own using your Hazelcast Simulator Profile. Hot Fix Patches can thus be applied to your production system with no or minimal testing, relying on our testing. We also go red before we go green – reproducing your issue on Simulator first before creating a fix. This way you and we have great confidence the issue is resolved by the Hot Fix.
  14. Every Hazelcast customer can rest assured knowing all the license obligations, conflicts and risks of Hazelcast open source with Black Duck® Protex™ reporting. Poor open source compliance can expose you to costly, time-consuming risks, including litigation and loss of IP. Black Duck Protex is the industry’s leading solution for managing open source license compliance. Protex integrates with existing development tools to automatically scan, identify, and inventory open source software, allowing you to understand license obligations, conflicts and risks. This enables you to mitigate these risks by enforcing license compliance and corporate policy requirements.
  15. Robust stream processing: Configurable At-least once or exactly-once OOTB experience – stored in memory (in embedded IMDG, so backed up in the same way as Hazelcast IMDG is). When node fails -> job restarted from last snapshot on the remaining nodes Outlook: Elasticity (when the node is added, extend computation to it), Usability enhancements - High-level API for Batch (0.5) and stream (0.6)
  16. Demo time