SlideShare a Scribd company logo
1 of 11
Download to read offline
A Future-proof Architecture
for
Streaming Data Analytics
This white paper explores strategies to leverage the steady flow of new, advanced
real-time streaming data analytics (RTSA) application development technologies.
It defines a thoughtful approach to capitalize on the window of opportunity to benefit
from the power of real-time decision making now, and still be able to move to new
and emerging technologies as they become enterprise ready.
Are You Ready For Real-time?
A
re you ready to start reaping the benefits of streaming
analytics, but are worried about investing in new technology
that could rapidly be surpassed or become outdated?
2
Technology is, and always will be, a moving target. So how can you
stay ahead of the curve, take advantage of the latest technologies,
while also ensuring that you don’t make a mistake in your selection of
a technology platform?
The answer lies in a strategy called “future proofing.” The term
“future proof” refers to the ability for something to continue to be of
value, well into the future. This gives you the confidence that it will
not quickly become obsolete. Obviously, future proofing provides a
significant advantage in any overall technology strategy.
In this paper, we explore the demand for streaming analytics, discuss
some of the available open source engines, the challenges of
adopting them, and introduce a future-proof platform that uses an
integrated development environment (IDE) (abstracted at a level
above the streaming analytics engine) — enabling you to ride the
future innovation wave.
Streaming analytics is fast becoming a must-have technology for
enterprises seeking to transform their analytic to take advantage of
“fast data” sources and build real-time or near real-time applications.
Gartner has predicted that the data processing market will grow to a
value of $55B by 2016, and that much of that value is held in the
data, waiting to be exploited. According to Gartner, 70% of
high-performing companies will manage their business processes
using real-time analytics or extreme collaboration by 2016.
Why? It’s all about acting on “perishable insight,” which is information
that firms can only detect and act upon at a moment’s notice.
Streaming analytics is the key to harnessing real-time flow of
perishable insights originating from the Internet-of-Things (IoT),
mobile phones, market data, sensors, Web clickstream, and
transactions. Real-time streaming is essential for Big Data analytics
because it delivers insights in real-time when they are most valuable.
With the Big Data environment evolving daily and flooding companies
with data, stream-processing capabilities are becoming increasingly
important. Streaming allows organizations to take action in real-time
and make critical decisions based on live data.
The Value of Now
70% of high-performing
companies will manage their
business processes using
real-time analytics or extreme
collaboration by 2016.
ACCORDING TO GARTNER
How Can You Benefit From RTSA?
3
The availability of real-time information sources and the IoT represent
a new source of revenue and profit, product innovation, and
investment opportunity. However, the real value comes from
harnessing the high volume, velocity, and range of variety of
streaming data being captured to process and apply analytics in
real-time.
Real-time streaming analytics enable enterprises to collect, integrate,
analyze, and visualize data as it is generated. It processes all the
activities of the data as the data is being produced, without
disrupting the activity of existing sources, storage, and enterprise
systems.
Here are a few examples of how leading enterprises are using
real-time streaming analytics to gain deeper insights and respond
more quickly to changing conditions and opportunities:
CUT PREVENTABLE LOSSES
Enterprises can cut preventable losses with real-time streaming
analytics. Examples include:
Financial Services
The signs or signals of risky and/or fraudulent transactions can be
imbedded in predictive intelligence models to alert firms to
suspicious activities.
Preventive Maintenance
Sensors can detect and communicate wear-and-tear before they
become machine failures that interrupt or cripple operations.
Medical
In urgent healthcare situations, patient monitoring systems require
real-time responses to alert professionals to urgent issues that
demand immediate attention or intervention.
Brand Reputation
Given the wide adoption and influence of social media, brands are
being discussed anywhere at any time. Automated monitoring can
alert brand managers of the activity, and give them time to contain,
control, or contribute to the brand conversation.
Disaster Warning Systems
Earthquakes, tsunamis, tornadoes, and other potentially
devastating events can often be detected through a variety of
monitoring and global sensing systems. Real-time streaming
analytics can bridge the time lag between detection and
communication, allowing authorities greater time to mobilize
emergency responses.
4
GAIN OPERATIONAL INSIGHTS
Allows businesses to take swift actions and ensure uninterrupted
operations. Examples include:
IT Systems and Network Monitoring
Traffic, load, and threat vulnerability – these can be seen and
controlled dynamically as conditions change.
Financial Transaction Processing
Authentications, validations, fraud detection must be processed in
real-time.
Manufacturing
Closed loop control systems can monitor processes for variations
and make adjustments to compensate for out-of-limit conditions in
real-time.
Field Assets Monitoring and Alerting
With stream analytics, remote human and machine activity can be
sensed and responded to instantly.
SEIZE NEW OPPORTUNITIES
Real-time analytics can stream data processed for one particular
purpose to new opportunities, delivering value and generating new
revenue streams. Examples include:
New Products
A major running shoe manufacturer has added sensors to its shoes
to track exercise routines, offering consumers valuable new health
insights.
Behavior Insights
Media companies and operators can analyze audience behavior to
provide insights to drive new services and business models.
Soil Insights
When outfitted with sensors, tractors can add value by providing
insights into soil conditions, such as humidity, temperature, and
composition to farmers.
5
The Big Questions
The Complexity of an RTSA Platform
Today, technology is finally building end-to-end components that
make streaming analytics a reality. Tools that can process and
analyze high-volume data streams are maturing rapidly, and new
entrants — both commercial and open source — are arriving on the
scene with surprising regularity. As a result, enterprise executives are
asking themselves the following relevant questions:
Streaming data is flowing by, and if it’s not captured, analyzed and
put to immediate use, the opportunity slips away. Consequently,
streaming analytics requires blazing fast ingestion, analysis, and
actions on multiple sources of fast data.
The streaming analytics platforms must be able to ingest, filter,
aggregate, enrich, and analyze a high throughput of data from
disparate live data sources to identify patterns, detect urgent
situations, and automate immediate actions. That’s the first
complexity — everything the platform must do, and how rapidly it must
do it.
The second major challenge is that real-time streaming platforms
don’t resemble the types of analytic applications that enterprises are
used to.
According to Forrester analyst Mike Gualtieri in the Impetus webinar,
Future Proofing Your Streaming Analytics Platform, the streaming
application programming model is unfamiliar to most application
developers. It’s a different paradigm from normal programming where
code execution controls data. In streaming applications, the incoming
data controls the code.
What platform design will best address the needs of the
market and drive rapid, large-scale adoption?
With so many new innovations continuously flowing into
this domain, how do I choose a platform?
Is it smart to wait for the space to settle and then see
who the clear “winner” is? Or, is there a way to get
started now?
In the streaming application
programming model, the
incoming data controls the
code, unlike normal
programming.
THE STREAMING
APPLICATION MODEL
6
The Allure of Open Source and the Dynamic
Nature of the Market
Several offerings on the market seek to establish themselves as Big
Data-processing platforms for large-scale data processing.
One of the best-known streaming technologies is Apache Storm.
Storm has been battle-tested by many of the largest enterprises in
the world, who routinely deal with large scale, real-time streaming
data sources, including:
Apache Storm is available as an open source software (OSS); as is
Spark Streaming — a comparatively new entrant for the Apache
Spark parallel-processing framework.
Cerner (Healthcare)
Groupon/Yahoo!/Yelp (Social media)
Aeris Communications (Telematics/IoT)
Verisign (Security)
Cloud Services
IoT
Data
Ingestion
Streaming
Analytics
Perishable
Insights
Historical
Insights
Data In-Motion
Data At-Rest
Hadoop
Analytical
Storage
Context
Enrichment
Future
Learning
IoT
The Internet
Applications
The Complex and Increasingly Expanding Data Map
7
Both Storm and Spark Streaming focus primarily on working directly
at the level of the stream processing platform and prove difficult to
integrate, often requiring additional tools. Despite the hype around
them, enterprises struggle with both offerings’ narrow focus and
seem to rarely get beyond the proof-of-concept (PoC) stage.
Why then, given the difficulty of incorporating these two open source
technologies, are they so popular? Enterprises typically lean toward
Apache Storm and Spark Streaming in the preliminary stages for
several reasons.
Both projects are well known in the open source ecosystem around
Hadoop. They are readily identifiable as alternative processing
engines, and therefore as MapReduce alternatives. As they are
available in open source, they can be easily acquired and installed to
support research and development (R&D), prototyping and PoC
projects.
Spark Streaming and Storm’s well-known capabilities have forced
companies to rethink their data-processing use cases and to consider
the possibilities of stream processing to reduce time-to-action. As
companies recognize the need for streaming analytics, the demand
for an enterprise-grade streaming platform in the market increases.
However, Storm and Spark Streaming are neither data-scientist nor
application-developer friendly. The most significant feature that they
lack is a graphical user interface (GUI) for building topologies. Without
this, only developers with an understanding of Java or Clojure can
build applications using these engines.
Data scientists and developers need intuitive visual tools to create
streaming apps. Programming is tedious, and with Spark Streaming
and Storm, developers are forced to manually account for scalability,
handle input data skews, hand-code fault tolerance for the applica-
tion and attempt to force event ordering/re-ordering.
Forrester's Wave report concedes that Storm and other open source
software technologies have generated a great deal of buzz, but
argues that they lack the support, functions, and amenities of
commercially-supported third-party offerings.
What should be done in this scenario?
The Need to Future-proof
8
One of the major obstacles that stands in the way of enterprises
deciding to move forward — to select a vendor and to jump into the
rapidly moving river of real-time streaming analytics — is the absence
of a future-proof guarantee. This is especially true right now as the
impact of innovation in this space is expected to continue, with
several new open source projects already underway.
The ideal real-time streaming analytics platform would have an
architecture with a common abstraction layer below the user
interface that allows the selection of one or more streaming engines
and change in engines — as your goals, requirements and strategies
change. This abstraction layer would also streamline upgrades and
embed new technologies into the existing system, based on
relevance and credibility, with time.
But can one access such state-of-the-art technology? YES. And there
is only one like it available on the market today. It’s been created by
Impetus Technologies and it is called StreamAnalytix.
A real-time streaming platform must meet the needs of data
scientists, developers and data center operations teams without
requiring extensive custom code or brittle integration of many
third-party components.
Maximizing the potential of data means analyzing data in motion as
well as data at rest — from the moment it is created and after it is
stored. Even better is to combine the two in an integrated lambda
architecture. Achieving such a scenario may be simpler than you
might think if you’ve got the right integration architecture in place,
and the right tools for building analytics and integrations.
As we stated earlier, the question companies are trying to answer is
this: How can we take advantage of new, innovative technologies
while protecting our investment?
We know that the trend in today’s Big Data world is to look for ways to
leverage open source. This is because open source drives the
potential of a vibrant stream of innovation coming from the global
community. As a result, breakthrough technology is available to
software developers at a low cost. This is not expected to slow down
anytime soon; in fact, it is picking up speed, and investment is flowing
into software companies that are pursuing this strategy.
The Ideal Platform
The technological advances
related to real-time streaming
analytics are moving and
changing as rapidly as data
itself.
ADVANCES IN REAL-TIME
STREAMING ANALYTICS
9
However, two main factors hinder adoption:
Firstly, unless software development is the industry you are in, you
may not have the extensive team of highly-experienced developers
required to code an application using open source, especially not
without an IDE that offers a GUI developers interface
And secondly, even if you did, you may fear that the speed with
which technology advances may leave you behind no matter what
The answer is three-fold. You need a platform that:
By providing a hybrid option, Impetus’ StreamAnalytix leverages the
best of both worlds from enterprise-class solutions and open source
while mitigating the disadvantages of both. In other words,
StreamAnalytix offers the reliability, manageability, vendor support
and professional services of an enterprise-class solution, while also
availing you of the community innovation, low-cost, lack of vendor
lock-in, and future proofing benefits that open source offers.
Provides an integrated development environment
Utilizes open source
Offers a level of abstraction above the open source engine that
insulates you from the changing technologies that could cause
disruptive change at the infrastructure level
StreamAnalytix offers the
reliability, manageability,
vendor supportand
professional services of an
enterprise-class solution,
while also availing you of the
community innovation,
low-cost, lack of vendor
lock-in, and future proofing
benefits that open source
offers.
THE STREAMANALYTIX
PLATFORM
10
Reliability
Large-scale efficiency
Robust intuitiveness
A future-proof and context-efficient abstraction architecture
The StreamAnalytix Solution
In an effort to protect you from the dilemma, risk, and expense of
building your own platform while not limiting or locking you into
up-and-coming future innovations, StreamAnalytix has abstracted the
graphical developer user interface above the level of the stream
processing engine.
StreamAnalytix (www.streamanalytix.com) is a state-of-the-art RTSA
platform combining enterprise class with open source. Built on an
engineering framework that articulates the essential requirements
for modern businesses, StreamAnalytix delivers:
Open, flexible, extensible, and easy-to-use, StreamAnalytix rides right
on the top of any standard Hadoop stack and is integrated with
Apache Storm.
It also works with other NoSQL options like Apache Cassandra and
Oracle NoSQL DB. Other choices for persistence and indexing can be
easily integrated.
ENTERPRISE GRADE - OPEN SOURCE BASED - STREAMING ANALYTIX PLATFORM
Unified Business Interfaces | Common Utilities | Smart Workflows
11
About StreamAnalytix
StreamAnalytix, an enterprise class real-time streaming analytics
platform, based on a best-of-breed Open Source stack, can help
organizations across industry verticals to quickly and reliably take
into production a wide range of streaming data applications.
It enables use cases in areas such as the IoT, sensor data analytics,
e-commerce and Internet advertising, security, fraud, insurance claim
validation, credit-line-management, call center analytics, and log
analytics.
It also enables enterprise IT and business transformation with
horizontal capabilities like Streaming ETL to speed up slow batch
processes to near-real-time.
Additionally, it is equipped to support multiple streaming engines and
customized extensions in order to deliver an unmatched level of
technology future proof investment protection.
Currently powered by Storm, StreamAnalytix is soon to be extended to
include Spark Streaming as an added underlying stream-processing
engine option. Additionally, Impetus Technologies has announced a
statement of direction that StreamAnalytix will not only continue to
support both Storm and Spark Streaming but will also evaluate other
technologies as they become available, thereby keeping you ahead of
the curve now and in the future.
To know more about the product and its architecture, visit:
www.streamanalytix.com.
About Impetus
Impetus is focused on creating big business impact through Big Data Solutions for
Fortune 1000 enterprises across multiple verticals. The company brings together a
unique mix of software products, consulting services, Data Science capabilities and
technology expertise. It offers full life-cycle services for Big Data implementations and
real-time streaming analytics, including technology strategy, solution architecture,
proof of concept, production implementation and on-going support to its clients.
To learn more, visit www.impetus.com or write to us at inquiry@impetus.com.
© 2015 Impetus Technologies, Inc.
All rights reserved. Product and
company names mentioned herein
may be trademarks of their
respective companies.
Feb 2015#81554
StreamAnalytix is equipped to
support multiple streaming
engines and customized
extensions in order to deliver
an unmatched level of
technology future-proof
investment protection
UNMATCHED, FUTURE-
PROOF TECHNOLOGY

More Related Content

What's hot

Turning Big Data to Business Advantage
Turning Big Data to Business AdvantageTurning Big Data to Business Advantage
Turning Big Data to Business AdvantageTeradata Aster
 
Big data analytics use cases: all you need to know
Big data analytics use cases:  all you need to knowBig data analytics use cases:  all you need to know
Big data analytics use cases: all you need to knowJane Brewer
 
Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online caniceconsulting
 
The data ecosystem
The data ecosystemThe data ecosystem
The data ecosystemWGroup
 
IBM Solutions Connect 2013 - Getting started with Big Data
IBM Solutions Connect 2013 - Getting started with Big DataIBM Solutions Connect 2013 - Getting started with Big Data
IBM Solutions Connect 2013 - Getting started with Big DataIBM Software India
 
Big data analytics in banking sector
Big data analytics in banking sectorBig data analytics in banking sector
Big data analytics in banking sectorAnil Rana
 
AI & ML for Supply Chain Optimization
AI & ML for Supply Chain OptimizationAI & ML for Supply Chain Optimization
AI & ML for Supply Chain OptimizationShiSh Shridhar
 
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015Edgar Alejandro Villegas
 
BIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceBIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceSkillspeed
 
Latest trends in Business Analytics
Latest trends in Business AnalyticsLatest trends in Business Analytics
Latest trends in Business AnalyticsPuneet Bhalla
 
Governing Big Data : Principles and practices
Governing Big Data : Principles and practicesGoverning Big Data : Principles and practices
Governing Big Data : Principles and practicesPiyush Malik
 
What are Big Data, Data Science, and Data Analytics
 What are Big Data, Data Science, and Data Analytics What are Big Data, Data Science, and Data Analytics
What are Big Data, Data Science, and Data AnalyticsRay Business Technologies
 
K1 embedding big data & analytics into the business to deliver sustainable value
K1 embedding big data & analytics into the business to deliver sustainable valueK1 embedding big data & analytics into the business to deliver sustainable value
K1 embedding big data & analytics into the business to deliver sustainable valueDr. Wilfred Lin (Ph.D.)
 
Internet of things & predictive analytics
Internet of things & predictive analyticsInternet of things & predictive analytics
Internet of things & predictive analyticsPrasad Narasimhan
 

What's hot (20)

HP Vertica and IIS: Big Data = Big Literacy
HP Vertica and IIS: Big Data = Big LiteracyHP Vertica and IIS: Big Data = Big Literacy
HP Vertica and IIS: Big Data = Big Literacy
 
Turning Big Data to Business Advantage
Turning Big Data to Business AdvantageTurning Big Data to Business Advantage
Turning Big Data to Business Advantage
 
Big data analytics use cases: all you need to know
Big data analytics use cases:  all you need to knowBig data analytics use cases:  all you need to know
Big data analytics use cases: all you need to know
 
Big Data and Analytics - 2016 CFO
Big Data and Analytics - 2016 CFOBig Data and Analytics - 2016 CFO
Big Data and Analytics - 2016 CFO
 
Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online
 
The data ecosystem
The data ecosystemThe data ecosystem
The data ecosystem
 
IoT Big Data Analytics Insights from Patents
IoT Big Data Analytics Insights from PatentsIoT Big Data Analytics Insights from Patents
IoT Big Data Analytics Insights from Patents
 
IBM Solutions Connect 2013 - Getting started with Big Data
IBM Solutions Connect 2013 - Getting started with Big DataIBM Solutions Connect 2013 - Getting started with Big Data
IBM Solutions Connect 2013 - Getting started with Big Data
 
Big data analytics in banking sector
Big data analytics in banking sectorBig data analytics in banking sector
Big data analytics in banking sector
 
AI & ML for Supply Chain Optimization
AI & ML for Supply Chain OptimizationAI & ML for Supply Chain Optimization
AI & ML for Supply Chain Optimization
 
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015Analytics Trends 20145 -  Deloitte - us-da-analytics-analytics-trends-2015
Analytics Trends 20145 - Deloitte - us-da-analytics-analytics-trends-2015
 
BIg Data Trends in 2016
BIg Data Trends in 2016BIg Data Trends in 2016
BIg Data Trends in 2016
 
BIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceBIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in Finance
 
Latest trends in Business Analytics
Latest trends in Business AnalyticsLatest trends in Business Analytics
Latest trends in Business Analytics
 
Analytics 3.0: Opportunities for Healthcare
Analytics 3.0: Opportunities for HealthcareAnalytics 3.0: Opportunities for Healthcare
Analytics 3.0: Opportunities for Healthcare
 
Governing Big Data : Principles and practices
Governing Big Data : Principles and practicesGoverning Big Data : Principles and practices
Governing Big Data : Principles and practices
 
What are Big Data, Data Science, and Data Analytics
 What are Big Data, Data Science, and Data Analytics What are Big Data, Data Science, and Data Analytics
What are Big Data, Data Science, and Data Analytics
 
K1 embedding big data & analytics into the business to deliver sustainable value
K1 embedding big data & analytics into the business to deliver sustainable valueK1 embedding big data & analytics into the business to deliver sustainable value
K1 embedding big data & analytics into the business to deliver sustainable value
 
Internet of things & predictive analytics
Internet of things & predictive analyticsInternet of things & predictive analytics
Internet of things & predictive analytics
 
BigFastData
BigFastDataBigFastData
BigFastData
 

Viewers also liked

Connexions entre dispositius
Connexions entre dispositiusConnexions entre dispositius
Connexions entre dispositiusjuliagarriga
 
The responsibility to protect and the need to affect change: undocumented mi...
The responsibility to protect and the need to affect change:  undocumented mi...The responsibility to protect and the need to affect change:  undocumented mi...
The responsibility to protect and the need to affect change: undocumented mi...Jo Vearey
 
Visual research methods : some reflections
Visual research methods: some reflectionsVisual research methods: some reflections
Visual research methods : some reflectionsJo Vearey
 
Senedi ittifak - horozz.net
Senedi ittifak - horozz.netSenedi ittifak - horozz.net
Senedi ittifak - horozz.netAdnan Dan
 
Fact sheet about qb3
Fact sheet about qb3Fact sheet about qb3
Fact sheet about qb3neenakadaba
 
#BrusselsTogether at Crowdsourcing Week
#BrusselsTogether at Crowdsourcing Week#BrusselsTogether at Crowdsourcing Week
#BrusselsTogether at Crowdsourcing WeekXavier Damman
 
MIAMI - Toward Shared Prosperity As A Creative And Inclusive Global City
MIAMI - Toward Shared Prosperity As A Creative And Inclusive Global CityMIAMI - Toward Shared Prosperity As A Creative And Inclusive Global City
MIAMI - Toward Shared Prosperity As A Creative And Inclusive Global CityDoris Chang
 
Joseph szczygiel 201606 Publications
Joseph szczygiel 201606 PublicationsJoseph szczygiel 201606 Publications
Joseph szczygiel 201606 PublicationsJoseph SZCZYGIEL
 
Supply cchain Management of BMW
Supply cchain Management of BMWSupply cchain Management of BMW
Supply cchain Management of BMWAbu Sayem Rimon
 
7 Habits of Highly Effective Salespeople
7 Habits of Highly Effective Salespeople7 Habits of Highly Effective Salespeople
7 Habits of Highly Effective SalespeopleQamaru Dheen
 

Viewers also liked (14)

ICP Grant Yarde 2014
ICP Grant Yarde 2014ICP Grant Yarde 2014
ICP Grant Yarde 2014
 
Connexions entre dispositius
Connexions entre dispositiusConnexions entre dispositius
Connexions entre dispositius
 
The responsibility to protect and the need to affect change: undocumented mi...
The responsibility to protect and the need to affect change:  undocumented mi...The responsibility to protect and the need to affect change:  undocumented mi...
The responsibility to protect and the need to affect change: undocumented mi...
 
Visual research methods : some reflections
Visual research methods: some reflectionsVisual research methods: some reflections
Visual research methods : some reflections
 
Senedi ittifak - horozz.net
Senedi ittifak - horozz.netSenedi ittifak - horozz.net
Senedi ittifak - horozz.net
 
Fact sheet about qb3
Fact sheet about qb3Fact sheet about qb3
Fact sheet about qb3
 
#BrusselsTogether at Crowdsourcing Week
#BrusselsTogether at Crowdsourcing Week#BrusselsTogether at Crowdsourcing Week
#BrusselsTogether at Crowdsourcing Week
 
MIAMI - Toward Shared Prosperity As A Creative And Inclusive Global City
MIAMI - Toward Shared Prosperity As A Creative And Inclusive Global CityMIAMI - Toward Shared Prosperity As A Creative And Inclusive Global City
MIAMI - Toward Shared Prosperity As A Creative And Inclusive Global City
 
Joseph szczygiel 201606 Publications
Joseph szczygiel 201606 PublicationsJoseph szczygiel 201606 Publications
Joseph szczygiel 201606 Publications
 
Supply cchain Management of BMW
Supply cchain Management of BMWSupply cchain Management of BMW
Supply cchain Management of BMW
 
Efficacité personnelle
Efficacité personnelleEfficacité personnelle
Efficacité personnelle
 
Anti vegf' s in Ophthalmology
Anti vegf' s in OphthalmologyAnti vegf' s in Ophthalmology
Anti vegf' s in Ophthalmology
 
7 Habits of Highly Effective Salespeople
7 Habits of Highly Effective Salespeople7 Habits of Highly Effective Salespeople
7 Habits of Highly Effective Salespeople
 
シェアリングエコノミー推進に係る政府の取り組について(犬童周作)
シェアリングエコノミー推進に係る政府の取り組について(犬童周作)シェアリングエコノミー推進に係る政府の取り組について(犬童周作)
シェアリングエコノミー推進に係る政府の取り組について(犬童周作)
 

Similar to Future-proof-Architecture-for-Streaming-Data-Analytics-WhitePaper

7_considerations_final
7_considerations_final7_considerations_final
7_considerations_finalJane Roberts
 
These Fascinating Examples Show Why Streaming Data And Real-Time Analytics Ma...
These Fascinating Examples Show Why Streaming Data And Real-Time Analytics Ma...These Fascinating Examples Show Why Streaming Data And Real-Time Analytics Ma...
These Fascinating Examples Show Why Streaming Data And Real-Time Analytics Ma...Bernard Marr
 
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING Goodbuzz Inc.
 
Big Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraBig Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraVin Malhotra
 
NI Automated Test Outlook 2016
NI Automated Test Outlook 2016NI Automated Test Outlook 2016
NI Automated Test Outlook 2016Hank Lydick
 
STS. Smarter devices. Smarter test systems.
STS. Smarter devices. Smarter test systems.STS. Smarter devices. Smarter test systems.
STS. Smarter devices. Smarter test systems.Hank Lydick
 
Forrester Wave - Big data streaming analytics platforms
Forrester Wave - Big data streaming analytics platformsForrester Wave - Big data streaming analytics platforms
Forrester Wave - Big data streaming analytics platformsIBM Software India
 
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...Stuart Blair
 
it-CIO-Trend-Report-2017-The-Eight-Trends
it-CIO-Trend-Report-2017-The-Eight-Trendsit-CIO-Trend-Report-2017-The-Eight-Trends
it-CIO-Trend-Report-2017-The-Eight-TrendsDavid Glazer, MBA
 
Exciting it trends in 2015 why you should consider shifting and upgrading yo...
Exciting it trends in 2015  why you should consider shifting and upgrading yo...Exciting it trends in 2015  why you should consider shifting and upgrading yo...
Exciting it trends in 2015 why you should consider shifting and upgrading yo...lithanhall
 
Managing your Assets with Big Data Tools
Managing your Assets with Big Data ToolsManaging your Assets with Big Data Tools
Managing your Assets with Big Data ToolsMachinePulse
 
Data Has A Shelf Life: Why You Should Be Thinking About Real-Time Analytics
Data Has A Shelf Life: Why You Should Be Thinking About Real-Time AnalyticsData Has A Shelf Life: Why You Should Be Thinking About Real-Time Analytics
Data Has A Shelf Life: Why You Should Be Thinking About Real-Time AnalyticsBernard Marr
 
BigData Analytics_1.7
BigData Analytics_1.7BigData Analytics_1.7
BigData Analytics_1.7Rohit Mittal
 
Analytics trends deloitte
Analytics trends deloitteAnalytics trends deloitte
Analytics trends deloitteMani Kansal
 
2015 Forrester Report
2015 Forrester Report2015 Forrester Report
2015 Forrester ReportAnthony Papp
 
CS309A Final Paper_KM_DD
CS309A Final Paper_KM_DDCS309A Final Paper_KM_DD
CS309A Final Paper_KM_DDDavid Darrough
 

Similar to Future-proof-Architecture-for-Streaming-Data-Analytics-WhitePaper (20)

7_considerations_final
7_considerations_final7_considerations_final
7_considerations_final
 
Transforming Big Data into business value
Transforming Big Data into business valueTransforming Big Data into business value
Transforming Big Data into business value
 
These Fascinating Examples Show Why Streaming Data And Real-Time Analytics Ma...
These Fascinating Examples Show Why Streaming Data And Real-Time Analytics Ma...These Fascinating Examples Show Why Streaming Data And Real-Time Analytics Ma...
These Fascinating Examples Show Why Streaming Data And Real-Time Analytics Ma...
 
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING
 
Big Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin MalhotraBig Data & Analytics Trends 2016 Vin Malhotra
Big Data & Analytics Trends 2016 Vin Malhotra
 
NI Automated Test Outlook 2016
NI Automated Test Outlook 2016NI Automated Test Outlook 2016
NI Automated Test Outlook 2016
 
STS. Smarter devices. Smarter test systems.
STS. Smarter devices. Smarter test systems.STS. Smarter devices. Smarter test systems.
STS. Smarter devices. Smarter test systems.
 
Forrester Wave - Big data streaming analytics platforms
Forrester Wave - Big data streaming analytics platformsForrester Wave - Big data streaming analytics platforms
Forrester Wave - Big data streaming analytics platforms
 
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
Fast Data and Architecting the Digital Enterprise Fast Data drivers, componen...
 
it-CIO-Trend-Report-2017-The-Eight-Trends
it-CIO-Trend-Report-2017-The-Eight-Trendsit-CIO-Trend-Report-2017-The-Eight-Trends
it-CIO-Trend-Report-2017-The-Eight-Trends
 
Exciting it trends in 2015 why you should consider shifting and upgrading yo...
Exciting it trends in 2015  why you should consider shifting and upgrading yo...Exciting it trends in 2015  why you should consider shifting and upgrading yo...
Exciting it trends in 2015 why you should consider shifting and upgrading yo...
 
Managing your Assets with Big Data Tools
Managing your Assets with Big Data ToolsManaging your Assets with Big Data Tools
Managing your Assets with Big Data Tools
 
Data Has A Shelf Life: Why You Should Be Thinking About Real-Time Analytics
Data Has A Shelf Life: Why You Should Be Thinking About Real-Time AnalyticsData Has A Shelf Life: Why You Should Be Thinking About Real-Time Analytics
Data Has A Shelf Life: Why You Should Be Thinking About Real-Time Analytics
 
Unlocking big data
Unlocking big dataUnlocking big data
Unlocking big data
 
1 tihify
1 tihify1 tihify
1 tihify
 
BigData Analytics_1.7
BigData Analytics_1.7BigData Analytics_1.7
BigData Analytics_1.7
 
Analytics trends deloitte
Analytics trends deloitteAnalytics trends deloitte
Analytics trends deloitte
 
Forrester on Big Data
Forrester on Big DataForrester on Big Data
Forrester on Big Data
 
2015 Forrester Report
2015 Forrester Report2015 Forrester Report
2015 Forrester Report
 
CS309A Final Paper_KM_DD
CS309A Final Paper_KM_DDCS309A Final Paper_KM_DD
CS309A Final Paper_KM_DD
 

Future-proof-Architecture-for-Streaming-Data-Analytics-WhitePaper

  • 1. A Future-proof Architecture for Streaming Data Analytics This white paper explores strategies to leverage the steady flow of new, advanced real-time streaming data analytics (RTSA) application development technologies. It defines a thoughtful approach to capitalize on the window of opportunity to benefit from the power of real-time decision making now, and still be able to move to new and emerging technologies as they become enterprise ready.
  • 2. Are You Ready For Real-time? A re you ready to start reaping the benefits of streaming analytics, but are worried about investing in new technology that could rapidly be surpassed or become outdated? 2 Technology is, and always will be, a moving target. So how can you stay ahead of the curve, take advantage of the latest technologies, while also ensuring that you don’t make a mistake in your selection of a technology platform? The answer lies in a strategy called “future proofing.” The term “future proof” refers to the ability for something to continue to be of value, well into the future. This gives you the confidence that it will not quickly become obsolete. Obviously, future proofing provides a significant advantage in any overall technology strategy. In this paper, we explore the demand for streaming analytics, discuss some of the available open source engines, the challenges of adopting them, and introduce a future-proof platform that uses an integrated development environment (IDE) (abstracted at a level above the streaming analytics engine) — enabling you to ride the future innovation wave. Streaming analytics is fast becoming a must-have technology for enterprises seeking to transform their analytic to take advantage of “fast data” sources and build real-time or near real-time applications. Gartner has predicted that the data processing market will grow to a value of $55B by 2016, and that much of that value is held in the data, waiting to be exploited. According to Gartner, 70% of high-performing companies will manage their business processes using real-time analytics or extreme collaboration by 2016. Why? It’s all about acting on “perishable insight,” which is information that firms can only detect and act upon at a moment’s notice. Streaming analytics is the key to harnessing real-time flow of perishable insights originating from the Internet-of-Things (IoT), mobile phones, market data, sensors, Web clickstream, and transactions. Real-time streaming is essential for Big Data analytics because it delivers insights in real-time when they are most valuable. With the Big Data environment evolving daily and flooding companies with data, stream-processing capabilities are becoming increasingly important. Streaming allows organizations to take action in real-time and make critical decisions based on live data. The Value of Now 70% of high-performing companies will manage their business processes using real-time analytics or extreme collaboration by 2016. ACCORDING TO GARTNER
  • 3. How Can You Benefit From RTSA? 3 The availability of real-time information sources and the IoT represent a new source of revenue and profit, product innovation, and investment opportunity. However, the real value comes from harnessing the high volume, velocity, and range of variety of streaming data being captured to process and apply analytics in real-time. Real-time streaming analytics enable enterprises to collect, integrate, analyze, and visualize data as it is generated. It processes all the activities of the data as the data is being produced, without disrupting the activity of existing sources, storage, and enterprise systems. Here are a few examples of how leading enterprises are using real-time streaming analytics to gain deeper insights and respond more quickly to changing conditions and opportunities: CUT PREVENTABLE LOSSES Enterprises can cut preventable losses with real-time streaming analytics. Examples include: Financial Services The signs or signals of risky and/or fraudulent transactions can be imbedded in predictive intelligence models to alert firms to suspicious activities. Preventive Maintenance Sensors can detect and communicate wear-and-tear before they become machine failures that interrupt or cripple operations. Medical In urgent healthcare situations, patient monitoring systems require real-time responses to alert professionals to urgent issues that demand immediate attention or intervention. Brand Reputation Given the wide adoption and influence of social media, brands are being discussed anywhere at any time. Automated monitoring can alert brand managers of the activity, and give them time to contain, control, or contribute to the brand conversation. Disaster Warning Systems Earthquakes, tsunamis, tornadoes, and other potentially devastating events can often be detected through a variety of monitoring and global sensing systems. Real-time streaming analytics can bridge the time lag between detection and communication, allowing authorities greater time to mobilize emergency responses.
  • 4. 4 GAIN OPERATIONAL INSIGHTS Allows businesses to take swift actions and ensure uninterrupted operations. Examples include: IT Systems and Network Monitoring Traffic, load, and threat vulnerability – these can be seen and controlled dynamically as conditions change. Financial Transaction Processing Authentications, validations, fraud detection must be processed in real-time. Manufacturing Closed loop control systems can monitor processes for variations and make adjustments to compensate for out-of-limit conditions in real-time. Field Assets Monitoring and Alerting With stream analytics, remote human and machine activity can be sensed and responded to instantly. SEIZE NEW OPPORTUNITIES Real-time analytics can stream data processed for one particular purpose to new opportunities, delivering value and generating new revenue streams. Examples include: New Products A major running shoe manufacturer has added sensors to its shoes to track exercise routines, offering consumers valuable new health insights. Behavior Insights Media companies and operators can analyze audience behavior to provide insights to drive new services and business models. Soil Insights When outfitted with sensors, tractors can add value by providing insights into soil conditions, such as humidity, temperature, and composition to farmers.
  • 5. 5 The Big Questions The Complexity of an RTSA Platform Today, technology is finally building end-to-end components that make streaming analytics a reality. Tools that can process and analyze high-volume data streams are maturing rapidly, and new entrants — both commercial and open source — are arriving on the scene with surprising regularity. As a result, enterprise executives are asking themselves the following relevant questions: Streaming data is flowing by, and if it’s not captured, analyzed and put to immediate use, the opportunity slips away. Consequently, streaming analytics requires blazing fast ingestion, analysis, and actions on multiple sources of fast data. The streaming analytics platforms must be able to ingest, filter, aggregate, enrich, and analyze a high throughput of data from disparate live data sources to identify patterns, detect urgent situations, and automate immediate actions. That’s the first complexity — everything the platform must do, and how rapidly it must do it. The second major challenge is that real-time streaming platforms don’t resemble the types of analytic applications that enterprises are used to. According to Forrester analyst Mike Gualtieri in the Impetus webinar, Future Proofing Your Streaming Analytics Platform, the streaming application programming model is unfamiliar to most application developers. It’s a different paradigm from normal programming where code execution controls data. In streaming applications, the incoming data controls the code. What platform design will best address the needs of the market and drive rapid, large-scale adoption? With so many new innovations continuously flowing into this domain, how do I choose a platform? Is it smart to wait for the space to settle and then see who the clear “winner” is? Or, is there a way to get started now? In the streaming application programming model, the incoming data controls the code, unlike normal programming. THE STREAMING APPLICATION MODEL
  • 6. 6 The Allure of Open Source and the Dynamic Nature of the Market Several offerings on the market seek to establish themselves as Big Data-processing platforms for large-scale data processing. One of the best-known streaming technologies is Apache Storm. Storm has been battle-tested by many of the largest enterprises in the world, who routinely deal with large scale, real-time streaming data sources, including: Apache Storm is available as an open source software (OSS); as is Spark Streaming — a comparatively new entrant for the Apache Spark parallel-processing framework. Cerner (Healthcare) Groupon/Yahoo!/Yelp (Social media) Aeris Communications (Telematics/IoT) Verisign (Security) Cloud Services IoT Data Ingestion Streaming Analytics Perishable Insights Historical Insights Data In-Motion Data At-Rest Hadoop Analytical Storage Context Enrichment Future Learning IoT The Internet Applications The Complex and Increasingly Expanding Data Map
  • 7. 7 Both Storm and Spark Streaming focus primarily on working directly at the level of the stream processing platform and prove difficult to integrate, often requiring additional tools. Despite the hype around them, enterprises struggle with both offerings’ narrow focus and seem to rarely get beyond the proof-of-concept (PoC) stage. Why then, given the difficulty of incorporating these two open source technologies, are they so popular? Enterprises typically lean toward Apache Storm and Spark Streaming in the preliminary stages for several reasons. Both projects are well known in the open source ecosystem around Hadoop. They are readily identifiable as alternative processing engines, and therefore as MapReduce alternatives. As they are available in open source, they can be easily acquired and installed to support research and development (R&D), prototyping and PoC projects. Spark Streaming and Storm’s well-known capabilities have forced companies to rethink their data-processing use cases and to consider the possibilities of stream processing to reduce time-to-action. As companies recognize the need for streaming analytics, the demand for an enterprise-grade streaming platform in the market increases. However, Storm and Spark Streaming are neither data-scientist nor application-developer friendly. The most significant feature that they lack is a graphical user interface (GUI) for building topologies. Without this, only developers with an understanding of Java or Clojure can build applications using these engines. Data scientists and developers need intuitive visual tools to create streaming apps. Programming is tedious, and with Spark Streaming and Storm, developers are forced to manually account for scalability, handle input data skews, hand-code fault tolerance for the applica- tion and attempt to force event ordering/re-ordering. Forrester's Wave report concedes that Storm and other open source software technologies have generated a great deal of buzz, but argues that they lack the support, functions, and amenities of commercially-supported third-party offerings. What should be done in this scenario?
  • 8. The Need to Future-proof 8 One of the major obstacles that stands in the way of enterprises deciding to move forward — to select a vendor and to jump into the rapidly moving river of real-time streaming analytics — is the absence of a future-proof guarantee. This is especially true right now as the impact of innovation in this space is expected to continue, with several new open source projects already underway. The ideal real-time streaming analytics platform would have an architecture with a common abstraction layer below the user interface that allows the selection of one or more streaming engines and change in engines — as your goals, requirements and strategies change. This abstraction layer would also streamline upgrades and embed new technologies into the existing system, based on relevance and credibility, with time. But can one access such state-of-the-art technology? YES. And there is only one like it available on the market today. It’s been created by Impetus Technologies and it is called StreamAnalytix. A real-time streaming platform must meet the needs of data scientists, developers and data center operations teams without requiring extensive custom code or brittle integration of many third-party components. Maximizing the potential of data means analyzing data in motion as well as data at rest — from the moment it is created and after it is stored. Even better is to combine the two in an integrated lambda architecture. Achieving such a scenario may be simpler than you might think if you’ve got the right integration architecture in place, and the right tools for building analytics and integrations. As we stated earlier, the question companies are trying to answer is this: How can we take advantage of new, innovative technologies while protecting our investment? We know that the trend in today’s Big Data world is to look for ways to leverage open source. This is because open source drives the potential of a vibrant stream of innovation coming from the global community. As a result, breakthrough technology is available to software developers at a low cost. This is not expected to slow down anytime soon; in fact, it is picking up speed, and investment is flowing into software companies that are pursuing this strategy. The Ideal Platform The technological advances related to real-time streaming analytics are moving and changing as rapidly as data itself. ADVANCES IN REAL-TIME STREAMING ANALYTICS
  • 9. 9 However, two main factors hinder adoption: Firstly, unless software development is the industry you are in, you may not have the extensive team of highly-experienced developers required to code an application using open source, especially not without an IDE that offers a GUI developers interface And secondly, even if you did, you may fear that the speed with which technology advances may leave you behind no matter what The answer is three-fold. You need a platform that: By providing a hybrid option, Impetus’ StreamAnalytix leverages the best of both worlds from enterprise-class solutions and open source while mitigating the disadvantages of both. In other words, StreamAnalytix offers the reliability, manageability, vendor support and professional services of an enterprise-class solution, while also availing you of the community innovation, low-cost, lack of vendor lock-in, and future proofing benefits that open source offers. Provides an integrated development environment Utilizes open source Offers a level of abstraction above the open source engine that insulates you from the changing technologies that could cause disruptive change at the infrastructure level StreamAnalytix offers the reliability, manageability, vendor supportand professional services of an enterprise-class solution, while also availing you of the community innovation, low-cost, lack of vendor lock-in, and future proofing benefits that open source offers. THE STREAMANALYTIX PLATFORM
  • 10. 10 Reliability Large-scale efficiency Robust intuitiveness A future-proof and context-efficient abstraction architecture The StreamAnalytix Solution In an effort to protect you from the dilemma, risk, and expense of building your own platform while not limiting or locking you into up-and-coming future innovations, StreamAnalytix has abstracted the graphical developer user interface above the level of the stream processing engine. StreamAnalytix (www.streamanalytix.com) is a state-of-the-art RTSA platform combining enterprise class with open source. Built on an engineering framework that articulates the essential requirements for modern businesses, StreamAnalytix delivers: Open, flexible, extensible, and easy-to-use, StreamAnalytix rides right on the top of any standard Hadoop stack and is integrated with Apache Storm. It also works with other NoSQL options like Apache Cassandra and Oracle NoSQL DB. Other choices for persistence and indexing can be easily integrated. ENTERPRISE GRADE - OPEN SOURCE BASED - STREAMING ANALYTIX PLATFORM Unified Business Interfaces | Common Utilities | Smart Workflows
  • 11. 11 About StreamAnalytix StreamAnalytix, an enterprise class real-time streaming analytics platform, based on a best-of-breed Open Source stack, can help organizations across industry verticals to quickly and reliably take into production a wide range of streaming data applications. It enables use cases in areas such as the IoT, sensor data analytics, e-commerce and Internet advertising, security, fraud, insurance claim validation, credit-line-management, call center analytics, and log analytics. It also enables enterprise IT and business transformation with horizontal capabilities like Streaming ETL to speed up slow batch processes to near-real-time. Additionally, it is equipped to support multiple streaming engines and customized extensions in order to deliver an unmatched level of technology future proof investment protection. Currently powered by Storm, StreamAnalytix is soon to be extended to include Spark Streaming as an added underlying stream-processing engine option. Additionally, Impetus Technologies has announced a statement of direction that StreamAnalytix will not only continue to support both Storm and Spark Streaming but will also evaluate other technologies as they become available, thereby keeping you ahead of the curve now and in the future. To know more about the product and its architecture, visit: www.streamanalytix.com. About Impetus Impetus is focused on creating big business impact through Big Data Solutions for Fortune 1000 enterprises across multiple verticals. The company brings together a unique mix of software products, consulting services, Data Science capabilities and technology expertise. It offers full life-cycle services for Big Data implementations and real-time streaming analytics, including technology strategy, solution architecture, proof of concept, production implementation and on-going support to its clients. To learn more, visit www.impetus.com or write to us at inquiry@impetus.com. © 2015 Impetus Technologies, Inc. All rights reserved. Product and company names mentioned herein may be trademarks of their respective companies. Feb 2015#81554 StreamAnalytix is equipped to support multiple streaming engines and customized extensions in order to deliver an unmatched level of technology future-proof investment protection UNMATCHED, FUTURE- PROOF TECHNOLOGY