SlideShare a Scribd company logo
1 of 34
Download to read offline
SOLIX COMMON DATA PLATFORM:
Empowering the Data-driven Enterprise
Advanced Analytics and the Data-Driven Enterprise
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise1
EXECUTIVE SUMMARY
“You cannot manage what you cannot measure.”
Those prescient words from management guru Peter Drucker more than 30 years ago encapsulated the
evolution of enterprise software platforms and have paved the path to the era of the data-driven enterprise.
Data is remaking industries and reshaping the global economy. Those who embraced data found growth
areas and improved earnings. Organizations that ignore the promise of data can no longer survive in the
new world economy.
Today, Drucker’s words are as true as ever. To continue being data-driven, organizations must be able to
ingest and analyze new forms of data from constantly developing new sources. Those who are ready to mine
new data streams, such as social, IoT and more, are primed to transform economies and reap the benefits
with new growth opportunities. Businesses ready to leap into the fray are adopting Big Data. Big Data brings
together structured, semi-structured and unstructured data. When all forms of data are brought together,
it not only multiplies the value of every single piece of data, but it also presents a new set of challenges
around data storage, governance and consumption.
In today’s enterprise world, business users want to make real-time, data-driven decisions using the vast
amount of data available. Yet, IT departments are faced with the challenge of increasing storage and
Business Intelligence (BI) costs, complex governance and Information Lifecycle Management (ILM) for data,
which is now in the scale of petabytes and beyond. Unfortunately, current enterprise ready technology
offerings are not capable of managing this data tsunami, let alone take advantage of all the possibilities this
data offers. The tension created within organizations is clear.
	 As Forrester also points out in its research report, in the era of Big Data, traditional EDW is 		
	 failing to meet new business requirements, such as support for real-time and ad hoc customer 		
	 analytics, new sources of data, and self-service capabilities.1
The Solix Common Data Platform (CDP) allows organizations to embrace Big Data, while keeping the
challenges in check. The Solix CDP helps organizations leverage their existing infrastructure and allows
them to collect, store and analyze massive amounts of data from every source without sacrificing
governance, security or management. Further, with the Solix CDP all data keeps its original context and
structure, allowing organizations to ask complex questions and gain deep contextual insights from data at
any point. The Solix CDP creates a new paradigm fostering a meaningful, frictionless partnership between IT
departments and business users. IT departments can now become the guardians of data and business users
can become the owners and direct consumers of data.
Solix created the CDP to bring ILM to the Data Lake and innovation to the EDW. The Solix CDP is the next
evolution in the new enterprise blueprint, offering Enterprise Data Archiving and Enterprise Data Lake to
create an Advanced Analytics platform with unprecedented levels of ILM in a Big Data setting.
1
Forrester Report on The Next-Generation EDW is the Big Data Warehouse, August 2016
“
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise2
INTRODUCTION — SOLIX COMMON DATA PLATFORM
Solix CDP = Enterprise Archiving + Enterprise Data Lake + Information Governance
At the core of the Solix CDP are Enterprise Archive and The Enterprise Data Lake.
The Solix CDP utilizes the Solix Big Data Suite to provide comprehensive enterprise data management and
robust ILM. With the CDP, organizations can vastly expand the reach of analytics by creating an Advanced
Analytics platform. For the CIO, Enterprise Archiving offers a quick ROI that will ensure budgetary support
from the organization and dissolves the obstacles between Big Data and ILM.
The Solix CDP brings enterprise-grade capabilities to the Hadoop framework, addressing all shortcomings
of the Data Lake. Solix CDP provides uniform data collection, metadata management, ILM and secure data
access for Advanced Analytics.
The Solix CDP does this all while maximizing an organization’s existing infrastructure. With no need to
reboot the organization’s enterprise architecture, the Solix CDP harnesses the current architecture to
develop a new enterprise blueprint, capable of evolving with the business requirements of an organization.
The Solix CDP is also capable of evolution. As businesses stretch Hadoop to its limits, new Big Data
technologies will emerge. The Solix CDP is primed to adapt with them.
------------
------------------------
------------------------
-------------------------
-----------
SOLIX COMMON
DATA PLATFORM
ENTERPRISE
ARCHIVING
ENTERPRISEDATALAKE
INFO
R
M
ATION GOVERNANCE
The Solix CDP brings enterprise-grade capabilities to the Hadoop framework, addressing
all shortcomings of the Data Lake.
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise3
WHY SOLIX COMMON DATA PLATFORM?
The need to become data-driven is clear. Transformation has hit every major industry and disruptors have
become powerhouses in the global economy based on their capabilities to mine data. Any organization
wanting to compete must become data-driven or it is destined to fail.
The current enterprise architecture offering, EDW, provides a canonical, top-down view of enterprise data to
meet end user requirements, but those views rarely satisfy the function-specific requirements of data-driven
applications.
	 As per Forrester research, Big Data platforms such as Hadoop have made Big Data architectures 		
	 more affordable, allowing companies to pursue new business insights for increased data-driven 		
	 competitive advantage.2
The Solix CDP, built on top of Hadoop distributions, enables data-driven organizations to gain more value
from their data because now data can be visualized in more specific ways.
The cost of relying on the EDW to collect and analyze all of this data would also exceed the budget of most
organizations. The Solix CDP is a uniform data collection system for structured, unstructured and semi-
structured data featuring low-cost data storage and Advanced Analytics. Solix CDP stores data “as-is” to
reduce costly Extract, Transform and Load (ETL) operations, as well as transforms data to feed downstream
NoSQL and analytics applications. Solix CDP enables organizations to create a true enterprise Data Lake
with full access to the data, rather than a data swamp where the data gets lost. This enables the CIO to find
a better solution than trying to collect and store all of the enterprise data in the expensive Tier 1 storage
and existing EDW architectural offerings.
The Solix CDP does not require costly infrastructure and offers the scalability and flexibility the Big Data
platform architecture provides, along with enterprise-grade governance and security. The Solix CDP lays
the foundation for information governance, efficient infrastructure utilization and Advanced Analytics at
petabyte scale.
The Solix CDP lays the foundation for information governance, efficient infrastructure
utilization and Advanced Analytics at petabyte scale.
“
2
Forrester Report on Big Data Fabric Drives Innovation and Growth, March 2016
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise4
Here is the comparison on how Solix CDP differs from a traditional Data Warehouse and a
Data Lake:
DATA WAREHOUSE DATA LAKE SOLIX CDP
Data Processed, Structured
Structured, Semi-structured/
Unstructured
Processed, Structured, Semi-structured/
Unstructured
Schema On write On read On read / On write
Storage Costs High Low Low
Scalability Low High High
Agility Low, Fixed configuration High, Configure & Reconfigure High, Configure & reconfigure
Metadata Repository Centralized MetaData Repository No Centralized MetaData Repository
Data Access Query Search Query + Search
Query Performance High Medium Medium
Security / Governance Mature Maturing Mature
Users Business Users Data Scientists
Business Users, Data Analysts, Data
Scientists
Role based Access Yes No Yes
ILM No No Yes
Regulatory Retention
Management
No No Yes
Legal Hold No No Yes
ROI High Low High
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise5
PRODUCT/ SOLUTION OVERVIEW OF THE SOLIX COMMON DATA PLATFORM
Built on top of Hadoop distributions, such as Cloudera CDH or Hortonworks HDP, the Solix CDP provides an
integrated suite of enterprise connectors through its Object Workbench to build a consolidated repository of
enterprise data and metadata. The Solix Analyst Workbench allows multiple teams and users to collaborate,
create virtual workspaces and projects to access the data without compromising on compliance and
security. Because it runs on top of both of the most popular Hadoop distributions, it eliminates one of the
basic questions behind the creation of a Hadoop stack, and it can bridge the two in environments where
both are being used.
The data-driven enterprise does not wait for the business question to develop and then use the data to
answer it. The data-driven organization uses Advanced Analytics and Business Intelligence to mine the data
for the questions and then the answers. With Solix CDP, business users can create data models and derive
the insights needed to move the organization forward. The self-service model takes IT out of the equation,
freeing it to focus on its work, while ensuring security and governance measures are also met. The Solix CDP
brings robust ILM to all data.
The Solix CDP ensures all data retains context by retaining its metadata, meaning its value is never lost.
This ensures the business questions being raised by the data are truly valid and the answers analysts find
are relevant.
The Solix CDP ensures all data retains context by retaining its metadata, meaning its
value is never lost.
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise6
Enterprise Archiving
	 In the era of Big Data, Archiving is a No-Brainer Investment.3
Up to 80 percent of production data used by core applications is inactive. Data archiving has emerged as
an ILM best practice to meet data growth challenges. Solix CDP ensures that Enterprise Archiving improves
production application performance, reduces infrastructure costs and meets the regulatory and compliance
needs.
As part of Enterprise Archiving, application data running online is first moved into Tier 2 or Hadoop
infrastructure, and then purged from its source location, according to ILM policies. Data archiving best
practice requires that MOVE and PURGE processes be coordinated and validated. Enterprise Archiving
on Solix CDP ensures proper data governance since enterprise data is ingested and stored based on ILM
retention policies and business rules.
Archive data is classified for security and compliance requirements, such as legal hold, and universal access
is provided for business users through structured reports and full text search for business objects.
Semi/Unstructured Data
Universal Access
Native
Access
BI Reporting
Analytics
Solix Big Data
Suite
Archiving
Solix EDMS
Database
Archiving
Archive Database
DB
Active Data
Custom Apps
Structured Data
MOVE & COPY
MOVE, COPY, PRINT
Enterprise Business Record
Print Stream Capture
Search & Query Access
Retention Management and Legal Hold
SOLIX COMMON DATA PLATFORM
Semi-Active Data
(RDBMS)
InActive Data
(Hadoop ) Reporting / BI Tools
MACHINE
DATA
XML
VIDEOS EMAIL
IMAGES FILE
SHARE
Solix
BigData
Suite
Solix APM
(Repository, Query, Search)
3
Forrester Report on Vendor Landscape: Big Data Archiving, August 2015
Solix CDP ensures that Enterprise Archiving improves production application performance,
reduces infrastructure costs and meets the regulatory and compliance needs.
“
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise7
Enterprise Business Record (EBR)
An EBR is a de-normalized, point-in-time snapshot of a business transaction, which may include structured
or unstructured elements. The Solix CDP helps model, ingest and manage EBR data into a Hadoop optimized
file format that is fully accessible for text search or structured query.
Data can be ingested to build both a long-term Enterprise Archive and a transient Enterprise Data Lake. For
the archiving use case, older inactive data is moved from the source application to the Solix CDP. For the
Data Lake use case, current data can be transformed and then copied from the source application to the
Solix CDP.
EDW Augmentation
Currently enterprises are struggling to maintain costs associated with both storage and processing
capabilities around traditional EDW implementations. Offloading storage as well as costly ETL functions to a
commodity hardware such as Hadoop enables enterprises to focus on utilizing the existing Data Warehouse
infrastructure to its best ability in doing BI and Advanced Analytics.
Migrating warm or cold data from the EDW via archive onto low cost bulk storage system such as Hadoop
enables organizations to save millions on storage costs and significantly speeds up the processing power to
get more value from the data warehouse by extracting valuable insights at a quicker pace from the collected
data.
0 – 3 Years
3 – 10 Years
Complete business object
Denormalized structure
Point-in-Time snapshot
Decoupled from application
Search & Query
Retention Management
Legal Hold
Scheduler & CLI
Solix Application Portfolio Manager
Create
EBR
Ingest
into SBDS
After
Retention
Enterprise Application
Enterprise Business Record
AR Invoice
Transactions
Master Data
Reference Data
Attachments
Report Files
Solix Big Data Suite
Retention
Policies
Search Reports BI ReportsRole Based Security
LDAP/Active
Directory
Dashboards
SAP BO
Oracle BI
Crystal Reports
IBM Cognos
Currently enterprises are struggling to maintain costs associated with both storage and
processing capabilities around traditional EDW implementations.
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise8
Enterprise Data Lake and Advanced Analytics
The Solix CDP based on Apache Hadoop establishes new capabilities for Advanced Analytics applications.
It stores data “as-is” eliminating the need for demanding ETL processes during ingestion. It captures and
maintains the metadata connected to each byte of data, which is half or more of the value of the data itself.
The Enterprise Data Lake may then be mined for critical business insights using text search, structured query
or further processing by downstream analytical applications. The Solix CDP utilizes either Hive or Spark
query frameworks dependent on the user requirements.
The Solix Enterprise Data Lake reduces the complexity and processing burden of staging EDW and analytics
applications and provides highly efficient, bulk storage of enterprise data for later use. Once resident within
HDFS, enterprise data may be more easily distilled and better described at petabyte-scale by business
analytics applications. This allows organizations to develop an enterprise architectural strategy that is
responsive to the business stakeholders without driving up the investment in hardware and software.
Information Governance
Analysts have warned that applying existing information governance practices to Big Data will result in
failure. Comprehensive information governance provided by Solix CDP establishes the control framework
necessary for proper data access control, data assessment, data discovery, data classification, data
validation, retention management, legal hold and privilege management.
DATA MART
AUDIO VIDEOS SOCIAL MEDIA
EMAIL DOCUMENTS
IMAGES
WORD DOCUMENTS
UNSTRUCTURED DATA
SEMI STRUCTURED DATA
JSON XML CSV
LOGS MACHINE DATA SENSORS
STRUCTURED DATA
CLOUD APPS ERP CUSTOM APPS
CRMDATA WAREHOUSE
DISCOVERY
SEARCH
STAGE
TRANSFORM
ARCHIVE
DATA LAKE
HIVEHIV
ANALYTICS REPORTINGDATA MINING
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise9
	
	 Forrester estimates that the average Hadoop repository doubles in size every year; some 		
	 implementations double in volume every month. More Hadoop silos are creating data challenges 	
	 around security, integration, governance and delivery.4
To achieve robust ILM, new security and governance measures must be put into place to match the variety
and complexity of the new data assets. The Solix CDP provides a true ILM continuum that addresses the
complexity of governance in the Big Data world, while ensuring governance for core enterprise applications
is not sacrificed. The Solix ILM framework manages the data within HDFS and provides an integrated
retention-management and legal-hold capabilities.
Structured and unstructured data from various data sources are migrated into HDFS with full data-validation
and audit reports. These reports provide the necessary defensibility and chain of custody for compliance
and data governance. ILM policies and business rules may be pre-configured to meet industry standard
compliance objectives, such as COBIT, or custom designed to meet more specific requirements.
Additionally, ILM also helps to solve the data growth problem by moving less frequently accessed data from
high-cost Tier 1 infrastructure to Hadoop, leveraging cheap commodity infrastructure. Relocating inactive
data to low-cost bulk data storage creates enormous infrastructure cost savings. Because governance, risk
and compliance concerns grow by the terabyte, the Solix CDP ensures ILM for data throughout its lifecycle.
IN
FORMATION GOVERNAN
CE
Create
Classify
Archive,
Retire
Secure,
Encrypt
Retain,Hold,Dispose
Monitor,
Assess
AUDITCONTROL
RECONCILE
4
Forrester Report on Big Data Fabric Drives Innovation and Growth, March 2016
“
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise10
COMPONENTS OF THE SOLIX COMMON DATA PLATFORM
Solix Object Workbench
Integrated Connectors
Solix Object Workbench provides integrated connectors that can extract and ingest vast amounts of data
“as-is” from an extensive set of enterprise data sources, including structured, semi-structured, unstructured
and streaming data sources. The Object Workbench provides functionality to copy, move, and transform data
from various data sources into the Solix CDP.
Extract, Transform and Load (ETL)
The Solix CDP Object Workbench also enables the ETL process to be undertaken as data is moved into the
Enterprise Data Lake. This provides the ability to transform complex application data into meaningful data in
a ready-to-use format from which the business user can gain immediate insight, with the use of BI tools.
Solix Virtual Printer
The Solix Virtual Printer provides functionality to capture print stream output from any application,
transform it into a PDF document, automatically ingest it into Hadoop, index it and make it available for
search access with full role-based security.
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise11
The virtual printer can be used to supplement an archiving project by capturing key report output —
including all formatting — from the source application and storing it alongside the structured data.
The virtual printer can also be used to support a streamlined archiving approach called “print-and-purge.”
Using this approach, key documents, such as invoices or customer documents, are first “printed” by the
Solix Virtual printer and ingested into Hadoop, after which the underlying data from the source application
can be purged.
Real-Time and Streaming Data
	 Business users want data that’s integrated in real time from multiple sources, including legacy 		
	 data, social media, sensor data and weblogs, so they can make better decisions and increase		
	 their company’s competitiveness.5
Insights from enterprise data is now not restricted to formulated data repositories, which only contain
data at the end of its operational life. Huge amounts of data are now collected from both internet enabled
devices and also from terminals in real-time and streaming formats.
This data can also be captured via the Solix CDP Object Workbench, enabling teams to create views of data
to analyze and deliver actionable intelligence. This will enable enterprises across industries to become truly
data-driven.
Solix Big Data Suite
Solix Virtual Printer
Print Steam
Data LakeL k
ERP|CRM|HR|CustomApplications
DISCOVERY
SEARCH
STAGE
TRANSFORM
ARCHIVE
5
Forrester Report on Big Data Fabric Drives Innovation and Growth, March 2016
“
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise12
Solix Analyst Workbench
The Analyst Workbench is designed for business analysts, data scientists, and DBAs to securely access the
data within the Solix CDP and build virtual workspaces to manage analytics projects. All data within the
platform is automatically made searchable and reportable in a secure and governed manner.
Functionality included with the analyst workbench includes:
Data Lake Visualizer
The Data Lake Visualizer is a graphical inventory of the data contained in the lake. Using the visualizer the
data analyst can quickly find the data sets needed to complete their analytics assignment. Once the data
sets are identified they can be selected for inclusion in the analytics project.
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise13
Virtual Projects and Workspaces
For each analytics assignment a virtual project can be created by the analyst. Within each project one or
more virtual workspaces can be created. The objects identified in the visualizer can then be virtually copied
into the workspace, eliminating the need to make physical copies of data.
Once the virtual workspace has been created, the data analyst can do data mashups by creating new
composite objects to support the analytics assignment.
Data Preparation
Data preparation is the foundation for becoming a data-driven organization. To properly use data it must
not only be collected from its ever-increasing variety of sources, it must also be put into a repository where
those varied forms can be used by the analysts.
	 As more organizations utilize Data Scientists and Advanced Business Analysts to 			
	 wrangle their data to enable digital transformation, the lack of proper tools hinders	
	 this progression. In fact, research shows that Data Scientists spend 80 percent of			
	 their time just cleaning data.
Every analytics project requires the proper preparation of the data set. The nature of the data — from
semi-structured data (such as log files), unstructured data (such as social, IoT) and structured data (such as
relational databases) – must be understood, organized and transformed quickly and efficiently. The Solix
CDP offers powerful, easy to use self-serve data preparation capabilities, including the ability to parse, clean,
join and enrich data, as well as populate missing information and calculate new metrics.
The Solix CDP utilizes the Spark framework. Spark runs in-memory within the cluster and provides machine
learning capabilities for faster and more advanced data preparation.
Search and Reporting Functionality
Solix CDP supports universal access to all enterprise data on a petabyte scale via text search, structured
query or further processing by downstream analytical applications. End users gain improved data-driven
results because their data is better able to be described.
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise14
Information Governance
The Solix CDP provides the ability to govern all of the data within the Hadoop repository for compliance
and security. For example, automatically purge data based on a time-horizon, apply legal holds on files
and transactions, enforce Kerberos/LDAP authorization for user access and more. This level of security and
governance is able to be maintained because the CDP ensures all data retains its metadata.
Information governance establishes the control framework necessary for proper data access control, data
assessment, data discovery, data classification, data validation, retention management, legal hold and
privilege management.
Solix Common Data Platform API
To enable development of custom applications and integration with existing BI and Advanced Analytics
tools, the platform provides extensive APIs to access the unified repository. The API allows users to
seamlessly access data from the Data Lake to enable the data-driven enterprise.
Solix App Store
The Solix App Store makes inductive BI user-friendly. The App Store offers out-of-the-box analytics through
pre-integrated applications and also offers the opportunity to utilize third-party apps.
CURRENT
SOLIX ANALYTICS
APP STORE OFFERS
Cash-flow
Analytics
Sales/pipeline
Analytics
Integration with
standard BI tools
such as Tableau,
Splunk, etc.
An executive
Dashboard
Ad-hocreportingandSearchingTools
MarketingandSocialAnalytics
------------
------------------------
------------------------
------------------------
------------
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise15
DEPLOYMENT MODELS
The Solix CDP is enabled for a number of deployment models including bare metal infrastructure, data
center deployment, cloud infrastructure and also hosted multi-tenant deployment.
Solix never delivers a one-size-fits-all solution. The Solix team has experts ready to address customer needs
from IT, business use and financial perspectives. The Solix team will work to understand organizational
needs and then implement the best solution.
SPARK ON SOLIX CDP
The Solix CDP utilizes the Spark programing models. Spark runs in-memory within the cluster and does not
depend on the two-stage Hadoop MapReduce paradigm. Therefore, repeat access to data is much faster.
Spark relies on HDFS and runs on Hadoop YARN to be able to analyze the data stream.
Solix is committed to adding support for new Big Data tools as they appear in the fast-evolving Big Data
ecosystem. This future-proofs Solix CDP installations, an important commitment given the speed with which
the open source Big Data stack is evolving. It also simplifies the creation and evolution of an enterprise’s
Big Data environments.
BENEFITS OF THE SOLIX CDP
The benefits of the Solix CDP include:
•	 Combining the advantages of Hadoop with the ability to preserve the full metadata.
•	 Providing advanced ILM capabilities, including the ability to copy data from the data warehouse
and to archive older data.
•	 Supporting advanced data security, as well as third party analysis packages, including machine
learning and cognitive computing analysis of the data.
•	 Preserving all data in its original format and with full metadata and supporting established open
standard interfaces. It future-proofs the Data Lake, ensuring the data will be usable by the new
technologies and for new use cases that are as yet undefined.
•	 Providing a unified data governance layer from the time of data ingestion to use of data by
business users for operational insights and Advanced Analytics.
•	 Ability to utilize either Hive or Spark query frameworks dependent on the user requirements.
•	 Cloud, on-premise and hybrid deployment models.
•	 Working with all Hadoop distributions such as Cloudera and Hortonworks.
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise16
The Solix CDP is the first solution to address all the data needs of an organization. From governance
to analytics, the Solix CDP works with an organization’s existing infrastructure to create a true ILM
continuum that ensures the onslaught of data can be an asset and not a hindrance to business growth and
development. The Solix CDP brings together the Enterprise Archiving, EDW and the Enterprise Data Lake
while preserving metadata, allowing for schema on read, analytics opportunities, low cost implementation
and maintenance as well as offering incredible scalability.
CONCLUSION
The era of game-changing digital disruption is here, and to thrive in this competitive environment,
organizations need leadership that can effectively leverage all the data to derive actionable insights to fuel
growth.
Reducing infrastructure costs, attaining operational efficiencies and deriving insights from BI and Advanced
Analytics is the desire of many organizations. Solix CDP maximizes the insights that can be achieved, while
reducing risk, ensuring compliance and governance to create a true ILM framework to lead organizations
into the future. The Solix CDP gives organizations all the tools necessary to lower the total cost of
ownership and satisfy the desire for return on investment.
Empowering the Data-driven Enterprise
Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise17
Solix Technologies, Inc.
4701 Patrick Henry Dr., Bldg 20
Santa Clara, CA 95054
Toll Free:	 +1.888.GO.SOLIX (+1.888.467.6549)
Telephone:	+1.408.654.6400
Fax:		 +1.408.562.0048
URL:		 http://www.solix.com
Copyright ©2016, Solix Technologies and/or its affiliates. All rights reserved.
This document is provided for information purposes only and the contents hereof are subject to change
without notice.
This document is not warranted to be error-free, nor subject to any other warranties or conditions, whether
expressed orally or implied in law, including implied warranties and conditions of merchant- ability or
fitness for a particular purpose.
We specially disclaim any liability with respect to this document and no contractual obligations are formed
either directly or indirectly by this document. This document may not be reproduced or transmitted in any
form or by any means, electronic or mechanical, for any purpose, without our prior written permission.
Solix is a registered trademark of Solix Technologies and/or its affiliates. Other
names may be trademarks of their respectively.
Empowering the Data-driven Enterprise
The Next-Generation EDW Is The Big Data
Warehouse
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
by Noel Yuhanna
August 29, 2016
For Enterprise Architecture Professionals
forrester.com
Key Takeaways
Without Modernizing Your Current EDW
Platform, You Will Likely Fail
Business users are demanding faster, more real-
time, and integrated customer analytics from
multiple sources, so they can make better decisions
and increase their company’s competitiveness.
Current EDW platforms have gaps and limitations
that fail to meet these new requirements.
Forrester’s Big Data Warehouse Strategy
Extends The Existing EDW Framework
Based on interviews of customers and vendors,
Forrester has laid out an architecture to guide
enterprise architects in creating a big data
warehouse framework tailored to their firm’s
requirements to support both existing and new
actionable business insights.
You Need A Big Data Warehouse Strategy To
Succeed
Big data warehouse is a modern data
warehouse architecture that leverages traditional
and new data repositories, in-memory, cloud,
and other technologies.
Why Read This Report
EDW is not dead; it’s evolving! Enterprise data
warehouses have come a long way in delivering
value by predicting trends, minimizing churn,
and identifying new business opportunities.
However, in the era of big data, traditional EDW is
failing to meet new business requirements, such
as support for real-time and ad hoc customer
analytics, new sources of data, and self-service
capabilities. Enterprise architects should read this
report to learn how the new big data warehouse
addresses these gaps by delivering timely and
actionable insights to gain competitive edge and
enable innovation and growth.
2
6
10
12
13
© 2016 Forrester Research, Inc. Opinions reflect judgment at the time and are subject to change. Forrester®
,
Technographics®
, Forrester Wave, RoleView, TechRadar, and Total Economic Impact are trademarks of Forrester
Research, Inc. All other trademarks are the property of their respective companies. Unauthorized copying or
distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378
Forrester Research, Inc., 60 Acorn Park Drive, Cambridge, MA 02140 USA
+1 617-613-6000 | Fax: +1 617-613-5000 | forrester.com
Table Of Contents
EDW Has Been The Analytics Platform King
For Decades
But New Business Requirements Are
Changing EDW Requirements
EDW Technology Gaps Are Making
Enterprises Look Elsewhere
The Big Data Warehouse Extends The EDW
Platform
Big Data Fabric Connects The Superset Of
Your Data Sources — Including Your BDWs
The BDW Provides A Comprehensive View
And Integrated Analytics
The Major EDW Vendors Provide BDW
Components
BDW Use Cases Go Beyond Traditional
Analytics
Recommendations
Extend Your Current EDW Platforms Toward
A BDW Strategy
Supplemental Material
Notes & Resources
Forrester interviewed various customers in the
financial, oil and gas, retail, and healthcare
sectors.
Related Research Documents
Big Data Fabric Drives Innovation And Growth
The Forrester Wave™: Enterprise Data
Warehouse, Q4 2015
TechRadar™: Big Data, Q1 2016
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
by Noel Yuhanna
with Gene Leganza and Shreyas Warrier
August 29, 2016
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
August 29, 2016
© 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law.
Citations@forrester.com or +1 866-367-7378
2
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
EDW Has Been The Analytics Platform King For Decades
The enterprise data warehouse is an architecture, not a technology. The traditional EDW platform has
served and continues to serve a broad range of business users, including enterprise architecture (EA)
pros, feeding both analytical and operational systems. EDWs:
›› Organize and aggregate historical analytical data from functional domains. EDWs house
information from data subject areas such as customer, manufacturing, finance, and human
resources that align with key processes, applications, and roles. Most of the traditional EDW
platform has been built using relational database management system (DBMS) and columnar
database platforms using extract-transform-load (ETL), change data capture (CDC), and replication
technology (see Figure 1).
›› Offer a strong decision support framework. EDWs provide in-database analytics, predictive
models, and embedded business algorithms to drive business decisions.
›› Are central to a firm’s data ecosystem. The EDW is a proven ecosystem that supports integration
with data models and security frameworks, automation, and a broad range of business intelligence
(BI) and visualization tools.1
›› Provide the foundation for BI. EDWs support timely reports, ad hoc queries, and dashboards and
supply other analytics applications with trusted and integrated data. Many use the EDW to deliver
operational intelligence — in the form of query responses, reports, dashboards, charts, and other
analytic views — in support of various decision scenarios.
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
August 29, 2016
© 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law.
Citations@forrester.com or +1 866-367-7378
3
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
FIGURE 1 The Traditional Enterprise Data Warehouse Platform
Relational
Columnar
Business intelligence
Operational
reporting
Analytics
Predictive
analytics
Social
OLTP
CRM
ETL/CDC/replication
On-premises
• Modeling
• Data quality
• Security governance
transformation
• Integration
Hybrid
ERP
SaaS
Compute/processingStorage/persistenceSource
Cloud
But New Business Requirements Are Changing EDW Requirements
Today, business users are demanding real-time analytics that’s integrated from legacy, social, and
cloud sources, while business execs want self-service and autonomous access to fit-for-purpose
customer data insights. In our 2016 global survey, 59% of respondents stated that leveraging big data
and analytics was a critical or high priority (see Figure 2). But increasing data volume and dealing with
multimodel customer data are slowing down timely analytics and putting constraints on traditional
warehouse platforms, causing firms to revisit their EDW architectures. Businesses are reporting that
current EDW platforms:
›› Can’t share current data quickly enough for timely business decisions. With increasing big
data comes a major challenge for any enterprise: knowing what to look for and where, and then
making sense of it. In our survey, 30% of businesses reported growth of data volume and variety
affecting their BI strategy (see Figure 3). Firms are realizing that traditional data warehouses fall
short when it comes to real-time analytics.2
“With data explosion and increasing demand for real-time analytics by the business, we are
finding it challenging to support our LOB users. While we already use Hadoop, our traditional
data warehouses still are important for analytics, but we are now looking at modernizing that
architecture.” (Enterprise architect, oil and gas, North America)
›› Don’t support ad hoc and dynamic analytics for new customer trends. EDWs were built for a
limited set of uses, providing answers to known questions. But 27% of enterprises report that fast-
changing analytics and reporting requirements are one of the biggest challenges when orchestrating
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
August 29, 2016
© 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law.
Citations@forrester.com or +1 866-367-7378
4
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
their BI strategy, while 30% cite the growth and variety of their data. Processes using traditional
EDWs don’t scale well when you introduce ambiguity or add new and dynamic questions. EDWs
need to ingest, process, and curate data continuously and support dynamic insights.
“We are now looking to [build] a modern data warehouse that can provide insights to all kinds of
tough questions critical for our business to succeed. Including identifying business risks and
opportunity.” (Business analyst, financial services, Europe)
›› Don’t provide a self-service platform for strategic and operational decision-making. When
executives need to determine why something is happening or what the best course of action is,
they can’t wait for a data processing cycle to make data available. Analysts need to be able to
aggregate and prepare data sets without technology management’s involvement. Twenty-seven
percent of companies reported lack of end user self-service capabilities as one of the biggest
challenges in executing their BI strategy. Self-service customer analytics has become critical for
organizations to succeed.
“Self-service for all data is our long-term strategic direction, and we know it’ll take us some time
to get there, but we have to start somewhere. We have started to integrate our current EDW
appliances to Hadoop and in-memory to create [a] unified and integrated analytical platform.”
(Enterprise architect, financial services, North America)
FIGURE 2 Big Data And Analytics Have Become A Priority
0%
9%
28%
40%
19%
1%
Not on our agenda
Low priority
Moderate priority
High priority
Critical priority
Don’t know
“Which of the following initiatives are likely to be your organization’s top business priorities?”
(Better leverage big data and analytics in business decision-making)
Source: Forrester’s Global Business Technographics®
Data and Analytics Survey, 2016
Base: 3,343 data and analytics decision-makers
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
August 29, 2016
© 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law.
Citations@forrester.com or +1 866-367-7378
5
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
FIGURE 3 Data Growth And Variety Are Affecting Business Intelligence And Analytics Strategy
4%
16%
17%
17%
19%
20%
21%
22%
22%
25%
25%
25%
27%
30%
35%
Don’t know/does not apply
Lack of business C-level executive support
Widespread utilization of insights for
decision-making and planning
Inadequate change management programs
(communications, incentives, etc.)
Lack of access to data and insights
Lack of end user self-service capabilities
Lack of data standards
Legal and regulatory compliance
Inadequate or missing relevant internal skills
Poor data quality
Lack of adequate user training
Lack of alignment between IT and business
Fast-changing analytic and reporting requirements
Growth of data volume/variety
Data security and privacy
“What are the biggest challenges your firm faces when orchestrating
its business intelligence strategy?”
Source: Forrester’s Global Business Technographics®
Data and Analytics Survey, 2016
Base: 3,343 data and analytics decision-makers
EDW Technology Gaps Are Making Enterprises Look Elsewhere
While traditional data warehouses often took years to build, deploy, and reap benefits from, today’s
organizations want more simplified, agile, integrated, cost-effective, and automated solutions. Firms
are revisiting their EDW strategies, as they spend too much time loading, unloading, transforming,
securing, integrating, and curating customer data. Enterprises face:
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
August 29, 2016
© 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law.
Citations@forrester.com or +1 866-367-7378
6
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
›› A data volume explosion that’s affecting customer analytics. Traditional structured data
continues to grow rapidly, slowing down legacy data warehouse systems and affecting analytics
and timely insights. Regulatory requirements now mandate storing compliant data for several years,
and business growth is generating more data at a faster pace than ever before.
“We are experiencing tremendous data explosion for traditional data sets that’s impacting our data
warehouses. While we are still looking at improving the performance of existing data warehouses
for the short term, we are now starting to look at alternatives, both supplementary and replacement
as longer-term strategy.” (Enterprise architect, oil and gas, North America)
›› Data variety that’s making it harder to support using traditional warehouses. Business users
can’t easily spot patterns and trends in content such as documents, email, images, audio, and
social media. In addition, storing, processing, and accessing unstructured data in data warehouses
pushes the limits of traditional technologies and architectures, which were not designed to handle
such data types.3
›› Data speed that’s making it harder to keep up. New sources of data are coming in a lot faster,
such as sensor and machine data, log and clickstream data, cloud and software-as-a-service
(SaaS) data, and other streaming data. Storing, transforming, and processing such data requires
new technologies and systems to support new customer analytics, real-time analytics, and
operational intelligence reporting.4
“For us, real-time data sharing is critical internally among business users but also with various
partners that we engage with. Currently, not all of our data is available to everyone, but we are
looking at ways of expanding to support a more self-service real-time big data platform.” (Data
scientist, biotechnology company, North America)
The Big Data Warehouse Extends The EDW Platform
Firms are already using a variety of technologies in their big data strategy to support new, next-
generation analytics (see Figure 4). The big data warehouse (BDW) is a modern data warehouse
architecture that leverages traditional data warehouse architectures as well as modern big data
technologies (see Figure 5). Forrester defines the big data warehouse as:
A specialized, cohesive set of data repositories and platforms used to support a broad variety
of analytics running on-premises, in the cloud, or in a hybrid environment. BDW leverages both
traditional and new technologies such as Hadoop, columnar and row-based data warehouses,
ETL and streaming, and elastic in-memory and storage frameworks.
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
August 29, 2016
© 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law.
Citations@forrester.com or +1 866-367-7378
7
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
FIGURE 4 Cloud, Streaming, And Distributed In-Memory Are Already Part Of Firms’ Big Data Strategy
“Which of the following are included in your plans for big data?”
8%
16%
18%
22%
23%
23%
26%
26%
27%
28%
30%
33%
36%
40%
Don’t know
NoSQL other than Hadoop
A massively parallel processing (MPP)
data warehouse
Semantic technologies (ontology building,
search, autocuration, graph, etc.)
Hadoop (including Hbase or Accumulo)
Data anonymization or de-identification
Creating or building out a data lake
Marketing or digital data management platforms and
service providers that brand their offerings as . . .
Packaged analytics technologies that brand
themselves as big data
Unstructured data mining/analytics
Distributed in-memory databases, grids,
analytics tools
Streaming analytics/computing
Large-scale predictive modelling, data mining,
or other advanced analytics
Public cloud big data services
Source: Forrester’s Global Business Technographics®
Data and Analytics Survey, 2016
Base: 2,094 data and analytics decision-makers
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
August 29, 2016
© 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law.
Citations@forrester.com or +1 866-367-7378
8
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
FIGURE 5 Big Data Warehouse Architecture
Relational
Columnar
Apache Hadoop
Business
intelligence
Operational
reporting
Analytics
Predictive
analytics
Social
Devices
OLTP
ERP
ETL/CDC/replication
Machine learning
Data quality
Security
Governance
Hybrid
CRM
SaaS
Sensors Real-time
analytics
Streaming
Transformation
Integration
In-memory/
Apache Spark
Self-service
Ad hoc
interactions
Modeling
On-premises
Storage/compute
processing ManagementSources Use casesInteraction
Cloud
Big Data Fabric Connects The Superset Of Your Data Sources — Including Your BDWs
The big data warehouse is part of a larger big data fabric architecture, which embodies data from
multiple — potentially distributed — data sources, including BDWs and data lakes. The big data
fabric architecture enables integration, data quality, security, governance, data curation, data
preparation, and data management to support an end-to-end, real-time big data platform (see Figure
6).5
The two architectures:
›› Can exist separately but work best as complements. Multiple traditional EDWs, BDWs, and data
lakes have become the new norm to support the variety of analytical workloads. While both BDWs
and big data fabric architectures can exist independent of each other, typically firms leverage
both to deliver a blend of real-time and batch across various distributed enterprise data sets to
support broader use cases. For example, some financial services organizations use the BDW to
support mostly financial data analytics — leveraging columnar data warehouses, Hadoop, and ETL
technologies. The BDW also acts as a source within the big data fabric architecture that delivers
real-time customer analytics across BDW, Twitter, Salesforce, and clickstream data.
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
August 29, 2016
© 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law.
Citations@forrester.com or +1 866-367-7378
9
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
›› Vary significantly in the amount of data transformation required. We often see big data fabric
used for real-time analytical use cases that integrate data across many disparate sources, including
BDW, with the BDW used mostly for batch and near-real-time analytics for data stored in a data
warehouse and Hadoop clusters that require aggregation, transformation, and further processing
before becoming available to BI users or analytical processes. Exploration occurs within the fabric,
with transformations captured within the BDW.
FIGURE 6 Big Data Fabric Architecture Integrated With Big Data Warehouse
Big data fabric
Data ingestion
(streaming/replication/batch)
Processing and
persistence
On-premises sources Cloud sources
Hadoop
Spark
BDW
Hadoop
EDW
Spark
New York
Singapore
The BDW Provides A Comprehensive View And Integrated Analytics
A key component of the BDW architecture is the ability to leverage various specialized data
repositories such as traditional relational data warehouses, columnar data warehouses, and Hadoop.
Unlike traditional data warehouses, the BDW minimizes complexity and hides heterogeneity by
embodying a trusted model, supports all kinds of data types including unstructured data, and adapts
to changing business requirements more rapidly through a self-service platform. The BDW centralizes
administration of distributed data repositories, in-memory compute resources, metadata, storage,
access, and processing functions. It leverages new technologies such as:
›› Hadoop to support diverse data sets and distributed computing. By leveraging Hadoop, the
BDW enables organizations to deal with a wider variety of data structures than traditional EDWs.
Hadoop can also deal with extremely large data sets that are inappropriate for traditional EDW
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
August 29, 2016
© 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law.
Citations@forrester.com or +1 866-367-7378
10
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
platforms. Enterprise architects can choose to store data in relational, columnar, wide columns, or
Hadoop based on business needs. For example, a retailer leverages legacy structured data stored
in a traditional data warehouse, and Hadoop for clickstream data, and integrates them to deliver a
360-degree view of the customer for recommendations and churn analysis.
›› In-memory to enable faster customer analytical capabilities. A key component of the BDW is
the ability to use in-memory to deliver performance and faster access to business data. We are
heading toward having large memory platforms that will store petabytes in DRAM and Flash/SSD in
the coming years. For example, several retailers are using BDW to leverage customer-related data
to determine product discounting strategy, optimize product distribution across stores, and enable
personalized customer experiences.
›› Streaming engines to support new data channels for ingestion and processing. Market data,
clickstream, mobile devices, and sensors are new sources for analytical information that are not in
your existing data warehouse. Streaming technology boosts integrating, transforming, and curating
data on diverse data streams in real time.6
Integrating streaming technology with data platforms
such as Hadoop and Spark — as well as traditional data warehouses — has become critical. For
example, we see oil and gas industry firms leveraging streaming technology for insights into new
business opportunities, such as predicting staffing and resource requirements for various drilling
sites and performing machine failure analysis.
The Major EDW Vendors Provide BDW Components
From an implementation viewpoint, most enterprises are currently building BDW platforms themselves
by integrating their traditional data warehouses with Apache Spark, Hadoop, Storm, and in-memory
technologies. Forrester sees many enterprises already using an extract-Hadoop-load (EHL) approach to:
1.	 Extract data from various source systems such as traditional databases and flat files.
2.	Load data into Hadoop to perform aggregation and transformation using Apache Hadoop
ecosystem tools.
3.	 Finally load the result into the EDW platform.7
BDW Use Cases Go Beyond Traditional Analytics
Adoption of BDW architectures will accelerate as enterprises run into existing EDW challenges. But
building a BDW platform internally will require more time and effort, which will likely put pressure on
the overall business technology (BT) agenda. The good news is that solutions are starting to emerge
from vendors such as IBM, Microsoft, Oracle, SAP, Snowflake, and Teradata that provide some or all of
the components to build and deploy a BDW strategy.8
Enterprises are already using BDWs to support
social analytics, risk analysis, campaign analysis, fraud assessment, and pricing trends. The top BDW
use cases include:
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
August 29, 2016
© 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law.
Citations@forrester.com or +1 866-367-7378
11
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
›› Integrated analytics. A key challenge in the traditional EDW approach was that if data didn’t
exist in the warehouse, you couldn’t do any analytics — full stop. With BDW architecture, you can
perform integrated analytics across data warehouse and Hadoop clusters. Hadoop can store and
process large sets of semistructured and unstructured data, log files, and streaming data with ease.
For example, health research often requires looking at complex patient data and determining how
effective a treatment is likely to be based on factors like age, sex, and health status. The BDW
enables gathering and storing millions of data points in Hadoop and performing complex navigation
and modeling using traditional data warehouse and in-memory technology.
›› Internet-of-things (IoT) analytics. Traditional data warehouses don’t deal with IoT data.
However, the BDW offers the ability to store, process, and access large volumes of IoT data from
sensors and devices in Hadoop repositories efficiently through automation and machine learning
technologies. Manufacturers deal with highly sophisticated machinery to support their plants,
whether they’re building a car, airplane, or tire or bottling wine or soda. Every minute of machine
downtime can cost a manufacturer dearly. IoT analytics on BDW platforms enables manufacturers
to predict machine failures based on sensor data, minimizing or eliminating production slowdown.
›› Right-time business analytics. Traditional EDW architectures were based on mostly batch
processing, with ETL doing the heavy lifting of data from traditional systems to operational systems
to data warehouses. As a result, by the time data arrived in data warehouses, it was already 12 to
48 hours old. BDWs enable right-time analytics by leveraging streaming and replication with direct
access to data sources, whether on-premises or cloud, bypassing traditional ETL approaches. The
financial services industry has been an early adopter of BDW to support right-time analytics for
portfolio management, fraud detection, and asset management.
›› Adaptive, self-service analytics. Most EDWs use predefined data sources to deliver predictive
analytics, trends, and insights. The BDW enables organizations to dynamically leverage new data
sources quickly to deliver new insights. It enables self-service capabilities for business users to
ask complex and new questions so they can make more accurate decisions. The BDW adapts to
the new sources and can help correlate data using machine learning and adaptive intelligence. For
example, a major European bank recently built a BDW framework that business units now use to
support self-service for making better decisions on investments and risks. The platform represents
a major shift from the static reports the bank used previously.
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
August 29, 2016
© 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law.
Citations@forrester.com or +1 866-367-7378
12
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
Recommendations
Extend Your Current EDW Platforms Toward A BDW Strategy
Don’t throw away your existing EDW platform! The investments you have already made in EDWs will form
the foundation of the next-generation BDW strategy. However, attaining this demands that you rearchitect
your existing EDW platform and invest in new technologies to deliver on a new vision of right-time
analytics, self-service, and intelligent and contextualized customer analytics. Forrester recommends that
enterprise architects extend existing EDW platforms toward a BDW strategy by leveraging:
›› Hadoop for low-cost storage and processing of big data. Let Hadoop be the first stop for your
big data that has no other home in your data warehouse. Hadoop offers the ability to store very
large volumes of data (including unstructured data) more efficiently than traditional warehouses —
and at a fraction of cost. In addition, Hadoop helps you offload data from traditional warehouses
and leverage a distributed computing framework to perform transformation, aggregation, and
curation quickly.
›› In-memory technology to support right-time analytics. Without in-memory technology,
customer analytics, personalization, and right-time analytics will run slowly. This could cause you
to miss key trends like customer churn or miss the opportunity to offer new products and services
or identify weak markets. You can also use data from the BDW as part of the bigger big data
fabric framework that leverages distributed in-memory computing to deliver a broader enterprise
information fabric.
›› Hybrid platforms to support on-demand and scalable BDWs. Storing all of your data on-
premises need no longer be the default. Cloud platforms like those from Amazon Web Services,
Google, IBM, Microsoft (Azure), Oracle, and Rackspace offer pay-as-you-go facilities to store,
process, and access any amount of data.9
Hybrid is the new norm — look at utilizing both on-
premises and cloud data warehouse platforms as part of your BDW architecture, with a common
administration facility.
›› Vendor solutions that help achieve faster time-to-value. Data warehouse, Hadoop, and other big
data solutions from vendors such as Cloudera, Hortonworks, IBM, MapR Technologies, Microsoft,
Oracle, SAP, and Teradata can reduce time-to-value by automating and simplifying various BDW
functions and implementation steps. Look at vendors that support broader solutions and can
support your business data. Ask your vendor how it plans to provide the BDW vision. Review the
various components that the vendor has integrated and ask how it plans to fill any gaps.
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
August 29, 2016
© 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law.
Citations@forrester.com or +1 866-367-7378
13
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
Supplemental Material
Forrester’s Global Business Technographics® Data And Analytics Survey, 2016 was fielded in March
2016. This online survey included 3,343 respondents in Australia, Brazil, Canada, China, France,
Germany, India, New Zealand, the UK, and the US from companies with 100 or more employees.
Forrester’s Business Technographics ensures that the final survey population contains only those with
significant involvement in the planning, funding, and purchasing of business and technology products
and services. Research Now fielded this survey on behalf of Forrester. Survey respondent incentives
include points redeemable for gift certificates.
Please note that the brand questions included in this survey should not be used to measure market
share. The purpose of Forrester’s Business Technographics brand questions is to show usage of a
brand by a specific target audience at one point in time.
Engage With An Analyst
Gain greater confidence in your decisions by working with Forrester thought leaders to apply
our research to your specific business and technology initiatives.
Forrester’s research apps for iPhone®
and iPad®
Stay ahead of your competition no matter where you are.
Analyst Inquiry
To help you put research
into practice, connect
with an analyst to discuss
your questions in a
30-minute phone session
— or opt for a response
via email.
Learn more.
Analyst Advisory
Translate research into
action by working with
an analyst on a specific
engagement in the form
of custom strategy
sessions, workshops,
or speeches.
Learn more.
Webinar
Join our online sessions
on the latest research
affecting your business.
Each call includes analyst
Q&A and slides and is
available on-demand.
Learn more.
For Enterprise Architecture Professionals
The Next-Generation EDW Is The Big Data Warehouse
August 29, 2016
© 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law.
Citations@forrester.com or +1 866-367-7378
14
Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics
Endnotes
1
	Today, organizations still rely on EDW platforms to deliver actionable, timely, and trustworthy intelligence. EDW
technology organizes and aggregates analytical data from various functional domains and serves as a critical
repository for organizations’ operations. See the “The Forrester Wave™: Enterprise Data Warehouse, Q4 2015”
Forrester report.
2
	It takes a long time to measure a business process. Enterprise data hubs need to accommodate more data and an
infinite set of queries. See the “Create A Road Map For A Real-Time, Agile, Self-Service Data Platform” Forrester
report.
3
	 Data consumers — from casual data analysts to data scientists to your customers — are looking across a broad
variety of data today to find answers to their questions. See the “Compose Digital Data To Create A Symphony Of
Insight” Forrester report.
4
	 Data bottlenecks create business bottlenecks. The days of provisioning data to simply meet the requirements of
systems of record are over. Business stakeholders at the executive and line-of-business levels need data faster to
keep up with customers, competitors, and partners. See the “Create A Road Map For A Real-Time, Agile, Self-Service
Data Platform” Forrester report.
5
	 Forrester defines big data fabric as “bringing together disparate big data sources automatically, intelligently, and
securely, and processing them in a big data platform technology, such as Hadoop and Apache Spark, to deliver a
unified, trusted, and comprehensive view of customer and business data.” See the “Big Data Fabric Drives Innovation
And Growth” Forrester report.
6
	Streaming technology helps integrating, transforming, and curating data on diverse data streams in real time. See the
“The Forrester Wave™: Big Data Streaming Analytics, Q1 2016” Forrester report.
7
	 Forrester sees many enterprises already using an extract-Hadoop-load approach to extract data from various source
systems, such as IoT devices and cloud and traditional platforms, then load it into Hadoop, perform aggregation
and transformation, and finally load it into the EDW to support business analytics. See the “The Forrester Wave™:
Enterprise Data Warehouse, Q4 2015” Forrester report.
8
	 Most big data integration vendors focus on making classic processes faster with tools for moving data into a lake and
working with it there. Three innovative vendors — Looker Data Sciences, SnapLogic, and Snowflake Computing —
offer alternative approaches. See the “Breakout Vendors: Big Data Integration” Forrester report.
9
	 According to Forrester customer feedback, such cloud-based storage is typically over 20% less expensive than on-
premises deployment.
We work with business and technology leaders to develop
customer-obsessed strategies that drive growth.
Products and Services
›› Core research and tools
›› Data and analytics
›› Peer collaboration
›› Analyst engagement
›› Consulting
›› Events
Forrester Research (Nasdaq: FORR) is one of the most influential research and advisory firms in the world. We work with
business and technology leaders to develop customer-obsessed strategies that drive growth. Through proprietary
research, data, custom consulting, exclusive executive peer groups, and events, the Forrester experience is about a
singular and powerful purpose: to challenge the thinking of our clients to help them lead change in their organizations.
For more information, visit forrester.com.
Client support
For information on hard-copy or electronic reprints, please contact Client Support at
+1 866-367-7378, +1 617-613-5730, or clientsupport@forrester.com. We offer quantity
discounts and special pricing for academic and nonprofit institutions.
Forrester’s research and insights are tailored to your role and
critical business initiatives.
Roles We Serve
Marketing & Strategy
Professionals
CMO
B2B Marketing
B2C Marketing
Customer Experience
Customer Insights
eBusiness & Channel
Strategy
Technology Management
Professionals
CIO
Application Development
& Delivery
›› Enterprise Architecture
Infrastructure & Operations
Security & Risk
Sourcing & Vendor
Management
Technology Industry
Professionals
Analyst Relations
128005

More Related Content

What's hot

Dark Side Of Customer Analytics
Dark Side Of Customer AnalyticsDark Side Of Customer Analytics
Dark Side Of Customer Analyticspritishz
 
Toys R Us Japan Case Presentation
Toys R Us Japan Case PresentationToys R Us Japan Case Presentation
Toys R Us Japan Case PresentationJessica Gahtan
 
Group 4 scm assignment 2
Group 4 scm assignment 2Group 4 scm assignment 2
Group 4 scm assignment 2ZaidTergaon
 
Creating Your Data Governance Dashboard
Creating Your Data Governance DashboardCreating Your Data Governance Dashboard
Creating Your Data Governance DashboardTrillium Software
 
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothThe Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothAdaryl "Bob" Wakefield, MBA
 
Intel 64 fund case analysis - Super Six Team lead by V.V.L.N. Sastry
Intel 64 fund case analysis - Super Six Team lead by V.V.L.N. SastryIntel 64 fund case analysis - Super Six Team lead by V.V.L.N. Sastry
Intel 64 fund case analysis - Super Six Team lead by V.V.L.N. SastryDr.V.V.L.N. Sastry
 
The michelin case
The michelin caseThe michelin case
The michelin caseSirris
 
Netflix Presentation
Netflix PresentationNetflix Presentation
Netflix PresentationTyler Massie
 
Nintendo Case Study
Nintendo Case StudyNintendo Case Study
Nintendo Case StudyÖzge Duman
 
Air India - Turnaround strategy
Air India - Turnaround strategyAir India - Turnaround strategy
Air India - Turnaround strategyMayank Agrawal
 
Progressive: Pay-As-You-Go Insurance
Progressive: Pay-As-You-Go InsuranceProgressive: Pay-As-You-Go Insurance
Progressive: Pay-As-You-Go InsuranceAnant Lodha
 
R Data Access from hdfs,spark,hive
R Data Access  from hdfs,spark,hiveR Data Access  from hdfs,spark,hive
R Data Access from hdfs,spark,hivearunkumar sadhasivam
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo
 
Harley davidson ppt
Harley davidson pptHarley davidson ppt
Harley davidson pptvishtime
 
Harley Davidson Case Study - Building Brand Communities
Harley Davidson Case Study - Building Brand CommunitiesHarley Davidson Case Study - Building Brand Communities
Harley Davidson Case Study - Building Brand CommunitiesCarmen Neghina
 

What's hot (20)

Dark Side Of Customer Analytics
Dark Side Of Customer AnalyticsDark Side Of Customer Analytics
Dark Side Of Customer Analytics
 
Toys R Us Japan Case Presentation
Toys R Us Japan Case PresentationToys R Us Japan Case Presentation
Toys R Us Japan Case Presentation
 
Blue Nile & Diamond Retailing
Blue Nile & Diamond RetailingBlue Nile & Diamond Retailing
Blue Nile & Diamond Retailing
 
Visual Data Vault
Visual Data VaultVisual Data Vault
Visual Data Vault
 
Group 4 scm assignment 2
Group 4 scm assignment 2Group 4 scm assignment 2
Group 4 scm assignment 2
 
Creating Your Data Governance Dashboard
Creating Your Data Governance DashboardCreating Your Data Governance Dashboard
Creating Your Data Governance Dashboard
 
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothThe Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
 
Intel 64 fund case analysis - Super Six Team lead by V.V.L.N. Sastry
Intel 64 fund case analysis - Super Six Team lead by V.V.L.N. SastryIntel 64 fund case analysis - Super Six Team lead by V.V.L.N. Sastry
Intel 64 fund case analysis - Super Six Team lead by V.V.L.N. Sastry
 
The michelin case
The michelin caseThe michelin case
The michelin case
 
Netflix Presentation
Netflix PresentationNetflix Presentation
Netflix Presentation
 
Blue ocean strategy Ppt
Blue ocean strategy  Ppt Blue ocean strategy  Ppt
Blue ocean strategy Ppt
 
Nintendo Case Study
Nintendo Case StudyNintendo Case Study
Nintendo Case Study
 
Air India - Turnaround strategy
Air India - Turnaround strategyAir India - Turnaround strategy
Air India - Turnaround strategy
 
Progressive: Pay-As-You-Go Insurance
Progressive: Pay-As-You-Go InsuranceProgressive: Pay-As-You-Go Insurance
Progressive: Pay-As-You-Go Insurance
 
R Data Access from hdfs,spark,hive
R Data Access  from hdfs,spark,hiveR Data Access  from hdfs,spark,hive
R Data Access from hdfs,spark,hive
 
Harley davidson
Harley davidsonHarley davidson
Harley davidson
 
Dark side of customer analytics
Dark side of customer analyticsDark side of customer analytics
Dark side of customer analytics
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
 
Harley davidson ppt
Harley davidson pptHarley davidson ppt
Harley davidson ppt
 
Harley Davidson Case Study - Building Brand Communities
Harley Davidson Case Study - Building Brand CommunitiesHarley Davidson Case Study - Building Brand Communities
Harley Davidson Case Study - Building Brand Communities
 

Similar to Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise

Solix Big Data Suite
Solix Big Data SuiteSolix Big Data Suite
Solix Big Data SuiteLindaWatson19
 
The New Enterprise Blueprint featuring the Gartner Magic Quadrant
The New Enterprise Blueprint featuring the Gartner Magic QuadrantThe New Enterprise Blueprint featuring the Gartner Magic Quadrant
The New Enterprise Blueprint featuring the Gartner Magic QuadrantLindaWatson19
 
Solving The Data Growth Crisis: Solix Big Data Suite
Solving The Data Growth Crisis: Solix Big Data SuiteSolving The Data Growth Crisis: Solix Big Data Suite
Solving The Data Growth Crisis: Solix Big Data SuiteLindaWatson19
 
Top 10 guidelines for deploying modern data architecture for the data driven ...
Top 10 guidelines for deploying modern data architecture for the data driven ...Top 10 guidelines for deploying modern data architecture for the data driven ...
Top 10 guidelines for deploying modern data architecture for the data driven ...LindaWatson19
 
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Denodo
 
bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000Kartik Padmanabhan
 
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESB
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESBData Integration Alternatives: When to use Data Virtualization, ETL, and ESB
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESBDenodo
 
2016 Strata Conference New York - Vendor Briefings
2016 Strata Conference New York - Vendor Briefings2016 Strata Conference New York - Vendor Briefings
2016 Strata Conference New York - Vendor BriefingsDigital Enterprise Journal
 
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...Eric Javier Espino Man
 
Jump-Start the Enterprise Journey to the Cloud
Jump-Start the Enterprise Journey to the CloudJump-Start the Enterprise Journey to the Cloud
Jump-Start the Enterprise Journey to the CloudLindaWatson19
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Top 5 Emerging Trends in Data Integration
Top 5 Emerging Trends in Data IntegrationTop 5 Emerging Trends in Data Integration
Top 5 Emerging Trends in Data IntegrationJade Global
 
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Denodo
 
Open Source Ecosystem Future of Enterprise IT
Open Source Ecosystem Future of Enterprise ITOpen Source Ecosystem Future of Enterprise IT
Open Source Ecosystem Future of Enterprise ITandreas kuncoro
 
Building a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White PaperBuilding a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White PaperImpetus Technologies
 

Similar to Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise (20)

Solix Big Data Suite
Solix Big Data SuiteSolix Big Data Suite
Solix Big Data Suite
 
iri-highres
iri-highresiri-highres
iri-highres
 
The New Enterprise Blueprint featuring the Gartner Magic Quadrant
The New Enterprise Blueprint featuring the Gartner Magic QuadrantThe New Enterprise Blueprint featuring the Gartner Magic Quadrant
The New Enterprise Blueprint featuring the Gartner Magic Quadrant
 
Solving The Data Growth Crisis: Solix Big Data Suite
Solving The Data Growth Crisis: Solix Big Data SuiteSolving The Data Growth Crisis: Solix Big Data Suite
Solving The Data Growth Crisis: Solix Big Data Suite
 
Top 10 guidelines for deploying modern data architecture for the data driven ...
Top 10 guidelines for deploying modern data architecture for the data driven ...Top 10 guidelines for deploying modern data architecture for the data driven ...
Top 10 guidelines for deploying modern data architecture for the data driven ...
 
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
Analyst Webinar: Discover how a logical data fabric helps organizations avoid...
 
bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000bigdatasqloverview21jan2015-2408000
bigdatasqloverview21jan2015-2408000
 
IBM Cloud pak for data brochure
IBM Cloud pak for data   brochureIBM Cloud pak for data   brochure
IBM Cloud pak for data brochure
 
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESB
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESBData Integration Alternatives: When to use Data Virtualization, ETL, and ESB
Data Integration Alternatives: When to use Data Virtualization, ETL, and ESB
 
2016 Strata Conference New York - Vendor Briefings
2016 Strata Conference New York - Vendor Briefings2016 Strata Conference New York - Vendor Briefings
2016 Strata Conference New York - Vendor Briefings
 
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
 
Jump-Start the Enterprise Journey to the Cloud
Jump-Start the Enterprise Journey to the CloudJump-Start the Enterprise Journey to the Cloud
Jump-Start the Enterprise Journey to the Cloud
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Data Analytics - The Insight
Data Analytics - The InsightData Analytics - The Insight
Data Analytics - The Insight
 
Taming the data beast
Taming the data beastTaming the data beast
Taming the data beast
 
Top 5 Emerging Trends in Data Integration
Top 5 Emerging Trends in Data IntegrationTop 5 Emerging Trends in Data Integration
Top 5 Emerging Trends in Data Integration
 
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
 
Open Source Ecosystem Future of Enterprise IT
Open Source Ecosystem Future of Enterprise ITOpen Source Ecosystem Future of Enterprise IT
Open Source Ecosystem Future of Enterprise IT
 
Building a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White PaperBuilding a Big Data Analytics Platform- Impetus White Paper
Building a Big Data Analytics Platform- Impetus White Paper
 
pwc-data-mesh.pdf
pwc-data-mesh.pdfpwc-data-mesh.pdf
pwc-data-mesh.pdf
 

More from LindaWatson19

Data-driven Healthcare
Data-driven HealthcareData-driven Healthcare
Data-driven HealthcareLindaWatson19
 
How to Optimize ERP Upgrades
How to Optimize ERP UpgradesHow to Optimize ERP Upgrades
How to Optimize ERP UpgradesLindaWatson19
 
Solix Cloud – Managing Data Growth with Database Archiving and Application Re...
Solix Cloud – Managing Data Growth with Database Archiving and Application Re...Solix Cloud – Managing Data Growth with Database Archiving and Application Re...
Solix Cloud – Managing Data Growth with Database Archiving and Application Re...LindaWatson19
 
Simplifying Enterprise Application Retirement with Solix ExAPPS – Industry’s ...
Simplifying Enterprise Application Retirement with Solix ExAPPS – Industry’s ...Simplifying Enterprise Application Retirement with Solix ExAPPS – Industry’s ...
Simplifying Enterprise Application Retirement with Solix ExAPPS – Industry’s ...LindaWatson19
 
Solix EDMS and Oracle Exadata: Transitioning to the Private Cloud
Solix EDMS and Oracle Exadata: Transitioning to the Private CloudSolix EDMS and Oracle Exadata: Transitioning to the Private Cloud
Solix EDMS and Oracle Exadata: Transitioning to the Private CloudLindaWatson19
 
Go Green to Save Green – Embracing Green Energy Practices
Go Green to Save Green – Embracing Green Energy PracticesGo Green to Save Green – Embracing Green Energy Practices
Go Green to Save Green – Embracing Green Energy PracticesLindaWatson19
 
The CIO guide to Big Data Archiving
The CIO guide to Big Data ArchivingThe CIO guide to Big Data Archiving
The CIO guide to Big Data ArchivingLindaWatson19
 
Extending Information Security to Non-Production Environments
Extending Information Security to Non-Production EnvironmentsExtending Information Security to Non-Production Environments
Extending Information Security to Non-Production EnvironmentsLindaWatson19
 
Enterprise Data Management “As-a-Service”
Enterprise Data Management “As-a-Service”Enterprise Data Management “As-a-Service”
Enterprise Data Management “As-a-Service”LindaWatson19
 
Database Archiving: The Key to Siebel Performance
Database Archiving: The Key to Siebel PerformanceDatabase Archiving: The Key to Siebel Performance
Database Archiving: The Key to Siebel PerformanceLindaWatson19
 
Data-driven Healthcare for the Pharmaceutical Industry
Data-driven Healthcare for the Pharmaceutical IndustryData-driven Healthcare for the Pharmaceutical Industry
Data-driven Healthcare for the Pharmaceutical IndustryLindaWatson19
 
Data-driven Healthcare for Providers
Data-driven Healthcare for ProvidersData-driven Healthcare for Providers
Data-driven Healthcare for ProvidersLindaWatson19
 
Data-driven Healthcare for Payers
Data-driven Healthcare for PayersData-driven Healthcare for Payers
Data-driven Healthcare for PayersLindaWatson19
 
Data-driven Healthcare for Manufacturers
Data-driven Healthcare for ManufacturersData-driven Healthcare for Manufacturers
Data-driven Healthcare for ManufacturersLindaWatson19
 
Data-driven Banking: Managing the Digital Transformation
Data-driven Banking: Managing the Digital TransformationData-driven Banking: Managing the Digital Transformation
Data-driven Banking: Managing the Digital TransformationLindaWatson19
 
Case Studies in Improving Application Performance With Solix Database Archivi...
Case Studies in Improving Application Performance With Solix Database Archivi...Case Studies in Improving Application Performance With Solix Database Archivi...
Case Studies in Improving Application Performance With Solix Database Archivi...LindaWatson19
 
Application Retirement – Road Map for Legacy Applications
Application Retirement – Road Map for Legacy ApplicationsApplication Retirement – Road Map for Legacy Applications
Application Retirement – Road Map for Legacy ApplicationsLindaWatson19
 
A New Frontier in Securing Sensitive Information – Taneja Group, April 2007
A New Frontier in Securing Sensitive Information – Taneja Group, April 2007A New Frontier in Securing Sensitive Information – Taneja Group, April 2007
A New Frontier in Securing Sensitive Information – Taneja Group, April 2007LindaWatson19
 
Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...
Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...
Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...LindaWatson19
 

More from LindaWatson19 (19)

Data-driven Healthcare
Data-driven HealthcareData-driven Healthcare
Data-driven Healthcare
 
How to Optimize ERP Upgrades
How to Optimize ERP UpgradesHow to Optimize ERP Upgrades
How to Optimize ERP Upgrades
 
Solix Cloud – Managing Data Growth with Database Archiving and Application Re...
Solix Cloud – Managing Data Growth with Database Archiving and Application Re...Solix Cloud – Managing Data Growth with Database Archiving and Application Re...
Solix Cloud – Managing Data Growth with Database Archiving and Application Re...
 
Simplifying Enterprise Application Retirement with Solix ExAPPS – Industry’s ...
Simplifying Enterprise Application Retirement with Solix ExAPPS – Industry’s ...Simplifying Enterprise Application Retirement with Solix ExAPPS – Industry’s ...
Simplifying Enterprise Application Retirement with Solix ExAPPS – Industry’s ...
 
Solix EDMS and Oracle Exadata: Transitioning to the Private Cloud
Solix EDMS and Oracle Exadata: Transitioning to the Private CloudSolix EDMS and Oracle Exadata: Transitioning to the Private Cloud
Solix EDMS and Oracle Exadata: Transitioning to the Private Cloud
 
Go Green to Save Green – Embracing Green Energy Practices
Go Green to Save Green – Embracing Green Energy PracticesGo Green to Save Green – Embracing Green Energy Practices
Go Green to Save Green – Embracing Green Energy Practices
 
The CIO guide to Big Data Archiving
The CIO guide to Big Data ArchivingThe CIO guide to Big Data Archiving
The CIO guide to Big Data Archiving
 
Extending Information Security to Non-Production Environments
Extending Information Security to Non-Production EnvironmentsExtending Information Security to Non-Production Environments
Extending Information Security to Non-Production Environments
 
Enterprise Data Management “As-a-Service”
Enterprise Data Management “As-a-Service”Enterprise Data Management “As-a-Service”
Enterprise Data Management “As-a-Service”
 
Database Archiving: The Key to Siebel Performance
Database Archiving: The Key to Siebel PerformanceDatabase Archiving: The Key to Siebel Performance
Database Archiving: The Key to Siebel Performance
 
Data-driven Healthcare for the Pharmaceutical Industry
Data-driven Healthcare for the Pharmaceutical IndustryData-driven Healthcare for the Pharmaceutical Industry
Data-driven Healthcare for the Pharmaceutical Industry
 
Data-driven Healthcare for Providers
Data-driven Healthcare for ProvidersData-driven Healthcare for Providers
Data-driven Healthcare for Providers
 
Data-driven Healthcare for Payers
Data-driven Healthcare for PayersData-driven Healthcare for Payers
Data-driven Healthcare for Payers
 
Data-driven Healthcare for Manufacturers
Data-driven Healthcare for ManufacturersData-driven Healthcare for Manufacturers
Data-driven Healthcare for Manufacturers
 
Data-driven Banking: Managing the Digital Transformation
Data-driven Banking: Managing the Digital TransformationData-driven Banking: Managing the Digital Transformation
Data-driven Banking: Managing the Digital Transformation
 
Case Studies in Improving Application Performance With Solix Database Archivi...
Case Studies in Improving Application Performance With Solix Database Archivi...Case Studies in Improving Application Performance With Solix Database Archivi...
Case Studies in Improving Application Performance With Solix Database Archivi...
 
Application Retirement – Road Map for Legacy Applications
Application Retirement – Road Map for Legacy ApplicationsApplication Retirement – Road Map for Legacy Applications
Application Retirement – Road Map for Legacy Applications
 
A New Frontier in Securing Sensitive Information – Taneja Group, April 2007
A New Frontier in Securing Sensitive Information – Taneja Group, April 2007A New Frontier in Securing Sensitive Information – Taneja Group, April 2007
A New Frontier in Securing Sensitive Information – Taneja Group, April 2007
 
Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...
Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...
Enterprise Archiving with Apache Hadoop Featuring the 2015 Gartner Magic Quad...
 

Recently uploaded

Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxFurkanTasci3
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 

Recently uploaded (20)

Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
Data Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptxData Science Jobs and Salaries Analysis.pptx
Data Science Jobs and Salaries Analysis.pptx
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 

Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise

  • 1. SOLIX COMMON DATA PLATFORM: Empowering the Data-driven Enterprise Advanced Analytics and the Data-Driven Enterprise
  • 2. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise1 EXECUTIVE SUMMARY “You cannot manage what you cannot measure.” Those prescient words from management guru Peter Drucker more than 30 years ago encapsulated the evolution of enterprise software platforms and have paved the path to the era of the data-driven enterprise. Data is remaking industries and reshaping the global economy. Those who embraced data found growth areas and improved earnings. Organizations that ignore the promise of data can no longer survive in the new world economy. Today, Drucker’s words are as true as ever. To continue being data-driven, organizations must be able to ingest and analyze new forms of data from constantly developing new sources. Those who are ready to mine new data streams, such as social, IoT and more, are primed to transform economies and reap the benefits with new growth opportunities. Businesses ready to leap into the fray are adopting Big Data. Big Data brings together structured, semi-structured and unstructured data. When all forms of data are brought together, it not only multiplies the value of every single piece of data, but it also presents a new set of challenges around data storage, governance and consumption. In today’s enterprise world, business users want to make real-time, data-driven decisions using the vast amount of data available. Yet, IT departments are faced with the challenge of increasing storage and Business Intelligence (BI) costs, complex governance and Information Lifecycle Management (ILM) for data, which is now in the scale of petabytes and beyond. Unfortunately, current enterprise ready technology offerings are not capable of managing this data tsunami, let alone take advantage of all the possibilities this data offers. The tension created within organizations is clear. As Forrester also points out in its research report, in the era of Big Data, traditional EDW is failing to meet new business requirements, such as support for real-time and ad hoc customer analytics, new sources of data, and self-service capabilities.1 The Solix Common Data Platform (CDP) allows organizations to embrace Big Data, while keeping the challenges in check. The Solix CDP helps organizations leverage their existing infrastructure and allows them to collect, store and analyze massive amounts of data from every source without sacrificing governance, security or management. Further, with the Solix CDP all data keeps its original context and structure, allowing organizations to ask complex questions and gain deep contextual insights from data at any point. The Solix CDP creates a new paradigm fostering a meaningful, frictionless partnership between IT departments and business users. IT departments can now become the guardians of data and business users can become the owners and direct consumers of data. Solix created the CDP to bring ILM to the Data Lake and innovation to the EDW. The Solix CDP is the next evolution in the new enterprise blueprint, offering Enterprise Data Archiving and Enterprise Data Lake to create an Advanced Analytics platform with unprecedented levels of ILM in a Big Data setting. 1 Forrester Report on The Next-Generation EDW is the Big Data Warehouse, August 2016 “
  • 3. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise2 INTRODUCTION — SOLIX COMMON DATA PLATFORM Solix CDP = Enterprise Archiving + Enterprise Data Lake + Information Governance At the core of the Solix CDP are Enterprise Archive and The Enterprise Data Lake. The Solix CDP utilizes the Solix Big Data Suite to provide comprehensive enterprise data management and robust ILM. With the CDP, organizations can vastly expand the reach of analytics by creating an Advanced Analytics platform. For the CIO, Enterprise Archiving offers a quick ROI that will ensure budgetary support from the organization and dissolves the obstacles between Big Data and ILM. The Solix CDP brings enterprise-grade capabilities to the Hadoop framework, addressing all shortcomings of the Data Lake. Solix CDP provides uniform data collection, metadata management, ILM and secure data access for Advanced Analytics. The Solix CDP does this all while maximizing an organization’s existing infrastructure. With no need to reboot the organization’s enterprise architecture, the Solix CDP harnesses the current architecture to develop a new enterprise blueprint, capable of evolving with the business requirements of an organization. The Solix CDP is also capable of evolution. As businesses stretch Hadoop to its limits, new Big Data technologies will emerge. The Solix CDP is primed to adapt with them. ------------ ------------------------ ------------------------ ------------------------- ----------- SOLIX COMMON DATA PLATFORM ENTERPRISE ARCHIVING ENTERPRISEDATALAKE INFO R M ATION GOVERNANCE The Solix CDP brings enterprise-grade capabilities to the Hadoop framework, addressing all shortcomings of the Data Lake.
  • 4. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise3 WHY SOLIX COMMON DATA PLATFORM? The need to become data-driven is clear. Transformation has hit every major industry and disruptors have become powerhouses in the global economy based on their capabilities to mine data. Any organization wanting to compete must become data-driven or it is destined to fail. The current enterprise architecture offering, EDW, provides a canonical, top-down view of enterprise data to meet end user requirements, but those views rarely satisfy the function-specific requirements of data-driven applications. As per Forrester research, Big Data platforms such as Hadoop have made Big Data architectures more affordable, allowing companies to pursue new business insights for increased data-driven competitive advantage.2 The Solix CDP, built on top of Hadoop distributions, enables data-driven organizations to gain more value from their data because now data can be visualized in more specific ways. The cost of relying on the EDW to collect and analyze all of this data would also exceed the budget of most organizations. The Solix CDP is a uniform data collection system for structured, unstructured and semi- structured data featuring low-cost data storage and Advanced Analytics. Solix CDP stores data “as-is” to reduce costly Extract, Transform and Load (ETL) operations, as well as transforms data to feed downstream NoSQL and analytics applications. Solix CDP enables organizations to create a true enterprise Data Lake with full access to the data, rather than a data swamp where the data gets lost. This enables the CIO to find a better solution than trying to collect and store all of the enterprise data in the expensive Tier 1 storage and existing EDW architectural offerings. The Solix CDP does not require costly infrastructure and offers the scalability and flexibility the Big Data platform architecture provides, along with enterprise-grade governance and security. The Solix CDP lays the foundation for information governance, efficient infrastructure utilization and Advanced Analytics at petabyte scale. The Solix CDP lays the foundation for information governance, efficient infrastructure utilization and Advanced Analytics at petabyte scale. “ 2 Forrester Report on Big Data Fabric Drives Innovation and Growth, March 2016
  • 5. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise4 Here is the comparison on how Solix CDP differs from a traditional Data Warehouse and a Data Lake: DATA WAREHOUSE DATA LAKE SOLIX CDP Data Processed, Structured Structured, Semi-structured/ Unstructured Processed, Structured, Semi-structured/ Unstructured Schema On write On read On read / On write Storage Costs High Low Low Scalability Low High High Agility Low, Fixed configuration High, Configure & Reconfigure High, Configure & reconfigure Metadata Repository Centralized MetaData Repository No Centralized MetaData Repository Data Access Query Search Query + Search Query Performance High Medium Medium Security / Governance Mature Maturing Mature Users Business Users Data Scientists Business Users, Data Analysts, Data Scientists Role based Access Yes No Yes ILM No No Yes Regulatory Retention Management No No Yes Legal Hold No No Yes ROI High Low High
  • 6. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise5 PRODUCT/ SOLUTION OVERVIEW OF THE SOLIX COMMON DATA PLATFORM Built on top of Hadoop distributions, such as Cloudera CDH or Hortonworks HDP, the Solix CDP provides an integrated suite of enterprise connectors through its Object Workbench to build a consolidated repository of enterprise data and metadata. The Solix Analyst Workbench allows multiple teams and users to collaborate, create virtual workspaces and projects to access the data without compromising on compliance and security. Because it runs on top of both of the most popular Hadoop distributions, it eliminates one of the basic questions behind the creation of a Hadoop stack, and it can bridge the two in environments where both are being used. The data-driven enterprise does not wait for the business question to develop and then use the data to answer it. The data-driven organization uses Advanced Analytics and Business Intelligence to mine the data for the questions and then the answers. With Solix CDP, business users can create data models and derive the insights needed to move the organization forward. The self-service model takes IT out of the equation, freeing it to focus on its work, while ensuring security and governance measures are also met. The Solix CDP brings robust ILM to all data. The Solix CDP ensures all data retains context by retaining its metadata, meaning its value is never lost. This ensures the business questions being raised by the data are truly valid and the answers analysts find are relevant. The Solix CDP ensures all data retains context by retaining its metadata, meaning its value is never lost.
  • 7. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise6 Enterprise Archiving In the era of Big Data, Archiving is a No-Brainer Investment.3 Up to 80 percent of production data used by core applications is inactive. Data archiving has emerged as an ILM best practice to meet data growth challenges. Solix CDP ensures that Enterprise Archiving improves production application performance, reduces infrastructure costs and meets the regulatory and compliance needs. As part of Enterprise Archiving, application data running online is first moved into Tier 2 or Hadoop infrastructure, and then purged from its source location, according to ILM policies. Data archiving best practice requires that MOVE and PURGE processes be coordinated and validated. Enterprise Archiving on Solix CDP ensures proper data governance since enterprise data is ingested and stored based on ILM retention policies and business rules. Archive data is classified for security and compliance requirements, such as legal hold, and universal access is provided for business users through structured reports and full text search for business objects. Semi/Unstructured Data Universal Access Native Access BI Reporting Analytics Solix Big Data Suite Archiving Solix EDMS Database Archiving Archive Database DB Active Data Custom Apps Structured Data MOVE & COPY MOVE, COPY, PRINT Enterprise Business Record Print Stream Capture Search & Query Access Retention Management and Legal Hold SOLIX COMMON DATA PLATFORM Semi-Active Data (RDBMS) InActive Data (Hadoop ) Reporting / BI Tools MACHINE DATA XML VIDEOS EMAIL IMAGES FILE SHARE Solix BigData Suite Solix APM (Repository, Query, Search) 3 Forrester Report on Vendor Landscape: Big Data Archiving, August 2015 Solix CDP ensures that Enterprise Archiving improves production application performance, reduces infrastructure costs and meets the regulatory and compliance needs. “
  • 8. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise7 Enterprise Business Record (EBR) An EBR is a de-normalized, point-in-time snapshot of a business transaction, which may include structured or unstructured elements. The Solix CDP helps model, ingest and manage EBR data into a Hadoop optimized file format that is fully accessible for text search or structured query. Data can be ingested to build both a long-term Enterprise Archive and a transient Enterprise Data Lake. For the archiving use case, older inactive data is moved from the source application to the Solix CDP. For the Data Lake use case, current data can be transformed and then copied from the source application to the Solix CDP. EDW Augmentation Currently enterprises are struggling to maintain costs associated with both storage and processing capabilities around traditional EDW implementations. Offloading storage as well as costly ETL functions to a commodity hardware such as Hadoop enables enterprises to focus on utilizing the existing Data Warehouse infrastructure to its best ability in doing BI and Advanced Analytics. Migrating warm or cold data from the EDW via archive onto low cost bulk storage system such as Hadoop enables organizations to save millions on storage costs and significantly speeds up the processing power to get more value from the data warehouse by extracting valuable insights at a quicker pace from the collected data. 0 – 3 Years 3 – 10 Years Complete business object Denormalized structure Point-in-Time snapshot Decoupled from application Search & Query Retention Management Legal Hold Scheduler & CLI Solix Application Portfolio Manager Create EBR Ingest into SBDS After Retention Enterprise Application Enterprise Business Record AR Invoice Transactions Master Data Reference Data Attachments Report Files Solix Big Data Suite Retention Policies Search Reports BI ReportsRole Based Security LDAP/Active Directory Dashboards SAP BO Oracle BI Crystal Reports IBM Cognos Currently enterprises are struggling to maintain costs associated with both storage and processing capabilities around traditional EDW implementations.
  • 9. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise8 Enterprise Data Lake and Advanced Analytics The Solix CDP based on Apache Hadoop establishes new capabilities for Advanced Analytics applications. It stores data “as-is” eliminating the need for demanding ETL processes during ingestion. It captures and maintains the metadata connected to each byte of data, which is half or more of the value of the data itself. The Enterprise Data Lake may then be mined for critical business insights using text search, structured query or further processing by downstream analytical applications. The Solix CDP utilizes either Hive or Spark query frameworks dependent on the user requirements. The Solix Enterprise Data Lake reduces the complexity and processing burden of staging EDW and analytics applications and provides highly efficient, bulk storage of enterprise data for later use. Once resident within HDFS, enterprise data may be more easily distilled and better described at petabyte-scale by business analytics applications. This allows organizations to develop an enterprise architectural strategy that is responsive to the business stakeholders without driving up the investment in hardware and software. Information Governance Analysts have warned that applying existing information governance practices to Big Data will result in failure. Comprehensive information governance provided by Solix CDP establishes the control framework necessary for proper data access control, data assessment, data discovery, data classification, data validation, retention management, legal hold and privilege management. DATA MART AUDIO VIDEOS SOCIAL MEDIA EMAIL DOCUMENTS IMAGES WORD DOCUMENTS UNSTRUCTURED DATA SEMI STRUCTURED DATA JSON XML CSV LOGS MACHINE DATA SENSORS STRUCTURED DATA CLOUD APPS ERP CUSTOM APPS CRMDATA WAREHOUSE DISCOVERY SEARCH STAGE TRANSFORM ARCHIVE DATA LAKE HIVEHIV ANALYTICS REPORTINGDATA MINING
  • 10. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise9 Forrester estimates that the average Hadoop repository doubles in size every year; some implementations double in volume every month. More Hadoop silos are creating data challenges around security, integration, governance and delivery.4 To achieve robust ILM, new security and governance measures must be put into place to match the variety and complexity of the new data assets. The Solix CDP provides a true ILM continuum that addresses the complexity of governance in the Big Data world, while ensuring governance for core enterprise applications is not sacrificed. The Solix ILM framework manages the data within HDFS and provides an integrated retention-management and legal-hold capabilities. Structured and unstructured data from various data sources are migrated into HDFS with full data-validation and audit reports. These reports provide the necessary defensibility and chain of custody for compliance and data governance. ILM policies and business rules may be pre-configured to meet industry standard compliance objectives, such as COBIT, or custom designed to meet more specific requirements. Additionally, ILM also helps to solve the data growth problem by moving less frequently accessed data from high-cost Tier 1 infrastructure to Hadoop, leveraging cheap commodity infrastructure. Relocating inactive data to low-cost bulk data storage creates enormous infrastructure cost savings. Because governance, risk and compliance concerns grow by the terabyte, the Solix CDP ensures ILM for data throughout its lifecycle. IN FORMATION GOVERNAN CE Create Classify Archive, Retire Secure, Encrypt Retain,Hold,Dispose Monitor, Assess AUDITCONTROL RECONCILE 4 Forrester Report on Big Data Fabric Drives Innovation and Growth, March 2016 “
  • 11. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise10 COMPONENTS OF THE SOLIX COMMON DATA PLATFORM Solix Object Workbench Integrated Connectors Solix Object Workbench provides integrated connectors that can extract and ingest vast amounts of data “as-is” from an extensive set of enterprise data sources, including structured, semi-structured, unstructured and streaming data sources. The Object Workbench provides functionality to copy, move, and transform data from various data sources into the Solix CDP. Extract, Transform and Load (ETL) The Solix CDP Object Workbench also enables the ETL process to be undertaken as data is moved into the Enterprise Data Lake. This provides the ability to transform complex application data into meaningful data in a ready-to-use format from which the business user can gain immediate insight, with the use of BI tools. Solix Virtual Printer The Solix Virtual Printer provides functionality to capture print stream output from any application, transform it into a PDF document, automatically ingest it into Hadoop, index it and make it available for search access with full role-based security.
  • 12. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise11 The virtual printer can be used to supplement an archiving project by capturing key report output — including all formatting — from the source application and storing it alongside the structured data. The virtual printer can also be used to support a streamlined archiving approach called “print-and-purge.” Using this approach, key documents, such as invoices or customer documents, are first “printed” by the Solix Virtual printer and ingested into Hadoop, after which the underlying data from the source application can be purged. Real-Time and Streaming Data Business users want data that’s integrated in real time from multiple sources, including legacy data, social media, sensor data and weblogs, so they can make better decisions and increase their company’s competitiveness.5 Insights from enterprise data is now not restricted to formulated data repositories, which only contain data at the end of its operational life. Huge amounts of data are now collected from both internet enabled devices and also from terminals in real-time and streaming formats. This data can also be captured via the Solix CDP Object Workbench, enabling teams to create views of data to analyze and deliver actionable intelligence. This will enable enterprises across industries to become truly data-driven. Solix Big Data Suite Solix Virtual Printer Print Steam Data LakeL k ERP|CRM|HR|CustomApplications DISCOVERY SEARCH STAGE TRANSFORM ARCHIVE 5 Forrester Report on Big Data Fabric Drives Innovation and Growth, March 2016 “
  • 13. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise12 Solix Analyst Workbench The Analyst Workbench is designed for business analysts, data scientists, and DBAs to securely access the data within the Solix CDP and build virtual workspaces to manage analytics projects. All data within the platform is automatically made searchable and reportable in a secure and governed manner. Functionality included with the analyst workbench includes: Data Lake Visualizer The Data Lake Visualizer is a graphical inventory of the data contained in the lake. Using the visualizer the data analyst can quickly find the data sets needed to complete their analytics assignment. Once the data sets are identified they can be selected for inclusion in the analytics project.
  • 14. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise13 Virtual Projects and Workspaces For each analytics assignment a virtual project can be created by the analyst. Within each project one or more virtual workspaces can be created. The objects identified in the visualizer can then be virtually copied into the workspace, eliminating the need to make physical copies of data. Once the virtual workspace has been created, the data analyst can do data mashups by creating new composite objects to support the analytics assignment. Data Preparation Data preparation is the foundation for becoming a data-driven organization. To properly use data it must not only be collected from its ever-increasing variety of sources, it must also be put into a repository where those varied forms can be used by the analysts. As more organizations utilize Data Scientists and Advanced Business Analysts to wrangle their data to enable digital transformation, the lack of proper tools hinders this progression. In fact, research shows that Data Scientists spend 80 percent of their time just cleaning data. Every analytics project requires the proper preparation of the data set. The nature of the data — from semi-structured data (such as log files), unstructured data (such as social, IoT) and structured data (such as relational databases) – must be understood, organized and transformed quickly and efficiently. The Solix CDP offers powerful, easy to use self-serve data preparation capabilities, including the ability to parse, clean, join and enrich data, as well as populate missing information and calculate new metrics. The Solix CDP utilizes the Spark framework. Spark runs in-memory within the cluster and provides machine learning capabilities for faster and more advanced data preparation. Search and Reporting Functionality Solix CDP supports universal access to all enterprise data on a petabyte scale via text search, structured query or further processing by downstream analytical applications. End users gain improved data-driven results because their data is better able to be described.
  • 15. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise14 Information Governance The Solix CDP provides the ability to govern all of the data within the Hadoop repository for compliance and security. For example, automatically purge data based on a time-horizon, apply legal holds on files and transactions, enforce Kerberos/LDAP authorization for user access and more. This level of security and governance is able to be maintained because the CDP ensures all data retains its metadata. Information governance establishes the control framework necessary for proper data access control, data assessment, data discovery, data classification, data validation, retention management, legal hold and privilege management. Solix Common Data Platform API To enable development of custom applications and integration with existing BI and Advanced Analytics tools, the platform provides extensive APIs to access the unified repository. The API allows users to seamlessly access data from the Data Lake to enable the data-driven enterprise. Solix App Store The Solix App Store makes inductive BI user-friendly. The App Store offers out-of-the-box analytics through pre-integrated applications and also offers the opportunity to utilize third-party apps. CURRENT SOLIX ANALYTICS APP STORE OFFERS Cash-flow Analytics Sales/pipeline Analytics Integration with standard BI tools such as Tableau, Splunk, etc. An executive Dashboard Ad-hocreportingandSearchingTools MarketingandSocialAnalytics ------------ ------------------------ ------------------------ ------------------------ ------------
  • 16. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise15 DEPLOYMENT MODELS The Solix CDP is enabled for a number of deployment models including bare metal infrastructure, data center deployment, cloud infrastructure and also hosted multi-tenant deployment. Solix never delivers a one-size-fits-all solution. The Solix team has experts ready to address customer needs from IT, business use and financial perspectives. The Solix team will work to understand organizational needs and then implement the best solution. SPARK ON SOLIX CDP The Solix CDP utilizes the Spark programing models. Spark runs in-memory within the cluster and does not depend on the two-stage Hadoop MapReduce paradigm. Therefore, repeat access to data is much faster. Spark relies on HDFS and runs on Hadoop YARN to be able to analyze the data stream. Solix is committed to adding support for new Big Data tools as they appear in the fast-evolving Big Data ecosystem. This future-proofs Solix CDP installations, an important commitment given the speed with which the open source Big Data stack is evolving. It also simplifies the creation and evolution of an enterprise’s Big Data environments. BENEFITS OF THE SOLIX CDP The benefits of the Solix CDP include: • Combining the advantages of Hadoop with the ability to preserve the full metadata. • Providing advanced ILM capabilities, including the ability to copy data from the data warehouse and to archive older data. • Supporting advanced data security, as well as third party analysis packages, including machine learning and cognitive computing analysis of the data. • Preserving all data in its original format and with full metadata and supporting established open standard interfaces. It future-proofs the Data Lake, ensuring the data will be usable by the new technologies and for new use cases that are as yet undefined. • Providing a unified data governance layer from the time of data ingestion to use of data by business users for operational insights and Advanced Analytics. • Ability to utilize either Hive or Spark query frameworks dependent on the user requirements. • Cloud, on-premise and hybrid deployment models. • Working with all Hadoop distributions such as Cloudera and Hortonworks.
  • 17. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise16 The Solix CDP is the first solution to address all the data needs of an organization. From governance to analytics, the Solix CDP works with an organization’s existing infrastructure to create a true ILM continuum that ensures the onslaught of data can be an asset and not a hindrance to business growth and development. The Solix CDP brings together the Enterprise Archiving, EDW and the Enterprise Data Lake while preserving metadata, allowing for schema on read, analytics opportunities, low cost implementation and maintenance as well as offering incredible scalability. CONCLUSION The era of game-changing digital disruption is here, and to thrive in this competitive environment, organizations need leadership that can effectively leverage all the data to derive actionable insights to fuel growth. Reducing infrastructure costs, attaining operational efficiencies and deriving insights from BI and Advanced Analytics is the desire of many organizations. Solix CDP maximizes the insights that can be achieved, while reducing risk, ensuring compliance and governance to create a true ILM framework to lead organizations into the future. The Solix CDP gives organizations all the tools necessary to lower the total cost of ownership and satisfy the desire for return on investment.
  • 18. Empowering the Data-driven Enterprise Solix Common Data Platform: Advanced Analytics and the Data-Driven Enterprise17 Solix Technologies, Inc. 4701 Patrick Henry Dr., Bldg 20 Santa Clara, CA 95054 Toll Free: +1.888.GO.SOLIX (+1.888.467.6549) Telephone: +1.408.654.6400 Fax: +1.408.562.0048 URL: http://www.solix.com Copyright ©2016, Solix Technologies and/or its affiliates. All rights reserved. This document is provided for information purposes only and the contents hereof are subject to change without notice. This document is not warranted to be error-free, nor subject to any other warranties or conditions, whether expressed orally or implied in law, including implied warranties and conditions of merchant- ability or fitness for a particular purpose. We specially disclaim any liability with respect to this document and no contractual obligations are formed either directly or indirectly by this document. This document may not be reproduced or transmitted in any form or by any means, electronic or mechanical, for any purpose, without our prior written permission. Solix is a registered trademark of Solix Technologies and/or its affiliates. Other names may be trademarks of their respectively. Empowering the Data-driven Enterprise
  • 19. The Next-Generation EDW Is The Big Data Warehouse Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics by Noel Yuhanna August 29, 2016 For Enterprise Architecture Professionals forrester.com Key Takeaways Without Modernizing Your Current EDW Platform, You Will Likely Fail Business users are demanding faster, more real- time, and integrated customer analytics from multiple sources, so they can make better decisions and increase their company’s competitiveness. Current EDW platforms have gaps and limitations that fail to meet these new requirements. Forrester’s Big Data Warehouse Strategy Extends The Existing EDW Framework Based on interviews of customers and vendors, Forrester has laid out an architecture to guide enterprise architects in creating a big data warehouse framework tailored to their firm’s requirements to support both existing and new actionable business insights. You Need A Big Data Warehouse Strategy To Succeed Big data warehouse is a modern data warehouse architecture that leverages traditional and new data repositories, in-memory, cloud, and other technologies. Why Read This Report EDW is not dead; it’s evolving! Enterprise data warehouses have come a long way in delivering value by predicting trends, minimizing churn, and identifying new business opportunities. However, in the era of big data, traditional EDW is failing to meet new business requirements, such as support for real-time and ad hoc customer analytics, new sources of data, and self-service capabilities. Enterprise architects should read this report to learn how the new big data warehouse addresses these gaps by delivering timely and actionable insights to gain competitive edge and enable innovation and growth.
  • 20. 2 6 10 12 13 © 2016 Forrester Research, Inc. Opinions reflect judgment at the time and are subject to change. Forrester® , Technographics® , Forrester Wave, RoleView, TechRadar, and Total Economic Impact are trademarks of Forrester Research, Inc. All other trademarks are the property of their respective companies. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 Forrester Research, Inc., 60 Acorn Park Drive, Cambridge, MA 02140 USA +1 617-613-6000 | Fax: +1 617-613-5000 | forrester.com Table Of Contents EDW Has Been The Analytics Platform King For Decades But New Business Requirements Are Changing EDW Requirements EDW Technology Gaps Are Making Enterprises Look Elsewhere The Big Data Warehouse Extends The EDW Platform Big Data Fabric Connects The Superset Of Your Data Sources — Including Your BDWs The BDW Provides A Comprehensive View And Integrated Analytics The Major EDW Vendors Provide BDW Components BDW Use Cases Go Beyond Traditional Analytics Recommendations Extend Your Current EDW Platforms Toward A BDW Strategy Supplemental Material Notes & Resources Forrester interviewed various customers in the financial, oil and gas, retail, and healthcare sectors. Related Research Documents Big Data Fabric Drives Innovation And Growth The Forrester Wave™: Enterprise Data Warehouse, Q4 2015 TechRadar™: Big Data, Q1 2016 For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics by Noel Yuhanna with Gene Leganza and Shreyas Warrier August 29, 2016
  • 21. For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse August 29, 2016 © 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 2 Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics EDW Has Been The Analytics Platform King For Decades The enterprise data warehouse is an architecture, not a technology. The traditional EDW platform has served and continues to serve a broad range of business users, including enterprise architecture (EA) pros, feeding both analytical and operational systems. EDWs: ›› Organize and aggregate historical analytical data from functional domains. EDWs house information from data subject areas such as customer, manufacturing, finance, and human resources that align with key processes, applications, and roles. Most of the traditional EDW platform has been built using relational database management system (DBMS) and columnar database platforms using extract-transform-load (ETL), change data capture (CDC), and replication technology (see Figure 1). ›› Offer a strong decision support framework. EDWs provide in-database analytics, predictive models, and embedded business algorithms to drive business decisions. ›› Are central to a firm’s data ecosystem. The EDW is a proven ecosystem that supports integration with data models and security frameworks, automation, and a broad range of business intelligence (BI) and visualization tools.1 ›› Provide the foundation for BI. EDWs support timely reports, ad hoc queries, and dashboards and supply other analytics applications with trusted and integrated data. Many use the EDW to deliver operational intelligence — in the form of query responses, reports, dashboards, charts, and other analytic views — in support of various decision scenarios.
  • 22. For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse August 29, 2016 © 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 3 Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics FIGURE 1 The Traditional Enterprise Data Warehouse Platform Relational Columnar Business intelligence Operational reporting Analytics Predictive analytics Social OLTP CRM ETL/CDC/replication On-premises • Modeling • Data quality • Security governance transformation • Integration Hybrid ERP SaaS Compute/processingStorage/persistenceSource Cloud But New Business Requirements Are Changing EDW Requirements Today, business users are demanding real-time analytics that’s integrated from legacy, social, and cloud sources, while business execs want self-service and autonomous access to fit-for-purpose customer data insights. In our 2016 global survey, 59% of respondents stated that leveraging big data and analytics was a critical or high priority (see Figure 2). But increasing data volume and dealing with multimodel customer data are slowing down timely analytics and putting constraints on traditional warehouse platforms, causing firms to revisit their EDW architectures. Businesses are reporting that current EDW platforms: ›› Can’t share current data quickly enough for timely business decisions. With increasing big data comes a major challenge for any enterprise: knowing what to look for and where, and then making sense of it. In our survey, 30% of businesses reported growth of data volume and variety affecting their BI strategy (see Figure 3). Firms are realizing that traditional data warehouses fall short when it comes to real-time analytics.2 “With data explosion and increasing demand for real-time analytics by the business, we are finding it challenging to support our LOB users. While we already use Hadoop, our traditional data warehouses still are important for analytics, but we are now looking at modernizing that architecture.” (Enterprise architect, oil and gas, North America) ›› Don’t support ad hoc and dynamic analytics for new customer trends. EDWs were built for a limited set of uses, providing answers to known questions. But 27% of enterprises report that fast- changing analytics and reporting requirements are one of the biggest challenges when orchestrating
  • 23. For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse August 29, 2016 © 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 4 Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics their BI strategy, while 30% cite the growth and variety of their data. Processes using traditional EDWs don’t scale well when you introduce ambiguity or add new and dynamic questions. EDWs need to ingest, process, and curate data continuously and support dynamic insights. “We are now looking to [build] a modern data warehouse that can provide insights to all kinds of tough questions critical for our business to succeed. Including identifying business risks and opportunity.” (Business analyst, financial services, Europe) ›› Don’t provide a self-service platform for strategic and operational decision-making. When executives need to determine why something is happening or what the best course of action is, they can’t wait for a data processing cycle to make data available. Analysts need to be able to aggregate and prepare data sets without technology management’s involvement. Twenty-seven percent of companies reported lack of end user self-service capabilities as one of the biggest challenges in executing their BI strategy. Self-service customer analytics has become critical for organizations to succeed. “Self-service for all data is our long-term strategic direction, and we know it’ll take us some time to get there, but we have to start somewhere. We have started to integrate our current EDW appliances to Hadoop and in-memory to create [a] unified and integrated analytical platform.” (Enterprise architect, financial services, North America) FIGURE 2 Big Data And Analytics Have Become A Priority 0% 9% 28% 40% 19% 1% Not on our agenda Low priority Moderate priority High priority Critical priority Don’t know “Which of the following initiatives are likely to be your organization’s top business priorities?” (Better leverage big data and analytics in business decision-making) Source: Forrester’s Global Business Technographics® Data and Analytics Survey, 2016 Base: 3,343 data and analytics decision-makers
  • 24. For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse August 29, 2016 © 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 5 Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics FIGURE 3 Data Growth And Variety Are Affecting Business Intelligence And Analytics Strategy 4% 16% 17% 17% 19% 20% 21% 22% 22% 25% 25% 25% 27% 30% 35% Don’t know/does not apply Lack of business C-level executive support Widespread utilization of insights for decision-making and planning Inadequate change management programs (communications, incentives, etc.) Lack of access to data and insights Lack of end user self-service capabilities Lack of data standards Legal and regulatory compliance Inadequate or missing relevant internal skills Poor data quality Lack of adequate user training Lack of alignment between IT and business Fast-changing analytic and reporting requirements Growth of data volume/variety Data security and privacy “What are the biggest challenges your firm faces when orchestrating its business intelligence strategy?” Source: Forrester’s Global Business Technographics® Data and Analytics Survey, 2016 Base: 3,343 data and analytics decision-makers EDW Technology Gaps Are Making Enterprises Look Elsewhere While traditional data warehouses often took years to build, deploy, and reap benefits from, today’s organizations want more simplified, agile, integrated, cost-effective, and automated solutions. Firms are revisiting their EDW strategies, as they spend too much time loading, unloading, transforming, securing, integrating, and curating customer data. Enterprises face:
  • 25. For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse August 29, 2016 © 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 6 Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics ›› A data volume explosion that’s affecting customer analytics. Traditional structured data continues to grow rapidly, slowing down legacy data warehouse systems and affecting analytics and timely insights. Regulatory requirements now mandate storing compliant data for several years, and business growth is generating more data at a faster pace than ever before. “We are experiencing tremendous data explosion for traditional data sets that’s impacting our data warehouses. While we are still looking at improving the performance of existing data warehouses for the short term, we are now starting to look at alternatives, both supplementary and replacement as longer-term strategy.” (Enterprise architect, oil and gas, North America) ›› Data variety that’s making it harder to support using traditional warehouses. Business users can’t easily spot patterns and trends in content such as documents, email, images, audio, and social media. In addition, storing, processing, and accessing unstructured data in data warehouses pushes the limits of traditional technologies and architectures, which were not designed to handle such data types.3 ›› Data speed that’s making it harder to keep up. New sources of data are coming in a lot faster, such as sensor and machine data, log and clickstream data, cloud and software-as-a-service (SaaS) data, and other streaming data. Storing, transforming, and processing such data requires new technologies and systems to support new customer analytics, real-time analytics, and operational intelligence reporting.4 “For us, real-time data sharing is critical internally among business users but also with various partners that we engage with. Currently, not all of our data is available to everyone, but we are looking at ways of expanding to support a more self-service real-time big data platform.” (Data scientist, biotechnology company, North America) The Big Data Warehouse Extends The EDW Platform Firms are already using a variety of technologies in their big data strategy to support new, next- generation analytics (see Figure 4). The big data warehouse (BDW) is a modern data warehouse architecture that leverages traditional data warehouse architectures as well as modern big data technologies (see Figure 5). Forrester defines the big data warehouse as: A specialized, cohesive set of data repositories and platforms used to support a broad variety of analytics running on-premises, in the cloud, or in a hybrid environment. BDW leverages both traditional and new technologies such as Hadoop, columnar and row-based data warehouses, ETL and streaming, and elastic in-memory and storage frameworks.
  • 26. For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse August 29, 2016 © 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 7 Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics FIGURE 4 Cloud, Streaming, And Distributed In-Memory Are Already Part Of Firms’ Big Data Strategy “Which of the following are included in your plans for big data?” 8% 16% 18% 22% 23% 23% 26% 26% 27% 28% 30% 33% 36% 40% Don’t know NoSQL other than Hadoop A massively parallel processing (MPP) data warehouse Semantic technologies (ontology building, search, autocuration, graph, etc.) Hadoop (including Hbase or Accumulo) Data anonymization or de-identification Creating or building out a data lake Marketing or digital data management platforms and service providers that brand their offerings as . . . Packaged analytics technologies that brand themselves as big data Unstructured data mining/analytics Distributed in-memory databases, grids, analytics tools Streaming analytics/computing Large-scale predictive modelling, data mining, or other advanced analytics Public cloud big data services Source: Forrester’s Global Business Technographics® Data and Analytics Survey, 2016 Base: 2,094 data and analytics decision-makers
  • 27. For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse August 29, 2016 © 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 8 Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics FIGURE 5 Big Data Warehouse Architecture Relational Columnar Apache Hadoop Business intelligence Operational reporting Analytics Predictive analytics Social Devices OLTP ERP ETL/CDC/replication Machine learning Data quality Security Governance Hybrid CRM SaaS Sensors Real-time analytics Streaming Transformation Integration In-memory/ Apache Spark Self-service Ad hoc interactions Modeling On-premises Storage/compute processing ManagementSources Use casesInteraction Cloud Big Data Fabric Connects The Superset Of Your Data Sources — Including Your BDWs The big data warehouse is part of a larger big data fabric architecture, which embodies data from multiple — potentially distributed — data sources, including BDWs and data lakes. The big data fabric architecture enables integration, data quality, security, governance, data curation, data preparation, and data management to support an end-to-end, real-time big data platform (see Figure 6).5 The two architectures: ›› Can exist separately but work best as complements. Multiple traditional EDWs, BDWs, and data lakes have become the new norm to support the variety of analytical workloads. While both BDWs and big data fabric architectures can exist independent of each other, typically firms leverage both to deliver a blend of real-time and batch across various distributed enterprise data sets to support broader use cases. For example, some financial services organizations use the BDW to support mostly financial data analytics — leveraging columnar data warehouses, Hadoop, and ETL technologies. The BDW also acts as a source within the big data fabric architecture that delivers real-time customer analytics across BDW, Twitter, Salesforce, and clickstream data.
  • 28. For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse August 29, 2016 © 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 9 Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics ›› Vary significantly in the amount of data transformation required. We often see big data fabric used for real-time analytical use cases that integrate data across many disparate sources, including BDW, with the BDW used mostly for batch and near-real-time analytics for data stored in a data warehouse and Hadoop clusters that require aggregation, transformation, and further processing before becoming available to BI users or analytical processes. Exploration occurs within the fabric, with transformations captured within the BDW. FIGURE 6 Big Data Fabric Architecture Integrated With Big Data Warehouse Big data fabric Data ingestion (streaming/replication/batch) Processing and persistence On-premises sources Cloud sources Hadoop Spark BDW Hadoop EDW Spark New York Singapore The BDW Provides A Comprehensive View And Integrated Analytics A key component of the BDW architecture is the ability to leverage various specialized data repositories such as traditional relational data warehouses, columnar data warehouses, and Hadoop. Unlike traditional data warehouses, the BDW minimizes complexity and hides heterogeneity by embodying a trusted model, supports all kinds of data types including unstructured data, and adapts to changing business requirements more rapidly through a self-service platform. The BDW centralizes administration of distributed data repositories, in-memory compute resources, metadata, storage, access, and processing functions. It leverages new technologies such as: ›› Hadoop to support diverse data sets and distributed computing. By leveraging Hadoop, the BDW enables organizations to deal with a wider variety of data structures than traditional EDWs. Hadoop can also deal with extremely large data sets that are inappropriate for traditional EDW
  • 29. For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse August 29, 2016 © 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 10 Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics platforms. Enterprise architects can choose to store data in relational, columnar, wide columns, or Hadoop based on business needs. For example, a retailer leverages legacy structured data stored in a traditional data warehouse, and Hadoop for clickstream data, and integrates them to deliver a 360-degree view of the customer for recommendations and churn analysis. ›› In-memory to enable faster customer analytical capabilities. A key component of the BDW is the ability to use in-memory to deliver performance and faster access to business data. We are heading toward having large memory platforms that will store petabytes in DRAM and Flash/SSD in the coming years. For example, several retailers are using BDW to leverage customer-related data to determine product discounting strategy, optimize product distribution across stores, and enable personalized customer experiences. ›› Streaming engines to support new data channels for ingestion and processing. Market data, clickstream, mobile devices, and sensors are new sources for analytical information that are not in your existing data warehouse. Streaming technology boosts integrating, transforming, and curating data on diverse data streams in real time.6 Integrating streaming technology with data platforms such as Hadoop and Spark — as well as traditional data warehouses — has become critical. For example, we see oil and gas industry firms leveraging streaming technology for insights into new business opportunities, such as predicting staffing and resource requirements for various drilling sites and performing machine failure analysis. The Major EDW Vendors Provide BDW Components From an implementation viewpoint, most enterprises are currently building BDW platforms themselves by integrating their traditional data warehouses with Apache Spark, Hadoop, Storm, and in-memory technologies. Forrester sees many enterprises already using an extract-Hadoop-load (EHL) approach to: 1. Extract data from various source systems such as traditional databases and flat files. 2. Load data into Hadoop to perform aggregation and transformation using Apache Hadoop ecosystem tools. 3. Finally load the result into the EDW platform.7 BDW Use Cases Go Beyond Traditional Analytics Adoption of BDW architectures will accelerate as enterprises run into existing EDW challenges. But building a BDW platform internally will require more time and effort, which will likely put pressure on the overall business technology (BT) agenda. The good news is that solutions are starting to emerge from vendors such as IBM, Microsoft, Oracle, SAP, Snowflake, and Teradata that provide some or all of the components to build and deploy a BDW strategy.8 Enterprises are already using BDWs to support social analytics, risk analysis, campaign analysis, fraud assessment, and pricing trends. The top BDW use cases include:
  • 30. For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse August 29, 2016 © 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 11 Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics ›› Integrated analytics. A key challenge in the traditional EDW approach was that if data didn’t exist in the warehouse, you couldn’t do any analytics — full stop. With BDW architecture, you can perform integrated analytics across data warehouse and Hadoop clusters. Hadoop can store and process large sets of semistructured and unstructured data, log files, and streaming data with ease. For example, health research often requires looking at complex patient data and determining how effective a treatment is likely to be based on factors like age, sex, and health status. The BDW enables gathering and storing millions of data points in Hadoop and performing complex navigation and modeling using traditional data warehouse and in-memory technology. ›› Internet-of-things (IoT) analytics. Traditional data warehouses don’t deal with IoT data. However, the BDW offers the ability to store, process, and access large volumes of IoT data from sensors and devices in Hadoop repositories efficiently through automation and machine learning technologies. Manufacturers deal with highly sophisticated machinery to support their plants, whether they’re building a car, airplane, or tire or bottling wine or soda. Every minute of machine downtime can cost a manufacturer dearly. IoT analytics on BDW platforms enables manufacturers to predict machine failures based on sensor data, minimizing or eliminating production slowdown. ›› Right-time business analytics. Traditional EDW architectures were based on mostly batch processing, with ETL doing the heavy lifting of data from traditional systems to operational systems to data warehouses. As a result, by the time data arrived in data warehouses, it was already 12 to 48 hours old. BDWs enable right-time analytics by leveraging streaming and replication with direct access to data sources, whether on-premises or cloud, bypassing traditional ETL approaches. The financial services industry has been an early adopter of BDW to support right-time analytics for portfolio management, fraud detection, and asset management. ›› Adaptive, self-service analytics. Most EDWs use predefined data sources to deliver predictive analytics, trends, and insights. The BDW enables organizations to dynamically leverage new data sources quickly to deliver new insights. It enables self-service capabilities for business users to ask complex and new questions so they can make more accurate decisions. The BDW adapts to the new sources and can help correlate data using machine learning and adaptive intelligence. For example, a major European bank recently built a BDW framework that business units now use to support self-service for making better decisions on investments and risks. The platform represents a major shift from the static reports the bank used previously.
  • 31. For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse August 29, 2016 © 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 12 Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics Recommendations Extend Your Current EDW Platforms Toward A BDW Strategy Don’t throw away your existing EDW platform! The investments you have already made in EDWs will form the foundation of the next-generation BDW strategy. However, attaining this demands that you rearchitect your existing EDW platform and invest in new technologies to deliver on a new vision of right-time analytics, self-service, and intelligent and contextualized customer analytics. Forrester recommends that enterprise architects extend existing EDW platforms toward a BDW strategy by leveraging: ›› Hadoop for low-cost storage and processing of big data. Let Hadoop be the first stop for your big data that has no other home in your data warehouse. Hadoop offers the ability to store very large volumes of data (including unstructured data) more efficiently than traditional warehouses — and at a fraction of cost. In addition, Hadoop helps you offload data from traditional warehouses and leverage a distributed computing framework to perform transformation, aggregation, and curation quickly. ›› In-memory technology to support right-time analytics. Without in-memory technology, customer analytics, personalization, and right-time analytics will run slowly. This could cause you to miss key trends like customer churn or miss the opportunity to offer new products and services or identify weak markets. You can also use data from the BDW as part of the bigger big data fabric framework that leverages distributed in-memory computing to deliver a broader enterprise information fabric. ›› Hybrid platforms to support on-demand and scalable BDWs. Storing all of your data on- premises need no longer be the default. Cloud platforms like those from Amazon Web Services, Google, IBM, Microsoft (Azure), Oracle, and Rackspace offer pay-as-you-go facilities to store, process, and access any amount of data.9 Hybrid is the new norm — look at utilizing both on- premises and cloud data warehouse platforms as part of your BDW architecture, with a common administration facility. ›› Vendor solutions that help achieve faster time-to-value. Data warehouse, Hadoop, and other big data solutions from vendors such as Cloudera, Hortonworks, IBM, MapR Technologies, Microsoft, Oracle, SAP, and Teradata can reduce time-to-value by automating and simplifying various BDW functions and implementation steps. Look at vendors that support broader solutions and can support your business data. Ask your vendor how it plans to provide the BDW vision. Review the various components that the vendor has integrated and ask how it plans to fill any gaps.
  • 32. For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse August 29, 2016 © 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 13 Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics Supplemental Material Forrester’s Global Business Technographics® Data And Analytics Survey, 2016 was fielded in March 2016. This online survey included 3,343 respondents in Australia, Brazil, Canada, China, France, Germany, India, New Zealand, the UK, and the US from companies with 100 or more employees. Forrester’s Business Technographics ensures that the final survey population contains only those with significant involvement in the planning, funding, and purchasing of business and technology products and services. Research Now fielded this survey on behalf of Forrester. Survey respondent incentives include points redeemable for gift certificates. Please note that the brand questions included in this survey should not be used to measure market share. The purpose of Forrester’s Business Technographics brand questions is to show usage of a brand by a specific target audience at one point in time. Engage With An Analyst Gain greater confidence in your decisions by working with Forrester thought leaders to apply our research to your specific business and technology initiatives. Forrester’s research apps for iPhone® and iPad® Stay ahead of your competition no matter where you are. Analyst Inquiry To help you put research into practice, connect with an analyst to discuss your questions in a 30-minute phone session — or opt for a response via email. Learn more. Analyst Advisory Translate research into action by working with an analyst on a specific engagement in the form of custom strategy sessions, workshops, or speeches. Learn more. Webinar Join our online sessions on the latest research affecting your business. Each call includes analyst Q&A and slides and is available on-demand. Learn more.
  • 33. For Enterprise Architecture Professionals The Next-Generation EDW Is The Big Data Warehouse August 29, 2016 © 2016 Forrester Research, Inc. Unauthorized copying or distributing is a violation of copyright law. Citations@forrester.com or +1 866-367-7378 14 Big Data Warehouses Drive Faster, Integrated, Self-Service Analytics Endnotes 1 Today, organizations still rely on EDW platforms to deliver actionable, timely, and trustworthy intelligence. EDW technology organizes and aggregates analytical data from various functional domains and serves as a critical repository for organizations’ operations. See the “The Forrester Wave™: Enterprise Data Warehouse, Q4 2015” Forrester report. 2 It takes a long time to measure a business process. Enterprise data hubs need to accommodate more data and an infinite set of queries. See the “Create A Road Map For A Real-Time, Agile, Self-Service Data Platform” Forrester report. 3 Data consumers — from casual data analysts to data scientists to your customers — are looking across a broad variety of data today to find answers to their questions. See the “Compose Digital Data To Create A Symphony Of Insight” Forrester report. 4 Data bottlenecks create business bottlenecks. The days of provisioning data to simply meet the requirements of systems of record are over. Business stakeholders at the executive and line-of-business levels need data faster to keep up with customers, competitors, and partners. See the “Create A Road Map For A Real-Time, Agile, Self-Service Data Platform” Forrester report. 5 Forrester defines big data fabric as “bringing together disparate big data sources automatically, intelligently, and securely, and processing them in a big data platform technology, such as Hadoop and Apache Spark, to deliver a unified, trusted, and comprehensive view of customer and business data.” See the “Big Data Fabric Drives Innovation And Growth” Forrester report. 6 Streaming technology helps integrating, transforming, and curating data on diverse data streams in real time. See the “The Forrester Wave™: Big Data Streaming Analytics, Q1 2016” Forrester report. 7 Forrester sees many enterprises already using an extract-Hadoop-load approach to extract data from various source systems, such as IoT devices and cloud and traditional platforms, then load it into Hadoop, perform aggregation and transformation, and finally load it into the EDW to support business analytics. See the “The Forrester Wave™: Enterprise Data Warehouse, Q4 2015” Forrester report. 8 Most big data integration vendors focus on making classic processes faster with tools for moving data into a lake and working with it there. Three innovative vendors — Looker Data Sciences, SnapLogic, and Snowflake Computing — offer alternative approaches. See the “Breakout Vendors: Big Data Integration” Forrester report. 9 According to Forrester customer feedback, such cloud-based storage is typically over 20% less expensive than on- premises deployment.
  • 34. We work with business and technology leaders to develop customer-obsessed strategies that drive growth. Products and Services ›› Core research and tools ›› Data and analytics ›› Peer collaboration ›› Analyst engagement ›› Consulting ›› Events Forrester Research (Nasdaq: FORR) is one of the most influential research and advisory firms in the world. We work with business and technology leaders to develop customer-obsessed strategies that drive growth. Through proprietary research, data, custom consulting, exclusive executive peer groups, and events, the Forrester experience is about a singular and powerful purpose: to challenge the thinking of our clients to help them lead change in their organizations. For more information, visit forrester.com. Client support For information on hard-copy or electronic reprints, please contact Client Support at +1 866-367-7378, +1 617-613-5730, or clientsupport@forrester.com. We offer quantity discounts and special pricing for academic and nonprofit institutions. Forrester’s research and insights are tailored to your role and critical business initiatives. Roles We Serve Marketing & Strategy Professionals CMO B2B Marketing B2C Marketing Customer Experience Customer Insights eBusiness & Channel Strategy Technology Management Professionals CIO Application Development & Delivery ›› Enterprise Architecture Infrastructure & Operations Security & Risk Sourcing & Vendor Management Technology Industry Professionals Analyst Relations 128005