SlideShare a Scribd company logo
1 of 27
Download to read offline
CW
IN
CAPGEMINI
WEEK OF
INNOVATION
NETWORKS
Fathoming Data for
Competitive Advantage
Gururaj Joshi, Bangalore, Sep 26th 2018
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 2
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 3
The Evolving 3rd Platform
At-Scale PersonalizationExponential Change
Autonomy
Data as a Service
Reference : https://www.idc.com
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 4
Data Deluge
By 2024 machine-to-machine connections will grow to 27 billion
By 2020, a quarter of a billion cars will be connected to the Internet
By 2020 Smartphone users alone are predicted to number over 6 billion
By 2020, devices that connect to the Internet are 50 billion.
By 2025; the amount of analyzed data that is “touched” by cognitive systems will grow by a factor 1.4ZB
By 2025, more than a quarter of data created will be real time in nature, and real-time IoT data will
make up more than 95% of this.
By 2025, an average connected person anywhere in the world will interact with connected
devices nearly 4,800 times per day
By 2025, nearly 20% of the data will be critical to our daily lives and nearly 10% of
that will be hypercritical.
Reference : https://www.idc.com
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 5
The Power of Data
Healthcare
investing billions in developing
new biometric sensors and
wearable technology that
tracks health and fitness.
Sports
Retail
retailers are constantly
finding cutting-edge ways
to draw insights from the
ever-increasing amount of
information available
about their customers
Government
and public
sector
services
Financial
services,
banking
and
insurance
Are adopting to the data first
approach to driving business
growth and enhancing its
services for customers.
Energy
Transportat
ion and
logistics
From the weather to the
condition of vehicles and
machinery, and data
analytics
enables businesses to drive
significant efficiencies
Agriculture
and farming
it’s possible to take more
than a million readings –
vastly increasing the
amount of data gathered
during exploration
several data-enabled
services that let farmers
benefit from crowdsourced,
real-time monitoring of
data collected
An increasing number of
cities are currently piloting
data analytics with the aim
of turning themselves into
‘smart cities’
Most elite sports have now
embraced data analytics &
its hard to think of any
area of sport that isn’t
embracing data
Businesses built
on data
A glance at the 10 most
valuable Fortune companies ,
Proves that their business
model are built on data, or are
heavily investing in data
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 6
every business should be a data business
data is becoming a key business asset, central to the success of every company, big or small.
Data is a valuable
asset and, as a
result, companies
are more hungry
for data than ever
before
As the world
becomes smarter
and smarter, data
becomes the key to
competitive
advantage.
every tiny piece of
data may very well
be valuable to some
extent or another.
every business
therefore needs a
robust data strategy,
Those without risk
being left behind.
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 7
The Path Of Wisdom
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018
Pathways to Big Data adoption: A strategic value-drivers based
approach
Strategic Value & Differentiation:
Data Monetization
Personalization & Intimacy:
Data Science
Scale, Variability and Flexibility:
Data Lakes 1
2
3
4
1
2
3
4
Speed and Responsiveness:
Data Streams
ValueDriversandBigDatalevers
The Data Explosion in the digital universe is transformational – and can unlock answers to
critical business needs and opportunities
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 9
Validity
TRUST OF
DATA
Volatility
RETENTION OF
DATA
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 10
What we seek about data …
S No Source Details
1 Volume – What is the volume of data coming for each source
1.1 Overall Size of dataset in One year (GBs)
1.2 Size of dataset in upload (Small, Medium, Large, Extra Large)
1.3 Avg number of Records of Dataset
1.4 Does dataset contains binary data (Capture limit of binary dataset)
2 Variety – Different types of data set
2.1 Types of Structured Data
2.1.1 RDBMS
2.1.2 XML
2.1.3 JSON
2.1.4 CSV
2.1.5 Capture the structure
2.2 Is the data in Semi-structured form?
2.3 Is the data in Completely unstructured?
2.4 Is the data in binary format?
2.5 Location of source data (On-premises, Private Cloud, Public Cloud, Hybrid)
2.6 What should be source provenance
2.7 On boarding timestamp
2.8 Source Identification( exact file name formats)
2.9 Source data header/ trailer format's available
2.10 Are there any delimiters at different places available
2.11 Does Business metadata captured
2.11.1 Business names
2.11.2 Descriptions
2.11.3 Tags
2.11.4 Quality
3 Velocity – Rate of data ingestion, transformation & visualization
3.1 Is the dataset expected to be real time
3.2 Is the dataset expected to be one time bulk ingested
3.3 Is the dataset expected to be ingested in incremental sizes
3.4 Is the dataset expected to be ingested in batch mode (Repeated small chunks of dataset)
3.5 Does data ingested by pull-based refreshes/ push-based refreshes
3.6 Frequency of Data Ingestion
3.7 What is expectancy of availability of data at Transient Data layer
3.8 What is expectancy of availability of data at Immutable Raw Data layer
3.9 What is expectancy of availability of data at Enriched Data layer
3.10 What is expectancy of availability of data at Trusted Data layer
3.11 What is expectancy of availability of data at Discovery Data layer
3.12 What is expectancy of availability of data at Visualization Layer
4 Veracity –
4.1 Are there any known anomalies in data set
4.2 Are there any known data cleaning activities required
4.3 Are there any known data formatting activities required
4.4
Are there any known data to be tokenized or masked to protect
personally identifiable information (PII)
4.5 Are there any known data to be tokenized or masked for sensitive data.
4.6
Data Classification (Open, Organizational, Internal, Restricted, Need
basis)
4.7 Data correctness and accurate for the intended use.
4.8 Are the data sets validated by owners for correctness
4.9 Are any specific Metadata to be captured
4.10
Does the data source make the data available easily or specific connector
needs to be built?
5 Volatility - Refers to shelf life of Data
5.1 How long is data valid
5.2 How long data needs to be stored
5.3 Is there point of data irrelevance available
5.4 Capture Data Audit (Lineage)
6 General
6.1 Sample data to be prepared / made available by the data provider .
6.2
Does Mapping Document (mappings, transformations and joins provided )
& is verified with source
6.3 Will there be any control file for each of the source files?
6.4
In a Scenario where the process is down for multiple days?
Is it required to load the backlogs of files or just the latest file?
6.5
The delimiters or special characters to be used should be confirmed in
appearance and their ASCII value provided.
6.6 What validations need to be Performed for source file names?
6.7
Do the file contains columns which indicates the start and end of field
values
6.8 Is it required to validate the header and trailer details?
6.9 Details of data upload failure intimation / Process followed
7 Security
7.1
Are there any known users groups identified (User classified in separate
groups)
7.2 Are there any known users identified as Owners (Full access to data)
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 11
Components of data architecture
Batch Ingestion /
Data Store
Batch Processing
Real Time Ingestion Stream Processing
Analytical Data Store Analytics & ReportingData Source
On-Premise
RDBMS
Static Files
Real-Time
Cloud Storages
Devices
Social
Events
Sensors
Shared
File, Queue, Blob Process Long-running /
Large Data Sets
Store / Buffer/ Reliable
delivery / real-time
messages
filtering, aggregating,
preparing the data
serve data for
analysis in
structured format /
low-latency NoSQL /
Interactive
distributed data
•Data Modeling Layer,
•Self-service BI
•Visualization tech
•Interactive data
exploration
Automated Workflows, Transform , Move between, Load processed, Push results to report or dashboardOrchestration
Securing access to dataSecurity Details on dataMeta Data Guidance for dataCatalog Right dataQuality
Data Store
Build/ train models /
NLP
Machine Learning
Data Services
(APIs)
Data Processing
Algorithms
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 12
Zones of Data
Data is organized into zones that serve specific functions. Each zone data is accessible and viewable
Why: Record Incoming Data Snapshot
Where : Transient Data (staging area)
What :
•Data first comes into the data lake.
•Basic quality checks
•Create new and different transformed data sets
•Create Data Catalog
•Create Meta Data (Automated )
•Provide access on need basis
How :
•Data can come by Manual, Stream, Batch
•Raw data is categorized by Source
Why: Source snapshot for Analytics &
Discovery
Where : Raw Data (Immutable Raw data
area in original form)
What :
•Data is immutable
•Data can be tokenized or masked to protect
personally identifiable information (PII) or other
sensitive data.
•Data is in original format
•Restricted / Very Limited Access is provided
How :
•Data can come from source & in format for
analytics
•Raw data is categorized by Source
Why: Provide insights
Where : Enriched Data (Refined Data )
What :
•Create refined data sets from raw data / Trusted
data
•Perform data enrichment
•Perform data quality checks
•Define new structures for common data models
• Democratizing access
How :
Data is added after transformation and processing
Enriched data is categorized by Destination
Why: Single Source of Truth
Where : Trusted Data
What :
•Create Trusted data sets from raw data/ Enriched Data
•Create Master Data / Reference Data
•Democratizing access
How :
Data is added after transformation & processing
Trusted data is categorized by Destination
Ingestion Storage
Trusted Data
Enriched Data
Data
Discovery
Transformed Data
Reference Data
Master Data
Raw Data
Immutable
Manual
Stream
Batch
Transient Data
Data ingestion
snapshot
Transient Data
Why: Value Discovery
Where : Data Discovery Zone (Sandbox Area)
What :
•Create Trusted data sets from raw data/ Enriched Data
•Play with Data
•More Analytical Models (by Data Scientist & Machine
learning)
• Limited Access
How :
•Data is in multiple formats
•Slice N Dice
•Data is categorized on need basis
Mask /Tokenize to
protect sensitive data,
Data ingestion
snapshot
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 13
Folder Structure
Raw data is categorized by Source
Enriched and Trusted data is categorized by Destination
Raw_Data_Zone
Org_Internal
Business Unit
India
Consumer
Legacy
2017
Jan
01 (Day)
02
Feb
2018 Jan
01 (Day)
02
Order
Products
Unclassified
USA
Business_unit
Org_External
Org_Collabrati
on
Open_Collecti
on
DataZone Source Classification Platform
Geography
Entity Year Month DayBusiness Unit
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018
Metadata structure
Why Metadata:
•Choose data from the right place and time.
•Different data sets require different types of preparation
•Identify any data causing error in any stage from ingestion through to the processing
•Restricted access to sensitive data.
•Use by downstream programs
•Create an environment of continuous quality improvement.
Technical:
Technical metadata captures about the nature of data, which is easier to find and understand attributes.
Type of Data (Text, JSON, Avro, Parquet, XML, etc)
Schema Structure (Fields and their types)
Size
Operational:
Operational metadata captured from processes about the data genealogy, which is easier to find and understand course of flow.
Data Source
Filename
Time of creation
Time of acquisition
File size
MD5 Hash (Redundancy checks make sure the transmission was not corrupted)
Watermark / special identifiers
Lineage
Quality
Profile
Provenance
# of Rejected Records
Job Status (Success/ Failure/ Partial)
Number of bad fields or bad records
Job Process Time
Business:
Business metadata captures what the data means to the end user to make data fields easier to find and understand
Business names
Descriptions
Tags
Quality
Masking rules
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018
Data Catalog Data catalog serves as system of registration and system of discovery for enterprise data assets
Annotate
Data
Assets
Remove
Data
Assets
Discover
Data
Assets
Register
Data
Assets
Connect
Data
Assets
Manage
Data
Assets
Provision
Data
Catalog
One data catalog per
organization Register key structural
metadata such as names,
types, experts, & locations
from the data source
Discover data assets
with easy search & filter
for the indexed metadata
(Any property in catalog,
annotations etc)
Remove data assets
from the data catalog.
Provide annotations ( information
as in descriptions , tags,
taxonomy , documentation ) for
the data assets.
Open data assets in
integrated client tools
& non-integrated tool
Control the visibility
to specific users or to
members of specific
groups.
Data Sources
Enable Data
Democratization
Connect People to
data and empower
them
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 16
Data Quality Dimensions
Data Quality
Completeness
Uniqueness
Timeliness
Validity
Accuracy
Consistency
17Fathoming Data for Competitive advantage | Gururaj Joshi | Sep 26,1018 © 2018 Capgemini. All rights reserved.
Private
Security as a Critical Foundation
▪ Information requiring
the highest security,
such as financial
transactions,
personnel files,
medical records, and
military intelligence
▪ Information that the
originator wants to
protect, such as trade
secrets, customer
lists, and confidential
memos
▪ . Account information
that, if breached,
could lead to or aid in
identity theft
▪ Information such as
emails that might be
discoverable in
litigation or subject to
a retention rule
▪ Information such as
an email address on a
YouTube upload
Compliance-
driven
Confidential CustodialLockdown
The percentage of data requiring security will be near 90% by 2025, and this data falls into five categories
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 18
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 19
INFRASTRUCTURE
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 20
Open Source Tools….
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 21
Advanced Analytics
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 22
Cognitive Tools…..
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 23
Business models for data economy
Store & Host
Filter/Refine
Enhance / Enrich
Simplify Access
Consult /Advise
Collect & Supply
Gather and sell raw data
Hold onto someone else’s data for them
Strip out problematic records or data fields or release interesting data subsets
Blend in other datasets to create a new and interesting picture
Help people cherry-pick the data they want in the
format they prefer
Provide guidance on others’ data efforts
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 24
A Checklist for Success
• Business-Benefit Priority List
• Architectural Oversight
• Security Strategy
• Compute , I/O and Memory Model
• Workforce Skillset Evaluation
• Operations Plan
• Disaster Recovery Plan
• Communications Plan
• Monetization Plan
finding ways for data to make our lives better that we didn’t imagine even a few years ago
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 25
Questions & Answers
CW
IN
CAPGEMINI
WEEK OF
INNOVATION
NETWORKS
Thank You!
Phone: +91 9590019491
Gururaj.Joshi@capgemini.com
Gururaj Joshi
Enterprise Architect
@mgururaji
Speaker 1
Photo
© 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018
This message contains information that may be privileged or confidential and is
the property of the Capgemini Group.
Copyright © 2018 Capgemini. All rights reserved.
A global leader in consulting, technology services and digital transformation, Capgemini is
at the forefront of innovation to address the entire breadth of clients’ opportunities in the
evolving world of cloud, digital and platforms. Building on its strong 50-year heritage and
deep industry-specific expertise, Capgemini enables organizations to realize their business
ambitions through an array of services from strategy to operations. Capgemini is driven
by the conviction that the business value of technology comes from and through people.
It is a multicultural company of 200,000 team members in over 40 countries. The Group
reported 2017 global revenues of EUR 12.8 billion.
About Capgemini
Learn more about us at
www.capgemini.com

More Related Content

What's hot

Scaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analyticsScaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analyticsConnected Data World
 
Streamline Data Governance with Egeria: The Industry's First Open Metadata St...
Streamline Data Governance with Egeria: The Industry's First Open Metadata St...Streamline Data Governance with Egeria: The Industry's First Open Metadata St...
Streamline Data Governance with Egeria: The Industry's First Open Metadata St...DataWorks Summit
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Cambridge Semantics
 
Modern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail BankingModern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail BankingCambridge Semantics
 
Protecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UKProtecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UKUlf Mattsson
 
Tiger graph 2021 corporate overview [read only]
Tiger graph 2021 corporate overview [read only]Tiger graph 2021 corporate overview [read only]
Tiger graph 2021 corporate overview [read only]ercan5
 
Syngenta's Predictive Analytics Platform for Seeds R&D
Syngenta's Predictive Analytics Platform for Seeds R&DSyngenta's Predictive Analytics Platform for Seeds R&D
Syngenta's Predictive Analytics Platform for Seeds R&DMichael Swanson
 
Introduction to Data Mining, Business Intelligence and Data Science
Introduction to Data Mining, Business Intelligence and Data ScienceIntroduction to Data Mining, Business Intelligence and Data Science
Introduction to Data Mining, Business Intelligence and Data ScienceIMC Institute
 
Ai presentatie
Ai presentatieAi presentatie
Ai presentatieLunaDuFour
 
What are the 6 elements of a project
What are the 6 elements of a projectWhat are the 6 elements of a project
What are the 6 elements of a projectRichardPierce28
 
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital StrategyFrom Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital StrategyCambridge Semantics
 
Maximize the Value of Your Data: Neo4j Graph Data Platform
Maximize the Value of Your Data: Neo4j Graph Data PlatformMaximize the Value of Your Data: Neo4j Graph Data Platform
Maximize the Value of Your Data: Neo4j Graph Data PlatformNeo4j
 
Hds ucp sap hana infographic v6[1]
Hds ucp  sap hana infographic v6[1]Hds ucp  sap hana infographic v6[1]
Hds ucp sap hana infographic v6[1]Barbara Götz
 
Business intelligence 3.0 and the data lake
Business intelligence 3.0 and the data lakeBusiness intelligence 3.0 and the data lake
Business intelligence 3.0 and the data lakeData Science Thailand
 
Accelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success StoriesAccelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success StoriesCambridge Semantics
 
Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data FabricUsing Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data FabricCambridge Semantics
 
CWIN17 India / Bigdata architecture yashowardhan sowale
CWIN17 India / Bigdata architecture  yashowardhan sowaleCWIN17 India / Bigdata architecture  yashowardhan sowale
CWIN17 India / Bigdata architecture yashowardhan sowaleCapgemini
 
DataStreams : Corporate Overview
DataStreams : Corporate OverviewDataStreams : Corporate Overview
DataStreams : Corporate OverviewDataStreams
 

What's hot (20)

Scaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analyticsScaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analytics
 
Shortest path routing
 Shortest path routing Shortest path routing
Shortest path routing
 
Semantic AI
Semantic AISemantic AI
Semantic AI
 
Streamline Data Governance with Egeria: The Industry's First Open Metadata St...
Streamline Data Governance with Egeria: The Industry's First Open Metadata St...Streamline Data Governance with Egeria: The Industry's First Open Metadata St...
Streamline Data Governance with Egeria: The Industry's First Open Metadata St...
 
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
Anzo Smart Data Lake 4.0 - a Data Lake Platform for the Enterprise Informatio...
 
Modern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail BankingModern Data Discovery and Integration in Retail Banking
Modern Data Discovery and Integration in Retail Banking
 
Protecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UKProtecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UK
 
Tiger graph 2021 corporate overview [read only]
Tiger graph 2021 corporate overview [read only]Tiger graph 2021 corporate overview [read only]
Tiger graph 2021 corporate overview [read only]
 
Syngenta's Predictive Analytics Platform for Seeds R&D
Syngenta's Predictive Analytics Platform for Seeds R&DSyngenta's Predictive Analytics Platform for Seeds R&D
Syngenta's Predictive Analytics Platform for Seeds R&D
 
Introduction to Data Mining, Business Intelligence and Data Science
Introduction to Data Mining, Business Intelligence and Data ScienceIntroduction to Data Mining, Business Intelligence and Data Science
Introduction to Data Mining, Business Intelligence and Data Science
 
Ai presentatie
Ai presentatieAi presentatie
Ai presentatie
 
What are the 6 elements of a project
What are the 6 elements of a projectWhat are the 6 elements of a project
What are the 6 elements of a project
 
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital StrategyFrom Data Lakes to the Data Fabric: Our Vision for Digital Strategy
From Data Lakes to the Data Fabric: Our Vision for Digital Strategy
 
Maximize the Value of Your Data: Neo4j Graph Data Platform
Maximize the Value of Your Data: Neo4j Graph Data PlatformMaximize the Value of Your Data: Neo4j Graph Data Platform
Maximize the Value of Your Data: Neo4j Graph Data Platform
 
Hds ucp sap hana infographic v6[1]
Hds ucp  sap hana infographic v6[1]Hds ucp  sap hana infographic v6[1]
Hds ucp sap hana infographic v6[1]
 
Business intelligence 3.0 and the data lake
Business intelligence 3.0 and the data lakeBusiness intelligence 3.0 and the data lake
Business intelligence 3.0 and the data lake
 
Accelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success StoriesAccelerating Insight - Smart Data Lake Customer Success Stories
Accelerating Insight - Smart Data Lake Customer Success Stories
 
Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data FabricUsing Cloud Automation Technologies to Deliver an Enterprise Data Fabric
Using Cloud Automation Technologies to Deliver an Enterprise Data Fabric
 
CWIN17 India / Bigdata architecture yashowardhan sowale
CWIN17 India / Bigdata architecture  yashowardhan sowaleCWIN17 India / Bigdata architecture  yashowardhan sowale
CWIN17 India / Bigdata architecture yashowardhan sowale
 
DataStreams : Corporate Overview
DataStreams : Corporate OverviewDataStreams : Corporate Overview
DataStreams : Corporate Overview
 

Similar to Fathoming Data for Competitive Advantage

Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo
 
Saama Presents Is your Big Data Solution Ready for Streaming
Saama Presents Is your Big Data Solution Ready for StreamingSaama Presents Is your Big Data Solution Ready for Streaming
Saama Presents Is your Big Data Solution Ready for StreamingSaama
 
Logical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsLogical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsDenodo
 
Delivering Analytics at The Speed of Transactions with Data Fabric
Delivering Analytics at The Speed of Transactions with Data FabricDelivering Analytics at The Speed of Transactions with Data Fabric
Delivering Analytics at The Speed of Transactions with Data FabricDenodo
 
Frontiers in Alternative Data : Techniques and Use Cases
Frontiers in Alternative Data : Techniques and Use CasesFrontiers in Alternative Data : Techniques and Use Cases
Frontiers in Alternative Data : Techniques and Use CasesQuantUniversity
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIDenodo
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesAshraf Uddin
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationDatabricks
 
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughtonReal-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughtonSynerzip
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewDenodo
 
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...Matt Stubbs
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Jeffrey T. Pollock
 
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo
 
Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Capgemini
 
Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationDenodo
 
How Big Data Shaping The Supply Chain
How Big Data Shaping The Supply ChainHow Big Data Shaping The Supply Chain
How Big Data Shaping The Supply ChainHafizullah Mohd Amin
 
[DSC Adria 23] Thomas Miebach A modern, business focused data strategy with C...
[DSC Adria 23] Thomas Miebach A modern, business focused data strategy with C...[DSC Adria 23] Thomas Miebach A modern, business focused data strategy with C...
[DSC Adria 23] Thomas Miebach A modern, business focused data strategy with C...DataScienceConferenc1
 
Necessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services SectorNecessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services SectorDataWorks Summit
 

Similar to Fathoming Data for Competitive Advantage (20)

Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
 
Saama Presents Is your Big Data Solution Ready for Streaming
Saama Presents Is your Big Data Solution Ready for StreamingSaama Presents Is your Big Data Solution Ready for Streaming
Saama Presents Is your Big Data Solution Ready for Streaming
 
Logical Data Fabric: Architectural Components
Logical Data Fabric: Architectural ComponentsLogical Data Fabric: Architectural Components
Logical Data Fabric: Architectural Components
 
Delivering Analytics at The Speed of Transactions with Data Fabric
Delivering Analytics at The Speed of Transactions with Data FabricDelivering Analytics at The Speed of Transactions with Data Fabric
Delivering Analytics at The Speed of Transactions with Data Fabric
 
Frontiers in Alternative Data : Techniques and Use Cases
Frontiers in Alternative Data : Techniques and Use CasesFrontiers in Alternative Data : Techniques and Use Cases
Frontiers in Alternative Data : Techniques and Use Cases
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture Capabilities
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
 
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughtonReal-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
Real-Time With AI – The Convergence Of Big Data And AI by Colin MacNaughton
 
How a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 ViewHow a Logical Data Fabric Enhances the Customer 360 View
How a Logical Data Fabric Enhances the Customer 360 View
 
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!
 
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
 
Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry Big Data Analytics in light of Financial Industry
Big Data Analytics in light of Financial Industry
 
Fast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow PresentationFast Data Strategy Houston Roadshow Presentation
Fast Data Strategy Houston Roadshow Presentation
 
How Big Data Shaping The Supply Chain
How Big Data Shaping The Supply ChainHow Big Data Shaping The Supply Chain
How Big Data Shaping The Supply Chain
 
[DSC Adria 23] Thomas Miebach A modern, business focused data strategy with C...
[DSC Adria 23] Thomas Miebach A modern, business focused data strategy with C...[DSC Adria 23] Thomas Miebach A modern, business focused data strategy with C...
[DSC Adria 23] Thomas Miebach A modern, business focused data strategy with C...
 
Necessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services SectorNecessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services Sector
 

More from Capgemini

Top Healthcare Trends 2022
Top Healthcare Trends 2022Top Healthcare Trends 2022
Top Healthcare Trends 2022Capgemini
 
Top P&C Insurance Trends 2022
Top P&C Insurance Trends 2022Top P&C Insurance Trends 2022
Top P&C Insurance Trends 2022Capgemini
 
Commercial Banking Trends book 2022
Commercial Banking Trends book 2022Commercial Banking Trends book 2022
Commercial Banking Trends book 2022Capgemini
 
Top Trends in Payments 2022
Top Trends in Payments 2022Top Trends in Payments 2022
Top Trends in Payments 2022Capgemini
 
Top Trends in Wealth Management 2022
Top Trends in Wealth Management 2022Top Trends in Wealth Management 2022
Top Trends in Wealth Management 2022Capgemini
 
Retail Banking Trends book 2022
Retail Banking Trends book 2022Retail Banking Trends book 2022
Retail Banking Trends book 2022Capgemini
 
Top Life Insurance Trends 2022
Top Life Insurance Trends 2022Top Life Insurance Trends 2022
Top Life Insurance Trends 2022Capgemini
 
キャップジェミニ、あなたの『RISE WITH SAP』のパートナーです
キャップジェミニ、あなたの『RISE WITH SAP』のパートナーですキャップジェミニ、あなたの『RISE WITH SAP』のパートナーです
キャップジェミニ、あなたの『RISE WITH SAP』のパートナーですCapgemini
 
Property & Casualty Insurance Top Trends 2021
Property & Casualty Insurance Top Trends 2021Property & Casualty Insurance Top Trends 2021
Property & Casualty Insurance Top Trends 2021Capgemini
 
Life Insurance Top Trends 2021
Life Insurance Top Trends 2021Life Insurance Top Trends 2021
Life Insurance Top Trends 2021Capgemini
 
Top Trends in Commercial Banking: 2021
Top Trends in Commercial Banking: 2021Top Trends in Commercial Banking: 2021
Top Trends in Commercial Banking: 2021Capgemini
 
Top Trends in Wealth Management: 2021
Top Trends in Wealth Management: 2021Top Trends in Wealth Management: 2021
Top Trends in Wealth Management: 2021Capgemini
 
Top Trends in Payments: 2021
Top Trends in Payments: 2021Top Trends in Payments: 2021
Top Trends in Payments: 2021Capgemini
 
Health Insurance Top Trends 2021
Health Insurance Top Trends 2021Health Insurance Top Trends 2021
Health Insurance Top Trends 2021Capgemini
 
Top Trends in Retail Banking: 2021
Top Trends in Retail Banking: 2021Top Trends in Retail Banking: 2021
Top Trends in Retail Banking: 2021Capgemini
 
Capgemini’s Connected Autonomous Planning
Capgemini’s Connected Autonomous PlanningCapgemini’s Connected Autonomous Planning
Capgemini’s Connected Autonomous PlanningCapgemini
 
Top Trends in Retail Banking: 2020
Top Trends in Retail Banking: 2020Top Trends in Retail Banking: 2020
Top Trends in Retail Banking: 2020Capgemini
 
Top Trends in Life Insurance: 2020
Top Trends in Life Insurance: 2020Top Trends in Life Insurance: 2020
Top Trends in Life Insurance: 2020Capgemini
 
Top Trends in Health Insurance: 2020
Top Trends in Health Insurance: 2020Top Trends in Health Insurance: 2020
Top Trends in Health Insurance: 2020Capgemini
 
Top Trends in Payments: 2020
Top Trends in Payments: 2020Top Trends in Payments: 2020
Top Trends in Payments: 2020Capgemini
 

More from Capgemini (20)

Top Healthcare Trends 2022
Top Healthcare Trends 2022Top Healthcare Trends 2022
Top Healthcare Trends 2022
 
Top P&C Insurance Trends 2022
Top P&C Insurance Trends 2022Top P&C Insurance Trends 2022
Top P&C Insurance Trends 2022
 
Commercial Banking Trends book 2022
Commercial Banking Trends book 2022Commercial Banking Trends book 2022
Commercial Banking Trends book 2022
 
Top Trends in Payments 2022
Top Trends in Payments 2022Top Trends in Payments 2022
Top Trends in Payments 2022
 
Top Trends in Wealth Management 2022
Top Trends in Wealth Management 2022Top Trends in Wealth Management 2022
Top Trends in Wealth Management 2022
 
Retail Banking Trends book 2022
Retail Banking Trends book 2022Retail Banking Trends book 2022
Retail Banking Trends book 2022
 
Top Life Insurance Trends 2022
Top Life Insurance Trends 2022Top Life Insurance Trends 2022
Top Life Insurance Trends 2022
 
キャップジェミニ、あなたの『RISE WITH SAP』のパートナーです
キャップジェミニ、あなたの『RISE WITH SAP』のパートナーですキャップジェミニ、あなたの『RISE WITH SAP』のパートナーです
キャップジェミニ、あなたの『RISE WITH SAP』のパートナーです
 
Property & Casualty Insurance Top Trends 2021
Property & Casualty Insurance Top Trends 2021Property & Casualty Insurance Top Trends 2021
Property & Casualty Insurance Top Trends 2021
 
Life Insurance Top Trends 2021
Life Insurance Top Trends 2021Life Insurance Top Trends 2021
Life Insurance Top Trends 2021
 
Top Trends in Commercial Banking: 2021
Top Trends in Commercial Banking: 2021Top Trends in Commercial Banking: 2021
Top Trends in Commercial Banking: 2021
 
Top Trends in Wealth Management: 2021
Top Trends in Wealth Management: 2021Top Trends in Wealth Management: 2021
Top Trends in Wealth Management: 2021
 
Top Trends in Payments: 2021
Top Trends in Payments: 2021Top Trends in Payments: 2021
Top Trends in Payments: 2021
 
Health Insurance Top Trends 2021
Health Insurance Top Trends 2021Health Insurance Top Trends 2021
Health Insurance Top Trends 2021
 
Top Trends in Retail Banking: 2021
Top Trends in Retail Banking: 2021Top Trends in Retail Banking: 2021
Top Trends in Retail Banking: 2021
 
Capgemini’s Connected Autonomous Planning
Capgemini’s Connected Autonomous PlanningCapgemini’s Connected Autonomous Planning
Capgemini’s Connected Autonomous Planning
 
Top Trends in Retail Banking: 2020
Top Trends in Retail Banking: 2020Top Trends in Retail Banking: 2020
Top Trends in Retail Banking: 2020
 
Top Trends in Life Insurance: 2020
Top Trends in Life Insurance: 2020Top Trends in Life Insurance: 2020
Top Trends in Life Insurance: 2020
 
Top Trends in Health Insurance: 2020
Top Trends in Health Insurance: 2020Top Trends in Health Insurance: 2020
Top Trends in Health Insurance: 2020
 
Top Trends in Payments: 2020
Top Trends in Payments: 2020Top Trends in Payments: 2020
Top Trends in Payments: 2020
 

Recently uploaded

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 

Recently uploaded (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

Fathoming Data for Competitive Advantage

  • 1. CW IN CAPGEMINI WEEK OF INNOVATION NETWORKS Fathoming Data for Competitive Advantage Gururaj Joshi, Bangalore, Sep 26th 2018
  • 2. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 2
  • 3. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 3 The Evolving 3rd Platform At-Scale PersonalizationExponential Change Autonomy Data as a Service Reference : https://www.idc.com
  • 4. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 4 Data Deluge By 2024 machine-to-machine connections will grow to 27 billion By 2020, a quarter of a billion cars will be connected to the Internet By 2020 Smartphone users alone are predicted to number over 6 billion By 2020, devices that connect to the Internet are 50 billion. By 2025; the amount of analyzed data that is “touched” by cognitive systems will grow by a factor 1.4ZB By 2025, more than a quarter of data created will be real time in nature, and real-time IoT data will make up more than 95% of this. By 2025, an average connected person anywhere in the world will interact with connected devices nearly 4,800 times per day By 2025, nearly 20% of the data will be critical to our daily lives and nearly 10% of that will be hypercritical. Reference : https://www.idc.com
  • 5. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 5 The Power of Data Healthcare investing billions in developing new biometric sensors and wearable technology that tracks health and fitness. Sports Retail retailers are constantly finding cutting-edge ways to draw insights from the ever-increasing amount of information available about their customers Government and public sector services Financial services, banking and insurance Are adopting to the data first approach to driving business growth and enhancing its services for customers. Energy Transportat ion and logistics From the weather to the condition of vehicles and machinery, and data analytics enables businesses to drive significant efficiencies Agriculture and farming it’s possible to take more than a million readings – vastly increasing the amount of data gathered during exploration several data-enabled services that let farmers benefit from crowdsourced, real-time monitoring of data collected An increasing number of cities are currently piloting data analytics with the aim of turning themselves into ‘smart cities’ Most elite sports have now embraced data analytics & its hard to think of any area of sport that isn’t embracing data Businesses built on data A glance at the 10 most valuable Fortune companies , Proves that their business model are built on data, or are heavily investing in data
  • 6. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 6 every business should be a data business data is becoming a key business asset, central to the success of every company, big or small. Data is a valuable asset and, as a result, companies are more hungry for data than ever before As the world becomes smarter and smarter, data becomes the key to competitive advantage. every tiny piece of data may very well be valuable to some extent or another. every business therefore needs a robust data strategy, Those without risk being left behind.
  • 7. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 7 The Path Of Wisdom
  • 8. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 Pathways to Big Data adoption: A strategic value-drivers based approach Strategic Value & Differentiation: Data Monetization Personalization & Intimacy: Data Science Scale, Variability and Flexibility: Data Lakes 1 2 3 4 1 2 3 4 Speed and Responsiveness: Data Streams ValueDriversandBigDatalevers The Data Explosion in the digital universe is transformational – and can unlock answers to critical business needs and opportunities
  • 9. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 9 Validity TRUST OF DATA Volatility RETENTION OF DATA
  • 10. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 10 What we seek about data … S No Source Details 1 Volume – What is the volume of data coming for each source 1.1 Overall Size of dataset in One year (GBs) 1.2 Size of dataset in upload (Small, Medium, Large, Extra Large) 1.3 Avg number of Records of Dataset 1.4 Does dataset contains binary data (Capture limit of binary dataset) 2 Variety – Different types of data set 2.1 Types of Structured Data 2.1.1 RDBMS 2.1.2 XML 2.1.3 JSON 2.1.4 CSV 2.1.5 Capture the structure 2.2 Is the data in Semi-structured form? 2.3 Is the data in Completely unstructured? 2.4 Is the data in binary format? 2.5 Location of source data (On-premises, Private Cloud, Public Cloud, Hybrid) 2.6 What should be source provenance 2.7 On boarding timestamp 2.8 Source Identification( exact file name formats) 2.9 Source data header/ trailer format's available 2.10 Are there any delimiters at different places available 2.11 Does Business metadata captured 2.11.1 Business names 2.11.2 Descriptions 2.11.3 Tags 2.11.4 Quality 3 Velocity – Rate of data ingestion, transformation & visualization 3.1 Is the dataset expected to be real time 3.2 Is the dataset expected to be one time bulk ingested 3.3 Is the dataset expected to be ingested in incremental sizes 3.4 Is the dataset expected to be ingested in batch mode (Repeated small chunks of dataset) 3.5 Does data ingested by pull-based refreshes/ push-based refreshes 3.6 Frequency of Data Ingestion 3.7 What is expectancy of availability of data at Transient Data layer 3.8 What is expectancy of availability of data at Immutable Raw Data layer 3.9 What is expectancy of availability of data at Enriched Data layer 3.10 What is expectancy of availability of data at Trusted Data layer 3.11 What is expectancy of availability of data at Discovery Data layer 3.12 What is expectancy of availability of data at Visualization Layer 4 Veracity – 4.1 Are there any known anomalies in data set 4.2 Are there any known data cleaning activities required 4.3 Are there any known data formatting activities required 4.4 Are there any known data to be tokenized or masked to protect personally identifiable information (PII) 4.5 Are there any known data to be tokenized or masked for sensitive data. 4.6 Data Classification (Open, Organizational, Internal, Restricted, Need basis) 4.7 Data correctness and accurate for the intended use. 4.8 Are the data sets validated by owners for correctness 4.9 Are any specific Metadata to be captured 4.10 Does the data source make the data available easily or specific connector needs to be built? 5 Volatility - Refers to shelf life of Data 5.1 How long is data valid 5.2 How long data needs to be stored 5.3 Is there point of data irrelevance available 5.4 Capture Data Audit (Lineage) 6 General 6.1 Sample data to be prepared / made available by the data provider . 6.2 Does Mapping Document (mappings, transformations and joins provided ) & is verified with source 6.3 Will there be any control file for each of the source files? 6.4 In a Scenario where the process is down for multiple days? Is it required to load the backlogs of files or just the latest file? 6.5 The delimiters or special characters to be used should be confirmed in appearance and their ASCII value provided. 6.6 What validations need to be Performed for source file names? 6.7 Do the file contains columns which indicates the start and end of field values 6.8 Is it required to validate the header and trailer details? 6.9 Details of data upload failure intimation / Process followed 7 Security 7.1 Are there any known users groups identified (User classified in separate groups) 7.2 Are there any known users identified as Owners (Full access to data)
  • 11. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 11 Components of data architecture Batch Ingestion / Data Store Batch Processing Real Time Ingestion Stream Processing Analytical Data Store Analytics & ReportingData Source On-Premise RDBMS Static Files Real-Time Cloud Storages Devices Social Events Sensors Shared File, Queue, Blob Process Long-running / Large Data Sets Store / Buffer/ Reliable delivery / real-time messages filtering, aggregating, preparing the data serve data for analysis in structured format / low-latency NoSQL / Interactive distributed data •Data Modeling Layer, •Self-service BI •Visualization tech •Interactive data exploration Automated Workflows, Transform , Move between, Load processed, Push results to report or dashboardOrchestration Securing access to dataSecurity Details on dataMeta Data Guidance for dataCatalog Right dataQuality Data Store Build/ train models / NLP Machine Learning Data Services (APIs) Data Processing Algorithms
  • 12. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 12 Zones of Data Data is organized into zones that serve specific functions. Each zone data is accessible and viewable Why: Record Incoming Data Snapshot Where : Transient Data (staging area) What : •Data first comes into the data lake. •Basic quality checks •Create new and different transformed data sets •Create Data Catalog •Create Meta Data (Automated ) •Provide access on need basis How : •Data can come by Manual, Stream, Batch •Raw data is categorized by Source Why: Source snapshot for Analytics & Discovery Where : Raw Data (Immutable Raw data area in original form) What : •Data is immutable •Data can be tokenized or masked to protect personally identifiable information (PII) or other sensitive data. •Data is in original format •Restricted / Very Limited Access is provided How : •Data can come from source & in format for analytics •Raw data is categorized by Source Why: Provide insights Where : Enriched Data (Refined Data ) What : •Create refined data sets from raw data / Trusted data •Perform data enrichment •Perform data quality checks •Define new structures for common data models • Democratizing access How : Data is added after transformation and processing Enriched data is categorized by Destination Why: Single Source of Truth Where : Trusted Data What : •Create Trusted data sets from raw data/ Enriched Data •Create Master Data / Reference Data •Democratizing access How : Data is added after transformation & processing Trusted data is categorized by Destination Ingestion Storage Trusted Data Enriched Data Data Discovery Transformed Data Reference Data Master Data Raw Data Immutable Manual Stream Batch Transient Data Data ingestion snapshot Transient Data Why: Value Discovery Where : Data Discovery Zone (Sandbox Area) What : •Create Trusted data sets from raw data/ Enriched Data •Play with Data •More Analytical Models (by Data Scientist & Machine learning) • Limited Access How : •Data is in multiple formats •Slice N Dice •Data is categorized on need basis Mask /Tokenize to protect sensitive data, Data ingestion snapshot
  • 13. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 13 Folder Structure Raw data is categorized by Source Enriched and Trusted data is categorized by Destination Raw_Data_Zone Org_Internal Business Unit India Consumer Legacy 2017 Jan 01 (Day) 02 Feb 2018 Jan 01 (Day) 02 Order Products Unclassified USA Business_unit Org_External Org_Collabrati on Open_Collecti on DataZone Source Classification Platform Geography Entity Year Month DayBusiness Unit
  • 14. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 Metadata structure Why Metadata: •Choose data from the right place and time. •Different data sets require different types of preparation •Identify any data causing error in any stage from ingestion through to the processing •Restricted access to sensitive data. •Use by downstream programs •Create an environment of continuous quality improvement. Technical: Technical metadata captures about the nature of data, which is easier to find and understand attributes. Type of Data (Text, JSON, Avro, Parquet, XML, etc) Schema Structure (Fields and their types) Size Operational: Operational metadata captured from processes about the data genealogy, which is easier to find and understand course of flow. Data Source Filename Time of creation Time of acquisition File size MD5 Hash (Redundancy checks make sure the transmission was not corrupted) Watermark / special identifiers Lineage Quality Profile Provenance # of Rejected Records Job Status (Success/ Failure/ Partial) Number of bad fields or bad records Job Process Time Business: Business metadata captures what the data means to the end user to make data fields easier to find and understand Business names Descriptions Tags Quality Masking rules
  • 15. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 Data Catalog Data catalog serves as system of registration and system of discovery for enterprise data assets Annotate Data Assets Remove Data Assets Discover Data Assets Register Data Assets Connect Data Assets Manage Data Assets Provision Data Catalog One data catalog per organization Register key structural metadata such as names, types, experts, & locations from the data source Discover data assets with easy search & filter for the indexed metadata (Any property in catalog, annotations etc) Remove data assets from the data catalog. Provide annotations ( information as in descriptions , tags, taxonomy , documentation ) for the data assets. Open data assets in integrated client tools & non-integrated tool Control the visibility to specific users or to members of specific groups. Data Sources Enable Data Democratization Connect People to data and empower them
  • 16. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 16 Data Quality Dimensions Data Quality Completeness Uniqueness Timeliness Validity Accuracy Consistency
  • 17. 17Fathoming Data for Competitive advantage | Gururaj Joshi | Sep 26,1018 © 2018 Capgemini. All rights reserved. Private Security as a Critical Foundation ▪ Information requiring the highest security, such as financial transactions, personnel files, medical records, and military intelligence ▪ Information that the originator wants to protect, such as trade secrets, customer lists, and confidential memos ▪ . Account information that, if breached, could lead to or aid in identity theft ▪ Information such as emails that might be discoverable in litigation or subject to a retention rule ▪ Information such as an email address on a YouTube upload Compliance- driven Confidential CustodialLockdown The percentage of data requiring security will be near 90% by 2025, and this data falls into five categories
  • 18. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 18
  • 19. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 19 INFRASTRUCTURE
  • 20. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 20 Open Source Tools….
  • 21. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 21 Advanced Analytics
  • 22. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 22 Cognitive Tools…..
  • 23. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 23 Business models for data economy Store & Host Filter/Refine Enhance / Enrich Simplify Access Consult /Advise Collect & Supply Gather and sell raw data Hold onto someone else’s data for them Strip out problematic records or data fields or release interesting data subsets Blend in other datasets to create a new and interesting picture Help people cherry-pick the data they want in the format they prefer Provide guidance on others’ data efforts
  • 24. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 24 A Checklist for Success • Business-Benefit Priority List • Architectural Oversight • Security Strategy • Compute , I/O and Memory Model • Workforce Skillset Evaluation • Operations Plan • Disaster Recovery Plan • Communications Plan • Monetization Plan finding ways for data to make our lives better that we didn’t imagine even a few years ago
  • 25. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 25 Questions & Answers
  • 26. CW IN CAPGEMINI WEEK OF INNOVATION NETWORKS Thank You! Phone: +91 9590019491 Gururaj.Joshi@capgemini.com Gururaj Joshi Enterprise Architect @mgururaji Speaker 1 Photo
  • 27. © 2018 Capgemini. All rights reserved.Fathoming Data for Competitive Advantage | Gururaj Joshi | Sep 26,2018 This message contains information that may be privileged or confidential and is the property of the Capgemini Group. Copyright © 2018 Capgemini. All rights reserved. A global leader in consulting, technology services and digital transformation, Capgemini is at the forefront of innovation to address the entire breadth of clients’ opportunities in the evolving world of cloud, digital and platforms. Building on its strong 50-year heritage and deep industry-specific expertise, Capgemini enables organizations to realize their business ambitions through an array of services from strategy to operations. Capgemini is driven by the conviction that the business value of technology comes from and through people. It is a multicultural company of 200,000 team members in over 40 countries. The Group reported 2017 global revenues of EUR 12.8 billion. About Capgemini Learn more about us at www.capgemini.com