SlideShare a Scribd company logo
INTRODUCTION
The volume, variety, velocity and veracity of big data are getting increasingly complex
each passing day. The way the data is stored, processed, managed and shared with
decision-makers is getting impacted by this complexity and to tackle the same, a
revolutionary approach to data management has come into picture. A data lake.
©Denave
©Denave
WHAT IS A
DATA LAKE?
©Denave
As the name suggests, data lake is a large reservoir of data – structured or
unstructured, fed through disparate channels. The data is fed through channels in an
ad-hoc manner into these data lakes, however, owing to the predefined set of rules or
schema, correlation between the database is established automatically to help with
the extraction of meaningful information.
It provides high level of flexibility in terms of interaction with and leverage of the data.
In general, data lakes are used to store data when you’ve a constant stream of
unstructured data coming into, such as, web interactions, product logs, IoT sensors,
app usage etc.
Simply put, along with the on-premises data, it is the real-time data which fills up the
data lake, upon which are then used the principles of machine learning and analytics
to make it relational.
The global data lakes market is expected to grow at a CAGR of
approximately 28% during the forecast period 2017-2023, to
touch $12.01 billion by 2024 & $+14.01 billion by 2026. 1,2&3
©Denave
DATA LAKES &
SALES ECOSYSTEM
UNDERSTANDING
THE CORRELATION
Sales is no more just about the product alone. Rather, it is more about getting
connected to the customer – at a deeper level. In order to do so, organisations are
becoming data-driven in every sense and they rely heavily on agile technologies to
help them with the analysis and management of data.
With a large wealth of customer data at their disposal, it is that data analysis backed
sales action which acts as a game-changer for organisations. To get the differentiating
edge, firms need to use the customer data in the best possible manner to make
hyper-impactful outreach and better sales interactions.
Following elucidates some of the challenges of sales ecosystem which are solved
by Data lakes:
Need for the Data in Native Format
It may be difficult to believe but since the amount of data being generated by
enterprises is extremely huge, a major portion of that data is discarded. Remaining
small portion is then stored in the data warehouse – for a few years. This happens
owing to the storage capacity, structure restriction, associated costs etc. and most of
the time it happens because enterprises don’t know what to be done with that data,
esp. machine-generated or historical data. Hence, it is dumped away, thus putting a
limitation on the extent of analytics application that can happen.
With data lakes, enterprises save the data without fretting about the structure,
intended use etc. Simply put, you may not know why you are saving the data, but
you’d still do so with the thought that you may need that data someday – thus,
getting as much data as possible in its native format.
©Denave
With data lakes, enterprises save the data without fretting about the structure,
intended use etc. Simply put, you may not know why you are saving the data, but
you’d still do so with the thought that you may need that data someday – thus,
getting as much data as possible in its native format.
Quashing Data Silos
With data, the case is such that the number of teams or departments you have, that
much variety of data would be there. That database is not centralised and instead
remain in silos because it turns out to be expensive as well as time consuming to
share that data with one another. Consider the case where a department may need
the data from another department – their requirement would be a specific segment of
data in a particular format, Therefore, the department which owns the data will be
conducting all ETL exercises to be able to extract and package the data in line with
the requirement. Extra work equals to extra time and also extra expenses and hence
the delivery team would want to shy away from getting any data request in the first
place or to delay and excuse any request which has already been made, resulting in
data silos.
It is like having an immense wealth with you but not being able to use it owing to the
labours it is going to take to spend it.
This issue is tackled head on with data lakes because there the data ingestion is
almost frictionless since it accepts data without any processing, thus allows for a
deeper data leverage to all with a centralised and transparent access process. Data
ownership doesn’t remain a barrier any more with data lakes.
©Denave
©Denave
ADVANTAGES OF
DATA LAKES
The acceptance of need for data lakes itself makes visible a lot of benefits which
salespeople can accrue, let’s dig deeper and see what all as an organisation are you
set to gain if your database strategies include leveraging data lakes:
Say good-bye to silos and fragmentation
With data lakes, you get a unified view of everything which comprises the customer
experience – all the data from all the platforms, departments, teams, delivery
channels.
Better preparedness for the customer journey
Since you’ve better knowledge about the customer buying cycle, the high or low
points of his journey, quite naturally the decision-making process is much more
impactful than before.
High yielding campaigns
You’re better equipped than before for generating and assessing campaigns. The
agility and autonomy are rendered by the tools which not only helps you to measure
but also to optimise your marketing investments in real time.
Power to predict
Sitting on the historical data which you’d have otherwise dumped or would have
never been able to analyse, provides you with the power to analyse and establish
predictive models for customer behaviour.
©Denave
All touch point control
You’re also able to trace the journey and behaviour specific to each customer
touchpoint which allows you to rank the point of conversions accurately. Accordingly,
you can adjust your focus on the touchpoints.
Fertile environment for new tools or methodologies
development
Since you own the storage of immense amount of data, you can generate your own
technological environment and a service-oriented architecture which can make the
development of new tools possible.
Intelligent and faster decisions
With dynamic dashboards which work upon varied kinds of data accumulated from
myriad number of sources – external or internal, you get an unmatched business
intelligence right at your tips and thus, decision-making in real-time becomes
possible.
If we put together all the benefits in one basket, the highlight of data lake adoption
would be the provision of an agile 360-degree view data-driven marketing
operations and a faster (and intelligent) response system to any business need.
These will automatically translate into improved customer interactions, enhanced
R&D innovation possibilities and augmented operational efficiencies.
©Denave
©Denave
GETTING THE
RIGHT FIT
Getting convinced of the immense advantages of data lakes is one thing but
understanding where it fits in your business needs and vision is another thing and
probably the more important one.
A quick assessment of the latter will help in getting the optimal resolution for
your business:
Understand your data type
What is the type of data which forms the majority of your data input/ accumulation? If
it is structured data in generality – traditional, tabular format – data being generated
by HR systems, traditional CRMs etc. then data warehouse is a good option.
Data lakes are beneficial in case if the data you have is unstructured or semi-
structured and it’s getting generated constantly (like a person crosses an in-store ad
and the data is captured by the intelligent display unit to get records of user views
and behaviour). In such a constant stream of data, hundreds of terabytes are
generated and occupy the storage in a matter of time.
In short, your structured data may be more suited for RDBMS while the sensor or
SAAS based data would be more suited for data lakes. If you would use a data lake for
the prior stated data variety then the costs would be exorbitant and unrequired.
©Denave
Understand your data expectation
If the post storage needs from the data are fixed and you need it for specific use
cases, then you’d structure the data in advance for the requite processing. For such
kind of expectation from the data, a data warehouse is best suited, however,
structuring the data has its own costs attached and it certainly limits your ability to
repurpose the data for any other use later on.
With data lakes, comes in the flexibility. You can store now and decide later what you
really want to extract out of that data.
For a set of predefined reports, you may not bother having a data lake but to have
the freedom to analyse and accordingly scale up or innovate, data lakes would be the
best bet.
Understand your resources
While DB technicians or IT team would be able to get a database system in place and
then business analysts can leverage self-service to extract reports out of it, data lakes
would require a different level of expertise. Skilled big data engineers and data
scientists would have to be invested in along with sophisticated tools to form a data
puddle and then a data pond and finally a data lake.
Analyse your data acquisition process
If the data acquisition is happening from multiple sources and is a complex process in
itself -it would mean that you’d be spending a large overhead on ETL techniques in
order to render that data suitable for data warehouse. If you’re constantly spending a
fortune because of a complex data acquisition and thus a cost intensive process, then
it might be time to switch to data lake.
Once you take the plunge into data lakes based on your requirements, remember the
governance part as well and involve the legal & privacy team for required due
diligence.
©Denave
©Denave
USING IT
RIGHT
Getting the right platform and then getting the data loaded onto it are the first steps
but the real job begins post that.
The interface plays a critical role in adoption and usage of data lake. The scope of
data lake is huge enough that the normal IT function, which is adequate to manage a
data warehouse, can’t be expected to cater to such a wide variety of data as well as a
huge community of users.
Therefore, self service becomes an essential feature for the data lake interface.
It carries these two integral aspects:
• To ensure the access grant in accordance with the level of expertise
• To ensure that people are able to find the right data that they are looking for
For example – a business analysts would need a cooked form of data and a
completely raw data set would not make sense to him/ her, though getting the same
unprocessed data would be necessary for a data scientist. Hence, the interface will
have to have bifurcated zones for different data requirements of divergent set of
audience, such as, partially processed data, raw data etc.
©Denave
CONCLUSION
The whole process of getting the infrastructure in place for the data lake, organising
the data lake (creating zones for various user communities as per their expertise
level), putting up the catalogue of data assets to enable self-service and eventually,
opening it for the users – this complete journey is like a success roadmap for data
lakes.
Such kind of centralisation of data allows for multiple forms of analysis such as
analysing the conversion funnel for improvement areas, creation of a precise
recommendation engine etc. – everything facilitating engaging customer experiences
and explosive business growth.
©Denave
Sources:
https://www.marketwatch.com/press-release/data-lakes-market-
2018-global-size-share-statistics-opportunities-growth-trends-
industry-analysis-and-regional-forecast-to-2023-2018-08-21
https://www.marketwatch.com/press-release/data-lakes-market-to-
touch-an-aggregate-of-1201-billion-by-2024-growing-at-a-cagr-of-
278-2019-04-18
https://medium.com/datadriveninvestor/data-lakes-market-2018-
2025-top-key-players-like-microsoft-informatica-teradata-capgemini-
b6f6a86e2fc8
©Denave
For more sales insights, visit
www.denave.com/resources
©Denave

More Related Content

What's hot

Setting Up the Data Lake
Setting Up the Data LakeSetting Up the Data Lake
Setting Up the Data Lake
Caserta
 
Become Data Driven With Hadoop as-a-Service
Become Data Driven With Hadoop as-a-ServiceBecome Data Driven With Hadoop as-a-Service
Become Data Driven With Hadoop as-a-Service
Mammoth Data
 
Better Architecture for Data: Adaptable, Scalable, and Smart
Better Architecture for Data: Adaptable, Scalable, and SmartBetter Architecture for Data: Adaptable, Scalable, and Smart
Better Architecture for Data: Adaptable, Scalable, and Smart
Paul Boal
 
data warehouse vs data lake
data warehouse vs data lakedata warehouse vs data lake
data warehouse vs data lake
Polestarsolutions
 
Big dataplatform operationalstrategy
Big dataplatform operationalstrategyBig dataplatform operationalstrategy
Big dataplatform operationalstrategy
Himanshu Bari
 
How 3 trends are shaping analytics and data management
How 3 trends are shaping analytics and data management How 3 trends are shaping analytics and data management
How 3 trends are shaping analytics and data management
Abhishek Sood
 
Veritas corporate brochure emea
Veritas corporate brochure emeaVeritas corporate brochure emea
Veritas corporate brochure emea
Hayatollah Ayoubi
 
Augmented analytics will push the analytics adoption
Augmented analytics will push the analytics adoptionAugmented analytics will push the analytics adoption
Augmented analytics will push the analytics adoption
Polestarsolutions
 
Big datarevealed hadoop catalog
Big datarevealed hadoop catalogBig datarevealed hadoop catalog
Big datarevealed hadoop catalog
Steven Meister
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
Big Data User Group Karlsruhe/Stuttgart
 
Big data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security AllianceBig data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security Alliance
Information Security Awareness Group
 
Oracle sql plsql & dw
Oracle sql plsql & dwOracle sql plsql & dw
Oracle sql plsql & dw
Sateesh Kumar Sarvasiddi
 
Stream Meets Batch for Smarter Analytics- Impetus White Paper
Stream Meets Batch for Smarter Analytics- Impetus White PaperStream Meets Batch for Smarter Analytics- Impetus White Paper
Stream Meets Batch for Smarter Analytics- Impetus White Paper
Impetus Technologies
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for Everyone
Caserta
 
Smarter Management for Your Data Growth
Smarter Management for Your Data GrowthSmarter Management for Your Data Growth
Smarter Management for Your Data Growth
RainStor
 
Big data and oracle
Big data and oracleBig data and oracle
Big data and oracle
Sourabh Saxena
 
Data warehousing and data mining
Data warehousing and data miningData warehousing and data mining
Data warehousing and data mining
Snehali Chake
 
Introduction To Data Warehousing
Introduction To Data WarehousingIntroduction To Data Warehousing
Introduction To Data Warehousing
Alex Meadows
 
Big Data
Big DataBig Data
Big Data
Faisal Ahmed
 

What's hot (20)

Setting Up the Data Lake
Setting Up the Data LakeSetting Up the Data Lake
Setting Up the Data Lake
 
Become Data Driven With Hadoop as-a-Service
Become Data Driven With Hadoop as-a-ServiceBecome Data Driven With Hadoop as-a-Service
Become Data Driven With Hadoop as-a-Service
 
Better Architecture for Data: Adaptable, Scalable, and Smart
Better Architecture for Data: Adaptable, Scalable, and SmartBetter Architecture for Data: Adaptable, Scalable, and Smart
Better Architecture for Data: Adaptable, Scalable, and Smart
 
data warehouse vs data lake
data warehouse vs data lakedata warehouse vs data lake
data warehouse vs data lake
 
Big dataplatform operationalstrategy
Big dataplatform operationalstrategyBig dataplatform operationalstrategy
Big dataplatform operationalstrategy
 
How 3 trends are shaping analytics and data management
How 3 trends are shaping analytics and data management How 3 trends are shaping analytics and data management
How 3 trends are shaping analytics and data management
 
Veritas corporate brochure emea
Veritas corporate brochure emeaVeritas corporate brochure emea
Veritas corporate brochure emea
 
Augmented analytics will push the analytics adoption
Augmented analytics will push the analytics adoptionAugmented analytics will push the analytics adoption
Augmented analytics will push the analytics adoption
 
Big datarevealed hadoop catalog
Big datarevealed hadoop catalogBig datarevealed hadoop catalog
Big datarevealed hadoop catalog
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
 
Big data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security AllianceBig data analysis concepts and references by Cloud Security Alliance
Big data analysis concepts and references by Cloud Security Alliance
 
Oracle sql plsql & dw
Oracle sql plsql & dwOracle sql plsql & dw
Oracle sql plsql & dw
 
Stream Meets Batch for Smarter Analytics- Impetus White Paper
Stream Meets Batch for Smarter Analytics- Impetus White PaperStream Meets Batch for Smarter Analytics- Impetus White Paper
Stream Meets Batch for Smarter Analytics- Impetus White Paper
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for Everyone
 
Smarter Management for Your Data Growth
Smarter Management for Your Data GrowthSmarter Management for Your Data Growth
Smarter Management for Your Data Growth
 
Big data and oracle
Big data and oracleBig data and oracle
Big data and oracle
 
Data warehousing and data mining
Data warehousing and data miningData warehousing and data mining
Data warehousing and data mining
 
Introduction To Data Warehousing
Introduction To Data WarehousingIntroduction To Data Warehousing
Introduction To Data Warehousing
 
Big Data
Big DataBig Data
Big Data
 
Gartner Predicts 2018
Gartner Predicts 2018Gartner Predicts 2018
Gartner Predicts 2018
 

Similar to WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEM

DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
King Julian
 
Data lake ppt
Data lake pptData lake ppt
Data lake ppt
SwarnaLatha177
 
Data warehouse
Data warehouseData warehouse
Data warehouseMR Z
 
Optimising Data Lakes for Financial Services
Optimising Data Lakes for Financial ServicesOptimising Data Lakes for Financial Services
Optimising Data Lakes for Financial Services
Andrew Carr
 
Enterprise Data Lake
Enterprise Data LakeEnterprise Data Lake
Enterprise Data Lake
sambiswal
 
Benefits of a data lake
Benefits of a data lake Benefits of a data lake
Benefits of a data lake
Sun Technologies
 
Semantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data LakeSemantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data Lake
Thomas Kelly, PMP
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingwork
 
Data Analytics.pptx
Data Analytics.pptxData Analytics.pptx
Data Analytics.pptx
Rapyder Cloud Solutions
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
VijayaLakshmi514
 
Semantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data LakeSemantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data Lake
Cognizant
 
dw_concepts_2_day_course.ppt
dw_concepts_2_day_course.pptdw_concepts_2_day_course.ppt
dw_concepts_2_day_course.ppt
DougSchoemaker
 
Dataware housing
Dataware housingDataware housing
Dataware housingwork
 
Snowflake Time Travel.pdf
Snowflake Time Travel.pdfSnowflake Time Travel.pdf
Snowflake Time Travel.pdf
VishnuGone
 
Data Science.pdf
Data Science.pdfData Science.pdf
Data Science.pdf
Umar khan
 
Harness the power of data
Harness the power of dataHarness the power of data
Harness the power of data
Harsha MV
 

Similar to WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEM (20)

DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Data lake ppt
Data lake pptData lake ppt
Data lake ppt
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
IT Ready - DW: 1st Day
IT Ready - DW: 1st Day IT Ready - DW: 1st Day
IT Ready - DW: 1st Day
 
Optimising Data Lakes for Financial Services
Optimising Data Lakes for Financial ServicesOptimising Data Lakes for Financial Services
Optimising Data Lakes for Financial Services
 
Big data rmoug
Big data rmougBig data rmoug
Big data rmoug
 
Enterprise Data Lake
Enterprise Data LakeEnterprise Data Lake
Enterprise Data Lake
 
Benefits of a data lake
Benefits of a data lake Benefits of a data lake
Benefits of a data lake
 
Semantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data LakeSemantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data Lake
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Data Analytics.pptx
Data Analytics.pptxData Analytics.pptx
Data Analytics.pptx
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Semantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data LakeSemantic 'Radar' Steers Users to Insights in the Data Lake
Semantic 'Radar' Steers Users to Insights in the Data Lake
 
dw_concepts_2_day_course.ppt
dw_concepts_2_day_course.pptdw_concepts_2_day_course.ppt
dw_concepts_2_day_course.ppt
 
Dataware housing
Dataware housingDataware housing
Dataware housing
 
Snowflake Time Travel.pdf
Snowflake Time Travel.pdfSnowflake Time Travel.pdf
Snowflake Time Travel.pdf
 
Data Science.pdf
Data Science.pdfData Science.pdf
Data Science.pdf
 
Data virtualization
Data virtualizationData virtualization
Data virtualization
 
Harness the power of data
Harness the power of dataHarness the power of data
Harness the power of data
 
Abstract
AbstractAbstract
Abstract
 

Recently uploaded

RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
BBPMedia1
 
Premium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern BusinessesPremium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern Businesses
SynapseIndia
 
Business Valuation Principles for Entrepreneurs
Business Valuation Principles for EntrepreneursBusiness Valuation Principles for Entrepreneurs
Business Valuation Principles for Entrepreneurs
Ben Wann
 
The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...
Adam Smith
 
Bài tập - Tiếng anh 11 Global Success UNIT 1 - Bản HS.doc
Bài tập - Tiếng anh 11 Global Success UNIT 1 - Bản HS.docBài tập - Tiếng anh 11 Global Success UNIT 1 - Bản HS.doc
Bài tập - Tiếng anh 11 Global Success UNIT 1 - Bản HS.doc
daothibichhang1
 
The key differences between the MDR and IVDR in the EU
The key differences between the MDR and IVDR in the EUThe key differences between the MDR and IVDR in the EU
The key differences between the MDR and IVDR in the EU
Allensmith572606
 
Project File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdfProject File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdf
RajPriye
 
FINAL PRESENTATION.pptx12143241324134134
FINAL PRESENTATION.pptx12143241324134134FINAL PRESENTATION.pptx12143241324134134
FINAL PRESENTATION.pptx12143241324134134
LR1709MUSIC
 
ikea_woodgreen_petscharity_cat-alogue_digital.pdf
ikea_woodgreen_petscharity_cat-alogue_digital.pdfikea_woodgreen_petscharity_cat-alogue_digital.pdf
ikea_woodgreen_petscharity_cat-alogue_digital.pdf
agatadrynko
 
Mastering B2B Payments Webinar from BlueSnap
Mastering B2B Payments Webinar from BlueSnapMastering B2B Payments Webinar from BlueSnap
Mastering B2B Payments Webinar from BlueSnap
Norma Mushkat Gaffin
 
Company Valuation webinar series - Tuesday, 4 June 2024
Company Valuation webinar series - Tuesday, 4 June 2024Company Valuation webinar series - Tuesday, 4 June 2024
Company Valuation webinar series - Tuesday, 4 June 2024
FelixPerez547899
 
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBdCree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
creerey
 
The Parable of the Pipeline a book every new businessman or business student ...
The Parable of the Pipeline a book every new businessman or business student ...The Parable of the Pipeline a book every new businessman or business student ...
The Parable of the Pipeline a book every new businessman or business student ...
awaisafdar
 
Digital Transformation and IT Strategy Toolkit and Templates
Digital Transformation and IT Strategy Toolkit and TemplatesDigital Transformation and IT Strategy Toolkit and Templates
Digital Transformation and IT Strategy Toolkit and Templates
Aurelien Domont, MBA
 
Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...
Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...
Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...
Lviv Startup Club
 
What is the TDS Return Filing Due Date for FY 2024-25.pdf
What is the TDS Return Filing Due Date for FY 2024-25.pdfWhat is the TDS Return Filing Due Date for FY 2024-25.pdf
What is the TDS Return Filing Due Date for FY 2024-25.pdf
seoforlegalpillers
 
-- June 2024 is National Volunteer Month --
-- June 2024 is National Volunteer Month ---- June 2024 is National Volunteer Month --
-- June 2024 is National Volunteer Month --
NZSG
 
Kseniya Leshchenko: Shared development support service model as the way to ma...
Kseniya Leshchenko: Shared development support service model as the way to ma...Kseniya Leshchenko: Shared development support service model as the way to ma...
Kseniya Leshchenko: Shared development support service model as the way to ma...
Lviv Startup Club
 
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdfModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
fisherameliaisabella
 
Set off and carry forward of losses and assessment of individuals.pptx
Set off and carry forward of losses and assessment of individuals.pptxSet off and carry forward of losses and assessment of individuals.pptx
Set off and carry forward of losses and assessment of individuals.pptx
HARSHITHV26
 

Recently uploaded (20)

RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...
 
Premium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern BusinessesPremium MEAN Stack Development Solutions for Modern Businesses
Premium MEAN Stack Development Solutions for Modern Businesses
 
Business Valuation Principles for Entrepreneurs
Business Valuation Principles for EntrepreneursBusiness Valuation Principles for Entrepreneurs
Business Valuation Principles for Entrepreneurs
 
The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...The Influence of Marketing Strategy and Market Competition on Business Perfor...
The Influence of Marketing Strategy and Market Competition on Business Perfor...
 
Bài tập - Tiếng anh 11 Global Success UNIT 1 - Bản HS.doc
Bài tập - Tiếng anh 11 Global Success UNIT 1 - Bản HS.docBài tập - Tiếng anh 11 Global Success UNIT 1 - Bản HS.doc
Bài tập - Tiếng anh 11 Global Success UNIT 1 - Bản HS.doc
 
The key differences between the MDR and IVDR in the EU
The key differences between the MDR and IVDR in the EUThe key differences between the MDR and IVDR in the EU
The key differences between the MDR and IVDR in the EU
 
Project File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdfProject File Report BBA 6th semester.pdf
Project File Report BBA 6th semester.pdf
 
FINAL PRESENTATION.pptx12143241324134134
FINAL PRESENTATION.pptx12143241324134134FINAL PRESENTATION.pptx12143241324134134
FINAL PRESENTATION.pptx12143241324134134
 
ikea_woodgreen_petscharity_cat-alogue_digital.pdf
ikea_woodgreen_petscharity_cat-alogue_digital.pdfikea_woodgreen_petscharity_cat-alogue_digital.pdf
ikea_woodgreen_petscharity_cat-alogue_digital.pdf
 
Mastering B2B Payments Webinar from BlueSnap
Mastering B2B Payments Webinar from BlueSnapMastering B2B Payments Webinar from BlueSnap
Mastering B2B Payments Webinar from BlueSnap
 
Company Valuation webinar series - Tuesday, 4 June 2024
Company Valuation webinar series - Tuesday, 4 June 2024Company Valuation webinar series - Tuesday, 4 June 2024
Company Valuation webinar series - Tuesday, 4 June 2024
 
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBdCree_Rey_BrandIdentityKit.PDF_PersonalBd
Cree_Rey_BrandIdentityKit.PDF_PersonalBd
 
The Parable of the Pipeline a book every new businessman or business student ...
The Parable of the Pipeline a book every new businessman or business student ...The Parable of the Pipeline a book every new businessman or business student ...
The Parable of the Pipeline a book every new businessman or business student ...
 
Digital Transformation and IT Strategy Toolkit and Templates
Digital Transformation and IT Strategy Toolkit and TemplatesDigital Transformation and IT Strategy Toolkit and Templates
Digital Transformation and IT Strategy Toolkit and Templates
 
Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...
Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...
Evgen Osmak: Methods of key project parameters estimation: from the shaman-in...
 
What is the TDS Return Filing Due Date for FY 2024-25.pdf
What is the TDS Return Filing Due Date for FY 2024-25.pdfWhat is the TDS Return Filing Due Date for FY 2024-25.pdf
What is the TDS Return Filing Due Date for FY 2024-25.pdf
 
-- June 2024 is National Volunteer Month --
-- June 2024 is National Volunteer Month ---- June 2024 is National Volunteer Month --
-- June 2024 is National Volunteer Month --
 
Kseniya Leshchenko: Shared development support service model as the way to ma...
Kseniya Leshchenko: Shared development support service model as the way to ma...Kseniya Leshchenko: Shared development support service model as the way to ma...
Kseniya Leshchenko: Shared development support service model as the way to ma...
 
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdfModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
ModelingMarketingStrategiesMKS.CollumbiaUniversitypdf
 
Set off and carry forward of losses and assessment of individuals.pptx
Set off and carry forward of losses and assessment of individuals.pptxSet off and carry forward of losses and assessment of individuals.pptx
Set off and carry forward of losses and assessment of individuals.pptx
 

WHAT IS A DATA LAKE? Know DATA LAKES & SALES ECOSYSTEM

  • 1.
  • 2. INTRODUCTION The volume, variety, velocity and veracity of big data are getting increasingly complex each passing day. The way the data is stored, processed, managed and shared with decision-makers is getting impacted by this complexity and to tackle the same, a revolutionary approach to data management has come into picture. A data lake. ©Denave
  • 4. ©Denave As the name suggests, data lake is a large reservoir of data – structured or unstructured, fed through disparate channels. The data is fed through channels in an ad-hoc manner into these data lakes, however, owing to the predefined set of rules or schema, correlation between the database is established automatically to help with the extraction of meaningful information. It provides high level of flexibility in terms of interaction with and leverage of the data. In general, data lakes are used to store data when you’ve a constant stream of unstructured data coming into, such as, web interactions, product logs, IoT sensors, app usage etc. Simply put, along with the on-premises data, it is the real-time data which fills up the data lake, upon which are then used the principles of machine learning and analytics to make it relational. The global data lakes market is expected to grow at a CAGR of approximately 28% during the forecast period 2017-2023, to touch $12.01 billion by 2024 & $+14.01 billion by 2026. 1,2&3
  • 5. ©Denave DATA LAKES & SALES ECOSYSTEM UNDERSTANDING THE CORRELATION
  • 6. Sales is no more just about the product alone. Rather, it is more about getting connected to the customer – at a deeper level. In order to do so, organisations are becoming data-driven in every sense and they rely heavily on agile technologies to help them with the analysis and management of data. With a large wealth of customer data at their disposal, it is that data analysis backed sales action which acts as a game-changer for organisations. To get the differentiating edge, firms need to use the customer data in the best possible manner to make hyper-impactful outreach and better sales interactions. Following elucidates some of the challenges of sales ecosystem which are solved by Data lakes: Need for the Data in Native Format It may be difficult to believe but since the amount of data being generated by enterprises is extremely huge, a major portion of that data is discarded. Remaining small portion is then stored in the data warehouse – for a few years. This happens owing to the storage capacity, structure restriction, associated costs etc. and most of the time it happens because enterprises don’t know what to be done with that data, esp. machine-generated or historical data. Hence, it is dumped away, thus putting a limitation on the extent of analytics application that can happen. With data lakes, enterprises save the data without fretting about the structure, intended use etc. Simply put, you may not know why you are saving the data, but you’d still do so with the thought that you may need that data someday – thus, getting as much data as possible in its native format. ©Denave
  • 7. With data lakes, enterprises save the data without fretting about the structure, intended use etc. Simply put, you may not know why you are saving the data, but you’d still do so with the thought that you may need that data someday – thus, getting as much data as possible in its native format. Quashing Data Silos With data, the case is such that the number of teams or departments you have, that much variety of data would be there. That database is not centralised and instead remain in silos because it turns out to be expensive as well as time consuming to share that data with one another. Consider the case where a department may need the data from another department – their requirement would be a specific segment of data in a particular format, Therefore, the department which owns the data will be conducting all ETL exercises to be able to extract and package the data in line with the requirement. Extra work equals to extra time and also extra expenses and hence the delivery team would want to shy away from getting any data request in the first place or to delay and excuse any request which has already been made, resulting in data silos. It is like having an immense wealth with you but not being able to use it owing to the labours it is going to take to spend it. This issue is tackled head on with data lakes because there the data ingestion is almost frictionless since it accepts data without any processing, thus allows for a deeper data leverage to all with a centralised and transparent access process. Data ownership doesn’t remain a barrier any more with data lakes. ©Denave
  • 9. The acceptance of need for data lakes itself makes visible a lot of benefits which salespeople can accrue, let’s dig deeper and see what all as an organisation are you set to gain if your database strategies include leveraging data lakes: Say good-bye to silos and fragmentation With data lakes, you get a unified view of everything which comprises the customer experience – all the data from all the platforms, departments, teams, delivery channels. Better preparedness for the customer journey Since you’ve better knowledge about the customer buying cycle, the high or low points of his journey, quite naturally the decision-making process is much more impactful than before. High yielding campaigns You’re better equipped than before for generating and assessing campaigns. The agility and autonomy are rendered by the tools which not only helps you to measure but also to optimise your marketing investments in real time. Power to predict Sitting on the historical data which you’d have otherwise dumped or would have never been able to analyse, provides you with the power to analyse and establish predictive models for customer behaviour. ©Denave
  • 10. All touch point control You’re also able to trace the journey and behaviour specific to each customer touchpoint which allows you to rank the point of conversions accurately. Accordingly, you can adjust your focus on the touchpoints. Fertile environment for new tools or methodologies development Since you own the storage of immense amount of data, you can generate your own technological environment and a service-oriented architecture which can make the development of new tools possible. Intelligent and faster decisions With dynamic dashboards which work upon varied kinds of data accumulated from myriad number of sources – external or internal, you get an unmatched business intelligence right at your tips and thus, decision-making in real-time becomes possible. If we put together all the benefits in one basket, the highlight of data lake adoption would be the provision of an agile 360-degree view data-driven marketing operations and a faster (and intelligent) response system to any business need. These will automatically translate into improved customer interactions, enhanced R&D innovation possibilities and augmented operational efficiencies. ©Denave
  • 12. Getting convinced of the immense advantages of data lakes is one thing but understanding where it fits in your business needs and vision is another thing and probably the more important one. A quick assessment of the latter will help in getting the optimal resolution for your business: Understand your data type What is the type of data which forms the majority of your data input/ accumulation? If it is structured data in generality – traditional, tabular format – data being generated by HR systems, traditional CRMs etc. then data warehouse is a good option. Data lakes are beneficial in case if the data you have is unstructured or semi- structured and it’s getting generated constantly (like a person crosses an in-store ad and the data is captured by the intelligent display unit to get records of user views and behaviour). In such a constant stream of data, hundreds of terabytes are generated and occupy the storage in a matter of time. In short, your structured data may be more suited for RDBMS while the sensor or SAAS based data would be more suited for data lakes. If you would use a data lake for the prior stated data variety then the costs would be exorbitant and unrequired. ©Denave
  • 13. Understand your data expectation If the post storage needs from the data are fixed and you need it for specific use cases, then you’d structure the data in advance for the requite processing. For such kind of expectation from the data, a data warehouse is best suited, however, structuring the data has its own costs attached and it certainly limits your ability to repurpose the data for any other use later on. With data lakes, comes in the flexibility. You can store now and decide later what you really want to extract out of that data. For a set of predefined reports, you may not bother having a data lake but to have the freedom to analyse and accordingly scale up or innovate, data lakes would be the best bet. Understand your resources While DB technicians or IT team would be able to get a database system in place and then business analysts can leverage self-service to extract reports out of it, data lakes would require a different level of expertise. Skilled big data engineers and data scientists would have to be invested in along with sophisticated tools to form a data puddle and then a data pond and finally a data lake. Analyse your data acquisition process If the data acquisition is happening from multiple sources and is a complex process in itself -it would mean that you’d be spending a large overhead on ETL techniques in order to render that data suitable for data warehouse. If you’re constantly spending a fortune because of a complex data acquisition and thus a cost intensive process, then it might be time to switch to data lake. Once you take the plunge into data lakes based on your requirements, remember the governance part as well and involve the legal & privacy team for required due diligence. ©Denave
  • 15. Getting the right platform and then getting the data loaded onto it are the first steps but the real job begins post that. The interface plays a critical role in adoption and usage of data lake. The scope of data lake is huge enough that the normal IT function, which is adequate to manage a data warehouse, can’t be expected to cater to such a wide variety of data as well as a huge community of users. Therefore, self service becomes an essential feature for the data lake interface. It carries these two integral aspects: • To ensure the access grant in accordance with the level of expertise • To ensure that people are able to find the right data that they are looking for For example – a business analysts would need a cooked form of data and a completely raw data set would not make sense to him/ her, though getting the same unprocessed data would be necessary for a data scientist. Hence, the interface will have to have bifurcated zones for different data requirements of divergent set of audience, such as, partially processed data, raw data etc. ©Denave
  • 16. CONCLUSION The whole process of getting the infrastructure in place for the data lake, organising the data lake (creating zones for various user communities as per their expertise level), putting up the catalogue of data assets to enable self-service and eventually, opening it for the users – this complete journey is like a success roadmap for data lakes. Such kind of centralisation of data allows for multiple forms of analysis such as analysing the conversion funnel for improvement areas, creation of a precise recommendation engine etc. – everything facilitating engaging customer experiences and explosive business growth. ©Denave
  • 18. For more sales insights, visit www.denave.com/resources ©Denave