Big data sketch-and-possible-usecases2

•Download as PPTX, PDF•

0 likes•111 views

Dmitri Apassov

big data use cases and architecture

Travel

GOAL: DATA DRIVEN
BUSINESS DECISIONS and
ACTIONS
BASE: SMART COLLECTION and STORING OF DATA
Buzzwords: Hadoop, Document Databases, Columnar, datalake
PATH: ACTIONABLE INTERACTIVE INFOGRAPHICS
Buzzwords: dashboards, Predictive/prescriptive analytics, self-service
BI, Machine Learning

NOT ELIMINATING – AUGMENTING!
We leave current DWH operational and intact
Not Revolution – EVOLUTION!
Of storage
Visualisation
Decision making
In the direction of
Business with truly
crossfunctional team
A shift from traditional reporting to BI and Data Science
Only raw data persists, computations and visualisations are ”as need arise”
New architecture and software:
modern analytical tools
Machine learning
Graph databases
OVERALL STRATEGY DECISIONS

Challenges to overcome
• in BASE: Volume, Variety, Complexity, Security
• in PATH: Resourses, Ownership, Question Repository, Design
Can be overcome by:
• Right tech platforms
• Right competence
• Crossfunctional team

How one should view data
STORAGE TRANSFORMATION VISUALISATION
Essentially, a file in
a folder on disk.
Essentially, make new file on disk
or in memory
0
5
Essentially, ”playing” the
file an appropriate player
conceptDWHdatalake
Mdf, ldf files. Only relational or
dimensional data
SQL, SSIS, C#
Anything. Files,
databases, KV-
stores etc
Rich programmatic
interface
Tools to design
and publish
reports
may be moved
to the cloud
is provided in
the cloud
ODBC

Hadoop: redundant cluster file system + MapReduce
Hadoop: A yellow stuffed elephant
In Cutting's own words:
“The name my kid gave a
stuffed yellow elephant.
Short, relatively easy to
spell and pronounce,
meaningless, and not used
elsewhere: those are my
naming criteria. Kids are
good at generating such.
Googol is a kid’s term”
Why a non-related
meaningless name
In Cutting own words:
“The rules of names for
software is they're
meaningless because
sometimes the use of a
particular piece of software
drifts, and if your name is too
closely associated with that, it
could end up being wrong
over time"
Doug Cutting with the famous elephant

Modern Cloud Architecture
STORAGE TRANSFORMATION VISUALISATION
Sources:
Files, Pictures,
databases
push
Azure
cloud:
Here we store all possivle
data formats within the
organization with Azure
Tech Stack.
Exists on top of
HDinsight
Can
consume
data from
diverse
sources
Python/Java.
Just with a few lines of code: create and
persist resilient distributed datasets
Transform into dataframes
Aggregate as needed
Interactive
notebooks (internal
marketplace of
ideas)
Modern visualisation
tools: Tableau or PowerBI
ODBC
interface/native
connector
Exists on top of
spark in AzureExists on top
of spark in
Azure
Conductor of cluster
resourses and
distributed
calculations
HDInsight (Hadoop) for files
DocumentDB for rich docs
BLOB storage for media
SQL Azure for tabular data

Source files Data lake
Change in spark
jobs
New reports
Change in source
system: column types,
encoding, extra fields
Supplier rebuilds his file
export: column
types,encoding, extra
fields

Blog and video resources for Azure cloud
services
https://www.youtube.com/playlist?list=PLeIihrNL8cl4BiKiD-
VSTah_XZqmaJR3p
http://www.youtube.com/channel/UCRzsq7k4-kT-h3TDUBQ82-w
https://blogs.msdn.microsoft.com/azuredatalake/

What's hot

Making Big Data Analytics with Hadoop fast & easy (webinar slides)Yellowfin

obzen Business Analytics(Big Data with R, Hadoop)Jinsup

BigDataShankar R

The Six pillars for Building big data analytics ecosystemstaimur hafeez

Big Data Hadoop TechnologyRahul Sharma

Scaling Face Recognition with Big DataBogdan Bocse

How to boost your datamanagement with Dremio ?Vincent Terrasi

Big Data Analytics with HadoopPhilippe Julio

Big data analytic platformJesse Wang

Introducing the Big Data Ecosystem with Caserta Concepts & TalendCaserta

Exploring Big Data Analytics ToolsMultisoft Virtual Academy

Modern data warehouseStephen Alex

Relational Technologies Under Siege: Will Handsome Newcomers Displace the St...Neil Raden

Big data managementzeba khanam

Hadoop,Big Data Analytics and MoreTrendwise Analytics

Chattanooga Hadoop Meetup - Hadoop 101 - November 2014Josh Patterson

Cloud as a Data PlatformAndrei Savu

WhatisbigdataandwhylearnhadoopEdureka!

ROI of Big Data Analytics Native on HadoopDataWorks Summit

BigData AnalyticsMayank Kumar Sharma

What's hot (20)

Making Big Data Analytics with Hadoop fast & easy (webinar slides)

obzen Business Analytics(Big Data with R, Hadoop)

BigData

The Six pillars for Building big data analytics ecosystems

Big Data Hadoop Technology

Scaling Face Recognition with Big Data

How to boost your datamanagement with Dremio ?

Big Data Analytics with Hadoop

Big data analytic platform

Introducing the Big Data Ecosystem with Caserta Concepts & Talend

Exploring Big Data Analytics Tools

Modern data warehouse

Relational Technologies Under Siege: Will Handsome Newcomers Displace the St...

Big data management

Hadoop,Big Data Analytics and More

Chattanooga Hadoop Meetup - Hadoop 101 - November 2014

Cloud as a Data Platform

Whatisbigdataandwhylearnhadoop

ROI of Big Data Analytics Native on Hadoop

BigData Analytics

Similar to Big data sketch-and-possible-usecases2

Big Data Practice_Planning_steps_RKRajesh Jayarman

Rajesh Angadi Brochure Rajesh Angadi

Differentiate Big Data vs Data Warehouse use cases for a cloud solutionJames Serra

Using Machine Learning with HDInsightEng Teong Cheah

Big data and apache hadoop adoptionfaizrashid1995

Learn About Big Data and Hadoop The Most Significant ResourceAssignment Help

Modern data warehouseStephen Alex

The future of Big Data toolingData Science Society

SQL vs NoSQL: Big Data Adoption & Success in the EnterpriseAnita Luthra

Hd insight overviewvhrocca

FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)GeeksLab Odessa

WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsJane Roberts

Introduction to Microsoft Azure HD Insight by Dattatrey Sindhol HARMAN Services

Big-Data Hadoop Tutorials - MindScripts Technologies, Pune amrutupre

HadoopMayuri Gupta

Hadoop & Data Warehouse Mohit Srivastava

Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02BIWUG

How to build your own Delve: combining machine learning, big data and SharePointJoris Poelmans

Hitachi Data Systems Hadoop SolutionHitachi Vantara

Better Together: The New Data Management OrchestraCloudera, Inc.

Similar to Big data sketch-and-possible-usecases2 (20)

Big Data Practice_Planning_steps_RK

Rajesh Angadi Brochure

Differentiate Big Data vs Data Warehouse use cases for a cloud solution

Using Machine Learning with HDInsight

Big data and apache hadoop adoption

Learn About Big Data and Hadoop The Most Significant Resource

Modern data warehouse

The future of Big Data tooling

SQL vs NoSQL: Big Data Adoption & Success in the Enterprise

Hd insight overview

FOSS Sea 2014_DataWarehouse & BigData_Владимир Слободянюк ( Luxoft)

WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts

Introduction to Microsoft Azure HD Insight by Dattatrey Sindhol

Big-Data Hadoop Tutorials - MindScripts Technologies, Pune

Hadoop

Hadoop & Data Warehouse

Spsbepoelmanssharepointbigdataclean 150421080105-conversion-gate02

How to build your own Delve: combining machine learning, big data and SharePoint

Hitachi Data Systems Hadoop Solution

Better Together: The New Data Management Orchestra

Recently uploaded

DARK TRAVEL AGENCY presented by Khuda BuxBeEducate

Italia Lucca 1 Un tesoro nascosto tra le sue murasandamichaela *

A Comprehensive Guide to The Types of Dubai Residence Visas.pdfDisha Global Tours

Exploring Sicily Your Comprehensive Ebook Travel GuideTime for Sicily

best weekend places near delhi where you should visit.pdftour guide

Dubai Call Girls O528786472 Call Girls Dubai Big Juicyhf8803863

VIP Call Girls in Noida 9711199012 Escorts in Greater Noida,Msankitnayak356677

Call Girls In Munirka 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICECall Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

8377087607 Full Enjoy @24/7 Call Girls in INA Market Dilli Hatt Delhi NCRdollysharma2066

(8264348440) 🔝 Call Girls In Nand Nagri 🔝 Delhi NCRsoniya singh

Enjoy ➥8448380779▻ Call Girls In Sector 74 Noida Escorts Delhi NCRStunning ➥8448380779▻ Call Girls In Hauz Khas Delhi NCR

Moving to Italy - A Relocation RollercoasterStefSmulders1

Call Girls In Panjim Mariott Resort ₰8588052666₰ North ...nishakur201

Enjoy ➥8448380779▻ Call Girls In Sector 62 Noida Escorts Delhi NCRStunning ➥8448380779▻ Call Girls In Hauz Khas Delhi NCR

Hoi An Ancient Town, Vietnam (越南會安古鎮).ppsxChung Yen Chang

"Fly with Ease: Booking Your Flights with Air Europa"flyn goo

Akshay Mehndiratta Summer Special Light Meal Ideas From Across India.pptxAkshay Mehndiratta

Call Girls 🫤 Connaught Place ➡️ 9999965857 ➡️ Delhi 🫦 Russian Escorts FULL ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Inspirational Quotes About Italy and FoodKasia Chojecki

Recently uploaded (19)

DARK TRAVEL AGENCY presented by Khuda Bux

Italia Lucca 1 Un tesoro nascosto tra le sue mura

A Comprehensive Guide to The Types of Dubai Residence Visas.pdf

Exploring Sicily Your Comprehensive Ebook Travel Guide

best weekend places near delhi where you should visit.pdf

Dubai Call Girls O528786472 Call Girls Dubai Big Juicy

VIP Call Girls in Noida 9711199012 Escorts in Greater Noida,Ms

Call Girls In Munirka 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SERVICE

8377087607 Full Enjoy @24/7 Call Girls in INA Market Dilli Hatt Delhi NCR

(8264348440) 🔝 Call Girls In Nand Nagri 🔝 Delhi NCR

Enjoy ➥8448380779▻ Call Girls In Sector 74 Noida Escorts Delhi NCR

Moving to Italy - A Relocation Rollercoaster

Call Girls In Panjim Mariott Resort ₰8588052666₰ North ...

Enjoy ➥8448380779▻ Call Girls In Sector 62 Noida Escorts Delhi NCR

Hoi An Ancient Town, Vietnam (越南會安古鎮).ppsx

"Fly with Ease: Booking Your Flights with Air Europa"

Akshay Mehndiratta Summer Special Light Meal Ideas From Across India.pptx

Call Girls 🫤 Connaught Place ➡️ 9999965857 ➡️ Delhi 🫦 Russian Escorts FULL ...

Inspirational Quotes About Italy and Food

Big data sketch-and-possible-usecases2

1. BigData sketch and possible usecases

2. GOAL: DATA DRIVEN BUSINESS DECISIONS and ACTIONS BASE: SMART COLLECTION and STORING OF DATA Buzzwords: Hadoop, Document Databases, Columnar, datalake PATH: ACTIONABLE INTERACTIVE INFOGRAPHICS Buzzwords: dashboards, Predictive/prescriptive analytics, self-service BI, Machine Learning

3. NOT ELIMINATING – AUGMENTING! We leave current DWH operational and intact Not Revolution – EVOLUTION! Of storage Visualisation Decision making In the direction of Business with truly crossfunctional team A shift from traditional reporting to BI and Data Science Only raw data persists, computations and visualisations are ”as need arise” New architecture and software: modern analytical tools Machine learning Graph databases OVERALL STRATEGY DECISIONS

4. Challenges to overcome • in BASE: Volume, Variety, Complexity, Security • in PATH: Resourses, Ownership, Question Repository, Design Can be overcome by: • Right tech platforms • Right competence • Crossfunctional team

5. How one should view data STORAGE TRANSFORMATION VISUALISATION Essentially, a file in a folder on disk. Essentially, make new file on disk or in memory 0 5 Essentially, ”playing” the file an appropriate player conceptDWHdatalake Mdf, ldf files. Only relational or dimensional data SQL, SSIS, C# Anything. Files, databases, KV- stores etc Rich programmatic interface Tools to design and publish reports may be moved to the cloud is provided in the cloud ODBC

6. Hadoop: redundant cluster file system + MapReduce Hadoop: A yellow stuffed elephant In Cutting's own words: “The name my kid gave a stuffed yellow elephant. Short, relatively easy to spell and pronounce, meaningless, and not used elsewhere: those are my naming criteria. Kids are good at generating such. Googol is a kid’s term” Why a non-related meaningless name In Cutting own words: “The rules of names for software is they're meaningless because sometimes the use of a particular piece of software drifts, and if your name is too closely associated with that, it could end up being wrong over time" Doug Cutting with the famous elephant

7. Modern Cloud Architecture STORAGE TRANSFORMATION VISUALISATION Sources: Files, Pictures, databases push Azure cloud: Here we store all possivle data formats within the organization with Azure Tech Stack. Exists on top of HDinsight Can consume data from diverse sources Python/Java. Just with a few lines of code: create and persist resilient distributed datasets Transform into dataframes Aggregate as needed Interactive notebooks (internal marketplace of ideas) Modern visualisation tools: Tableau or PowerBI ODBC interface/native connector Exists on top of spark in AzureExists on top of spark in Azure Conductor of cluster resourses and distributed calculations HDInsight (Hadoop) for files DocumentDB for rich docs BLOB storage for media SQL Azure for tabular data

8. Source files Data lake Change in spark jobs New reports Change in source system: column types, encoding, extra fields Supplier rebuilds his file export: column types,encoding, extra fields

10. Blog and video resources for Azure cloud services https://www.youtube.com/playlist?list=PLeIihrNL8cl4BiKiD- VSTah_XZqmaJR3p http://www.youtube.com/channel/UCRzsq7k4-kT-h3TDUBQ82-w https://blogs.msdn.microsoft.com/azuredatalake/

11. STREAMING DATA TO COLLECT: SNAPSHOT DATA TO COLLECT: Possible use cases Customer profile Social networks Aggregated snapshots Transactions ”as they come” Transactional history face recognition Geodata geolocation Segmentation+targeting Fraudulent transactions Churn Personalized support immediate risk recalculation Click interactions+log analysis Customer lifetime score BENEFITS-SOLUTIONS- ACTIONS

Big data sketch-and-possible-usecases2

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Big data sketch-and-possible-usecases2

Similar to Big data sketch-and-possible-usecases2 (20)

Recently uploaded

Recently uploaded (19)

Big data sketch-and-possible-usecases2