SlideShare a Scribd company logo
1 of 21
Yekesa Kosuru
HERE.com
Nokia
Hadoop Innovation Summit February 20 & 21, San Diego
2013
Phases of Big Data Challenges
@ Nokia
11
• Phases of Big Data Challenges @Nokia
– Who we are
– Big data platform
– Use case data flows
– High level architecture
–Challenges
• Phases of challenges
Agenda
22
Accelerometer
GPS
Water
Proof
12h
Battery
Bluetooth 2GB Storage
Barometer
NFC
Gyroscope
Magnetometer
Who we are – disrupting the future
3
Apps
Smart Data
Platform
Content
PositionsMaps TrafficPlaces Directions Guidance
Location Platform, Enabling
Contextually Rich Mobile Experiences
44
5
Big Data
Analytics
…to Be Made
Available for Analysis
Enabling feedback loops for continuous improvement,
Location Optimized Experience, CRM, etc..!
Big Data Flows and Differentiates
…on All Supported
Platforms…
Nokia
Account
We Collect
User Data…
5
Click to edit Master title style
Phase 0
66
2008 – ‘10
BuildTechnology
Platform,
GetData
7
Business Challenges
• Data silos, no unique identifiers, missing semantics
• Multiple sources - overlapping, conflicting
• Timely processing of large volumes & velocity of data
• Partial, insufficient, inaccurate, inconsistent.. data
• Data/wire formats, Security, privacy and other policies
unknown
Central Big Data Platform created
8
…to verify Map accuracy and create
Motion Graph
Using different big data sets
Reports
Analytical
DBMS
Analytics Cluster
Data Asset
Catalog
Analytical
DBMS
Dashboards
Data Discovery
Interactive
Queries
Batch
Queries
Web Applications
Activity
Logs
VShards
(NoSQL)
Reference Data
Device Applications
Probes
3rd Party
Device
User Profile
POI, Map
Activity
Sensor
DataIntake
ETL,datacrunching,
attribution,ML
Algorithms
Aggregation
HDFS
9
Analytical
DBMS
Big Data Analytics Platform Data Flows
Technology Platform
10
Hadoop R
VShards
(KV)
SDK,
Scribe, FTP
Hive, Pig
Analytical
DBMS
Export/
Import
Workflow
Engine
Config./
Deploy
Monitor Alerts
Data
Pipeline
Scheduler
Security/Kerberos & ACL
On-Premise & Cloud Infrastructure
11
Data Platform
Self Serve
Tools
ETL, Agg
Machine Learning
Data Quality
Data Asset
Catalog
Data, Metadata, Operational Data
Collect Ingest Organize Analyze Deliver
Technology Platform
Click to edit Master title style
Phase 1 –2012
1212
2008 – ‘10
BuildTechnology
Platform,
GetData
2011
EnhancePlatform,
MoreData,
SimpleAnalytics,
DataCrunching
2012
PB’sofData,
HundredsofUsers
ThousandsofJobs
ComplexAnalytics,
MultipleClusters
13
2012 Production Statistics
• 10’s PB of data all across Nokia
• Multi-tenant, multi-petabyte analytics cluster
• 10-20K+ jobs per day
• 600+ internal users
• 300M+ KV queries
• Terabytes flowing in every day
• Multiple data centers around the world
14
Challenges With Big Data
• Complex eco-system of technologies - many moving
parts, slower deploy cycles, data integration is complex
• Capacity & Scale Issues – Provision for peaks or sustained,
storage or compute ?
• DBMS great for performance & data management, but
cant scale - price/performance & ACIDity
• Hadoop great for ETL, but poor on query performance &
data management, not interactive
• Data and Metadata fragmentation
15
Big Data Capacity Issues
• Spikey Workloads
• Capacity Provisioning
– Peaks
– Sustained loads
• How many clusters ?
– SLA/Adhoc/Research
– Multiple data centers
– Data duplication
• Tenancy – single/multi
• TOC
– Hadoop can get expensive -
storage & computed tightly
coupled, idle machines
16
Cloud helps with some issues
• Operational & IT complexity reduced – API based spin up
& tear down – rapid deployments, faster cycles
• Pay for what is used
• Capacity issues mitigated - idle machines or peaks not
an issue – elastically scale up and down
• De-coupled Storage and Compute makes sense
• Stateless architecture, recycle slow/bad machines, no
need for rolling upgrades, instead do rolling replace
Click to edit Master title style
Phase 2
1717
2012
PB’sofData,
HundredsofUsers
ThousandsofJobs
Simple&Complex
Analytics
2008 – ‘10
BuildTechnology
Platform,
GetData
17
2011
EnhancePlatform,
MoreData,
SimpleAnalytics
2013
StillPending
Challenges
18
Still Pending
• Data and Metadata fragmentation, need deeper
integration into all tools/frameworks
• Advanced Analytics - Data science problems are hard &
inefficient to implement in Map Reduce/RDBMS
19
Complex Analytics
• Mathematicians think terms of Arrays not Map Reduce
• Data science tools can’t efficiently handle big data
• Data partitioning is naïve, indexing wont scale
Big Data Technologies for Future
21
THANK YOU
Yekesa Kosuru yekesa.kosuru@nokia.com

More Related Content

What's hot

A brief history of data warehousing
A brief history of data warehousingA brief history of data warehousing
A brief history of data warehousingRob Winters
 
Driving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data AssetsDriving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data AssetsEmbarcadero Technologies
 
Big Data Warehousing Meetup with Riak
Big Data Warehousing Meetup with RiakBig Data Warehousing Meetup with Riak
Big Data Warehousing Meetup with RiakCaserta
 
Resume_Pratik_15012017
Resume_Pratik_15012017Resume_Pratik_15012017
Resume_Pratik_15012017Pratik Awasthi
 
Building data "Py-pelines"
Building data "Py-pelines"Building data "Py-pelines"
Building data "Py-pelines"Rob Winters
 
2009/11 Database Architechs Presentation
2009/11   Database Architechs Presentation2009/11   Database Architechs Presentation
2009/11 Database Architechs PresentationDatabase Architechs
 
Terumo Medical Integrated Business Analytics at its Best
Terumo Medical Integrated Business Analytics at its BestTerumo Medical Integrated Business Analytics at its Best
Terumo Medical Integrated Business Analytics at its BestAlithya
 
Design Decisions for Embedding BI into Your Application
Design Decisions for Embedding BI into Your ApplicationDesign Decisions for Embedding BI into Your Application
Design Decisions for Embedding BI into Your ApplicationMia Yuan Cao
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupBig Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupScott Mitchell
 
Delivering Insights from 20M+ Smart Homes with 500M+ Devices
Delivering Insights from 20M+ Smart Homes with 500M+ DevicesDelivering Insights from 20M+ Smart Homes with 500M+ Devices
Delivering Insights from 20M+ Smart Homes with 500M+ DevicesDatabricks
 
Dataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra SolutionsDataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra SolutionsQuontra Solutions
 
MohitKalra_Resume
MohitKalra_ResumeMohitKalra_Resume
MohitKalra_ResumeMohit Kalra
 
AnishNSheth_Business_Intelligence_Architect
AnishNSheth_Business_Intelligence_ArchitectAnishNSheth_Business_Intelligence_Architect
AnishNSheth_Business_Intelligence_ArchitectAnish Sheth
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureJames Serra
 

What's hot (20)

A brief history of data warehousing
A brief history of data warehousingA brief history of data warehousing
A brief history of data warehousing
 
Bishakha Gupta Resume
Bishakha Gupta ResumeBishakha Gupta Resume
Bishakha Gupta Resume
 
Driving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data AssetsDriving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data Assets
 
Oracle Data Integrator
Oracle Data Integrator Oracle Data Integrator
Oracle Data Integrator
 
Big Data Warehousing Meetup with Riak
Big Data Warehousing Meetup with RiakBig Data Warehousing Meetup with Riak
Big Data Warehousing Meetup with Riak
 
Dilchand Kumar_Resume
Dilchand Kumar_ResumeDilchand Kumar_Resume
Dilchand Kumar_Resume
 
Resume_Pratik_15012017
Resume_Pratik_15012017Resume_Pratik_15012017
Resume_Pratik_15012017
 
Building data "Py-pelines"
Building data "Py-pelines"Building data "Py-pelines"
Building data "Py-pelines"
 
2009/11 Database Architechs Presentation
2009/11   Database Architechs Presentation2009/11   Database Architechs Presentation
2009/11 Database Architechs Presentation
 
Terumo Medical Integrated Business Analytics at its Best
Terumo Medical Integrated Business Analytics at its BestTerumo Medical Integrated Business Analytics at its Best
Terumo Medical Integrated Business Analytics at its Best
 
Design Decisions for Embedding BI into Your Application
Design Decisions for Embedding BI into Your ApplicationDesign Decisions for Embedding BI into Your Application
Design Decisions for Embedding BI into Your Application
 
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User GroupBig Data and BI Tools - BI Reporting for Bay Area Startups User Group
Big Data and BI Tools - BI Reporting for Bay Area Startups User Group
 
Delivering Insights from 20M+ Smart Homes with 500M+ Devices
Delivering Insights from 20M+ Smart Homes with 500M+ DevicesDelivering Insights from 20M+ Smart Homes with 500M+ Devices
Delivering Insights from 20M+ Smart Homes with 500M+ Devices
 
Dataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra SolutionsDataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra Solutions
 
MohitKalra_Resume
MohitKalra_ResumeMohitKalra_Resume
MohitKalra_Resume
 
Rajesh CV
Rajesh CVRajesh CV
Rajesh CV
 
Rest and Hateoas APIs
Rest and Hateoas APIsRest and Hateoas APIs
Rest and Hateoas APIs
 
AnishNSheth_Business_Intelligence_Architect
AnishNSheth_Business_Intelligence_ArchitectAnishNSheth_Business_Intelligence_Architect
AnishNSheth_Business_Intelligence_Architect
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 
AhmedWasfi2015
AhmedWasfi2015AhmedWasfi2015
AhmedWasfi2015
 

Similar to Phases of Big Data Challenges @ Nokia

ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...DATAVERSITY
 
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015 Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015 Vladi Vexler
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Architecting Agile Data Applications for Scale
Architecting Agile Data Applications for ScaleArchitecting Agile Data Applications for Scale
Architecting Agile Data Applications for ScaleDatabricks
 
Logical Data Fabric and Data Mesh – Driving Business Outcomes
Logical Data Fabric and Data Mesh – Driving Business OutcomesLogical Data Fabric and Data Mesh – Driving Business Outcomes
Logical Data Fabric and Data Mesh – Driving Business OutcomesDenodo
 
SQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data ArchitectureSQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data ArchitectureVenu Anuganti
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Denodo
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)James Serra
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonJeffrey T. Pollock
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)James Serra
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Denodo
 
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization Denodo
 
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyModernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyAlluxio, Inc.
 
TECHunplugged Austin 2016
TECHunplugged Austin 2016TECHunplugged Austin 2016
TECHunplugged Austin 2016Chris Evans
 
SharePoint Best Practices Conference 2013
SharePoint Best Practices Conference 2013SharePoint Best Practices Conference 2013
SharePoint Best Practices Conference 2013Mike Brannon
 
Scaling db infra_pay_pal
Scaling db infra_pay_palScaling db infra_pay_pal
Scaling db infra_pay_palpramod garre
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMichael Hiskey
 
How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxjKool
 

Similar to Phases of Big Data Challenges @ Nokia (20)

ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
 
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015 Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
Data Modeling and Scale Out - ScaleBase + 451-Group webinar 30.4.2015
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Architecting Agile Data Applications for Scale
Architecting Agile Data Applications for ScaleArchitecting Agile Data Applications for Scale
Architecting Agile Data Applications for Scale
 
Logical Data Fabric and Data Mesh – Driving Business Outcomes
Logical Data Fabric and Data Mesh – Driving Business OutcomesLogical Data Fabric and Data Mesh – Driving Business Outcomes
Logical Data Fabric and Data Mesh – Driving Business Outcomes
 
SQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data ArchitectureSQL, NoSQL, BigData in Data Architecture
SQL, NoSQL, BigData in Data Architecture
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 
Retail & CPG
Retail & CPGRetail & CPG
Retail & CPG
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
 
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
 
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio JourneyModernizing Global Shared Data Analytics Platform and our Alluxio Journey
Modernizing Global Shared Data Analytics Platform and our Alluxio Journey
 
TECHunplugged Austin 2016
TECHunplugged Austin 2016TECHunplugged Austin 2016
TECHunplugged Austin 2016
 
Accelerating Data Warehouse Modernization
Accelerating Data Warehouse ModernizationAccelerating Data Warehouse Modernization
Accelerating Data Warehouse Modernization
 
SharePoint Best Practices Conference 2013
SharePoint Best Practices Conference 2013SharePoint Best Practices Conference 2013
SharePoint Best Practices Conference 2013
 
Scaling db infra_pay_pal
Scaling db infra_pay_palScaling db infra_pay_pal
Scaling db infra_pay_pal
 
Meta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinarMeta scale kognitio hadoop webinar
Meta scale kognitio hadoop webinar
 
How jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStaxHow jKool Analyzes Streaming Data in Real Time with DataStax
How jKool Analyzes Streaming Data in Real Time with DataStax
 

More from Innovation Enterprise

Marketing Technology Organizational Models
Marketing Technology Organizational ModelsMarketing Technology Organizational Models
Marketing Technology Organizational ModelsInnovation Enterprise
 
Bridging the Gap between Budgets & Reality Oracle's Next Generation S&OP Solu...
Bridging the Gap between Budgets & Reality Oracle's Next Generation S&OP Solu...Bridging the Gap between Budgets & Reality Oracle's Next Generation S&OP Solu...
Bridging the Gap between Budgets & Reality Oracle's Next Generation S&OP Solu...Innovation Enterprise
 
Beyond the Basics: Leveraging S&OP to Deliver Results, Newell Rubbermaid
Beyond the Basics: Leveraging S&OP to Deliver Results, Newell RubbermaidBeyond the Basics: Leveraging S&OP to Deliver Results, Newell Rubbermaid
Beyond the Basics: Leveraging S&OP to Deliver Results, Newell RubbermaidInnovation Enterprise
 
CHAINalytics, Empowering Fact Based Decisions Across Your Supply Chain
CHAINalytics, Empowering Fact Based Decisions Across Your Supply ChainCHAINalytics, Empowering Fact Based Decisions Across Your Supply Chain
CHAINalytics, Empowering Fact Based Decisions Across Your Supply ChainInnovation Enterprise
 
Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...
Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...
Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...Innovation Enterprise
 
One Version of the Truth, Driving S&OP from detailed planning tools, Freescale
One Version of the Truth, Driving S&OP from detailed planning tools, FreescaleOne Version of the Truth, Driving S&OP from detailed planning tools, Freescale
One Version of the Truth, Driving S&OP from detailed planning tools, FreescaleInnovation Enterprise
 
Making Sales and Operations Planning a Truly Collaborative Process, Dick Ling
Making Sales and Operations Planning a Truly Collaborative Process, Dick LingMaking Sales and Operations Planning a Truly Collaborative Process, Dick Ling
Making Sales and Operations Planning a Truly Collaborative Process, Dick LingInnovation Enterprise
 
Building a Fast and Flexible Consumer-Driven Supply Chain, Stanley Black & De...
Building a Fast and Flexible Consumer-Driven Supply Chain, Stanley Black & De...Building a Fast and Flexible Consumer-Driven Supply Chain, Stanley Black & De...
Building a Fast and Flexible Consumer-Driven Supply Chain, Stanley Black & De...Innovation Enterprise
 
Strengthen the Processes to reach another level of excellence, Satish Sandhir
Strengthen the Processes to reach another level of excellence, Satish SandhirStrengthen the Processes to reach another level of excellence, Satish Sandhir
Strengthen the Processes to reach another level of excellence, Satish SandhirInnovation Enterprise
 
How to Keep S&OP From Getting "Stuck", Oliver Wight, JDA
How to Keep S&OP From Getting "Stuck", Oliver Wight, JDAHow to Keep S&OP From Getting "Stuck", Oliver Wight, JDA
How to Keep S&OP From Getting "Stuck", Oliver Wight, JDAInnovation Enterprise
 
Cisco Strategic Planning The Journey, Cisco
Cisco Strategic Planning The Journey, CiscoCisco Strategic Planning The Journey, Cisco
Cisco Strategic Planning The Journey, CiscoInnovation Enterprise
 
Sales and Operations Planning, Supported by Demand Management Capability, Sus...
Sales and Operations Planning, Supported by Demand Management Capability, Sus...Sales and Operations Planning, Supported by Demand Management Capability, Sus...
Sales and Operations Planning, Supported by Demand Management Capability, Sus...Innovation Enterprise
 
Enablers for Maturing your S&OP Processes, SherTrack
Enablers for Maturing your S&OP Processes, SherTrackEnablers for Maturing your S&OP Processes, SherTrack
Enablers for Maturing your S&OP Processes, SherTrackInnovation Enterprise
 
Sales, Inventory & Operations Planning During High Growth, GMCR
Sales, Inventory & Operations Planning During High Growth, GMCRSales, Inventory & Operations Planning During High Growth, GMCR
Sales, Inventory & Operations Planning During High Growth, GMCRInnovation Enterprise
 
Predicting The Future With Big Data: No Crystal Ball Required, TrendSpottr
Predicting The Future With Big Data: No Crystal Ball Required, TrendSpottrPredicting The Future With Big Data: No Crystal Ball Required, TrendSpottr
Predicting The Future With Big Data: No Crystal Ball Required, TrendSpottrInnovation Enterprise
 
Big Data in Education, Desire2Learn Inc
Big Data in Education, Desire2Learn IncBig Data in Education, Desire2Learn Inc
Big Data in Education, Desire2Learn IncInnovation Enterprise
 

More from Innovation Enterprise (20)

Marketing Technology Organizational Models
Marketing Technology Organizational ModelsMarketing Technology Organizational Models
Marketing Technology Organizational Models
 
BI, INC - BI, INC, Boeing
BI, INC - BI, INC, BoeingBI, INC - BI, INC, Boeing
BI, INC - BI, INC, Boeing
 
Bridging the Gap between Budgets & Reality Oracle's Next Generation S&OP Solu...
Bridging the Gap between Budgets & Reality Oracle's Next Generation S&OP Solu...Bridging the Gap between Budgets & Reality Oracle's Next Generation S&OP Solu...
Bridging the Gap between Budgets & Reality Oracle's Next Generation S&OP Solu...
 
Beyond the Basics: Leveraging S&OP to Deliver Results, Newell Rubbermaid
Beyond the Basics: Leveraging S&OP to Deliver Results, Newell RubbermaidBeyond the Basics: Leveraging S&OP to Deliver Results, Newell Rubbermaid
Beyond the Basics: Leveraging S&OP to Deliver Results, Newell Rubbermaid
 
CHAINalytics, Empowering Fact Based Decisions Across Your Supply Chain
CHAINalytics, Empowering Fact Based Decisions Across Your Supply ChainCHAINalytics, Empowering Fact Based Decisions Across Your Supply Chain
CHAINalytics, Empowering Fact Based Decisions Across Your Supply Chain
 
Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...
Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...
Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...
 
One Version of the Truth, Driving S&OP from detailed planning tools, Freescale
One Version of the Truth, Driving S&OP from detailed planning tools, FreescaleOne Version of the Truth, Driving S&OP from detailed planning tools, Freescale
One Version of the Truth, Driving S&OP from detailed planning tools, Freescale
 
Making Sales and Operations Planning a Truly Collaborative Process, Dick Ling
Making Sales and Operations Planning a Truly Collaborative Process, Dick LingMaking Sales and Operations Planning a Truly Collaborative Process, Dick Ling
Making Sales and Operations Planning a Truly Collaborative Process, Dick Ling
 
Building a Fast and Flexible Consumer-Driven Supply Chain, Stanley Black & De...
Building a Fast and Flexible Consumer-Driven Supply Chain, Stanley Black & De...Building a Fast and Flexible Consumer-Driven Supply Chain, Stanley Black & De...
Building a Fast and Flexible Consumer-Driven Supply Chain, Stanley Black & De...
 
Strengthen the Processes to reach another level of excellence, Satish Sandhir
Strengthen the Processes to reach another level of excellence, Satish SandhirStrengthen the Processes to reach another level of excellence, Satish Sandhir
Strengthen the Processes to reach another level of excellence, Satish Sandhir
 
How to Keep S&OP From Getting "Stuck", Oliver Wight, JDA
How to Keep S&OP From Getting "Stuck", Oliver Wight, JDAHow to Keep S&OP From Getting "Stuck", Oliver Wight, JDA
How to Keep S&OP From Getting "Stuck", Oliver Wight, JDA
 
S&OP Innovation, Marietta
S&OP Innovation, MariettaS&OP Innovation, Marietta
S&OP Innovation, Marietta
 
Cisco Strategic Planning The Journey, Cisco
Cisco Strategic Planning The Journey, CiscoCisco Strategic Planning The Journey, Cisco
Cisco Strategic Planning The Journey, Cisco
 
Sales and Operations Planning, Supported by Demand Management Capability, Sus...
Sales and Operations Planning, Supported by Demand Management Capability, Sus...Sales and Operations Planning, Supported by Demand Management Capability, Sus...
Sales and Operations Planning, Supported by Demand Management Capability, Sus...
 
Enablers for Maturing your S&OP Processes, SherTrack
Enablers for Maturing your S&OP Processes, SherTrackEnablers for Maturing your S&OP Processes, SherTrack
Enablers for Maturing your S&OP Processes, SherTrack
 
S&OP, Kinaxis
S&OP, KinaxisS&OP, Kinaxis
S&OP, Kinaxis
 
Sales, Inventory & Operations Planning During High Growth, GMCR
Sales, Inventory & Operations Planning During High Growth, GMCRSales, Inventory & Operations Planning During High Growth, GMCR
Sales, Inventory & Operations Planning During High Growth, GMCR
 
Predicting The Future With Big Data: No Crystal Ball Required, TrendSpottr
Predicting The Future With Big Data: No Crystal Ball Required, TrendSpottrPredicting The Future With Big Data: No Crystal Ball Required, TrendSpottr
Predicting The Future With Big Data: No Crystal Ball Required, TrendSpottr
 
Big Data Toronto, Unata
Big Data Toronto, UnataBig Data Toronto, Unata
Big Data Toronto, Unata
 
Big Data in Education, Desire2Learn Inc
Big Data in Education, Desire2Learn IncBig Data in Education, Desire2Learn Inc
Big Data in Education, Desire2Learn Inc
 

Recently uploaded

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 

Recently uploaded (20)

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 

Phases of Big Data Challenges @ Nokia

  • 1. Yekesa Kosuru HERE.com Nokia Hadoop Innovation Summit February 20 & 21, San Diego 2013 Phases of Big Data Challenges @ Nokia 11
  • 2. • Phases of Big Data Challenges @Nokia – Who we are – Big data platform – Use case data flows – High level architecture –Challenges • Phases of challenges Agenda 22
  • 4. Apps Smart Data Platform Content PositionsMaps TrafficPlaces Directions Guidance Location Platform, Enabling Contextually Rich Mobile Experiences 44
  • 5. 5 Big Data Analytics …to Be Made Available for Analysis Enabling feedback loops for continuous improvement, Location Optimized Experience, CRM, etc..! Big Data Flows and Differentiates …on All Supported Platforms… Nokia Account We Collect User Data… 5
  • 6. Click to edit Master title style Phase 0 66 2008 – ‘10 BuildTechnology Platform, GetData
  • 7. 7 Business Challenges • Data silos, no unique identifiers, missing semantics • Multiple sources - overlapping, conflicting • Timely processing of large volumes & velocity of data • Partial, insufficient, inaccurate, inconsistent.. data • Data/wire formats, Security, privacy and other policies unknown Central Big Data Platform created
  • 8. 8 …to verify Map accuracy and create Motion Graph Using different big data sets
  • 9. Reports Analytical DBMS Analytics Cluster Data Asset Catalog Analytical DBMS Dashboards Data Discovery Interactive Queries Batch Queries Web Applications Activity Logs VShards (NoSQL) Reference Data Device Applications Probes 3rd Party Device User Profile POI, Map Activity Sensor DataIntake ETL,datacrunching, attribution,ML Algorithms Aggregation HDFS 9 Analytical DBMS Big Data Analytics Platform Data Flows
  • 10. Technology Platform 10 Hadoop R VShards (KV) SDK, Scribe, FTP Hive, Pig Analytical DBMS Export/ Import Workflow Engine Config./ Deploy Monitor Alerts Data Pipeline Scheduler Security/Kerberos & ACL On-Premise & Cloud Infrastructure
  • 11. 11 Data Platform Self Serve Tools ETL, Agg Machine Learning Data Quality Data Asset Catalog Data, Metadata, Operational Data Collect Ingest Organize Analyze Deliver Technology Platform
  • 12. Click to edit Master title style Phase 1 –2012 1212 2008 – ‘10 BuildTechnology Platform, GetData 2011 EnhancePlatform, MoreData, SimpleAnalytics, DataCrunching 2012 PB’sofData, HundredsofUsers ThousandsofJobs ComplexAnalytics, MultipleClusters
  • 13. 13 2012 Production Statistics • 10’s PB of data all across Nokia • Multi-tenant, multi-petabyte analytics cluster • 10-20K+ jobs per day • 600+ internal users • 300M+ KV queries • Terabytes flowing in every day • Multiple data centers around the world
  • 14. 14 Challenges With Big Data • Complex eco-system of technologies - many moving parts, slower deploy cycles, data integration is complex • Capacity & Scale Issues – Provision for peaks or sustained, storage or compute ? • DBMS great for performance & data management, but cant scale - price/performance & ACIDity • Hadoop great for ETL, but poor on query performance & data management, not interactive • Data and Metadata fragmentation
  • 15. 15 Big Data Capacity Issues • Spikey Workloads • Capacity Provisioning – Peaks – Sustained loads • How many clusters ? – SLA/Adhoc/Research – Multiple data centers – Data duplication • Tenancy – single/multi • TOC – Hadoop can get expensive - storage & computed tightly coupled, idle machines
  • 16. 16 Cloud helps with some issues • Operational & IT complexity reduced – API based spin up & tear down – rapid deployments, faster cycles • Pay for what is used • Capacity issues mitigated - idle machines or peaks not an issue – elastically scale up and down • De-coupled Storage and Compute makes sense • Stateless architecture, recycle slow/bad machines, no need for rolling upgrades, instead do rolling replace
  • 17. Click to edit Master title style Phase 2 1717 2012 PB’sofData, HundredsofUsers ThousandsofJobs Simple&Complex Analytics 2008 – ‘10 BuildTechnology Platform, GetData 17 2011 EnhancePlatform, MoreData, SimpleAnalytics 2013 StillPending Challenges
  • 18. 18 Still Pending • Data and Metadata fragmentation, need deeper integration into all tools/frameworks • Advanced Analytics - Data science problems are hard & inefficient to implement in Map Reduce/RDBMS
  • 19. 19 Complex Analytics • Mathematicians think terms of Arrays not Map Reduce • Data science tools can’t efficiently handle big data • Data partitioning is naïve, indexing wont scale
  • 20. Big Data Technologies for Future
  • 21. 21 THANK YOU Yekesa Kosuru yekesa.kosuru@nokia.com