SlideShare a Scribd company logo
1
But how do I GET the
data?
Transparency Camp 2014
Shooju is a Web-Based Data Platform
2
• Consolidate your internal and external data sources
• Make all data searchable from one place
• Provide continuous updating
• Seamlessly integrate with tools and applications
• Share data across your entire organization
• Save time and energy while reducing errors and
problems with version control
Shooju saves time, improves data quality and enhances
data sharing across your entire organization
The Analytical Process
3
Data
Data Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
The Analytical Process
4
Data
Data Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
some place
The Analytical Process
5
Data
Data Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
some place
your tool of choice
The Analytical Process
6
Data
Data Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
some place
your tool of choice
your product
The Analytical Process
7
Data
Data Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
some place
your tool of choice
your product
The Fun Part 
The Analytical Process
8
Data
Data Data
Data
Data
Data
Data
Data
Data
Data
Data
Data
some place
your tool of choice
your product
The Not Fun Part 
Big data vs. small data
9
A boring 2 x 2
10
The harsh 80/20 reality
11
Most organizations spend more time collecting,
cleaning, downloading, managing and
wrangling data than they do conducting analysis
Three ways to get data
• API
– Good
– Bad
• Scraping
• Manual
12
Defined as ETL (Extract,
Transform, Load) process
Method comparison
13
TechnicalExpertiserequired
Time (and annoyance)
Manual
Scraping
API
14
Average cost curve of data collection
Manual Collection
AverageCost
Number of times data is collected
15
Average cost curve of data collection
Manual Collection
AverageCost
Number of times data is collected
Scraping
16
Average cost curve of data collection
Manual Collection
AverageCost
Number of times data is collected
Scraping
API
How do you get your data?
What do you like?
What don’t you like?
17
Once the data is scraped, where can it go?
• CSV
• XLS
• DBF
• SQL
• NoSQL
• Many others
18
Where does your data go when you collect it?
19
1 Appendix
Shooju Value Added
Cost Savings
By saving analyst time and energy, Shooju allows analysts to do more with less,
reducing data management costs and putting more focus on high-value analysis.
Added Quality
Automating data processes internally will ensure that your data is accurate, up-to-date
and consistent across your entire organization.
Enhanced Decision Making
Having more accurate data available faster with more analyst time left for analysis
leads to enhanced decision making.
21
Cost
Savings
Added
Quality
Enhanced
Decision
Making
Shooju
Value
Added
22
Shooju
Sources
Excel
Add-In
& Other Tools
Custom
BI Apps
Web
Search
Auto-
Import
Drivers
# of analysts retrieving
time saved in retrieval
# of sources
frequency of retrieval
# of analysts refreshing
time saved in tool refresh
# of sources
frequency of refresh
time to integrate data
analysts contributing data
# of tools created
analyst upload time
# of analysts searching
time saved in search
# of sources
frequency of search
5 analysts
65 min / source
22 sources
18 times / year
11 analysts
74 min / source
22 sources
14 times / year
9 min / source
22 sources
32 times / year
$97k
(14%)
$73k
(10%)
$248k
(35%)
$702kTotal:
Cost Savings
13 analysts
14 wk of dev. saved
8 analysts contributing
2 apps created
$284k
(41%)
40 min 10 times / year
Sample Cost Savings
Cost Savings Added Quality Enhanced Decision MakingShooju Value Added
* Based on real 40-person
organization. Assumed
annual wages vary
between $30k and $140k.
$410k
savings
equivalent to
10% of HR
spend*
Shooju speeds up
custom BI application
development by making
all data natively
accessible and
continuously updated in
any BI tool or custom
app.
USD (%)
Added Quality: The Three “Cs”
23
Cost Savings Added QualityShooju Value Added
Consistency
Shooju ensures that all analysts are using the same data
across all their tools and applications. By allowing
analysts to upload their own data to the platform, internal
data as well as external data now flows seamlessly -
without messy spreadsheet links.
Currency
By automatically pulling in the latest source data through the
Shooju importer layer, Shooju ensures that all of your
spreadsheets and models are populated with the latest data.
Our native plugins for Excel, Access and all your other tools
allow data to flow through directly without any need for the
analyst to download or copy and paste.
Correctness
The more data is touched by human hands, the more prone it is to errors. By streamlining
workflows and automating work processes, Shooju eliminates most of these errors, saving
time and ensuring that the data you rely on is more accurate.
Enhanced Decision Making
We support any data source
24
Ask us about non-mainstream data
sources that traditional data providers
don’t carry.
Shooju Data Process
25
Shooju vs. Custom Data Warehouse
Custom Data
Warehouse Shooju
Design Custom “Plug-and-play”
Cost 7+ digits 5-6 digits
Rollout timeline Months / Years Hours
Scalability Minimal Infinite
Flexibility Low High
Maintenance High Low
Stakeholders IT controlled Analyst run / IT maintained
Tool and app support Clunky, requiring IT Native tool support
26
Data warehouse projects are costly, time consuming and
result in inflexible systems with low adoption rates
Shooju vs. Off-the-shelf Data Management*
Off-the-shelf
Data Management* Shooju
Service focus Data provision/management Process improvement
Prepackaged data feeds Many None
Custom data feeds None (not natively supported) Included(all feeds are custom)
Internal data integration Weeks (high consulting fees) Days (included in service)
Process flexibility Low High
Analyst learning curve Weeks Hours
Ease of migrating off Very difficult/impossible Easy
Annual fee 6-7 digits 5-6 digits
27
Data management* solutions focus on generic data
provision rather than process improvement and limit
analysts to a closed and inflexible data ecosystem.
* Top-ranked providers in the EnergyRisk Data Management category include: Morningstar, ZE Power Group, SunGard, Allegro, Pioneer
Solutions, SAS, and InteractiveData. See http://www.slideshare.net/Allegrodev/energy-risk-magazines-etrm-software-rankings-2013

More Related Content

What's hot

Empowering Customers with Personalized Insights
Empowering Customers with Personalized InsightsEmpowering Customers with Personalized Insights
Empowering Customers with Personalized Insights
Cloudera, Inc.
 
The Benefits of Predictive and Proactive Support for an Enterprise Data Hub
The Benefits of Predictive and Proactive Support for an Enterprise Data HubThe Benefits of Predictive and Proactive Support for an Enterprise Data Hub
The Benefits of Predictive and Proactive Support for an Enterprise Data Hub
Cloudera, Inc.
 
Data Migrations powered by GalenETL
Data Migrations powered by GalenETLData Migrations powered by GalenETL
Data Migrations powered by GalenETL
Julie Champagne
 
Quelles nouveautés avec la version 6.5 de Splunk Enterprise
Quelles nouveautés avec la version 6.5 de Splunk EnterpriseQuelles nouveautés avec la version 6.5 de Splunk Enterprise
Quelles nouveautés avec la version 6.5 de Splunk Enterprise
Splunk
 
Big Data Business Wins: Real-time Inventory Tracking with Hadoop
Big Data Business Wins: Real-time Inventory Tracking with HadoopBig Data Business Wins: Real-time Inventory Tracking with Hadoop
Big Data Business Wins: Real-time Inventory Tracking with HadoopDataWorks Summit
 
How to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsHow to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT Operations
ExtraHop Networks
 
ODSC data science to DataOps
ODSC data science to DataOpsODSC data science to DataOps
ODSC data science to DataOps
Christopher Bergh
 
Tableau - Make your SEO data work for you!
Tableau - Make your SEO data work for you!Tableau - Make your SEO data work for you!
Tableau - Make your SEO data work for you!
Renco Smeding
 
Should You Invest In DataOps Services?
Should You Invest In DataOps Services?Should You Invest In DataOps Services?
Should You Invest In DataOps Services?
Enov8
 
Florida's Natural Growers BI case study
Florida's Natural Growers BI case studyFlorida's Natural Growers BI case study
Florida's Natural Growers BI case study
Dimensional Insight
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects Fail
Sense Corp
 
Data quality management Basic
Data quality management BasicData quality management Basic
Data quality management Basic
Khaled Mosharraf
 
Architectural Health Check for Postgres
Architectural Health Check for PostgresArchitectural Health Check for Postgres
Architectural Health Check for Postgres
EDB
 
2015-11-13 Data for Administrative Professionals
2015-11-13  Data for Administrative Professionals2015-11-13  Data for Administrative Professionals
2015-11-13 Data for Administrative ProfessionalsTara E. Browne, DTM
 
925 plenary rexer_using our laptop
925 plenary rexer_using our laptop925 plenary rexer_using our laptop
925 plenary rexer_using our laptop
Rising Media, Inc.
 
Taking Splunk to the Next Level - Management Breakout Session
Taking Splunk to the Next Level - Management Breakout SessionTaking Splunk to the Next Level - Management Breakout Session
Taking Splunk to the Next Level - Management Breakout Session
Splunk
 
Why Data Science Projects Fail?
Why Data Science Projects Fail?Why Data Science Projects Fail?
Why Data Science Projects Fail?
Ethan Ram
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes Biobanking
Denodo
 
Everything you wanted to know about data ops
Everything you wanted to know about data opsEverything you wanted to know about data ops
Everything you wanted to know about data ops
Enov8
 

What's hot (19)

Empowering Customers with Personalized Insights
Empowering Customers with Personalized InsightsEmpowering Customers with Personalized Insights
Empowering Customers with Personalized Insights
 
The Benefits of Predictive and Proactive Support for an Enterprise Data Hub
The Benefits of Predictive and Proactive Support for an Enterprise Data HubThe Benefits of Predictive and Proactive Support for an Enterprise Data Hub
The Benefits of Predictive and Proactive Support for an Enterprise Data Hub
 
Data Migrations powered by GalenETL
Data Migrations powered by GalenETLData Migrations powered by GalenETL
Data Migrations powered by GalenETL
 
Quelles nouveautés avec la version 6.5 de Splunk Enterprise
Quelles nouveautés avec la version 6.5 de Splunk EnterpriseQuelles nouveautés avec la version 6.5 de Splunk Enterprise
Quelles nouveautés avec la version 6.5 de Splunk Enterprise
 
Big Data Business Wins: Real-time Inventory Tracking with Hadoop
Big Data Business Wins: Real-time Inventory Tracking with HadoopBig Data Business Wins: Real-time Inventory Tracking with Hadoop
Big Data Business Wins: Real-time Inventory Tracking with Hadoop
 
How to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT OperationsHow to Use Big Data to Transform IT Operations
How to Use Big Data to Transform IT Operations
 
ODSC data science to DataOps
ODSC data science to DataOpsODSC data science to DataOps
ODSC data science to DataOps
 
Tableau - Make your SEO data work for you!
Tableau - Make your SEO data work for you!Tableau - Make your SEO data work for you!
Tableau - Make your SEO data work for you!
 
Should You Invest In DataOps Services?
Should You Invest In DataOps Services?Should You Invest In DataOps Services?
Should You Invest In DataOps Services?
 
Florida's Natural Growers BI case study
Florida's Natural Growers BI case studyFlorida's Natural Growers BI case study
Florida's Natural Growers BI case study
 
Why Data Science Projects Fail
Why Data Science Projects FailWhy Data Science Projects Fail
Why Data Science Projects Fail
 
Data quality management Basic
Data quality management BasicData quality management Basic
Data quality management Basic
 
Architectural Health Check for Postgres
Architectural Health Check for PostgresArchitectural Health Check for Postgres
Architectural Health Check for Postgres
 
2015-11-13 Data for Administrative Professionals
2015-11-13  Data for Administrative Professionals2015-11-13  Data for Administrative Professionals
2015-11-13 Data for Administrative Professionals
 
925 plenary rexer_using our laptop
925 plenary rexer_using our laptop925 plenary rexer_using our laptop
925 plenary rexer_using our laptop
 
Taking Splunk to the Next Level - Management Breakout Session
Taking Splunk to the Next Level - Management Breakout SessionTaking Splunk to the Next Level - Management Breakout Session
Taking Splunk to the Next Level - Management Breakout Session
 
Why Data Science Projects Fail?
Why Data Science Projects Fail?Why Data Science Projects Fail?
Why Data Science Projects Fail?
 
Data Virtualization Modernizes Biobanking
Data Virtualization Modernizes BiobankingData Virtualization Modernizes Biobanking
Data Virtualization Modernizes Biobanking
 
Everything you wanted to know about data ops
Everything you wanted to know about data opsEverything you wanted to know about data ops
Everything you wanted to know about data ops
 

Viewers also liked

OBIEE 11.1.1.7: Upgrade y Nuevas Características
OBIEE 11.1.1.7: Upgrade y Nuevas CaracterísticasOBIEE 11.1.1.7: Upgrade y Nuevas Características
OBIEE 11.1.1.7: Upgrade y Nuevas Características
Edelweiss Kammermann
 
Incredible ODI tips to work with Hyperion tools that you ever wanted to know
Incredible ODI tips to work with Hyperion tools that you ever wanted to knowIncredible ODI tips to work with Hyperion tools that you ever wanted to know
Incredible ODI tips to work with Hyperion tools that you ever wanted to know
Rodrigo Radtke de Souza
 
Oracle BI Hybrid BI : Mode 1 + Mode 2, Cloud + On-Premise Business Analytics
Oracle BI Hybrid BI : Mode 1 + Mode 2, Cloud + On-Premise Business AnalyticsOracle BI Hybrid BI : Mode 1 + Mode 2, Cloud + On-Premise Business Analytics
Oracle BI Hybrid BI : Mode 1 + Mode 2, Cloud + On-Premise Business Analytics
Mark Rittman
 
Tableau Best Practices for OBIEE
Tableau Best Practices for OBIEETableau Best Practices for OBIEE
Tableau Best Practices for OBIEE
BI Connector
 
How to solve complex business requirements with Oracle Data Integrator?
How to solve complex business requirements with Oracle Data Integrator?How to solve complex business requirements with Oracle Data Integrator?
How to solve complex business requirements with Oracle Data Integrator?
Gurcan Orhan
 
Logical DB Design (OOP)
Logical DB Design (OOP)Logical DB Design (OOP)
Logical DB Design (OOP)
Traitet Thepbandansuk
 
A microservice approach for legacy modernisation
A microservice approach for legacy modernisationA microservice approach for legacy modernisation
A microservice approach for legacy modernisation
luisw19
 
Empowering Business Users: OBIEE 12c Visual Analyzer and Data Mashup
Empowering Business Users: OBIEE 12c Visual Analyzer and Data MashupEmpowering Business Users: OBIEE 12c Visual Analyzer and Data Mashup
Empowering Business Users: OBIEE 12c Visual Analyzer and Data Mashup
Edelweiss Kammermann
 
OUG Ireland Meet-up - Updates from Oracle Open World 2016
OUG Ireland Meet-up - Updates from Oracle Open World 2016OUG Ireland Meet-up - Updates from Oracle Open World 2016
OUG Ireland Meet-up - Updates from Oracle Open World 2016
Brendan Tierney
 
Ranges, ranges everywhere (Oracle SQL)
Ranges, ranges everywhere (Oracle SQL)Ranges, ranges everywhere (Oracle SQL)
Ranges, ranges everywhere (Oracle SQL)
Stew Ashton
 
OUG Ireland Meet-up 12th January
OUG Ireland Meet-up 12th JanuaryOUG Ireland Meet-up 12th January
OUG Ireland Meet-up 12th January
Brendan Tierney
 
How to read a data model
How to read a data modelHow to read a data model
How to read a data model
sanksh
 
Row Pattern Matching in Oracle Database 12c
Row Pattern Matching in Oracle Database 12cRow Pattern Matching in Oracle Database 12c
Row Pattern Matching in Oracle Database 12c
Stew Ashton
 
Data modelling 101
Data modelling 101Data modelling 101
Data modelling 101
Christopher Bradley
 
Data Modeling PPT
Data Modeling PPTData Modeling PPT
Data Modeling PPT
Trinath
 
SQL: The one language to rule all your data
SQL: The one language to rule all your dataSQL: The one language to rule all your data
SQL: The one language to rule all your data
Brendan Tierney
 

Viewers also liked (20)

OBIEE 11.1.1.7: Upgrade y Nuevas Características
OBIEE 11.1.1.7: Upgrade y Nuevas CaracterísticasOBIEE 11.1.1.7: Upgrade y Nuevas Características
OBIEE 11.1.1.7: Upgrade y Nuevas Características
 
Data modeling
Data modelingData modeling
Data modeling
 
Incredible ODI tips to work with Hyperion tools that you ever wanted to know
Incredible ODI tips to work with Hyperion tools that you ever wanted to knowIncredible ODI tips to work with Hyperion tools that you ever wanted to know
Incredible ODI tips to work with Hyperion tools that you ever wanted to know
 
Oracle BI Hybrid BI : Mode 1 + Mode 2, Cloud + On-Premise Business Analytics
Oracle BI Hybrid BI : Mode 1 + Mode 2, Cloud + On-Premise Business AnalyticsOracle BI Hybrid BI : Mode 1 + Mode 2, Cloud + On-Premise Business Analytics
Oracle BI Hybrid BI : Mode 1 + Mode 2, Cloud + On-Premise Business Analytics
 
Tableau Best Practices for OBIEE
Tableau Best Practices for OBIEETableau Best Practices for OBIEE
Tableau Best Practices for OBIEE
 
How to solve complex business requirements with Oracle Data Integrator?
How to solve complex business requirements with Oracle Data Integrator?How to solve complex business requirements with Oracle Data Integrator?
How to solve complex business requirements with Oracle Data Integrator?
 
Logical DB Design (OOP)
Logical DB Design (OOP)Logical DB Design (OOP)
Logical DB Design (OOP)
 
A microservice approach for legacy modernisation
A microservice approach for legacy modernisationA microservice approach for legacy modernisation
A microservice approach for legacy modernisation
 
Empowering Business Users: OBIEE 12c Visual Analyzer and Data Mashup
Empowering Business Users: OBIEE 12c Visual Analyzer and Data MashupEmpowering Business Users: OBIEE 12c Visual Analyzer and Data Mashup
Empowering Business Users: OBIEE 12c Visual Analyzer and Data Mashup
 
OUG Ireland Meet-up - Updates from Oracle Open World 2016
OUG Ireland Meet-up - Updates from Oracle Open World 2016OUG Ireland Meet-up - Updates from Oracle Open World 2016
OUG Ireland Meet-up - Updates from Oracle Open World 2016
 
Ranges, ranges everywhere (Oracle SQL)
Ranges, ranges everywhere (Oracle SQL)Ranges, ranges everywhere (Oracle SQL)
Ranges, ranges everywhere (Oracle SQL)
 
OUG Ireland Meet-up 12th January
OUG Ireland Meet-up 12th JanuaryOUG Ireland Meet-up 12th January
OUG Ireland Meet-up 12th January
 
Domain model
Domain modelDomain model
Domain model
 
How to read a data model
How to read a data modelHow to read a data model
How to read a data model
 
Row Pattern Matching in Oracle Database 12c
Row Pattern Matching in Oracle Database 12cRow Pattern Matching in Oracle Database 12c
Row Pattern Matching in Oracle Database 12c
 
Data modelling 101
Data modelling 101Data modelling 101
Data modelling 101
 
Data models
Data modelsData models
Data models
 
Dbms models
Dbms modelsDbms models
Dbms models
 
Data Modeling PPT
Data Modeling PPTData Modeling PPT
Data Modeling PPT
 
SQL: The one language to rule all your data
SQL: The one language to rule all your dataSQL: The one language to rule all your data
SQL: The one language to rule all your data
 

Similar to But how do I GET the data? Transparency Camp 2014

data_blending
data_blendingdata_blending
data_blendingsubit1615
 
DataOps , cbuswaw April '23
DataOps , cbuswaw April '23DataOps , cbuswaw April '23
DataOps , cbuswaw April '23
Jason Packer
 
How Can You Implement DataOps In Your Existing Workflow?
How Can You Implement DataOps In Your Existing Workflow?How Can You Implement DataOps In Your Existing Workflow?
How Can You Implement DataOps In Your Existing Workflow?
Enov8
 
Breed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptxBreed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptx
GautamPopli1
 
Starting Your Modern DataOps Journey
Starting Your Modern DataOps JourneyStarting Your Modern DataOps Journey
Starting Your Modern DataOps Journey
CloverDX
 
Big Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation SlidesBig Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation Slides
SlideTeam
 
Latest trends in Business Analytics
Latest trends in Business AnalyticsLatest trends in Business Analytics
Latest trends in Business Analytics
Puneet Bhalla
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Data summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsData summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data ops
Ryan Gross
 
Modernize your Infrastructure and Mobilize Your Data
Modernize your Infrastructure and Mobilize Your DataModernize your Infrastructure and Mobilize Your Data
Modernize your Infrastructure and Mobilize Your Data
Precisely
 
Toad Business Intelligence Suite
Toad Business Intelligence Suite Toad Business Intelligence Suite
Toad Business Intelligence Suite
pound78
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
itnewsafrica
 
MTX Portland Office 365 Strategic Capabilities Sep2017
MTX Portland Office 365 Strategic Capabilities Sep2017MTX Portland Office 365 Strategic Capabilities Sep2017
MTX Portland Office 365 Strategic Capabilities Sep2017
Owen Allen
 
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo
 
TB8568_8568_Presentation
TB8568_8568_PresentationTB8568_8568_Presentation
TB8568_8568_PresentationRonnie Falgout
 
Advanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project DeliveryAdvanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project Delivery
Mark Constable
 
Big Data Analytics Architecture Powerpoint Presentation Slides
Big Data Analytics Architecture Powerpoint Presentation SlidesBig Data Analytics Architecture Powerpoint Presentation Slides
Big Data Analytics Architecture Powerpoint Presentation Slides
SlideTeam
 
Data-Driven DevOps: Mining Machine Data for 'Metrics that Matter' in a DevOps...
Data-Driven DevOps: Mining Machine Data for 'Metrics that Matter' in a DevOps...Data-Driven DevOps: Mining Machine Data for 'Metrics that Matter' in a DevOps...
Data-Driven DevOps: Mining Machine Data for 'Metrics that Matter' in a DevOps...
Splunk
 
Intro of Key Features of Soft CAAT Ent Software
Intro of Key Features of Soft CAAT Ent SoftwareIntro of Key Features of Soft CAAT Ent Software
Intro of Key Features of Soft CAAT Ent Software
rafeq
 
What's New in Pentaho 7.0?
What's New in Pentaho 7.0?What's New in Pentaho 7.0?
What's New in Pentaho 7.0?
Xpand IT
 

Similar to But how do I GET the data? Transparency Camp 2014 (20)

data_blending
data_blendingdata_blending
data_blending
 
DataOps , cbuswaw April '23
DataOps , cbuswaw April '23DataOps , cbuswaw April '23
DataOps , cbuswaw April '23
 
How Can You Implement DataOps In Your Existing Workflow?
How Can You Implement DataOps In Your Existing Workflow?How Can You Implement DataOps In Your Existing Workflow?
How Can You Implement DataOps In Your Existing Workflow?
 
Breed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptxBreed data scientists_ A Presentation.pptx
Breed data scientists_ A Presentation.pptx
 
Starting Your Modern DataOps Journey
Starting Your Modern DataOps JourneyStarting Your Modern DataOps Journey
Starting Your Modern DataOps Journey
 
Big Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation SlidesBig Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation Slides
 
Latest trends in Business Analytics
Latest trends in Business AnalyticsLatest trends in Business Analytics
Latest trends in Business Analytics
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
Data summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data opsData summit connect fall 2020 - rise of data ops
Data summit connect fall 2020 - rise of data ops
 
Modernize your Infrastructure and Mobilize Your Data
Modernize your Infrastructure and Mobilize Your DataModernize your Infrastructure and Mobilize Your Data
Modernize your Infrastructure and Mobilize Your Data
 
Toad Business Intelligence Suite
Toad Business Intelligence Suite Toad Business Intelligence Suite
Toad Business Intelligence Suite
 
Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
MTX Portland Office 365 Strategic Capabilities Sep2017
MTX Portland Office 365 Strategic Capabilities Sep2017MTX Portland Office 365 Strategic Capabilities Sep2017
MTX Portland Office 365 Strategic Capabilities Sep2017
 
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
 
TB8568_8568_Presentation
TB8568_8568_PresentationTB8568_8568_Presentation
TB8568_8568_Presentation
 
Advanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project DeliveryAdvanced Project Data Analytics for Improved Project Delivery
Advanced Project Data Analytics for Improved Project Delivery
 
Big Data Analytics Architecture Powerpoint Presentation Slides
Big Data Analytics Architecture Powerpoint Presentation SlidesBig Data Analytics Architecture Powerpoint Presentation Slides
Big Data Analytics Architecture Powerpoint Presentation Slides
 
Data-Driven DevOps: Mining Machine Data for 'Metrics that Matter' in a DevOps...
Data-Driven DevOps: Mining Machine Data for 'Metrics that Matter' in a DevOps...Data-Driven DevOps: Mining Machine Data for 'Metrics that Matter' in a DevOps...
Data-Driven DevOps: Mining Machine Data for 'Metrics that Matter' in a DevOps...
 
Intro of Key Features of Soft CAAT Ent Software
Intro of Key Features of Soft CAAT Ent SoftwareIntro of Key Features of Soft CAAT Ent Software
Intro of Key Features of Soft CAAT Ent Software
 
What's New in Pentaho 7.0?
What's New in Pentaho 7.0?What's New in Pentaho 7.0?
What's New in Pentaho 7.0?
 

Recently uploaded

Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
Cheryl Hung
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
g2nightmarescribd
 

Recently uploaded (20)

Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Generating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using SmithyGenerating a custom Ruby SDK for your web service or Rails API using Smithy
Generating a custom Ruby SDK for your web service or Rails API using Smithy
 

But how do I GET the data? Transparency Camp 2014

  • 1. 1 But how do I GET the data? Transparency Camp 2014
  • 2. Shooju is a Web-Based Data Platform 2 • Consolidate your internal and external data sources • Make all data searchable from one place • Provide continuous updating • Seamlessly integrate with tools and applications • Share data across your entire organization • Save time and energy while reducing errors and problems with version control Shooju saves time, improves data quality and enhances data sharing across your entire organization
  • 3. The Analytical Process 3 Data Data Data Data Data Data Data Data Data Data Data Data
  • 4. The Analytical Process 4 Data Data Data Data Data Data Data Data Data Data Data Data some place
  • 5. The Analytical Process 5 Data Data Data Data Data Data Data Data Data Data Data Data some place your tool of choice
  • 6. The Analytical Process 6 Data Data Data Data Data Data Data Data Data Data Data Data some place your tool of choice your product
  • 7. The Analytical Process 7 Data Data Data Data Data Data Data Data Data Data Data Data some place your tool of choice your product The Fun Part 
  • 8. The Analytical Process 8 Data Data Data Data Data Data Data Data Data Data Data Data some place your tool of choice your product The Not Fun Part 
  • 9. Big data vs. small data 9
  • 10. A boring 2 x 2 10
  • 11. The harsh 80/20 reality 11 Most organizations spend more time collecting, cleaning, downloading, managing and wrangling data than they do conducting analysis
  • 12. Three ways to get data • API – Good – Bad • Scraping • Manual 12 Defined as ETL (Extract, Transform, Load) process
  • 14. 14 Average cost curve of data collection Manual Collection AverageCost Number of times data is collected
  • 15. 15 Average cost curve of data collection Manual Collection AverageCost Number of times data is collected Scraping
  • 16. 16 Average cost curve of data collection Manual Collection AverageCost Number of times data is collected Scraping API
  • 17. How do you get your data? What do you like? What don’t you like? 17
  • 18. Once the data is scraped, where can it go? • CSV • XLS • DBF • SQL • NoSQL • Many others 18
  • 19. Where does your data go when you collect it? 19
  • 21. Shooju Value Added Cost Savings By saving analyst time and energy, Shooju allows analysts to do more with less, reducing data management costs and putting more focus on high-value analysis. Added Quality Automating data processes internally will ensure that your data is accurate, up-to-date and consistent across your entire organization. Enhanced Decision Making Having more accurate data available faster with more analyst time left for analysis leads to enhanced decision making. 21 Cost Savings Added Quality Enhanced Decision Making Shooju Value Added
  • 22. 22 Shooju Sources Excel Add-In & Other Tools Custom BI Apps Web Search Auto- Import Drivers # of analysts retrieving time saved in retrieval # of sources frequency of retrieval # of analysts refreshing time saved in tool refresh # of sources frequency of refresh time to integrate data analysts contributing data # of tools created analyst upload time # of analysts searching time saved in search # of sources frequency of search 5 analysts 65 min / source 22 sources 18 times / year 11 analysts 74 min / source 22 sources 14 times / year 9 min / source 22 sources 32 times / year $97k (14%) $73k (10%) $248k (35%) $702kTotal: Cost Savings 13 analysts 14 wk of dev. saved 8 analysts contributing 2 apps created $284k (41%) 40 min 10 times / year Sample Cost Savings Cost Savings Added Quality Enhanced Decision MakingShooju Value Added * Based on real 40-person organization. Assumed annual wages vary between $30k and $140k. $410k savings equivalent to 10% of HR spend* Shooju speeds up custom BI application development by making all data natively accessible and continuously updated in any BI tool or custom app. USD (%)
  • 23. Added Quality: The Three “Cs” 23 Cost Savings Added QualityShooju Value Added Consistency Shooju ensures that all analysts are using the same data across all their tools and applications. By allowing analysts to upload their own data to the platform, internal data as well as external data now flows seamlessly - without messy spreadsheet links. Currency By automatically pulling in the latest source data through the Shooju importer layer, Shooju ensures that all of your spreadsheets and models are populated with the latest data. Our native plugins for Excel, Access and all your other tools allow data to flow through directly without any need for the analyst to download or copy and paste. Correctness The more data is touched by human hands, the more prone it is to errors. By streamlining workflows and automating work processes, Shooju eliminates most of these errors, saving time and ensuring that the data you rely on is more accurate. Enhanced Decision Making
  • 24. We support any data source 24 Ask us about non-mainstream data sources that traditional data providers don’t carry.
  • 26. Shooju vs. Custom Data Warehouse Custom Data Warehouse Shooju Design Custom “Plug-and-play” Cost 7+ digits 5-6 digits Rollout timeline Months / Years Hours Scalability Minimal Infinite Flexibility Low High Maintenance High Low Stakeholders IT controlled Analyst run / IT maintained Tool and app support Clunky, requiring IT Native tool support 26 Data warehouse projects are costly, time consuming and result in inflexible systems with low adoption rates
  • 27. Shooju vs. Off-the-shelf Data Management* Off-the-shelf Data Management* Shooju Service focus Data provision/management Process improvement Prepackaged data feeds Many None Custom data feeds None (not natively supported) Included(all feeds are custom) Internal data integration Weeks (high consulting fees) Days (included in service) Process flexibility Low High Analyst learning curve Weeks Hours Ease of migrating off Very difficult/impossible Easy Annual fee 6-7 digits 5-6 digits 27 Data management* solutions focus on generic data provision rather than process improvement and limit analysts to a closed and inflexible data ecosystem. * Top-ranked providers in the EnergyRisk Data Management category include: Morningstar, ZE Power Group, SunGard, Allegro, Pioneer Solutions, SAS, and InteractiveData. See http://www.slideshare.net/Allegrodev/energy-risk-magazines-etrm-software-rankings-2013

Editor's Notes

  1. ----- Meeting Notes (5/30/12 21:35) ----- hey there
  2. ----- Meeting Notes (5/30/12 21:35) ----- hey there