SlideShare a Scribd company logo
1© 2017 Cloudera, Inc. All rights reserved.
Digital Government
Data + Government Isn’t Enough
Trey Causey
Senior Product Manager, Socrata
@treycausey
#wrangleconf | Powered by Cloudera
2© 2017 Cloudera, Inc. All rights reserved.
3© 2017 Cloudera, Inc. All rights reserved.
4© 2017 Cloudera, Inc. All rights reserved.
5© 2017 Cloudera, Inc. All rights reserved.
6© 2017 Cloudera, Inc. All rights reserved.
7© 2017 Cloudera, Inc. All rights reserved.
Adoption and compliance
8© 2017 Cloudera, Inc. All rights reserved.
9© 2017 Cloudera, Inc. All rights reserved.
10© 2017 Cloudera, Inc. All rights reserved.
FOIA requests down
50%.
Requests for officer-
involved shooting data to
zero.
11© 2017 Cloudera, Inc. All rights reserved.
Open data success stories
12© 2017 Cloudera, Inc. All rights reserved.
NYC
Transparency
13© 2017 Cloudera, Inc. All rights reserved.
14© 2017 Cloudera, Inc. All rights reserved.
15© 2017 Cloudera, Inc. All rights reserved.
New Orleans
Safety & efficiency
16© 2017 Cloudera, Inc. All rights reserved.
“Nothing we did required big data or fancy machines or big tech investments. We are a city of less
than 400,000 and we are strained for resources. If we can do it, anyone can do it.”
- Oliver Wise, Director, Office of Performance & Analytics
17© 2017 Cloudera, Inc. All rights reserved.
18© 2017 Cloudera, Inc. All rights reserved.
Jackson, MS
Transparency &
accountability
19© 2017 Cloudera, Inc. All rights reserved.
2014 2016
20© 2017 Cloudera, Inc. All rights reserved.
21© 2017 Cloudera, Inc. All rights reserved.
22© 2017 Cloudera, Inc. All rights reserved.
Data isn’t enough
23© 2017 Cloudera, Inc. All rights reserved.
Compliance != compelling.
24© 2017 Cloudera, Inc. All rights reserved.
Data-informed decision-
making needs (open) data.
25© 2017 Cloudera, Inc. All rights reserved.
Data science in
government is a whole
different ball game.
26© 2017 Cloudera, Inc. All rights reserved.
All eyes are on you.
27© 2017 Cloudera, Inc. All rights reserved.
28© 2017 Cloudera, Inc. All rights reserved.
Data is locked up in legacy
systems.
29© 2017 Cloudera, Inc. All rights reserved.
Tooling is complicated and
data is (very) messy.
30© 2017 Cloudera, Inc. All rights reserved.
Your audience is everyone.
31© 2017 Cloudera, Inc. All rights reserved.
data
+ mission
+ strategy
= successful open data
Thank you!
@treycausey

More Related Content

What's hot

Webinar - Case Study: ProtectWise enhances network security with DataStax alw...
Webinar - Case Study: ProtectWise enhances network security with DataStax alw...Webinar - Case Study: ProtectWise enhances network security with DataStax alw...
Webinar - Case Study: ProtectWise enhances network security with DataStax alw...
DataStax
 
The Journey to Success with Big Data
The Journey to Success with Big DataThe Journey to Success with Big Data
The Journey to Success with Big Data
Cloudera, Inc.
 
The Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent OffersThe Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent Offers
Cloudera, Inc.
 
Data Science in Enterprise
Data Science in EnterpriseData Science in Enterprise
Data Science in Enterprise
Josh Yeh
 
Case study: Hadoop as ELT for Leading US Retailer - Happiest Minds
Case study: Hadoop as ELT for Leading US Retailer - Happiest MindsCase study: Hadoop as ELT for Leading US Retailer - Happiest Minds
Case study: Hadoop as ELT for Leading US Retailer - Happiest Minds
Happiest Minds Technologies
 
Optimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataOptimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big Data
Cloudera, Inc.
 
Rethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data HubRethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data Hub
Cloudera, Inc.
 
Multi Cloud Data Integration- Retail
Multi Cloud Data Integration- RetailMulti Cloud Data Integration- Retail
Multi Cloud Data Integration- Retail
alanwaler
 
IoT-Enabled Predictive Maintenance
IoT-Enabled Predictive MaintenanceIoT-Enabled Predictive Maintenance
IoT-Enabled Predictive Maintenance
Cloudera, Inc.
 
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
Webinar  |  Aligning GDPR Requirements with Today's Hybrid Cloud RealitiesWebinar  |  Aligning GDPR Requirements with Today's Hybrid Cloud Realities
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
DataStax
 
How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
How Virtual Reality and Machine Learning Are Powering the New Age of Network ...How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
DataStax
 
How Cloudera SDX can aid GDPR compliance 6.21.18
How Cloudera SDX can aid GDPR compliance 6.21.18How Cloudera SDX can aid GDPR compliance 6.21.18
How Cloudera SDX can aid GDPR compliance 6.21.18
Cloudera, Inc.
 
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid CloudHow to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
DataStax
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Cloudera, Inc.
 
Protecting health and life science organizations from breaches and ransomware
Protecting health and life science organizations from breaches and ransomwareProtecting health and life science organizations from breaches and ransomware
Protecting health and life science organizations from breaches and ransomware
Cloudera, Inc.
 
Building Data Science Teams: A Moneyball Approach
Building Data Science Teams: A Moneyball ApproachBuilding Data Science Teams: A Moneyball Approach
Building Data Science Teams: A Moneyball Approach
joshwills
 
Multi Cloud Data Integration- Manufacturing Industry
Multi Cloud Data Integration- Manufacturing IndustryMulti Cloud Data Integration- Manufacturing Industry
Multi Cloud Data Integration- Manufacturing Industry
alanwaler
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoopDr. Wilfred Lin (Ph.D.)
 
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
DataStax
 
Building Data Science Teams
Building Data Science TeamsBuilding Data Science Teams
Building Data Science Teams
EMC
 

What's hot (20)

Webinar - Case Study: ProtectWise enhances network security with DataStax alw...
Webinar - Case Study: ProtectWise enhances network security with DataStax alw...Webinar - Case Study: ProtectWise enhances network security with DataStax alw...
Webinar - Case Study: ProtectWise enhances network security with DataStax alw...
 
The Journey to Success with Big Data
The Journey to Success with Big DataThe Journey to Success with Big Data
The Journey to Success with Big Data
 
The Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent OffersThe Big Picture: Real-time Data is Defining Intelligent Offers
The Big Picture: Real-time Data is Defining Intelligent Offers
 
Data Science in Enterprise
Data Science in EnterpriseData Science in Enterprise
Data Science in Enterprise
 
Case study: Hadoop as ELT for Leading US Retailer - Happiest Minds
Case study: Hadoop as ELT for Leading US Retailer - Happiest MindsCase study: Hadoop as ELT for Leading US Retailer - Happiest Minds
Case study: Hadoop as ELT for Leading US Retailer - Happiest Minds
 
Optimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big DataOptimizing Regulatory Compliance with Big Data
Optimizing Regulatory Compliance with Big Data
 
Rethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data HubRethink Analytics with an Enterprise Data Hub
Rethink Analytics with an Enterprise Data Hub
 
Multi Cloud Data Integration- Retail
Multi Cloud Data Integration- RetailMulti Cloud Data Integration- Retail
Multi Cloud Data Integration- Retail
 
IoT-Enabled Predictive Maintenance
IoT-Enabled Predictive MaintenanceIoT-Enabled Predictive Maintenance
IoT-Enabled Predictive Maintenance
 
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
Webinar  |  Aligning GDPR Requirements with Today's Hybrid Cloud RealitiesWebinar  |  Aligning GDPR Requirements with Today's Hybrid Cloud Realities
Webinar | Aligning GDPR Requirements with Today's Hybrid Cloud Realities
 
How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
How Virtual Reality and Machine Learning Are Powering the New Age of Network ...How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
How Virtual Reality and Machine Learning Are Powering the New Age of Network ...
 
How Cloudera SDX can aid GDPR compliance 6.21.18
How Cloudera SDX can aid GDPR compliance 6.21.18How Cloudera SDX can aid GDPR compliance 6.21.18
How Cloudera SDX can aid GDPR compliance 6.21.18
 
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid CloudHow to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
How to Power Innovation with Geo-Distributed Data Management in Hybrid Cloud
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Protecting health and life science organizations from breaches and ransomware
Protecting health and life science organizations from breaches and ransomwareProtecting health and life science organizations from breaches and ransomware
Protecting health and life science organizations from breaches and ransomware
 
Building Data Science Teams: A Moneyball Approach
Building Data Science Teams: A Moneyball ApproachBuilding Data Science Teams: A Moneyball Approach
Building Data Science Teams: A Moneyball Approach
 
Multi Cloud Data Integration- Manufacturing Industry
Multi Cloud Data Integration- Manufacturing IndustryMulti Cloud Data Integration- Manufacturing Industry
Multi Cloud Data Integration- Manufacturing Industry
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
 
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
Webinar: How Active Everywhere Database Architecture Accelerates Hybrid Cloud...
 
Building Data Science Teams
Building Data Science TeamsBuilding Data Science Teams
Building Data Science Teams
 

Similar to Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017

Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!
DATAVERSITY
 
Garantice la continuidad de su negocio Damian Prieto
Garantice la continuidad de su negocio   Damian PrietoGarantice la continuidad de su negocio   Damian Prieto
Garantice la continuidad de su negocio Damian Prieto
Cristian Garcia G.
 
Geek Sync | Tackling Key GDPR Challenges with Data Modeling and Governance
Geek Sync | Tackling Key GDPR Challenges with Data Modeling and GovernanceGeek Sync | Tackling Key GDPR Challenges with Data Modeling and Governance
Geek Sync | Tackling Key GDPR Challenges with Data Modeling and Governance
IDERA Software
 
BDNA joins Flexera
BDNA joins FlexeraBDNA joins Flexera
BDNA joins Flexera
Flexera
 
Denodo DataFest 2017: Company Leadership from Data Leadership
Denodo DataFest 2017: Company Leadership from Data LeadershipDenodo DataFest 2017: Company Leadership from Data Leadership
Denodo DataFest 2017: Company Leadership from Data Leadership
Denodo
 
A Data Privacy & Security Year in Review: Top 10 Trends and Predictions
A Data Privacy & Security Year in Review: Top 10 Trends and PredictionsA Data Privacy & Security Year in Review: Top 10 Trends and Predictions
A Data Privacy & Security Year in Review: Top 10 Trends and Predictions
Delphix
 
Data & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customersData & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customers
Nick Smith
 
Data Analytics in Cyber Security
Data Analytics in Cyber SecurityData Analytics in Cyber Security
Data Analytics in Cyber Security
DNIF
 
Data Analytics in Cyber Security
Data Analytics in Cyber Security Data Analytics in Cyber Security
Data Analytics in Cyber Security
Siddhant Mishra
 
10 Step Guide to Cloud Security - 10th Magnitude - CSS Dallas Azure
10 Step Guide to Cloud Security - 10th Magnitude - CSS Dallas Azure10 Step Guide to Cloud Security - 10th Magnitude - CSS Dallas Azure
10 Step Guide to Cloud Security - 10th Magnitude - CSS Dallas Azure
Alert Logic
 
Observability in DevOps with Modern Distributed Applications
Observability in DevOps with Modern Distributed ApplicationsObservability in DevOps with Modern Distributed Applications
Observability in DevOps with Modern Distributed Applications
Dave Mangot
 
Keynote: Art of the Possible - Moore
Keynote: Art of the Possible - MooreKeynote: Art of the Possible - Moore
Keynote: Art of the Possible - Moore
Neo4j
 
From Rogue One to Rebel Alliance: Building Developers into Security Champions
From Rogue One to Rebel Alliance: Building Developers into Security Champions From Rogue One to Rebel Alliance: Building Developers into Security Champions
From Rogue One to Rebel Alliance: Building Developers into Security Champions
Digital Transformation EXPO Event Series
 
Session 2.6 semantic data governance for regulatory compliance
Session 2.6   semantic data governance for regulatory complianceSession 2.6   semantic data governance for regulatory compliance
Session 2.6 semantic data governance for regulatory compliance
semanticsconference
 
Creating an EDGE - Enterprise Data Governance Experience
Creating an EDGE - Enterprise Data Governance ExperienceCreating an EDGE - Enterprise Data Governance Experience
Creating an EDGE - Enterprise Data Governance Experience
DATAVERSITY
 
Benchmarking Your GDPR Compliance: Will You Make the Grade? [TrustArc Webinar...
Benchmarking Your GDPR Compliance: Will You Make the Grade? [TrustArc Webinar...Benchmarking Your GDPR Compliance: Will You Make the Grade? [TrustArc Webinar...
Benchmarking Your GDPR Compliance: Will You Make the Grade? [TrustArc Webinar...
TrustArc
 
graymatter-pentaho-consulting-services-.pdf
graymatter-pentaho-consulting-services-.pdfgraymatter-pentaho-consulting-services-.pdf
graymatter-pentaho-consulting-services-.pdf
GrayMatter Software Services
 
Veritas Solution Day 2017, France, keynote by Mike Palmer
Veritas Solution Day 2017, France, keynote by Mike PalmerVeritas Solution Day 2017, France, keynote by Mike Palmer
Veritas Solution Day 2017, France, keynote by Mike Palmer
Veritas Technologies LLC
 
Why Your Approach To Data Governance Needs a Major Update
Why Your Approach To Data Governance Needs a Major UpdateWhy Your Approach To Data Governance Needs a Major Update
Why Your Approach To Data Governance Needs a Major Update
Delphix
 
5 Steps to Prepare for Digital Transformation & Real-Time Analytics
5 Steps to Prepare for Digital Transformation & Real-Time Analytics 5 Steps to Prepare for Digital Transformation & Real-Time Analytics
5 Steps to Prepare for Digital Transformation & Real-Time Analytics
DATUM LLC
 

Similar to Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017 (20)

Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!Getting Started with Data Governance? Use Process Models!
Getting Started with Data Governance? Use Process Models!
 
Garantice la continuidad de su negocio Damian Prieto
Garantice la continuidad de su negocio   Damian PrietoGarantice la continuidad de su negocio   Damian Prieto
Garantice la continuidad de su negocio Damian Prieto
 
Geek Sync | Tackling Key GDPR Challenges with Data Modeling and Governance
Geek Sync | Tackling Key GDPR Challenges with Data Modeling and GovernanceGeek Sync | Tackling Key GDPR Challenges with Data Modeling and Governance
Geek Sync | Tackling Key GDPR Challenges with Data Modeling and Governance
 
BDNA joins Flexera
BDNA joins FlexeraBDNA joins Flexera
BDNA joins Flexera
 
Denodo DataFest 2017: Company Leadership from Data Leadership
Denodo DataFest 2017: Company Leadership from Data LeadershipDenodo DataFest 2017: Company Leadership from Data Leadership
Denodo DataFest 2017: Company Leadership from Data Leadership
 
A Data Privacy & Security Year in Review: Top 10 Trends and Predictions
A Data Privacy & Security Year in Review: Top 10 Trends and PredictionsA Data Privacy & Security Year in Review: Top 10 Trends and Predictions
A Data Privacy & Security Year in Review: Top 10 Trends and Predictions
 
Data & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customersData & Analytic Innovations: 5 lessons from our customers
Data & Analytic Innovations: 5 lessons from our customers
 
Data Analytics in Cyber Security
Data Analytics in Cyber SecurityData Analytics in Cyber Security
Data Analytics in Cyber Security
 
Data Analytics in Cyber Security
Data Analytics in Cyber Security Data Analytics in Cyber Security
Data Analytics in Cyber Security
 
10 Step Guide to Cloud Security - 10th Magnitude - CSS Dallas Azure
10 Step Guide to Cloud Security - 10th Magnitude - CSS Dallas Azure10 Step Guide to Cloud Security - 10th Magnitude - CSS Dallas Azure
10 Step Guide to Cloud Security - 10th Magnitude - CSS Dallas Azure
 
Observability in DevOps with Modern Distributed Applications
Observability in DevOps with Modern Distributed ApplicationsObservability in DevOps with Modern Distributed Applications
Observability in DevOps with Modern Distributed Applications
 
Keynote: Art of the Possible - Moore
Keynote: Art of the Possible - MooreKeynote: Art of the Possible - Moore
Keynote: Art of the Possible - Moore
 
From Rogue One to Rebel Alliance: Building Developers into Security Champions
From Rogue One to Rebel Alliance: Building Developers into Security Champions From Rogue One to Rebel Alliance: Building Developers into Security Champions
From Rogue One to Rebel Alliance: Building Developers into Security Champions
 
Session 2.6 semantic data governance for regulatory compliance
Session 2.6   semantic data governance for regulatory complianceSession 2.6   semantic data governance for regulatory compliance
Session 2.6 semantic data governance for regulatory compliance
 
Creating an EDGE - Enterprise Data Governance Experience
Creating an EDGE - Enterprise Data Governance ExperienceCreating an EDGE - Enterprise Data Governance Experience
Creating an EDGE - Enterprise Data Governance Experience
 
Benchmarking Your GDPR Compliance: Will You Make the Grade? [TrustArc Webinar...
Benchmarking Your GDPR Compliance: Will You Make the Grade? [TrustArc Webinar...Benchmarking Your GDPR Compliance: Will You Make the Grade? [TrustArc Webinar...
Benchmarking Your GDPR Compliance: Will You Make the Grade? [TrustArc Webinar...
 
graymatter-pentaho-consulting-services-.pdf
graymatter-pentaho-consulting-services-.pdfgraymatter-pentaho-consulting-services-.pdf
graymatter-pentaho-consulting-services-.pdf
 
Veritas Solution Day 2017, France, keynote by Mike Palmer
Veritas Solution Day 2017, France, keynote by Mike PalmerVeritas Solution Day 2017, France, keynote by Mike Palmer
Veritas Solution Day 2017, France, keynote by Mike Palmer
 
Why Your Approach To Data Governance Needs a Major Update
Why Your Approach To Data Governance Needs a Major UpdateWhy Your Approach To Data Governance Needs a Major Update
Why Your Approach To Data Governance Needs a Major Update
 
5 Steps to Prepare for Digital Transformation & Real-Time Analytics
5 Steps to Prepare for Digital Transformation & Real-Time Analytics 5 Steps to Prepare for Digital Transformation & Real-Time Analytics
5 Steps to Prepare for Digital Transformation & Real-Time Analytics
 

More from Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
Cloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
Cloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
Cloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
Cloudera, Inc.
 
Cloudera SDX
Cloudera SDXCloudera SDX
Cloudera SDX
Cloudera, Inc.
 

More from Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 
Cloudera SDX
Cloudera SDXCloudera SDX
Cloudera SDX
 

Recently uploaded

FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 

Digital Government: Data + Government Isn't Enough | Wrangle Conference 2017

Editor's Notes

  1. We're SaaS business providing cloud-based solutions for data-driven government. Data-as-a-service platform and cloud applications for government agencies  We make government data discoverable, reusable, and actionable I’m a social scientist turned data scientist turned product manager, so I think a lot about:  how humans generate data and how that data gets encoded, and then how that encoded data gets turned into models. Here I am at our annual employee summit – I was in the middle of talking to Dave Doyle, the City of Seattle open data program manager, who had just given closing remarks, and didn’t realize a group photo was forming. Or, as a coworker said to me after seeing this photo “man you must really love that laptop" I own several of our backend services –- all of the way that our customers ingress data onto our platform – as well as a chunk of our machine learning infrastructure.
  2. What do we build to enable open data? Open data portals – - web interfacet to a cloud-based data-as-a-service platform that lets agencies provide open data to their communities with built-in APIs, search, etc. making the data discoverable and resuable.  - this is probably what most people think of when they think of open data - lots of cities, counties, and states, some NGOs and federal agencies as well
  3. Open data powers applications as well that help agencies plan and communicate with stakeholders: - here’s the City of Seattle’s open budget  - Lets anyone explore the budget in nitty gritty detail, interactively
  4. Becoming more popular – performance programs - lets agencies provide transparency and accountability for the goals they've set for programs and initiatives - think of it as a public dashboard with KPIs that anyone can check in on - Budgets and performance dashboards are changes in the consumption experience for the raw data that open data portals host - represent the overall maturation of open data, as we move from data for data's sake to solving specific problems with open data and putting open data in the path of government work, rather than a destination where data lives
  5. - Even Steve Ballmer is getting in on the action Side project to help Americans understand the flow of money in government No LA Clippers salary cap explorer – but he’ll have 21 million or so freed up without Chris Paul in the upcoming season. However, USA facts is having the same problems with engagement that I'll talk about later in this talk – at the national governor's association last week, he said after a big flurry of publicity, they're only getting about 4000 visitors a day. Just putting data up for people to consume often doesn't produce a ton of engagement
  6. What’s the state of open data in 2017?
  7. - Broke the rules of giving a talk by discussing open data so far without really talking about what it is and what we mean by it Open data as both an idea and a practice really picked up steam during the Obama Administration – but the concept has been around for a while. So what do we mean? Data can be open in a couple of ways –  machine readable, available programmatically – this means data in widely accepted formats like CSV, JSON, XML, not locked up in a PDF or stored in physical copies that require a FOIA to get at It also means APIs that allow programmatic retrieval, and to enable developers to build applications with the data – if you've used a non-city-created transit application to find out when your bus/train is coming, you've benefited from this kind of open data. permissively licensed for reuse This is obviously a stickier issue – as licensing always is – and differs from agency to agency, but there is a generally agreed upon idea that open data should be available for reuse by anyone that wants to use it (at the very least non-commercially) but often this means for commercial reuse as well. Some examples of data reuse in applications include restaurant inspections in Yelp reviews or data used by Zillow for estimating housing value. 
  8. Sunlight Foundation, a non-profit dedicated to making governments accountable and transparent, has published guidelines for open data, many of which have been adopted by government agencies in creating their open data polices. Over the past 10 years or so, we've seen a number of municipalities create policies or legislation mandating that any data that can be made open be made open. The implementation and wording differs from agency to agency, but a growing number. Open Data Policies dot org – branched off from sunlight -- lists 101 current open data policies at the city, county, and state levels Federal level: DATA Act: Digital Accountability and Transparency Act (2014) – how does the government spend its money?
  9. This widespread adoption has produced real results for many agencies. The city of Chicago saw their freedom of information act requests drop by 50% when they launched their open data portal. Chicago is a real leader in open data – constantly pushing us – and open sourcing a lot of their work for other governments to use. They recently relaunched their open data portal (data.cityofchicago.org) to great success. The Dallas PD saw requests for data on officer-involved shootings drop to zero (dallasopendata.com) when they started releasing that data on their open data portal. Fulfilling requests for records and freedom of information act requests are resource-intensive. They’re usually time-bound and failure to comply carries a penalty.
  10. Beyond time saving and request-fulfilling efficiency, Where are we with open data? What are some of the success stories? Hopefully the data scientists in the crowd will allow me a bit of selection on the dependent variable for a moment.
  11. NYC has one of the most active open data programs in the country, thanks to a combination of a very talented staff and a legislated primary goal for all city agencies to share open data. The Department of Information Technology and Telecommunications (best acronym -- DoITT) and MODA (the Mayor’s Office of Data Analytics) manage a tremendously complicated program with great success – acting as central hubs for many of the city's agencies and their data. NYC also has stringent retention policies, so a lot of moving pieces to manage. A favorite dataset of mine is the NYC tree census of all trees in NYC – whenever we're testing out geospatial features, we usually use that dataset as one of our testers. 
  12. - Probably the most common thing many people think of when they think of Open Data – citizens serving as independent watchdogs, finding inefficiencies, injustices, and just plain mistakes. “Transparency” Ben Wellington is a quantitative analyst (who runs the popular blog I Quant NY) at Two Sigma – which if you’ve been to a recent PyData or SciPy conference will be familiar did this analysis using NYC open data.  Something worth over $33,000 in this picture – and it’s not the Mercedes.   Using NYC Open Data, found 84 tickets over a 4.5-month period -- $33,000 a year in fines, not including towing fees. One block over, another hydrant generating 24,000 a year – over 55,000 a year on two blocks
  13. Open data for the same reason you crowdsource things Can’t think of all possible questions to ask, instead, rely on motivated individuals to ask them Of course, this cost the city 55,000 a year! This has a bit of a bizarre twist, as there’s some confusion over whether these cars were parked legally or not. Using Google Maps – there is a protected bike lane between the cars and the fire hydrant.
  14. New Orleans is a real leader in performance management – a city with a number of well-known and lesser-known problems that really embraced tying its decision-making to open data.
  15. - NOLA combined data from the American Housing Survey, American Community Survey (both from the US Census) and FD admin data Office of Performance & Analytics identified homes most at-risk for fire Distributed 8,000 smoke detectors, reducing fire deaths & injuries A family of 11 later that year escaped a house fire after a smoke alarm went off at 3am – that had been installed as part of this program Also increased operational efficiency of FD by modeling where fires are most likely to occur
  16. Blight is a well-known problem in New Orleans, which has experienced natural disasters and social crises over the past century – homes left abandoned, boarded up in a city that is always at risk of being reclaimed by the earth.  Blight is problematic for a number of reasons – crime, pest infestation, public safety, and that lot is blocking new development / revenue collection / depressing home values elsewhere.  - Blight affected up to ¼ of all residential addresses in NOLA after Katrina Formed BlightSTAT – cross-department task force to reduce blight using data Decreased blight by 30% especially impressive when benchmarked against peer cities where abandoment rates are climbing cut response time from initial inspection to a hearing in half – over 3 months reduction
  17. - Jackson, MS – like many cities and like New Orleans, plagued by aging infrastructure - the kind of infrastructure that sees schoolbuses fall into sinkholes TWICE in three years – these are different buses on different streets. - more than half of the city's bridges in dire need of repairs Last year, one of these bridges targeted for repairs totally collapsed due to flooding – after closing, thankfully, but an example of the critical state of affairs. Estimates to repair Jackson's infrastructure range from 750 million to 1 billion dollars – that's 5-6 times the annual revenue of the city all told. City has one lever it can turn to generate revenue --  1 percent sales tax voted in, with revenue reserved for capital projects
  18. Jackson knew that the only way to keep the public's trust was to be open and transparent about how this revenue was being used and how it was tracking to meet its goals This slide, which you saw earlier, is actually part of Jackson's performance program – called JackStats Of particular interest given the bus-sinkhole problem is Operation Orange Cone By using 311data to more efficiently dispatch repair crews and identify problem areas, Filled over 69,000 potholes in 2 years, a 60% increase in pothole-filling compared to before Operation Orange Cone.  Some of these complaints dated to 2010! AND also decreased 311 calls
  19. Now residents can track the progress of Operation Orange Cone in a number of places and see how the project is doing on time and on budget, as well as getting up-to-date information about what streets are planned for resurfacing. Operation Orange Cone was supposed to be a two-week pilot program, but has run continuously for 2 years now.
  20. From the agency’s perspective, you need a strategy for success. Open data programs are most successful when they see high engagement with constituents and residents. Engagement is key – the mayor/governor wants to see that these open data programs are providing value. One way they do that is via constituent engagement. Checking the open data box doesn’t drive eyeballs. Need to figure out what problem you're solving and who you're solving it for. (Selfishly, I might say that open data programs need a good product manager but that’s probably another talk). Just releasing open data for civic hackers only helps a small portion of the population. When Jackson, MS needed to fix its roads to keep buses from falling into sinkholes, they made the data part of the planning process, the decisions that were made, and the reporting on the actions that resulted from those decisions. Putting a CSV on an open data portal doesn’t drive engagement, and is at worst a good way to have stale, out-of-date data that no one trusts or relies on. Each of the previous success stories began with some problem to be solved or goal to be accomplished – that’s what makes open data compelling. Data is only part of the story – what happens with the data is the magic of open data.
  21. Data-informed decision-making (credit to Greg Reda at PyData Seattle) needs data. For that data to be effective, that data needs to be up to date and authoritative. Government information workers need to be able to trust that data and integrate it into their workflows. “Open data portals” are a red herring – it shouldn’t be where old CSVs go to retire – it should be where government workers look for data they need to do their job.
  22. For every data scientist with a phd working at the NSA or the Census, there’s an analyst working at the department of public works in a city that has to work on 20 projects at once. They’re working with limited resources, limited time, and a public that often doesn’t care a whole lot about how busy they are. Public service isn’t just a catchphrase – there’s real service here.
  23. Your work is out there for the public to see. When your audience is everyone, you have to show your work. This means opening your data and explaining your models.  Sometimes your work is going to be featured in the local news, and there will often be press releases about it.  On the other hand, your work is going to be out there for everyone to see! I bet a lot of you don't get to talk about your work.
  24. Over 30,000 emails published for anyone to read. Released every Friday. This used to be a FOIA request that the local news outlets would make every week, now they just release them automatically to save them the trouble. Complex workflow that is half-automated, half-manual (can't release PII, constituent information, etc.).
  25. Government moves slowly – and upgrade cycles are no exception. Getting data from one agency to another is a real challenge. Talking to an analyst at a major west coast city – people drive around different lots in the city to verify permitting status, then literally fill out forms by hand, which are then delivered to a central office for data entry at a later date. By the time they’re digitized, who knows what has changed? Budgets run by department, but problems span departments – there’s no “department of homelessness”, but the problem requires action across agencies. Each agency has its own budget, and may use a different database or ERP system to track their data. It’s not as simple as a JOIN.
  26. You can’t just use that new library you read about on Hacker News into production – IT departments vary wildly in their permissiveness, and changes to infrastructure are extremely difficult. CIOs or CDOs may make software decisions for entire departments. You’re probably not going to be using Spark to build a predictive machine learning model that runs on a cluster and scales to millions of predictions a minute. Some agencies allow users to submit data via forms – and it’s messy. Different languages, profanity, and then some. I was speaking to an open data program manager from a city in the south and he was telling me that when they were cleaning up this data, retention laws meant that they had to go in and indicate where data had been changed from its original form. You think data scientists in the private sector complain about how hard it is to clean their data…
  27. Accessibility is a compliance matter – data you release and websites you host it on have to be accessible to people with disabilities, and there’s a shifting landscape of standards. You have no idea who will be accessing your data – it’s open! – and you can’t assume that they’ll be subject matter experts. Metadata, documentation are key – but of course they are time consuming. On the other hand – your audience is everyone! You get to work on things that affect all sorts of people in very real ways. Your predictive models have real impact – like saving lives through fire detector placement.
  28. For every data scientist with a phd working at the NSA or the Census, there’s an analyst working at the department of public works in a city that has to work on 20 projects at once. They’re working with limited resources, limited time, and a public that often doesn’t care a whole lot about how busy they are. Public service isn’t just a catchphrase – there’s real service here.