SlideShare a Scribd company logo
1 of 16
Evangelizing data at media companies
Stan Dyro
Introduction
Who am I?
● Stan Dyro. - Lead Data Engineer at the Los Angeles Times
● Former Lead Engineer at VideoAmp
● Technologist, programmer
● Creative problem solver
Why you should care about data?
● Software is eating the world
● and.. “Data is the fuel”
● We’re hiring!
Technology Direction
● As technology leaders or individual contributors, we have a lot of decisions:
Leaders Engineers Product
Who to hire Who to connect with How you help your company
Structure of teams Which companies to work
for
How to make your company
better
What culture to set Which tools to learn How to help your teams
What’s important and what’s not
● Teams are important. Teams are important.
● People stick around for good bosses. Studies bear this out
● People value the little things.
What’s important and what’s not
The opposite of microaggressions are microprotections:
● Highly paid tech professionals will value contributions over money.
Examples
● Upward mobility
● Ability to learn
● Ability to be proud of what they do
For example:
● Our culture at the LA Times is to “inform, engage and empower.”
● Unless you’re FAANG or flush with VC cash, $400,000 dollar salaries are not an option,
so apply what makes you different
Measure all the things
So let’s talk about data.
Now that you’ve built or worked on helping your great team.
What can you measure about your business?
● Your Customer
● Your Web Traffic
● Your Finances
Structure of a data stack
● Presentation - usually visual tools BI tools, static visualizations
● Databases - Relational data, big data, caching
● Workflow - Workflow tools, programming languages, code repositories
● Storage - Scalable storage, cloud storage options
Why it’s easier than ever
It’s simple.
Open Source.
We stand on the backs of giants in our field. Hadoop. Spark.
Hive. Google Cloud, AWS. The infrastructure is at our
fingertips.
Easy to launch a software business for cheap. So focus on
what matters: your customers.
Data Tools
Databases
● Relational databases
● Data warehouses
● Data lake
Programming Languages
● SQL
● Python
● Ruby
● Javascript / node.js
● C#
● Java
● Scala
It’s easy to get overwhelmed
Don’t get distracted by the shiny things
Tools - Data Analytics
Business Intelligence
● Tableau Online
● Looker
● Power BI
Visualization
● Tableau
● R Studio
● Python libraries
● D3 / Javascript
Tools - Software
Desktop SQL Clients
● dbForge
● DataGrip
● DataRow (web based client)
● S3 / Cloud Storage Browser
● Transmit (Mac)
● S3 Browser
Tools - Data Engineering
ETL / Job Runners
● Informatica
● Segment
● Airflow
● Luigi
● Custom Solution
Automation / API Integration
● Zapier
● Mulesoft
● Apigee
Tools - Data Science
Notebooks / Collaboration
● Databricks
● Spark on EMR
Cloud Data Warehouses
● Redshift Spectrum
● Snowflake
Scalable Key/Value Databases
● Cassandra
● DynamoDB

More Related Content

Similar to Data Con LA 2019 - The challenges of data science for veteran media organizations by Stan Dyro

Analytics-Enabled Experiences: The New Secret Weapon
Analytics-Enabled Experiences: The New Secret WeaponAnalytics-Enabled Experiences: The New Secret Weapon
Analytics-Enabled Experiences: The New Secret Weapon
Databricks
 

Similar to Data Con LA 2019 - The challenges of data science for veteran media organizations by Stan Dyro (20)

The ABCs of Treating Data as Product
The ABCs of Treating Data as ProductThe ABCs of Treating Data as Product
The ABCs of Treating Data as Product
 
Agile methods and dw mha
Agile methods and dw mhaAgile methods and dw mha
Agile methods and dw mha
 
Running a small, high tech consulting firm - lessons learned
Running a small, high tech consulting firm - lessons learnedRunning a small, high tech consulting firm - lessons learned
Running a small, high tech consulting firm - lessons learned
 
Data and data scientists are not equal to money david hoyle
Data and data scientists are not equal to money   david hoyleData and data scientists are not equal to money   david hoyle
Data and data scientists are not equal to money david hoyle
 
Building successful data science teams
Building successful data science teamsBuilding successful data science teams
Building successful data science teams
 
Large drupal site builds a workshop for sxsw interactive - march 17, 2015
Large drupal site builds   a workshop for sxsw interactive - march 17, 2015Large drupal site builds   a workshop for sxsw interactive - march 17, 2015
Large drupal site builds a workshop for sxsw interactive - march 17, 2015
 
Jarod Sickler and Morley Tooke - DITA Support Portals: A One Stop Shop to Giv...
Jarod Sickler and Morley Tooke - DITA Support Portals: A One Stop Shop to Giv...Jarod Sickler and Morley Tooke - DITA Support Portals: A One Stop Shop to Giv...
Jarod Sickler and Morley Tooke - DITA Support Portals: A One Stop Shop to Giv...
 
Webinar | Good Guys vs. Bad Data: How to Be a Data Quality Hero
Webinar | Good Guys vs. Bad Data: How to Be a Data Quality HeroWebinar | Good Guys vs. Bad Data: How to Be a Data Quality Hero
Webinar | Good Guys vs. Bad Data: How to Be a Data Quality Hero
 
apidays New York 2023 - How to Make Your Docs Stand Apart, Ash Arnwine, Nylas
apidays New York 2023 - How to Make Your Docs Stand Apart, Ash Arnwine, Nylasapidays New York 2023 - How to Make Your Docs Stand Apart, Ash Arnwine, Nylas
apidays New York 2023 - How to Make Your Docs Stand Apart, Ash Arnwine, Nylas
 
Transition to a modern data platform
Transition to a modern data platform Transition to a modern data platform
Transition to a modern data platform
 
Architecting for analytics
Architecting for analyticsArchitecting for analytics
Architecting for analytics
 
AppDynamics User Group
AppDynamics User GroupAppDynamics User Group
AppDynamics User Group
 
Analytics-Enabled Experiences: The New Secret Weapon
Analytics-Enabled Experiences: The New Secret WeaponAnalytics-Enabled Experiences: The New Secret Weapon
Analytics-Enabled Experiences: The New Secret Weapon
 
Lean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science teamLean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science team
 
How To Run A Successful BI Project with Hadoop
How To Run A Successful BI Project with HadoopHow To Run A Successful BI Project with Hadoop
How To Run A Successful BI Project with Hadoop
 
Data Science Environment with R on openSUSE Leap 15.1
Data Science Environment with R on openSUSE Leap 15.1Data Science Environment with R on openSUSE Leap 15.1
Data Science Environment with R on openSUSE Leap 15.1
 
Tangenz big data
Tangenz big dataTangenz big data
Tangenz big data
 
Time to Fly - Why Predictive Analytics is Going Mainstream
Time to Fly - Why Predictive Analytics is Going MainstreamTime to Fly - Why Predictive Analytics is Going Mainstream
Time to Fly - Why Predictive Analytics is Going Mainstream
 
Infotachus Private Limited
Infotachus Private LimitedInfotachus Private Limited
Infotachus Private Limited
 
iXora Solution Ltd. Presentation
iXora Solution Ltd. PresentationiXora Solution Ltd. Presentation
iXora Solution Ltd. Presentation
 

More from Data Con LA

Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA
 

More from Data Con LA (20)

Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup Showcase
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendations
 
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA 2022 - AI Ethics
Data Con LA 2022 - AI Ethics
 
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learning
 
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentation
 
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
 
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWS
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data Science
 
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
 
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
 
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with Kafka
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 

Data Con LA 2019 - The challenges of data science for veteran media organizations by Stan Dyro

  • 1. Evangelizing data at media companies Stan Dyro
  • 2. Introduction Who am I? ● Stan Dyro. - Lead Data Engineer at the Los Angeles Times ● Former Lead Engineer at VideoAmp ● Technologist, programmer ● Creative problem solver Why you should care about data? ● Software is eating the world ● and.. “Data is the fuel” ● We’re hiring!
  • 3. Technology Direction ● As technology leaders or individual contributors, we have a lot of decisions: Leaders Engineers Product Who to hire Who to connect with How you help your company Structure of teams Which companies to work for How to make your company better What culture to set Which tools to learn How to help your teams
  • 4. What’s important and what’s not ● Teams are important. Teams are important. ● People stick around for good bosses. Studies bear this out ● People value the little things.
  • 5. What’s important and what’s not The opposite of microaggressions are microprotections: ● Highly paid tech professionals will value contributions over money. Examples ● Upward mobility ● Ability to learn ● Ability to be proud of what they do For example: ● Our culture at the LA Times is to “inform, engage and empower.” ● Unless you’re FAANG or flush with VC cash, $400,000 dollar salaries are not an option, so apply what makes you different
  • 6. Measure all the things So let’s talk about data. Now that you’ve built or worked on helping your great team. What can you measure about your business? ● Your Customer ● Your Web Traffic ● Your Finances
  • 7. Structure of a data stack ● Presentation - usually visual tools BI tools, static visualizations ● Databases - Relational data, big data, caching ● Workflow - Workflow tools, programming languages, code repositories ● Storage - Scalable storage, cloud storage options
  • 8. Why it’s easier than ever It’s simple. Open Source. We stand on the backs of giants in our field. Hadoop. Spark. Hive. Google Cloud, AWS. The infrastructure is at our fingertips. Easy to launch a software business for cheap. So focus on what matters: your customers.
  • 9. Data Tools Databases ● Relational databases ● Data warehouses ● Data lake Programming Languages ● SQL ● Python ● Ruby ● Javascript / node.js ● C# ● Java ● Scala
  • 10. It’s easy to get overwhelmed
  • 11.
  • 12. Don’t get distracted by the shiny things
  • 13. Tools - Data Analytics Business Intelligence ● Tableau Online ● Looker ● Power BI Visualization ● Tableau ● R Studio ● Python libraries ● D3 / Javascript
  • 14. Tools - Software Desktop SQL Clients ● dbForge ● DataGrip ● DataRow (web based client) ● S3 / Cloud Storage Browser ● Transmit (Mac) ● S3 Browser
  • 15. Tools - Data Engineering ETL / Job Runners ● Informatica ● Segment ● Airflow ● Luigi ● Custom Solution Automation / API Integration ● Zapier ● Mulesoft ● Apigee
  • 16. Tools - Data Science Notebooks / Collaboration ● Databricks ● Spark on EMR Cloud Data Warehouses ● Redshift Spectrum ● Snowflake Scalable Key/Value Databases ● Cassandra ● DynamoDB