SlideShare a Scribd company logo
1 of 17
Download to read offline
“You cant manage
what you don’t
measure”
JUST AS IT’S BEING USED IN OTHER BUSINESSES, BIG DATA IS UTILIZED TO
MAKE BASEBALL TEAMS BETTER AND ORGANIZATIONS MORE PROFITABLE
How is Baseball using Big Data
Analytics?
https://www.youtube.com/watch?v=yGf6LNWY9AI
Sabermetrics=“Moneyball” is essentially the term
(stemming from the book written by Michael Lewis)
for using advanced statistics to determine how to
build a team, what strategies to use, and more.
“Our ability to know what’s going to happen, when
it’s going to happen, how much cash we’re going to
generate on the revenue side, allows us to plan
accordingly. That’s a tremendous value proposition
to ownership and executives.”
What is there to analyze?
 162 games per team per season (not including pre-
season, playoffs or minor league games) 162*30 mlb
teams=4860 regular season games.
 There are 85 common player stats measured (BA, OBP,
HR… ERA, W, L…) and then each of those stats is
broken down against teams, lefty vs righty, day vs night,
home vs away….
 All of these stats are kept in data banks for every single
pitch thrown, …700,000+ pitches in 2014... Also keep
data on fans, food, drinks, promotions…
 First 135 years of Baseball are a combined 2GB of data
Today each game has almost 1TB of data collected.
That’s a 10 Million fold increase in data collected. And
its predicted that upwards of 7TB per game will be
collected.
What do you do with the data?
 Win at an unfair game!
 2002 Oakland A’s
 103-59
 2014-15 Astros
 Shifts
 Predict injuries
 3D Snapshots 10-15 gigs of data
 $1.4 Billion in knee injuries for MLB in 2014
 https://www.youtube.com/watch?v=8avavYawsA8
Question??
 There are 4 types of analytics that companies can use
to aid their business:
1. Prescriptive- Takes data and reveals what actions
should be taken
2. Predictive- Takes data and gives an analysis of likely
scenarios of what might happen
3. Diagnostic- A look at past performance to determine
what happened and why
4. Descriptive- What is happening now based on
incoming data
 MLB uses all 4. What type of analytics does your
company use that helps it gain a competitive
advantage?
Tools
 Statcast by MLBAM
 Uses Amazon Web Services
 Captures on field data
 Quickly analyzes and codes
 Pitch Rx
 Tracks every pitch
 Uses Camera Triangulation
 Field Fx
 Records all field plays using camera feeds and
object-recognition software
 BaseRuns Estimator
 Estimates the number of runs a team should
score given their offensive statistics and the
number of runs a hitter or pitcher creates or
allows
Source: Whitman School of Business, Syracuse University
Data from the Player Tracking System (Statcast) overlaid on video
of the Panik-Hosmer play. The red section on the right shows that if
Hosmer had maintained his speed instead of diving to the bag,
he would have been safe by about a foot.
Tools in use
Cloud
EC2
Compute
Pwr behind
solution
Amazon S3
Storage
7Tb per
Game
Amazon
Elastic
Cache
Temp
Memory,
Fast
Retrieval
AWS Lambda
 Used for Raw Data
Manipulation to Create
“On the Fly” Metrics
 Creates More Insight in
to Plays
Amazon DynamoDB
 Allows for powerful
queries.
 Supports fast retrieval
of information
Dedicated
Connection
Discussion Question
Given the emergence of complex
technological tools, how can companies
with smaller budgets stay competitive with
companies that have deep pockets?
3 Key Differences in
Data
• Volume- More data
across the internet
every second vs what
was stored on the
entire internet 20 years
ago
• Velocity-Real time
data i.e. cell phone
location data
• Variety-Large amounts
of data being created
on every topic of
business
Data Types
Structured Data
 G = games • Number of games a
player participated in (out of 162
games in a season)
 AB = at bats • Number of times a
batter was hitting and either got a
hit or got out (does not include
walks or reaching base on an
error)
 R = runs • Number of runs the
player scored
 H = hit • Number of times a player
hit the ball or got on base or hit a
home run (sum of 1B, 2B, 3B, HR)
Unstructured Data
 Social Media updates tied to
baseball games/players
 Video
 Photos
 Open ended surveys
What are some examples of how
Unstructured Data is used in your
company?
Online Reviews
Facebook “Likes”
http://www.marketingpilgrim.com/2013/08
/infographic-major-league-baseballs-top-
social-media-performers.html
Structured data and unstructured data can be
combined to gain insights into new categories.
Big Data Acquisition:
 Data harvest from meticulous record-keeping
 (on-base percentage, batting average, slugging/fastball percentages, RBIs, stolen bases, etc…)
 Employ analytics experts: utilize their skillset to build team, field, and manage players
 Expand use: ticketing, promotions, fan-team relationships, concessions and products
 Milwaukee Brewers analyze each email received by teams to better understand fans
 Analyze who the occasional attendees are and how to get them to buy tickets more often
 Boston Red Sox developing concessions heat-map (geo-locating proximity fans to hotdog stands)
 Tracks type, quantity, frequency, and locations of concession purchases
 2014 App “IdealSeat” allows fans to choose seats based upon likelihood of catching foul balls
 Adjust and re-target focus of data sets (player field positions, t-shirt prices) as needed
 Q: What other venues or industries could benefit by a similar depth of big-data analysis?
Big Data Governance: Organization
 Effective governance is equal parts: organization and security
 Historic Organization (Waterfall Model): Garbage-In / Garbage-
Out
 Integration of data as it arrives into repository for use
 Indiscriminate harvest; lack of profiling/prioritizing data lengthens
time to organize/use
 Without organizing, data mismatches can damage customer
relations
 (i.e. coupons for women’s shoes sent to male customers)
 Understanding data before to employment is key
 Effective Governance: beyond scrubbing and deletion, focus
on ensuring accuracy
 Identify custodians (who's accountable for data consistency,
accuracy, and archival)
 Develop criteria policies (standards and procedures for use,
purpose, and by who?)
 Enact policy controls and audit (enforcement of policies and
accountability for custodians)
Big Data Governance: Security
 Security Issues:
 Financial and Reputational
 Too much data with too many vulnerabilities can be
catastrophic
 2015 Breach at Office of Personnel Management:
 Personal Records, PII (names, addresses,
etc…), Security Clearance details of 21M
citizens
 5.6M sets of fingerprints stolen
 2014 Breach at Home Depot:
 46M credit cards hacked
 Big Data poses Big Risks:
 Big gains can be realized IF security risks are properly
mitigated AND the data harvest is properly organized
Conclusion
 Big data is utilized to make teams better and organizations
more profitable
 4 Types of Analytics
1. Prescriptive
2. Predictive
3. Diagnostic
4. Descriptive
 Many tools available to analyze data
 Statcast by MLBAM, Pitch Rx, Field Fx
 Data Types
 Structured & Unstructured
 Effective Governance can ensure accuracy
Questions

More Related Content

What's hot

Understanding big data and data analytics-Business Intelligence
Understanding big data and data analytics-Business IntelligenceUnderstanding big data and data analytics-Business Intelligence
Understanding big data and data analytics-Business IntelligenceSeta Wicaksana
 
Big data Presentation
Big data PresentationBig data Presentation
Big data PresentationAswadmehar
 
Data analytics in banking sector
Data analytics in banking sectorData analytics in banking sector
Data analytics in banking sectorSnigdhaGupta23
 
Types of Data Validation
Types of Data ValidationTypes of Data Validation
Types of Data ValidationMetric Fox
 
Unit i big data introduction
Unit  i big data introductionUnit  i big data introduction
Unit i big data introductionSujaMaryD
 
Big data analytics in sports industry
Big data analytics in sports industryBig data analytics in sports industry
Big data analytics in sports industryPromptCloud
 
Data Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-ServiceData Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-ServiceDATAVERSITY
 
Churn in the Telecommunications Industry
Churn in the Telecommunications IndustryChurn in the Telecommunications Industry
Churn in the Telecommunications Industryskewdlogix
 
Data Management PowerPoint Presentation Slides
Data Management PowerPoint Presentation Slides Data Management PowerPoint Presentation Slides
Data Management PowerPoint Presentation Slides SlideTeam
 
Uses of analytics in the field of Banking
Uses of analytics in the field of BankingUses of analytics in the field of Banking
Uses of analytics in the field of BankingNiveditasri N
 
Recommender systems
Recommender systemsRecommender systems
Recommender systemsTamer Rezk
 
Data Warehouse Design Project
Data Warehouse Design ProjectData Warehouse Design Project
Data Warehouse Design ProjectPradeep Yamala
 
Data Science Use cases in Banking
Data Science Use cases in BankingData Science Use cases in Banking
Data Science Use cases in BankingArul Bharathi
 
Best Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon RedshiftBest Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon RedshiftAmazon Web Services
 
Lead scoring case study presentation
Lead scoring case study presentationLead scoring case study presentation
Lead scoring case study presentationMithul Murugaadev
 
BI Consultancy - Data, Analytics and Strategy
BI Consultancy - Data, Analytics and StrategyBI Consultancy - Data, Analytics and Strategy
BI Consultancy - Data, Analytics and StrategyShivam Dhawan
 

What's hot (20)

TELECOMMUNICATION (2).pptx
TELECOMMUNICATION (2).pptxTELECOMMUNICATION (2).pptx
TELECOMMUNICATION (2).pptx
 
Understanding big data and data analytics-Business Intelligence
Understanding big data and data analytics-Business IntelligenceUnderstanding big data and data analytics-Business Intelligence
Understanding big data and data analytics-Business Intelligence
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Data analytics in banking sector
Data analytics in banking sectorData analytics in banking sector
Data analytics in banking sector
 
Types of Data Validation
Types of Data ValidationTypes of Data Validation
Types of Data Validation
 
Unit i big data introduction
Unit  i big data introductionUnit  i big data introduction
Unit i big data introduction
 
Big data analytics in sports industry
Big data analytics in sports industryBig data analytics in sports industry
Big data analytics in sports industry
 
Data Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-ServiceData Catalogues - Architecting for Collaboration & Self-Service
Data Catalogues - Architecting for Collaboration & Self-Service
 
Churn in the Telecommunications Industry
Churn in the Telecommunications IndustryChurn in the Telecommunications Industry
Churn in the Telecommunications Industry
 
Strategy For Data Quality
Strategy For Data QualityStrategy For Data Quality
Strategy For Data Quality
 
Data Management PowerPoint Presentation Slides
Data Management PowerPoint Presentation Slides Data Management PowerPoint Presentation Slides
Data Management PowerPoint Presentation Slides
 
Uses of analytics in the field of Banking
Uses of analytics in the field of BankingUses of analytics in the field of Banking
Uses of analytics in the field of Banking
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
 
Data analytics
Data analyticsData analytics
Data analytics
 
Data Warehouse Design Project
Data Warehouse Design ProjectData Warehouse Design Project
Data Warehouse Design Project
 
Data Science Use cases in Banking
Data Science Use cases in BankingData Science Use cases in Banking
Data Science Use cases in Banking
 
Big Data Trends
Big Data TrendsBig Data Trends
Big Data Trends
 
Best Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon RedshiftBest Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon Redshift
 
Lead scoring case study presentation
Lead scoring case study presentationLead scoring case study presentation
Lead scoring case study presentation
 
BI Consultancy - Data, Analytics and Strategy
BI Consultancy - Data, Analytics and StrategyBI Consultancy - Data, Analytics and Strategy
BI Consultancy - Data, Analytics and Strategy
 

Similar to Big data and MLB

Sports Analytics: Market Shares, Strategy, and Forecasts, Worldwide, 2015 to ...
Sports Analytics: Market Shares, Strategy, and Forecasts, Worldwide, 2015 to ...Sports Analytics: Market Shares, Strategy, and Forecasts, Worldwide, 2015 to ...
Sports Analytics: Market Shares, Strategy, and Forecasts, Worldwide, 2015 to ...Shrikant Mandlik
 
D A T A &A N A L Y T I C SThomas H. DavenportWhat B.docx
D A T A  &A N A L Y T I C SThomas H. DavenportWhat B.docxD A T A  &A N A L Y T I C SThomas H. DavenportWhat B.docx
D A T A &A N A L Y T I C SThomas H. DavenportWhat B.docxwhittemorelucilla
 
Data warehouse Soccer Project
Data warehouse Soccer Project Data warehouse Soccer Project
Data warehouse Soccer Project Sagar Singh
 
Big Data BizViz Sports Analytics
Big Data BizViz Sports AnalyticsBig Data BizViz Sports Analytics
Big Data BizViz Sports AnalyticsBig Data BizViz LLC
 
Sports and Big data
Sports and Big dataSports and Big data
Sports and Big dataDeZyre
 
Identifying Key Factors in Winning MLB Games Using a Data-Mining Approach
Identifying Key Factors in Winning MLB Games Using a Data-Mining ApproachIdentifying Key Factors in Winning MLB Games Using a Data-Mining Approach
Identifying Key Factors in Winning MLB Games Using a Data-Mining ApproachJoelDabady
 
Alex Kornilov: Building Big Data Company in Sports-Betting Industry - BETEGY ...
Alex Kornilov: Building Big Data Company in Sports-Betting Industry - BETEGY ...Alex Kornilov: Building Big Data Company in Sports-Betting Industry - BETEGY ...
Alex Kornilov: Building Big Data Company in Sports-Betting Industry - BETEGY ...AnalyticsConf
 
Technology and open knowledge in sports statistics
Technology and open knowledge in sports statisticsTechnology and open knowledge in sports statistics
Technology and open knowledge in sports statisticsdwiederman
 
Senior Project Research Paper
Senior Project Research PaperSenior Project Research Paper
Senior Project Research Papercrissy498
 
IPL auction q1_q2.docx
IPL auction q1_q2.docxIPL auction q1_q2.docx
IPL auction q1_q2.docxAlivaMishra4
 
Webinar - How to Create an Effective Compensation Data Strategy
Webinar - How to Create an Effective Compensation Data StrategyWebinar - How to Create an Effective Compensation Data Strategy
Webinar - How to Create an Effective Compensation Data StrategyPayScale, Inc.
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Simplilearn
 
Managing web analytics
Managing web analyticsManaging web analytics
Managing web analyticsMashMetrics
 
7 Data Analytics Dashboards for Small Business
7 Data Analytics Dashboards for Small Business7 Data Analytics Dashboards for Small Business
7 Data Analytics Dashboards for Small BusinessTrent Meyer
 

Similar to Big data and MLB (20)

Sports Analytics: Market Shares, Strategy, and Forecasts, Worldwide, 2015 to ...
Sports Analytics: Market Shares, Strategy, and Forecasts, Worldwide, 2015 to ...Sports Analytics: Market Shares, Strategy, and Forecasts, Worldwide, 2015 to ...
Sports Analytics: Market Shares, Strategy, and Forecasts, Worldwide, 2015 to ...
 
D A T A &A N A L Y T I C SThomas H. DavenportWhat B.docx
D A T A  &A N A L Y T I C SThomas H. DavenportWhat B.docxD A T A  &A N A L Y T I C SThomas H. DavenportWhat B.docx
D A T A &A N A L Y T I C SThomas H. DavenportWhat B.docx
 
Data warehouse Soccer Project
Data warehouse Soccer Project Data warehouse Soccer Project
Data warehouse Soccer Project
 
Big Data BizViz Sports Analytics
Big Data BizViz Sports AnalyticsBig Data BizViz Sports Analytics
Big Data BizViz Sports Analytics
 
Sports and Big data
Sports and Big dataSports and Big data
Sports and Big data
 
srproj_anthon_w16
srproj_anthon_w16srproj_anthon_w16
srproj_anthon_w16
 
Identifying Key Factors in Winning MLB Games Using a Data-Mining Approach
Identifying Key Factors in Winning MLB Games Using a Data-Mining ApproachIdentifying Key Factors in Winning MLB Games Using a Data-Mining Approach
Identifying Key Factors in Winning MLB Games Using a Data-Mining Approach
 
Alex Kornilov: Building Big Data Company in Sports-Betting Industry - BETEGY ...
Alex Kornilov: Building Big Data Company in Sports-Betting Industry - BETEGY ...Alex Kornilov: Building Big Data Company in Sports-Betting Industry - BETEGY ...
Alex Kornilov: Building Big Data Company in Sports-Betting Industry - BETEGY ...
 
Technology and open knowledge in sports statistics
Technology and open knowledge in sports statisticsTechnology and open knowledge in sports statistics
Technology and open knowledge in sports statistics
 
Senior Project Research Paper
Senior Project Research PaperSenior Project Research Paper
Senior Project Research Paper
 
Sexiest of the Sexiest Job Profile: Sports Analyst
Sexiest of the Sexiest Job Profile: Sports AnalystSexiest of the Sexiest Job Profile: Sports Analyst
Sexiest of the Sexiest Job Profile: Sports Analyst
 
IPL auction q1_q2.docx
IPL auction q1_q2.docxIPL auction q1_q2.docx
IPL auction q1_q2.docx
 
Intro.pptx
Intro.pptxIntro.pptx
Intro.pptx
 
The Future of Analytics
The Future of AnalyticsThe Future of Analytics
The Future of Analytics
 
Webinar - How to Create an Effective Compensation Data Strategy
Webinar - How to Create an Effective Compensation Data StrategyWebinar - How to Create an Effective Compensation Data Strategy
Webinar - How to Create an Effective Compensation Data Strategy
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
 
Managing web analytics
Managing web analyticsManaging web analytics
Managing web analytics
 
Big Data for Big Sports
Big Data for Big SportsBig Data for Big Sports
Big Data for Big Sports
 
7 Data Analytics Dashboards for Small Business
7 Data Analytics Dashboards for Small Business7 Data Analytics Dashboards for Small Business
7 Data Analytics Dashboards for Small Business
 
Big data
Big dataBig data
Big data
 

Recently uploaded

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAbdelrhman abooda
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubaihf8803863
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 

Recently uploaded (20)

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptxAmazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
Amazon TQM (2) Amazon TQM (2)Amazon TQM (2).pptx
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls DubaiDubai Call Girls Wifey O52&786472 Call Girls Dubai
Dubai Call Girls Wifey O52&786472 Call Girls Dubai
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 

Big data and MLB

  • 1. “You cant manage what you don’t measure” JUST AS IT’S BEING USED IN OTHER BUSINESSES, BIG DATA IS UTILIZED TO MAKE BASEBALL TEAMS BETTER AND ORGANIZATIONS MORE PROFITABLE
  • 2. How is Baseball using Big Data Analytics? https://www.youtube.com/watch?v=yGf6LNWY9AI Sabermetrics=“Moneyball” is essentially the term (stemming from the book written by Michael Lewis) for using advanced statistics to determine how to build a team, what strategies to use, and more. “Our ability to know what’s going to happen, when it’s going to happen, how much cash we’re going to generate on the revenue side, allows us to plan accordingly. That’s a tremendous value proposition to ownership and executives.”
  • 3. What is there to analyze?  162 games per team per season (not including pre- season, playoffs or minor league games) 162*30 mlb teams=4860 regular season games.  There are 85 common player stats measured (BA, OBP, HR… ERA, W, L…) and then each of those stats is broken down against teams, lefty vs righty, day vs night, home vs away….  All of these stats are kept in data banks for every single pitch thrown, …700,000+ pitches in 2014... Also keep data on fans, food, drinks, promotions…  First 135 years of Baseball are a combined 2GB of data Today each game has almost 1TB of data collected. That’s a 10 Million fold increase in data collected. And its predicted that upwards of 7TB per game will be collected.
  • 4. What do you do with the data?  Win at an unfair game!  2002 Oakland A’s  103-59  2014-15 Astros  Shifts  Predict injuries  3D Snapshots 10-15 gigs of data  $1.4 Billion in knee injuries for MLB in 2014  https://www.youtube.com/watch?v=8avavYawsA8
  • 5. Question??  There are 4 types of analytics that companies can use to aid their business: 1. Prescriptive- Takes data and reveals what actions should be taken 2. Predictive- Takes data and gives an analysis of likely scenarios of what might happen 3. Diagnostic- A look at past performance to determine what happened and why 4. Descriptive- What is happening now based on incoming data  MLB uses all 4. What type of analytics does your company use that helps it gain a competitive advantage?
  • 6. Tools  Statcast by MLBAM  Uses Amazon Web Services  Captures on field data  Quickly analyzes and codes  Pitch Rx  Tracks every pitch  Uses Camera Triangulation  Field Fx  Records all field plays using camera feeds and object-recognition software  BaseRuns Estimator  Estimates the number of runs a team should score given their offensive statistics and the number of runs a hitter or pitcher creates or allows Source: Whitman School of Business, Syracuse University Data from the Player Tracking System (Statcast) overlaid on video of the Panik-Hosmer play. The red section on the right shows that if Hosmer had maintained his speed instead of diving to the bag, he would have been safe by about a foot.
  • 7. Tools in use Cloud EC2 Compute Pwr behind solution Amazon S3 Storage 7Tb per Game Amazon Elastic Cache Temp Memory, Fast Retrieval AWS Lambda  Used for Raw Data Manipulation to Create “On the Fly” Metrics  Creates More Insight in to Plays Amazon DynamoDB  Allows for powerful queries.  Supports fast retrieval of information Dedicated Connection
  • 8. Discussion Question Given the emergence of complex technological tools, how can companies with smaller budgets stay competitive with companies that have deep pockets?
  • 9. 3 Key Differences in Data • Volume- More data across the internet every second vs what was stored on the entire internet 20 years ago • Velocity-Real time data i.e. cell phone location data • Variety-Large amounts of data being created on every topic of business
  • 10. Data Types Structured Data  G = games • Number of games a player participated in (out of 162 games in a season)  AB = at bats • Number of times a batter was hitting and either got a hit or got out (does not include walks or reaching base on an error)  R = runs • Number of runs the player scored  H = hit • Number of times a player hit the ball or got on base or hit a home run (sum of 1B, 2B, 3B, HR) Unstructured Data  Social Media updates tied to baseball games/players  Video  Photos  Open ended surveys
  • 11. What are some examples of how Unstructured Data is used in your company? Online Reviews Facebook “Likes”
  • 13. Big Data Acquisition:  Data harvest from meticulous record-keeping  (on-base percentage, batting average, slugging/fastball percentages, RBIs, stolen bases, etc…)  Employ analytics experts: utilize their skillset to build team, field, and manage players  Expand use: ticketing, promotions, fan-team relationships, concessions and products  Milwaukee Brewers analyze each email received by teams to better understand fans  Analyze who the occasional attendees are and how to get them to buy tickets more often  Boston Red Sox developing concessions heat-map (geo-locating proximity fans to hotdog stands)  Tracks type, quantity, frequency, and locations of concession purchases  2014 App “IdealSeat” allows fans to choose seats based upon likelihood of catching foul balls  Adjust and re-target focus of data sets (player field positions, t-shirt prices) as needed  Q: What other venues or industries could benefit by a similar depth of big-data analysis?
  • 14. Big Data Governance: Organization  Effective governance is equal parts: organization and security  Historic Organization (Waterfall Model): Garbage-In / Garbage- Out  Integration of data as it arrives into repository for use  Indiscriminate harvest; lack of profiling/prioritizing data lengthens time to organize/use  Without organizing, data mismatches can damage customer relations  (i.e. coupons for women’s shoes sent to male customers)  Understanding data before to employment is key  Effective Governance: beyond scrubbing and deletion, focus on ensuring accuracy  Identify custodians (who's accountable for data consistency, accuracy, and archival)  Develop criteria policies (standards and procedures for use, purpose, and by who?)  Enact policy controls and audit (enforcement of policies and accountability for custodians)
  • 15. Big Data Governance: Security  Security Issues:  Financial and Reputational  Too much data with too many vulnerabilities can be catastrophic  2015 Breach at Office of Personnel Management:  Personal Records, PII (names, addresses, etc…), Security Clearance details of 21M citizens  5.6M sets of fingerprints stolen  2014 Breach at Home Depot:  46M credit cards hacked  Big Data poses Big Risks:  Big gains can be realized IF security risks are properly mitigated AND the data harvest is properly organized
  • 16. Conclusion  Big data is utilized to make teams better and organizations more profitable  4 Types of Analytics 1. Prescriptive 2. Predictive 3. Diagnostic 4. Descriptive  Many tools available to analyze data  Statcast by MLBAM, Pitch Rx, Field Fx  Data Types  Structured & Unstructured  Effective Governance can ensure accuracy