Submit Search
Upload
Using R for Social Media and Sports Analytics
•
2 likes
•
1,852 views
Ajay Ohri
Follow
sports analytics using R.Bay R User Group Presentation by Sqor
Read less
Read more
Data & Analytics
Report
Share
Report
Share
1 of 9
Download now
Download to read offline
Recommended
Analyze this
Analyze this
Ajay Ohri
R and Data Science
R and Data Science
Revolution Analytics
Data Analytics with R and SQL Server
Data Analytics with R and SQL Server
Stéphane Fréchette
Introduction to Analytics with Azure Notebooks and Python
Introduction to Analytics with Azure Notebooks and Python
Jen Stirrup
Distributed Deep Learning + others for Spark Meetup
Distributed Deep Learning + others for Spark Meetup
Vijay Srinivas Agneeswaran, Ph.D
Building a scalable data science platform with R
Building a scalable data science platform with R
Revolution Analytics
Machine Learning with Spark
Machine Learning with Spark
elephantscale
Scalable Collaborative Filtering Recommendation Algorithms on Apache Spark
Scalable Collaborative Filtering Recommendation Algorithms on Apache Spark
Evan Casey
Recommended
Analyze this
Analyze this
Ajay Ohri
R and Data Science
R and Data Science
Revolution Analytics
Data Analytics with R and SQL Server
Data Analytics with R and SQL Server
Stéphane Fréchette
Introduction to Analytics with Azure Notebooks and Python
Introduction to Analytics with Azure Notebooks and Python
Jen Stirrup
Distributed Deep Learning + others for Spark Meetup
Distributed Deep Learning + others for Spark Meetup
Vijay Srinivas Agneeswaran, Ph.D
Building a scalable data science platform with R
Building a scalable data science platform with R
Revolution Analytics
Machine Learning with Spark
Machine Learning with Spark
elephantscale
Scalable Collaborative Filtering Recommendation Algorithms on Apache Spark
Scalable Collaborative Filtering Recommendation Algorithms on Apache Spark
Evan Casey
Data Science with Spark
Data Science with Spark
Krishna Sankar
Python for Data Science with Anaconda
Python for Data Science with Anaconda
Travis Oliphant
Distributed processing of large graphs in python
Distributed processing of large graphs in python
Jose Quesada (hiring)
Using the search engine as recommendation engine
Using the search engine as recommendation engine
Lars Marius Garshol
Agile data science with scala
Agile data science with scala
Andy Petrella
Big Graph Analytics on Neo4j with Apache Spark
Big Graph Analytics on Neo4j with Apache Spark
Kenny Bastani
Towards a rebirth of data science (by Data Fellas)
Towards a rebirth of data science (by Data Fellas)
Andy Petrella
The Challenges of Bringing Machine Learning to the Masses
The Challenges of Bringing Machine Learning to the Masses
Alice Zheng
Sparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya Hristakeva
Spark Summit
What is a distributed data science pipeline. how with apache spark and friends.
What is a distributed data science pipeline. how with apache spark and friends.
Andy Petrella
Gephi, Graphx, and Giraph
Gephi, Graphx, and Giraph
Doug Needham
Pandas UDF: Scalable Analysis with Python and PySpark
Pandas UDF: Scalable Analysis with Python and PySpark
Li Jin
Making Machine Learning Scale: Single Machine and Distributed
Making Machine Learning Scale: Single Machine and Distributed
Turi, Inc.
Follow the money with graphs
Follow the money with graphs
Stanka Dalekova
CuRious about R in Power BI? End to end R in Power BI for beginners
CuRious about R in Power BI? End to end R in Power BI for beginners
Jen Stirrup
EDHREC @ Data Science MD
EDHREC @ Data Science MD
Donald Miner
Parikshit Ram – Senior Machine Learning Scientist, Skytree at MLconf ATL
Parikshit Ram – Senior Machine Learning Scientist, Skytree at MLconf ATL
MLconf
Better {ML} Together: GraphLab Create + Spark
Better {ML} Together: GraphLab Create + Spark
Turi, Inc.
Data Science at Scale: Using Apache Spark for Data Science at Bitly
Data Science at Scale: Using Apache Spark for Data Science at Bitly
Sarah Guido
The MADlib Analytics Library
The MADlib Analytics Library
EMC
SPSNYC2019 - What is Common Data Model and how to use it?
SPSNYC2019 - What is Common Data Model and how to use it?
Nicolas Georgeault
San Francisco Atlassian User Group - February 2014
San Francisco Atlassian User Group - February 2014
Nicholas Muldoon
More Related Content
What's hot
Data Science with Spark
Data Science with Spark
Krishna Sankar
Python for Data Science with Anaconda
Python for Data Science with Anaconda
Travis Oliphant
Distributed processing of large graphs in python
Distributed processing of large graphs in python
Jose Quesada (hiring)
Using the search engine as recommendation engine
Using the search engine as recommendation engine
Lars Marius Garshol
Agile data science with scala
Agile data science with scala
Andy Petrella
Big Graph Analytics on Neo4j with Apache Spark
Big Graph Analytics on Neo4j with Apache Spark
Kenny Bastani
Towards a rebirth of data science (by Data Fellas)
Towards a rebirth of data science (by Data Fellas)
Andy Petrella
The Challenges of Bringing Machine Learning to the Masses
The Challenges of Bringing Machine Learning to the Masses
Alice Zheng
Sparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya Hristakeva
Spark Summit
What is a distributed data science pipeline. how with apache spark and friends.
What is a distributed data science pipeline. how with apache spark and friends.
Andy Petrella
Gephi, Graphx, and Giraph
Gephi, Graphx, and Giraph
Doug Needham
Pandas UDF: Scalable Analysis with Python and PySpark
Pandas UDF: Scalable Analysis with Python and PySpark
Li Jin
Making Machine Learning Scale: Single Machine and Distributed
Making Machine Learning Scale: Single Machine and Distributed
Turi, Inc.
Follow the money with graphs
Follow the money with graphs
Stanka Dalekova
CuRious about R in Power BI? End to end R in Power BI for beginners
CuRious about R in Power BI? End to end R in Power BI for beginners
Jen Stirrup
EDHREC @ Data Science MD
EDHREC @ Data Science MD
Donald Miner
Parikshit Ram – Senior Machine Learning Scientist, Skytree at MLconf ATL
Parikshit Ram – Senior Machine Learning Scientist, Skytree at MLconf ATL
MLconf
Better {ML} Together: GraphLab Create + Spark
Better {ML} Together: GraphLab Create + Spark
Turi, Inc.
Data Science at Scale: Using Apache Spark for Data Science at Bitly
Data Science at Scale: Using Apache Spark for Data Science at Bitly
Sarah Guido
The MADlib Analytics Library
The MADlib Analytics Library
EMC
What's hot
(20)
Data Science with Spark
Data Science with Spark
Python for Data Science with Anaconda
Python for Data Science with Anaconda
Distributed processing of large graphs in python
Distributed processing of large graphs in python
Using the search engine as recommendation engine
Using the search engine as recommendation engine
Agile data science with scala
Agile data science with scala
Big Graph Analytics on Neo4j with Apache Spark
Big Graph Analytics on Neo4j with Apache Spark
Towards a rebirth of data science (by Data Fellas)
Towards a rebirth of data science (by Data Fellas)
The Challenges of Bringing Machine Learning to the Masses
The Challenges of Bringing Machine Learning to the Masses
Sparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya Hristakeva
What is a distributed data science pipeline. how with apache spark and friends.
What is a distributed data science pipeline. how with apache spark and friends.
Gephi, Graphx, and Giraph
Gephi, Graphx, and Giraph
Pandas UDF: Scalable Analysis with Python and PySpark
Pandas UDF: Scalable Analysis with Python and PySpark
Making Machine Learning Scale: Single Machine and Distributed
Making Machine Learning Scale: Single Machine and Distributed
Follow the money with graphs
Follow the money with graphs
CuRious about R in Power BI? End to end R in Power BI for beginners
CuRious about R in Power BI? End to end R in Power BI for beginners
EDHREC @ Data Science MD
EDHREC @ Data Science MD
Parikshit Ram – Senior Machine Learning Scientist, Skytree at MLconf ATL
Parikshit Ram – Senior Machine Learning Scientist, Skytree at MLconf ATL
Better {ML} Together: GraphLab Create + Spark
Better {ML} Together: GraphLab Create + Spark
Data Science at Scale: Using Apache Spark for Data Science at Bitly
Data Science at Scale: Using Apache Spark for Data Science at Bitly
The MADlib Analytics Library
The MADlib Analytics Library
Similar to Using R for Social Media and Sports Analytics
SPSNYC2019 - What is Common Data Model and how to use it?
SPSNYC2019 - What is Common Data Model and how to use it?
Nicolas Georgeault
San Francisco Atlassian User Group - February 2014
San Francisco Atlassian User Group - February 2014
Nicholas Muldoon
Measuring Programmer Performance with SourceKibitzer EyeQ
Measuring Programmer Performance with SourceKibitzer EyeQ
Mark Kofman
How Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platformwith Databricks
Grega Kespret
RightScale Webinar: Get Top Performance for Your Games
RightScale Webinar: Get Top Performance for Your Games
RightScale
Snowplow: open source game analytics powered by AWS
Snowplow: open source game analytics powered by AWS
Giuseppe Gaviani
Extending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and Spotfire
Lou Bajuk
Software Development & Architecture @ LinkedIn
Software Development & Architecture @ LinkedIn
C4Media
Splunk bangalore user group 2020-06-01
Splunk bangalore user group 2020-06-01
NiketNilay
Can We Make Maps from Videos? ~From AI Algorithm to Engineering for Continuou...
Can We Make Maps from Videos? ~From AI Algorithm to Engineering for Continuou...
DeNA
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
MLconf
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Kent Graziano
Software Development Services
Software Development Services
Cassie Nguyen
Redefining the Role of IT in a Self-Help Data Integration Environment
Redefining the Role of IT in a Self-Help Data Integration Environment
UNIFI Software
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Data Con LA
Designing a pragmatic back-end service for mobile games
Designing a pragmatic back-end service for mobile games
iFunFactory Inc.
Yuvaraj
Yuvaraj
Yuvraj Kothandan
Thinking DevOps in the Era of the Cloud - Demi Ben-Ari
Thinking DevOps in the Era of the Cloud - Demi Ben-Ari
Demi Ben-Ari
RootandLeaves.pptx
RootandLeaves.pptx
Root and Leaves
Manage the Velocity of Change with Cloud Computing
Manage the Velocity of Change with Cloud Computing
Janine Sneed
Similar to Using R for Social Media and Sports Analytics
(20)
SPSNYC2019 - What is Common Data Model and how to use it?
SPSNYC2019 - What is Common Data Model and how to use it?
San Francisco Atlassian User Group - February 2014
San Francisco Atlassian User Group - February 2014
Measuring Programmer Performance with SourceKibitzer EyeQ
Measuring Programmer Performance with SourceKibitzer EyeQ
How Celtra Optimizes its Advertising Platformwith Databricks
How Celtra Optimizes its Advertising Platformwith Databricks
RightScale Webinar: Get Top Performance for Your Games
RightScale Webinar: Get Top Performance for Your Games
Snowplow: open source game analytics powered by AWS
Snowplow: open source game analytics powered by AWS
Extending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and Spotfire
Software Development & Architecture @ LinkedIn
Software Development & Architecture @ LinkedIn
Splunk bangalore user group 2020-06-01
Splunk bangalore user group 2020-06-01
Can We Make Maps from Videos? ~From AI Algorithm to Engineering for Continuou...
Can We Make Maps from Videos? ~From AI Algorithm to Engineering for Continuou...
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Venkatesh Ramanathan, Data Scientist, PayPal at MLconf ATL 2017
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Software Development Services
Software Development Services
Redefining the Role of IT in a Self-Help Data Integration Environment
Redefining the Role of IT in a Self-Help Data Integration Environment
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Big Data Day LA 2016/ Big Data Track - Apply R in Enterprise Applications, Lo...
Designing a pragmatic back-end service for mobile games
Designing a pragmatic back-end service for mobile games
Yuvaraj
Yuvaraj
Thinking DevOps in the Era of the Cloud - Demi Ben-Ari
Thinking DevOps in the Era of the Cloud - Demi Ben-Ari
RootandLeaves.pptx
RootandLeaves.pptx
Manage the Velocity of Change with Cloud Computing
Manage the Velocity of Change with Cloud Computing
More from Ajay Ohri
Introduction to R ajay Ohri
Introduction to R ajay Ohri
Ajay Ohri
Introduction to R
Introduction to R
Ajay Ohri
Social Media and Fake News in the 2016 Election
Social Media and Fake News in the 2016 Election
Ajay Ohri
Pyspark
Pyspark
Ajay Ohri
Download Python for R Users pdf for free
Download Python for R Users pdf for free
Ajay Ohri
Install spark on_windows10
Install spark on_windows10
Ajay Ohri
Ajay ohri Resume
Ajay ohri Resume
Ajay Ohri
Statistics for data scientists
Statistics for data scientists
Ajay Ohri
National seminar on emergence of internet of things (io t) trends and challe...
National seminar on emergence of internet of things (io t) trends and challe...
Ajay Ohri
Tools and techniques for data science
Tools and techniques for data science
Ajay Ohri
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help business
Ajay Ohri
Training in Analytics and Data Science
Training in Analytics and Data Science
Ajay Ohri
Tradecraft
Tradecraft
Ajay Ohri
Software Testing for Data Scientists
Software Testing for Data Scientists
Ajay Ohri
Craps
Craps
Ajay Ohri
A Data Science Tutorial in Python
A Data Science Tutorial in Python
Ajay Ohri
How does cryptography work? by Jeroen Ooms
How does cryptography work? by Jeroen Ooms
Ajay Ohri
Kush stats alpha
Kush stats alpha
Ajay Ohri
Summer school python in spanish
Summer school python in spanish
Ajay Ohri
Introduction to sas in spanish
Introduction to sas in spanish
Ajay Ohri
More from Ajay Ohri
(20)
Introduction to R ajay Ohri
Introduction to R ajay Ohri
Introduction to R
Introduction to R
Social Media and Fake News in the 2016 Election
Social Media and Fake News in the 2016 Election
Pyspark
Pyspark
Download Python for R Users pdf for free
Download Python for R Users pdf for free
Install spark on_windows10
Install spark on_windows10
Ajay ohri Resume
Ajay ohri Resume
Statistics for data scientists
Statistics for data scientists
National seminar on emergence of internet of things (io t) trends and challe...
National seminar on emergence of internet of things (io t) trends and challe...
Tools and techniques for data science
Tools and techniques for data science
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help business
Training in Analytics and Data Science
Training in Analytics and Data Science
Tradecraft
Tradecraft
Software Testing for Data Scientists
Software Testing for Data Scientists
Craps
Craps
A Data Science Tutorial in Python
A Data Science Tutorial in Python
How does cryptography work? by Jeroen Ooms
How does cryptography work? by Jeroen Ooms
Kush stats alpha
Kush stats alpha
Summer school python in spanish
Summer school python in spanish
Introduction to sas in spanish
Introduction to sas in spanish
Recently uploaded
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting Techniques
Boston Institute of Analytics
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
HyderabadDolls
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
HyderabadDolls
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
ronsairoathenadugay
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
kumargunjan9515
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
ThinkInnovation
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
vershagrag
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
Graham Ware
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
HyderabadDolls
Call Girls in G.T.B. Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in G.T.B. Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
Rajesh Mondal
👉 Bhilai Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Girl Ser...
👉 Bhilai Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Girl Ser...
vershagrag
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
Call Girls In GOA North Goa +91-8588052666 Direct Cash Escorts Service
Call Girls In GOA North Goa +91-8588052666 Direct Cash Escorts Service
nishakur201
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
kumargunjan9515
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
HyderabadDolls
Recently uploaded
(20)
Predictive Precipitation: Advanced Rain Forecasting Techniques
Predictive Precipitation: Advanced Rain Forecasting Techniques
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
Diamond Harbour \ Russian Call Girls Kolkata | Book 8005736733 Extreme Naught...
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Nandurbar [ 7014168258 ] Call Me For Genuine Models...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Call Girls in G.T.B. Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in G.T.B. Nagar (delhi) call me [🔝9953056974🔝] escort service 24X7
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
👉 Bhilai Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Girl Ser...
👉 Bhilai Call Girls Service Just Call 🍑👄6378878445 🍑👄 Top Class Call Girl Ser...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Abortion pills in Doha {{ QATAR }} +966572737505) Get Cytotec
Call Girls In GOA North Goa +91-8588052666 Direct Cash Escorts Service
Call Girls In GOA North Goa +91-8588052666 Direct Cash Escorts Service
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Using R for Social Media and Sports Analytics
1.
© 2013 Sqor,
Inc. Sqor Using R For Social Media and Sports Data Athletes SuccessData Noah Gift: CTO @ Sqor
2.
© 2013 Sqor,
Inc. What is Sqor? • Social Network hyper-focused on enhancing fan/athlete relationships. We only do Sports!: Now • Marketplace for athletes to build and market their digital brand: Now • Social Analytics and Prediction Engine as a Service: Q1 2015 • Micro-endorsement platform: Q1 2015 • Crowdfunding for athletes: Now • Game platform: First Homegrown game featuring Brett Favre: Now • Cross-Social Network Publishing Platform: Facebook, Twitter, Embeddable posts.: Now • Website, Android App, and iOS App:
3.
© 2013 Sqor,
Inc. Key Aspects of Data Pipeline • Multiple languages involved: Python, R, Erlang, C#, SQL and Javascript. • Multiple persistence options: SQL Server (RDS), Riak (No SQL), CSV Files, Mnesia (Distributed Soft Realtime DB) • RabbitMQ and Erlang handle messaging and job communication • Easy to debug: daily and nightly scripts, intermediate CSV files, deep storage in K/V store and reports live in RDS. • R is used exclusively for machine learning and statistics (Although recommendation engine v1 was written in Python. We are going to replace it with R/Erlang code though)
4.
© 2013 Sqor,
Inc. Things They Don’t Tell You Building A Data Pipeline From Scratch (Our you should have paid attention to) • Getting the data in the right format and making sure it is accurate is back breaking work. It truly is horrible. • Keeping track of model prediction accuracy over time: both with new data and new models is really important • Non-linear regression is non-trivial • Automation and debuggability of every step is very important. Think Unix Tools • Expensive, exotic solutions sometimes aren’t worth it at first…or maybe ever. Weird databases, etc. • Making predictions involving real money with limited data is scary and really hard. If your not scared about this, you should be.)
5.
© 2013 Sqor,
Inc. Predicting Top Athletic Performers in Social Media • Sqor finds influential athletes and collaborates with them using our prediction algorithms
6.
© 2013 Sqor,
Inc. Our Prediction Algorithms Appear To Work • Or we got really lucky….
7.
© 2013 Sqor,
Inc. Clustering • We use R clustering packages for classification, visualization of patterns and diagnostics for predictions
8.
© 2013 Sqor,
Inc. Clustering • We use kNN clustering for NBA and MLB Sports. Plan on expanding this further in the near future.
9.
© 2013 Sqor,
Inc. Erlang/R Bridge • Sqor is a heavy user of Erlang • We like Erlang because it has unique concurrency abilities and high uptime (and also because I had a lot of bosses who told me I couldn’t use). • ➜ ~ curl -v -X PUT -H 'content-type: application/json' http://127.0.0.1:8080/api/script/foo -d '{"script":"execute <- function (A) { A * 2 }", "docs":"this doubles stuff"}' • ➜ ~ curl -v http://127.0.0.1:8080/api/script/foo -X POST -H 'content-type: application/json' -d '[25]’ • Returns: [50.0] • We plan on open sourcing this in next 2 months: Run scripts, runs jobs, scales R
Download now