1) Carbine Studios wanted to identify which of its 125,000 beta testers were most likely to remain long-term players to combat customer churn.
2) Surveys and predictive modeling identified 5 unique gamer segments with different characteristics and spending behaviors.
3) Key insights included that 8% of beta testers were at risk to churn, females were twice as likely to churn, and passion for the game was more important than demographics in predicting loyalty.
Evolving analytics at ebay - 2012 Tableau Customer Conferencegdougan1
From Data to Knowledge: Evolving Analytics at ebay.
Gary Dougan's presentation at TCC 2012 (http://www.linkedin.com/in/garydougan)
Learn about eBay’s extensive analytics environment, and how eBay’s Business Intelligence platform team is enabling “visual analytics” across a complex ecosystem of platforms, technologies, and data enthusiasts, to synthesize information and derive insights from dynamic and complex data.
When writing this new paper, my main objective was to provide a clear understanding of where the term "Big Data" comes from, why is that term so popular now, what does it really mean and what can be its implication for businesses. Because the full power of Big Data can be revealed only by Analytics, i provided a description of a widely recognized and used analytical techniques to help you figure out how used in conjunction with Big Data, analytics can boost Business Performance.
i expected that by the end of this paper :
- you will smile the next time you read or hear at the terms big data, hadoop, or analytics :)
- you will understand the technologies that are behind the scene when one talks about "Big Data"
- you will know how to "make sense" of Big Data using Analytics
- you will get a basic idea of data mining techniques used in Business in general and in Big Data in particular
- you will be able to get every news about Big Data
Evolving analytics at ebay - 2012 Tableau Customer Conferencegdougan1
From Data to Knowledge: Evolving Analytics at ebay.
Gary Dougan's presentation at TCC 2012 (http://www.linkedin.com/in/garydougan)
Learn about eBay’s extensive analytics environment, and how eBay’s Business Intelligence platform team is enabling “visual analytics” across a complex ecosystem of platforms, technologies, and data enthusiasts, to synthesize information and derive insights from dynamic and complex data.
When writing this new paper, my main objective was to provide a clear understanding of where the term "Big Data" comes from, why is that term so popular now, what does it really mean and what can be its implication for businesses. Because the full power of Big Data can be revealed only by Analytics, i provided a description of a widely recognized and used analytical techniques to help you figure out how used in conjunction with Big Data, analytics can boost Business Performance.
i expected that by the end of this paper :
- you will smile the next time you read or hear at the terms big data, hadoop, or analytics :)
- you will understand the technologies that are behind the scene when one talks about "Big Data"
- you will know how to "make sense" of Big Data using Analytics
- you will get a basic idea of data mining techniques used in Business in general and in Big Data in particular
- you will be able to get every news about Big Data
Massively multiplayer data challenges in mobile game analyticsJak Marshall
I gave this talk at the Institute of Mathematics and its Applications Conference for Early Career Mathematicians at Warwick University on November 2016.
The talk details my experiences of working in the mobile games industry as a data scientist, drawing on my experience working at both Sega Hardlight and Exient Malta studios.
It was the final talk of the day and it was well received.
Massively multiplayer data challenges in mobile game analyticsJak Marshall
I gave this talk at the IMA Early Career Mathematicians' Conference at Warwick University on the 5th November 2016.
It was the final talk of the day and it was well received by the professional mathematicians and students that were in attendance.
The gaming industry is huge, and it can keep its audience consumed for hours, days and even weeks. Presentation shows how it all started, some best and worst practices and main principles of gamification.
Summary:
You know the game you want to make but how do you talk about it? Not with your peers and friends, but with investors and publishers. In this session, we’ll discover the art of pitching, why it’s useful, and how to structure it so you feel prepared in front of a publisher.
Presenter Bio:
Kumsal Obuz is a self-taught veteran web developer with more than 15 years of experience. After several years of preparation, he started his own game studio, Viroid Games, in August 2020. He also authored a game development book that covers Blender and Godot Engine for Packt Publishing.
Context:
This is a talk I often give at game development Meetups and conferences. I see that many indie developers suffer from the most basic aspects of pitching their creation to people who have the power to part way with money.
Why Big and Small Data Is Important by Google's Product ManagerProduct School
In this talk, Dan McClary, a Product Manager at Google, walked through the importance of using data to drive product decisions, as well as how to quickly pull together an architecture using free tools to help grow a product effort from market analysis to live data capture and data-driven product decisions. We also played a rousing game of Breakout.
The benefits of operating a free-to-play "game-as-a-service" are well known: elastic pricing, a direct relationship with your players, longer lifespan, and an opportunity to fine-tune after launch. But to fully realize these benefits, you need to plan your live operations strategy as carefully as you plan your game. This talk will show how you can build an effective LiveOps strategy using PlayFab.
Free2 play soft launch obtaining tangible results through action-oriented a...Mary Chan
This presentation is a crossroads between Business, Marketing, Data Analysis, and Production. It presents what is important from a free2play business perspective, how and what needs to be tracked, and how a company can make sure results are delivered based on the data gathered. A basic business awareness of the free2play market is necessary, but data analysts and production-oriented people can attend and get useful insights.
Data and Analytics are all the rage these days and the appeal to use data to inform comms strategy is coming from all sides. You're collecting data everywhere. In your CRM, in your offline database, from your direct mail programs, from your social media, your events, etc. Perhaps you're even doing data appends from outside sources. You’re swimming in data! However, data in and of itself is useless. How do you get to a place where you’re able to transform your data into something meaningful?
A Producer's Tale: Using Data-Driven Decisions to Impact the Production of Your Game
(White Nights Conference Prague 2017)
The official conference website — http://wnconf.com
Massively multiplayer data challenges in mobile game analyticsJak Marshall
I gave this talk at the Institute of Mathematics and its Applications Conference for Early Career Mathematicians at Warwick University on November 2016.
The talk details my experiences of working in the mobile games industry as a data scientist, drawing on my experience working at both Sega Hardlight and Exient Malta studios.
It was the final talk of the day and it was well received.
Massively multiplayer data challenges in mobile game analyticsJak Marshall
I gave this talk at the IMA Early Career Mathematicians' Conference at Warwick University on the 5th November 2016.
It was the final talk of the day and it was well received by the professional mathematicians and students that were in attendance.
The gaming industry is huge, and it can keep its audience consumed for hours, days and even weeks. Presentation shows how it all started, some best and worst practices and main principles of gamification.
Summary:
You know the game you want to make but how do you talk about it? Not with your peers and friends, but with investors and publishers. In this session, we’ll discover the art of pitching, why it’s useful, and how to structure it so you feel prepared in front of a publisher.
Presenter Bio:
Kumsal Obuz is a self-taught veteran web developer with more than 15 years of experience. After several years of preparation, he started his own game studio, Viroid Games, in August 2020. He also authored a game development book that covers Blender and Godot Engine for Packt Publishing.
Context:
This is a talk I often give at game development Meetups and conferences. I see that many indie developers suffer from the most basic aspects of pitching their creation to people who have the power to part way with money.
Why Big and Small Data Is Important by Google's Product ManagerProduct School
In this talk, Dan McClary, a Product Manager at Google, walked through the importance of using data to drive product decisions, as well as how to quickly pull together an architecture using free tools to help grow a product effort from market analysis to live data capture and data-driven product decisions. We also played a rousing game of Breakout.
The benefits of operating a free-to-play "game-as-a-service" are well known: elastic pricing, a direct relationship with your players, longer lifespan, and an opportunity to fine-tune after launch. But to fully realize these benefits, you need to plan your live operations strategy as carefully as you plan your game. This talk will show how you can build an effective LiveOps strategy using PlayFab.
Free2 play soft launch obtaining tangible results through action-oriented a...Mary Chan
This presentation is a crossroads between Business, Marketing, Data Analysis, and Production. It presents what is important from a free2play business perspective, how and what needs to be tracked, and how a company can make sure results are delivered based on the data gathered. A basic business awareness of the free2play market is necessary, but data analysts and production-oriented people can attend and get useful insights.
Data and Analytics are all the rage these days and the appeal to use data to inform comms strategy is coming from all sides. You're collecting data everywhere. In your CRM, in your offline database, from your direct mail programs, from your social media, your events, etc. Perhaps you're even doing data appends from outside sources. You’re swimming in data! However, data in and of itself is useless. How do you get to a place where you’re able to transform your data into something meaningful?
A Producer's Tale: Using Data-Driven Decisions to Impact the Production of Your Game
(White Nights Conference Prague 2017)
The official conference website — http://wnconf.com
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
Opendatabay - Open Data Marketplace.pptxOpendatabay
Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets.
First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience.
From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets.
Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.
Techniques to optimize the pagerank algorithm usually fall in two categories. One is to try reducing the work per iteration, and the other is to try reducing the number of iterations. These goals are often at odds with one another. Skipping computation on vertices which have already converged has the potential to save iteration time. Skipping in-identical vertices, with the same in-links, helps reduce duplicate computations and thus could help reduce iteration time. Road networks often have chains which can be short-circuited before pagerank computation to improve performance. Final ranks of chain nodes can be easily calculated. This could reduce both the iteration time, and the number of iterations. If a graph has no dangling nodes, pagerank of each strongly connected component can be computed in topological order. This could help reduce the iteration time, no. of iterations, and also enable multi-iteration concurrency in pagerank computation. The combination of all of the above methods is the STICD algorithm. [sticd] For dynamic graphs, unchanged components whose ranks are unaffected can be skipped altogether.
2. To ensure the success of their new game,
Carbine Studios would need to combat
one of the biggest challenges in gaming:
customer churn.
THE BUSINESS CHALLENGE
3. To do that they would need to identify
which of their 125,000 beta testers were
most likely to remain long-term players.
This was one of 14 questions the company wanted answers to, as part of an overall learning agenda.
4. Our Approach:
Leverage the power of the 125,000
beta testers by deploying a survey
designed with specific questions
to uncover which customers were
most likely to churn.
In general, we recommend using primary data wherever possible, as syndicated data sources are becoming less and less useful.
5. We created a learning agenda to determine which data
attributes we’d need to build our predictive models.
No. Question
1 Who are the unique customer types?
2 What are the largest segments of customers in terms of demographics?
3 Which customer segments spend the most time playing MMOs?
4 Which customers are most eagerly anticipating a new MMO?
5 Which gaming devices do WildStar gamers use, e.g., PlayStation, mobile phone etc.?
6 What are the segment’s motivations, e.g., Enjoy_Storyline/lore, Enjoy_ArtStyle, etc. How do they differ?
7 How should different gamer types be treated, i.e., marketing tactics
8 What other games do WildStar players also play?
9 What kinds of people are excited about WildStar?
10 Why are people excited about WildStar?
11 Who is most likely to be a long term player, i.e., who is most interested in PvE content
12 What are the characteristics of gamers who beta test MMOs?
13 Which attributes of a game cause people to spend money on a game?
14 Who are the characteristics of potential “churners"?
The survey, including its questions and skip logic were specifically design to understand customer churn, not to simply collect a bunch of general information on
customers.
6. This resulted in 48 million individual data cells
that would need to be whipped into shape.
In general, we recommend using primary data wherever possible, as syndicated data sources are becoming less and less useful. The importance of the data preparatio
stage cannot be overstated
7. Using a decision tree algorithm, we tried to
predict which customers were most likely to
churn by modeling various attributes.
Decision Tree Result:
83% accuracy
ROC Curve Probability
Interpretation
.90-1 = excellent (A)
.80-.90 = good (B)
.70-.80 = fair (C)
.60-.70 = poor (D)
.50-.60 = fail (F)
ROC Area =.863
Output ResultsQ1: Who are the unique customer types?
Predictive Algorithms used: Decision Tree, Decision Stump, Decision Prunenet. ROC curves are a way of visualizing performance of a binary classifier, i.e., two different
output classes. ROC curve visualizes all possible thresholds vs. a misclassifcation rate is only an error rate for a single threshold.
8. A demographics-only approach to the model
yielded low accuracy scores, so we shifted to
clustering algorithms to identify churners.
Cluster EvaluationQ1: Who are the unique customer types?
Sum of Squared Error (SSE) is
most common measure.
For each point the error is the
distance to the nearest cluster.
To get SEE, we square these
errors and then sum.
x is a data point in cluster Ci and
mi us the representative point
for cluster Ci
Given the two clusterings, we
can choose the one with the
smallest error
Rather than use the decision tree algorithm to predict churners, we instead created a segment for them, i.e., grouped them together so we could see which attributes
they commonely expressed. K-means clustering found intersting groups of gamers with different spend levels.
9. Key Insights:
1. Just because gamers play a lot doesn’t
mean they spend a lot. And just
because they make a lot doesn’t mean
they spend a lot.
2. There is no single “silver bullet”
predictive attribute for customer churn.
3. 8% of beta testers are at risk for churn.
4.Females are twice as likely to churn.
High household incomes have nothing to do with loyalty. Passion for the game (and topic) are more important.
PvE content appeals to many different clusters. But Scientists and Explorers seem to gravitate toward PvE the most.
10. SEGMENT 1
(BIG SPENDERS)
SEGMENT 2
(BREAD AND BUTTER)
SEGMENT 3A
(STUDENTS)
SEGMENT 3B
(BIG SPENDER
STUDENTS)
SEGMENT 4
(FEMALES)
Male Male Male Male Female
37% have kids 20% have kids 6% have kids 13% have kids 38% have kids
HHI $80k+ HHI $20-80k HHI $<20k HHI $20-80k HHI $20-120k
Non-Student Non-Student Student Student
Both student and non-
student
Pre-ordered a console Pre-ordered a console Didn’t pre-order console Pre-ordered a console Didn’t pre-order console
11-30 hours/wk playing
MMOs
11-20 hours/wk playing
MMOs
11-20 hours/wk playing
MMOs
40+ hours/wk playing
MMOs
11-20 hours/wk playing
MMOs
Spent $ on 20+ games Spent $ on 6-20 games Spent $ on 0-5 games Spent $ on 20+ games Spent $ on 0-5 games
PvE Content, Character
Progression
PvE Content
PvE Content, Character
Progression
PvP Storyline and Lore
Explorer, Settler Scientist, Explorer Soldier, Explorer Explorer, Scientist Explorer, Settler, Scientist
Excitement Level: 9 Excitement Level: 10 Excitement Level: 10 Excitement Level: 10 Excitement Level: 10
DEMOGRAPHICSGAMERBEHAVIOR
What did we do with all the data?
We created a customer segmentation...
High household incomes have nothing to do with loyalty. Passion for the game (and topic) are more important.
PvE content appeals to many different clusters. But Scientists and Explorers seem to gravitate toward PvE the most.
11. …and a set of marketing recommendations
that mapped to individual segment behaviors.
SEGMENT 1
(BIG SPENDERS)
SEGMENT 2
(BREAD AND BUTTER)
SEGMENT 3A
(STUDENTS)
SEGMENT 3B
(BIG SPENDER
STUDENTS)
SEGMENT 4
(FEMALES)
Mobile Display Mobile Display Mobile Display Mobile Display Mobile Display
Promoted Facebook
posts
and YouTube ads
Promoted Facebook
posts
and YouTube ads
Promoted Facebook
posts
and YouTube ads
In-game advertising (Tablet)
In-game advertising
(Tablet)
In-game advertising
(Tablet)
In-game advertising
(Tablet)
In-game advertising
(Tablet)
Interactive Trailers Interactive Trailers Interactive Trailers Interactive Trailers Interactive Trailers
In-game exclusives
(Cool Gear)
In-game exclusives
(Cool Gear)
In-game exclusives
(Cool Gear)
In-game exclusives
(Cool Gear)
In-game exclusives
(Cool Gear)
Mobile/online Coupons Mobile/online Coupons Mobile/online Coupons
High-res stylized artwork High-res stylized artwork High-res stylized artwork High-res stylized artwork High-res stylized artwork
Email (optimized for mobile)
Email (optimized for
mobile)
Email (optimized for
mobile)
Email (optimized for
mobile)
Email (optimized for
mobile)
Community-created
content, e.g., parody
videos, etc.
Community-created
content, e.g., parody
videos, etc.
Community-created
content, e.g., parody
videos, etc.
Social Media Social Media Social Media Social Media Social Media
ACQUISITIONRETENTION
Marketing recommendations were mapped to the individual gamer behavioral attributes that make up each unique segment and broken out by acquisition vs. retention.
Retention is especially important, as we were dealing with a customer churn problem.