This document analyzes Pokemon data to answer several questions: which starter Pokemon is best, how Pokemon have changed across generations, and the strongest team of 6 Pokemon including and excluding legendaries. Key findings are that starter Pokemon get stronger over time, later generations have higher base stats on average, and the top 6 team includes legendary Pokemon known for their rare typing and immense size and weight.
2. Introduction
What is Pokemon?
What are our goals?
● Answer who is the best Starter Pokemon
● Analyze Generations I-VII to see how Pokemon has changed over the years.
● Pick the best 6 Pokemon lineup across ALL generations.
● Briefly analyze Legendary Pokemon
3. Check out my mixtape...or my Kaggle
https://www.kaggle.com/jeffsuth/pokemon-data-analysis
Feel free to open it up and take a look and fork your own copy to play around with!
4. Key Terms
Pokemon- Pocket Monsters. You catch, train, and compete with them!
Generation- New Pokemon Game or new Pokemon region.
Base Stats- Sum of all stats combined.
Type (Weakness)- Pokemon can be up to 2 out of 18 different types each with different
strengths and weaknesses to other types. (Like 18 way rock-paper-scissors)
Evolution (Mega Evolution)- When a Pokemon reaches a certain threshold, it transforms into
a more powerful form. Mega-evolutions goes even further beyond using a special item.
5. Variable Declaration
Define different stats to be used based
on the given dataset
Define Pokemon Generation based on
Pokedex Number
Legendaries as well
20. Generation Analysis
● Base Stats mean of all pokemon in each
generation.
● Mod Stats mean of all Pokemon in each
generation
Yes, Pokemon generally get
stronger after each generation.
25. The search for the top 6 non legendary Pokemon
● Sort pokemon by generation
● Filter out legendary pokemon
● Implement a weakness rating
system
● Calculate base stats and mod
stats
● Remove null entries
● Find the top 6 Pokemon
based on mod stats and
domain expertise
26. Looking at stats
Base stats: attack + defence + hp + special attack + special defence
Weakness rating (1 - 10): 1 = stronger - 10 = weaker
(weakness count - min weakness) * (10 - 1) / (max weakness - min weakness) + 1
Weakness count: All weaknesses added together
Mod base stats:
(attack * 1.5) + defence + hp + (special attack * 1.5) + special defence + (speed * 2)
True base stats: All of the above combined
27. Base Stats vs Mod Stats Example
Base stats Mod stats True base stats
39. Legendary Analysis
What are some key characteristics of a Legendary Pokemon? Their type? Their
height or weight? What are the qualities that will usually determine if a Pokemon is
a Legendary or not?
40.
41.
42.
43.
44. Conclusions!
Who is the strongest Starter Pokemon?
What is the best 6 Pokemon lineup?(w/ Legendaries and w/o Legendaries)
What is the trend of Pokemon by each Generation?
What can this information be used for?
Editor's Notes
Jeff
Describe dataset from kaggle! 801 entries
Jeff
Pokemon,is a popular japanese role-playing game developed by Gamefreak and published by Nintendo. The game consists of catching creatures called “Pokemon” and having them battle each other in a turn-based combat system. Each pokemon has unique stats, weaknesses and characteristics that distinguish them from other pokemon in combat. These stats, weaknesses and characteristics also dictate how a player forms their team of Pokemon. With the vast number of Pokemon available in the game, the curiosity of the best Pokemon team combination arises. Players that develop a powerful Pokemon team can gain a competitive edge and possibly compete in tournaments. Preparing and choosing the right Pokemon team is important. Our project will help inform players about which Pokemon would be best for combat, helping them form the best possible team. We aim to separate the wheat from chaff, and discover which pokemon are the most effective across various strategies and playstyles. With our huge dataset of 801 pokemon acquired from Kaggle. We will first analyze all the starter Pokemon from generation I to generation VII, to see who has the highest base stats in each region and is the strongest starter Pokemon overall. Then we will move on to analyzing the base stats of the entire pool of Pokemon to see what 6 Pokemon will make the strongest competitive team based on our modified stats and evaluation of weaknesses determined from our domain expertise. Lastly, we will cover generational analysis to see how Pokemon has changed over the years and a brief analysis of Legendary Pokemon.
Pull intro stuff off proposal
Basic background
Each section brief overview
Talk about why in each section
Show code, graphs and significant observations from that.
Jeff
Dalton
Before we go into our project we need to go over some important key terms for you guys to understand.
Jeff
Jeff
The age old question...Some pick based on what is the cutest or coolest, but we will tell you which to pick if you really want to do well in the game and beat your friends.
Jeff
This is the format for all the graphs focusing on analyzing the starter Pokemon. It will be all the same format, but each analyzes something different starting with base stats, then modded stats. Then offensive and defensive stats.
Also, I’m sure there is an easier way to color the bars in a bar graph but I never figured it out so I made a little for loop to color all the grass Pokemon green, all fire red...We honestly thought of just photoshopping and just choosing a new fill for the different bars which may have been just as easy.
Jeff
Jeff
Jeff
We read in all the against_...all the different weaknesses a Pokemon can have and append it to a list. We do the same thing but for getting the different numerical values of weaknesses a Pokemon can have. We then look at which weaknesses have the highest mean weaknesses against the pokemon we are looking at (in this case the 21 starters) and it tells us that flying is the strongest move set against all the starters. We then randomly assign colors to all the different weaknesses and print them out for all starter pokemon.
Jeff
We see that flying has a 4x damage multiplier and no 0x. So flying moves will always hit any of the starters and will super effective to almost half of the starter Pokemon.
Chris
So after we analyzed all the starter Pokemon, we turned into analyzing every Pokemon in the game. So we decided to look into Pokemon heights and weights just for fun. And then we look at their base stats as jeffrey explained
And we also looked at the pokemon’s primary and secondary types. and so as we can see on the chart on the right there is pretty much and even percentage ag of dual and single types of pokemon but there is slightly more dual types
Chris
These are just some graphs where we analyzed pokemon height and weight.
We see that the average height of all Pokemon is around 1.13m and the average weight of all pokemon is about 59kg. However, there are of course some extremes that throw these numbers off like Onix who is 8.8m tall and weighs 210.kg.
Chris
Here is a better representation using just a simple scatter plot. This Pokemon is cosmoem and completely throws off the data by being 0.01m tall and weighing over 999kg(cosmeom weight is actually imeasurable but 999kg is the highest the weight scale goes.
Dalton
My personal favorite graph that covers all 801 Pokemon and the different base stats total there can be. Ranges from 180 to 780. Spikes around 400 and 600.
Dalton
Dalton
Jeff
We analyzed and compared between each Pokemon generation to see if Pokemon get stronger after every generation
Chris
First we wanted to see the Pokemon sample size of each generation shown in these charts . So we see on the right that generation 5 contains the most pokemon followed by gen 1 and 3. And we have generation 6 with the least amount of pokemon
That is possible conclusion for the first 3 generations . We wanted to explore the question of whether or not people would have to keep buying new pokemon games to stay competitive.
When we look at the average of base stats on the left we can see a gradual rise from gen 1 to gen 4 but when we look at mod stats mean where we compensate for more powerful stats such as speed and special attack to better reflect combat power we can see that there is a huge increase of pokemon power from generation 1 to 3
But then there was a significant drop in overall pokemon power in generation 5. And this is interesting because generation 5 was when the Pokemon series swaped to a new console so it was pretty much a start to a whole new game series so there were alot of weaker pokemon that came out for all the players that just bought the game. And after gen 5 we see another big rise in overall pokemon power.
When we examine THE mod stats mean across all generation we could say that yes Pokemon do generally get stronger in the context that gen 4 and 5 was the beginning of a new pokemon series. So basically if you want to stay competitive with your friends buying all the new pokemon generations is recommended, and thats basically what the developers want so they can take of all of your money.
Fewer Legendaries
Wow so many Dragons!
Reduction in dragon hegemony
David
Marco BEGIN
Marco
Marco
Marco
Dalton
All pokemon (left) then Legendary (right)
All pokemon (left) then Legendary (right)
Dalton
(Left is all Pokemon, right is only Legendaries)
In general, Legendary Pokemon will have higher stats than all the other Pokemon. We see the average for their offensive and defensive stats are higher than those of all pokemon.
Dalton
There is 70 legendary Pokemon across all generations.
Conclusion For our first time coding a project in Python, overall we did a great job! We collaborated between 4 people for all this code (David's section is in R and was not able to be moved to Python in time but will be attached) and were able to coordinate the code so that it fit perfectly into one project. There is a lot of meaningful conclusions we pulled from examining all this data so let's dive into those going one section at a time.
Starter Pokemon
We see that Greninja is by far the strongest starter Pokemon with 640 total base stats, beating out even mega-evolved Pokemon!
Flying type moves are most effective overall against any of the starter Pokemon.
The most common secondary type for a starter Pokemon is fighting and only 18 out of the 21 Pokemon are Dual types (having both Primary and Secondary types)
All 801 Pokemon
There are more Dual type than Single type Pokemon.
The average height of a Pokemon is 1.13m and the average weight is 59kg.
Ice and Rock moves are the most effective against most Pokemon.
Most Pokemon are Water Primary Type and Flying Secondary Type.
Best 6 Pokemon Team
With Legendaries is (in power order) Mewtwo, Rayquaza, Kyogre, Groudon, Arceus, and Zygarde.
Without Legendaries is (in power order) Greninja, Slaking, Garchomp, Wishiwashi, Metagross, and Arcanine.
Generational Analysis
Most Pokemon come from generations I and V.
Generation III has the best competitive Pokemon (based on mod_stats) and Generation IV has the highest base stats Pokemon.
Legendary Analysis
Generation VII has the most Legendary Pokemon out of all the generations.
Most Legendary Pokemon have a base happiness of 0!
Most Legendary Pokemon are Psychic Primary type and Flying Secondary Type.
In conclusion, we learned a lot from doing EDA on this data and aquired an even deeper appreciation of Pokemon and desire to keep working with big data sets and explore what challenges Kaggle has to offer. With a limited background in Python, we achieved many goals that we set out to do and picked up tons of coding skills along the way.