SlideShare a Scribd company logo
1 of 5
Declaration on Plagiarism
Name: Tejal Vijay Nijai
Student Number: 19210412
Programme: MSc in Computing(Data Analytics)
Module Code: CA682
Assignment Title: Data Visualisation
Submission Date: 16 Dec 2019
Module Coordinator: Dr Suzanne Little
I declare that this material, which I now submit for assessment,is entirely my own work and has not been taken
from the work of others, save and to the extent that such work has been cited and acknowledged within the text
of my work. I understand that plagiarism, collusion, and copying are grave and serious offences in the university
and accept the penalties that would be imposed should I engage in plagiarism, collusion or copying. I have read
and understood the Assignment Regulations.Ihave identified and included the source ofall facts,ideas, opinions,
and viewpoints of others in the assignment references. Direct quotations from books, journal articles, internet
sources,module text, or any other source whatsoeverare acknowledged and the source cited are identified in the
assignment references. This assignment, or any part of it, has not been previously submitted by me or any other
person for assessment on this or any other course of study.
I have read and understood the referencing guidelines found at
http://www.dcu.ie/info/regulations/plagiarism.shtml, https://www4.dcu.ie/students/az/plagiarism and/or
recommended in the assignment guidelines
Name: Tejal Vijay Nijai Date: 16/12/2019
120 year of Olympic Games Data Visualization
The database contains information about 120 years of summer and winter games Olympic data with various
events and descriptions of male and female athletes. I'm curious to see how male and female athlete
participation has evolved over the years. I'm trying to find out if they have a similar turnout trend in
contribution. Furthermore, what are the popular events or sports? Iam also trying to find the inclination of
males and females in various events held in popular athletics sport over the years.
Through my graphs,there is some trend followed in the initial years of Olympic(1890-1980) and after 1980
same kind of pattern is followed for both genders.Athletics is the summer sport in case of sports popularity and
cross-country skiing is famous winter sport. The trend of contribution of male and female in athletics sport’s
various events such as Athletics Men's Shot Put, Athletics Women's Shot Put, Athletics Men's High Jump held
over the years.
1.Dataset Description:
The size of the dataset is 40MB. It has 271116 rows and 15 columns and each record corresponding to athlete
detail competing in the Olympic event. This is a historical dataset on the modern Olympic Games, including all
the Games from Athens 1896 to Rio 2016. The columns with the data types are as follows:
1. ID - Unique number for each athlete (Integer)
2. Name - Athlete's name (String)
3. Sex - M or F (Character)
4. Age – Integer (Integer)
5. Height - In centimetres (Integer)
6. Weight - In kilograms (Integer)
7. Team - Team name (String)
8. NOC - National Olympic Committee 3-letter code (String)
9. Games - Year and season (String)
10. Year – Integer
11. Season - Summer or Winter (String)
12. City - Host city (String)
13. Sport – Sport (String)
14. Event – Event (String)
15. Medal - Gold, Silver, Bronze, or NA (String)
As it has 2lakhs+ rows with 15 different descriptive attributes,my dataset comes under the volume dimension.
2. Data Exploration, Processing,Cleaning and/or Integration:
Dataset had incomplete information about the athletes' age, height, and weight. I cleaned the dataset by using
dropna function in python.I wanted to make a story around genderperformance or participation in different
sports or events in different regions if possible. Despite that,columns like age, height, weight, and medal have
been omitted. Because the dataset is about 120 years of players, which made me think of showing a pattern in
participation and also provides gender-specific data that we can compare over the years. Additionally, there was
data about sports orgames in which we could find out the favourite sport.
3. Visualisations
Graph 1:
The graph above shows the pattern of male and female participation in the Olympic Games for 120
years,i.e. 1896 to 2016. From this graph, we discovered that women started to contribute from 1920
Olympic Games. Male participation has grown rapidly from 1950-1980, although female participation
is slowly gaining momentum. In 1980, there was a sharp decrease in both genders' participation. In
1980, the United States led a boycott of the Summer Olympic Games in Moscow to protest the late
1979 Soviet invasion of Afghanistan. In total, 65 nations refused to participate in the games, whereas
80 countries sent athletes to compete[2].Since 1980, competing in both sexes ' Olympic games is quite
similar.
Line graphs are used for monitoring changes over short and long periods of time. These are also used
to compare improvements over the same time period for more than one category.I constructed this chart
in R and showed it in animation. I used red and blue colours respectively to represent the female and
male line. We might work out some points in my diagrams quickly by animating the line graph. Once
animation begins, we can discern immediately that for female graph startedat 1920. Going further there
is a sudden drop in 1980 and after 1980 the pattern is similar for both sexes. This helped examine and
compare the pattern of the phenomenon.
Graph 2:
The chart above shows common games / sports played by men and women of the season based on the number of
people who played the game. Using this graph, we discovered that athletics is the summer season's popular
game among males and females. Cross-country skiing is the most popular sport among males and females for
the winter season. We get to know that, irrespective of season same sports is adored by both genders.
Treemap graphs are used to view hierarchical data and relationships in part. It is automatically ordered in
descending order by the width of the rectangle which helps to classify the values from the highest to the lowest.
I created this graph and used common colours to represent blue and pink for male and female to realize that data
visualization is about the distribution of sex. I have used tableau interactive treemap feature to filter data season
wise.
Graph 3:
By referring second graph,we got to know that athletics sport is famous among men and women. To drill down
further how the athletics sport with its various events has men and women inclination. This graph not only tells
us count of athletics events held for male and female individually but also the total count over gender. Graph has
been created in tableau and generic colours for male and female have been used. Area chart is used to show
trend of population over time.
We got to know that athletics is popular among men and women by referring to the second graph.To further
drill down how men and women are drawn to the athletics sport with its various events,I created area chart. The
chart shows us not only the number of individually held athletic events for male and female, but also the overall
count by sex.
Graph was developed in the table and male and female standard colors were used.Area map is used to display
population pattern over time.
4. Conclusion:
I tried to plot animation of line graph in python,but it was taking longer time. Whereas in R , I could plot the
graph with a smaller number of lines of code with the help of R’s gganimate libraries. The data from treemap
graph could have been shown in bubble chart also. The con of bubble chart is it shows data in different size of
circles indicating the highest and lowest of data.To distinguish among these circles to get the largest one is
difficult or time consuming when count of parameter is nearby as the size would seemsame. In contradiction to
this , data is displayed in descending order automatically.
References:
1. Dataset retrieved from following Link:
https://www.kaggle.com/heesoo37/120-years-of-olympic-history-athletes-and-results
2. https://2001-2009.state.gov/r/pa/ho/time/qfp/104481.htm
3. https://rpubs.com/elifdemir/olympics_analysis_report (Referred this for the design of my Graph 1)

More Related Content

Similar to Details about visualization

Statistics
StatisticsStatistics
Statisticsdiereck
 
Introduction to Statistics information.pptx
Introduction to Statistics information.pptxIntroduction to Statistics information.pptx
Introduction to Statistics information.pptxHarshiHarshika1
 
Data science week_2_visualization
Data science week_2_visualizationData science week_2_visualization
Data science week_2_visualizationKeiko Ono
 
Statistics for Managers notes.pdf
Statistics for Managers notes.pdfStatistics for Managers notes.pdf
Statistics for Managers notes.pdfVelujv
 
Zontos_ep410_Report1_SSN
Zontos_ep410_Report1_SSNZontos_ep410_Report1_SSN
Zontos_ep410_Report1_SSNZoe Zontou
 
Graphs, Tables and Charts.ppt for learners
Graphs, Tables and Charts.ppt for learnersGraphs, Tables and Charts.ppt for learners
Graphs, Tables and Charts.ppt for learnersmadiha977567
 
General Statistics boa
General Statistics boaGeneral Statistics boa
General Statistics boaraileeanne
 
Enhancing Research with Statistics
Enhancing Research with StatisticsEnhancing Research with Statistics
Enhancing Research with Statisticskhornberger
 
Spotfire Recommendations in Action
Spotfire Recommendations in ActionSpotfire Recommendations in Action
Spotfire Recommendations in ActionStuart Blair
 
STATS 101 WK7 NOTE.pptx
STATS 101 WK7 NOTE.pptxSTATS 101 WK7 NOTE.pptx
STATS 101 WK7 NOTE.pptxMulbahKromah
 
KennethRosales_FinalBMPComparison_Seattle_WashingtonDC_Combined
KennethRosales_FinalBMPComparison_Seattle_WashingtonDC_CombinedKennethRosales_FinalBMPComparison_Seattle_WashingtonDC_Combined
KennethRosales_FinalBMPComparison_Seattle_WashingtonDC_CombinedKenneth Rosales
 
Organizing and presenting data
Organizing and presenting dataOrganizing and presenting data
Organizing and presenting dataGrace Robledo
 
105575916 maths-edit-new
105575916 maths-edit-new105575916 maths-edit-new
105575916 maths-edit-newhomeworkping7
 
1.AdvantagesandDisadvantagesofDotPlotsHistogramsandBoxPlotsLesson.pptx
1.AdvantagesandDisadvantagesofDotPlotsHistogramsandBoxPlotsLesson.pptx1.AdvantagesandDisadvantagesofDotPlotsHistogramsandBoxPlotsLesson.pptx
1.AdvantagesandDisadvantagesofDotPlotsHistogramsandBoxPlotsLesson.pptxAayzazAhmad
 
Data Visualisation Design Workshop #UXbne
Data Visualisation Design Workshop #UXbneData Visualisation Design Workshop #UXbne
Data Visualisation Design Workshop #UXbneCam Taylor
 

Similar to Details about visualization (20)

Statistics
StatisticsStatistics
Statistics
 
Introduction to Statistics information.pptx
Introduction to Statistics information.pptxIntroduction to Statistics information.pptx
Introduction to Statistics information.pptx
 
Data science week_2_visualization
Data science week_2_visualizationData science week_2_visualization
Data science week_2_visualization
 
Statistics for Managers notes.pdf
Statistics for Managers notes.pdfStatistics for Managers notes.pdf
Statistics for Managers notes.pdf
 
Zontos_ep410_Report1_SSN
Zontos_ep410_Report1_SSNZontos_ep410_Report1_SSN
Zontos_ep410_Report1_SSN
 
Sight, Sound, Numbers & Us: Data Visualization + Data Sonification = Data Acc...
Sight, Sound, Numbers & Us: Data Visualization + Data Sonification = Data Acc...Sight, Sound, Numbers & Us: Data Visualization + Data Sonification = Data Acc...
Sight, Sound, Numbers & Us: Data Visualization + Data Sonification = Data Acc...
 
Graphs, Tables and Charts.ppt for learners
Graphs, Tables and Charts.ppt for learnersGraphs, Tables and Charts.ppt for learners
Graphs, Tables and Charts.ppt for learners
 
General Statistics boa
General Statistics boaGeneral Statistics boa
General Statistics boa
 
Enhancing Research with Statistics
Enhancing Research with StatisticsEnhancing Research with Statistics
Enhancing Research with Statistics
 
Visual Aids.pptx
Visual Aids.pptxVisual Aids.pptx
Visual Aids.pptx
 
Spotfire Recommendations in Action
Spotfire Recommendations in ActionSpotfire Recommendations in Action
Spotfire Recommendations in Action
 
STATS 101 WK7 NOTE.pptx
STATS 101 WK7 NOTE.pptxSTATS 101 WK7 NOTE.pptx
STATS 101 WK7 NOTE.pptx
 
Math Statistics Essay
Math Statistics EssayMath Statistics Essay
Math Statistics Essay
 
Presenting statistics in social media
Presenting statistics in social mediaPresenting statistics in social media
Presenting statistics in social media
 
KennethRosales_FinalBMPComparison_Seattle_WashingtonDC_Combined
KennethRosales_FinalBMPComparison_Seattle_WashingtonDC_CombinedKennethRosales_FinalBMPComparison_Seattle_WashingtonDC_Combined
KennethRosales_FinalBMPComparison_Seattle_WashingtonDC_Combined
 
Organizing and presenting data
Organizing and presenting dataOrganizing and presenting data
Organizing and presenting data
 
105575916 maths-edit-new
105575916 maths-edit-new105575916 maths-edit-new
105575916 maths-edit-new
 
1.AdvantagesandDisadvantagesofDotPlotsHistogramsandBoxPlotsLesson.pptx
1.AdvantagesandDisadvantagesofDotPlotsHistogramsandBoxPlotsLesson.pptx1.AdvantagesandDisadvantagesofDotPlotsHistogramsandBoxPlotsLesson.pptx
1.AdvantagesandDisadvantagesofDotPlotsHistogramsandBoxPlotsLesson.pptx
 
Data Visualisation Design Workshop #UXbne
Data Visualisation Design Workshop #UXbneData Visualisation Design Workshop #UXbne
Data Visualisation Design Workshop #UXbne
 
Year 9 Stats
Year 9 StatsYear 9 Stats
Year 9 Stats
 

Recently uploaded

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 

Recently uploaded (20)

CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 

Details about visualization

  • 1. Declaration on Plagiarism Name: Tejal Vijay Nijai Student Number: 19210412 Programme: MSc in Computing(Data Analytics) Module Code: CA682 Assignment Title: Data Visualisation Submission Date: 16 Dec 2019 Module Coordinator: Dr Suzanne Little I declare that this material, which I now submit for assessment,is entirely my own work and has not been taken from the work of others, save and to the extent that such work has been cited and acknowledged within the text of my work. I understand that plagiarism, collusion, and copying are grave and serious offences in the university and accept the penalties that would be imposed should I engage in plagiarism, collusion or copying. I have read and understood the Assignment Regulations.Ihave identified and included the source ofall facts,ideas, opinions, and viewpoints of others in the assignment references. Direct quotations from books, journal articles, internet sources,module text, or any other source whatsoeverare acknowledged and the source cited are identified in the assignment references. This assignment, or any part of it, has not been previously submitted by me or any other person for assessment on this or any other course of study. I have read and understood the referencing guidelines found at http://www.dcu.ie/info/regulations/plagiarism.shtml, https://www4.dcu.ie/students/az/plagiarism and/or recommended in the assignment guidelines Name: Tejal Vijay Nijai Date: 16/12/2019
  • 2. 120 year of Olympic Games Data Visualization The database contains information about 120 years of summer and winter games Olympic data with various events and descriptions of male and female athletes. I'm curious to see how male and female athlete participation has evolved over the years. I'm trying to find out if they have a similar turnout trend in contribution. Furthermore, what are the popular events or sports? Iam also trying to find the inclination of males and females in various events held in popular athletics sport over the years. Through my graphs,there is some trend followed in the initial years of Olympic(1890-1980) and after 1980 same kind of pattern is followed for both genders.Athletics is the summer sport in case of sports popularity and cross-country skiing is famous winter sport. The trend of contribution of male and female in athletics sport’s various events such as Athletics Men's Shot Put, Athletics Women's Shot Put, Athletics Men's High Jump held over the years. 1.Dataset Description: The size of the dataset is 40MB. It has 271116 rows and 15 columns and each record corresponding to athlete detail competing in the Olympic event. This is a historical dataset on the modern Olympic Games, including all the Games from Athens 1896 to Rio 2016. The columns with the data types are as follows: 1. ID - Unique number for each athlete (Integer) 2. Name - Athlete's name (String) 3. Sex - M or F (Character) 4. Age – Integer (Integer) 5. Height - In centimetres (Integer) 6. Weight - In kilograms (Integer) 7. Team - Team name (String) 8. NOC - National Olympic Committee 3-letter code (String) 9. Games - Year and season (String) 10. Year – Integer 11. Season - Summer or Winter (String) 12. City - Host city (String) 13. Sport – Sport (String) 14. Event – Event (String) 15. Medal - Gold, Silver, Bronze, or NA (String) As it has 2lakhs+ rows with 15 different descriptive attributes,my dataset comes under the volume dimension. 2. Data Exploration, Processing,Cleaning and/or Integration: Dataset had incomplete information about the athletes' age, height, and weight. I cleaned the dataset by using dropna function in python.I wanted to make a story around genderperformance or participation in different sports or events in different regions if possible. Despite that,columns like age, height, weight, and medal have been omitted. Because the dataset is about 120 years of players, which made me think of showing a pattern in participation and also provides gender-specific data that we can compare over the years. Additionally, there was data about sports orgames in which we could find out the favourite sport.
  • 3. 3. Visualisations Graph 1: The graph above shows the pattern of male and female participation in the Olympic Games for 120 years,i.e. 1896 to 2016. From this graph, we discovered that women started to contribute from 1920 Olympic Games. Male participation has grown rapidly from 1950-1980, although female participation is slowly gaining momentum. In 1980, there was a sharp decrease in both genders' participation. In 1980, the United States led a boycott of the Summer Olympic Games in Moscow to protest the late 1979 Soviet invasion of Afghanistan. In total, 65 nations refused to participate in the games, whereas 80 countries sent athletes to compete[2].Since 1980, competing in both sexes ' Olympic games is quite similar. Line graphs are used for monitoring changes over short and long periods of time. These are also used to compare improvements over the same time period for more than one category.I constructed this chart in R and showed it in animation. I used red and blue colours respectively to represent the female and male line. We might work out some points in my diagrams quickly by animating the line graph. Once animation begins, we can discern immediately that for female graph startedat 1920. Going further there is a sudden drop in 1980 and after 1980 the pattern is similar for both sexes. This helped examine and compare the pattern of the phenomenon.
  • 4. Graph 2: The chart above shows common games / sports played by men and women of the season based on the number of people who played the game. Using this graph, we discovered that athletics is the summer season's popular game among males and females. Cross-country skiing is the most popular sport among males and females for the winter season. We get to know that, irrespective of season same sports is adored by both genders. Treemap graphs are used to view hierarchical data and relationships in part. It is automatically ordered in descending order by the width of the rectangle which helps to classify the values from the highest to the lowest. I created this graph and used common colours to represent blue and pink for male and female to realize that data visualization is about the distribution of sex. I have used tableau interactive treemap feature to filter data season wise.
  • 5. Graph 3: By referring second graph,we got to know that athletics sport is famous among men and women. To drill down further how the athletics sport with its various events has men and women inclination. This graph not only tells us count of athletics events held for male and female individually but also the total count over gender. Graph has been created in tableau and generic colours for male and female have been used. Area chart is used to show trend of population over time. We got to know that athletics is popular among men and women by referring to the second graph.To further drill down how men and women are drawn to the athletics sport with its various events,I created area chart. The chart shows us not only the number of individually held athletic events for male and female, but also the overall count by sex. Graph was developed in the table and male and female standard colors were used.Area map is used to display population pattern over time. 4. Conclusion: I tried to plot animation of line graph in python,but it was taking longer time. Whereas in R , I could plot the graph with a smaller number of lines of code with the help of R’s gganimate libraries. The data from treemap graph could have been shown in bubble chart also. The con of bubble chart is it shows data in different size of circles indicating the highest and lowest of data.To distinguish among these circles to get the largest one is difficult or time consuming when count of parameter is nearby as the size would seemsame. In contradiction to this , data is displayed in descending order automatically. References: 1. Dataset retrieved from following Link: https://www.kaggle.com/heesoo37/120-years-of-olympic-history-athletes-and-results 2. https://2001-2009.state.gov/r/pa/ho/time/qfp/104481.htm 3. https://rpubs.com/elifdemir/olympics_analysis_report (Referred this for the design of my Graph 1)