SlideShare a Scribd company logo
1 of 9
Cleansing Big
Data
Final Presentation
Agenda
• Project Overview
• Tools used
• Use Cases
• Demonstration
• Conclusion
The Problem
Our task is to scrape a Wikipedia page
containing notable deaths for the last 40
years, then create meaningful insights
from our analysis.
Why Clean
Data?
COMPANIES ARE BECOMING MORE
AND MORE DATA DRIVEN, BUT THE
DATA NEEDS TO BE ACCESSED
SOMEHOW
TOOLS LIKE PYTHON AND R CAN
CAPTURE THIS DATA
TOOLS SUCH AS PYTHON DO NOT
ALWAYS MEET DEMONSTRATED
BUSINESS NEED
Web Scraping
(Python)
• Used to scrape the Wikipedia Page
• Import Data into arrays
• Append Arrays together
• Print the data to a CSV
Data Cleansing
(Excel)
• Clean the Data
• Fill in additional info (dates, nationalities)
• Perform further analysis
• Chart the similarities in the “Ground Truth”
Our Solution Caters to the
Ultimate Business Need
• Excel is Popular with businesses
• Allows for customization
• Can be used for different datasets, especially ones
with dates
• Not as costly, as most businesses have access to
Excel
Demonstration
Conclusion
• What we accomplished
• Scrape all webpages
• Import the data into excel
• Clean/modify the data in excel using a macro
• Business Use Rationale
• Excel is widely accepted within the business
community
• This will make our project more viable for a
business to use

More Related Content

What's hot

Tableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
Tableau Conference 2018: Binging on Data - Enabling Analytics at NetflixTableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
Tableau Conference 2018: Binging on Data - Enabling Analytics at NetflixBlake Irvine
 
Values & Vision - Cloud Sandboxes for BIG Earth Sciences
Values & Vision - Cloud Sandboxes for BIG Earth SciencesValues & Vision - Cloud Sandboxes for BIG Earth Sciences
Values & Vision - Cloud Sandboxes for BIG Earth Sciencesterradue
 
M02 - Micrisoft Azure DB - Azure Open Group
M02 - Micrisoft Azure DB - Azure Open GroupM02 - Micrisoft Azure DB - Azure Open Group
M02 - Micrisoft Azure DB - Azure Open GroupPaulo Renato
 
Riding the big data wave with Excel and Power BI - SMBNation 2015
Riding the big data wave with Excel and Power BI - SMBNation 2015Riding the big data wave with Excel and Power BI - SMBNation 2015
Riding the big data wave with Excel and Power BI - SMBNation 2015Robert Crane
 
Keeping a Common Address Database “Common” Using FME
Keeping a Common Address Database “Common” Using FMEKeeping a Common Address Database “Common” Using FME
Keeping a Common Address Database “Common” Using FMESafe Software
 
Mike Tuche, CEO of Talend: Enabling the Data Driven Enterprise
Mike Tuche, CEO of Talend: Enabling the Data Driven EnterpriseMike Tuche, CEO of Talend: Enabling the Data Driven Enterprise
Mike Tuche, CEO of Talend: Enabling the Data Driven EnterpriseTalend
 
Store, Extract, Transform, Load, Visualize. Untagged Conference
Store, Extract, Transform, Load, Visualize. Untagged ConferenceStore, Extract, Transform, Load, Visualize. Untagged Conference
Store, Extract, Transform, Load, Visualize. Untagged ConferenceAni Lopez
 
Data Culture Series - Keynote - 16th September 2014
Data Culture Series - Keynote - 16th September 2014Data Culture Series - Keynote - 16th September 2014
Data Culture Series - Keynote - 16th September 2014Jonathan Woodward
 
Achieving Agility and Scale for Your Data Lake - Talend
Achieving Agility and Scale for Your Data Lake - TalendAchieving Agility and Scale for Your Data Lake - Talend
Achieving Agility and Scale for Your Data Lake - TalendTalend
 
Embracing Cloud Agility to Maximize Flexibility & Performance
Embracing Cloud Agility to Maximize Flexibility & Performance Embracing Cloud Agility to Maximize Flexibility & Performance
Embracing Cloud Agility to Maximize Flexibility & Performance Talend
 
Dynamic Data Visualization with Tableau
Dynamic Data Visualization with TableauDynamic Data Visualization with Tableau
Dynamic Data Visualization with TableauAkashBorse2
 
Lessons from Digital Natives: How Retailers Power their Businesses with DataOps
Lessons from Digital Natives: How Retailers Power their Businesses with DataOpsLessons from Digital Natives: How Retailers Power their Businesses with DataOps
Lessons from Digital Natives: How Retailers Power their Businesses with DataOpsNexla
 
Using FME to Transform Survey 123 Data
Using FME to Transform Survey 123 DataUsing FME to Transform Survey 123 Data
Using FME to Transform Survey 123 DataSafe Software
 
OgH Data Visualization Special Part I
OgH Data Visualization Special Part IOgH Data Visualization Special Part I
OgH Data Visualization Special Part ILuc Bors
 
Multidimensional models with Analysis Services 2014
Multidimensional models with Analysis Services 2014Multidimensional models with Analysis Services 2014
Multidimensional models with Analysis Services 2014Alan Koo
 
Kiefer Business Intelligence Presentation | SharePoint Users Group Breakfast ...
Kiefer Business Intelligence Presentation | SharePoint Users Group Breakfast ...Kiefer Business Intelligence Presentation | SharePoint Users Group Breakfast ...
Kiefer Business Intelligence Presentation | SharePoint Users Group Breakfast ...KieferConsulting
 
Talend 6.1 - What's New in Talend?
Talend 6.1 - What's New in Talend?Talend 6.1 - What's New in Talend?
Talend 6.1 - What's New in Talend?Talend
 
How stlrda does data
How stlrda does dataHow stlrda does data
How stlrda does dataAdam Doyle
 

What's hot (20)

Tableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
Tableau Conference 2018: Binging on Data - Enabling Analytics at NetflixTableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
Tableau Conference 2018: Binging on Data - Enabling Analytics at Netflix
 
Values & Vision - Cloud Sandboxes for BIG Earth Sciences
Values & Vision - Cloud Sandboxes for BIG Earth SciencesValues & Vision - Cloud Sandboxes for BIG Earth Sciences
Values & Vision - Cloud Sandboxes for BIG Earth Sciences
 
M02 - Micrisoft Azure DB - Azure Open Group
M02 - Micrisoft Azure DB - Azure Open GroupM02 - Micrisoft Azure DB - Azure Open Group
M02 - Micrisoft Azure DB - Azure Open Group
 
Riding the big data wave with Excel and Power BI - SMBNation 2015
Riding the big data wave with Excel and Power BI - SMBNation 2015Riding the big data wave with Excel and Power BI - SMBNation 2015
Riding the big data wave with Excel and Power BI - SMBNation 2015
 
Keeping a Common Address Database “Common” Using FME
Keeping a Common Address Database “Common” Using FMEKeeping a Common Address Database “Common” Using FME
Keeping a Common Address Database “Common” Using FME
 
Data migration services
Data migration servicesData migration services
Data migration services
 
Mike Tuche, CEO of Talend: Enabling the Data Driven Enterprise
Mike Tuche, CEO of Talend: Enabling the Data Driven EnterpriseMike Tuche, CEO of Talend: Enabling the Data Driven Enterprise
Mike Tuche, CEO of Talend: Enabling the Data Driven Enterprise
 
Store, Extract, Transform, Load, Visualize. Untagged Conference
Store, Extract, Transform, Load, Visualize. Untagged ConferenceStore, Extract, Transform, Load, Visualize. Untagged Conference
Store, Extract, Transform, Load, Visualize. Untagged Conference
 
Data Culture Series - Keynote - 16th September 2014
Data Culture Series - Keynote - 16th September 2014Data Culture Series - Keynote - 16th September 2014
Data Culture Series - Keynote - 16th September 2014
 
Achieving Agility and Scale for Your Data Lake - Talend
Achieving Agility and Scale for Your Data Lake - TalendAchieving Agility and Scale for Your Data Lake - Talend
Achieving Agility and Scale for Your Data Lake - Talend
 
Embracing Cloud Agility to Maximize Flexibility & Performance
Embracing Cloud Agility to Maximize Flexibility & Performance Embracing Cloud Agility to Maximize Flexibility & Performance
Embracing Cloud Agility to Maximize Flexibility & Performance
 
Dynamic Data Visualization with Tableau
Dynamic Data Visualization with TableauDynamic Data Visualization with Tableau
Dynamic Data Visualization with Tableau
 
Lessons from Digital Natives: How Retailers Power their Businesses with DataOps
Lessons from Digital Natives: How Retailers Power their Businesses with DataOpsLessons from Digital Natives: How Retailers Power their Businesses with DataOps
Lessons from Digital Natives: How Retailers Power their Businesses with DataOps
 
Using FME to Transform Survey 123 Data
Using FME to Transform Survey 123 DataUsing FME to Transform Survey 123 Data
Using FME to Transform Survey 123 Data
 
OgH Data Visualization Special Part I
OgH Data Visualization Special Part IOgH Data Visualization Special Part I
OgH Data Visualization Special Part I
 
Multidimensional models with Analysis Services 2014
Multidimensional models with Analysis Services 2014Multidimensional models with Analysis Services 2014
Multidimensional models with Analysis Services 2014
 
Kiefer Business Intelligence Presentation | SharePoint Users Group Breakfast ...
Kiefer Business Intelligence Presentation | SharePoint Users Group Breakfast ...Kiefer Business Intelligence Presentation | SharePoint Users Group Breakfast ...
Kiefer Business Intelligence Presentation | SharePoint Users Group Breakfast ...
 
A2B Data™ Brochure
A2B Data™ BrochureA2B Data™ Brochure
A2B Data™ Brochure
 
Talend 6.1 - What's New in Talend?
Talend 6.1 - What's New in Talend?Talend 6.1 - What's New in Talend?
Talend 6.1 - What's New in Talend?
 
How stlrda does data
How stlrda does dataHow stlrda does data
How stlrda does data
 

Similar to Final presentation updated

Branson - Self-Service Business Intelligence for On-Prem Organizations
Branson - Self-Service Business Intelligence for On-Prem OrganizationsBranson - Self-Service Business Intelligence for On-Prem Organizations
Branson - Self-Service Business Intelligence for On-Prem OrganizationsTheresa Lubelski
 
Data Detectives - Presentation
Data Detectives - PresentationData Detectives - Presentation
Data Detectives - PresentationClint Campbell
 
Self-Service Business Intelligence in Excel
Self-Service Business Intelligence in ExcelSelf-Service Business Intelligence in Excel
Self-Service Business Intelligence in ExcelInnoTech
 
Tableau - Learning Objectives for Data, Graphs, Filters, Dashboards and Advan...
Tableau - Learning Objectives for Data, Graphs, Filters, Dashboards and Advan...Tableau - Learning Objectives for Data, Graphs, Filters, Dashboards and Advan...
Tableau - Learning Objectives for Data, Graphs, Filters, Dashboards and Advan...Srinath Reddy
 
Develop a Custom Data Solution Architecture with NorthBay
Develop a Custom Data Solution Architecture with NorthBayDevelop a Custom Data Solution Architecture with NorthBay
Develop a Custom Data Solution Architecture with NorthBayAmazon Web Services
 
Tableau Visual analytics complete deck 2
Tableau Visual analytics complete deck 2Tableau Visual analytics complete deck 2
Tableau Visual analytics complete deck 2Arun K
 
SPTechCon Austin - The Slippery Slope of SharePoint Migrations
SPTechCon Austin - The Slippery Slope of SharePoint MigrationsSPTechCon Austin - The Slippery Slope of SharePoint Migrations
SPTechCon Austin - The Slippery Slope of SharePoint MigrationsJill Hannemann
 
Big Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeBig Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeCaserta
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02email2jl
 
Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitectureCreating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitecturePerficient, Inc.
 
Data Foundation for Analytics Excellence by Tanimura, cathy from Okta
Data Foundation for Analytics Excellence by Tanimura, cathy from OktaData Foundation for Analytics Excellence by Tanimura, cathy from Okta
Data Foundation for Analytics Excellence by Tanimura, cathy from OktaTin Ho
 
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteArchitecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteCaserta
 
Tableau Seattle BI Event How Tableau Changed My Life
Tableau Seattle BI Event How Tableau Changed My LifeTableau Seattle BI Event How Tableau Changed My Life
Tableau Seattle BI Event How Tableau Changed My LifeRussell Spangler
 
Big data in the enterprise: When to use what?
Big data in the enterprise: When to use what?Big data in the enterprise: When to use what?
Big data in the enterprise: When to use what?Jesus Rodriguez
 
How to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationHow to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationPerficient, Inc.
 
Thinking outside the box enterprsie architecture ias10 online
Thinking outside the box  enterprsie architecture ias10 onlineThinking outside the box  enterprsie architecture ias10 online
Thinking outside the box enterprsie architecture ias10 onlineRen Pope
 
Building Better Analytics Workflows (Strata-Hadoop World 2013)
Building Better Analytics Workflows (Strata-Hadoop World 2013)Building Better Analytics Workflows (Strata-Hadoop World 2013)
Building Better Analytics Workflows (Strata-Hadoop World 2013)Wes McKinney
 
Power BI for Office 365: Using SharePoint to Deliver Self-Service
Power BI for Office 365: Using SharePoint to Deliver Self-ServicePower BI for Office 365: Using SharePoint to Deliver Self-Service
Power BI for Office 365: Using SharePoint to Deliver Self-ServicePerficient, Inc.
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigManish Chopra
 
9 Months of Fun with SharePoint in Azure and Office 365
9 Months of Fun with SharePoint in Azure and Office 3659 Months of Fun with SharePoint in Azure and Office 365
9 Months of Fun with SharePoint in Azure and Office 365Colin Phillips
 

Similar to Final presentation updated (20)

Branson - Self-Service Business Intelligence for On-Prem Organizations
Branson - Self-Service Business Intelligence for On-Prem OrganizationsBranson - Self-Service Business Intelligence for On-Prem Organizations
Branson - Self-Service Business Intelligence for On-Prem Organizations
 
Data Detectives - Presentation
Data Detectives - PresentationData Detectives - Presentation
Data Detectives - Presentation
 
Self-Service Business Intelligence in Excel
Self-Service Business Intelligence in ExcelSelf-Service Business Intelligence in Excel
Self-Service Business Intelligence in Excel
 
Tableau - Learning Objectives for Data, Graphs, Filters, Dashboards and Advan...
Tableau - Learning Objectives for Data, Graphs, Filters, Dashboards and Advan...Tableau - Learning Objectives for Data, Graphs, Filters, Dashboards and Advan...
Tableau - Learning Objectives for Data, Graphs, Filters, Dashboards and Advan...
 
Develop a Custom Data Solution Architecture with NorthBay
Develop a Custom Data Solution Architecture with NorthBayDevelop a Custom Data Solution Architecture with NorthBay
Develop a Custom Data Solution Architecture with NorthBay
 
Tableau Visual analytics complete deck 2
Tableau Visual analytics complete deck 2Tableau Visual analytics complete deck 2
Tableau Visual analytics complete deck 2
 
SPTechCon Austin - The Slippery Slope of SharePoint Migrations
SPTechCon Austin - The Slippery Slope of SharePoint MigrationsSPTechCon Austin - The Slippery Slope of SharePoint Migrations
SPTechCon Austin - The Slippery Slope of SharePoint Migrations
 
Big Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeBig Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data Lake
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
 
Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitectureCreating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data Architecture
 
Data Foundation for Analytics Excellence by Tanimura, cathy from Okta
Data Foundation for Analytics Excellence by Tanimura, cathy from OktaData Foundation for Analytics Excellence by Tanimura, cathy from Okta
Data Foundation for Analytics Excellence by Tanimura, cathy from Okta
 
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing KeynoteArchitecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
Architecting Data For The Modern Enterprise - Data Summit 2017, Closing Keynote
 
Tableau Seattle BI Event How Tableau Changed My Life
Tableau Seattle BI Event How Tableau Changed My LifeTableau Seattle BI Event How Tableau Changed My Life
Tableau Seattle BI Event How Tableau Changed My Life
 
Big data in the enterprise: When to use what?
Big data in the enterprise: When to use what?Big data in the enterprise: When to use what?
Big data in the enterprise: When to use what?
 
How to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationHow to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data Visualization
 
Thinking outside the box enterprsie architecture ias10 online
Thinking outside the box  enterprsie architecture ias10 onlineThinking outside the box  enterprsie architecture ias10 online
Thinking outside the box enterprsie architecture ias10 online
 
Building Better Analytics Workflows (Strata-Hadoop World 2013)
Building Better Analytics Workflows (Strata-Hadoop World 2013)Building Better Analytics Workflows (Strata-Hadoop World 2013)
Building Better Analytics Workflows (Strata-Hadoop World 2013)
 
Power BI for Office 365: Using SharePoint to Deliver Self-Service
Power BI for Office 365: Using SharePoint to Deliver Self-ServicePower BI for Office 365: Using SharePoint to Deliver Self-Service
Power BI for Office 365: Using SharePoint to Deliver Self-Service
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
 
9 Months of Fun with SharePoint in Azure and Office 365
9 Months of Fun with SharePoint in Azure and Office 3659 Months of Fun with SharePoint in Azure and Office 365
9 Months of Fun with SharePoint in Azure and Office 365
 

Recently uploaded

Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.arsicmarija21
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 

Recently uploaded (20)

Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.AmericanHighSchoolsprezentacijaoskolama.
AmericanHighSchoolsprezentacijaoskolama.
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 

Final presentation updated

  • 2. Agenda • Project Overview • Tools used • Use Cases • Demonstration • Conclusion
  • 3. The Problem Our task is to scrape a Wikipedia page containing notable deaths for the last 40 years, then create meaningful insights from our analysis.
  • 4. Why Clean Data? COMPANIES ARE BECOMING MORE AND MORE DATA DRIVEN, BUT THE DATA NEEDS TO BE ACCESSED SOMEHOW TOOLS LIKE PYTHON AND R CAN CAPTURE THIS DATA TOOLS SUCH AS PYTHON DO NOT ALWAYS MEET DEMONSTRATED BUSINESS NEED
  • 5. Web Scraping (Python) • Used to scrape the Wikipedia Page • Import Data into arrays • Append Arrays together • Print the data to a CSV
  • 6. Data Cleansing (Excel) • Clean the Data • Fill in additional info (dates, nationalities) • Perform further analysis • Chart the similarities in the “Ground Truth”
  • 7. Our Solution Caters to the Ultimate Business Need • Excel is Popular with businesses • Allows for customization • Can be used for different datasets, especially ones with dates • Not as costly, as most businesses have access to Excel
  • 9. Conclusion • What we accomplished • Scrape all webpages • Import the data into excel • Clean/modify the data in excel using a macro • Business Use Rationale • Excel is widely accepted within the business community • This will make our project more viable for a business to use