SlideShare a Scribd company logo
1 of 21
A-ONEConsultants
Jessica Morris & Partap Singh
Current Assignment
DATA ANALYSIS
Data Mining Goals
Analyze QVC airtime and sales history to determine the best times to sell
certain products on air
Determine which states make the most purchases in order to better
geographically target QVCs sales
Determine which brands and products sell the best
Data Provided
Clean the Data
In order to get the data in a format readable by HDFS file types, the data
needed to be cleaned
We used a combination of Excel and Powershell to do this
Quotes needed to be removed and dates needed to be formatted as YYYY-
MM-DD not MM/DD/YYYY.
Process the Data
A mixture of the Hadoop tools Hive and Impala were used
We ran a combination of queries on the tables including joins and distinct
queries to get an idea of the data we were working with
These queries generated the Excel files that we further analyzed in Tableau
In a real world situation, one would not limit themselves to one tool
Example of Hive/Impala Queries
Example of Generated Data
Airtime for each product generated by Impala
Example of Generated Data (cont.)
Hive was used to generate the excel file here. The chart was created in excel and
shows the top 25 sales dates...
Example of Generated Data (Cont)
...As well as how many orders contained what products on these dates
Visualization
of Data
in Tableau
Visualization - Tableau
Visualization - Tableau
Visualization - Tableau
Visualization - Tableau
Visualization - Tableau
Visualization - Tableau (map)
Visualization - Tableau (map)
Visualization - Tableau (map)
Visualization - Tableau (map)
Thank You!

More Related Content

Viewers also liked

Ceplac inicia curso de jovem empreendedor rural
Ceplac inicia curso de jovem empreendedor ruralCeplac inicia curso de jovem empreendedor rural
Ceplac inicia curso de jovem empreendedor ruralRoberto Rabat Chame
 
SHORT BIOGRAPHY - Kai Brand-Jacobsen (2015)
SHORT BIOGRAPHY - Kai Brand-Jacobsen (2015)SHORT BIOGRAPHY - Kai Brand-Jacobsen (2015)
SHORT BIOGRAPHY - Kai Brand-Jacobsen (2015)Kai Brand-Jacobsen
 
Trasabilitatea calitatii laptelui si cadrul legal de sustinere
Trasabilitatea calitatii laptelui si cadrul legal de sustinereTrasabilitatea calitatii laptelui si cadrul legal de sustinere
Trasabilitatea calitatii laptelui si cadrul legal de sustinereGabriela Maria Grama
 
Families and Friends of Murder Victims November 2016 Newsletter
Families and Friends of Murder Victims November 2016 NewsletterFamilies and Friends of Murder Victims November 2016 Newsletter
Families and Friends of Murder Victims November 2016 Newsletterffmv
 

Viewers also liked (8)

Ceplac inicia curso de jovem empreendedor rural
Ceplac inicia curso de jovem empreendedor ruralCeplac inicia curso de jovem empreendedor rural
Ceplac inicia curso de jovem empreendedor rural
 
Memberships
MembershipsMemberships
Memberships
 
SHORT BIOGRAPHY - Kai Brand-Jacobsen (2015)
SHORT BIOGRAPHY - Kai Brand-Jacobsen (2015)SHORT BIOGRAPHY - Kai Brand-Jacobsen (2015)
SHORT BIOGRAPHY - Kai Brand-Jacobsen (2015)
 
TALIC BROCHURE
TALIC BROCHURETALIC BROCHURE
TALIC BROCHURE
 
Trasabilitatea calitatii laptelui si cadrul legal de sustinere
Trasabilitatea calitatii laptelui si cadrul legal de sustinereTrasabilitatea calitatii laptelui si cadrul legal de sustinere
Trasabilitatea calitatii laptelui si cadrul legal de sustinere
 
Families and Friends of Murder Victims November 2016 Newsletter
Families and Friends of Murder Victims November 2016 NewsletterFamilies and Friends of Murder Victims November 2016 Newsletter
Families and Friends of Murder Victims November 2016 Newsletter
 
نموذج ميد ... توازن النمو على المدى الطويل
نموذج ميد ... توازن النمو على المدى الطويلنموذج ميد ... توازن النمو على المدى الطويل
نموذج ميد ... توازن النمو على المدى الطويل
 
Quiz 2 ambiental
Quiz 2 ambientalQuiz 2 ambiental
Quiz 2 ambiental
 

Similar to QVC Data Analysis Report

Dw Concepts
Dw ConceptsDw Concepts
Dw Conceptsdataware
 
Cloud computing major project
Cloud computing major projectCloud computing major project
Cloud computing major projectayk115
 
Testing Big Data: Automated ETL Testing of Hadoop
Testing Big Data: Automated ETL Testing of HadoopTesting Big Data: Automated ETL Testing of Hadoop
Testing Big Data: Automated ETL Testing of HadoopRTTS
 
Hadoop - An Introduction
Hadoop - An IntroductionHadoop - An Introduction
Hadoop - An IntroductionShankar R
 
2017-01-08-scaling tribalknowledge
2017-01-08-scaling tribalknowledge2017-01-08-scaling tribalknowledge
2017-01-08-scaling tribalknowledgeChristopher Williams
 
DWH_PROJECT [Compatibility Mode]
DWH_PROJECT [Compatibility Mode]DWH_PROJECT [Compatibility Mode]
DWH_PROJECT [Compatibility Mode]vasanth kumar C
 
Build Data Lakes and Analytics on AWS
Build Data Lakes and Analytics on AWS Build Data Lakes and Analytics on AWS
Build Data Lakes and Analytics on AWS Amazon Web Services
 
Build data warehouse for retail using Hadoop
Build data warehouse for retail using HadoopBuild data warehouse for retail using Hadoop
Build data warehouse for retail using HadoopAlex Nguyen
 
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop ProfessionalsBest Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop ProfessionalsCloudera, Inc.
 
The Big Data Puzzle, Where Does the Eclipse Piece Fit?
The Big Data Puzzle, Where Does the Eclipse Piece Fit?The Big Data Puzzle, Where Does the Eclipse Piece Fit?
The Big Data Puzzle, Where Does the Eclipse Piece Fit?J Langley
 
American family hadoop journey, uw ebc sig meeting, april 2015
American family hadoop journey, uw ebc sig meeting, april 2015American family hadoop journey, uw ebc sig meeting, april 2015
American family hadoop journey, uw ebc sig meeting, april 2015Craig Jordan
 
Big data or big deal
Big data or big dealBig data or big deal
Big data or big dealeduarderwee
 
Case Study: Big Data Analytics
Case Study: Big Data AnalyticsCase Study: Big Data Analytics
Case Study: Big Data AnalyticsAbhinav Das
 
Dell APEX outperformed comparable Amazon EC2 instances on a decision-support ...
Dell APEX outperformed comparable Amazon EC2 instances on a decision-support ...Dell APEX outperformed comparable Amazon EC2 instances on a decision-support ...
Dell APEX outperformed comparable Amazon EC2 instances on a decision-support ...Principled Technologies
 
Haddop in Business Intelligence
Haddop in Business IntelligenceHaddop in Business Intelligence
Haddop in Business IntelligenceHGanesh
 
Hd insight overview
Hd insight overviewHd insight overview
Hd insight overviewvhrocca
 
Analysis of historical movie data by BHADRA
Analysis of historical movie data by BHADRAAnalysis of historical movie data by BHADRA
Analysis of historical movie data by BHADRABhadra Gowdra
 
Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...
Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...
Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...Impetus Technologies
 
Testing Big Data: Automated Testing of Hadoop with QuerySurge
Testing Big Data: Automated  Testing of Hadoop with QuerySurgeTesting Big Data: Automated  Testing of Hadoop with QuerySurge
Testing Big Data: Automated Testing of Hadoop with QuerySurgeRTTS
 

Similar to QVC Data Analysis Report (20)

Dw Concepts
Dw ConceptsDw Concepts
Dw Concepts
 
Cloud computing major project
Cloud computing major projectCloud computing major project
Cloud computing major project
 
Datalake Architecture
Datalake ArchitectureDatalake Architecture
Datalake Architecture
 
Testing Big Data: Automated ETL Testing of Hadoop
Testing Big Data: Automated ETL Testing of HadoopTesting Big Data: Automated ETL Testing of Hadoop
Testing Big Data: Automated ETL Testing of Hadoop
 
Hadoop - An Introduction
Hadoop - An IntroductionHadoop - An Introduction
Hadoop - An Introduction
 
2017-01-08-scaling tribalknowledge
2017-01-08-scaling tribalknowledge2017-01-08-scaling tribalknowledge
2017-01-08-scaling tribalknowledge
 
DWH_PROJECT [Compatibility Mode]
DWH_PROJECT [Compatibility Mode]DWH_PROJECT [Compatibility Mode]
DWH_PROJECT [Compatibility Mode]
 
Build Data Lakes and Analytics on AWS
Build Data Lakes and Analytics on AWS Build Data Lakes and Analytics on AWS
Build Data Lakes and Analytics on AWS
 
Build data warehouse for retail using Hadoop
Build data warehouse for retail using HadoopBuild data warehouse for retail using Hadoop
Build data warehouse for retail using Hadoop
 
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop ProfessionalsBest Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
 
The Big Data Puzzle, Where Does the Eclipse Piece Fit?
The Big Data Puzzle, Where Does the Eclipse Piece Fit?The Big Data Puzzle, Where Does the Eclipse Piece Fit?
The Big Data Puzzle, Where Does the Eclipse Piece Fit?
 
American family hadoop journey, uw ebc sig meeting, april 2015
American family hadoop journey, uw ebc sig meeting, april 2015American family hadoop journey, uw ebc sig meeting, april 2015
American family hadoop journey, uw ebc sig meeting, april 2015
 
Big data or big deal
Big data or big dealBig data or big deal
Big data or big deal
 
Case Study: Big Data Analytics
Case Study: Big Data AnalyticsCase Study: Big Data Analytics
Case Study: Big Data Analytics
 
Dell APEX outperformed comparable Amazon EC2 instances on a decision-support ...
Dell APEX outperformed comparable Amazon EC2 instances on a decision-support ...Dell APEX outperformed comparable Amazon EC2 instances on a decision-support ...
Dell APEX outperformed comparable Amazon EC2 instances on a decision-support ...
 
Haddop in Business Intelligence
Haddop in Business IntelligenceHaddop in Business Intelligence
Haddop in Business Intelligence
 
Hd insight overview
Hd insight overviewHd insight overview
Hd insight overview
 
Analysis of historical movie data by BHADRA
Analysis of historical movie data by BHADRAAnalysis of historical movie data by BHADRA
Analysis of historical movie data by BHADRA
 
Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...
Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...
Planning your Next-Gen Change Data Capture (CDC) Architecture in 2019 - Strea...
 
Testing Big Data: Automated Testing of Hadoop with QuerySurge
Testing Big Data: Automated  Testing of Hadoop with QuerySurgeTesting Big Data: Automated  Testing of Hadoop with QuerySurge
Testing Big Data: Automated Testing of Hadoop with QuerySurge
 

QVC Data Analysis Report