2. Dataset Summary
Ecommerce Data Analysis And Visualization
Datamatics Ltd sells unique all-occasion gifts and other items to customers. Many customers of
the company are wholesalers. This is a data set from a UK-based company. This data set
contains all the transactions occurring between 01/12/2010 and 09/12/2011.
The goal of this hackathon is to visualize the sales of this company. Below are the details of the dataset.
VariableName - Description
InvoiceNo: Unique ID to identify each Invoice
StockCode: Unique ID for each item in stock
Description: A short description for each item
Quantity: Number of items bought
UnitPrice: The price of each item
Customer ID: Unique ID for each customer
Country The country were the costumer live
3. 1. Find out the total number of customers in every country so that they can increase sales of
the items in that country. List the country where both least and highest items are purchased.
Country that has highest number of Customers : United Kingdom
Country that has Least number of Customers : Saudi Arabia
0
50000
100000
150000
200000
250000
300000
350000
400000
Total Number of Customer Country wise
PROBLEM STATEMENTS
4. 2. Display top 10 orders of customers.
ASSORTED COLOUR BIRD ORNAMENT
JUMBO BAG RED RETROSPOT
LUNCH BAG BLACK SKULL.
LUNCH BAG RED RETROSPOT
PACK OF 72 RETROSPOT CAKE CASES
PARTY BUNTING
POSTAGE
REGENCY CAKESTAND 3 TIER
SET OF 3 CAKE TINS PANTRY DESIGN
WHITE HANGING HEART T-LIGHT HOLDER 0 500 1000 1500 2000 2500
ASSORTED COLOUR BIRD ORNAMENT
JUMBO BAG RED RETROSPOT
LUNCH BAG BLACK SKULL.
LUNCH BAG RED RETROSPOT
PACK OF 72 RETROSPOT CAKE CASES
PARTY BUNTING
POSTAGE
REGENCY CAKESTAND 3 TIER
SET OF 3 CAKE TINS PANTRY DESIGN
WHITE HANGING HEART T-LIGHT HOLDER
Top 10 Orders of Customers
5. 3. Find out how the price is varying over 2 years.
0
50000
100000
150000
200000
250000
300000
350000
Dec Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec
2010 2011
Price Varying according to Month over 2 Years
Total
-> There is sharp increase in the
month of November in UnitPrice i.e
327149.85 .
->The steep difference between two
years 2010 &2011 is 1977762
-> The sharp increase in November
month is highest Price from Dec
2010 to Dec 2011
6. 4. To get a clear picture about sales of products, find out how many unique stock
codes and
what are the total number of customers.
* We have 3959 unique values for
Stock Code
* We have 4373 unique Customers in
the given dataset
3959
4372
UNIQUE STOCK CODE AND CUSTOMER
Unique Stock Code
Customer
7. 5. Check in which month the highest number of invoices are issued.
We have highest number of invoices in the
month of November 2011 i.e. 60742 invoices
0 10000 20000 30000 40000 50000 60000 70000 80000 90000
(blank)
Dec
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
<12/
1/20
1020102011
Highest Invoice Issued
8. 6. How many unique stock codes are present in the dataset to know which
stock are highest and least.
0
500
1000
1500
2000
2500
85123A
22728
22557
22652
22212
21535
84949
22690
22571
22623
22672
22422
23371
21544
22664
22332
22967
22673
23380
22853
21358
21063
21395
84519A
22204
23432
20616
22788
21683
71270
23598
85016
84976
72128
22883
20794
21738
21547
21319
84766
23143
21651
90162B
23632
gift_0001_30
85067
90001D
90195B
90026B
23635
79323W
90060B
79323LP
21806
Unique Stock Code (Highest to Lowest) Unique Stock Codes 3958
Highest Stock Code 2380
Least Stock Code 1
9. 7. Come out with the item which is most frequently purchased by customers
so that company
can increase production of that product
Most frequently purchased item is ‘ WHITE
HANGING HEART T-LIGHT HOLDER ’ followed by
Regency Cakestand 3 Tier ,
Jumbo Bag Red Retrospot ,
Party Bunting ,
Lunch Bag Red Retrospot .
0
500
1000
1500
2000
2500
WHITE HANGING
HEART T-LIGHT
HOLDER
REGENCY
CAKESTAND 3 TIER
JUMBO BAG RED
RETROSPOT
PARTY BUNTING LUNCH BAG RED
RETROSPOT
Frequently Purchased Items
Total
10. 8. How much money is spent by the
customers?
0
200000
400000
600000
800000
1000000
1200000
1400000
1600000
(blank) 14646 18102 17450 14911 12415 14156 17511 16684 13694
1447682.12
279489.02256438.49
187482.17
132572.62123725.45113384.1488125.3865892.0862653.1
Money spent by Customers
Total
0 10000 20000 30000 40000 50000 60000 70000 80000 90000
Dec
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
201
02011
Monthly Wise Distribution
Total
9. Show monthly wise distribution of
number of orders of customers ?
11. 10. What are the total number of orders for each country?
We have maximum numbers of
orders from ‘United Kingdom’ ,
Which is more than 90% of total
orders placed.
Total
0
500000
Total Numbers of Orders Country Wise
12. 11. How much money is spent by each country?
United Kingdom has spent highest money i.e.
8187806.364 among all the countries followed by 284661.54
12. In 2010 what was the total sales of the
United Kingdom and France?
0
1000000
2000000
3000000
4000000
5000000
6000000
7000000
8000000
9000000
Money Spent By Country
Total
0
5000
10000
15000
20000
25000
30000
35000
40000
45000
United Kingdom France
Total Sales of United Kingdom and France
COUNTRY SALES
United Kingdom 40125
France 439
13. 13. In 2011, what is the total amount of money spent by
different countries?
14. In November and December 2011
how price was distributed in the United
Kingdom?
0
1000000
2000000
3000000
4000000
5000000
6000000
7000000
8000000
Australia
Bahrain
Brazil
ChannelIslands
CzechRepublic
EIRE
Finland
Germany
HongKong
Israel
Japan
Malta
Norway
Portugal
SaudiArabia
Spain
Switzerland
UnitedKingdom
USA
2011
2011 Total Money Spent
Dec Nov Dec
2010 2011
251922.9
301640.98
128259.67
676742.62
1282805.78
388735.43
Price Distributed In Nov & Dec In United Kingdom
Sum of UnitPrice
Sum of Money spent
14. 15. Find insights apart from the above insights for better understanding of products and its
sale.
Count of InvoiceNo
Sum of Quantity
0
1000000
2000000
3000000
4000000
5000000
TOP 10 POTENTIAL WORKING COUNTRY
Count of InvoiceNo
Count of
CustomerID
Sum of Quantity
Insights drawn from the sales is are as follows:
* European countries have highest sales.
* Czech Republic in central Europe is one country who is in
bottom 5 with respect to sales .
* All other states are contributing minimun to the profit of Ecommerce.
0 100 200 300 400 500 600 700
Bahrain
Brazil
Czech Republic
Lithuania
Saudi Arabia
(blank)
TOP 10 AVERAGE WORKING COUNTRIES
Sum of Quantity
Count of CustomerID
Count of InvoiceNo
15. Conclusions:
More than 90% of sales is done by United Kingdom
Most frequent ordered and purchased item is ’WHITE HANGING HEART T-LIGHT HOLDER’ ,so we can
increase the production of this item.
The product like ‘ASSORTED COLOUR SILK GLASSES CASE’ is least sold product. Should get the customer
reviews and construct accordingly
When there is profit happening from few particular countries the production of items should be
increased. That means we need to satisfy the needs of the customers from countries like United Kingdom,
Netherlands etc.
The countries or states where there is less or no sales ,the strategies and well advertisement should be
done to increase the sales .Countries like Bahrain, Lithuania, Brazil ,Saudi Arabia.
Should try make such strategies and planning it should reach every customer.
I conclude my presentation
Thank you