SlideShare a Scribd company logo
Belgian Scanner Data Project
Methodology and results
Rome
1&2 October
François Valenduc & Ken Van Loon
Statistics Belgium
http://statbel.fgov.be
Contents
 Overview of the project
 Combining scanner data and classical price collection
– Link to COICOP
– Stratification model
 Methodology for calculating scanner data indices
 Results
– Unit value versus observed price
– Experimental indices
 Future developments
http://statbel.fgov.be
Scanner data: overview of the project
 We receive scanner data from the 3 largest supermarket chains in
Belgium: Colruyt, Carrefour and Delhaize (around 75% of the market)
 Negotiations started early 2012
 We received our first scanner datasets by the end of 2013
– Historical data for Colruyt and Carrefour starting from January 2012
– Historical data were received from Delhaize starting from January 2013
 Each week we receive the data of the previous week via SFTP:
– On Tuesday for Colruyt and Carrefour (+ 2 days)
– The following weekend for Delhaize (+ 7 days)
 The data we receive can be split into 2 parts
– One dataset containing the product info
– One dataset containing the internal classification of the retail chain
 Currently, scanner data are used in 9 COICOP 5 groups in the CPI,
implementation in the HICP/CPI starting in 2016 for at least COICOP 1
http://statbel.fgov.be
Scanner data: link to COICOP
 First step: link the internal classification to COICOP 5 digits
 Second step: create detailed groups at COICOP 6 digits
(consumption segments)
– Not the same segments as we use for classical price collection
– Turnover information from scanner data shows that the current sample is
not always representative
– Groups at COICOP 6 digits are harmonised as much as possible between
the different retail chains to allow for comparison
– Around 450 COICOP 6 digits groups for COICOP 01 instead of 173 for
classical price collection
 Maximal use of internal classification of retail chains
– Each chain has around 3500 internal groups
• Often too detailed to use this directly (e.g. one product per group !)
• Sometimes, internal groups are quite heterogeneous
• Linking at product level is inevitable in order to create groups at COICOP 6 level :
around 10% of the products are individually linked
– Currently examining whether machine learning (using supervised learning
models) can be used.
http://statbel.fgov.be
 To combine scanner data and classical price collection a
stratification model is used. Indices are always combined at the
COICOP 5 digits level.
Aggregation based on
expenditures by retail
channel (HBS)
Speciality stores –
COICOP 5
Discounter – COICOP 5Supermarkets– COICOP 5
Aggregation based on
turnover
Supermarkets and discounters –
COICOP 5
Global index – COICOP 5
Scanner data and classical price
collection: stratification
Scanner data and classical price
collection: stratification
http://statbel.fgov.be
Scanner
data
COICOP 6
group 1
Jevons index
Scanner
data
COICOP 6
group 2
Scanner
data
COICOP 6
group 3
Scanner
data
…
Retail chain 1 –
COICOP 5
Aggregation based on
turnover from scanner
data
Retail chain 2 –
COICOP 5
Retail chain 3 –
COICOP 5
Supermarkets– COICOP 5
Aggregation based on
turnover from annual
accounts
http://statbel.fgov.be
Scanner data and classical price
collection: stratification
 Groups where prices of retail chains, discounters and speciality stores
are combined (based on data from HBS 2012):
 For other COICOP groups in COICOP 1 (where market share is below
15% for speciality stores), limitation to scanner data and discounters.
– Our own research shows a high correlation between supermarkets and speciality
stores for those groups
COICOP Supermarkets & discounters Speciality stores
01.1.1.3 Bread 54% 46%
01.1.2.8 Other meat preparations 74% 26%
01.1.2.1 Beaf and veal 76% 24%
01.1.2.5 Other meat 76% 24%
01.1.2.2 Pork 78% 22%
01.1.2.7 Dried, salted or smoked meat 79% 21%
01.1.1.4 Other bakery products 80% 20%
01.1.2.3 Lamb and goat 81% 19%
01.1.2.4 Poultry 84% 16%
http://statbel.fgov.be
 Based on methods of the Netherlands and Switzerland
– use of monthly chained Jevons index at COICOP 6 digits level
– sampling: product included in the sample if
𝑠 𝑚+𝑠 𝑚−1
2
>
1
1.25 × 𝑛
with n = number of products and sm (or sm-1) = market shares of each
product
– price imputation for temporary missing products or out of sample products
(to satisfy the identity and transitivity test)
– dumping and outlier filters
– prices that are excluded by dumping and outliers filter are also imputed
– unit value price for each internal code is used (normally first 3 weeks of a
month)
 Calculation is done in SAS
 Products falling out of the sample are verified against products
entering the sample to check if product relaunches occur:
– Direct comparison (if necessary with volume adjustment) in the case of
product relaunches
– Same system for volume discounts: most of the time, products with volume
discounts (6 + 2 free) have different internal codes
Scanner data: methodology
http://statbel.fgov.be
Scanner data: linking (relaunches)
 Each month two lists are generated for every COICOP 6 digits group of
every retail chain:
– One list contains the “new” products in the sample
– Another lists contains the products that have disappeared from the sample in the
current month and the previous 3 months (also the turnover figures are listed)
 In order to avoid a possible bias in the index level the old product and
new product are linked if they refer to the same product (if necessary,
volume adjustment is done)
 Around 80 linkings per month for COICOP 1
 Currently this linking is done manually in MS Access .
We are doing research on text mining (fuzzy matching) to see if this
can be done more efficiently
Month COICOP Group Old product New product Coeff.
June-14 01.1.1.4.02
Chocolate
biscuits
LU PRINCE - 6 + 2 gratis -
Mini Stars B & W
LU PRINCE - 225 g - Mini
Stars Black & White
0.75
October-14 01.1.4.5.10 Cream cheese
KRAFT PHILADELPHIA - 200 g
- fromage light
KRAFT PHILADELPHIA -
300 g - fromage light
1.50
http://statbel.fgov.be
Scanner data: impact of relaunches
98.0
98.5
99.0
99.5
100.0
100.5
101.0
101.5
12/2013
01/2014
02/2014
03/2014
04/2014
05/2014
06/2014
07/2014
08/2014
09/2014
10/2014
11/2014
12/2014
01/2015
02/2015
03/2015
04/2015
05/2015
06/2015
07/2015
08/2015
12/2013=100
01.1.1.4: Other bakery products
Without linking With linking
http://statbel.fgov.be
Scanner data – Use of internal codes
 Use of internal code to match products instead of GTIN (or EAN)
– The internal code can also be used to verify the product on the website
of the retail chain (useful for making a link between new and old
products or for purchasing power parities)
• The product description we receive is usually also the one used on
the website, since this description is the most detailed.
– Fresh products such as meat, fruits and vegetables are sold in different
quantities, everything is aggregated with one internal code to get for
example the average price per kilo of “chateaubriand steak”
– GTIN are often reused for totally different product for in-store EANS:
– GTIN codes are only used to verify the consistency in the link to the
COICOP classification between the different retail chains
Date GTIN Internal code Description
1/01/2013 2020380000000F1988111800338400000 Melon | Charentais | Philibon
2/03/2015 2020380000000F1986053100318010000 Green pepper | Sold loose
http://statbel.fgov.be
Scanner data – example multiple GTIN codes
 Example of multiple GTIN-codes for
the same product
 For instance, Ferrero Rocher is sold
under two GTIN codes at one chain:
– 8000500032237
– 8000500167113
– But only one internal code S2001062000072760000
 When it’s available under the 2 codes, the unit value is identical
(5,99 €):
Week GTIN Internal code Product description Unit Sales unit Turnover Unit value
07Oct20138000500032237 S2001062000072760000 Rocher | 30 pièces | Ferrero 0.375K 475 2845.25 5.99
07Oct20138000500167113 S2001062000072760000 Rocher | 30 pièces | Ferrero 0.375K 732 4381.69 5.99
http://statbel.fgov.be
05238265 is the internal code
Scanner data – Use of internal codes
http://statbel.fgov.be
Unit value vs observed price
http://statbel.fgov.be
Unit value vs observed price
 Web scraping for COICOP 05.6.1.1 compared to scanner data
http://statbel.fgov.be
 Web scraping for COICOP 01.1.1.4 compared to scanner data
Unit value vs observed price
http://statbel.fgov.be
Examples
 Some examples: scanner data indices compared to traditional price
collection for the period 12/2013 - 07/2015
 Traditional price collection limited to the 3 chains for which we receive
scanner data
 Number of products used in the calculation (scanner data):
COICOP Group Number of products
01.1.1.6 Pasta products and couscous 689
01.1.4.1 Whole milk 47
01.1.4.2 Low fat milk 124
01.1.4.5 Cheese 1440
http://statbel.fgov.be
Examples
94
95
96
97
98
99
100
101
102
103
12/2013
01/2014
02/2014
03/2014
04/2014
05/2014
06/2014
07/2014
08/2014
09/2014
10/2014
11/2014
12/2014
01/2015
02/2015
03/2015
04/2015
05/2015
06/2015
07/2015
12/2013=100
01.1.1.6: Pasta products and couscous
Scanner data Traditional
http://statbel.fgov.be
Examples
86
88
90
92
94
96
98
100
102
104
12/2013
01/2014
02/2014
03/2014
04/2014
05/2014
06/2014
07/2014
08/2014
09/2014
10/2014
11/2014
12/2014
01/2015
02/2015
03/2015
04/2015
05/2015
06/2015
07/2015
12/2013=100
01.1.4.1: Whole milk
Scanner data Traditional
http://statbel.fgov.be
Examples
86
88
90
92
94
96
98
100
102
104
12/2013
01/2014
02/2014
03/2014
04/2014
05/2014
06/2014
07/2014
08/2014
09/2014
10/2014
11/2014
12/2014
01/2015
02/2015
03/2015
04/2015
05/2015
06/2015
07/2015
12/2013=100
01.1.4.2: Low fat milk
Scanner data Traditional
http://statbel.fgov.be
Examples
98.0
98.5
99.0
99.5
100.0
100.5
101.0
101.5
102.0
12/2013
01/2014
02/2014
03/2014
04/2014
05/2014
06/2014
07/2014
08/2014
09/2014
10/2014
11/2014
12/2014
01/2015
02/2015
03/2015
04/2015
05/2015
06/2015
07/2015
12/2013=100
01.1.4.5: Cheese
Scanner data Traditional price collection
http://statbel.fgov.be
– Compilation of HICP CT using scanner data: excise taxes depend on
volume and alcoholic grade: some assumptions or simplification may be
needed (also done at the moment for manual price collection).
– Negotiations with Lidl (currently in progress) and Aldi (around 16% of
market share together)
– Negotiations with consumer electronics or clothing stores
– Research on machine learning for link to COICOP
– Research on text mining to detect product relaunches
Future developments
Thanks for your attention
Questions ?
http://statbel.fgov.be

More Related Content

Similar to Belgian scanner data project: methodology and results - Ken Van Loon, François Valenduc

Simulean_Analytics
Simulean_AnalyticsSimulean_Analytics
Simulean_AnalyticsRobert Lemay
 
The Swedish experience with scanner data from sampling to index calculation -...
The Swedish experience with scanner data from sampling to index calculation -...The Swedish experience with scanner data from sampling to index calculation -...
The Swedish experience with scanner data from sampling to index calculation -...
Istituto nazionale di statistica
 
Promotion Analytics in Consumer Electronics - Module 1: Data
Promotion Analytics in Consumer Electronics - Module 1: DataPromotion Analytics in Consumer Electronics - Module 1: Data
Promotion Analytics in Consumer Electronics - Module 1: Data
Minha Hwang
 
Presentation on the state of play for a recommendation on processing scanner ...
Presentation on the state of play for a recommendation on processing scanner ...Presentation on the state of play for a recommendation on processing scanner ...
Presentation on the state of play for a recommendation on processing scanner ...
Istituto nazionale di statistica
 
Yulia ladigina (danone). sergey syamichev (lenta)
Yulia ladigina (danone). sergey syamichev (lenta)Yulia ladigina (danone). sergey syamichev (lenta)
Yulia ladigina (danone). sergey syamichev (lenta)ECR Community
 
презентация Ecr англ
презентация Ecr англпрезентация Ecr англ
презентация Ecr англECR Community
 
Roadshow Presentation - 3-Months Key Sales Figures 2019/20
Roadshow Presentation - 3-Months Key Sales Figures 2019/20Roadshow Presentation - 3-Months Key Sales Figures 2019/20
Roadshow Presentation - 3-Months Key Sales Figures 2019/20
Barry Callebaut
 
Ekomilk AMP system for real time on farm milk quality and animal health control
Ekomilk AMP system for real time on farm milk quality and animal health controlEkomilk AMP system for real time on farm milk quality and animal health control
Ekomilk AMP system for real time on farm milk quality and animal health control
Ekomilk
 
20131030 OECD - Competition and value distribution along the food chain. Span...
20131030 OECD - Competition and value distribution along the food chain. Span...20131030 OECD - Competition and value distribution along the food chain. Span...
20131030 OECD - Competition and value distribution along the food chain. Span...
FIAB
 
Ch07 cma
Ch07  cmaCh07  cma
Ch07 cma
Ibrahim Jimalle
 
Evolution of the retail trade market and the role of modern distribution - Hé...
Evolution of the retail trade market and the role of modern distribution - Hé...Evolution of the retail trade market and the role of modern distribution - Hé...
Evolution of the retail trade market and the role of modern distribution - Hé...
Istituto nazionale di statistica
 
The impact of spatial aggregation on price indices with scanner data - Ingol...
The impact of spatial aggregation on price indices  with scanner data - Ingol...The impact of spatial aggregation on price indices  with scanner data - Ingol...
The impact of spatial aggregation on price indices with scanner data - Ingol...
Istituto nazionale di statistica
 
Business Intelligence: Multidimensional Analysis
Business Intelligence: Multidimensional AnalysisBusiness Intelligence: Multidimensional Analysis
Business Intelligence: Multidimensional Analysis
Michael Lamont
 
Presentatie stefan herold pim in the 360 degree view ronde 2
Presentatie stefan herold pim in the 360 degree view ronde 2Presentatie stefan herold pim in the 360 degree view ronde 2
Presentatie stefan herold pim in the 360 degree view ronde 2
e-Channel Netwerk
 
CAGE Conference 2016
CAGE Conference 2016CAGE Conference 2016
CAGE Conference 2016
Barry Callebaut
 
Session 8 c subnational pp ps discussion of biggeri paper
Session 8 c subnational pp ps   discussion of biggeri paperSession 8 c subnational pp ps   discussion of biggeri paper
Session 8 c subnational pp ps discussion of biggeri paper
IARIW 2014
 
BVDW Online sales study of the Circle of Online Marketers (OVK)
BVDW Online sales study of the Circle of Online Marketers (OVK)BVDW Online sales study of the Circle of Online Marketers (OVK)
BVDW Online sales study of the Circle of Online Marketers (OVK)
IAB Europe
 
Financial Analysis Coursework April 2016 - Ocado
Financial Analysis Coursework April 2016 - Ocado Financial Analysis Coursework April 2016 - Ocado
Financial Analysis Coursework April 2016 - Ocado Oliver Goodwin - MCSI
 

Similar to Belgian scanner data project: methodology and results - Ken Van Loon, François Valenduc (20)

Simulean_Analytics
Simulean_AnalyticsSimulean_Analytics
Simulean_Analytics
 
The Swedish experience with scanner data from sampling to index calculation -...
The Swedish experience with scanner data from sampling to index calculation -...The Swedish experience with scanner data from sampling to index calculation -...
The Swedish experience with scanner data from sampling to index calculation -...
 
Promotion Analytics in Consumer Electronics - Module 1: Data
Promotion Analytics in Consumer Electronics - Module 1: DataPromotion Analytics in Consumer Electronics - Module 1: Data
Promotion Analytics in Consumer Electronics - Module 1: Data
 
Competition Law
Competition LawCompetition Law
Competition Law
 
Presentation on the state of play for a recommendation on processing scanner ...
Presentation on the state of play for a recommendation on processing scanner ...Presentation on the state of play for a recommendation on processing scanner ...
Presentation on the state of play for a recommendation on processing scanner ...
 
Yulia ladigina (danone). sergey syamichev (lenta)
Yulia ladigina (danone). sergey syamichev (lenta)Yulia ladigina (danone). sergey syamichev (lenta)
Yulia ladigina (danone). sergey syamichev (lenta)
 
презентация Ecr англ
презентация Ecr англпрезентация Ecr англ
презентация Ecr англ
 
Roadshow Presentation - 3-Months Key Sales Figures 2019/20
Roadshow Presentation - 3-Months Key Sales Figures 2019/20Roadshow Presentation - 3-Months Key Sales Figures 2019/20
Roadshow Presentation - 3-Months Key Sales Figures 2019/20
 
Ekomilk AMP system for real time on farm milk quality and animal health control
Ekomilk AMP system for real time on farm milk quality and animal health controlEkomilk AMP system for real time on farm milk quality and animal health control
Ekomilk AMP system for real time on farm milk quality and animal health control
 
Strategy Pack AW15 Pans
Strategy Pack AW15 PansStrategy Pack AW15 Pans
Strategy Pack AW15 Pans
 
20131030 OECD - Competition and value distribution along the food chain. Span...
20131030 OECD - Competition and value distribution along the food chain. Span...20131030 OECD - Competition and value distribution along the food chain. Span...
20131030 OECD - Competition and value distribution along the food chain. Span...
 
Ch07 cma
Ch07  cmaCh07  cma
Ch07 cma
 
Evolution of the retail trade market and the role of modern distribution - Hé...
Evolution of the retail trade market and the role of modern distribution - Hé...Evolution of the retail trade market and the role of modern distribution - Hé...
Evolution of the retail trade market and the role of modern distribution - Hé...
 
The impact of spatial aggregation on price indices with scanner data - Ingol...
The impact of spatial aggregation on price indices  with scanner data - Ingol...The impact of spatial aggregation on price indices  with scanner data - Ingol...
The impact of spatial aggregation on price indices with scanner data - Ingol...
 
Business Intelligence: Multidimensional Analysis
Business Intelligence: Multidimensional AnalysisBusiness Intelligence: Multidimensional Analysis
Business Intelligence: Multidimensional Analysis
 
Presentatie stefan herold pim in the 360 degree view ronde 2
Presentatie stefan herold pim in the 360 degree view ronde 2Presentatie stefan herold pim in the 360 degree view ronde 2
Presentatie stefan herold pim in the 360 degree view ronde 2
 
CAGE Conference 2016
CAGE Conference 2016CAGE Conference 2016
CAGE Conference 2016
 
Session 8 c subnational pp ps discussion of biggeri paper
Session 8 c subnational pp ps   discussion of biggeri paperSession 8 c subnational pp ps   discussion of biggeri paper
Session 8 c subnational pp ps discussion of biggeri paper
 
BVDW Online sales study of the Circle of Online Marketers (OVK)
BVDW Online sales study of the Circle of Online Marketers (OVK)BVDW Online sales study of the Circle of Online Marketers (OVK)
BVDW Online sales study of the Circle of Online Marketers (OVK)
 
Financial Analysis Coursework April 2016 - Ocado
Financial Analysis Coursework April 2016 - Ocado Financial Analysis Coursework April 2016 - Ocado
Financial Analysis Coursework April 2016 - Ocado
 

More from Istituto nazionale di statistica

Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
Istituto nazionale di statistica
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
Istituto nazionale di statistica
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
Istituto nazionale di statistica
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
Istituto nazionale di statistica
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
Istituto nazionale di statistica
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
Istituto nazionale di statistica
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
Istituto nazionale di statistica
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
Istituto nazionale di statistica
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
Istituto nazionale di statistica
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
Istituto nazionale di statistica
 
14a Conferenza Nazionale di Statisticacnstatistica14
14a Conferenza Nazionale di Statisticacnstatistica1414a Conferenza Nazionale di Statisticacnstatistica14
14a Conferenza Nazionale di Statisticacnstatistica14
Istituto nazionale di statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
Istituto nazionale di statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
Istituto nazionale di statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
Istituto nazionale di statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
Istituto nazionale di statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
Istituto nazionale di statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
Istituto nazionale di statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
Istituto nazionale di statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
Istituto nazionale di statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
Istituto nazionale di statistica
 

More from Istituto nazionale di statistica (20)

Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profitCensimenti Permanenti Istituzioni non profit
Censimenti Permanenti Istituzioni non profit
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
 
Censimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni PubblicheCensimento Permanente Istituzioni Pubbliche
Censimento Permanente Istituzioni Pubbliche
 
14a Conferenza Nazionale di Statisticacnstatistica14
14a Conferenza Nazionale di Statisticacnstatistica1414a Conferenza Nazionale di Statisticacnstatistica14
14a Conferenza Nazionale di Statisticacnstatistica14
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 
14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica14a Conferenza Nazionale di Statistica
14a Conferenza Nazionale di Statistica
 

Recently uploaded

Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
Kartik Tiwari
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
timhan337
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
Jean Carlos Nunes Paixão
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
TechSoup
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
kimdan468
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 

Recently uploaded (20)

Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
 
Lapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdfLapbook sobre os Regimes Totalitários.pdf
Lapbook sobre os Regimes Totalitários.pdf
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
Introduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp NetworkIntroduction to AI for Nonprofits with Tapp Network
Introduction to AI for Nonprofits with Tapp Network
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBCSTRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
STRAND 3 HYGIENIC PRACTICES.pptx GRADE 7 CBC
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 

Belgian scanner data project: methodology and results - Ken Van Loon, François Valenduc

  • 1. Belgian Scanner Data Project Methodology and results Rome 1&2 October François Valenduc & Ken Van Loon Statistics Belgium
  • 2. http://statbel.fgov.be Contents  Overview of the project  Combining scanner data and classical price collection – Link to COICOP – Stratification model  Methodology for calculating scanner data indices  Results – Unit value versus observed price – Experimental indices  Future developments
  • 3. http://statbel.fgov.be Scanner data: overview of the project  We receive scanner data from the 3 largest supermarket chains in Belgium: Colruyt, Carrefour and Delhaize (around 75% of the market)  Negotiations started early 2012  We received our first scanner datasets by the end of 2013 – Historical data for Colruyt and Carrefour starting from January 2012 – Historical data were received from Delhaize starting from January 2013  Each week we receive the data of the previous week via SFTP: – On Tuesday for Colruyt and Carrefour (+ 2 days) – The following weekend for Delhaize (+ 7 days)  The data we receive can be split into 2 parts – One dataset containing the product info – One dataset containing the internal classification of the retail chain  Currently, scanner data are used in 9 COICOP 5 groups in the CPI, implementation in the HICP/CPI starting in 2016 for at least COICOP 1
  • 4. http://statbel.fgov.be Scanner data: link to COICOP  First step: link the internal classification to COICOP 5 digits  Second step: create detailed groups at COICOP 6 digits (consumption segments) – Not the same segments as we use for classical price collection – Turnover information from scanner data shows that the current sample is not always representative – Groups at COICOP 6 digits are harmonised as much as possible between the different retail chains to allow for comparison – Around 450 COICOP 6 digits groups for COICOP 01 instead of 173 for classical price collection  Maximal use of internal classification of retail chains – Each chain has around 3500 internal groups • Often too detailed to use this directly (e.g. one product per group !) • Sometimes, internal groups are quite heterogeneous • Linking at product level is inevitable in order to create groups at COICOP 6 level : around 10% of the products are individually linked – Currently examining whether machine learning (using supervised learning models) can be used.
  • 5. http://statbel.fgov.be  To combine scanner data and classical price collection a stratification model is used. Indices are always combined at the COICOP 5 digits level. Aggregation based on expenditures by retail channel (HBS) Speciality stores – COICOP 5 Discounter – COICOP 5Supermarkets– COICOP 5 Aggregation based on turnover Supermarkets and discounters – COICOP 5 Global index – COICOP 5 Scanner data and classical price collection: stratification
  • 6. Scanner data and classical price collection: stratification http://statbel.fgov.be Scanner data COICOP 6 group 1 Jevons index Scanner data COICOP 6 group 2 Scanner data COICOP 6 group 3 Scanner data … Retail chain 1 – COICOP 5 Aggregation based on turnover from scanner data Retail chain 2 – COICOP 5 Retail chain 3 – COICOP 5 Supermarkets– COICOP 5 Aggregation based on turnover from annual accounts
  • 7. http://statbel.fgov.be Scanner data and classical price collection: stratification  Groups where prices of retail chains, discounters and speciality stores are combined (based on data from HBS 2012):  For other COICOP groups in COICOP 1 (where market share is below 15% for speciality stores), limitation to scanner data and discounters. – Our own research shows a high correlation between supermarkets and speciality stores for those groups COICOP Supermarkets & discounters Speciality stores 01.1.1.3 Bread 54% 46% 01.1.2.8 Other meat preparations 74% 26% 01.1.2.1 Beaf and veal 76% 24% 01.1.2.5 Other meat 76% 24% 01.1.2.2 Pork 78% 22% 01.1.2.7 Dried, salted or smoked meat 79% 21% 01.1.1.4 Other bakery products 80% 20% 01.1.2.3 Lamb and goat 81% 19% 01.1.2.4 Poultry 84% 16%
  • 8. http://statbel.fgov.be  Based on methods of the Netherlands and Switzerland – use of monthly chained Jevons index at COICOP 6 digits level – sampling: product included in the sample if 𝑠 𝑚+𝑠 𝑚−1 2 > 1 1.25 × 𝑛 with n = number of products and sm (or sm-1) = market shares of each product – price imputation for temporary missing products or out of sample products (to satisfy the identity and transitivity test) – dumping and outlier filters – prices that are excluded by dumping and outliers filter are also imputed – unit value price for each internal code is used (normally first 3 weeks of a month)  Calculation is done in SAS  Products falling out of the sample are verified against products entering the sample to check if product relaunches occur: – Direct comparison (if necessary with volume adjustment) in the case of product relaunches – Same system for volume discounts: most of the time, products with volume discounts (6 + 2 free) have different internal codes Scanner data: methodology
  • 9. http://statbel.fgov.be Scanner data: linking (relaunches)  Each month two lists are generated for every COICOP 6 digits group of every retail chain: – One list contains the “new” products in the sample – Another lists contains the products that have disappeared from the sample in the current month and the previous 3 months (also the turnover figures are listed)  In order to avoid a possible bias in the index level the old product and new product are linked if they refer to the same product (if necessary, volume adjustment is done)  Around 80 linkings per month for COICOP 1  Currently this linking is done manually in MS Access . We are doing research on text mining (fuzzy matching) to see if this can be done more efficiently Month COICOP Group Old product New product Coeff. June-14 01.1.1.4.02 Chocolate biscuits LU PRINCE - 6 + 2 gratis - Mini Stars B & W LU PRINCE - 225 g - Mini Stars Black & White 0.75 October-14 01.1.4.5.10 Cream cheese KRAFT PHILADELPHIA - 200 g - fromage light KRAFT PHILADELPHIA - 300 g - fromage light 1.50
  • 10. http://statbel.fgov.be Scanner data: impact of relaunches 98.0 98.5 99.0 99.5 100.0 100.5 101.0 101.5 12/2013 01/2014 02/2014 03/2014 04/2014 05/2014 06/2014 07/2014 08/2014 09/2014 10/2014 11/2014 12/2014 01/2015 02/2015 03/2015 04/2015 05/2015 06/2015 07/2015 08/2015 12/2013=100 01.1.1.4: Other bakery products Without linking With linking
  • 11. http://statbel.fgov.be Scanner data – Use of internal codes  Use of internal code to match products instead of GTIN (or EAN) – The internal code can also be used to verify the product on the website of the retail chain (useful for making a link between new and old products or for purchasing power parities) • The product description we receive is usually also the one used on the website, since this description is the most detailed. – Fresh products such as meat, fruits and vegetables are sold in different quantities, everything is aggregated with one internal code to get for example the average price per kilo of “chateaubriand steak” – GTIN are often reused for totally different product for in-store EANS: – GTIN codes are only used to verify the consistency in the link to the COICOP classification between the different retail chains Date GTIN Internal code Description 1/01/2013 2020380000000F1988111800338400000 Melon | Charentais | Philibon 2/03/2015 2020380000000F1986053100318010000 Green pepper | Sold loose
  • 12. http://statbel.fgov.be Scanner data – example multiple GTIN codes  Example of multiple GTIN-codes for the same product  For instance, Ferrero Rocher is sold under two GTIN codes at one chain: – 8000500032237 – 8000500167113 – But only one internal code S2001062000072760000  When it’s available under the 2 codes, the unit value is identical (5,99 €): Week GTIN Internal code Product description Unit Sales unit Turnover Unit value 07Oct20138000500032237 S2001062000072760000 Rocher | 30 pièces | Ferrero 0.375K 475 2845.25 5.99 07Oct20138000500167113 S2001062000072760000 Rocher | 30 pièces | Ferrero 0.375K 732 4381.69 5.99
  • 13. http://statbel.fgov.be 05238265 is the internal code Scanner data – Use of internal codes
  • 15. http://statbel.fgov.be Unit value vs observed price  Web scraping for COICOP 05.6.1.1 compared to scanner data
  • 16. http://statbel.fgov.be  Web scraping for COICOP 01.1.1.4 compared to scanner data Unit value vs observed price
  • 17. http://statbel.fgov.be Examples  Some examples: scanner data indices compared to traditional price collection for the period 12/2013 - 07/2015  Traditional price collection limited to the 3 chains for which we receive scanner data  Number of products used in the calculation (scanner data): COICOP Group Number of products 01.1.1.6 Pasta products and couscous 689 01.1.4.1 Whole milk 47 01.1.4.2 Low fat milk 124 01.1.4.5 Cheese 1440
  • 22. http://statbel.fgov.be – Compilation of HICP CT using scanner data: excise taxes depend on volume and alcoholic grade: some assumptions or simplification may be needed (also done at the moment for manual price collection). – Negotiations with Lidl (currently in progress) and Aldi (around 16% of market share together) – Negotiations with consumer electronics or clothing stores – Research on machine learning for link to COICOP – Research on text mining to detect product relaunches Future developments
  • 23. Thanks for your attention Questions ? http://statbel.fgov.be