SlideShare a Scribd company logo
1 of 26
Forecasting Intraday Call
Arrivals
Modelling Special Days
Devon K. Barrow
Nikolaos Kourentzes
27th European Conference on Operational Research
12-15 July 2015
University of Strathclyde
1. Research Questions
2. Call Arrival Data and
3. Experimental Design
4. Results
5. Conclusions and future work
Outline
Forecasting Intraday Call Arrivals Outline 2
• How do different forecasting methods
perform when forecasting (high frequency)
call centre arrivals?
• What is the impact of coding or not coding
of (functional) outlying periods on
forecasting performance?
Research Questions
Research Questions 3Forecasting Intraday Call Arrivals
1. Research Questions
2. Call Arrival Data and
3. Experimental Design
4. Results
5. Conclusions and future work
Outline
Forecasting Intraday Call Arrivals Call Centre Data 4
• High dimensional and sampled at a high
frequency (Kourentzes and Crone, 2010).
• Complex seasonal patterns
– Intraday
– Intraweek
– interyear dependencies(De Livera et al., 2011)
Call Arrival Data and Challenges
Challenges
5Call Centre DataForecasting Intraday Call Arrivals
Call Arrival Data and Challenges
Some call centre data
6
• Two series from a large UK service provider call
centre
• A leading entertainment company in Europe
• Data is sampled at half-hourly intervals
• Consists of 103 weeks and 3 days from 29 June 2012
to 23 June 2014 inclusive including bank holidays
and weekends.
Call Centre DataForecasting Intraday Call Arrivals
Call Arrival Data and Challenges
Complex seasonal patterns
7
Intraday Intraweek
Mean
Median
Monday
Intrayear
Call Centre DataForecasting Intraday Call Arrivals
• Development of new methods
– Double seasonal exponential smoothing (Taylor, 2008)
– Multiplicative double seasonal ARMA model (Taylor, 2003)
– Exponential weighting (Taylor, 2008, 2010)
– Regression (Tych et al., 2002; Taylor, 2010)
– Singular vector decomposition (Shen and Huang, 2005, 2008;
Shen, 2009)
– Gaussian linear mixed-effects models (Aldor-Noiman et al.,
2009; Ibrahim and L’Ecuyer, 2013)
• Intraweek seasonal moving average performs well
at medium to long horizons (Tandberg et al. 1995; Taylor, 2008; Taylor,
2010; Ibrahim and L’Ecuyer, 2013)
Call Arrival Data and Challenges
Handling high frequency complex seasonality
8Call Centre DataForecasting Intraday Call Arrivals
!
• Data is context sensitive
– Effects of holidays, special events and
promotional activities
• Prone to quite sizeable unexplained
variations (outliers)
– E.g. due to system failures and data
processing
Call Arrival Data and Challenges
Challenges
9Call Centre DataForecasting Intraday Call Arrivals
Call Arrival Data and Challenges
Anomalies (outliers)
10
Series 1
Series 1I
Call Centre DataForecasting Intraday Call Arrivals
Call centre data
Seasonal profile
12
4 8 12 16 20
0
100
200
300
400
Calls
Monday
4 8 12 16 20
Tuesday
4 8 12 16 20
Wednesday
4 8 12 16 20
Hour
Thursday
4 8 12 16 20
Friday
4 8 12 16 20
Saturday
4 8 12 16 20
Sunday
Median
25-75%
10-90%
05-95%
4 8 12 16 20
0
10
20
30
40
Calls
Monday
4 8 12 16 20
Tuesday
4 8 12 16 20
Wednesday
4 8 12 16 20
Hour
Thursday
4 8 12 16 20
Friday
4 8 12 16 20
Saturday
4 8 12 16 20
Sunday
Median
25-75%
10-90%
05-95%
Series 1
Series 1ILarge variation from the middle
Call Centre DataForecasting Intraday Call Arrivals
Call centre data
Outliers vs. median pattern
Call Centre Data 13
4 8 12 16 20
0
100
200
300
400
Monday
Calls
4 8 12 16 20
Tuesday
4 8 12 16 20
Wednesday
4 8 12 16 20
Thursday
Hour
4 8 12 16 20
Friday
4 8 12 16 20
Saturday
4 8 12 16 20
Sunday
Profile
Outliers
4 8 12 16 20
0
10
20
30
40
Monday
Calls
4 8 12 16 20
Tuesday
4 8 12 16 20
Wednesday
4 8 12 16 20
Thursday
Hour
4 8 12 16 20
Friday
4 8 12 16 20
Saturday
4 8 12 16 20
Sunday
Profile
Outliers
Series 1
Series 1IRegular profile very different from outliers
Forecasting Intraday Call Arrivals
• Approaches to handling ‘special days’:
– Information is either available and/or data is pre-
cleansed
– The forecaster has an external methodology for
tackling ‘special days’ (Jongbloed and Koole, 2001; Avramidis et al.,
2004; Taylor, 2008a; Pacheco et al., 2009; Taylor, 2010b)
– Removing such days altogether (Taylor et al. 2006)
– Singular vector decomposition for automatic outlier
detection (Shen and Huang, 2005)
• Kourentzes (2011) demonstrates that there are
substantial accuracy benefits to be had from
modelling irregular load patterns.
Call Arrival Data and Challenges
Handling ‘special days’
14Call Centre DataForecasting Intraday Call Arrivals
!
1. Research Questions
2. Call Arrival Data and
3. Experimental Design
4. Results
5. Conclusions and future work
Outline
Forecasting Intraday Call Arrivals Experimental Design 15
Experimental Design
Functional outlier modelling
16
• We evaluate seven alternative methodologies for
modelling functional outliers
• These are based on extensions of conventional
outlier modelling and novel approaches
• We assume that the data generating process of
normal observations is captured adequately and
the outliers are already labelled
Control
Single Binary
Dummy
Multiple Binary
Dummy
Single Integer
Profile Dummy
Trigonometric
Dummy
Model
Separately
Experimental DesignForecasting Intraday Call Arrivals
Experimental Design
Functional outlier modelling
17
• Control/benchmark
– A set of autoregressive lagged inputs (past values) are
identified using stepwise regression
– Outliers are not modelled
• Single Binary Dummy Variable
– Indicator variable s.t. one = outlier; zero otherwise
• Multiple Binary Dummy Variable
– S is the seasonal length (S=48)
– 48 (S) dummy variables to code each observation
– 47 (S – 1) dummy variables to code each observation
– Stepwise selection, s = {1,…,S=48} s.t. s is significantly
different from normal observations
Experimental DesignForecasting Intraday Call Arrivals
Experimental Design
Functional outlier modelling
18
• Single Integer Dummy Variable
– Monotonically increasing variable from 1 to 48 if outlier; zero
otherwise
• Profile Dummy Variable
– This variable is equal to the profile if there is an outlier; zero
otherwise
• Trigonometric Dummy Variables
– Sine and cosine if there is an outlier; zero otherwise
• Model Separately
– Create a new series containing only outliers
– Replace outliers in original series with normal observations
Experimental DesignForecasting Intraday Call Arrivals
Experimental Design
Experimental setup
Experimental design 19
• Neural network setup
– Inputs based on backwards regression + dummies +
seasonal dummies.
– Mode ensemble of 50 networks trained by scaled
conjugate gradient descent
– Hidden nodes identified experimentally for each set of
inputs
• Forecast creation
– Forecast horizon is set to 1 day ahead (1-48 half hourly
steps ahead)
– Test set of 100 days (4800 data points x 48 forecasted
horizons)
• Forecast evaluation
– Mean Absolute Error (MAE)
– Relative Mean Absolute Error (RMAE) i.e. MAE_{Method} /
MAE_{Control}
AE = |Yt - Ft|
Forecasting Intraday Call Arrivals
1. Research Questions
2. Call Arrival Data and
3. Experimental Design
4. Results
5. Conclusions and future work
Outline
Forecasting Intraday Call Arrivals Results 20
Results
Neural networks versus benchmarks
21
Time Series 1 Overall Outlier Normal
Naïve 1 3.930 2.711 4.269
Naïve Day 1.417 1.211 1.475
Naïve Week 1.333 1.057 1.410
MA Day 1.238 1.278 1.226
MA Week 1.233 0.845 1.341
ETS Day 1.887 1.438 2.013
ETS Week 1.529 1.165 1.630
ETS Double 1.086 0.857 1.149
MAPA Day 1.522 1.362 1.566
MAPA Week 1.484 1.274 1.542
NN Control 1.000 1.000 1.000
ResultsForecasting Intraday Call Arrivals
Results
Neural networks versus benchmarks
Time Series II Overall Outlier Normal
Naïve 1 2.236 1.770 2.500
Naïve Day 1.415 1.119 1.582
Naïve Week 1.433 1.157 1.590
MA Day 1.415 1.119 1.582
MA Week 1.129 0.973 1.217
ETS Day 1.600 1.368 1.731
ETS Week 1.479 1.288 1.587
ETS Double 1.151 0.990 1.242
MAPA Day 1.347 1.242 1.406
MAPA Week 1.280 1.201 1.325
NN Control 1.000 1.000 1.000
22ResultsForecasting Intraday Call Arrivals
Results
Outlier modelling versus control
Time Series 1 Overall Outlier Normal
NN Control 1.000 1.000 1.000
NN Bin1 0.996 0.933 1.013
NN BinS 0.884 0.822 0.901
NN BinS-1 0.873 0.831 0.885
NN Bin Step 0.897 0.858 0.908
NN Bin Back 0.895 0.850 0.907
NN Int 0.935 0.913 0.941
NN SinCos 0.980 0.865 1.012
NN Profile 0.986 0.953 0.995
NN Replace 1.002 1.019 0.997
23ResultsForecasting Intraday Call Arrivals
Results
Outlier modelling versus control
Time Series II Overall Outlier Normal
NN Control 1.000 1.000 1.000
NN Bin1 0.956 0.930 0.970
NN BinS 0.922 0.866 0.954
NN BinS-1 0.923 0.870 0.954
NN Bin Step 0.929 0.876 0.958
NN Bin Back 0.931 0.875 0.963
NN Int 0.957 0.940 0.966
NN SinCos 0.951 0.912 0.972
NN Profile 0.963 0.936 0.979
NN Replace 1.030 1.090 0.996
Results 24Forecasting Intraday Call Arrivals
1. Research Questions
2. Call Arrival Data and
3. Experimental Design
4. Results
5. Conclusions and future work
Outline
Forecasting Intraday Call Arrivals Conclusion and future work 25
Conclusion and future work
Conclusion and future work 26
• Conclusion
– Neural networks are good for call centre
data for two reasons:
• They can do complex structures
• They can do complex outliers with relatively
simple modelling
• Future work
– Automatic functional outlier detection
and modelling for call centre arrival data
Forecasting Intraday Call Arrivals
Devon K. Barrow
Coventry Business School
Coventry University, Priory Street, Coventry, CV1
5FB
Direct line: + 44 024 7765 7413
Skype: devon.k.barrow
Email: devon.barrow@coventry.ac.uk

More Related Content

Similar to Forecasting Intraday Arrivals at a Call Centre using Neural Networks: Forecasting Anomalous Days

Monte Carlo and Schedule Risk Analysis
Monte Carlo and Schedule Risk AnalysisMonte Carlo and Schedule Risk Analysis
Monte Carlo and Schedule Risk AnalysisIntaver Insititute
 
OSMC 2023 | Know your data: The stats behind your alerts by Dave McAllister
OSMC 2023 | Know your data: The stats behind your alerts by Dave McAllisterOSMC 2023 | Know your data: The stats behind your alerts by Dave McAllister
OSMC 2023 | Know your data: The stats behind your alerts by Dave McAllisterNETWAYS
 
ICIS - Power price prediction with neural networks
ICIS - Power price prediction with neural networksICIS - Power price prediction with neural networks
ICIS - Power price prediction with neural networksICIS
 
Mixed Effects Models - Fixed Effects
Mixed Effects Models - Fixed EffectsMixed Effects Models - Fixed Effects
Mixed Effects Models - Fixed EffectsScott Fraundorf
 
Thesis presentation: Applications of machine learning in predicting supply risks
Thesis presentation: Applications of machine learning in predicting supply risksThesis presentation: Applications of machine learning in predicting supply risks
Thesis presentation: Applications of machine learning in predicting supply risksTuanNguyen1697
 
Engineering Data Analysis-ProfCharlton
Engineering Data  Analysis-ProfCharltonEngineering Data  Analysis-ProfCharlton
Engineering Data Analysis-ProfCharltonCharltonInao1
 
Cross-validation aggregation for forecasting
Cross-validation aggregation for forecastingCross-validation aggregation for forecasting
Cross-validation aggregation for forecastingDevon Barrow
 
Feature Engineering
Feature Engineering Feature Engineering
Feature Engineering odsc
 
A Study on Privacy Level in Publishing Data of Smart Tap Network
A Study on Privacy Level in Publishing Data of Smart Tap NetworkA Study on Privacy Level in Publishing Data of Smart Tap Network
A Study on Privacy Level in Publishing Data of Smart Tap NetworkHa Phuong
 
Semantic Analysis to Compute Personality Traits from Social Media Posts
Semantic Analysis to Compute Personality Traits from Social Media PostsSemantic Analysis to Compute Personality Traits from Social Media Posts
Semantic Analysis to Compute Personality Traits from Social Media PostsGiulio Carducci
 
Reliability.pdf
Reliability.pdfReliability.pdf
Reliability.pdfChiWaiYiu
 
Benchmarking search relevance in industry vs academia
Benchmarking search relevance in industry vs academiaBenchmarking search relevance in industry vs academia
Benchmarking search relevance in industry vs academiaNick Craswell
 
From Raw Data to Deployed Product. Fast & Agile with CRISP-DM
From Raw Data to Deployed Product. Fast & Agile with CRISP-DMFrom Raw Data to Deployed Product. Fast & Agile with CRISP-DM
From Raw Data to Deployed Product. Fast & Agile with CRISP-DMMichał Łopuszyński
 
Leveragin research, behavioural and demeographic data
Leveragin research, behavioural and demeographic dataLeveragin research, behavioural and demeographic data
Leveragin research, behavioural and demeographic dataMRS
 
Keynote SBST 2014 - Search-Based Testing
Keynote SBST 2014 - Search-Based TestingKeynote SBST 2014 - Search-Based Testing
Keynote SBST 2014 - Search-Based TestingLionel Briand
 
How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?Anubhav Jain
 

Similar to Forecasting Intraday Arrivals at a Call Centre using Neural Networks: Forecasting Anomalous Days (20)

1650 track2 anderberg
1650 track2 anderberg1650 track2 anderberg
1650 track2 anderberg
 
Monte Carlo and Schedule Risk Analysis
Monte Carlo and Schedule Risk AnalysisMonte Carlo and Schedule Risk Analysis
Monte Carlo and Schedule Risk Analysis
 
OSMC 2023 | Know your data: The stats behind your alerts by Dave McAllister
OSMC 2023 | Know your data: The stats behind your alerts by Dave McAllisterOSMC 2023 | Know your data: The stats behind your alerts by Dave McAllister
OSMC 2023 | Know your data: The stats behind your alerts by Dave McAllister
 
ICIS - Power price prediction with neural networks
ICIS - Power price prediction with neural networksICIS - Power price prediction with neural networks
ICIS - Power price prediction with neural networks
 
Machine Learning Impact on IoT - Part 2
Machine Learning Impact on IoT - Part 2Machine Learning Impact on IoT - Part 2
Machine Learning Impact on IoT - Part 2
 
Mixed Effects Models - Fixed Effects
Mixed Effects Models - Fixed EffectsMixed Effects Models - Fixed Effects
Mixed Effects Models - Fixed Effects
 
Thesis presentation: Applications of machine learning in predicting supply risks
Thesis presentation: Applications of machine learning in predicting supply risksThesis presentation: Applications of machine learning in predicting supply risks
Thesis presentation: Applications of machine learning in predicting supply risks
 
Engineering Data Analysis-ProfCharlton
Engineering Data  Analysis-ProfCharltonEngineering Data  Analysis-ProfCharlton
Engineering Data Analysis-ProfCharlton
 
Cross-validation aggregation for forecasting
Cross-validation aggregation for forecastingCross-validation aggregation for forecasting
Cross-validation aggregation for forecasting
 
Feature Engineering
Feature Engineering Feature Engineering
Feature Engineering
 
A Study on Privacy Level in Publishing Data of Smart Tap Network
A Study on Privacy Level in Publishing Data of Smart Tap NetworkA Study on Privacy Level in Publishing Data of Smart Tap Network
A Study on Privacy Level in Publishing Data of Smart Tap Network
 
Semantic Analysis to Compute Personality Traits from Social Media Posts
Semantic Analysis to Compute Personality Traits from Social Media PostsSemantic Analysis to Compute Personality Traits from Social Media Posts
Semantic Analysis to Compute Personality Traits from Social Media Posts
 
timeseries1.ppt
timeseries1.ppttimeseries1.ppt
timeseries1.ppt
 
Reliability.pdf
Reliability.pdfReliability.pdf
Reliability.pdf
 
Benchmarking search relevance in industry vs academia
Benchmarking search relevance in industry vs academiaBenchmarking search relevance in industry vs academia
Benchmarking search relevance in industry vs academia
 
From Raw Data to Deployed Product. Fast & Agile with CRISP-DM
From Raw Data to Deployed Product. Fast & Agile with CRISP-DMFrom Raw Data to Deployed Product. Fast & Agile with CRISP-DM
From Raw Data to Deployed Product. Fast & Agile with CRISP-DM
 
Resume_Voice
Resume_VoiceResume_Voice
Resume_Voice
 
Leveragin research, behavioural and demeographic data
Leveragin research, behavioural and demeographic dataLeveragin research, behavioural and demeographic data
Leveragin research, behavioural and demeographic data
 
Keynote SBST 2014 - Search-Based Testing
Keynote SBST 2014 - Search-Based TestingKeynote SBST 2014 - Search-Based Testing
Keynote SBST 2014 - Search-Based Testing
 
How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?How might machine learning help advance solar PV research?
How might machine learning help advance solar PV research?
 

Recently uploaded

Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Milind Agarwal
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingsocarem879
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 

Recently uploaded (20)

Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
Unveiling the Role of Social Media Suspect Investigators in Preventing Online...
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processing
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 

Forecasting Intraday Arrivals at a Call Centre using Neural Networks: Forecasting Anomalous Days

  • 1. Forecasting Intraday Call Arrivals Modelling Special Days Devon K. Barrow Nikolaos Kourentzes 27th European Conference on Operational Research 12-15 July 2015 University of Strathclyde
  • 2. 1. Research Questions 2. Call Arrival Data and 3. Experimental Design 4. Results 5. Conclusions and future work Outline Forecasting Intraday Call Arrivals Outline 2
  • 3. • How do different forecasting methods perform when forecasting (high frequency) call centre arrivals? • What is the impact of coding or not coding of (functional) outlying periods on forecasting performance? Research Questions Research Questions 3Forecasting Intraday Call Arrivals
  • 4. 1. Research Questions 2. Call Arrival Data and 3. Experimental Design 4. Results 5. Conclusions and future work Outline Forecasting Intraday Call Arrivals Call Centre Data 4
  • 5. • High dimensional and sampled at a high frequency (Kourentzes and Crone, 2010). • Complex seasonal patterns – Intraday – Intraweek – interyear dependencies(De Livera et al., 2011) Call Arrival Data and Challenges Challenges 5Call Centre DataForecasting Intraday Call Arrivals
  • 6. Call Arrival Data and Challenges Some call centre data 6 • Two series from a large UK service provider call centre • A leading entertainment company in Europe • Data is sampled at half-hourly intervals • Consists of 103 weeks and 3 days from 29 June 2012 to 23 June 2014 inclusive including bank holidays and weekends. Call Centre DataForecasting Intraday Call Arrivals
  • 7. Call Arrival Data and Challenges Complex seasonal patterns 7 Intraday Intraweek Mean Median Monday Intrayear Call Centre DataForecasting Intraday Call Arrivals
  • 8. • Development of new methods – Double seasonal exponential smoothing (Taylor, 2008) – Multiplicative double seasonal ARMA model (Taylor, 2003) – Exponential weighting (Taylor, 2008, 2010) – Regression (Tych et al., 2002; Taylor, 2010) – Singular vector decomposition (Shen and Huang, 2005, 2008; Shen, 2009) – Gaussian linear mixed-effects models (Aldor-Noiman et al., 2009; Ibrahim and L’Ecuyer, 2013) • Intraweek seasonal moving average performs well at medium to long horizons (Tandberg et al. 1995; Taylor, 2008; Taylor, 2010; Ibrahim and L’Ecuyer, 2013) Call Arrival Data and Challenges Handling high frequency complex seasonality 8Call Centre DataForecasting Intraday Call Arrivals !
  • 9. • Data is context sensitive – Effects of holidays, special events and promotional activities • Prone to quite sizeable unexplained variations (outliers) – E.g. due to system failures and data processing Call Arrival Data and Challenges Challenges 9Call Centre DataForecasting Intraday Call Arrivals
  • 10. Call Arrival Data and Challenges Anomalies (outliers) 10 Series 1 Series 1I Call Centre DataForecasting Intraday Call Arrivals
  • 11. Call centre data Seasonal profile 12 4 8 12 16 20 0 100 200 300 400 Calls Monday 4 8 12 16 20 Tuesday 4 8 12 16 20 Wednesday 4 8 12 16 20 Hour Thursday 4 8 12 16 20 Friday 4 8 12 16 20 Saturday 4 8 12 16 20 Sunday Median 25-75% 10-90% 05-95% 4 8 12 16 20 0 10 20 30 40 Calls Monday 4 8 12 16 20 Tuesday 4 8 12 16 20 Wednesday 4 8 12 16 20 Hour Thursday 4 8 12 16 20 Friday 4 8 12 16 20 Saturday 4 8 12 16 20 Sunday Median 25-75% 10-90% 05-95% Series 1 Series 1ILarge variation from the middle Call Centre DataForecasting Intraday Call Arrivals
  • 12. Call centre data Outliers vs. median pattern Call Centre Data 13 4 8 12 16 20 0 100 200 300 400 Monday Calls 4 8 12 16 20 Tuesday 4 8 12 16 20 Wednesday 4 8 12 16 20 Thursday Hour 4 8 12 16 20 Friday 4 8 12 16 20 Saturday 4 8 12 16 20 Sunday Profile Outliers 4 8 12 16 20 0 10 20 30 40 Monday Calls 4 8 12 16 20 Tuesday 4 8 12 16 20 Wednesday 4 8 12 16 20 Thursday Hour 4 8 12 16 20 Friday 4 8 12 16 20 Saturday 4 8 12 16 20 Sunday Profile Outliers Series 1 Series 1IRegular profile very different from outliers Forecasting Intraday Call Arrivals
  • 13. • Approaches to handling ‘special days’: – Information is either available and/or data is pre- cleansed – The forecaster has an external methodology for tackling ‘special days’ (Jongbloed and Koole, 2001; Avramidis et al., 2004; Taylor, 2008a; Pacheco et al., 2009; Taylor, 2010b) – Removing such days altogether (Taylor et al. 2006) – Singular vector decomposition for automatic outlier detection (Shen and Huang, 2005) • Kourentzes (2011) demonstrates that there are substantial accuracy benefits to be had from modelling irregular load patterns. Call Arrival Data and Challenges Handling ‘special days’ 14Call Centre DataForecasting Intraday Call Arrivals !
  • 14. 1. Research Questions 2. Call Arrival Data and 3. Experimental Design 4. Results 5. Conclusions and future work Outline Forecasting Intraday Call Arrivals Experimental Design 15
  • 15. Experimental Design Functional outlier modelling 16 • We evaluate seven alternative methodologies for modelling functional outliers • These are based on extensions of conventional outlier modelling and novel approaches • We assume that the data generating process of normal observations is captured adequately and the outliers are already labelled Control Single Binary Dummy Multiple Binary Dummy Single Integer Profile Dummy Trigonometric Dummy Model Separately Experimental DesignForecasting Intraday Call Arrivals
  • 16. Experimental Design Functional outlier modelling 17 • Control/benchmark – A set of autoregressive lagged inputs (past values) are identified using stepwise regression – Outliers are not modelled • Single Binary Dummy Variable – Indicator variable s.t. one = outlier; zero otherwise • Multiple Binary Dummy Variable – S is the seasonal length (S=48) – 48 (S) dummy variables to code each observation – 47 (S – 1) dummy variables to code each observation – Stepwise selection, s = {1,…,S=48} s.t. s is significantly different from normal observations Experimental DesignForecasting Intraday Call Arrivals
  • 17. Experimental Design Functional outlier modelling 18 • Single Integer Dummy Variable – Monotonically increasing variable from 1 to 48 if outlier; zero otherwise • Profile Dummy Variable – This variable is equal to the profile if there is an outlier; zero otherwise • Trigonometric Dummy Variables – Sine and cosine if there is an outlier; zero otherwise • Model Separately – Create a new series containing only outliers – Replace outliers in original series with normal observations Experimental DesignForecasting Intraday Call Arrivals
  • 18. Experimental Design Experimental setup Experimental design 19 • Neural network setup – Inputs based on backwards regression + dummies + seasonal dummies. – Mode ensemble of 50 networks trained by scaled conjugate gradient descent – Hidden nodes identified experimentally for each set of inputs • Forecast creation – Forecast horizon is set to 1 day ahead (1-48 half hourly steps ahead) – Test set of 100 days (4800 data points x 48 forecasted horizons) • Forecast evaluation – Mean Absolute Error (MAE) – Relative Mean Absolute Error (RMAE) i.e. MAE_{Method} / MAE_{Control} AE = |Yt - Ft| Forecasting Intraday Call Arrivals
  • 19. 1. Research Questions 2. Call Arrival Data and 3. Experimental Design 4. Results 5. Conclusions and future work Outline Forecasting Intraday Call Arrivals Results 20
  • 20. Results Neural networks versus benchmarks 21 Time Series 1 Overall Outlier Normal Naïve 1 3.930 2.711 4.269 Naïve Day 1.417 1.211 1.475 Naïve Week 1.333 1.057 1.410 MA Day 1.238 1.278 1.226 MA Week 1.233 0.845 1.341 ETS Day 1.887 1.438 2.013 ETS Week 1.529 1.165 1.630 ETS Double 1.086 0.857 1.149 MAPA Day 1.522 1.362 1.566 MAPA Week 1.484 1.274 1.542 NN Control 1.000 1.000 1.000 ResultsForecasting Intraday Call Arrivals
  • 21. Results Neural networks versus benchmarks Time Series II Overall Outlier Normal Naïve 1 2.236 1.770 2.500 Naïve Day 1.415 1.119 1.582 Naïve Week 1.433 1.157 1.590 MA Day 1.415 1.119 1.582 MA Week 1.129 0.973 1.217 ETS Day 1.600 1.368 1.731 ETS Week 1.479 1.288 1.587 ETS Double 1.151 0.990 1.242 MAPA Day 1.347 1.242 1.406 MAPA Week 1.280 1.201 1.325 NN Control 1.000 1.000 1.000 22ResultsForecasting Intraday Call Arrivals
  • 22. Results Outlier modelling versus control Time Series 1 Overall Outlier Normal NN Control 1.000 1.000 1.000 NN Bin1 0.996 0.933 1.013 NN BinS 0.884 0.822 0.901 NN BinS-1 0.873 0.831 0.885 NN Bin Step 0.897 0.858 0.908 NN Bin Back 0.895 0.850 0.907 NN Int 0.935 0.913 0.941 NN SinCos 0.980 0.865 1.012 NN Profile 0.986 0.953 0.995 NN Replace 1.002 1.019 0.997 23ResultsForecasting Intraday Call Arrivals
  • 23. Results Outlier modelling versus control Time Series II Overall Outlier Normal NN Control 1.000 1.000 1.000 NN Bin1 0.956 0.930 0.970 NN BinS 0.922 0.866 0.954 NN BinS-1 0.923 0.870 0.954 NN Bin Step 0.929 0.876 0.958 NN Bin Back 0.931 0.875 0.963 NN Int 0.957 0.940 0.966 NN SinCos 0.951 0.912 0.972 NN Profile 0.963 0.936 0.979 NN Replace 1.030 1.090 0.996 Results 24Forecasting Intraday Call Arrivals
  • 24. 1. Research Questions 2. Call Arrival Data and 3. Experimental Design 4. Results 5. Conclusions and future work Outline Forecasting Intraday Call Arrivals Conclusion and future work 25
  • 25. Conclusion and future work Conclusion and future work 26 • Conclusion – Neural networks are good for call centre data for two reasons: • They can do complex structures • They can do complex outliers with relatively simple modelling • Future work – Automatic functional outlier detection and modelling for call centre arrival data Forecasting Intraday Call Arrivals
  • 26. Devon K. Barrow Coventry Business School Coventry University, Priory Street, Coventry, CV1 5FB Direct line: + 44 024 7765 7413 Skype: devon.k.barrow Email: devon.barrow@coventry.ac.uk