SlideShare a Scribd company logo
1 of 22
CONTRIBUTION
ISHITA MATHUR – CODING IN R
MANSI MARWAHA – ANALYSIS FOR PPT
RIYA SEHGAL – MODEL BUILDING AND ANALYSIS FOR PPT
MEGHNA BAID – ANALYSIS FOR PPT
WHAT IS A TIME SERIES ?
• ANY METRIC THAT IS MEASURED OVER REGULAR TIME INTERVALS FORMS A
TIME SERIES. ANALYSIS OF TIME SERIES IS COMMERCIALLY IMPORTANCE
BECAUSE OF INDUSTRIAL NEED AND RELEVANCE ESPECIALLY W.R.T
FORECASTING (DEMAND, SALES, SUPPLY ETC).
• EACH DATA POINT AT TIME T IN A TIME SERIES CAN BE EXPRESSED AS
EITHER A SUM OR A PRODUCT OF 3 COMPONENTS, NAMELY, SEASONALITY
(ST), TREND (TT) AND ERROR (ET) (A.K.A WHITE NOISE).
AIM OF THE ANALYSIS
• TO ANALYSE TREND AND SEASONALITY OF THE RIDERS IN PORTLAND
OREGAN
• TO DEPICT INSIGHTS USING R
• TO FORECAST THE VALUES FOR THE FUTURE
UNDERSTANDING THE DATASET
• IT IS A MONTHLY COUNT OF RIDERS FOR THE PORTLAND PUBLIC
TRANSPORTATION SYSTEM. THE WEBSITE STATES THAT IT IS FROM JANUARY
1960 THROUGH JUNE 1969
• IT CONTAINS 2 COLUMNS ONE REPRESENTS THE MONTH AND OTHER
REPRESENTS THE TOTAL NUMBER OF RIDERS FOR THE PORTLAND PUBLIC
TRANSPORTATION SYSTEM
• FOLLOWING IS A LINK TO THE CASE STUDY
• HTTPS://WWW.KAGGLE.COM/HSANKESARA/PORTLAND-OREGON-AVG-RIDER-
MONTHLY-DATA/DATA#PORTLAND-OREGON-AVERAGE-MONTHLY-.CSV
METHODOLOGY
• EXPLORATORY DATA ANALYSIS
• TIME SERIES AND DECOMPOSING PHASE
• HYPOTHESIS FOR STATIONARY AND NON STATIONARY SERIES
• CHECKING ACCURACY
• FORECASTING FOR THE FUTURE
CONVERTING THE DATA SET INTO TIME
SERIES
WHILE WORKING WITH A TIME SERIES ANALYSIS IT IS VERY IMPORTANT TO
UNDERSTAND THE TRENDS, SEASONALITY AND ERROR IN THE DATA SET.
FOR THAT WE NEED TO CHANGE THE DATA SET INTO TIME SERIES SO AS TO
APPLY FURTHER FUNCTIONS
HERE WE HAVE USED TS () FUNCTION TO CONVERT THE DATA INTO TIME
SERIES
EXPLORATORY DATA ANALYSIS
• IT IS USED TO CHECK THE OVERALL STRUCTURE AND SUMMARY OF DATA TO
UNDERSTAND AN OVERALL PICTURE ABOUT THE DATA.
• IT IS USEFUL TO FIND BASIC VALUES AND OUTLIERS
• HERE WE FOUND OUT
THE STRUCTURE OF DATA SET USING STR() FUNCTION
FIRST 6 ROWS OF DATA USING HEAD() FUNCTION
NAMES OF THE COLUMNS USING NAMES() FUNCTION
NA VALUES IN THE DATA USING IS.NA() FUNCTION
RANGE: 613 – 1558
AVERAGE COUNT OF PEOPLE IS 1120
VISUAL REPRESENTATION OF TIME SERIES
• VISUAL REPRESENTATION IS OFTEN
CONSIDERED A BETTER WAY TO
UNDERSTAND THE DATA
• HERE WE HAVE USED THE FUNCTION
PLOT() TO CREATE A PLOT TO
VISUALLY IDENTIFY THE TRENDS
AND SEASONALITY
OUTLIER DETECTIONBox-Plot shows that there is no outlier in
dataset i.e. monthly count of riders for the
Portland public transportation system.
In case there are outliers in the dataset, we
have two methods to treat them.
IQR Test
This method detects an outlier where the Demand
value is greater than (or equal to) Q3+1.5*(Q3-Q1)
DECOMPOSITION OF TIME SERIES COMPONENTS
• SEASONAL TREND
DECOMPOSITION USING (STL) IS
AN ALGORITHM THAT WAS
DEVELOPED TO HELP TO DIVIDE UP
A TIME SERIES INTO THREE
COMPONENTS NAMELY:
1.TREND
2.SEASONALITY
3.REMAINDER
PROPERTIES OF RESIDUAL
• RESIDUAL MUST BE STATIONARY (STATISTICAL PROPERTY MUST BE CONSTANT)
• STATIONARITY MEANS MEAN IS CONSTANT , VARIANCE IS CONSTANT AND
AUTOCOVARIANCE IS CONSTANT FOR SAME ORDER
• AUGMENTED DICKEY FULLER TEST
• H0 : SERIES IS NOT STATIONARY
• H1: SERIES IS STATIONARY
• FUNCTION USED- ADF.TEST(RESIDUAL)
• HERE, P VALUE = 0.01 < 0.05 , THEREFORE HO GETS REJECTED
• THEREFORE, RESIDUALS ARE STATIONARY.
• IN CASE, RESIDUALS ARE NOT STATIONARY WE TAKE THE LAG VALUES BY
DIFFERENCING.
ARIMA MODEL
• AUTO REGRESSIVE INTEGRATED MOVING AVERAGE’ IS A CLASS OF MODELS THAT
‘EXPLAINS’ A GIVEN TIME SERIES BASED ON ITS OWN PAST VALUES, THAT IS, ITS OWN
LAGS AND THE LAGGED FORECAST ERRORS, SO THAT EQUATION CAN BE USED TO
FORECAST FUTURE VALUES. AN ARIMA MODEL IS CHARACTERIZED BY 3 TERMS: P, D, Q
• WHERE,
P IS THE ORDER OF THE AR TERM
Q IS THE ORDER OF THE MA TERM
D IS THE NUMBER OF DIFFERENCING REQUIRED TO MAKE THE TIME SERIES STATIONARY
PLOT OF PACF CHART
Partial
autocorrelation
can be imagined
as the correlation
between the series
and its lag, after
excluding the
contributions from
the intermediate
lags
PLOT OF ACF CHART
ACF is a plot of
total correlation
between different
lag functions.
FINDING ORDER OF THE ARMA MODEL
• THE REQUIRED NUMBER OF AR TERMS BY INSPECTING THE PARTIAL
AUTOCORRELATION (PACF) PLOT. (HERE, P=1.5)
• THE RIGHT ORDER OF DIFFERENCING IS THE MINIMUM DIFFERENCING
REQUIRED TO GET A NEAR-STATIONARY SERIES (HERE, D=0)
• THE ACF TELLS HOW MANY MA TERMS ARE REQUIRED TO REMOVE ANY
AUTOCORRELATION IN THE SERIES. (HERE, Q=1)
MODEL BUILDING AND ACCURACY
AS NOW RESIDUALS ARE STATIONARY AND WE FOUND OUT THE ORDER OF
ARIMA MODEL.
MODEL IS BUILT BY - RESID_MODEL<-ARIMA(PORTLANDTS,ORDER =
C(1.5,0,1))
ACCURACY
AS MEAN ABSOLUTE PERCENTAGE ERROR (MAPE) IS 4.14 WHICH IS LESS
THAN 7 , THE MODEL IS GOOD.
3 MONTHS FORECAST OF MONTHLY COUNT OF RIDERS FOR
THE PORTLAND PUBLIC TRANSPORTATION SYSTEM.
VALIDATIONS
SYMMETRIC
HISTOGRAM
QQ PLOT IS
APPROXIMATEY
A STRAIGHT
LINE , GOOD
MODEL
LJUNG BOX STATISTIC
•H0: NO AUTOCORRELATION
H1: AUTOCORRELATION
•HERE P-VALUE IS 0.9 WHICH IS MORE THAN 0.05
THEREFORE, NULL IS ACCEPTED, THERE IS NO AUTOCORRELATION
•THIS IS A GOOD MODEL
THANKYOU

More Related Content

What's hot

Intro to Forecasting in R - Part 4
Intro to Forecasting in R - Part 4Intro to Forecasting in R - Part 4
Intro to Forecasting in R - Part 4egoodwintx
 
Time Series Analysis - 2 | Time Series in R | ARIMA Model Forecasting | Data ...
Time Series Analysis - 2 | Time Series in R | ARIMA Model Forecasting | Data ...Time Series Analysis - 2 | Time Series in R | ARIMA Model Forecasting | Data ...
Time Series Analysis - 2 | Time Series in R | ARIMA Model Forecasting | Data ...Simplilearn
 
X Bar And S Charts Mini Tutorial
X Bar And S Charts Mini TutorialX Bar And S Charts Mini Tutorial
X Bar And S Charts Mini Tutorialahmad bassiouny
 
On Modeling Murder Crimes in Nigeria
On Modeling Murder Crimes in NigeriaOn Modeling Murder Crimes in Nigeria
On Modeling Murder Crimes in NigeriaScientific Review SR
 
Methods of data presentation
Methods of data presentation Methods of data presentation
Methods of data presentation Dr Athar Khan
 
What is ARIMA Forecasting and How Can it Be Used for Enterprise Analysis?
What is ARIMA Forecasting and How Can it Be Used for Enterprise Analysis?What is ARIMA Forecasting and How Can it Be Used for Enterprise Analysis?
What is ARIMA Forecasting and How Can it Be Used for Enterprise Analysis?Smarten Augmented Analytics
 

What's hot (7)

Intro to Forecasting in R - Part 4
Intro to Forecasting in R - Part 4Intro to Forecasting in R - Part 4
Intro to Forecasting in R - Part 4
 
Time Series Analysis - 2 | Time Series in R | ARIMA Model Forecasting | Data ...
Time Series Analysis - 2 | Time Series in R | ARIMA Model Forecasting | Data ...Time Series Analysis - 2 | Time Series in R | ARIMA Model Forecasting | Data ...
Time Series Analysis - 2 | Time Series in R | ARIMA Model Forecasting | Data ...
 
X Bar And S Charts Mini Tutorial
X Bar And S Charts Mini TutorialX Bar And S Charts Mini Tutorial
X Bar And S Charts Mini Tutorial
 
On Modeling Murder Crimes in Nigeria
On Modeling Murder Crimes in NigeriaOn Modeling Murder Crimes in Nigeria
On Modeling Murder Crimes in Nigeria
 
2007 02t
2007 02t2007 02t
2007 02t
 
Methods of data presentation
Methods of data presentation Methods of data presentation
Methods of data presentation
 
What is ARIMA Forecasting and How Can it Be Used for Enterprise Analysis?
What is ARIMA Forecasting and How Can it Be Used for Enterprise Analysis?What is ARIMA Forecasting and How Can it Be Used for Enterprise Analysis?
What is ARIMA Forecasting and How Can it Be Used for Enterprise Analysis?
 

Similar to Portland oregon riders monthly data Using R

timeseries cheat sheet with example code for R
timeseries cheat sheet with example code for Rtimeseries cheat sheet with example code for R
timeseries cheat sheet with example code for Rderekjohnson549253
 
Air Passenger Prediction Using ARIMA Model
Air Passenger Prediction Using ARIMA Model Air Passenger Prediction Using ARIMA Model
Air Passenger Prediction Using ARIMA Model AkarshAvinash
 
IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...
IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...
IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...IRJET Journal
 
Ecm time series forecast
Ecm time series forecastEcm time series forecast
Ecm time series forecastAyapparaj SKS
 
AESA Airborne Radar Theory and Operations Technical Training Course Sampler
AESA Airborne Radar Theory and Operations Technical Training Course SamplerAESA Airborne Radar Theory and Operations Technical Training Course Sampler
AESA Airborne Radar Theory and Operations Technical Training Course SamplerJim Jenkins
 
Analysis of Various Periodicity Detection Algorithms in Time Series Data with...
Analysis of Various Periodicity Detection Algorithms in Time Series Data with...Analysis of Various Periodicity Detection Algorithms in Time Series Data with...
Analysis of Various Periodicity Detection Algorithms in Time Series Data with...Editor IJCATR
 
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?Smarten Augmented Analytics
 
ANN ARIMA Hybrid Models for Time Series Prediction
ANN ARIMA Hybrid Models for Time Series PredictionANN ARIMA Hybrid Models for Time Series Prediction
ANN ARIMA Hybrid Models for Time Series PredictionM Baddar
 
Time series modelling arima-arch
Time series modelling  arima-archTime series modelling  arima-arch
Time series modelling arima-archjeevan solaskar
 
Weather forecasting model.pptx
Weather forecasting model.pptxWeather forecasting model.pptx
Weather forecasting model.pptxVisheshYadav12
 
Forecasting time series powerful and simple
Forecasting time series powerful and simpleForecasting time series powerful and simple
Forecasting time series powerful and simpleIvo Andreev
 
Probability Measurement of Setup And Hold Time With Statistical Static Timing...
Probability Measurement of Setup And Hold Time With Statistical Static Timing...Probability Measurement of Setup And Hold Time With Statistical Static Timing...
Probability Measurement of Setup And Hold Time With Statistical Static Timing...IJERA Editor
 
Different Models Used In Time Series - InsideAIML
Different Models Used In Time Series - InsideAIMLDifferent Models Used In Time Series - InsideAIML
Different Models Used In Time Series - InsideAIMLVijaySharma802
 
Investigation of Parameter Behaviors in Stationarity of Autoregressive and Mo...
Investigation of Parameter Behaviors in Stationarity of Autoregressive and Mo...Investigation of Parameter Behaviors in Stationarity of Autoregressive and Mo...
Investigation of Parameter Behaviors in Stationarity of Autoregressive and Mo...BRNSS Publication Hub
 

Similar to Portland oregon riders monthly data Using R (20)

timeseries cheat sheet with example code for R
timeseries cheat sheet with example code for Rtimeseries cheat sheet with example code for R
timeseries cheat sheet with example code for R
 
ARIMA.pptx
ARIMA.pptxARIMA.pptx
ARIMA.pptx
 
Time series analysis
Time series analysisTime series analysis
Time series analysis
 
Air Passenger Prediction Using ARIMA Model
Air Passenger Prediction Using ARIMA Model Air Passenger Prediction Using ARIMA Model
Air Passenger Prediction Using ARIMA Model
 
IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...
IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...
IRJET- Performance Analysis of a Synchronized Receiver over Noiseless and Fad...
 
Ecm time series forecast
Ecm time series forecastEcm time series forecast
Ecm time series forecast
 
AESA Airborne Radar Theory and Operations Technical Training Course Sampler
AESA Airborne Radar Theory and Operations Technical Training Course SamplerAESA Airborne Radar Theory and Operations Technical Training Course Sampler
AESA Airborne Radar Theory and Operations Technical Training Course Sampler
 
Analysis of Various Periodicity Detection Algorithms in Time Series Data with...
Analysis of Various Periodicity Detection Algorithms in Time Series Data with...Analysis of Various Periodicity Detection Algorithms in Time Series Data with...
Analysis of Various Periodicity Detection Algorithms in Time Series Data with...
 
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
What is ARIMAX Forecasting and How is it Used for Enterprise Analysis?
 
ANN ARIMA Hybrid Models for Time Series Prediction
ANN ARIMA Hybrid Models for Time Series PredictionANN ARIMA Hybrid Models for Time Series Prediction
ANN ARIMA Hybrid Models for Time Series Prediction
 
Time_Series_Assignment
Time_Series_AssignmentTime_Series_Assignment
Time_Series_Assignment
 
Time series modelling arima-arch
Time series modelling  arima-archTime series modelling  arima-arch
Time series modelling arima-arch
 
Weather forecasting model.pptx
Weather forecasting model.pptxWeather forecasting model.pptx
Weather forecasting model.pptx
 
Unit 1 dsuc
Unit 1 dsucUnit 1 dsuc
Unit 1 dsuc
 
Forecasting time series powerful and simple
Forecasting time series powerful and simpleForecasting time series powerful and simple
Forecasting time series powerful and simple
 
Probability Measurement of Setup And Hold Time With Statistical Static Timing...
Probability Measurement of Setup And Hold Time With Statistical Static Timing...Probability Measurement of Setup And Hold Time With Statistical Static Timing...
Probability Measurement of Setup And Hold Time With Statistical Static Timing...
 
Different Models Used In Time Series - InsideAIML
Different Models Used In Time Series - InsideAIMLDifferent Models Used In Time Series - InsideAIML
Different Models Used In Time Series - InsideAIML
 
Investigation of Parameter Behaviors in Stationarity of Autoregressive and Mo...
Investigation of Parameter Behaviors in Stationarity of Autoregressive and Mo...Investigation of Parameter Behaviors in Stationarity of Autoregressive and Mo...
Investigation of Parameter Behaviors in Stationarity of Autoregressive and Mo...
 
04_AJMS_288_20.pdf
04_AJMS_288_20.pdf04_AJMS_288_20.pdf
04_AJMS_288_20.pdf
 
Seasonal ARIMA
Seasonal ARIMASeasonal ARIMA
Seasonal ARIMA
 

More from Meghna Baid

Sustained Business Reforms and Ease of Doing Business in India
Sustained Business Reforms and Ease of Doing Business in IndiaSustained Business Reforms and Ease of Doing Business in India
Sustained Business Reforms and Ease of Doing Business in IndiaMeghna Baid
 
Marketing Research Project on T test
Marketing Research Project on T test Marketing Research Project on T test
Marketing Research Project on T test Meghna Baid
 
Trade Operations in Union Budget Analysis
Trade Operations in Union Budget Analysis Trade Operations in Union Budget Analysis
Trade Operations in Union Budget Analysis Meghna Baid
 
Cvs Annual Magazine 2017-18, College Of Vocational Studies, University Of Delhu
Cvs Annual Magazine 2017-18, College Of Vocational Studies, University Of DelhuCvs Annual Magazine 2017-18, College Of Vocational Studies, University Of Delhu
Cvs Annual Magazine 2017-18, College Of Vocational Studies, University Of DelhuMeghna Baid
 
Msme Growth Reforms Difference between INDIA and CHINA
Msme Growth Reforms Difference between INDIA and CHINAMsme Growth Reforms Difference between INDIA and CHINA
Msme Growth Reforms Difference between INDIA and CHINAMeghna Baid
 
Wallmart Depth Analysis with Overview and Swot Analysis, Business Trends By M...
Wallmart Depth Analysis with Overview and Swot Analysis, Business Trends By M...Wallmart Depth Analysis with Overview and Swot Analysis, Business Trends By M...
Wallmart Depth Analysis with Overview and Swot Analysis, Business Trends By M...Meghna Baid
 
International business finance Foreign Exchange Markets
International business finance Foreign Exchange MarketsInternational business finance Foreign Exchange Markets
International business finance Foreign Exchange MarketsMeghna Baid
 

More from Meghna Baid (7)

Sustained Business Reforms and Ease of Doing Business in India
Sustained Business Reforms and Ease of Doing Business in IndiaSustained Business Reforms and Ease of Doing Business in India
Sustained Business Reforms and Ease of Doing Business in India
 
Marketing Research Project on T test
Marketing Research Project on T test Marketing Research Project on T test
Marketing Research Project on T test
 
Trade Operations in Union Budget Analysis
Trade Operations in Union Budget Analysis Trade Operations in Union Budget Analysis
Trade Operations in Union Budget Analysis
 
Cvs Annual Magazine 2017-18, College Of Vocational Studies, University Of Delhu
Cvs Annual Magazine 2017-18, College Of Vocational Studies, University Of DelhuCvs Annual Magazine 2017-18, College Of Vocational Studies, University Of Delhu
Cvs Annual Magazine 2017-18, College Of Vocational Studies, University Of Delhu
 
Msme Growth Reforms Difference between INDIA and CHINA
Msme Growth Reforms Difference between INDIA and CHINAMsme Growth Reforms Difference between INDIA and CHINA
Msme Growth Reforms Difference between INDIA and CHINA
 
Wallmart Depth Analysis with Overview and Swot Analysis, Business Trends By M...
Wallmart Depth Analysis with Overview and Swot Analysis, Business Trends By M...Wallmart Depth Analysis with Overview and Swot Analysis, Business Trends By M...
Wallmart Depth Analysis with Overview and Swot Analysis, Business Trends By M...
 
International business finance Foreign Exchange Markets
International business finance Foreign Exchange MarketsInternational business finance Foreign Exchange Markets
International business finance Foreign Exchange Markets
 

Recently uploaded

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityNeo4j
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningVitsRangannavar
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number SystemsJheuzeDellosa
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 

Recently uploaded (20)

The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
EY_Graph Database Powered Sustainability
EY_Graph Database Powered SustainabilityEY_Graph Database Powered Sustainability
EY_Graph Database Powered Sustainability
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideBuilding Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
cybersecurity notes for mca students for learning
cybersecurity notes for mca students for learningcybersecurity notes for mca students for learning
cybersecurity notes for mca students for learning
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
What is Binary Language? Computer Number Systems
What is Binary Language?  Computer Number SystemsWhat is Binary Language?  Computer Number Systems
What is Binary Language? Computer Number Systems
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 

Portland oregon riders monthly data Using R

  • 1.
  • 2. CONTRIBUTION ISHITA MATHUR – CODING IN R MANSI MARWAHA – ANALYSIS FOR PPT RIYA SEHGAL – MODEL BUILDING AND ANALYSIS FOR PPT MEGHNA BAID – ANALYSIS FOR PPT
  • 3. WHAT IS A TIME SERIES ? • ANY METRIC THAT IS MEASURED OVER REGULAR TIME INTERVALS FORMS A TIME SERIES. ANALYSIS OF TIME SERIES IS COMMERCIALLY IMPORTANCE BECAUSE OF INDUSTRIAL NEED AND RELEVANCE ESPECIALLY W.R.T FORECASTING (DEMAND, SALES, SUPPLY ETC). • EACH DATA POINT AT TIME T IN A TIME SERIES CAN BE EXPRESSED AS EITHER A SUM OR A PRODUCT OF 3 COMPONENTS, NAMELY, SEASONALITY (ST), TREND (TT) AND ERROR (ET) (A.K.A WHITE NOISE).
  • 4. AIM OF THE ANALYSIS • TO ANALYSE TREND AND SEASONALITY OF THE RIDERS IN PORTLAND OREGAN • TO DEPICT INSIGHTS USING R • TO FORECAST THE VALUES FOR THE FUTURE
  • 5. UNDERSTANDING THE DATASET • IT IS A MONTHLY COUNT OF RIDERS FOR THE PORTLAND PUBLIC TRANSPORTATION SYSTEM. THE WEBSITE STATES THAT IT IS FROM JANUARY 1960 THROUGH JUNE 1969 • IT CONTAINS 2 COLUMNS ONE REPRESENTS THE MONTH AND OTHER REPRESENTS THE TOTAL NUMBER OF RIDERS FOR THE PORTLAND PUBLIC TRANSPORTATION SYSTEM • FOLLOWING IS A LINK TO THE CASE STUDY • HTTPS://WWW.KAGGLE.COM/HSANKESARA/PORTLAND-OREGON-AVG-RIDER- MONTHLY-DATA/DATA#PORTLAND-OREGON-AVERAGE-MONTHLY-.CSV
  • 6. METHODOLOGY • EXPLORATORY DATA ANALYSIS • TIME SERIES AND DECOMPOSING PHASE • HYPOTHESIS FOR STATIONARY AND NON STATIONARY SERIES • CHECKING ACCURACY • FORECASTING FOR THE FUTURE
  • 7. CONVERTING THE DATA SET INTO TIME SERIES WHILE WORKING WITH A TIME SERIES ANALYSIS IT IS VERY IMPORTANT TO UNDERSTAND THE TRENDS, SEASONALITY AND ERROR IN THE DATA SET. FOR THAT WE NEED TO CHANGE THE DATA SET INTO TIME SERIES SO AS TO APPLY FURTHER FUNCTIONS HERE WE HAVE USED TS () FUNCTION TO CONVERT THE DATA INTO TIME SERIES
  • 8. EXPLORATORY DATA ANALYSIS • IT IS USED TO CHECK THE OVERALL STRUCTURE AND SUMMARY OF DATA TO UNDERSTAND AN OVERALL PICTURE ABOUT THE DATA. • IT IS USEFUL TO FIND BASIC VALUES AND OUTLIERS • HERE WE FOUND OUT THE STRUCTURE OF DATA SET USING STR() FUNCTION FIRST 6 ROWS OF DATA USING HEAD() FUNCTION NAMES OF THE COLUMNS USING NAMES() FUNCTION NA VALUES IN THE DATA USING IS.NA() FUNCTION RANGE: 613 – 1558 AVERAGE COUNT OF PEOPLE IS 1120
  • 9. VISUAL REPRESENTATION OF TIME SERIES • VISUAL REPRESENTATION IS OFTEN CONSIDERED A BETTER WAY TO UNDERSTAND THE DATA • HERE WE HAVE USED THE FUNCTION PLOT() TO CREATE A PLOT TO VISUALLY IDENTIFY THE TRENDS AND SEASONALITY
  • 10. OUTLIER DETECTIONBox-Plot shows that there is no outlier in dataset i.e. monthly count of riders for the Portland public transportation system. In case there are outliers in the dataset, we have two methods to treat them. IQR Test This method detects an outlier where the Demand value is greater than (or equal to) Q3+1.5*(Q3-Q1)
  • 11. DECOMPOSITION OF TIME SERIES COMPONENTS • SEASONAL TREND DECOMPOSITION USING (STL) IS AN ALGORITHM THAT WAS DEVELOPED TO HELP TO DIVIDE UP A TIME SERIES INTO THREE COMPONENTS NAMELY: 1.TREND 2.SEASONALITY 3.REMAINDER
  • 12. PROPERTIES OF RESIDUAL • RESIDUAL MUST BE STATIONARY (STATISTICAL PROPERTY MUST BE CONSTANT) • STATIONARITY MEANS MEAN IS CONSTANT , VARIANCE IS CONSTANT AND AUTOCOVARIANCE IS CONSTANT FOR SAME ORDER • AUGMENTED DICKEY FULLER TEST • H0 : SERIES IS NOT STATIONARY • H1: SERIES IS STATIONARY • FUNCTION USED- ADF.TEST(RESIDUAL) • HERE, P VALUE = 0.01 < 0.05 , THEREFORE HO GETS REJECTED • THEREFORE, RESIDUALS ARE STATIONARY. • IN CASE, RESIDUALS ARE NOT STATIONARY WE TAKE THE LAG VALUES BY DIFFERENCING.
  • 13. ARIMA MODEL • AUTO REGRESSIVE INTEGRATED MOVING AVERAGE’ IS A CLASS OF MODELS THAT ‘EXPLAINS’ A GIVEN TIME SERIES BASED ON ITS OWN PAST VALUES, THAT IS, ITS OWN LAGS AND THE LAGGED FORECAST ERRORS, SO THAT EQUATION CAN BE USED TO FORECAST FUTURE VALUES. AN ARIMA MODEL IS CHARACTERIZED BY 3 TERMS: P, D, Q • WHERE, P IS THE ORDER OF THE AR TERM Q IS THE ORDER OF THE MA TERM D IS THE NUMBER OF DIFFERENCING REQUIRED TO MAKE THE TIME SERIES STATIONARY
  • 14. PLOT OF PACF CHART Partial autocorrelation can be imagined as the correlation between the series and its lag, after excluding the contributions from the intermediate lags
  • 15. PLOT OF ACF CHART ACF is a plot of total correlation between different lag functions.
  • 16. FINDING ORDER OF THE ARMA MODEL • THE REQUIRED NUMBER OF AR TERMS BY INSPECTING THE PARTIAL AUTOCORRELATION (PACF) PLOT. (HERE, P=1.5) • THE RIGHT ORDER OF DIFFERENCING IS THE MINIMUM DIFFERENCING REQUIRED TO GET A NEAR-STATIONARY SERIES (HERE, D=0) • THE ACF TELLS HOW MANY MA TERMS ARE REQUIRED TO REMOVE ANY AUTOCORRELATION IN THE SERIES. (HERE, Q=1)
  • 17. MODEL BUILDING AND ACCURACY AS NOW RESIDUALS ARE STATIONARY AND WE FOUND OUT THE ORDER OF ARIMA MODEL. MODEL IS BUILT BY - RESID_MODEL<-ARIMA(PORTLANDTS,ORDER = C(1.5,0,1)) ACCURACY AS MEAN ABSOLUTE PERCENTAGE ERROR (MAPE) IS 4.14 WHICH IS LESS THAN 7 , THE MODEL IS GOOD.
  • 18. 3 MONTHS FORECAST OF MONTHLY COUNT OF RIDERS FOR THE PORTLAND PUBLIC TRANSPORTATION SYSTEM.
  • 20. QQ PLOT IS APPROXIMATEY A STRAIGHT LINE , GOOD MODEL
  • 21. LJUNG BOX STATISTIC •H0: NO AUTOCORRELATION H1: AUTOCORRELATION •HERE P-VALUE IS 0.9 WHICH IS MORE THAN 0.05 THEREFORE, NULL IS ACCEPTED, THERE IS NO AUTOCORRELATION •THIS IS A GOOD MODEL