SlideShare a Scribd company logo
1 of 31
Download to read offline
Making Susanne
THE WORLD RECESSION SUSCEPTIBILITY ANALYSIS WEBTOOL
Aditya Gupta (2011009)
1
Overview
 Objective and Motive
 Process
 Data
 Data Collection
 Data Cleanup
 Dependent Variable
 Analysis
 Regression, Classification
 Clustering and Results
 Further Work
 Presentation
 Drawbacks
2
Objective
Key Questions
 To study what economic
composition makes economies
more susceptible to
Global Recession
 By how much? How significantly?
 Can we predict recession impact?
 Studying similarity of economies
 How to quantify
 Recession impact
 Susceptibility
Key Goals
 Empirically ascertain significance
and impact of certain economic
traits vis-à-vis expert opinions
 Develop a powerful model to
predict recession susceptibility
 Present a global and intuitive view
across parameters and their
significance, i.e., Susanne.
3
Why? 4
 Recessions Hurt
 2008 – 2010
 Unemployment and layoffs
 Austerity
 Bearish Markets
 Inflation
 Bank Runs
Data
A BIG PROBLEM IN ITSELF
5
Data Collection
 60+ economic variables linked to Recession
 Allegedly, as per sources like Economist, Forbes, World Bank, WWW
 Or as per our suspicion
 Preferable country-specific ratios (Normalized and Structural Information)
 13 years, starting from 2000 to 2013
 210 Countries
 2500 rows of 60+ columns, each row is a country-year identified
 Source:
 World Bank (OECD National Account File): http://data.worldbank.org/
 United Nations Comtrade Database, International Monetary Fund, Direction
of Trade Database, Balance of Payments Database, and more.
6
Simple Enough?
 No, its blistering gunk.
 Only 86 out of ~2500 rows have
complete data (no NA’s)
 Non-normalized values
 What now?
7
Data Cleanup
 Manually add values for nearly complete columns
 Compress and remove years 2008-2010
 Observed Class Variables – not Causal
 Remove countries with almost no data (Afghanistan and 30 others)
 Down to 2100 rows
 Drop columns if:
 Significant, and have very little data available
 Year values for super-specific variables like
“Merchandise Exports to Scandinavian Countries as % of exports”
 We surely don’t have this for most countries, especially those like Albania
 Insignificant
 Determined from Regressional Analysis (MLR) p-values
8
Still too many missing values
 What to do?
 Drop row (done)
 Weighted expansion of row (did not consider)
 Infer a value
 Average value for parameter for country
over 13 years
 Result: Fixed 5000 out of 70K cells
 Still only 86 full rows
 Why?
 No value for property exists for a country at
all…
 Solution: Global Average – the sacrilege!
 Awful. Pull towards the mean, misclassify.
 But no directional bias.
 Trade-off: Unlocks a world of data
9
For a specific country…
Year V1 Pred V2 Pred
2001 15 15 15 15
2002 17 17 17 17
2003 15 15 19 19
2004 16 16 21 21
2005 ? 16 ? 18
Avg 16 18
Dependent Variable
 Goal: Capture Recession Impact between 2008-10
 Technical Definition: Absolute Growth Rate
 Problem: 15% to 1%, still not recession
 Average Growth Rate drop between 2008 – 2010?
 Problem: 1% drop for UK (at 0.5%) vs. AFG (at 15%)
 Percentage Drop in Growth Rate
 Problem: 0.1% to 3%, 3000% change!
 Solution?
 Drop.SD: Drop in Growth Rate in number of Standard
Deviations over 2008 – 2010.
Variance in Growth rate over last 20 years.
 Variance: Lot of manual data collection
10
Distribution and Discretization of
Drop.SD
 Corresponded well with web information about
countries that “avoided recession” and those
“hit worst”
 21% of the countries labelled as unsusceptible
 Less than -0.25 Drop in GDP during 2007-2010
 Middle 36% labelled as relatively unaffected
 between -0.25 to 0.75 SD GDP Drop in Recession
 Highest 43% adversely affected
 >0.75 SD drop in GDP
 How good was this division?
 lm R-squared rose from 41.58% to 44.14% (no loss in
predictive power, i.e. reasonable classification)
 Good split. Most countries were affected horribly.
11
Other Options
• Equal Density Split
• Maximise Classification Accuracy
• Purely Contextual
Analysis
AFTER WE FINALLY HAVE 2099 COMPLETE ROWS OF DATA
12
Classification: Can we predict
Recession Susceptibility?
 Assumption (non trivial)
 Drop.SD correctly represents Recession Impact in 08-10
 Recession Impact in 08-10 correctly represents Recession Susceptibility
 Can’t do better but guess a few things.
 SVM has 92% accuracy. Seems like it.
 Caveat:
 Bootstrap Analysis: Training Data = Test Data
 Workaround (can’t generate new countries or years):
 5-Fold Cross Validation
13
Important Variables and their
Impact
 Using Multilinear Regression (MLR),
 for each Economic Variable, we get
 degree
 and direction
of impact on Drop.SD
14
Clustering
 Motivation:
“Are we brute-engineering a predictor, or is there an actual underlying
economic structural pattern of recession-susceptibility?”
 70% accuracy (consistent clusters)with k-means, using multiple k
values.
 You decide.
15
Aside: Some nifty R snippets
 Populating Missing Values smartly:
 d7.agg <- aggregate(d7.norm[,5:48], by
= list(Country.Name =
d7.norm$Country.Name, Time =
d7.norm$Time), FUN = function(x) {
if(anyNA(x)) { mean(na.omit(x)) } else
{ mean(x) } })
 Class labelling:
 Rec.Affect <- ifelse
(CompleteDataFinal$Drop.SD. >= 1, 1,
ifelse(CompleteDataFinal$Drop.SD. >= -
0.25, 0, -1)).
 SVM (predict Recession Susceptibility)
16
> model <- svm(as.factor(Rec.Affect) ~
., data = CompleteDataFinal[,
c(6:46)])
> predictions <- predict(model,
CompleteDataFinal[,c(6:46)])
> table(pred = predictions, true =
Rec.Affect)
true
pred -1 0 1
-1 383 8 7
0 33 675 41
1 15 61 876
> t <- table(pred = predictions, true
= Rec.Affect)
> (t[1,1] + t[2,2] + t[3,3])/sum(t)
[1] 0.9213911
 Allowing similarity checks between
Economies
 Overall
 Over Economic Categories
 Using a Semantic Web compliant
Cardinality Checks and Ontology
 Classify Economic Variables into
one or more of:
 Central Government
 Economic Structure
 Net Exports
 Banking
 Manufacturing
 GDP
 Discretize them into , and over:
 Value
 And Impact
 Using middle 80 percentile cut-offs
Further Work - Accessibility 17
Presentation
THE WEBTOOL, I.E, SUSANNE
18
A Knowledge Map (Ontology) 19
Ontology (2) 20
• About 3500 Nodes
• And 7000 Edges
Backend 21
Frontend 22
 Takeaway:
 Single Page Architecture
 Global State:
 (a) Country
 (a) Year
 (a) Property
 Multiple Interesting Views:
 Map
 Country
 Property
 And 2-way state
integration and state-
update
 thanks, angular
Search (Tutorial) 23
Search (2) 24
Search (3) 25
Search (4) 26
Search (5) 27
Search (6) 28
Search (7) – and the last one,
breathe.
29
Demo
http://susanne.bitballoon.com/
30
Thank You
Sincerely
31

More Related Content

Viewers also liked

Endocrine system 1
Endocrine system 1Endocrine system 1
Endocrine system 1
stanralph
 
Sonicwall
Sonicwall Sonicwall
Sonicwall
emarin13
 
e-bulletin-November2010
e-bulletin-November2010e-bulletin-November2010
e-bulletin-November2010
Vivek kaser
 

Viewers also liked (16)

Endocrine system 1
Endocrine system 1Endocrine system 1
Endocrine system 1
 
Infographie passage au cloud
Infographie passage au cloudInfographie passage au cloud
Infographie passage au cloud
 
Woody woodchuck
Woody woodchuckWoody woodchuck
Woody woodchuck
 
Arnie aardvark
Arnie  aardvarkArnie  aardvark
Arnie aardvark
 
Old joe crow
Old joe crowOld joe crow
Old joe crow
 
Sonicwall
Sonicwall Sonicwall
Sonicwall
 
China air pollution season kicks off with a cough and a wheeze as coal plants...
China air pollution season kicks off with a cough and a wheeze as coal plants...China air pollution season kicks off with a cough and a wheeze as coal plants...
China air pollution season kicks off with a cough and a wheeze as coal plants...
 
第14屆法雅客24小時攝影紀錄
第14屆法雅客24小時攝影紀錄第14屆法雅客24小時攝影紀錄
第14屆法雅客24小時攝影紀錄
 
Cognitive dissonance is a Good thing
Cognitive dissonance is a Good thingCognitive dissonance is a Good thing
Cognitive dissonance is a Good thing
 
e-bulletin-November2010
e-bulletin-November2010e-bulletin-November2010
e-bulletin-November2010
 
2014法雅客攝影馬拉松記錄
2014法雅客攝影馬拉松記錄2014法雅客攝影馬拉松記錄
2014法雅客攝影馬拉松記錄
 
Holiday shopping - When and How?
Holiday shopping - When and How?Holiday shopping - When and How?
Holiday shopping - When and How?
 
Rip Mix Feed- ETEC 540
Rip Mix Feed- ETEC 540Rip Mix Feed- ETEC 540
Rip Mix Feed- ETEC 540
 
Daisy dragon
Daisy dragonDaisy dragon
Daisy dragon
 
Team building
Team buildingTeam building
Team building
 
Holland vs Argentina Predictions
Holland vs Argentina PredictionsHolland vs Argentina Predictions
Holland vs Argentina Predictions
 

Similar to Making susanne

Intro to quant_analysis_students
Intro to quant_analysis_studentsIntro to quant_analysis_students
Intro to quant_analysis_students
mstegman
 

Similar to Making susanne (20)

Isabelle Roland - The Aggregate E ects of Credit Market Frictions: Evidence f...
Isabelle Roland - The Aggregate Eects of Credit Market Frictions: Evidence f...Isabelle Roland - The Aggregate Eects of Credit Market Frictions: Evidence f...
Isabelle Roland - The Aggregate E ects of Credit Market Frictions: Evidence f...
 
Costs of sovereign default: restructuring strategies, bank distress and the c...
Costs of sovereign default: restructuring strategies, bank distress and the c...Costs of sovereign default: restructuring strategies, bank distress and the c...
Costs of sovereign default: restructuring strategies, bank distress and the c...
 
HLEG thematic workshop on measuring economic, social and environmental resili...
HLEG thematic workshop on measuring economic, social and environmental resili...HLEG thematic workshop on measuring economic, social and environmental resili...
HLEG thematic workshop on measuring economic, social and environmental resili...
 
Exchange rate pass-through: What has changed since the crisis?
Exchange rate pass-through: What has changed since the crisis?Exchange rate pass-through: What has changed since the crisis?
Exchange rate pass-through: What has changed since the crisis?
 
Intro to quant_analysis_students
Intro to quant_analysis_studentsIntro to quant_analysis_students
Intro to quant_analysis_students
 
Andrea Linarello - Allocative Efficiency and Finance
Andrea Linarello - Allocative Efficiency and FinanceAndrea Linarello - Allocative Efficiency and Finance
Andrea Linarello - Allocative Efficiency and Finance
 
Jaana Remes
Jaana RemesJaana Remes
Jaana Remes
 
The UK Management and Expectations Survey: First Results
The UK Management and Expectations Survey: First Results The UK Management and Expectations Survey: First Results
The UK Management and Expectations Survey: First Results
 
Project Risk Assessment PowerPoint Presentation Slides
Project Risk Assessment PowerPoint Presentation SlidesProject Risk Assessment PowerPoint Presentation Slides
Project Risk Assessment PowerPoint Presentation Slides
 
On the economic effect of Western sanctions on Russia
On the economic effect of Western sanctions on RussiaOn the economic effect of Western sanctions on Russia
On the economic effect of Western sanctions on Russia
 
Monetary policy, factor allocation and growth
Monetary policy, factor allocation and growthMonetary policy, factor allocation and growth
Monetary policy, factor allocation and growth
 
Financial Heterogeneity and Monetary Union
Financial Heterogeneity and Monetary Union Financial Heterogeneity and Monetary Union
Financial Heterogeneity and Monetary Union
 
Project Risk Assessment Powerpoint Presentation Slides
Project Risk Assessment Powerpoint Presentation SlidesProject Risk Assessment Powerpoint Presentation Slides
Project Risk Assessment Powerpoint Presentation Slides
 
Type of data @ web mining discussion
Type of data @ web mining discussionType of data @ web mining discussion
Type of data @ web mining discussion
 
Type of data @ Web Mining Discussion
Type of data @ Web Mining DiscussionType of data @ Web Mining Discussion
Type of data @ Web Mining Discussion
 
Operative Risk Analysis of Industrial Small and medium Enterprises
Operative Risk Analysis of Industrial Small and medium EnterprisesOperative Risk Analysis of Industrial Small and medium Enterprises
Operative Risk Analysis of Industrial Small and medium Enterprises
 
Demand forecasting methods 1 gp
Demand forecasting methods 1 gpDemand forecasting methods 1 gp
Demand forecasting methods 1 gp
 
Data Driven Dashboards For Economic and Policy Analysis
Data Driven Dashboards For Economic and Policy AnalysisData Driven Dashboards For Economic and Policy Analysis
Data Driven Dashboards For Economic and Policy Analysis
 
Can you Deep Learn the Stock Market?
Can you Deep Learn the Stock Market?Can you Deep Learn the Stock Market?
Can you Deep Learn the Stock Market?
 
Costs of sovereign default
Costs of sovereign defaultCosts of sovereign default
Costs of sovereign default
 

More from Aditya Gupta

More from Aditya Gupta (7)

Career Advice for Engineering Students in India
Career Advice for Engineering Students in IndiaCareer Advice for Engineering Students in India
Career Advice for Engineering Students in India
 
Meetups - A realtime distributed monitoring application (Android)
Meetups - A realtime distributed monitoring application (Android) Meetups - A realtime distributed monitoring application (Android)
Meetups - A realtime distributed monitoring application (Android)
 
Multi Agent Systems presentation
Multi Agent Systems presentationMulti Agent Systems presentation
Multi Agent Systems presentation
 
Equity research project, 2014 (banking)
Equity research project, 2014 (banking)Equity research project, 2014 (banking)
Equity research project, 2014 (banking)
 
Food Corporation of India - HR Policy Translation
Food Corporation of India - HR Policy TranslationFood Corporation of India - HR Policy Translation
Food Corporation of India - HR Policy Translation
 
India Energy Security Scenarios Calculator - BTech Project
India Energy Security Scenarios Calculator - BTech ProjectIndia Energy Security Scenarios Calculator - BTech Project
India Energy Security Scenarios Calculator - BTech Project
 
Responses
ResponsesResponses
Responses
 

Recently uploaded

[[Nerul]] MNavi Mumbai Honoreble Call Girls Number-9833754194-Panvel Best Es...
[[Nerul]] MNavi Mumbai Honoreble  Call Girls Number-9833754194-Panvel Best Es...[[Nerul]] MNavi Mumbai Honoreble  Call Girls Number-9833754194-Panvel Best Es...
[[Nerul]] MNavi Mumbai Honoreble Call Girls Number-9833754194-Panvel Best Es...
priyasharma62062
 
Call Girls in Yamuna Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in  Yamuna Vihar  (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in  Yamuna Vihar  (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in Yamuna Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Recently uploaded (20)

Call Girls In Kolkata-📞7033799463-Independent Escorts Services In Dam Dam Air...
Call Girls In Kolkata-📞7033799463-Independent Escorts Services In Dam Dam Air...Call Girls In Kolkata-📞7033799463-Independent Escorts Services In Dam Dam Air...
Call Girls In Kolkata-📞7033799463-Independent Escorts Services In Dam Dam Air...
 
[[Nerul]] MNavi Mumbai Honoreble Call Girls Number-9833754194-Panvel Best Es...
[[Nerul]] MNavi Mumbai Honoreble  Call Girls Number-9833754194-Panvel Best Es...[[Nerul]] MNavi Mumbai Honoreble  Call Girls Number-9833754194-Panvel Best Es...
[[Nerul]] MNavi Mumbai Honoreble Call Girls Number-9833754194-Panvel Best Es...
 
Vip Call Girls Bhubaneswar😉 Bhubaneswar 9777949614 Housewife Call Girls Serv...
Vip Call Girls Bhubaneswar😉  Bhubaneswar 9777949614 Housewife Call Girls Serv...Vip Call Girls Bhubaneswar😉  Bhubaneswar 9777949614 Housewife Call Girls Serv...
Vip Call Girls Bhubaneswar😉 Bhubaneswar 9777949614 Housewife Call Girls Serv...
 
GIFT City Overview India's Gateway to Global Finance
GIFT City Overview  India's Gateway to Global FinanceGIFT City Overview  India's Gateway to Global Finance
GIFT City Overview India's Gateway to Global Finance
 
Thane Call Girls , 07506202331 Kalyan Call Girls
Thane Call Girls , 07506202331 Kalyan Call GirlsThane Call Girls , 07506202331 Kalyan Call Girls
Thane Call Girls , 07506202331 Kalyan Call Girls
 
Call Girls Howrah ( 8250092165 ) Cheap rates call girls | Get low budget
Call Girls Howrah ( 8250092165 ) Cheap rates call girls | Get low budgetCall Girls Howrah ( 8250092165 ) Cheap rates call girls | Get low budget
Call Girls Howrah ( 8250092165 ) Cheap rates call girls | Get low budget
 
cost-volume-profit analysis.ppt(managerial accounting).pptx
cost-volume-profit analysis.ppt(managerial accounting).pptxcost-volume-profit analysis.ppt(managerial accounting).pptx
cost-volume-profit analysis.ppt(managerial accounting).pptx
 
Test bank for advanced assessment interpreting findings and formulating diffe...
Test bank for advanced assessment interpreting findings and formulating diffe...Test bank for advanced assessment interpreting findings and formulating diffe...
Test bank for advanced assessment interpreting findings and formulating diffe...
 
Kopar Khairane Cheapest Call Girls✔✔✔9833754194 Nerul Premium Call Girls-Navi...
Kopar Khairane Cheapest Call Girls✔✔✔9833754194 Nerul Premium Call Girls-Navi...Kopar Khairane Cheapest Call Girls✔✔✔9833754194 Nerul Premium Call Girls-Navi...
Kopar Khairane Cheapest Call Girls✔✔✔9833754194 Nerul Premium Call Girls-Navi...
 
logistics industry development power point ppt.pdf
logistics industry development power point ppt.pdflogistics industry development power point ppt.pdf
logistics industry development power point ppt.pdf
 
fundamentals of corporate finance 11th canadian edition test bank.docx
fundamentals of corporate finance 11th canadian edition test bank.docxfundamentals of corporate finance 11th canadian edition test bank.docx
fundamentals of corporate finance 11th canadian edition test bank.docx
 
Strategic Resources May 2024 Corporate Presentation
Strategic Resources May 2024 Corporate PresentationStrategic Resources May 2024 Corporate Presentation
Strategic Resources May 2024 Corporate Presentation
 
✂️ 👅 Independent Lucknow Escorts U.P Call Girls With Room Lucknow Call Girls ...
✂️ 👅 Independent Lucknow Escorts U.P Call Girls With Room Lucknow Call Girls ...✂️ 👅 Independent Lucknow Escorts U.P Call Girls With Room Lucknow Call Girls ...
✂️ 👅 Independent Lucknow Escorts U.P Call Girls With Room Lucknow Call Girls ...
 
Collecting banker, Capacity of collecting Banker, conditions under section 13...
Collecting banker, Capacity of collecting Banker, conditions under section 13...Collecting banker, Capacity of collecting Banker, conditions under section 13...
Collecting banker, Capacity of collecting Banker, conditions under section 13...
 
Mahendragarh Escorts 🥰 8617370543 Call Girls Offer VIP Hot Girls
Mahendragarh Escorts 🥰 8617370543 Call Girls Offer VIP Hot GirlsMahendragarh Escorts 🥰 8617370543 Call Girls Offer VIP Hot Girls
Mahendragarh Escorts 🥰 8617370543 Call Girls Offer VIP Hot Girls
 
falcon-invoice-discounting-unlocking-prime-investment-opportunities
falcon-invoice-discounting-unlocking-prime-investment-opportunitiesfalcon-invoice-discounting-unlocking-prime-investment-opportunities
falcon-invoice-discounting-unlocking-prime-investment-opportunities
 
✂️ 👅 Independent Bhubaneswar Escorts Odisha Call Girls With Room Bhubaneswar ...
✂️ 👅 Independent Bhubaneswar Escorts Odisha Call Girls With Room Bhubaneswar ...✂️ 👅 Independent Bhubaneswar Escorts Odisha Call Girls With Room Bhubaneswar ...
✂️ 👅 Independent Bhubaneswar Escorts Odisha Call Girls With Room Bhubaneswar ...
 
Seeman_Fiintouch_LLP_Newsletter_May-2024.pdf
Seeman_Fiintouch_LLP_Newsletter_May-2024.pdfSeeman_Fiintouch_LLP_Newsletter_May-2024.pdf
Seeman_Fiintouch_LLP_Newsletter_May-2024.pdf
 
Technology industry / Finnish economic outlook
Technology industry / Finnish economic outlookTechnology industry / Finnish economic outlook
Technology industry / Finnish economic outlook
 
Call Girls in Yamuna Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in  Yamuna Vihar  (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in  Yamuna Vihar  (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in Yamuna Vihar (delhi) call me [🔝9953056974🔝] escort service 24X7
 

Making susanne

  • 1. Making Susanne THE WORLD RECESSION SUSCEPTIBILITY ANALYSIS WEBTOOL Aditya Gupta (2011009) 1
  • 2. Overview  Objective and Motive  Process  Data  Data Collection  Data Cleanup  Dependent Variable  Analysis  Regression, Classification  Clustering and Results  Further Work  Presentation  Drawbacks 2
  • 3. Objective Key Questions  To study what economic composition makes economies more susceptible to Global Recession  By how much? How significantly?  Can we predict recession impact?  Studying similarity of economies  How to quantify  Recession impact  Susceptibility Key Goals  Empirically ascertain significance and impact of certain economic traits vis-à-vis expert opinions  Develop a powerful model to predict recession susceptibility  Present a global and intuitive view across parameters and their significance, i.e., Susanne. 3
  • 4. Why? 4  Recessions Hurt  2008 – 2010  Unemployment and layoffs  Austerity  Bearish Markets  Inflation  Bank Runs
  • 5. Data A BIG PROBLEM IN ITSELF 5
  • 6. Data Collection  60+ economic variables linked to Recession  Allegedly, as per sources like Economist, Forbes, World Bank, WWW  Or as per our suspicion  Preferable country-specific ratios (Normalized and Structural Information)  13 years, starting from 2000 to 2013  210 Countries  2500 rows of 60+ columns, each row is a country-year identified  Source:  World Bank (OECD National Account File): http://data.worldbank.org/  United Nations Comtrade Database, International Monetary Fund, Direction of Trade Database, Balance of Payments Database, and more. 6
  • 7. Simple Enough?  No, its blistering gunk.  Only 86 out of ~2500 rows have complete data (no NA’s)  Non-normalized values  What now? 7
  • 8. Data Cleanup  Manually add values for nearly complete columns  Compress and remove years 2008-2010  Observed Class Variables – not Causal  Remove countries with almost no data (Afghanistan and 30 others)  Down to 2100 rows  Drop columns if:  Significant, and have very little data available  Year values for super-specific variables like “Merchandise Exports to Scandinavian Countries as % of exports”  We surely don’t have this for most countries, especially those like Albania  Insignificant  Determined from Regressional Analysis (MLR) p-values 8
  • 9. Still too many missing values  What to do?  Drop row (done)  Weighted expansion of row (did not consider)  Infer a value  Average value for parameter for country over 13 years  Result: Fixed 5000 out of 70K cells  Still only 86 full rows  Why?  No value for property exists for a country at all…  Solution: Global Average – the sacrilege!  Awful. Pull towards the mean, misclassify.  But no directional bias.  Trade-off: Unlocks a world of data 9 For a specific country… Year V1 Pred V2 Pred 2001 15 15 15 15 2002 17 17 17 17 2003 15 15 19 19 2004 16 16 21 21 2005 ? 16 ? 18 Avg 16 18
  • 10. Dependent Variable  Goal: Capture Recession Impact between 2008-10  Technical Definition: Absolute Growth Rate  Problem: 15% to 1%, still not recession  Average Growth Rate drop between 2008 – 2010?  Problem: 1% drop for UK (at 0.5%) vs. AFG (at 15%)  Percentage Drop in Growth Rate  Problem: 0.1% to 3%, 3000% change!  Solution?  Drop.SD: Drop in Growth Rate in number of Standard Deviations over 2008 – 2010. Variance in Growth rate over last 20 years.  Variance: Lot of manual data collection 10
  • 11. Distribution and Discretization of Drop.SD  Corresponded well with web information about countries that “avoided recession” and those “hit worst”  21% of the countries labelled as unsusceptible  Less than -0.25 Drop in GDP during 2007-2010  Middle 36% labelled as relatively unaffected  between -0.25 to 0.75 SD GDP Drop in Recession  Highest 43% adversely affected  >0.75 SD drop in GDP  How good was this division?  lm R-squared rose from 41.58% to 44.14% (no loss in predictive power, i.e. reasonable classification)  Good split. Most countries were affected horribly. 11 Other Options • Equal Density Split • Maximise Classification Accuracy • Purely Contextual
  • 12. Analysis AFTER WE FINALLY HAVE 2099 COMPLETE ROWS OF DATA 12
  • 13. Classification: Can we predict Recession Susceptibility?  Assumption (non trivial)  Drop.SD correctly represents Recession Impact in 08-10  Recession Impact in 08-10 correctly represents Recession Susceptibility  Can’t do better but guess a few things.  SVM has 92% accuracy. Seems like it.  Caveat:  Bootstrap Analysis: Training Data = Test Data  Workaround (can’t generate new countries or years):  5-Fold Cross Validation 13
  • 14. Important Variables and their Impact  Using Multilinear Regression (MLR),  for each Economic Variable, we get  degree  and direction of impact on Drop.SD 14
  • 15. Clustering  Motivation: “Are we brute-engineering a predictor, or is there an actual underlying economic structural pattern of recession-susceptibility?”  70% accuracy (consistent clusters)with k-means, using multiple k values.  You decide. 15
  • 16. Aside: Some nifty R snippets  Populating Missing Values smartly:  d7.agg <- aggregate(d7.norm[,5:48], by = list(Country.Name = d7.norm$Country.Name, Time = d7.norm$Time), FUN = function(x) { if(anyNA(x)) { mean(na.omit(x)) } else { mean(x) } })  Class labelling:  Rec.Affect <- ifelse (CompleteDataFinal$Drop.SD. >= 1, 1, ifelse(CompleteDataFinal$Drop.SD. >= - 0.25, 0, -1)).  SVM (predict Recession Susceptibility) 16 > model <- svm(as.factor(Rec.Affect) ~ ., data = CompleteDataFinal[, c(6:46)]) > predictions <- predict(model, CompleteDataFinal[,c(6:46)]) > table(pred = predictions, true = Rec.Affect) true pred -1 0 1 -1 383 8 7 0 33 675 41 1 15 61 876 > t <- table(pred = predictions, true = Rec.Affect) > (t[1,1] + t[2,2] + t[3,3])/sum(t) [1] 0.9213911
  • 17.  Allowing similarity checks between Economies  Overall  Over Economic Categories  Using a Semantic Web compliant Cardinality Checks and Ontology  Classify Economic Variables into one or more of:  Central Government  Economic Structure  Net Exports  Banking  Manufacturing  GDP  Discretize them into , and over:  Value  And Impact  Using middle 80 percentile cut-offs Further Work - Accessibility 17
  • 19. A Knowledge Map (Ontology) 19
  • 20. Ontology (2) 20 • About 3500 Nodes • And 7000 Edges
  • 22. Frontend 22  Takeaway:  Single Page Architecture  Global State:  (a) Country  (a) Year  (a) Property  Multiple Interesting Views:  Map  Country  Property  And 2-way state integration and state- update  thanks, angular
  • 29. Search (7) – and the last one, breathe. 29