All Models Are Wrong, But Some Are Useful: 6 Lessons for Making Predictive Analytics Work

1. All Models Are Wrong, But Some Are Useful:  6 Lessons For Making Predictive Analytics Work Dr. Brian Mac Namee brian.macnamee@ucd.ie @brianmacnamee

2. machine learning ar,ﬁcial intelligence data science cogni,ve compu,ng big data Inspired by Brendan Tierney h:p://www.oraly,cs.com/2012/06/data-science-is-mul,disciplinary.html deep learning

3. ar#ﬁcial intelligence data science cogni#ve compu#ng big data deep learning Inspired by Brendan Tierney h:p://www.oraly#cs.com/2012/06/data-science-is-mul#disciplinary.html machine learning

6. if LOAN-SALARY RATIO < 1.5 then OUTCOME=’repay’ else if LOAN-SALARY RATIO > 4 then OUTCOME=’default’ else if AGE < 40 and OCCUPATION =’industrial’then OUTCOME=’default’ else OUTCOME=’repay’ end if Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com

7. Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com

10. Better data usually beats bigger models Prediction is a lot of things1 2 There is no such thing as a free lunch 3 Look for Goldilocks 4 Choose your evaluation carefully5 6 Remember Occam’s Razor

11. Prediction Is A   Lot Of Things 1

13. Predicting the value of an unknown variable at a time in the future

14. Forecast

15. 0 27.5 55 82.5 110 July September November January March May

16. 0 27.5 55 82.5 110 July September November January March May

17. Predict the value of an unknown variable associated with an object

18. Label

19. Image Set

20. Image Set Containing Nerves Not Containing Nerves

21. Predicting the propensity of somebody to take an action at a time in the future

22. Rank

23. Population

24. Population Least Likely   To Respond Most Likely   To Respond

25. "In data analytics a prediction is an assignment of a value to an unknown variable." Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com

26. Predictions means a lot of different things, which means we can apply predictive modelling to many different problems. Think carefully about what type of decision you want to make (label, rank, or forecast), and then design a predictive modelling solution to best help with that. Lesson

27. 27 There Is No Such Thing As A   Free Lunch 2

28. www.rapidminer.com

29. 29 www.rapidminer.com

30. "We have dubbed the associated results No Free Lunch theorems because they demonstrate that if an algorithm performs well on a certain class of problems then it necessarily pays for that with degraded performance on the set of all remaining problems." Wolpert & Macready "No Free Lunch Theorems for Optimization", David H. Wolpert and William G. Macready, IEEE Transactions On Evolutionary Computation, vol. 1, no. 1, 1997 http://ti.arc.nasa.gov/m/profile/dhw/papers/78.pdf

32. Tree Model Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com

33. Nearest Neighbour Model Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com

34. Linear Model Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com

36. Tree Model Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com

37. Nearest Neighbour Model Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com

38. Linear Model Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com

39. There are a huge number of different predictive modelling algorithms. You need to experiment with lots of different ones. Lesson random forest decision tree istonic regression neural network nearest neighbour naive Bayes support vector machine logistic regression Bayesian network ensemble gradient boosting linear model winnow

40. Look For Goldilocks 3

41. ● ● ● ● ● 0 20 40 60 80 100 20000400006000080000 Age Income Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com

45. 0 50 100 150 200 0.10.20.30.40.5 Training Iteration MisclassificationRate Performance on Training Set Performance on Validation Set

46. 0 50 100 150 200 0.10.20.30.40.5 Training Iteration MisclassificationRate Performance on Training Set Performance on Validation Set Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com

55. Always tune your models, but be very careful of overfitting. A validation dataset is crucial here. Lesson

56. 56 Better Data Usually Beats Bigger Models 4

57. Digital Image Processing, Gonzalez & Woods, 2002

58. Digital Image Processing, Gonzalez & Woods, 2002 Denoised image 100 200 300 400 500 600 50 100 150 200 250 300 350 400 450

62. Raw Activity

63. Normalised Activity

64. Wake Aligned Activity

65. Cumulative Wake Aligned Activity

66. Activity

67. Activity Peak activity (day) Variation in activity (day) Total activity (day) Peak activity (1st hour) Variation in activity (1st hour) Total activity (1st hour) Area under cumulative activity curve …

68. Choose An Algorithm Generate Data Tune Model Parameters

69. Choose An Algorithm Generate Data Tune Model Parameters

70. Developing new, richer features is often a better way to improve model performance than using more sophisticated modelling techniques. Lesson

71. An Aside On Deep Learning

72. Deep Learning Google Trends: http://www.google.com/trends/ 2005 2007 2009 2011 2013 2015

73. Deep-learning methods are representaUon-learning methods with mulple levels of representaon, obtained by composing simple but non-linear modules that each transform the representaon at one level (starng with the raw input) into a representaon at a higher, slightly more abstract level. [LeCun et al, 2014] Deep Learning Yann LeCun, Yoshua Bengio & Geoffrey Hinton http://www.nature.com/nature/journal/v521/n7553/full/nature14539.html

74. 0 1 2 3 4 5 6 7 8 9

75. Convoluonal neural networks seem to brilliantly address the selecUvity-invariance dilemma that is fundamental to all eﬀorts to learn to classify objects: they produce representaons that are selecve to the aspects of the image that are important for discriminaon, but that are invariant to irrelevant aspects Convoluonal networks hold records for problems in image recogniUon, speech recogniUon, and text classiﬁcaUon amongst other areas

77. On Welsh Corgis, Computer Vision, and the Power of Deep Learning, Microsoft Research, 2014 http://research.microsoft.com/en-us/news/features/dnnvision-071414.aspx Rise of the machines, The Economist, 2015 http://www.economist.com/news/briefing/21650526-artificial-intelligence-scares-peopleexcessively-so-rise-machines

78. Hardware Data Algorithms Applica4ons

79. 79 Choose Your Evaluation Carefully 5

81. A marketing company working for a charity has developed two different models that predict the likelihood that donors will respond to a mail- shot asking them to make a special extra donation. Two models have been built and an evaluation experiment had been performed. Now we must decide which model to use.

82. Prediction TRUE FALSE Target TRUE 2355 337 FALSE 329 1714 Classification Accuracy: 85.93% Model 1

83. Prediction TRUE FALSE Target TRUE 2198 494 FALSE 471 1572 Classification Accuracy: 79.62% Model 2

84. Model 1 Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com

85. Model 2 Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com

86. There are many different performance measures that we can use to evaluate the performance of a model. You need to pick the one that best matches the decisions you are trying to make. Lesson

87. 87 Remember Occam’s Razor 6

92. Timeline Followers Following Tweets + Metadata Profile

93. Tweets + Metadata Profile

94. Tweets + Metadata Profile

95. http://www.cso.ie/en/releasesandpublications/er/ibn/irishbabiesnames2014/

96. Always start with simple solutions first. Only add complexity if required. Lesson Frustra fit per plura quod potest fieri per pauciora (It is futile to do with more things that which can be done with fewer)

97. Better data usually beats bigger models Prediction is a lot of things1 2 There is no such thing as a free lunch 3 Look for Goldilocks 4 Choose your evaluation carefully5 6 Remember Occam’s Razor

98. Fundamentals of Machine Learning for Predictive Data Analytics John Kelleher, Brian Mac Namee, and Aoife D'Arcy www.machinelearningbook.com Thank You Questions? Training Course: Fundamentals of Machine  Learning for Predictive Data Analytics Dublin, March 21st - 23rd www.theanalyticsstore.ie/training/ brian.macnamee@ucd.ie @brianmacnamee

All Models Are Wrong, But Some Are Useful: 6 Lessons for Making Predictive Analytics Work

Recommended

Recommended

More Related Content

Similar to All Models Are Wrong, But Some Are Useful: 6 Lessons for Making Predictive Analytics Work

Similar to All Models Are Wrong, But Some Are Useful: 6 Lessons for Making Predictive Analytics Work (20)

Recently uploaded

Recently uploaded (20)

All Models Are Wrong, But Some Are Useful: 6 Lessons for Making Predictive Analytics Work