Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Machine Learning Introduction

37 views

Published on

Slides presented at ACCA event at Cyprus during panel discussion "Machine Learning - More Science than Fiction".

Published in: Technology
  • Be the first to comment

  • Be the first to like this

Machine Learning Introduction

  1. 1. 5/22/19 1Demetris Trihinas trihinas.d@unic.ac.cy 1ACCA ML Panel| Nicosia, May 2019 Department of Computer Science Machine Learning Panel Opening Comments Demetris Trihinas Department of Computer Science AILab @ University of Nicosia trihinas.d@unic.ac.cy
  2. 2. 5/22/19 2Demetris Trihinas trihinas.d@unic.ac.cy 2ACCA ML Panel| Nicosia, May 2019 Department of Computer Science Full-Time Faculty Member University of Nicosia “Designing and developing scalable and self-adaptive tools for data management, exploration and visualization” @dtrihinas http://dtrihinas.info https://ailab.unic.ac.cy/https://www.slideshare.net/DemetrisTrihinas
  3. 3. 5/22/19 3Demetris Trihinas trihinas.d@unic.ac.cy 3ACCA ML Panel| Nicosia, May 2019 Department of Computer Science What is NOT Machine Learning • Any question you can ask and get an –immediate and concrete– answer (e.g., database, spreadsheet). • How many sofas models are currently in stock? • How many sofas did we sell in Germany last month? • Which of our customers bought a sofa worth more than 500 euros this quarter?
  4. 4. 5/22/19 4Demetris Trihinas trihinas.d@unic.ac.cy 4ACCA ML Panel| Nicosia, May 2019 Department of Computer Science The Machine Learning Process Data and Labels Feature Engineering ML Algorithm Bike Car Bike < 2, 170, 35, 169, 51, 38, …> < 2, 119, 28, 210, 52, 02, …> Car <4, 13, 157, 90, 178, 145, …> <4, 12, 170, 82, 193, 145, …> wheels Training Testing Data Feature Engineering Inferencing Statistical Model <4, 18, 200, 64, 170, 141, …> It’s a… Car Finding ”patterns” from features ! = #(%, ')
  5. 5. 5/22/19 5Demetris Trihinas trihinas.d@unic.ac.cy 5ACCA ML Panel| Nicosia, May 2019 Department of Computer Science ML Modeling is an Approximation... Feature Engineering ML Algorithm Training Testing Data Feature Engineering Statistical Model <4, 18, 200, 64, 170, 141, …> <3, 22, 23, 31, 101, 205, …> Data and Labels Inferencing It’s a Car I’m 0.88 sure It’s a Car I’m 0.71 sure
  6. 6. 5/22/19 6Demetris Trihinas trihinas.d@unic.ac.cy 6ACCA ML Panel| Nicosia, May 2019 Department of Computer Science It’s Also Called “Learning” Because… Feature Engineering ML Algorithm Training Testing Data Feature Engineering Statistical Model <4, 18, 200, 64, 170, 141, …> It’s a… Car, 0.88 <3, 22, 23, 31, 101, 205, …> It’s a… Car, 0.71 <3, 22, 23, 31, 101, 205, …> It’s a… tricycle, 0.93 Data and Labels Evaluate and RetrainInferencing
  7. 7. 5/22/19 7Demetris Trihinas trihinas.d@unic.ac.cy 7ACCA ML Panel| Nicosia, May 2019 Department of Computer Science How Do Robots See Us
  8. 8. 5/22/19 8Demetris Trihinas trihinas.d@unic.ac.cy 8ACCA ML Panel| Nicosia, May 2019 Department of Computer Science Training is of Utmost Importance • More training data is good but too much leads to overfitting (irrelevant details are modeled). • Algorithms are not racist, do not hold prejudice or apply stereotypes… yes, but what happened to Amazon? Memorizing the answers is NOT Learning
  9. 9. 5/22/19 9Demetris Trihinas trihinas.d@unic.ac.cy 9ACCA ML Panel| Nicosia, May 2019 Department of Computer Science Beware of the “Trainer”… https://www.businessinsider.com/amazon-built-ai-to-hire-people-discriminated-against-women-2018-10
  10. 10. 5/22/19 10Demetris Trihinas trihinas.d@unic.ac.cy 10ACCA ML Panel| Nicosia, May 2019 Department of Computer Science Data Preprocessing • Preprocessing significantly increases ML performance and result quality. • Is 1% more error tolerable if computation “promises” to run for 10min instead of 5 hours? • Data Reduction –remove insignificant model dimensions. • Data Cleaning –remove incomplete and “dirty” data. Do not “influence” result
  11. 11. 5/22/19 11Demetris Trihinas trihinas.d@unic.ac.cy 11ACCA ML Panel| Nicosia, May 2019 Department of Computer Science What is YOUR Role? • Ask good questions – a model is based on a hypothesis. • Provide training data – BEWARE training can lead to bias. • Assess the quality of results – retrain let the model “learn”. • TEST, TEST and TEST again. The Ability Matrix Insights are generated by humans not machines!
  12. 12. 5/22/19 12Demetris Trihinas trihinas.d@unic.ac.cy 12ACCA ML Panel| Nicosia, May 2019 Department of Computer Science Questions? Demetris Trihinas Department of Computer Science AILab @ University of Nicosia trihinas.d@unic.ac.cy

×