SlideShare a Scribd company logo
A General Overview of Machine
Learning
Boise Data Science Meetup -- September 18, 2018
Ashish Sharma
➔ Software Systems Engineer -- HomeCU, LLC. (2017 - present)
➔ Founder -- AI Developers, Boise
➔ City Ambassador -- AI Saturdays (global initiative of nurture.ai)
➔ Alumnus -- Boise State University (MS in Computer Science, 2015-2017)
1
Overview
● AI and Applications
● Intro to Machine Learning
● Types of Machine Learning
● Which algorithm should I use?
● Effective Machine Learning
Image Source: Cousins of Artificial Intelligence -- Towards Datascience 2
AI Resurgence
➔ Computational Power (GPUs, cloud computing, distributed systems)
➔ Availability of large amount of Data (eg. Imagenet)
➔ Better theoretical understanding of the underlying techniques/algorithms
➔ Open and easily accessible research culture in academia and industry
(NIPS, ICML, archiv.org)
3
AI Resurgence (contd..)
➔ Netflix Challenge (2009) $1 Million Prize (User ratings for films)
➔ Kaggle (2010) (over more than a million users today)
➔ Fei-Fei Li and team at Stanford open sourced ImageNet (2008-2010)
◆ Imagenet Large Scale Visual Recognition Challenge (ILSVRC)
➔ Geoffrey Hinton’s Deep Learning Team wins ImageNet 2012 (Alexnet)
4
Common Applications
➔ Speech recognition (virtual assistants)
➔ Advanced machine translation and natural language intelligence
➔ Strategic gaming algorithms (AlphaGo, chess)
➔ Computer Vision (image classification and object detection)
➔ Autonomous Vehicles
➔ Manufacturing Companies (landing.ai)
➔ Healthcare (Google’s research on diabetic retinopathy -- with F-score of
0.95, surpassing the accuracy of 8 expert ophthalmologists)
5
Machine Learning
➔ Form of applied statistics with emphasis
on the use of computers to learn
complex mathematical functions.
➔ More formally, “A computer program is
said to learn from experience E with
respect to some class of tasks T and
performance measure P, if its
performance at tasks in T, as measured
by P, improves with experience E.”
Image Source: xkcd
6
Types of Machine Learning
➔ Supervised Learning
➔ Unsupervised Learning
➔ Reinforcement Learning
7
Supervised Learning
Terminologies:
➔ Input variable(s)
◆ independent variable(s)
◆ feature(s)/characteristic(s) of a single input object
◆ Numerical -- continuous ( height, area of house) , discrete (grades, age)
◆ Categorical (race, sex) -- nominal, ordinal
➔ Target variable(s)
◆ Dependent variable(s), number/vector (eg. price of house, patient is diabetic, etc.)
8
Supervised Learning
➔ Function approximation
◆ Mathematically: solve for coefficient(s) of a function
◆ Search for a best performing model from a hypothesis space.
◆ Make predictions based on historical (labeled) data
➔ Regression (predict continuous target variable)
◆ Univariate Regression (1 input variable, 1 output variable)
◆ Multiple Regression (>=2 input variables, 1 output variable)
◆ Multivariate Regression (>=2 output variable)
➔ Classification (predict discrete/categorical target variable)
◆ Email: Spam or not?
◆ Is this image a dog or cat?
9
Unsupervised Learning
➔ Unsupervised Learning
◆ Find hidden patterns and draw inference from (unlabeled) data
◆ Essential for preliminary data analysis and visualization
➔ Clustering (grouping of similar data points)
◆ K-Means, DBSCAN
➔ Dimensionality Reduction
◆ Principal Components Analysis
◆ Autoencoders
10
Reinforcement Learning
➔ AI, Animal Psychology, Control Theory
➔ Agents, Actions, Environment, Change in State, Reward/Punishment
➔ Eg. Deep Attari:
◆ Input: Snapshots of Attari board images (State and Actions)
◆ Algorithm: Convolutional NNs with no pooling
◆ Output layer: tailored for regression score (Maximize Reward)
11
Beginner’s Question!
➔ (Q)* Which Algorithm Should I Use?
➔ (A) The answer varies depending on many factors, including:
◆ The size, quality, and nature of data ;
◆ The available computational time;
◆ The urgency of the task; and
◆ What you want to do with the data(the problem).
* towardsdatascience.com
12
Which algorithm should I use?
◆ No one algorithm works best for every problem (Yes, not even neural networks!)
13
Important Concepts
➔ Model Selection:
◆ K-crossfold validation
◆ Train/Test/Evaluation Dataset
➔ Loss functions
➔ Convex Optimization
➔ Gradient Descent
➔ Model Complexity, Overfitting and Underfitting
➔ Regularization
➔ Training and Generalization Errors
14
Questions to ask when working on ML project!
➔ How much data do I have? What type/nature of data?
➔ How skilled and knowledgeable am I in this domain?
◆ Will I be able to create more useful features from what I already have?
➔ How good am I in error analysis?
15
Questions to ask when working on ML project!
➔ Assumptions, Limitations and Adoption (ALA rule) of the algorithm.
◆ Linear Regression (linear relationship, no or little multicollinearity, etc.)
◆ Why does this particular loss function make sense?
➔ How good am I in debugging the chosen learning algorithm?
16
Effective Machine Learning
➔ Reduce time spent in programming (more experiments in short time)
◆ Use off the shelf tools
➔ Customize and Scale Products
◆ Start simple, scale as needed (again, choice of relevant toolsets)
➔ Think like a Scientist
◆ Use statistics, not logic, to make decisions from the real world observations
* Slide content referred from Google’s Machine Learning Crash Course
17
Thank You
Ashish Sharma
Email: accssharma@gmail.com
/in/accssharma
@accssharma
AI Developers, Boise: https://github.com/aidevelopersboise/ai6-boise-materials
HomeCU is hiring Software Engineers and Mobile Developers.
https://www.homecu.net/company-jobs.html
18
Visual Demonstrations
➔ K nearest neighbor: http://vision.stanford.edu/teaching/cs231n-demos/knn/
➔ CIFAR 10 Image Classification:
https://cs.stanford.edu/people/karpathy/convnetjs/demo/cifar10.html
19

More Related Content

What's hot

Kaggle Days Milan - March 2019
Kaggle Days Milan - March 2019Kaggle Days Milan - March 2019
Kaggle Days Milan - March 2019
Alberto Danese
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Dr. Radhey Shyam
 
Data science as career
Data science as careerData science as career
Data science as career
Manjunath Sindagi
 
Learning for Big Data-林軒田
Learning for Big Data-林軒田Learning for Big Data-林軒田
Learning for Big Data-林軒田
台灣資料科學年會
 
Dynamic Optimization without Markov Assumptions: application to power systems
Dynamic Optimization without Markov Assumptions: application to power systemsDynamic Optimization without Markov Assumptions: application to power systems
Dynamic Optimization without Markov Assumptions: application to power systems
Olivier Teytaud
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
Sangath babu
 
Career in Analytics- Introduction to decision trees
Career in Analytics- Introduction to decision treesCareer in Analytics- Introduction to decision trees
Career in Analytics- Introduction to decision trees
Ivy Pro School
 
NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo
NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell ReboNYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo
NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo
Maryam Farooq
 
Machine Learning part1 - Introduction to Data Science
Machine Learning part1 - Introduction to Data Science Machine Learning part1 - Introduction to Data Science
Machine Learning part1 - Introduction to Data Science
Frank Kienle
 
Data Science for Business Managers - An intro to ROI for predictive analytics
Data Science for Business Managers - An intro to ROI for predictive analyticsData Science for Business Managers - An intro to ROI for predictive analytics
Data Science for Business Managers - An intro to ROI for predictive analytics
Akin Osman Kazakci
 
Introduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionIntroduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regression
Girish Gore
 
Machine learning
Machine learning Machine learning
Machine learning
Saurabh Agrawal
 
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Edureka!
 

What's hot (13)

Kaggle Days Milan - March 2019
Kaggle Days Milan - March 2019Kaggle Days Milan - March 2019
Kaggle Days Milan - March 2019
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Data science as career
Data science as careerData science as career
Data science as career
 
Learning for Big Data-林軒田
Learning for Big Data-林軒田Learning for Big Data-林軒田
Learning for Big Data-林軒田
 
Dynamic Optimization without Markov Assumptions: application to power systems
Dynamic Optimization without Markov Assumptions: application to power systemsDynamic Optimization without Markov Assumptions: application to power systems
Dynamic Optimization without Markov Assumptions: application to power systems
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Career in Analytics- Introduction to decision trees
Career in Analytics- Introduction to decision treesCareer in Analytics- Introduction to decision trees
Career in Analytics- Introduction to decision trees
 
NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo
NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell ReboNYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo
NYAI #25: Evolution Strategies: An Alternative Approach to AI w/ Maxwell Rebo
 
Machine Learning part1 - Introduction to Data Science
Machine Learning part1 - Introduction to Data Science Machine Learning part1 - Introduction to Data Science
Machine Learning part1 - Introduction to Data Science
 
Data Science for Business Managers - An intro to ROI for predictive analytics
Data Science for Business Managers - An intro to ROI for predictive analyticsData Science for Business Managers - An intro to ROI for predictive analytics
Data Science for Business Managers - An intro to ROI for predictive analytics
 
Introduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regressionIntroduction to machine learning and model building using linear regression
Introduction to machine learning and model building using linear regression
 
Machine learning
Machine learning Machine learning
Machine learning
 
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
Scikit Learn Tutorial | Machine Learning with Python | Python for Data Scienc...
 

Similar to A General Overview of Machine Learning

Workshop on Machine Learning
Workshop on Machine LearningWorkshop on Machine Learning
Workshop on Machine Learning
Harshad Saykhedkar
 
Webpage Personalization and User Profiling
Webpage Personalization and User ProfilingWebpage Personalization and User Profiling
Webpage Personalization and User Profiling
yingfeng
 
ML MODULE 1_slideshare.pdf
ML MODULE 1_slideshare.pdfML MODULE 1_slideshare.pdf
ML MODULE 1_slideshare.pdf
Shiwani Gupta
 
[系列活動] 機器學習速遊
[系列活動] 機器學習速遊[系列活動] 機器學習速遊
[系列活動] 機器學習速遊
台灣資料科學年會
 
General introduction to AI ML DL DS
General introduction to AI ML DL DSGeneral introduction to AI ML DL DS
General introduction to AI ML DL DS
Roopesh Kohad
 
L15.pptx
L15.pptxL15.pptx
L15.pptx
ImonBennett
 
Human in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AIHuman in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AI
Pramit Choudhary
 
Machine learning introduction to unit 1.ppt
Machine learning introduction to unit 1.pptMachine learning introduction to unit 1.ppt
Machine learning introduction to unit 1.ppt
ShivaShiva783981
 
Week_1 Machine Learning introduction.pptx
Week_1 Machine Learning introduction.pptxWeek_1 Machine Learning introduction.pptx
Week_1 Machine Learning introduction.pptx
muhammadsamroz
 
The Machine Learning Workflow with Azure
The Machine Learning Workflow with AzureThe Machine Learning Workflow with Azure
The Machine Learning Workflow with Azure
Ivo Andreev
 
Overview of machine learning
Overview of machine learning Overview of machine learning
Overview of machine learning
SolivarLabs
 
H2O with Erin LeDell at Portland R User Group
H2O with Erin LeDell at Portland R User GroupH2O with Erin LeDell at Portland R User Group
H2O with Erin LeDell at Portland R User Group
Sri Ambati
 
Lecture on AI and Machine Learning
Lecture on AI and Machine LearningLecture on AI and Machine Learning
Lecture on AI and Machine Learning
Xiaonan Wang
 
機器學習速遊
機器學習速遊機器學習速遊
機器學習速遊
台灣資料科學年會
 
Predicting Tweet Sentiment
Predicting Tweet SentimentPredicting Tweet Sentiment
Predicting Tweet Sentiment
Lucinda Linde
 
1225 lunchlearn shekhar_using his mac
1225 lunchlearn shekhar_using his mac1225 lunchlearn shekhar_using his mac
1225 lunchlearn shekhar_using his mac
Rising Media, Inc.
 
Machine Learning for (JVM) Developers
Machine Learning for (JVM) DevelopersMachine Learning for (JVM) Developers
Machine Learning for (JVM) Developers
Mateusz Dymczyk
 
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Ali Alkan
 
Artificial intelligence and IoT
Artificial intelligence and IoTArtificial intelligence and IoT
Artificial intelligence and IoT
Veselin Pizurica
 
Machine learning
 Machine learning Machine learning
Machine learning
Siddharth Kar
 

Similar to A General Overview of Machine Learning (20)

Workshop on Machine Learning
Workshop on Machine LearningWorkshop on Machine Learning
Workshop on Machine Learning
 
Webpage Personalization and User Profiling
Webpage Personalization and User ProfilingWebpage Personalization and User Profiling
Webpage Personalization and User Profiling
 
ML MODULE 1_slideshare.pdf
ML MODULE 1_slideshare.pdfML MODULE 1_slideshare.pdf
ML MODULE 1_slideshare.pdf
 
[系列活動] 機器學習速遊
[系列活動] 機器學習速遊[系列活動] 機器學習速遊
[系列活動] 機器學習速遊
 
General introduction to AI ML DL DS
General introduction to AI ML DL DSGeneral introduction to AI ML DL DS
General introduction to AI ML DL DS
 
L15.pptx
L15.pptxL15.pptx
L15.pptx
 
Human in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AIHuman in the loop: Bayesian Rules Enabling Explainable AI
Human in the loop: Bayesian Rules Enabling Explainable AI
 
Machine learning introduction to unit 1.ppt
Machine learning introduction to unit 1.pptMachine learning introduction to unit 1.ppt
Machine learning introduction to unit 1.ppt
 
Week_1 Machine Learning introduction.pptx
Week_1 Machine Learning introduction.pptxWeek_1 Machine Learning introduction.pptx
Week_1 Machine Learning introduction.pptx
 
The Machine Learning Workflow with Azure
The Machine Learning Workflow with AzureThe Machine Learning Workflow with Azure
The Machine Learning Workflow with Azure
 
Overview of machine learning
Overview of machine learning Overview of machine learning
Overview of machine learning
 
H2O with Erin LeDell at Portland R User Group
H2O with Erin LeDell at Portland R User GroupH2O with Erin LeDell at Portland R User Group
H2O with Erin LeDell at Portland R User Group
 
Lecture on AI and Machine Learning
Lecture on AI and Machine LearningLecture on AI and Machine Learning
Lecture on AI and Machine Learning
 
機器學習速遊
機器學習速遊機器學習速遊
機器學習速遊
 
Predicting Tweet Sentiment
Predicting Tweet SentimentPredicting Tweet Sentiment
Predicting Tweet Sentiment
 
1225 lunchlearn shekhar_using his mac
1225 lunchlearn shekhar_using his mac1225 lunchlearn shekhar_using his mac
1225 lunchlearn shekhar_using his mac
 
Machine Learning for (JVM) Developers
Machine Learning for (JVM) DevelopersMachine Learning for (JVM) Developers
Machine Learning for (JVM) Developers
 
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
Makine Öğrenmesi, Yapay Zeka ve Veri Bilimi Süreçlerinin Otomatikleştirilmesi...
 
Artificial intelligence and IoT
Artificial intelligence and IoTArtificial intelligence and IoT
Artificial intelligence and IoT
 
Machine learning
 Machine learning Machine learning
Machine learning
 

Recently uploaded

HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
Pablo Gómez Abajo
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
saastr
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
Safe Software
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
Jason Yip
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
DianaGray10
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
Edge AI and Vision Alliance
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
BibashShahi
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
Zilliz
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
Antonios Katsarakis
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
Zilliz
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
Ivo Velitchkov
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
ScyllaDB
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 

Recently uploaded (20)

HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 
Mutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented ChatbotsMutation Testing for Task-Oriented Chatbots
Mutation Testing for Task-Oriented Chatbots
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
Overcoming the PLG Trap: Lessons from Canva's Head of Sales & Head of EMEA Da...
 
Essentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation ParametersEssentials of Automations: Exploring Attributes & Automation Parameters
Essentials of Automations: Exploring Attributes & Automation Parameters
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
 
What is an RPA CoE? Session 1 – CoE Vision
What is an RPA CoE?  Session 1 – CoE VisionWhat is an RPA CoE?  Session 1 – CoE Vision
What is an RPA CoE? Session 1 – CoE Vision
 
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
“Temporal Event Neural Networks: A More Efficient Alternative to the Transfor...
 
Principle of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptxPrinciple of conventional tomography-Bibash Shahi ppt..pptx
Principle of conventional tomography-Bibash Shahi ppt..pptx
 
Generating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and MilvusGenerating privacy-protected synthetic data using Secludy and Milvus
Generating privacy-protected synthetic data using Secludy and Milvus
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
Dandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity serverDandelion Hashtable: beyond billion requests per second on a commodity server
Dandelion Hashtable: beyond billion requests per second on a commodity server
 
Programming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup SlidesProgramming Foundation Models with DSPy - Meetup Slides
Programming Foundation Models with DSPy - Meetup Slides
 
Apps Break Data
Apps Break DataApps Break Data
Apps Break Data
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-EfficiencyFreshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
Freshworks Rethinks NoSQL for Rapid Scaling & Cost-Efficiency
 
JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 

A General Overview of Machine Learning

  • 1. A General Overview of Machine Learning Boise Data Science Meetup -- September 18, 2018 Ashish Sharma ➔ Software Systems Engineer -- HomeCU, LLC. (2017 - present) ➔ Founder -- AI Developers, Boise ➔ City Ambassador -- AI Saturdays (global initiative of nurture.ai) ➔ Alumnus -- Boise State University (MS in Computer Science, 2015-2017) 1
  • 2. Overview ● AI and Applications ● Intro to Machine Learning ● Types of Machine Learning ● Which algorithm should I use? ● Effective Machine Learning Image Source: Cousins of Artificial Intelligence -- Towards Datascience 2
  • 3. AI Resurgence ➔ Computational Power (GPUs, cloud computing, distributed systems) ➔ Availability of large amount of Data (eg. Imagenet) ➔ Better theoretical understanding of the underlying techniques/algorithms ➔ Open and easily accessible research culture in academia and industry (NIPS, ICML, archiv.org) 3
  • 4. AI Resurgence (contd..) ➔ Netflix Challenge (2009) $1 Million Prize (User ratings for films) ➔ Kaggle (2010) (over more than a million users today) ➔ Fei-Fei Li and team at Stanford open sourced ImageNet (2008-2010) ◆ Imagenet Large Scale Visual Recognition Challenge (ILSVRC) ➔ Geoffrey Hinton’s Deep Learning Team wins ImageNet 2012 (Alexnet) 4
  • 5. Common Applications ➔ Speech recognition (virtual assistants) ➔ Advanced machine translation and natural language intelligence ➔ Strategic gaming algorithms (AlphaGo, chess) ➔ Computer Vision (image classification and object detection) ➔ Autonomous Vehicles ➔ Manufacturing Companies (landing.ai) ➔ Healthcare (Google’s research on diabetic retinopathy -- with F-score of 0.95, surpassing the accuracy of 8 expert ophthalmologists) 5
  • 6. Machine Learning ➔ Form of applied statistics with emphasis on the use of computers to learn complex mathematical functions. ➔ More formally, “A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P, if its performance at tasks in T, as measured by P, improves with experience E.” Image Source: xkcd 6
  • 7. Types of Machine Learning ➔ Supervised Learning ➔ Unsupervised Learning ➔ Reinforcement Learning 7
  • 8. Supervised Learning Terminologies: ➔ Input variable(s) ◆ independent variable(s) ◆ feature(s)/characteristic(s) of a single input object ◆ Numerical -- continuous ( height, area of house) , discrete (grades, age) ◆ Categorical (race, sex) -- nominal, ordinal ➔ Target variable(s) ◆ Dependent variable(s), number/vector (eg. price of house, patient is diabetic, etc.) 8
  • 9. Supervised Learning ➔ Function approximation ◆ Mathematically: solve for coefficient(s) of a function ◆ Search for a best performing model from a hypothesis space. ◆ Make predictions based on historical (labeled) data ➔ Regression (predict continuous target variable) ◆ Univariate Regression (1 input variable, 1 output variable) ◆ Multiple Regression (>=2 input variables, 1 output variable) ◆ Multivariate Regression (>=2 output variable) ➔ Classification (predict discrete/categorical target variable) ◆ Email: Spam or not? ◆ Is this image a dog or cat? 9
  • 10. Unsupervised Learning ➔ Unsupervised Learning ◆ Find hidden patterns and draw inference from (unlabeled) data ◆ Essential for preliminary data analysis and visualization ➔ Clustering (grouping of similar data points) ◆ K-Means, DBSCAN ➔ Dimensionality Reduction ◆ Principal Components Analysis ◆ Autoencoders 10
  • 11. Reinforcement Learning ➔ AI, Animal Psychology, Control Theory ➔ Agents, Actions, Environment, Change in State, Reward/Punishment ➔ Eg. Deep Attari: ◆ Input: Snapshots of Attari board images (State and Actions) ◆ Algorithm: Convolutional NNs with no pooling ◆ Output layer: tailored for regression score (Maximize Reward) 11
  • 12. Beginner’s Question! ➔ (Q)* Which Algorithm Should I Use? ➔ (A) The answer varies depending on many factors, including: ◆ The size, quality, and nature of data ; ◆ The available computational time; ◆ The urgency of the task; and ◆ What you want to do with the data(the problem). * towardsdatascience.com 12
  • 13. Which algorithm should I use? ◆ No one algorithm works best for every problem (Yes, not even neural networks!) 13
  • 14. Important Concepts ➔ Model Selection: ◆ K-crossfold validation ◆ Train/Test/Evaluation Dataset ➔ Loss functions ➔ Convex Optimization ➔ Gradient Descent ➔ Model Complexity, Overfitting and Underfitting ➔ Regularization ➔ Training and Generalization Errors 14
  • 15. Questions to ask when working on ML project! ➔ How much data do I have? What type/nature of data? ➔ How skilled and knowledgeable am I in this domain? ◆ Will I be able to create more useful features from what I already have? ➔ How good am I in error analysis? 15
  • 16. Questions to ask when working on ML project! ➔ Assumptions, Limitations and Adoption (ALA rule) of the algorithm. ◆ Linear Regression (linear relationship, no or little multicollinearity, etc.) ◆ Why does this particular loss function make sense? ➔ How good am I in debugging the chosen learning algorithm? 16
  • 17. Effective Machine Learning ➔ Reduce time spent in programming (more experiments in short time) ◆ Use off the shelf tools ➔ Customize and Scale Products ◆ Start simple, scale as needed (again, choice of relevant toolsets) ➔ Think like a Scientist ◆ Use statistics, not logic, to make decisions from the real world observations * Slide content referred from Google’s Machine Learning Crash Course 17
  • 18. Thank You Ashish Sharma Email: accssharma@gmail.com /in/accssharma @accssharma AI Developers, Boise: https://github.com/aidevelopersboise/ai6-boise-materials HomeCU is hiring Software Engineers and Mobile Developers. https://www.homecu.net/company-jobs.html 18
  • 19. Visual Demonstrations ➔ K nearest neighbor: http://vision.stanford.edu/teaching/cs231n-demos/knn/ ➔ CIFAR 10 Image Classification: https://cs.stanford.edu/people/karpathy/convnetjs/demo/cifar10.html 19