This presentation was created by Ajay, Machine Learning Scientist at MachinePulse, to present at a Meetup on Jan. 30, 2015. These slides provide an overview of widely used machine learning algorithms. The slides conclude with examples of real world applications.
Ajay Ramaseshan, is a Machine Learning Scientist at MachinePulse. He holds a Bachelors degree in Computer Science from NITK, Suratkhal and a Master in Machine Learning and Data Mining from Aalto University School of Science, Finland. He has extensive experience in the machine learning domain and has dealt with various real world problems.
Half day session on Machine learning and its applications. It introduces Artificial Intelligence, move on Machine Learning, applications, algorithms, types, using Cloud for ML, Deep Learning and some resources to start with
List of top Machine Learning algorithms are making headway in the world of data science. Explained here are the top 10 of these machine learning algorithms - https://www.dezyre.com/article/top-10-machine-learning-algorithms/202
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Simplilearn
This presentation on "Supervised and Unsupervised Learning" will help you understand what is machine learning, what are the types of Machine learning, what is supervised machine learning, types of supervised machine learning, what is unsupervised learning, types of unsupervised learning and what are the differences between supervised and unsupervised machine learning. In supervised learning, the model learns from a labeled data whereas in unsupervised learning, model trains itself on unlabeled data. Now, let us get started and understand supervised and unsupervised learning and how they are different from each other.
Below are the topics explained in this supervised and unsupervised learning in Machine Learning presentation-
1. What is Machine Learning
- Types of Machine Learning
- Supervised Learning
- Unsupervised Learning
2. Supervised Learning
- Types of Supervised Learning
3. Unsupervised Learning
- Types of Unsupervised Learning
About Simplilearn Machine Learning course:
A form of artificial intelligence, Machine Learning is revolutionizing the world of computing as well as all people’s digital interactions. Machine Learning powers such innovative automated technologies as recommendation engines, facial recognition, fraud protection and even self-driving cars. This Machine Learning course prepares engineers, data scientists and other professionals with the knowledge and hands-on skills required for certification and job competency in Machine Learning.
Why learn Machine Learning?
Machine Learning is taking over the world- and with that, there is a growing need among companies for professionals to know the ins and outs of Machine Learning
The Machine Learning market size is expected to grow from USD 1.03 Billion in 2016 to USD 8.81 Billion by 2022, at a Compound Annual Growth Rate (CAGR) of 44.1% during the forecast period.
By the end of this Machine Learning course, you will be able to:
1. Master the concepts of supervised, unsupervised and reinforcement learning concepts and modeling.
2. Gain practical mastery over principles, algorithms, and applications of Machine Learning through a hands-on approach which includes working on 28 projects and one capstone project.
3. Acquire a thorough knowledge of the mathematical and heuristic aspects of Machine Learning.
4. Understand the concepts and operation of support vector machines, kernel SVM, naive Bayes, decision tree classifier, random forest classifier, logistic regression, K-nearest neighbors, K-means clustering and more.
5. Be able to model a wide variety of robust Machine Learning algorithms including deep learning, clustering, and recommendation systems
Learn more at: https://www.simplilearn.com/
Half day session on Machine learning and its applications. It introduces Artificial Intelligence, move on Machine Learning, applications, algorithms, types, using Cloud for ML, Deep Learning and some resources to start with
List of top Machine Learning algorithms are making headway in the world of data science. Explained here are the top 10 of these machine learning algorithms - https://www.dezyre.com/article/top-10-machine-learning-algorithms/202
Supervised and Unsupervised Learning In Machine Learning | Machine Learning T...Simplilearn
This presentation on "Supervised and Unsupervised Learning" will help you understand what is machine learning, what are the types of Machine learning, what is supervised machine learning, types of supervised machine learning, what is unsupervised learning, types of unsupervised learning and what are the differences between supervised and unsupervised machine learning. In supervised learning, the model learns from a labeled data whereas in unsupervised learning, model trains itself on unlabeled data. Now, let us get started and understand supervised and unsupervised learning and how they are different from each other.
Below are the topics explained in this supervised and unsupervised learning in Machine Learning presentation-
1. What is Machine Learning
- Types of Machine Learning
- Supervised Learning
- Unsupervised Learning
2. Supervised Learning
- Types of Supervised Learning
3. Unsupervised Learning
- Types of Unsupervised Learning
About Simplilearn Machine Learning course:
A form of artificial intelligence, Machine Learning is revolutionizing the world of computing as well as all people’s digital interactions. Machine Learning powers such innovative automated technologies as recommendation engines, facial recognition, fraud protection and even self-driving cars. This Machine Learning course prepares engineers, data scientists and other professionals with the knowledge and hands-on skills required for certification and job competency in Machine Learning.
Why learn Machine Learning?
Machine Learning is taking over the world- and with that, there is a growing need among companies for professionals to know the ins and outs of Machine Learning
The Machine Learning market size is expected to grow from USD 1.03 Billion in 2016 to USD 8.81 Billion by 2022, at a Compound Annual Growth Rate (CAGR) of 44.1% during the forecast period.
By the end of this Machine Learning course, you will be able to:
1. Master the concepts of supervised, unsupervised and reinforcement learning concepts and modeling.
2. Gain practical mastery over principles, algorithms, and applications of Machine Learning through a hands-on approach which includes working on 28 projects and one capstone project.
3. Acquire a thorough knowledge of the mathematical and heuristic aspects of Machine Learning.
4. Understand the concepts and operation of support vector machines, kernel SVM, naive Bayes, decision tree classifier, random forest classifier, logistic regression, K-nearest neighbors, K-means clustering and more.
5. Be able to model a wide variety of robust Machine Learning algorithms including deep learning, clustering, and recommendation systems
Learn more at: https://www.simplilearn.com/
AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete DeckSlideTeam
AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete Deck is loaded with easy-to-follow content, and intuitive design. Introduce the types and levels of artificial intelligence using the highly-effective visuals featured in this PPT slide deck. Showcase the AI-subfield of machine learning, as well as deep learning through our comprehensive PowerPoint theme. Represent the differences, and interrelationship between AI, ML, and DL. Elaborate on the scope and use case of machine intelligence in healthcare, HR, banking, supply chain, or any other industry. Take advantage of the infographic-style layout to describe why AI is flourishing in today’s day and age. Elucidate AI trends such as robotic process automation, advanced cybersecurity, AI-powered chatbots, and more. Cover all the essentials of machine learning and deep learning with the help of this PPT slideshow. Outline the application, algorithms, use cases, significance, and selection criteria for machine learning. Highlight the deep learning process, types, limitations, and significance. Describe reinforcement training, neural network classifications, and a lot more. Hit download and begin personalization. Our AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete Deck are topically designed to provide an attractive backdrop to any subject. Use them to look like a presentation pro. https://bit.ly/3ngJCKf
Provides a brief overview of what machine learning is, how it works (theory), how to prepare data for a machine learning problem, an example case study, and additional resources.
The next phase of Smart Network Convergence could be putting Deep Learning systems on the Internet. Deep Learning and Blockchain Technology might be combined in the smart networks of the future for automated identification (deep learning) and automated transaction (blockchain). Large scale future-class problems might be addressed with Blockchain Deep Learning nets as an advanced computational infrastructure, challenges such as million-member genome banks, energy storage markets, global financial risk assessment, real-time voting, and asteroid mining.
Blockchain Deep Learning nets and Smart Networks more generally are computing networks with intelligence built in such that identification and transfer is performed by the network itself through sophisticated protocols that automatically identify (deep learning), and validate, confirm, and route transactions (blockchain) within the network.
A PPT which gives a brief introduction on Machine Learning and on the products developed by using Machine Learning Algorithms in them. Gives the introduction by using content and also by using a few images in the slides as part of the explanation. It includes some examples of cool products like Google Cloud Platform, Cozmo (a tiny robot built by using Artificial Intelligence), IBM Watson and many more.
Index.....................
History of Machine Learning.
What is Machine Learning.
Why ML.
Learning System Model.
Training and Testing.
Performance.
Algorithms.
Machine Learning Structure.
Application.
Conclusion.
----------------------------------------------
THANK YOU
Machine Learning. What is machine learning. Normal computer vs ML. Types of Machine Learning. Some ML Object detection methods. Faster CNN, RCNN, YOLO, SSD. Real Life ML Applications. Best Programming Languages for ML. Difference Between Machine Learning And Artificial Intelligence. Advantages of Machine Learning. Disadvantages of Machine Learning
Scikit-Learn is a powerful machine learning library implemented in Python with numeric and scientific computing powerhouses Numpy, Scipy, and matplotlib for extremely fast analysis of small to medium sized data sets. It is open source, commercially usable and contains many modern machine learning algorithms for classification, regression, clustering, feature extraction, and optimization. For this reason Scikit-Learn is often the first tool in a Data Scientists toolkit for machine learning of incoming data sets.
The purpose of this one day course is to serve as an introduction to Machine Learning with Scikit-Learn. We will explore several clustering, classification, and regression algorithms for a variety of machine learning tasks and learn how to implement these tasks with our data using Scikit-Learn and Python. In particular, we will structure our machine learning models as though we were producing a data product, an actionable model that can be used in larger programs or algorithms; rather than as simply a research or investigation methodology.
AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete DeckSlideTeam
AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete Deck is loaded with easy-to-follow content, and intuitive design. Introduce the types and levels of artificial intelligence using the highly-effective visuals featured in this PPT slide deck. Showcase the AI-subfield of machine learning, as well as deep learning through our comprehensive PowerPoint theme. Represent the differences, and interrelationship between AI, ML, and DL. Elaborate on the scope and use case of machine intelligence in healthcare, HR, banking, supply chain, or any other industry. Take advantage of the infographic-style layout to describe why AI is flourishing in today’s day and age. Elucidate AI trends such as robotic process automation, advanced cybersecurity, AI-powered chatbots, and more. Cover all the essentials of machine learning and deep learning with the help of this PPT slideshow. Outline the application, algorithms, use cases, significance, and selection criteria for machine learning. Highlight the deep learning process, types, limitations, and significance. Describe reinforcement training, neural network classifications, and a lot more. Hit download and begin personalization. Our AI Vs ML Vs DL PowerPoint Presentation Slide Templates Complete Deck are topically designed to provide an attractive backdrop to any subject. Use them to look like a presentation pro. https://bit.ly/3ngJCKf
Provides a brief overview of what machine learning is, how it works (theory), how to prepare data for a machine learning problem, an example case study, and additional resources.
The next phase of Smart Network Convergence could be putting Deep Learning systems on the Internet. Deep Learning and Blockchain Technology might be combined in the smart networks of the future for automated identification (deep learning) and automated transaction (blockchain). Large scale future-class problems might be addressed with Blockchain Deep Learning nets as an advanced computational infrastructure, challenges such as million-member genome banks, energy storage markets, global financial risk assessment, real-time voting, and asteroid mining.
Blockchain Deep Learning nets and Smart Networks more generally are computing networks with intelligence built in such that identification and transfer is performed by the network itself through sophisticated protocols that automatically identify (deep learning), and validate, confirm, and route transactions (blockchain) within the network.
A PPT which gives a brief introduction on Machine Learning and on the products developed by using Machine Learning Algorithms in them. Gives the introduction by using content and also by using a few images in the slides as part of the explanation. It includes some examples of cool products like Google Cloud Platform, Cozmo (a tiny robot built by using Artificial Intelligence), IBM Watson and many more.
Index.....................
History of Machine Learning.
What is Machine Learning.
Why ML.
Learning System Model.
Training and Testing.
Performance.
Algorithms.
Machine Learning Structure.
Application.
Conclusion.
----------------------------------------------
THANK YOU
Machine Learning. What is machine learning. Normal computer vs ML. Types of Machine Learning. Some ML Object detection methods. Faster CNN, RCNN, YOLO, SSD. Real Life ML Applications. Best Programming Languages for ML. Difference Between Machine Learning And Artificial Intelligence. Advantages of Machine Learning. Disadvantages of Machine Learning
Scikit-Learn is a powerful machine learning library implemented in Python with numeric and scientific computing powerhouses Numpy, Scipy, and matplotlib for extremely fast analysis of small to medium sized data sets. It is open source, commercially usable and contains many modern machine learning algorithms for classification, regression, clustering, feature extraction, and optimization. For this reason Scikit-Learn is often the first tool in a Data Scientists toolkit for machine learning of incoming data sets.
The purpose of this one day course is to serve as an introduction to Machine Learning with Scikit-Learn. We will explore several clustering, classification, and regression algorithms for a variety of machine learning tasks and learn how to implement these tasks with our data using Scikit-Learn and Python. In particular, we will structure our machine learning models as though we were producing a data product, an actionable model that can be used in larger programs or algorithms; rather than as simply a research or investigation methodology.
IMAGE CLASSIFICATION USING DIFFERENT CLASSICAL APPROACHESVikash Kumar
IMAGE CLASSIFICATION USING KNN, RANDOM FOREST AND SVM ALGORITHM ON GLAUCOMA DATASETS AND EXPLAIN THE ACCURACY, SENSITIVITY, AND SPECIFICITY OF EACH AND EVERY ALGORITHMS
This talk was presented in Startup Master Class 2017 - http://aaiitkblr.org/smc/ 2017 @ Christ College Bangalore. Hosted by IIT Kanpur Alumni Association and co-presented by IIT KGP Alumni Association, IITACB, PanIIT, IIMA and IIMB alumni.
My co-presenter was Biswa Gourav Singh. And contributor was Navin Manaswi.
http://dataconomy.com/2017/04/history-neural-networks/ - timeline for neural networks
Hypothesis on Different Data Mining AlgorithmsIJERA Editor
In this paper, different classification algorithms for data mining are discussed. Data Mining is about
explaining the past & predicting the future by means of data analysis. Classification is a task of data mining,
which categories data based on numerical or categorical variables. To classify the data many algorithms are
proposed, out of them five algorithms are comparatively studied for data mining through classification. There are
four different classification approaches namely Frequency Table, Covariance Matrix, Similarity Functions &
Others. As work for research on classification methods, algorithms like Naive Bayesian, K Nearest Neighbors,
Decision Tree, Artificial Neural Network & Support Vector Machine are studied & examined using benchmark
datasets like Iris & Lung Cancer.
Similar to Machine Learning and Real-World Applications (20)
Tracking PV module degradation using SolarPulseMachinePulse
In this case study, we have demonstrated the capability of SolarPulse to analyze numerous factors and arrive at the root cause of a problem. We have also shown how SolarPulse can be used to settle warranty claims in case of a product failure.
SolarPulse solves the problem of managing distributed solar sitesMachinePulse
In this case study, we have provided a qualitative analysis of the benefits of using SolarPulse for monitoring of distributed solar sites. We have assessed the impact and advantage of
SolarPulse in terms of costs, efficiency and ease of access.
SolarPulse helps PV plants reduce field expensesMachinePulse
Though there are many advantages of using SolarPulse, in this particular case study, we have focused only on the benefits of
SolarPulse on reducing the costs of routine maintenance.
Effective response to adverse weather conditions using SolarPulseMachinePulse
In this case study, we have demonstrated how SolarPulse can help PV plants to provide a quick response to such natural events and thus minimize losses.
Assessing the impact of SolarPulse on performance of utility scale PV plantsMachinePulse
In this case study, we have assessed the impact of SolarPulse on the performance ratio, power generation, revenue and costs of the solar plant from the perspective of the plant owner and O&M operator. A real life event at the plant has also been analyzed to provide an example of the problem solving capabilities of SolarPulse.
Value of solar remote monitoring and analytics for operational intelligenceMachinePulse
MachinePulse attended India Solar Week 2016 from June 2-3 in New Delhi. Our keynote speaker talked about how remote monitoring and analytics helps in increasing the operational efficiency of a solar plant. The presentation includes information about the company, our products and case studies.
MachinePulseTM provides rapidly scalable, end-to-end solutions to enable a smarter industrial Internet of Things (IoT). Our product offerings are modular and domain independent. We offer a complete range of products from industrial grade data acquisition devices to a robust and scalable cloud platform, which can be deployed together or into existing IoT workflows with full flexibility. Using our products and rich domain expertise in the renewable energy sector, we have built applications to provide operational intelligence for the solar photovoltaic and wind energy industries. We are also working with our clients to deploy monitoring and analytics solutions for other industries. MachinePulseTM is part of the $16 B Mahindra Group and has an enthusiastic team of more than 60 data scientists, programmers, hardware and network specialists.
MachinePulse provides rapidly scalable, end-to-end solutions to enable a smarter industrial Internet-of-Things. Our product offerings are modular and domain-independent. We offer a complete range of products from Industrial grade data acquisition devices to a robust and scalable cloud platform that can be deployed together or into existing IoT workflows with full flexibility. Using our products and rich domain expertise in the renewable energy sector, we have built applications to provide operational intelligence for the solar photovoltaic and wind energy industries. We are also working with our clients to deploy monitoring and analytics solutions for other industries. Our philosophy is “Any Device, Any Network, Any Protocol”.
Managing your Assets with Big Data ToolsMachinePulse
This presentation was given by Karthigai Muthu, Lead Big Data Analyst, at a meetup organized by the group Internet of Everything in March 2015.
Through his presentation, Karthik provided a comprehensive understanding of available ecosystem tools and how they can be used to perform data engineering and data analytics. Karthik covers the following topics in his presentation:
• Establishment of complete data pipeline using big data ecosystem tools.
• Tackling of high velocity streams using various stream processing engines on cloud and performing Real Time analytics.
• Tackling of historical data using big data ecosystem tools and migration of traditional infrastructure to big data environments.
• Integration of big data ecosystem for data analysis using SAMOA , R and Mahout.
• Deployments of big data environments on the cloud.
Challenges & Applications in the Industrial Internet of Things (IoT)MachinePulse
These slides detail the IoT applications for industries and the challenges they face in implementing the related technologies. Two case studies are explored - the first one is about GE Aviation and the second one on analyses MachinePulse's solution for solar power projects.
About the author
Sachin is lead Business Development Manager at MachinePulse where he works on expanding new business opportunities for the company's solutions. Sachin has close to 10 years of experience in product design & engineering of complex embedded systems, technology consulting and business development.
Sachin is also founder and manager of a Meetup group for IoT enthusiasts in Mumbai called IoT Mumbai (IoTMUM) http://bit.ly/1KIqNYT
This presentation introduces big data and explains how to generate actionable insights using analytics techniques. The deck explains general steps involved in a typical analytics project and provides a brief overview of the most commonly used predictive analytics methods and their business applications.
Vijay Adamapure is a Data Science Enthusiast with extensive experience in the field of data mining, predictive modeling and machine learning. He has worked on numerous analytics projects ranging from healthcare, business analytics, renewable energy to IoT.
Vijay presented these slides during the Internet of Everything Meetup event 'Predictive Analytics - An Overview' that took place on Jan. 9, 2015 in Mumbai. To join the Meetup group, register here: http://bit.ly/1A7T0A1
MachinePulse at the November Open Hardware Meetup, Mumbai 2014MachinePulse
A presentation of open hardware and software that powers the Internet of Things. From hardware like the Raspberry Pi, Beaglebone to open source software like NodeJS, databases like MongoDB and AngularJS.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
4. Volume of data collected growing day by day.
Data production will be 44 times greater in 2020 than in 2009.
Every day, 2.5 quintillion bytes of data are created, with 90
percent of the world's data created in the past two years.
Very little data will ever be looked at by a human.
Data is cheap and abundant (data warehouses, data marts);
knowledge is expensive and scarce.
Knowledge Discovery is NEEDED to make sense and use of
data.
Machine Learning is a technique in which computers learn
from data to obtain insight and help in knowledge discovery.
Why Machine Learning?
5. Supervised learning – class labels/ target variable known
Overview of Machine Learning Methods
6. Unsupervised learning – no class labels provided, need to
detect clusters of similar items in the data.
Overview (Contd)
7. Parametric Models:
Assumes prob. distribution for data, and
learn parameters from data
E.g. Naïve Bayes classifier, linear regression etc.
Non-parametric Models:
No fixed number of parameters.
E.g. K-NN, histograms etc.
Parametric Vs Nonparametric
8. Generative model – learns model for generating data, given
some hidden parameters.
Learns the joint probability distribution p(x,y).
e.g. HMM, GMM, Naïve Bayes etc.
Discriminative model – learns dependence of unobserved
variable y on observed variable x.
Tries to model the separation between classes.
Learns the conditional probability distribution p(y|x).
e.g. Logistic Regression, SVM, Neural networks etc.
Generative Vs Discriminative
9. Classification – Supervised learning.
Commonly used Methods for Classification –
Naïve Bayes
Decision tree
K nearest neighbors
Neural Networks
Support Vector Machines.
Classification
10. Regression – Predicting an output variable given input
variables.
Algorithms used –
Ordinary least squares
Partial least squares
Logistic Regression
Stepwise Regression
Support Vector Regression
Neural Networks
Regression
11. Clustering:
Group data into clusters using similarity measures.
Algorithms:
K-means clustering
Density based EM algorithm
Hierarchical clustering.
Spectral Clustering
Clustering
13. K-nearest neighbors:
Classifies the data point with the class of the k
nearest neighbors.
Value of k decided using cross validation
K-Nearest Neighbors
14. Leaves indicate classes.
Non-terminal nodes – decisions on attribute values
Algorithms used for decision tree learning
C4.5
ID3
CART.
Decision trees
15. Artificial Neural Networks
Modeled after the
human brain
Consists of an input layer,
many hidden layers, and an
output layer.
Multi-Layer Perceptrons,
Radial Basis Functions,
Kohonen Networks etc.
Neural Networks
16. Validation methods
Cross validation techniques
K-fold cross validation
Leave one out Cross Validation
ROC curve (for binary classifier)
Confusion Matrix
Evaluation of Machine Learning Methods
17. Some real life applications of machine learning:
Recommender systems – suggesting similar people on
Facebook/LinkedIn, similar movies/ books etc. on Amazon,
Business applications – Customer segmentation, Customer
retention, Targeted Marketing etc.
Medical applications – Disease diagnosis,
Banking – Credit card issue, fraud detection etc.
Language translation, text to speech or vice versa.
Real life applications
18. Wisconsin Breast Cancer dataset:
Instances : 569
Features : 32
Class variable : malignant, or benign.
Steps to classify using k-NN
Load data into R
Data normalization
Split into training and test datasets
Train model
Evaluate model performance
Breast Cancer Dataset and k-NN