Installing and Using Python
Basic I/O
Variables and Expressions
Conditional Code
Functions
Loops and Iteration
Python Data Structures
Errors and Exceptions
Object Oriented with Python
Multithreaded Programming with Python
Install/Create and Using Python Library
Compile Python Script
Resources
===========================
and 7 Quizzes
Collecting Uncertain Data the Reactive WayJeff Smith
Related Resources
----------------------
Video (starts at 25 minutes): https://skillsmatter.com/skillscasts/7038-lightning-talks-2
From Scala Exchange 2015: https://skillsmatter.com/conferences/1948-scala-exchange-2014
Reactive Machine Learning: http://www.reactivemachinelearning.com/
Data Engineering blogging: https://medium.com/data-engineering
Talk Summary
-----------------
Before you can ever get started building large-scale data analytic systems, you need to start with one crucial element: data. Collecting data, especially collecting lots of data, is harder than it seems. Data ingested with the wrong data model can be worse than no data at all. A data collection system that is too slow can bring an entire platform grinding to a halt.
Don't panic! Scalable, non-destructive data collection is possible. This talk will focus on strategies for data collection based on real world experience building large scale machine learning systems. It will introduce ideas from the emerging paradigm of reactive machine learning that are based on older ideas about immutable facts and pervasive, intrinsic uncertainty.
This presentation was given online in July 2017 and will be given at the NY Java SIG later this year. It progressively builds on Java 8 concepts using puzzles and coding to give students confidence in their Java 8 stream/lambda skills. Handouts and code in https://github.com/boyarsky/java-8-streams-by-puzzles
Installing and Using Python
Basic I/O
Variables and Expressions
Conditional Code
Functions
Loops and Iteration
Python Data Structures
Errors and Exceptions
Object Oriented with Python
Multithreaded Programming with Python
Install/Create and Using Python Library
Compile Python Script
Resources
===========================
and 7 Quizzes
Collecting Uncertain Data the Reactive WayJeff Smith
Related Resources
----------------------
Video (starts at 25 minutes): https://skillsmatter.com/skillscasts/7038-lightning-talks-2
From Scala Exchange 2015: https://skillsmatter.com/conferences/1948-scala-exchange-2014
Reactive Machine Learning: http://www.reactivemachinelearning.com/
Data Engineering blogging: https://medium.com/data-engineering
Talk Summary
-----------------
Before you can ever get started building large-scale data analytic systems, you need to start with one crucial element: data. Collecting data, especially collecting lots of data, is harder than it seems. Data ingested with the wrong data model can be worse than no data at all. A data collection system that is too slow can bring an entire platform grinding to a halt.
Don't panic! Scalable, non-destructive data collection is possible. This talk will focus on strategies for data collection based on real world experience building large scale machine learning systems. It will introduce ideas from the emerging paradigm of reactive machine learning that are based on older ideas about immutable facts and pervasive, intrinsic uncertainty.
This presentation was given online in July 2017 and will be given at the NY Java SIG later this year. It progressively builds on Java 8 concepts using puzzles and coding to give students confidence in their Java 8 stream/lambda skills. Handouts and code in https://github.com/boyarsky/java-8-streams-by-puzzles
Inspired by Josh Bloch's Java Puzzlers, we put together our own Python Puzzlers. This slide deck brings you a set of 10 python puzzlers, that are fun and educational. Each puzzler will show you a piece of python code. Your task if to figure out what happens when the code is run. Whether you're a python beginner or a passionate python veteran, we hope that there's something to learn for everybody.
This slide deck was first presented at shopkick. Nandan Sawant and Ryan Rueth are engineers at shopkick. Keeping the audience in mind, most of the puzzlers are based on python 2.x.
This presentation covers Python most important data structures like Lists, Dictionaries, Sets and Tuples. Exception Handling and Random number generation using simple python module "random" also covered. Added simple python programs at the end of the presentation
The basics of Python are rather straightforward. In a few minutes you can learn most of the syntax. There are some gotchas along the way that might appear tricky. This talk is meant to bring programmers up to speed with Python. They should be able to read and write Python.
How to Become a Tree Hugger: Random Forests and Predictive Modeling for Devel...Matt Harrison
Python makes data science easy. In this deck we walk through a complete example of creating and evaluating a predictive model using Decision Trees and Random Forests. All of the code is included in the slides.
Analysis of Fatal Utah Avalanches with Python. From Scraping, Analysis, to In...Matt Harrison
I gave this presentation at Code Camp. As a data scientist and backcountry skier, I was interested in looking at fatal avalanche data. This covers scraping the data, analysis with Python, pandas and IPython Notebook. The final result is an infographic
These are the slides of the second part of this multi-part series, from Learn Python Den Haag meetup group. It covers List comprehensions, Dictionary comprehensions and functions.
Introduction to the Python programming language (version 2.x)
Ambient intelligence: technology and design
http://bit.ly/polito-ami
Politecnico di Torino, 2015
Prepares the students for (and is a prerequisite for) the more advanced material students will encounter in later courses. Data structures organize data Þ more efficient programs.
Benchy: Lightweight framework for Performance Benchmarks Marcel Caraciolo
Benchy: Lightweight framework for Performance Benchmarks on Python Scripts.
Presented at XXVI Pernambuco Python User Group Meeting at Recife, Pernambuco, Brazil on 06.04.2013
Inspired by Josh Bloch's Java Puzzlers, we put together our own Python Puzzlers. This slide deck brings you a set of 10 python puzzlers, that are fun and educational. Each puzzler will show you a piece of python code. Your task if to figure out what happens when the code is run. Whether you're a python beginner or a passionate python veteran, we hope that there's something to learn for everybody.
This slide deck was first presented at shopkick. Nandan Sawant and Ryan Rueth are engineers at shopkick. Keeping the audience in mind, most of the puzzlers are based on python 2.x.
This presentation covers Python most important data structures like Lists, Dictionaries, Sets and Tuples. Exception Handling and Random number generation using simple python module "random" also covered. Added simple python programs at the end of the presentation
The basics of Python are rather straightforward. In a few minutes you can learn most of the syntax. There are some gotchas along the way that might appear tricky. This talk is meant to bring programmers up to speed with Python. They should be able to read and write Python.
How to Become a Tree Hugger: Random Forests and Predictive Modeling for Devel...Matt Harrison
Python makes data science easy. In this deck we walk through a complete example of creating and evaluating a predictive model using Decision Trees and Random Forests. All of the code is included in the slides.
Analysis of Fatal Utah Avalanches with Python. From Scraping, Analysis, to In...Matt Harrison
I gave this presentation at Code Camp. As a data scientist and backcountry skier, I was interested in looking at fatal avalanche data. This covers scraping the data, analysis with Python, pandas and IPython Notebook. The final result is an infographic
These are the slides of the second part of this multi-part series, from Learn Python Den Haag meetup group. It covers List comprehensions, Dictionary comprehensions and functions.
Introduction to the Python programming language (version 2.x)
Ambient intelligence: technology and design
http://bit.ly/polito-ami
Politecnico di Torino, 2015
Prepares the students for (and is a prerequisite for) the more advanced material students will encounter in later courses. Data structures organize data Þ more efficient programs.
Benchy: Lightweight framework for Performance Benchmarks Marcel Caraciolo
Benchy: Lightweight framework for Performance Benchmarks on Python Scripts.
Presented at XXVI Pernambuco Python User Group Meeting at Recife, Pernambuco, Brazil on 06.04.2013
BSidesLV 2013 - Using Machine Learning to Support Information SecurityAlex Pinto
Big Data, Data Science, Machine Learning and Analytics are a few of the new buzzwords that have invaded out industry of late. Again we are being sold a unicorn-laden, silver-bullet panacea by heavy handed marketing folks, evoking an expected pushback from the most enlightened members of our community. However, as was the case before, there might just be enough technical meat in there to help out with our security challenges and the overwhelming odds we face everyday. And if so, what do we as a community have to know about these technologies in order to be better professionals? Can we really use the data we have been collecting to help automate our security decision making? Is a robot going to steal my job?
If you are interested in what is behind this marketing buzz and are not scared of a little math, this talk would like to address some insights into applying Machine Learning techniques to data any of us have easy access to, and try to bring home the point that if all of this technology can be used to show us “better” ads in social media and track our behavior online (and a bit more than that) it can also be used to defend our networks as well.
Building a performing Machine Learning model from A to ZCharles Vestur
A 1-hour read to become highly knowledgeable about Machine learning and the machinery underneath, from scratch!
A presentation introducing to all fundamental concepts of Machine Learning step by step, following a classical approach to build a performing model. Simple examples and illustrations are used all along the presentation to make the concepts easier to grasp.
Slides used during the virtual conference, NetCoreConf on April 04, 2020. The session was a introduction to Machine Learning for .Net developers, using ML.Net as the main framework.
Machine learning from a software engineer's perspective - Marijn van Zelst - ...Codemotion
Lot's of software engineers seem to avoid the field of machine learning because it seems hard. In this talk I want to give developers an intuition of what machine learning is using visual examples and without using mathematical formulas. I want to show that machine learning will make things possible that cannot be achieved using traditional procedural programming. I will identify high level components of a supervised machine learning algorithm: vectors, feature spaces, neural networks and labels.
2019 12 19 Mississauga .Net User Group - Machine Learning.Net and Auto MLBruno Capuano
Slides used during the "Machine Learning Galore" session, on 2019 December 19 at the Microsoft offices. Event hosted by the Mississauga .Net User Group and my session was around Machine Learning.Net and Auto ML
2018 Global Azure Bootcamp Azure Machine Learning for neural networksSetu Chokshi
This was the introduction session done for the 2018 Global Azure Bootcamp to get the users started with neural networks on Azure Machine Learning Studio. This gives them the initial introduction on how to develop and write the neural networks. We started with writing LeNet architecture on Azure Machine Learning studio to identify handwritten digits and then moved on to cats and dogs.
This was also the presented in the first workshop of my meetup
Microsoft Ai, ML Community which can be reached here
https://www.meetup.com/Microsoft-AI-ML-Community/
Startup Jungle Cambodia | How to Build your First Machine Learning ApplicationSlash
The objective of the introductory workshop is to show you how Machine Learning (ML) is get started with building your first ML application. In the process we hope to give you a glimpse of how ML is going to revolutionize the world.
This workshop was given in front of 120 Cambodian engineers on 6 December 2017, at the inaugural Startup Jungle session in Phnom Penh. Startup Jungle is a tech startup learning community run by pioneering companies in the Phnom Penh ecosystem, to train for free the next generation of startupers and tech leaders.
### Part 1:
Introduction into AI and how to introduce it in your business. Examples of Slash AI projects. Target audience: business and technical :)
### Part 2:
Build your first ML application. Target audience: technical :)
- Hands-on technical workshop for developers, to show how to build your own neural net step by step. We will apply it to several examples.
- Brief introduction into advanced techniques to optimize ML algorithms, training & data sets, classifiers and parsers.
Feature Engineering - Getting most out of data for predictive models - TDC 2017Gabriel Moreira
How should data be preprocessed for use in machine learning algorithms? How to identify the most predictive attributes of a dataset? What features can generate to improve the accuracy of a model?
Feature Engineering is the process of extracting and selecting, from raw data, features that can be used effectively in predictive models. As the quality of the features greatly influences the quality of the results, knowing the main techniques and pitfalls will help you to succeed in the use of machine learning in your projects.
In this talk, we will present methods and techniques that allow us to extract the maximum potential of the features of a dataset, increasing flexibility, simplicity and accuracy of the models. The analysis of the distribution of features and their correlations, the transformation of numeric attributes (such as scaling, normalization, log-based transformation, binning), categorical attributes (such as one-hot encoding, feature hashing, Temporal (date / time), and free-text attributes (text vectorization, topic modeling).
Python, Python, Scikit-learn, and Spark SQL examples will be presented and how to use domain knowledge and intuition to select and generate features relevant to predictive models.
Apprentissage statistique et analyse prédictive en Python avec scikit-learn p...La Cuisine du Web
Avec plus de 300 000 utilisateurs réguliers, scikit-learn (http://scikit-learn.org) est la librairie de référence pour le machine learning en Python. scikit-learn couvre l’apprentissage supervisé (régression, classification) et non-supervisé (clustering, détection d’anomalie, réduction de dimension) scikit-learn est construit sur l’écosystème du Python scientifique Numpy, Scipy et Cython.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Enhancing Performance with Globus and the Science DMZGlobus
ESnet has led the way in helping national facilities—and many other institutions in the research community—configure Science DMZs and troubleshoot network issues to maximize data transfer performance. In this talk we will present a summary of approaches and tips for getting the most out of your network infrastructure using Globus Connect Server.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Welcome to the first live UiPath Community Day Dubai! Join us for this unique occasion to meet our local and global UiPath Community and leaders. You will get a full view of the MEA region's automation landscape and the AI Powered automation technology capabilities of UiPath. Also, hosted by our local partners Marc Ellis, you will enjoy a half-day packed with industry insights and automation peers networking.
📕 Curious on our agenda? Wait no more!
10:00 Welcome note - UiPath Community in Dubai
Lovely Sinha, UiPath Community Chapter Leader, UiPath MVPx3, Hyper-automation Consultant, First Abu Dhabi Bank
10:20 A UiPath cross-region MEA overview
Ashraf El Zarka, VP and Managing Director MEA, UiPath
10:35: Customer Success Journey
Deepthi Deepak, Head of Intelligent Automation CoE, First Abu Dhabi Bank
11:15 The UiPath approach to GenAI with our three principles: improve accuracy, supercharge productivity, and automate more
Boris Krumrey, Global VP, Automation Innovation, UiPath
12:15 To discover how Marc Ellis leverages tech-driven solutions in recruitment and managed services.
Brendan Lingam, Director of Sales and Business Development, Marc Ellis
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
2. Agenda
• What is Machine Learning?
• How is Machine Learning different from traditional
programming?
• How does it work?
• How to use it?
• Tools
• Resources
3. What is Machine Learning?
“Field of study that gives computers the ability to learn
without being explicitly programmed”
- Arthur Samuel, 1959
4. What is Machine Learning?
Learning
Algorithm
"A computer program is said to learn from experience E with respect to
some class of tasks T and performance measure P if its performance at
tasks in T, as measured by P, improves with experience E."
- Tom M. Mitchell
Cat / Not Cat
this is a cat
E
T
Cat -> Cat : 80%
P
5. How is Machine Learning different from traditional
programming?
6. How is Machine Learning different from traditional
programming?
if (eyes == 2) & (legs == 4) & (tail == 1)… then
print “Cat”
“ Cat ”
Traditional programming
Input
Program
Computer
Output
- eyes
- legs
- tail
- …..
- …..
7. How is Machine Learning different from traditional
programming?
if (eyes == 2) & (legs == 4) & (tail == 1)… then
print “Cat”
“ Cat ”
Traditional programming
Input
Program
Computer
Output
- eyes
- legs
- tail
- …..
- …..
8. How is Machine Learning different from traditional
programming?
“ Cat ”
Machine Learning
Input
Program
Computer
Output
Cat Recognition
- eyes
- legs
- tail
- …..
- …..
9. How does it work?
Feature
Extraction
Learning
Algorithm
Evaluation
Optimization
Data
Model
Error
Predicted Value = f(X)
We need f that Predicted Value close to True Value
(X)
(f)
Learning Process
Parameters
10. f( )=“ Cat ”
f( )=“ Cat ”
What is f()?
How does it work?
13. Feature
Extraction
Good BMI Bad BMI
Weight
(X1)
Height
(X2)
Good BMI
Bad BMI
Height Weight good/bad
160 55 good
170 55 bad
175 60 good
.. .. ..
Features Space
How does it work?
23. How does it work?
Learning
Algorithm
Car for achieve the goal!
Framework for learning the target function
24. How does it work?
Learning
Algorithm
Type of Learning
25. How does it work?
Learning
Algorithm
Supervised Learning > Classification
( , “Cat”) Learning
Algorithm
Cat Model “Cat”
Is cat or not?
26. How does it work?
Learning
Algorithm
Supervised Learning > Regression
( , 4.5) Learning
Algorithm
House Price
Model
4.3 M.
What is the price of this house?
27. How does it work?
Learning
Algorithm
Unsupervised Learning
( ) Learning
Algorithm
House
Grouping
Model
What is the group of this house?
Group 1
28. How does it work?
Learning
Algorithm
Unsupervised Learning
Group1
Group2
Group3
29. How does it work?
Learning
Algorithm
Reinforcement Learning
( ) Learning
Algorithm
Flappy Bird
Model
How should I move?
“ Down ”
Reward
(+,-)
30. How does it work?
Learning
Algorithm
Machine Learning Algorithms
https://en.wikipedia.org/wiki/List_of_machine_learning_concepts
31. How does it work?
Learning
Algorithm
Tree Based
32. How does it work?
Learning
Algorithm
Tree Based : Random Forest
33. How does it work?
Learning
Algorithm
Support Vector Machine
3D Features2D Features
34. How does it work?
Learning
Algorithm
Neural Network
35. How does it work?
Learning
Algorithm
Neural Network : Deep Learning
Deep Neural Network (DNN)
36. How does it work?
Learning
Algorithm
Neural Network : Deep Learning
Convolutional Neural Network (CNN)
37. How does it work?
Learning
Algorithm
Neural Network : Deep Learning
Recurrent Neural Network(RNN)
38. How does it work?
Evaluation
Learning
Algorithm
Cat / Not Cat
Model
this is a cat
Cat -> Cat : 200
Cat -> Not Cat : 10
How many is loss?
39. How does it work?
Optimization
Tuning the car’s engine
40. How does it work?
Optimization
Parameters Space (W1,W2)
f(X) = X1W1 + X2W2 +b
J is the cost function of Learning Algorithm
41. How does it work?
Optimization
http://playground.tensorflow.org/
Find the best parameters for the function
bad fair good very good
43. How to use it?
Feature matrix and Label vector
1 0.2 0.32 -0.93 0.32
2 -0.17 0.32 0.52 -0.93
3 -0.93 -0.17 0.52 -0.17
4 0.32 0.52 -0.17 -0.93
1
0
0
1
X
Y
}Input Output
44. How to use it?
Feature Extraction : Number/Categorical
no x1 x2 x3 x4
1 0.2 0.32 -0.93 0.32
2 -0.17 0.32 0.52 -0.93
3 -0.93 -0.17 0.52 -0.17
X
}
Age Height Weigh
t
Gender Abnormal
19 160 55 male No
25 170 55 male No
30 120 60 femal Yes
0
0
1
Y
Normal(0)/
Abnormal(1)
Covert categorical to number
and normalize data
45. How to use it?
Feature Extraction : Text -> Binary Vectors
doc1 : You are so cool
doc2 : You are very bad
doc3 : You are very good
word index
you 0
are 1
so 2
cool 3
very 4
bad 5
good 6
corpus dictionary
doc1 : 0 1 2 3
doc2 : 0 1 4 5
doc3 : 0 1 4 6
Feature index
no x0 x1 x2 x3 x4 x5 x6
1 1 1 1 1 0 0 0
2 1 1 0 0 1 1 0
3 1 1 0 0 1 0 1
Binary Feature Vectors
46. How to use it?
Feature Extraction : Text -> TF-IDF Vectors
doc1 : You are so cool
doc2 : You are very bad
doc3 : You are very good
word index
you 0
are 1
so 2
cool 3
very 4
bad 5
good 6
corpus dictionary
doc1 : 0 1 2 3
doc2 : 0 1 4 5
doc3 : 0 1 4 6
Feature index
no x0 x1 x2 x3 x4 x5 x6
1 0.1 0.1 0.3 0.3 0 0 0
2 0.1 0.1 0 0 0.2 0.3 0
3 0.1 0.1 0 0 0.2 0 0.3
TF-IDF Feature Vectors
word index
you 0.1
are 0.1
so 0.3
cool 0.3
very 0.2
bad 0.3
good 0.3
corpus TF-IDF
https://en.wikipedia.org/wiki/Tf%E2%80%93idf
47. How to use it?
Feature Extraction : Text -> Word2Vec
word2Vec model
48. How to use it?
Feature Extraction : Text -> Word2Vec -> Visualize
49. How to use it?
Feature Extraction : Image -> Pixel Vector
gray scale image (MxN) normalized image matrix (MxN)
[0,0,0,..,0.6,0.5,0,0,0,0,0,0,0,0,0……..0]
pixel1
pixelMxN
Pixel Vector
50. How to use it?
Feature Extraction : Image -> Pixel Vector
[0,0,0,..,0.6,0.5,0,0,0,0,0,0,0,0,0……..0]image1
Deep Neural Network
51. How to use it?
Feature Extraction : Image -> raw / gray scale image
raw image
1
2
3
4
5
Convolutional Neural Network
http://cs.stanford.edu/people/karpathy/convnetjs/
52. How to use it?
Model Evaluation
Confusion Matrix
53. How to use it?
Model Evaluation
Confusion Matrix Example
Normal Abnormal
Predict Normal 90 10
Predict Abnormal 10 90
Precision = 90/(90 + 10) = 0.9
Recall = 90/(90 + 10) = 0.9
Acc = (90+90)/(90+10+90+10) = 0.9
54. How to use it?
Model Evaluation : Underfitting and Overfitting
Good on training set but
bad on test set
Bad on training set
and test set
Good on training set
and test set
55. How to use it?
cluster
number
Unsupervised Learning Workflow
56. How to use it?
Evaluation
Sum of squared erros