This document introduces support vector machines (SVMs), including their use of hyperplanes to create classifiers with maximal marginal widths. It discusses how SVMs solve convex optimization problems to find the optimal hyperplane. Kernels are introduced to project data into higher dimensional spaces to allow for nonlinear classification. The document concludes by applying an SVM to a gender classification problem based on mobile app usage, using a custom kernel to account for apps and app categories.
Welcome to the Supervised Machine Learning and Data Sciences.
Algorithms for building models. Support Vector Machines.
Classification algorithm explanation and code in Python ( SVM ) .
In machine learning, support vector machines (SVMs, also support vector networks[1]) are supervised learning models with associated learning algorithms that analyze data and recognize patterns, used for classification and regression analysis. The basic SVM takes a set of input data and predicts, for each given input, which of two possible classes forms the output, making it a non-probabilistic binary linear classifier.
In this presentation, we approach a two-class classification problem. We try to find a plane that separates the class in the feature space, also called a hyperplane. If we can't find a hyperplane, then we can be creative in two ways: 1) We soften what we mean by separate, and 2) We enrich and enlarge the featured space so that separation is possible.
A BA-based algorithm for parameter optimization of support vector machineAboul Ella Hassanien
Presentation at the workshop on Intelligent systems and application, held at faculty of computer and information, Cairo University on Saturday 3 Dec. 2016
In machine learning, support vector machines (SVMs, also support-vector networks) are supervised learning models with associated learning algorithms that analyze data used for classification and regression analysis.
Binary Class and Multi Class Strategies for Machine LearningPaxcel Technologies
This presentation discusses the following -
Possible strategies to follow when working on a new machine learning problem.
The common problems with classifiers (how to detect them and eliminate them).
Popular approaches on how to use binary classifiers to problems with multi class classification.
A Support Vector Machine (SVM) is a discriminative classifier formally defined by a separating hyperplane. In other words, given labeled training data (supervised learning), the algorithm outputs an optimal hyperplane which categorizes new examples. In two dimentional space this hyperplane is a line dividing a plane in two parts where in each class lay in either side.
Data Science - Part XVI - Fourier AnalysisDerek Kane
This lecture provides an overview of the Fourier Analysis and the Fourier Transform as applied in Machine Learning. We will go through some methods of calibration and diagnostics and then apply the technique on a time series prediction of Manufacturing Order Volumes utilizing Fourier Analysis and Neural Networks.
Data Science - Part VII - Cluster AnalysisDerek Kane
This lecture provides an overview of clustering techniques, including K-Means, Hierarchical Clustering, and Gaussian Mixed Models. We will go through some methods of calibration and diagnostics and then apply the technique on a recognizable dataset.
Welcome to the Supervised Machine Learning and Data Sciences.
Algorithms for building models. Support Vector Machines.
Classification algorithm explanation and code in Python ( SVM ) .
In machine learning, support vector machines (SVMs, also support vector networks[1]) are supervised learning models with associated learning algorithms that analyze data and recognize patterns, used for classification and regression analysis. The basic SVM takes a set of input data and predicts, for each given input, which of two possible classes forms the output, making it a non-probabilistic binary linear classifier.
In this presentation, we approach a two-class classification problem. We try to find a plane that separates the class in the feature space, also called a hyperplane. If we can't find a hyperplane, then we can be creative in two ways: 1) We soften what we mean by separate, and 2) We enrich and enlarge the featured space so that separation is possible.
A BA-based algorithm for parameter optimization of support vector machineAboul Ella Hassanien
Presentation at the workshop on Intelligent systems and application, held at faculty of computer and information, Cairo University on Saturday 3 Dec. 2016
In machine learning, support vector machines (SVMs, also support-vector networks) are supervised learning models with associated learning algorithms that analyze data used for classification and regression analysis.
Binary Class and Multi Class Strategies for Machine LearningPaxcel Technologies
This presentation discusses the following -
Possible strategies to follow when working on a new machine learning problem.
The common problems with classifiers (how to detect them and eliminate them).
Popular approaches on how to use binary classifiers to problems with multi class classification.
A Support Vector Machine (SVM) is a discriminative classifier formally defined by a separating hyperplane. In other words, given labeled training data (supervised learning), the algorithm outputs an optimal hyperplane which categorizes new examples. In two dimentional space this hyperplane is a line dividing a plane in two parts where in each class lay in either side.
Data Science - Part XVI - Fourier AnalysisDerek Kane
This lecture provides an overview of the Fourier Analysis and the Fourier Transform as applied in Machine Learning. We will go through some methods of calibration and diagnostics and then apply the technique on a time series prediction of Manufacturing Order Volumes utilizing Fourier Analysis and Neural Networks.
Data Science - Part VII - Cluster AnalysisDerek Kane
This lecture provides an overview of clustering techniques, including K-Means, Hierarchical Clustering, and Gaussian Mixed Models. We will go through some methods of calibration and diagnostics and then apply the technique on a recognizable dataset.
Data Science - Part VIII - Artifical Neural NetworkDerek Kane
This lecture provides an overview of biological based learning in the brain and how to simulate this approach through the use of feed-forward artificial neural networks with back propagation. We will go through some methods of calibration and diagnostics and then apply the technique on three different data mining tasks: binary prediction, classification, and time series prediction.
Data Science - Part XIV - Genetic AlgorithmsDerek Kane
This lecture provides an overview on biological evolution and genetic algorithms in a machine learning context. We will start off by going through a broad overview of the biological evolutionary process and then explore how genetic algorithms can be developed that mimic these processes. We will dive into the types of problems that can be solved with genetic algorithms and then we will conclude with a series of practical examples in R which highlights the techniques: The Knapsack Problem, Feature Selection and OLS regression, and constrained optimizations.
Data Science - Part X - Time Series ForecastingDerek Kane
This lecture provides an overview of Time Series forecasting techniques and the process of creating effective forecasts. We will go through some of the popular statistical methods including time series decomposition, exponential smoothing, Holt-Winters, ARIMA, and GLM Models. These topics will be discussed in detail and we will go through the calibration and diagnostics effective time series models on a number of diverse datasets.
Data Science - Part XIII - Hidden Markov ModelsDerek Kane
This lecture provides an overview on Markov processes and Hidden Markov Models. We will start off by going through a basic conceptual example and then explore the types of problems that can be solved with HMM's. The underlying algorithms will be discussed in detail with a quantitative focus and then we will conclude with a practical example concerning stock market prediction which highlights the techniques.
Data Science - Part IX - Support Vector MachineDerek Kane
This lecture provides an overview of Support Vector Machines in a more relatable and accessible manner. We will go through some methods of calibration and diagnostics of SVM and then apply the technique to accurately detect breast cancer within a dataset.
Data Science - Part XV - MARS, Logistic Regression, & Survival AnalysisDerek Kane
This lecture provides an overview on extending the regression concepts brought forth in previous lectures. We will start off by going through a broad overview of the Multivariate Adaptive Regression Splines Algorithm, Logistic Regression, and then explore the Survival Analysis. The presentation will culminate with a real world example on how these techniques can be used in the US criminal justice system.
Data Science - Part XVII - Deep Learning & Image ProcessingDerek Kane
This lecture provides an overview of Image Processing and Deep Learning for the applications of data science and machine learning. We will go through examples of image processing techniques using a couple of different R packages. Afterwards, we will shift our focus and dive into the topics of Deep Neural Networks and Deep Learning. We will discuss topics including Deep Boltzmann Machines, Deep Belief Networks, & Convolutional Neural Networks and finish the presentation with a practical exercise in hand writing recognition technique.
Kernel based models for geo- and environmental sciences- Alexei Pozdnoukhov –...Beniamino Murgante
Kernel based models for geo- and environmental sciences- Alexei Pozdnoukhov – National Centre for Geocomputation, National University of Ireland , Maynooth (Ireland)
Intelligent Analysis of Environmental Data (S4 ENVISA Workshop 2009)
Support Vector Machine (SVM) is a popular supervised machine learning algorithm used for classification and regression tasks. It works by finding a hyperplane in a high-dimensional space that best separates data points of different classes. SVM aims to maximize the margin between the classes, where the margin is defined as the distance between the hyperplane and the nearest data points from each class. The data points that are closest to the hyperplane are called support vectors.
Here are some key concepts associated with SVM:
Hyperplane: In a two-dimensional space, a hyperplane is a line that separates the data points of different classes. In higher dimensions, it becomes a hyperplane. SVM tries to find the hyperplane with the maximum margin between classes.
Margin: The margin is the distance between the hyperplane and the nearest data points of each class. SVM seeks to maximize this margin.
Support Vectors: These are the data points that are closest to the hyperplane and have the most influence on determining its position. These points "support" the placement of the hyperplane.
Kernel Trick: SVM can be extended to non-linearly separable data using the kernel trick. A kernel function takes the original feature space and maps it to a higher-dimensional space where the data might be linearly separable. Common kernel functions include the linear kernel, polynomial kernel, and radial basis function (RBF) kernel.
C parameter: In SVM, the C parameter is a regularization parameter that balances the trade-off between maximizing the margin and minimizing the classification error. A small C value allows for a larger margin but may lead to more misclassifications, while a large C value prioritizes correct classification over margin maximization.
SVM can be used for both classification and regression tasks:
Classification: In classification, SVM tries to find a hyperplane that separates data points into different classes. New data points can then be classified based on which side of the hyperplane they fall.
Regression: In regression, SVM is used to find a hyperplane that best fits the data points. The goal is to minimize the error between the actual and predicted values.
SVMs have been widely used in various fields such as image classification, text categorization, bioinformatics, and more. However, they can be sensitive to the choice of hyperparameters and might not perform well on extremely noisy or overlapping data.
It's important to note that while SVMs are a powerful and versatile algorithm, newer algorithms like deep learning models have gained popularity due to their ability to automatically learn complex features and patterns from data.
Support Vector Machine (SVM) is a powerful machine learning algorithm for classification and regression. It finds a hyperplane that best separates data into classes, aiming to maximize the margin between them. Support vectors, the closest data points to the hyperplane, influence its position.
In this lecture, you will learn two of the most popular methods for classifying data points into a finite set of categories. Both methods are based on representing a classifier via its decision boundary which is a hyperplane. The parameters of the hyperplane are learned from training data by minimizing a particular loss function.
zkStudyClub - Reef: Fast Succinct Non-Interactive Zero-Knowledge Regex ProofsAlex Pruden
This paper presents Reef, a system for generating publicly verifiable succinct non-interactive zero-knowledge proofs that a committed document matches or does not match a regular expression. We describe applications such as proving the strength of passwords, the provenance of email despite redactions, the validity of oblivious DNS queries, and the existence of mutations in DNA. Reef supports the Perl Compatible Regular Expression syntax, including wildcards, alternation, ranges, capture groups, Kleene star, negations, and lookarounds. Reef introduces a new type of automata, Skipping Alternating Finite Automata (SAFA), that skips irrelevant parts of a document when producing proofs without undermining soundness, and instantiates SAFA with a lookup argument. Our experimental evaluation confirms that Reef can generate proofs for documents with 32M characters; the proofs are small and cheap to verify (under a second).
Paper: https://eprint.iacr.org/2023/1886
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
The Metaverse and AI: how can decision-makers harness the Metaverse for their...Jen Stirrup
The Metaverse is popularized in science fiction, and now it is becoming closer to being a part of our daily lives through the use of social media and shopping companies. How can businesses survive in a world where Artificial Intelligence is becoming the present as well as the future of technology, and how does the Metaverse fit into business strategy when futurist ideas are developing into reality at accelerated rates? How do we do this when our data isn't up to scratch? How can we move towards success with our data so we are set up for the Metaverse when it arrives?
How can you help your company evolve, adapt, and succeed using Artificial Intelligence and the Metaverse to stay ahead of the competition? What are the potential issues, complications, and benefits that these technologies could bring to us and our organizations? In this session, Jen Stirrup will explain how to start thinking about these technologies as an organisation.
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Welcome to the first live UiPath Community Day Dubai! Join us for this unique occasion to meet our local and global UiPath Community and leaders. You will get a full view of the MEA region's automation landscape and the AI Powered automation technology capabilities of UiPath. Also, hosted by our local partners Marc Ellis, you will enjoy a half-day packed with industry insights and automation peers networking.
📕 Curious on our agenda? Wait no more!
10:00 Welcome note - UiPath Community in Dubai
Lovely Sinha, UiPath Community Chapter Leader, UiPath MVPx3, Hyper-automation Consultant, First Abu Dhabi Bank
10:20 A UiPath cross-region MEA overview
Ashraf El Zarka, VP and Managing Director MEA, UiPath
10:35: Customer Success Journey
Deepthi Deepak, Head of Intelligent Automation CoE, First Abu Dhabi Bank
11:15 The UiPath approach to GenAI with our three principles: improve accuracy, supercharge productivity, and automate more
Boris Krumrey, Global VP, Automation Innovation, UiPath
12:15 To discover how Marc Ellis leverages tech-driven solutions in recruitment and managed services.
Brendan Lingam, Director of Sales and Business Development, Marc Ellis
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionAggregage
Join Maher Hanafi, VP of Engineering at Betterworks, in this new session where he'll share a practical framework to transform Gen AI prototypes into impactful products! He'll delve into the complexities of data collection and management, model selection and optimization, and ensuring security, scalability, and responsible use.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Support Vector Machine
1. Introduction to Support Vector Machine
Lucas Xu
September 4, 2012
Lucas Xu Introduction to Support Vector Machine September 4, 2012 1 / 20
2. 1 Classifier
2 Hyper-Plane
3 Convex Optimization
4 Kernel
5 Application
Lucas Xu Introduction to Support Vector Machine September 4, 2012 2 / 20
3. Classifier
Attributes and Class Labels
Training Data
S = (x(1) , y (1) ), · · · , (x(m) , y (m) ) , x(i) ∈ Rd , y (i) ∈ {−1, 1}
Lucas Xu Introduction to Support Vector Machine September 4, 2012 3 / 20
4. Classifier
Umeng Gender Classification Data
user app1 app2 ··· appd gender
user1 1 0 ··· 0 male
user2 0 1 ··· 1 f emale
.
. .
. .
. .. .
. .
.
. . . . . .
usern 1 1 ··· 1 f emale
Each App belongs to one category, ≈ 20 categories.
Categories are mutual exclusive.
Lucas Xu Introduction to Support Vector Machine September 4, 2012 4 / 20
5. Classifier
Umeng Gender Classification Data
S = (x(1) , y (1) ), · · · , (x(m) , y (m) ) , x(i) ∈ Rd , y (i) ∈ {−1, 1}
(i)
xk ∈ {0, 1}, 0 means not installed, 1 means installed on the device
1 ≤ k ≤ d, d 30, 000, about 30,000 apps
y (i) ∈ {male, f emale}
Lucas Xu Introduction to Support Vector Machine September 4, 2012 5 / 20
6. Hyper-Plane
Figure : Hyper Plane
The hyper-plane: wT x + b = 0
Classification function: hw,b (x) = g(wT x + b)
1 if z ≥ 0
g(z) =
−1 otherwise
Lucas Xu Introduction to Support Vector Machine September 4, 2012 6 / 20
7. Hyper-Plane
Functional Margin:
γ (i) = y (i) (wT x(i) + b)
ˆ
Scaling: set constraint normalization condition : w = 1
Geometric Margin:
w T b
γ (i) = y (i) x(i) +
w w
γ (i) should be a large positive number to increase the prediction
confidence.
Lucas Xu Introduction to Support Vector Machine September 4, 2012 7 / 20
8. Hyper-Plane
Definition
The geometry margin of (w, b) with respect to training dataset S:
γ = min γ (i)
i=1,...,m
Lucas Xu Introduction to Support Vector Machine September 4, 2012 8 / 20
9. Hyper-Plane
The optimal margin classifier: (Intuitive)
find a decision boundary that maximizes the margin.
maxγ,w,b γ
s.t. y (i) (wT x(i) + b) ≥ γ, i = 1, ..., m
w = 1.
Lucas Xu Introduction to Support Vector Machine September 4, 2012 9 / 20
10. Hyper-Plane
Normalization Constraint: let function margin γ = 1
ˆ
⇓
1
maxγ,w,b
w
s.t. y (i) (wT x(i) + b) ≥ γ, i = 1, ..., m
⇓
1
maxw,b w 2
2
s.t. y (i) (wT x(i) + b) ≥ 1, i = 1, ..., m
Lucas Xu Introduction to Support Vector Machine September 4, 2012 10 / 20
11. Hyper-Plane
Convex function
Lucas Xu Introduction to Support Vector Machine September 4, 2012 11 / 20
12. Hyper-Plane
Convex function
Convex set
Lucas Xu Introduction to Support Vector Machine September 4, 2012 11 / 20
13. Hyper-Plane
Convex function
Convex set
So-called Quadratic Programming. Their are many software
packages to solve the problem.
Lucas Xu Introduction to Support Vector Machine September 4, 2012 11 / 20
14. Hyper-Plane
Convex function
Convex set
So-called Quadratic Programming. Their are many software
packages to solve the problem.
Basic Ideas for Support Vector Machine DONE !
Lucas Xu Introduction to Support Vector Machine September 4, 2012 11 / 20
15. Hyper-Plane
Convex function
Convex set
So-called Quadratic Programming. Their are many software
packages to solve the problem.
Basic Ideas for Support Vector Machine DONE !
More efficient solution ?
Lucas Xu Introduction to Support Vector Machine September 4, 2012 11 / 20
16. Convex Optimization
Primal Problem:
1
maxw,b w 2
2
s.t. y (i) (wT x(i) + b) ≥ 1, i = 1, ..., m
Lucas Xu Introduction to Support Vector Machine September 4, 2012 12 / 20
17. Convex Optimization
Lagrangian for the original problem:
m
1 2
min max L(w, b, α) = w − αi y (i) (wT x(i) + b) − 1
w,b α:αi ≥0 2
i=1
⇓
Under K.K.T condition, transforms to its Dual problem:
m m
1
max W (α) = αi − y (i) y (j) αi αj x(i) , x(j)
α 2
i=1 i,j=1
s.t. αi ≥ 0, i = 1, ..., m
m
αi y (i) = 0
i=1
Lucas Xu Introduction to Support Vector Machine September 4, 2012 13 / 20
18. Convex Optimization
Solutions:
m
∗
w = αi y (i) x(i)
i=1
maxi:y(i) =−1 w∗T x(i) + mini:y(i) =1 w∗T x(i)
b∗ = −
2
Predict:
g(x) = wT x + b
m T
= αi y (i) x(i) x+b
i=1
m
= αi y (i) x(i) , x + b
i=1
Lucas Xu Introduction to Support Vector Machine September 4, 2012 14 / 20
19. Kernel
For most of αi , αi = 0.
For those αi > 0, (x(i) , y (i) ) are called support vectors
Only needs to compute x(i) , x
(i) (i) (i)
if we can map feature space (x1 , x2 , ...xk ) to another high
(i) (i) (i)
dimension space (z1 , z2 , ...zl ), z = φ(x)
i.e. φ(x(i) , φ(x)
we can easily compute z (i) , z = K(φ( x(i) , x ))
Use a slightly different notation:
K(x, y) = φ(x), φ(y)
Intuitive Explanation: Measure of Similarities
Lucas Xu Introduction to Support Vector Machine September 4, 2012 15 / 20
20. Kernel
Definition
Mercer Kernel: K is positive semi-definite
Lucas Xu Introduction to Support Vector Machine September 4, 2012 16 / 20
21. Kernel
Primitive x, y
Lucas Xu Introduction to Support Vector Machine September 4, 2012 17 / 20
22. Kernel
Primitive x, y
Polynomial ( x, y + 1)d
Lucas Xu Introduction to Support Vector Machine September 4, 2012 17 / 20
23. Kernel
Primitive x, y
Polynomial ( x, y + 1)d
RBF exp(−γ||x − y||2 )
Lucas Xu Introduction to Support Vector Machine September 4, 2012 17 / 20
24. Kernel
Primitive x, y
Polynomial ( x, y + 1)d
RBF exp(−γ||x − y||2 )
Sigmoid tanh(κ x, y + c).
Lucas Xu Introduction to Support Vector Machine September 4, 2012 17 / 20
25. Kernel
Primitive x, y
Polynomial ( x, y + 1)d
RBF exp(−γ||x − y||2 )
String
Lucas Xu Introduction to Support Vector Machine September 4, 2012 17 / 20
26. Kernel
Primitive x, y
Polynomial ( x, y + 1)d
RBF exp(−γ||x − y||2 )
String
Tree
Lucas Xu Introduction to Support Vector Machine September 4, 2012 17 / 20
27. Apply to Umeng Gender Classification
Problem Description
Classify the gender of a user based on apps (s)he installed and
categories of apps.
Kernel Design
m
K(x, y) = φ(xi , yj )
i,j=0
(1 + w)xi yj if i = j
φ(xi , yj ) = xi yj if i = j but the same category
0 if not the same category
w ≥ 0 , the extra weight if two users have installed the same app.
default to 1.0
Experiment Result
Lucas Xu Introduction to Support Vector Machine September 4, 2012 18 / 20
28. Apply to Umeng Gender Classification
x1
x2
.
.
.
xm
⇓
w · x1
w · x2
.
.
.
w · xm
c1
c2
.
. .
c20
ci counts the number of apps belonging to category i
Lucas Xu Introduction to Support Vector Machine September 4, 2012 19 / 20
29. references
Book: Christopher Bishop – PRML Chapter 7: Section 7.1
Slides: Andrew Moore – Support Vector Machines
Video: Bernhard Scholkopf – Kernel Methods
Video: Liva Ralaivola – Introduction to Kernel Methods
Video: Colin Campbell – Introduction to Support Vector Machines
Video: Alex Smola – Kernel Methods and Support Vector
Machines
Video: Partha Niyogi – Introduction to Kernel Methods
Many more videos on kernel-related topics here
http://www.seas.harvard.edu/courses/cs281/
Lucas Xu Introduction to Support Vector Machine September 4, 2012 20 / 20