Feature Engineering
Feature Engineering
Feature engineering is a machine
learning technique that leverages data to
create new variables that aren't in the
training set.
Math, Statistics and Domain Knowledge
Important area in the field of machine learning and data
analysis
Feature engineering techniques,
1) Outlier detection and removal
2) One hot encoding
3) Log transform
4) Dimensionality reduction using principal component
analysis (a.k.a. PCA)
5) Handling missing values
6) Scaling
Outliers
Outliers are unusual data points that differ
significantly from rest of the samples.
They can occur due to an error in data
collection process or they are just legitimate
data points and represent natural variation.
Eg: age more than 100
Percentile
Percentiles are used in statistics to give you a
number that describes the value that a given
percent of the values are lower than.
Outliers

Feature Engineering for data science.pptx

  • 1.
  • 2.
    Feature Engineering Feature engineeringis a machine learning technique that leverages data to create new variables that aren't in the training set. Math, Statistics and Domain Knowledge
  • 3.
    Important area inthe field of machine learning and data analysis Feature engineering techniques, 1) Outlier detection and removal 2) One hot encoding 3) Log transform 4) Dimensionality reduction using principal component analysis (a.k.a. PCA) 5) Handling missing values 6) Scaling
  • 6.
    Outliers Outliers are unusualdata points that differ significantly from rest of the samples. They can occur due to an error in data collection process or they are just legitimate data points and represent natural variation. Eg: age more than 100
  • 7.
    Percentile Percentiles are usedin statistics to give you a number that describes the value that a given percent of the values are lower than.
  • 8.