Machine Learning
Data
Information
Types
Numerical
Categorical
Training data
validation data
testing data
Structured data
Un Structured data
Time Series data
Data sources
create your own data
Roadmap to Membership of RICS - Pathways and Routes
Types of data in Machine Learning day 2
1. Day-2
Types of Data
Two Weeks online
Live Industrial Training
ON
HANDS ON: MACHINE LEARNING
Day-2
THE NATIONAL SMALL INDUSTRIES CORPORATION In Association with
(A Government of India Enterprise)
. Kamalanagar, Kushaiguda, Hyderabad-500062
Date & Time
18-08-2020
2.30pm-3.30pm
3. Day-2
Data
facts and statistics collected together for
reference or analysis.
Raw facts / Observations
the quantities, characters, or symbols on
which operations are performed by a
computer, which may be stored and
transmitted in the form of electrical
signals and recorded on magnetic, optical,
or mechanical recording media.
Information
what is conveyed or represented by a particular
arrangement or sequence of things.
Processed Data
7. Day-2
Data
• Numerical
• Exact Numbers-Height
• Discrete Data
• Numerical-Students? - No Half Student
• Continuous Data
• Numerical-3.265
• Categorical Data
• Yes/No, Gender, Race – red-1, green-2 – can take average
• Ordinal Data
• Mix of Numerical and Categorical data
• Scale-Movie Ratings -1-5 starts
12. Day-2
Time Series
• Time series data is a sequence of numbers collected at regular
intervals over some period of time.
• Date & Time
• Finance
• For example, we might measure the average number of home sales
for many years.
13. Day-2
Data Sets
• Training data
• First stage
• Adjusting Parameters - Amitabh
• Simply, you can say training data sets are used to train the model with data used in
real-life that gathered as machine learning training data.
• Validation data
• Second Stage
• evaluating the model predictions and learn from mistakes before validating the data
sets.
• Test Data
• Third Stage
• final evaluation that a model need to go through after the training stage in model
development