SlideShare a Scribd company logo
1 of 15
Data Science
with
R & Python
Get Trained .Get Employed.
1. Introduction to Data Science
• What is Data Science? Why Data
Science?
• Need for Data Scientist in
Industries?
• Role of a Data Scientist in
Industries.
• How to become a Data
Scientist?
www.allyedu.in
2. Business Statistics
o Univariate Analysis
• Measures of central tendencies (Mean, Median & Mode)
• Measures of dispersion (Range, Quartiles, Deciles,
Percentiles, Standard deviation, Variance, Mean/Median/Mode
Deviation)
• Measures of shape (Normal distribution, Central Limit Theorem,
Skewness (Left & Right), Kurtosis (Platy, Meso & Lepto))
• Tables (Counts, Frequency tables, Class intervals)
• Charts (Histograms, Polygonal Charts, Ogive Charts)
o Data Types
• Qualitative & Quantitative data
o Measurement & Scaling
• Nominal, Ordinal, Interval & Ratio
o Bi-variate Analysis
• Cross tabs, Correlations
o Hypothesis Testing
• Null, Alternate hypothesis, level of significance, Type-I Error, Type-II
Error)
• Parametric Tests (Z-tests, T-tests, Chi-sq tests, ANOVA (one-way,
two- way))
• Non-Parametric Tests (Wilcoxon Sign Rank test, Wilcoxon Sum Rank
test, Kruskal Wallis test, Friedmann Rank test)
www.allyedu.in
Probability
• Definitions (Probability, Events, Non-events, Mutually
exclusive,
Independent, Dependent, Exhaustive)
• Distributions (Gaussian, Binomial, Bernoulli)
• Variable Types (Continuous & Random)
o Multivariate Analysis
o Modeling Methodology
• Setting the working directory, Importing the data,
Splitting of the data, Data preprocessing (Missing value
treatment, Outlier treatment), Multicollinearity, variable
importance, Model development, Model validation using
various diagnostic techniques
o Supervised
• Regression (Simple Linear, Multiple Linear & Binary
Logistic), SVM, CART, Decision Tree, Random Forest, Naïve
Bayes, KNN, ANN
o Unsupervised
• Clustering, Apriori
o NLP
• Text Mining
o Forecasting
• Regular, Seasonal, Cyclic, Irregular trends, Moving
averages, Weighted moving averages, ACF, PACF, ARIMA
www.allyedu.in
3. R
o Introduction
• D Introduction, History, Various versions,
Installation, Terminologies, Advantages &
Disadvantages
o Packages
• Definition, Objective, Installation, CRAN mirror
o Help
• Help & search functions
o Working Directory
• Objective, Setting the working directory, getting the
working directory, file.choose
o Importing & exporting
• CSV, Excel & Text files
o Data Types
• Introduction to various data types - Vectors, Lists, Matrices,
Arrays, Data
Frames & Factors & corresponding functions
o Data Creation
o Data Conversions
Converting from one data type to another data type
www.allyedu.in
o Understanding the data
• Type, structure, dimension, # of rows & columns, nature of the
data
o Slicing & extraction of data
• Subsetting the data by variables & records based on logic
o Date & Time functions
• Date & time formats, converting from one format to other
format
o Joins
• Left, Right, Inner & Outer
o Data Merging
• Merging the data by rows, columns
www.allyedu.in
o Functions
• Numeric, String, Arithmetic
o Family Functions
• Apply, Normal distribution
o Missing values
• Identification, position, # of missing values, removing the
missing values, removing the records with missing values
o Conditions
• Ifelse, For loop, While loop
o Plots
• Bar, Pie, Histograms, Line & additional functions
o Statistics
• Univariate, Bivariate & Multivariate (Supervised & Unsupervised)
analysis
www.allyedu.in
4. Python
o Introduction
• Introduction, Advantages, Installation of Anaconda
o Packages
• Introduction to various packages like Pandas, Numpy,
Sklearn, Scipy, Matplotlib & Tensorflow
o Data Types
• Strings, Tuples, Dictionaries, Sets, Lists & Arrays
o Data Conversions
• Converting from one data type to another data type
o Understanding the data
• Type, structure, dimension, # of rows & columns, nature of the
data
www.allyedu.in
Slicing & extraction of data
• Subsetting the data by variables & records based on logic
o Working Directory
• Objective, Setting the working directory, getting the working
directory, file.choose
o Errors & Exceptions
• Differences between errors, exceptions, handling exceptions
o Missing values
• Identification, position, # of missing values, removing the missing
values, removing the records with missing values
o Conditions
• Ifelse, For loop, While loop
o Statistics
• Univariate, Bivariate
o Multivariate Analysis
• Regression (Simple Linear, Multiple Linear & Binary Logistic)
• Machine Learning (SVM, Decision Tree, Random Forest, KNN, ANN,
Boosting Techniques)
• Deep Learning (Image processing using CNN)
• NLP (Text Mining)www.allyedu.in
5. Tableau
o Introduction
• Introduction to Visualization, Importance, Various tools,
Tableau
variants, Application of various charts based on data
o Basics
• Installation of trial version, Data importing, Live & extract
connections, Dimensions, Measures, Parameters, Filters
o Types of variables
• Character, Numeric, Geographical, Hierarchical, Calculated
variables
o Functions
• Arithmetic, Numeric, Character, Logical & case based
functions
o Panes & Legends
• Various panes in Tableau, Filters, Color legends, Filter legends
o Filters
• Context, Local, Global filters etc
o Charts
• Bar, Line, Pie, Area, Circle, Bubble, Bullet etc
o Advanced
• Data blending, data extracts, packaged versions, Dashboards
(Static
& Dynamic), Publishing the dashboards, Geographical maps (at
Zipcode level, County level, State level) & Map layers
www.allyedu.in
6. Excel
o Introduction
• Importance, benefits of excel, Menu bar, Cell, Formula bar, # of
rows & columns in excel, Difference b/w Excel & CSV
o Basics
• Writing functions, Arithmetic, Random number creation, saving the
excel files, Sorting, Data filtration, Removal of duplicates,
Inserting rows & columns, Deleting rows & coulumns,
Toggling from left to right & top to bottom, Selecting rows &
columns
o Advanced
• Pivot tables, Conditional formatting, Countif, Countifs, Sumif,
Sumifs, Formattings for report, Vlookup, Hlookup, Generating
the report
o Statistics
• Univariate, Bivariate Analysis
o Multivariate Analysis
• Linear Regression
www.allyedu.in
7. SQL
o Introduction
• Introduction, various SQL languages, databases, importance of
SQL
o Basics
• Various SQL functions like Select, Create, Insert into, from,
top 100
o Advanced
• Joins (Left, Right, Full Outer & Inner), Views (Importance,
Creation)
o Conditions
• Where, Having, Order by, Group by
o Operators
• And, or, logical
o Functions
• Arithmetic, Numeric, Character & Logical
www.allyedu.in
8.Artificial Intelligence and Machine Learning with Python
Linear Models
Understand linear approximation and modelling of problems and develop linear models
Dimensionality Reduction
Use ideas from linear algebra to transform dimensions and warp space providing additional
flexibility and functionality to linear models.
SVM
Develop and implement kernel based methods to develop nonlinear models to solve few
complex tasks.
Nearest Neighbours, K-means, and Gaussian Mixture Models
Review pattern recognition ideas with distance and cluster based models to understand
similarity measures and grouping criteria.
Naive Bayes and Decision Trees
 Dive into applications of bayes theorem and the use of decision criteria when learning
from data.
•
 Search
 Look at search from the perspective of graphs, trees and heuristic based optimizations.
•
 Logic and Planning
 Discover ways to encode logic and develop agents that plan actions in an environment.
•
 Reinforcement Learning and Hidden Markov Models
 Engineering agents that learn from a sequence of actions using rewards and penalties.
•
 Q-Learning and Policy gradient
 Operate in a stateful world over value and policy approximations tasks
www.allyedu.in
Data Preprocessing
Regression Techniques
Simple Linear Regression
Multiple Linear Regression
Polynomial Linear Regression
Support Vector Regression
Decision Tree Regression
Random Forest Regression
Evaluating Regression Model Performance
Classification Techniques
K-Nearest Neighbors (KNN)
Support Vector Machine (SVM)
Kernel SVM
Nave Bayes Classification
Decision Tree Classification
Random Forest Classification
Evaluating Classification Model Performance
Natural Language Processing (NLP)
Basic of NLP
Language preprocessing Techniques
Auto summarizing the given text document
Clustering Techniques
K-Means Clustering
K-mini Batch Clustering
Hierarchical Clustering
www.allyedu.in
Elbow Method
Curve Smoothening Techniques
Association Rule Learning
Reinforcement Learning
Basics of Numpy and panda
Deep Learning
Basics/what is Deep Learning
Artificial Neural Networks
Dimension Reduction Techniques
Principal Component Analysis (PCA)
Linear Discriminant Analysis (LDA)
Statistics Basics
Standard Deviation
Variance
Co-Variance
T-distribution
Pearson Correlation Coefficient (PCC)/ Correlation Coefficient
www.allyedu.in

More Related Content

Similar to Data science

Data Visualization in Data Science
Data Visualization in Data ScienceData Visualization in Data Science
Data Visualization in Data ScienceMaloy Manna, PMP®
 
Data mining techniques unit 2
Data mining techniques unit 2Data mining techniques unit 2
Data mining techniques unit 2malathieswaran29
 
Machine Learning from Statistical Point of View
Machine Learning from Statistical Point of ViewMachine Learning from Statistical Point of View
Machine Learning from Statistical Point of ViewYury Gubman
 
APSY3206 Lecture 1.pptx
APSY3206 Lecture 1.pptxAPSY3206 Lecture 1.pptx
APSY3206 Lecture 1.pptxMariaMalikAwan
 
Data science training in Hyderabad
Data science training in HyderabadData science training in Hyderabad
Data science training in HyderabadRajitha D
 
Exploring Data (1).pptx
Exploring Data (1).pptxExploring Data (1).pptx
Exploring Data (1).pptxgina458018
 
Creativity and Curiosity - The Trial and Error of Data Science
Creativity and Curiosity - The Trial and Error of Data ScienceCreativity and Curiosity - The Trial and Error of Data Science
Creativity and Curiosity - The Trial and Error of Data ScienceDamianMingle
 
Big data analytics bhawani nandan prasad
Big data analytics   bhawani nandan prasadBig data analytics   bhawani nandan prasad
Big data analytics bhawani nandan prasadBhawani N Prasad
 
Big Data LDN 2018: TIPS AND TRICKS TO WRANGLE BIG, DIRTY DATA
Big Data LDN 2018: TIPS AND TRICKS TO WRANGLE BIG, DIRTY DATABig Data LDN 2018: TIPS AND TRICKS TO WRANGLE BIG, DIRTY DATA
Big Data LDN 2018: TIPS AND TRICKS TO WRANGLE BIG, DIRTY DATAMatt Stubbs
 
Preprocessing.ppt
Preprocessing.pptPreprocessing.ppt
Preprocessing.pptchatbot9
 
Graph Models for Deep Learning
Graph Models for Deep LearningGraph Models for Deep Learning
Graph Models for Deep LearningExperfy
 
Data Science Training in Chandigarh h
Data Science Training in Chandigarh    hData Science Training in Chandigarh    h
Data Science Training in Chandigarh hasmeerana605
 
All python data_analyst_r_course
All python data_analyst_r_courseAll python data_analyst_r_course
All python data_analyst_r_courseKamal A
 
The Machine Learning Workflow with Azure
The Machine Learning Workflow with AzureThe Machine Learning Workflow with Azure
The Machine Learning Workflow with AzureIvo Andreev
 

Similar to Data science (20)

Data Visualization in Data Science
Data Visualization in Data ScienceData Visualization in Data Science
Data Visualization in Data Science
 
Data mining techniques unit 2
Data mining techniques unit 2Data mining techniques unit 2
Data mining techniques unit 2
 
Learning from data
Learning from dataLearning from data
Learning from data
 
Machine Learning from Statistical Point of View
Machine Learning from Statistical Point of ViewMachine Learning from Statistical Point of View
Machine Learning from Statistical Point of View
 
APSY3206 Lecture 1.pptx
APSY3206 Lecture 1.pptxAPSY3206 Lecture 1.pptx
APSY3206 Lecture 1.pptx
 
Data science training in Hyderabad
Data science training in HyderabadData science training in Hyderabad
Data science training in Hyderabad
 
Datascience Training in Hyderabad
Datascience Training in HyderabadDatascience Training in Hyderabad
Datascience Training in Hyderabad
 
RM UNIT 6.pptx
RM UNIT 6.pptxRM UNIT 6.pptx
RM UNIT 6.pptx
 
Exploring Data (1).pptx
Exploring Data (1).pptxExploring Data (1).pptx
Exploring Data (1).pptx
 
Creativity and Curiosity - The Trial and Error of Data Science
Creativity and Curiosity - The Trial and Error of Data ScienceCreativity and Curiosity - The Trial and Error of Data Science
Creativity and Curiosity - The Trial and Error of Data Science
 
Big data analytics bhawani nandan prasad
Big data analytics   bhawani nandan prasadBig data analytics   bhawani nandan prasad
Big data analytics bhawani nandan prasad
 
Big Data LDN 2018: TIPS AND TRICKS TO WRANGLE BIG, DIRTY DATA
Big Data LDN 2018: TIPS AND TRICKS TO WRANGLE BIG, DIRTY DATABig Data LDN 2018: TIPS AND TRICKS TO WRANGLE BIG, DIRTY DATA
Big Data LDN 2018: TIPS AND TRICKS TO WRANGLE BIG, DIRTY DATA
 
Preprocessing.ppt
Preprocessing.pptPreprocessing.ppt
Preprocessing.ppt
 
Preprocessing.ppt
Preprocessing.pptPreprocessing.ppt
Preprocessing.ppt
 
Preprocessing.ppt
Preprocessing.pptPreprocessing.ppt
Preprocessing.ppt
 
Preprocessing.ppt
Preprocessing.pptPreprocessing.ppt
Preprocessing.ppt
 
Graph Models for Deep Learning
Graph Models for Deep LearningGraph Models for Deep Learning
Graph Models for Deep Learning
 
Data Science Training in Chandigarh h
Data Science Training in Chandigarh    hData Science Training in Chandigarh    h
Data Science Training in Chandigarh h
 
All python data_analyst_r_course
All python data_analyst_r_courseAll python data_analyst_r_course
All python data_analyst_r_course
 
The Machine Learning Workflow with Azure
The Machine Learning Workflow with AzureThe Machine Learning Workflow with Azure
The Machine Learning Workflow with Azure
 

Recently uploaded

Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfUmakantAnnand
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxRoyAbrique
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppCeline George
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptxPoojaSen20
 

Recently uploaded (20)

Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Concept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.CompdfConcept of Vouching. B.Com(Hons) /B.Compdf
Concept of Vouching. B.Com(Hons) /B.Compdf
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptxContemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
Contemporary philippine arts from the regions_PPT_Module_12 [Autosaved] (1).pptx
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
URLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website AppURLs and Routing in the Odoo 17 Website App
URLs and Routing in the Odoo 17 Website App
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of Powders
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
PSYCHIATRIC History collection FORMAT.pptx
PSYCHIATRIC   History collection FORMAT.pptxPSYCHIATRIC   History collection FORMAT.pptx
PSYCHIATRIC History collection FORMAT.pptx
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 

Data science

  • 1. Data Science with R & Python Get Trained .Get Employed.
  • 2. 1. Introduction to Data Science • What is Data Science? Why Data Science? • Need for Data Scientist in Industries? • Role of a Data Scientist in Industries. • How to become a Data Scientist? www.allyedu.in
  • 3. 2. Business Statistics o Univariate Analysis • Measures of central tendencies (Mean, Median & Mode) • Measures of dispersion (Range, Quartiles, Deciles, Percentiles, Standard deviation, Variance, Mean/Median/Mode Deviation) • Measures of shape (Normal distribution, Central Limit Theorem, Skewness (Left & Right), Kurtosis (Platy, Meso & Lepto)) • Tables (Counts, Frequency tables, Class intervals) • Charts (Histograms, Polygonal Charts, Ogive Charts) o Data Types • Qualitative & Quantitative data o Measurement & Scaling • Nominal, Ordinal, Interval & Ratio o Bi-variate Analysis • Cross tabs, Correlations o Hypothesis Testing • Null, Alternate hypothesis, level of significance, Type-I Error, Type-II Error) • Parametric Tests (Z-tests, T-tests, Chi-sq tests, ANOVA (one-way, two- way)) • Non-Parametric Tests (Wilcoxon Sign Rank test, Wilcoxon Sum Rank test, Kruskal Wallis test, Friedmann Rank test) www.allyedu.in
  • 4. Probability • Definitions (Probability, Events, Non-events, Mutually exclusive, Independent, Dependent, Exhaustive) • Distributions (Gaussian, Binomial, Bernoulli) • Variable Types (Continuous & Random) o Multivariate Analysis o Modeling Methodology • Setting the working directory, Importing the data, Splitting of the data, Data preprocessing (Missing value treatment, Outlier treatment), Multicollinearity, variable importance, Model development, Model validation using various diagnostic techniques o Supervised • Regression (Simple Linear, Multiple Linear & Binary Logistic), SVM, CART, Decision Tree, Random Forest, Naïve Bayes, KNN, ANN o Unsupervised • Clustering, Apriori o NLP • Text Mining o Forecasting • Regular, Seasonal, Cyclic, Irregular trends, Moving averages, Weighted moving averages, ACF, PACF, ARIMA www.allyedu.in
  • 5. 3. R o Introduction • D Introduction, History, Various versions, Installation, Terminologies, Advantages & Disadvantages o Packages • Definition, Objective, Installation, CRAN mirror o Help • Help & search functions o Working Directory • Objective, Setting the working directory, getting the working directory, file.choose o Importing & exporting • CSV, Excel & Text files o Data Types • Introduction to various data types - Vectors, Lists, Matrices, Arrays, Data Frames & Factors & corresponding functions o Data Creation o Data Conversions Converting from one data type to another data type www.allyedu.in
  • 6. o Understanding the data • Type, structure, dimension, # of rows & columns, nature of the data o Slicing & extraction of data • Subsetting the data by variables & records based on logic o Date & Time functions • Date & time formats, converting from one format to other format o Joins • Left, Right, Inner & Outer o Data Merging • Merging the data by rows, columns www.allyedu.in
  • 7. o Functions • Numeric, String, Arithmetic o Family Functions • Apply, Normal distribution o Missing values • Identification, position, # of missing values, removing the missing values, removing the records with missing values o Conditions • Ifelse, For loop, While loop o Plots • Bar, Pie, Histograms, Line & additional functions o Statistics • Univariate, Bivariate & Multivariate (Supervised & Unsupervised) analysis www.allyedu.in
  • 8. 4. Python o Introduction • Introduction, Advantages, Installation of Anaconda o Packages • Introduction to various packages like Pandas, Numpy, Sklearn, Scipy, Matplotlib & Tensorflow o Data Types • Strings, Tuples, Dictionaries, Sets, Lists & Arrays o Data Conversions • Converting from one data type to another data type o Understanding the data • Type, structure, dimension, # of rows & columns, nature of the data www.allyedu.in
  • 9. Slicing & extraction of data • Subsetting the data by variables & records based on logic o Working Directory • Objective, Setting the working directory, getting the working directory, file.choose o Errors & Exceptions • Differences between errors, exceptions, handling exceptions o Missing values • Identification, position, # of missing values, removing the missing values, removing the records with missing values o Conditions • Ifelse, For loop, While loop o Statistics • Univariate, Bivariate o Multivariate Analysis • Regression (Simple Linear, Multiple Linear & Binary Logistic) • Machine Learning (SVM, Decision Tree, Random Forest, KNN, ANN, Boosting Techniques) • Deep Learning (Image processing using CNN) • NLP (Text Mining)www.allyedu.in
  • 10. 5. Tableau o Introduction • Introduction to Visualization, Importance, Various tools, Tableau variants, Application of various charts based on data o Basics • Installation of trial version, Data importing, Live & extract connections, Dimensions, Measures, Parameters, Filters o Types of variables • Character, Numeric, Geographical, Hierarchical, Calculated variables o Functions • Arithmetic, Numeric, Character, Logical & case based functions o Panes & Legends • Various panes in Tableau, Filters, Color legends, Filter legends o Filters • Context, Local, Global filters etc o Charts • Bar, Line, Pie, Area, Circle, Bubble, Bullet etc o Advanced • Data blending, data extracts, packaged versions, Dashboards (Static & Dynamic), Publishing the dashboards, Geographical maps (at Zipcode level, County level, State level) & Map layers www.allyedu.in
  • 11. 6. Excel o Introduction • Importance, benefits of excel, Menu bar, Cell, Formula bar, # of rows & columns in excel, Difference b/w Excel & CSV o Basics • Writing functions, Arithmetic, Random number creation, saving the excel files, Sorting, Data filtration, Removal of duplicates, Inserting rows & columns, Deleting rows & coulumns, Toggling from left to right & top to bottom, Selecting rows & columns o Advanced • Pivot tables, Conditional formatting, Countif, Countifs, Sumif, Sumifs, Formattings for report, Vlookup, Hlookup, Generating the report o Statistics • Univariate, Bivariate Analysis o Multivariate Analysis • Linear Regression www.allyedu.in
  • 12. 7. SQL o Introduction • Introduction, various SQL languages, databases, importance of SQL o Basics • Various SQL functions like Select, Create, Insert into, from, top 100 o Advanced • Joins (Left, Right, Full Outer & Inner), Views (Importance, Creation) o Conditions • Where, Having, Order by, Group by o Operators • And, or, logical o Functions • Arithmetic, Numeric, Character & Logical www.allyedu.in
  • 13. 8.Artificial Intelligence and Machine Learning with Python Linear Models Understand linear approximation and modelling of problems and develop linear models Dimensionality Reduction Use ideas from linear algebra to transform dimensions and warp space providing additional flexibility and functionality to linear models. SVM Develop and implement kernel based methods to develop nonlinear models to solve few complex tasks. Nearest Neighbours, K-means, and Gaussian Mixture Models Review pattern recognition ideas with distance and cluster based models to understand similarity measures and grouping criteria. Naive Bayes and Decision Trees  Dive into applications of bayes theorem and the use of decision criteria when learning from data. •  Search  Look at search from the perspective of graphs, trees and heuristic based optimizations. •  Logic and Planning  Discover ways to encode logic and develop agents that plan actions in an environment. •  Reinforcement Learning and Hidden Markov Models  Engineering agents that learn from a sequence of actions using rewards and penalties. •  Q-Learning and Policy gradient  Operate in a stateful world over value and policy approximations tasks www.allyedu.in
  • 14. Data Preprocessing Regression Techniques Simple Linear Regression Multiple Linear Regression Polynomial Linear Regression Support Vector Regression Decision Tree Regression Random Forest Regression Evaluating Regression Model Performance Classification Techniques K-Nearest Neighbors (KNN) Support Vector Machine (SVM) Kernel SVM Nave Bayes Classification Decision Tree Classification Random Forest Classification Evaluating Classification Model Performance Natural Language Processing (NLP) Basic of NLP Language preprocessing Techniques Auto summarizing the given text document Clustering Techniques K-Means Clustering K-mini Batch Clustering Hierarchical Clustering www.allyedu.in
  • 15. Elbow Method Curve Smoothening Techniques Association Rule Learning Reinforcement Learning Basics of Numpy and panda Deep Learning Basics/what is Deep Learning Artificial Neural Networks Dimension Reduction Techniques Principal Component Analysis (PCA) Linear Discriminant Analysis (LDA) Statistics Basics Standard Deviation Variance Co-Variance T-distribution Pearson Correlation Coefficient (PCC)/ Correlation Coefficient www.allyedu.in