SlideShare a Scribd company logo
1 of 36
DEEP CODERS
ECKOVATION MACHINE LEARNING
members
Nitin Khatkar :01711503116
Sourav Tiwari :03011503116
Gulshan :01211503116
Shrey Achreja :41311503116
What cuisine is this recipe??
Picture yourself strolling through
your local, open-air market... What
do you see? What do you smell?
What will you make for dinner
tonight?
We want to thank Yummly for providing this unique dataset.
2
Data Description
▫ In the dataset, we include the
recipe id, the type of cuisine,
and the list of ingredients of
each recipe (of variable length).
The data is stored in JSON
format.
▫ An example of a recipe node in
train.json is given aside:
3
“We would predict the cuisine for
each recipe in the test case.”
4
5
STEPS FOLLOWED TO SOLVE THE GIVEN PROBLEM
STEP 3
At last, we will apply
the suitable algorithm
to it and find the best
suitable algorithm to
it.
STEP 1
First we will perform
EDA and will remove
all the redundant data
from given dataset.
STEP 2
Then , we will form our
feature matrix as well
target metrix.
PRE-PROCESSING
TOP 10 INGREDIENTS ACCORDING TO THE CUSINE
ALGORTITHM FOR FINDING TOP 10 CUISINE GIVEN ON NEXT SLIDE ->
7
ALGORITHM FOR FINDING TOP 10 CUISINE
▫ First make a dictionary with keys as different cuisine’s
and ingredients present in it as values.(dic)
▫ Then, with the help of above dictionary make a new
dictionary containing the counts of the ingredients
present in it.(count_dictionary)
▫ At last, make the pie chart of top 10 ingredients with
the help of above two dictionaries.(code given on next
slide)
8
CODE FOR PLOTTING TOP 10 INGREDIENTS
9
APPLYING
ML TO IT
FIRST GENERATING X AND Y FOR FURTHER
APPLYING ANY ALGORITHM TO IT.
10
11
GENERATING X AND Y
▫ Create an empty list
y,total_ingredients.
▫ Append all the unique ingredients in
the list total_ingredients.
▫ Create a zero matrix using numpy and
name it as x.(number of rows equal to
y and columns equal to
total_ingredients)
▫ For every ingredient in y replace with
1.
▫ Our feature matrix x and target y is
ready.
DIFFERENT ALGORITHM USED
12
FOURTH
FINALLY WE WENT FOR FINSL TEST OF DEEP
LEARNING BUT DUE COULD NOT PERFORM ON
FULL DATA DUE TO LOW END SOECIFIACTIONS
THIRD
OUT OF CUROSITY, WE ALSO
APPLIED NAÏVE BAYES TO IT, BUT
AGAIN GOT A VERY LOW SCORE OF
0.36
SECOND
THEN ,WE TESTED FOR RANDOM
FOREST AND GOT A SATISFACTORY
SCORE OF 0.72
FIRST
WE STARTED WITH DECISION TREE.
OUTCOME WAS NOT AT ALL FRUITFUL
AS IT ACHIEVED SCORE OF 0.30
13
ALGORITHM 1
▫ APPLYING DECISION
TREE TO IT.
▫ GOT A SCORE OF 0.26
14
ALGORITHM 2
▫ APPLYING RANDOM
FOREST TO IT.
▫ GOT A SCORE OF 0.72
15
ALGORITHM 3
▫ APPLYING NAÏVE NAYES
TO IT
▫ GOT A SCORE OF 0.36
16
ALGORITHM 4
▫ APPLYING DEEP
LEARNING TO IT.
▫ GOT A SCORE OF 0.64 ON
ONLY 10000 DATA
SHORTCOMINGS
Deep learning could not be
applied on whole dataset due to
low end specifications and SVM
could also not be applied due to
memory error.
RESULT
Conclusion
Random Forest is best algo ,
But if deep learning performed
on full dataset then , conclusion
may differ.
17
DATA COMPARISON
SCORE
DECISION TREE 0.30
NAÏVE BAYES 0.36
RANDOM FOREST 0.72
DEEP LEARNING 0.64
18
0.72
Final score achieved
Highest is 0.82
19
20
Forest Cover Type Prediction
▫ The study area includes four wilderness areas located in the
Roosevelt National Forest of northern Colorado. Each observation
is a 30m x 30m patch. We are asked to predict an integer
classification for the forest cover type.
21
Data Description
▫ The seven types are:
▫ 1 - Spruce/Fir
2 - Lodgepole Pine
3 - Ponderosa Pine
4 - Cottonwood/Willow
5 - Aspen
6 - Douglas-fir
7 - Krummholz
▫ The training set (15120 observations)
contains both features and the Cover_Type.
The test set contains only the features. You
must predict the Cover_Type for every
row in the test set (565892 observations).
“We would predict the forest-cover
type based upon the given value of
parameters.”
22
23
STEPS FOLLOWED TO SOLVE THE GIVEN PROBLEM
23
STEP 3
At last, we will apply
the suitable algorithm
to it and find the best
suitable algorithm to
it.
STEP 1
First we will perform
EDA and will remove
all the redundant data
from given dataset.
STEP 2
Then , we will form our
feature matrix as well
target metrix.
24
PRE-PROCESSING
PLOTTING PARAMETER’S WITH RESPECT TO FOREST-COVER TYPE
25
SAMPLE CODE FOR POTTING GRAPH
26
27
APPLYING
ML TO IT
FIRST GENERATING X AND Y FOR FURTHER
APPLYING ANY ALGORITHM TO IT.
27
28
GENERATING X AND Y
▫ Assigning the value’s in
cover_type column in target
matrix(y).
▫ Then, after removing the
cover_type from data-frame
assigning the value to x.
▫ Removing the redundant columns.
29
DIFFERENT ALGORITHM USED
FOURTH
Finally we went ahead for deep learning and got
score 0.84 as the Random forest.
THIRD
OUT OF CUROSITY, WE ALSO
APPLIED NAÏVE BAYES and SVM TO
IT, BUT AGAIN GOT A VERY LOW
SCORE OF 0.58 and 0.14 respectively.
SECOND
THEN ,WE TESTED FOR RANDOM
FOREST AND GOT A SATISFACTORY
SCORE OF 0.84
FIRST
WE STARTED WITH DECISION TREE.
OUTCOME WAS NOT AT ALL FRUITFUL
AS IT ACHIEVED SCORE OF 0.60
30
ALGORITHM 1
▫ Applying decision tree to
our problem
▫ We got a score of 0.30
31
ALGORITHM 2
▫ Applying random forest to
our problem we get.
▫ We got a score of 0.84
32
ALGORITHM 3
▫ Applied naïve-bayes and
SVM to it.
▫ But did not get fruitful
result because it’s not a
probability problem
33
ALGORITHM 4
▫ At last applying deep
learning to our problem
▫ We got a score of 0.80
34
DATA COMPARISON
SCORE
DECISION TREE 0.60
NAÏVE BAYES 0.58
RANDOM FOREST 0.84
SVM 0.14
DEEP LEARNING 0.80
34
35
0.84
Final score achieved
35
36
THANKS!

More Related Content

What's hot (19)

Solution of equations for methods iterativos
Solution of equations for methods iterativosSolution of equations for methods iterativos
Solution of equations for methods iterativos
 
n-squared_sorts
n-squared_sortsn-squared_sorts
n-squared_sorts
 
Me202 engineering mechanics l3
Me202 engineering mechanics l3Me202 engineering mechanics l3
Me202 engineering mechanics l3
 
Absolute inequalities
Absolute inequalitiesAbsolute inequalities
Absolute inequalities
 
Linear Systems Gauss Seidel
Linear Systems   Gauss SeidelLinear Systems   Gauss Seidel
Linear Systems Gauss Seidel
 
LYAPUNOV STABILITY PROBLEM SOLUTION
LYAPUNOV STABILITY PROBLEM SOLUTIONLYAPUNOV STABILITY PROBLEM SOLUTION
LYAPUNOV STABILITY PROBLEM SOLUTION
 
Lyapunov stability
Lyapunov stability Lyapunov stability
Lyapunov stability
 
Directs Methods
Directs MethodsDirects Methods
Directs Methods
 
Chapter 5
Chapter 5Chapter 5
Chapter 5
 
Insertion sort algorithm power point presentation
Insertion  sort algorithm power point presentation Insertion  sort algorithm power point presentation
Insertion sort algorithm power point presentation
 
Ib maths sl chain rule
Ib maths sl chain ruleIb maths sl chain rule
Ib maths sl chain rule
 
SQL: Unique IDs, Primary Keys and Archiving Inactive Rows Without Violating C...
SQL: Unique IDs, Primary Keys and Archiving Inactive Rows Without Violating C...SQL: Unique IDs, Primary Keys and Archiving Inactive Rows Without Violating C...
SQL: Unique IDs, Primary Keys and Archiving Inactive Rows Without Violating C...
 
Selection sort
Selection sortSelection sort
Selection sort
 
Me202 engineering mechanics l2
Me202 engineering mechanics l2Me202 engineering mechanics l2
Me202 engineering mechanics l2
 
CSS
CSSCSS
CSS
 
Interactives Methods
Interactives MethodsInteractives Methods
Interactives Methods
 
Iterative methods
Iterative methodsIterative methods
Iterative methods
 
ME202 Engineering Mechanics L6
ME202 Engineering Mechanics L6ME202 Engineering Mechanics L6
ME202 Engineering Mechanics L6
 
Ib maths sl product and quotient rules
Ib maths sl product and quotient rulesIb maths sl product and quotient rules
Ib maths sl product and quotient rules
 

Similar to deep_coders(sourav,nitin)

House Sale Price Prediction
House Sale Price PredictionHouse Sale Price Prediction
House Sale Price Predictionsriram30691
 
Artificial Neural Network
Artificial Neural NetworkArtificial Neural Network
Artificial Neural NetworkDessy Amirudin
 
Principal component analysis and lda
Principal component analysis and ldaPrincipal component analysis and lda
Principal component analysis and ldaSuresh Pokharel
 
Predicting rainfall using ensemble of ensembles
Predicting rainfall using ensemble of ensemblesPredicting rainfall using ensemble of ensembles
Predicting rainfall using ensemble of ensemblesVarad Meru
 
ALGORITHMS - SHORT NOTES
ALGORITHMS - SHORT NOTESALGORITHMS - SHORT NOTES
ALGORITHMS - SHORT NOTESsuthi
 
Workshop 4
Workshop 4Workshop 4
Workshop 4eeetq
 
Machine Learning Notes for beginners ,Step by step
Machine Learning Notes for beginners ,Step by stepMachine Learning Notes for beginners ,Step by step
Machine Learning Notes for beginners ,Step by stepSanjanaSaxena17
 
08 neural networks
08 neural networks08 neural networks
08 neural networksankit_ppt
 
Data Science Job Required Skill Analysis
Data Science Job Required Skill AnalysisData Science Job Required Skill Analysis
Data Science Job Required Skill AnalysisHarsh Kevadia
 
Numerical Techniques
Numerical TechniquesNumerical Techniques
Numerical TechniquesYasir Mahdi
 
ADA Unit — 2 Greedy Strategy and Examples | RGPV De Bunkers
ADA Unit — 2 Greedy Strategy and Examples | RGPV De BunkersADA Unit — 2 Greedy Strategy and Examples | RGPV De Bunkers
ADA Unit — 2 Greedy Strategy and Examples | RGPV De BunkersRGPV De Bunkers
 
Efficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketchingEfficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketchingHsing-chuan Hsieh
 
Algorithm & data structures lec1
Algorithm & data structures lec1Algorithm & data structures lec1
Algorithm & data structures lec1s_raza
 
Sienna 1 intro
Sienna 1 introSienna 1 intro
Sienna 1 introchidabdu
 
APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...
APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...
APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...WILIAMMAURICIOCAHUAT1
 
Bubble Sort algorithm in Assembly Language
Bubble Sort algorithm in Assembly LanguageBubble Sort algorithm in Assembly Language
Bubble Sort algorithm in Assembly LanguageAriel Tonatiuh Espindola
 
Calculator-Techniques for engineering.pptx
Calculator-Techniques for engineering.pptxCalculator-Techniques for engineering.pptx
Calculator-Techniques for engineering.pptxSoleil50
 
CS8451 - Design and Analysis of Algorithms
CS8451 - Design and Analysis of AlgorithmsCS8451 - Design and Analysis of Algorithms
CS8451 - Design and Analysis of AlgorithmsKrishnan MuthuManickam
 

Similar to deep_coders(sourav,nitin) (20)

House Sale Price Prediction
House Sale Price PredictionHouse Sale Price Prediction
House Sale Price Prediction
 
Artificial Neural Network
Artificial Neural NetworkArtificial Neural Network
Artificial Neural Network
 
Principal component analysis and lda
Principal component analysis and ldaPrincipal component analysis and lda
Principal component analysis and lda
 
Week 4
Week 4Week 4
Week 4
 
Predicting rainfall using ensemble of ensembles
Predicting rainfall using ensemble of ensemblesPredicting rainfall using ensemble of ensembles
Predicting rainfall using ensemble of ensembles
 
ALGORITHMS - SHORT NOTES
ALGORITHMS - SHORT NOTESALGORITHMS - SHORT NOTES
ALGORITHMS - SHORT NOTES
 
Workshop 4
Workshop 4Workshop 4
Workshop 4
 
Machine Learning Notes for beginners ,Step by step
Machine Learning Notes for beginners ,Step by stepMachine Learning Notes for beginners ,Step by step
Machine Learning Notes for beginners ,Step by step
 
08 neural networks
08 neural networks08 neural networks
08 neural networks
 
Ma3bfet par 10.6 5 aug 2014
Ma3bfet par 10.6 5 aug 2014Ma3bfet par 10.6 5 aug 2014
Ma3bfet par 10.6 5 aug 2014
 
Data Science Job Required Skill Analysis
Data Science Job Required Skill AnalysisData Science Job Required Skill Analysis
Data Science Job Required Skill Analysis
 
Numerical Techniques
Numerical TechniquesNumerical Techniques
Numerical Techniques
 
ADA Unit — 2 Greedy Strategy and Examples | RGPV De Bunkers
ADA Unit — 2 Greedy Strategy and Examples | RGPV De BunkersADA Unit — 2 Greedy Strategy and Examples | RGPV De Bunkers
ADA Unit — 2 Greedy Strategy and Examples | RGPV De Bunkers
 
Efficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketchingEfficient anomaly detection via matrix sketching
Efficient anomaly detection via matrix sketching
 
Algorithm & data structures lec1
Algorithm & data structures lec1Algorithm & data structures lec1
Algorithm & data structures lec1
 
Sienna 1 intro
Sienna 1 introSienna 1 intro
Sienna 1 intro
 
APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...
APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...
APLICACIONES DE LA DERIVADA EN LA CARRERA DE (Mecánica, Electrónica, Telecomu...
 
Bubble Sort algorithm in Assembly Language
Bubble Sort algorithm in Assembly LanguageBubble Sort algorithm in Assembly Language
Bubble Sort algorithm in Assembly Language
 
Calculator-Techniques for engineering.pptx
Calculator-Techniques for engineering.pptxCalculator-Techniques for engineering.pptx
Calculator-Techniques for engineering.pptx
 
CS8451 - Design and Analysis of Algorithms
CS8451 - Design and Analysis of AlgorithmsCS8451 - Design and Analysis of Algorithms
CS8451 - Design and Analysis of Algorithms
 

Recently uploaded

IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 

Recently uploaded (20)

Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 

deep_coders(sourav,nitin)

  • 1. DEEP CODERS ECKOVATION MACHINE LEARNING members Nitin Khatkar :01711503116 Sourav Tiwari :03011503116 Gulshan :01211503116 Shrey Achreja :41311503116
  • 2. What cuisine is this recipe?? Picture yourself strolling through your local, open-air market... What do you see? What do you smell? What will you make for dinner tonight? We want to thank Yummly for providing this unique dataset. 2
  • 3. Data Description ▫ In the dataset, we include the recipe id, the type of cuisine, and the list of ingredients of each recipe (of variable length). The data is stored in JSON format. ▫ An example of a recipe node in train.json is given aside: 3
  • 4. “We would predict the cuisine for each recipe in the test case.” 4
  • 5. 5 STEPS FOLLOWED TO SOLVE THE GIVEN PROBLEM STEP 3 At last, we will apply the suitable algorithm to it and find the best suitable algorithm to it. STEP 1 First we will perform EDA and will remove all the redundant data from given dataset. STEP 2 Then , we will form our feature matrix as well target metrix.
  • 6. PRE-PROCESSING TOP 10 INGREDIENTS ACCORDING TO THE CUSINE
  • 7. ALGORTITHM FOR FINDING TOP 10 CUISINE GIVEN ON NEXT SLIDE -> 7
  • 8. ALGORITHM FOR FINDING TOP 10 CUISINE ▫ First make a dictionary with keys as different cuisine’s and ingredients present in it as values.(dic) ▫ Then, with the help of above dictionary make a new dictionary containing the counts of the ingredients present in it.(count_dictionary) ▫ At last, make the pie chart of top 10 ingredients with the help of above two dictionaries.(code given on next slide) 8
  • 9. CODE FOR PLOTTING TOP 10 INGREDIENTS 9
  • 10. APPLYING ML TO IT FIRST GENERATING X AND Y FOR FURTHER APPLYING ANY ALGORITHM TO IT. 10
  • 11. 11 GENERATING X AND Y ▫ Create an empty list y,total_ingredients. ▫ Append all the unique ingredients in the list total_ingredients. ▫ Create a zero matrix using numpy and name it as x.(number of rows equal to y and columns equal to total_ingredients) ▫ For every ingredient in y replace with 1. ▫ Our feature matrix x and target y is ready.
  • 12. DIFFERENT ALGORITHM USED 12 FOURTH FINALLY WE WENT FOR FINSL TEST OF DEEP LEARNING BUT DUE COULD NOT PERFORM ON FULL DATA DUE TO LOW END SOECIFIACTIONS THIRD OUT OF CUROSITY, WE ALSO APPLIED NAÏVE BAYES TO IT, BUT AGAIN GOT A VERY LOW SCORE OF 0.36 SECOND THEN ,WE TESTED FOR RANDOM FOREST AND GOT A SATISFACTORY SCORE OF 0.72 FIRST WE STARTED WITH DECISION TREE. OUTCOME WAS NOT AT ALL FRUITFUL AS IT ACHIEVED SCORE OF 0.30
  • 13. 13 ALGORITHM 1 ▫ APPLYING DECISION TREE TO IT. ▫ GOT A SCORE OF 0.26
  • 14. 14 ALGORITHM 2 ▫ APPLYING RANDOM FOREST TO IT. ▫ GOT A SCORE OF 0.72
  • 15. 15 ALGORITHM 3 ▫ APPLYING NAÏVE NAYES TO IT ▫ GOT A SCORE OF 0.36
  • 16. 16 ALGORITHM 4 ▫ APPLYING DEEP LEARNING TO IT. ▫ GOT A SCORE OF 0.64 ON ONLY 10000 DATA
  • 17. SHORTCOMINGS Deep learning could not be applied on whole dataset due to low end specifications and SVM could also not be applied due to memory error. RESULT Conclusion Random Forest is best algo , But if deep learning performed on full dataset then , conclusion may differ. 17
  • 18. DATA COMPARISON SCORE DECISION TREE 0.30 NAÏVE BAYES 0.36 RANDOM FOREST 0.72 DEEP LEARNING 0.64 18
  • 20. 20 Forest Cover Type Prediction ▫ The study area includes four wilderness areas located in the Roosevelt National Forest of northern Colorado. Each observation is a 30m x 30m patch. We are asked to predict an integer classification for the forest cover type.
  • 21. 21 Data Description ▫ The seven types are: ▫ 1 - Spruce/Fir 2 - Lodgepole Pine 3 - Ponderosa Pine 4 - Cottonwood/Willow 5 - Aspen 6 - Douglas-fir 7 - Krummholz ▫ The training set (15120 observations) contains both features and the Cover_Type. The test set contains only the features. You must predict the Cover_Type for every row in the test set (565892 observations).
  • 22. “We would predict the forest-cover type based upon the given value of parameters.” 22
  • 23. 23 STEPS FOLLOWED TO SOLVE THE GIVEN PROBLEM 23 STEP 3 At last, we will apply the suitable algorithm to it and find the best suitable algorithm to it. STEP 1 First we will perform EDA and will remove all the redundant data from given dataset. STEP 2 Then , we will form our feature matrix as well target metrix.
  • 24. 24 PRE-PROCESSING PLOTTING PARAMETER’S WITH RESPECT TO FOREST-COVER TYPE
  • 25. 25
  • 26. SAMPLE CODE FOR POTTING GRAPH 26
  • 27. 27 APPLYING ML TO IT FIRST GENERATING X AND Y FOR FURTHER APPLYING ANY ALGORITHM TO IT. 27
  • 28. 28 GENERATING X AND Y ▫ Assigning the value’s in cover_type column in target matrix(y). ▫ Then, after removing the cover_type from data-frame assigning the value to x. ▫ Removing the redundant columns.
  • 29. 29 DIFFERENT ALGORITHM USED FOURTH Finally we went ahead for deep learning and got score 0.84 as the Random forest. THIRD OUT OF CUROSITY, WE ALSO APPLIED NAÏVE BAYES and SVM TO IT, BUT AGAIN GOT A VERY LOW SCORE OF 0.58 and 0.14 respectively. SECOND THEN ,WE TESTED FOR RANDOM FOREST AND GOT A SATISFACTORY SCORE OF 0.84 FIRST WE STARTED WITH DECISION TREE. OUTCOME WAS NOT AT ALL FRUITFUL AS IT ACHIEVED SCORE OF 0.60
  • 30. 30 ALGORITHM 1 ▫ Applying decision tree to our problem ▫ We got a score of 0.30
  • 31. 31 ALGORITHM 2 ▫ Applying random forest to our problem we get. ▫ We got a score of 0.84
  • 32. 32 ALGORITHM 3 ▫ Applied naïve-bayes and SVM to it. ▫ But did not get fruitful result because it’s not a probability problem
  • 33. 33 ALGORITHM 4 ▫ At last applying deep learning to our problem ▫ We got a score of 0.80
  • 34. 34 DATA COMPARISON SCORE DECISION TREE 0.60 NAÏVE BAYES 0.58 RANDOM FOREST 0.84 SVM 0.14 DEEP LEARNING 0.80 34