SlideShare a Scribd company logo
1 of 2
Download to read offline
PARMANAND SAHU
Dallas,TX | parmanand.sahu@utdallas.edu | 972-730-3967 | linkedin.com/in/parmanandsahu/ | https://parmanandsahu.com/
EDUCATION
THE UNIVERSITY OF TEXAS AT DALLAS, Richardson, TX Aug 2019 - May 2021
Master of Science in Computer Science 3.66/4
NATIONAL INSTITUTE OF TECHNOLOGY, Raipur, IN Jul 2009 - Jul 2013
Bachelor of Technology (Hons.) in Metallurgical Engineering 8.35/10
TECHNICAL SKILLS
Languages Python, C, Java, Scala, Matlab, Node.js
Databases MongoDB, MySQL, DynamoDB, Neptune, ElasticSearch, Neo4j
Libraries Numpy, Pandas, Matplotlib, Plotly, NLTK, Gensim, Sklearn, Spacy, Scipy, Tensorflow, PyTorch
ML Algorithms Logistic Regression, Dimensional Reduction, SVM, Clustering, Tree Based Algorithms, Ensemble Techniques
DL Algorithms RNN, LSTM, CNN, Attention Mechanism, Word Embeddings
NLP task Sequence-to-Sequence, Sequence tagging and Classification, Named Entity Recognition, Question Answering
Big Data Tools Hadoop, Spark, PySpark, MLlib, Hive, Impala, GraphX
Technologies Linux, Git, Django, Rest-API, Flask, Docker, Kubernetes, ML-flow, AWS, ECS, EKR, ECR, Nextflow
WORK EXPERIENCE
Lantern Pharma : Clinical stage pharmaceutical company Jan 2021 - Apr 2021
Data Scientist and Platform Development Intern Dallas,TX
Data Pipeline for Cancer Drug exploration platform
– Develop module to ingest genomics data (TCGA) of 50k+ samples and 3 billion+ data points into cloud infrastructure
– Designed and implemented microservice in AWS for ingesting genomics data
Capital One: Bank Holding Company Jun 2020 - Aug 2020
Data Science Intern McLean,VA
Neural Network Model to predict Mortgage Based Security Prepayment Rate
– Performed EDA and preprocessesed a specific category of Mortgage Based Securities(MBS) in investment portfolio.
– Implemented parallel processing for tuning hyperparameters of model to predict prepayment rate with extensive logging.
– Built and analyzed automated performance report (PDP/ICE, SHAP, and S-Curves) for comparing models.
VHSS Lab: Research Lab at UTD Sep 2019 - May 2020
Machine Learning Specialist Richardson,TX
NSF funded Conversational Emotive Virtual Reality patient project
– Researched and trained transformer-based model for virtual patient interacting with medical students using Pytorch.
Huddl.ai : Video Communications Provider Apr 2018 - Jun 2019
Artificial Intelligence Engineer Hyderabad, India
Named Entity Recognition for Voice Assistant
– Supervised data preparation team and retrained custom spacy model for Named-Entity-Recognition..
– Built module using Levenshtein Distance and Phonetic similarity to fix incorrect transcription for recognized entities.
– Developed micro-service using Node.js and DynamoDB to use as gazetteer in NER.
– Designed module using regex for extracting entities like Time and Date from voice commands.
– Packaged micro-services into docker for deployment in Kubernetes.
Reverse image search for information retrieval
– Developed parser for OCR response and utilized K-means to classify content to reduce false positive.
– Built keyword extraction using RAKE and graph-based algorithm for ranking meetings on search results.
Action Item Detection in Meeting Transcript
– Trained LSTM-RNN based model to classify the action items in the meeting transcript 95% accuracy.
– Deployed ML-flow for internal use and track experiments with different hyperparameters.
CoArtha Technosolution: Talent Acquisition platform Sep 2017 - Apr 2018
Associate Data Science Engineer Hyderabad, India
Semantic Understanding of Job Description for ranking resumes
– Trained model using Naive Bayes to classify sentences in job descriptions with 90%+ accuracy for matching resumes.
– Assisted in developing scoring logic to match job descriptions with resumes.
Candidate screening from audio interviews
– Employed Random Forest for classifying candidates using interview response audio with 90+% accuracy.
CoArtha Technosolution: Talent Acquisition platform Sep 2016 - Aug 2017
Associate Software Engineer Hyderabad, India
Knowledge Graph from job descriptions for ranking resumes
– Built a part of pipeline to scrape US job boards and pre-processed data using Selenium and Beautiful Soup.
– Built a part of Knowledge graph using Neo4j with skills, job titles & education entities from 100k+ job descriptions.
Semantic parsing of resumes
– Trained ensemble model to identify sections(contact, education, experience and skill)
– Implemented solution using regex for extracting entities and parsed table to correlate extracted entities.
DigiFledged: Digital Marketing Startup Jul 2015 - Aug 2016
Founder Bhilai, India
– Managed daily operation and acquired technical & functional requirements of projects from new clients.
– Led a team to deliver 5+ web development, 17+ freelancing projects and establish a blog with 130K+ page-views.
JSW Steel Ltd: Steel Manufacturing Company Feb 2014 - Apr 2015
Junior Manager Bellary, India
– Analyzed production reports discovering insights through exploratory data analysis using MS Excel and R.
PROJECTS
Question Answering on SQUAD 1.0 (LSTM | RNN | Self Attention | Pytorch)
– Preprocessed and extracted custom features along with pre-trained word embedding(Glove).
– Trained QA model(simplified Stanford Attentive Reader) with 70% F1 Score.
Language Model for Auto complete sentence
– Preprocess data for n-gram language model with smoothing for sentence auto complete
Named Entity Recognition on CONLL 2003 (RNN | GRU | Pytorch)
– Preprocess and prepare vocabulary for embedding layer
– Trained and compared Vanilla RNN(83%) and GRU RNN(86%)
Document search using approximate k-nearest neighbor
– Implemented local sensitive hashing(LSH) for multiple universes (different set of random planes)
– Developed document search using approximate k-nearest neighbor and LSH
Credit Card Transaction Fraud Detection(Random Forest | Logistic Regression | Feature Engineering | Sci-kit)
– Imputed missing data,created custom features,normalize and encode features
– Performed exploratory data analysis of features
– Handled imbalanced data using SMOTE and custom loss function.
– Train and evaluate linear/tree based classifier methods
– Analyzed feature importance w.r.t to dependent variable
Image Identification on CIFAR-10 dataset (Pytorch | Convolutions Neural Network )
– Augmented image data using transformation technique(random crop,vertical flip)
– Implemented and trained RESNET family of architecture for image detection with 86% accuracy
Ensemble Method and Decision Tree from scratch (Decision Tree | Bagging | Adaboost | Sci-Kit)
– Implemented fixed depth decision(ID3) tree from scratch for monk’s classification dataset
– Implemented Bagging and AdaBoost and compared with Sci-kit implementation for Mushroom bruises
CERTIFICATIONS AND ACTIVITIES
– Natural Language Processing, Machine Learning and Deep Learning by Coursera
– AWS Services by The University of Texas at Dallas and Linkedin Learning
– Linked Data Engineering by Hasso-Plattner Institute: Building Knowledge Graph,2016.
– M101: MongoDB for Developers by MongoDB University

More Related Content

Similar to Resume (20)

Prashant resume
Prashant resumePrashant resume
Prashant resume
 
Nikhil_Ayyagari_Resume
Nikhil_Ayyagari_ResumeNikhil_Ayyagari_Resume
Nikhil_Ayyagari_Resume
 
Shantanu Gupta
Shantanu GuptaShantanu Gupta
Shantanu Gupta
 
Resume Rishabh C
Resume Rishabh CResume Rishabh C
Resume Rishabh C
 
Resume(kaushik shakkari)
Resume(kaushik shakkari)Resume(kaushik shakkari)
Resume(kaushik shakkari)
 
HP resume
HP resumeHP resume
HP resume
 
SoumadeepMazumdarResume
SoumadeepMazumdarResumeSoumadeepMazumdarResume
SoumadeepMazumdarResume
 
Resume (kaushik shakkari)
Resume (kaushik shakkari)Resume (kaushik shakkari)
Resume (kaushik shakkari)
 
Shubhangi nov20
Shubhangi nov20Shubhangi nov20
Shubhangi nov20
 
Raghava Prasad S Resume
Raghava Prasad S ResumeRaghava Prasad S Resume
Raghava Prasad S Resume
 
Resume-Hpendyala
Resume-HpendyalaResume-Hpendyala
Resume-Hpendyala
 
Data science nlp_resume-2018-abridged
Data science nlp_resume-2018-abridgedData science nlp_resume-2018-abridged
Data science nlp_resume-2018-abridged
 
Data Scientist -Asish
Data Scientist -AsishData Scientist -Asish
Data Scientist -Asish
 
Resume
ResumeResume
Resume
 
nikhilAyyagari_Fulltime_Resume
nikhilAyyagari_Fulltime_ResumenikhilAyyagari_Fulltime_Resume
nikhilAyyagari_Fulltime_Resume
 
Resume
ResumeResume
Resume
 
SaiTejaDuthuluri
SaiTejaDuthuluriSaiTejaDuthuluri
SaiTejaDuthuluri
 
Yu's resume
Yu's resumeYu's resume
Yu's resume
 
RAJARAM R
RAJARAM RRAJARAM R
RAJARAM R
 
Sanmitra Ijeri Resume
Sanmitra Ijeri ResumeSanmitra Ijeri Resume
Sanmitra Ijeri Resume
 

Recently uploaded

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 

Recently uploaded (20)

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 

Resume

  • 1. PARMANAND SAHU Dallas,TX | parmanand.sahu@utdallas.edu | 972-730-3967 | linkedin.com/in/parmanandsahu/ | https://parmanandsahu.com/ EDUCATION THE UNIVERSITY OF TEXAS AT DALLAS, Richardson, TX Aug 2019 - May 2021 Master of Science in Computer Science 3.66/4 NATIONAL INSTITUTE OF TECHNOLOGY, Raipur, IN Jul 2009 - Jul 2013 Bachelor of Technology (Hons.) in Metallurgical Engineering 8.35/10 TECHNICAL SKILLS Languages Python, C, Java, Scala, Matlab, Node.js Databases MongoDB, MySQL, DynamoDB, Neptune, ElasticSearch, Neo4j Libraries Numpy, Pandas, Matplotlib, Plotly, NLTK, Gensim, Sklearn, Spacy, Scipy, Tensorflow, PyTorch ML Algorithms Logistic Regression, Dimensional Reduction, SVM, Clustering, Tree Based Algorithms, Ensemble Techniques DL Algorithms RNN, LSTM, CNN, Attention Mechanism, Word Embeddings NLP task Sequence-to-Sequence, Sequence tagging and Classification, Named Entity Recognition, Question Answering Big Data Tools Hadoop, Spark, PySpark, MLlib, Hive, Impala, GraphX Technologies Linux, Git, Django, Rest-API, Flask, Docker, Kubernetes, ML-flow, AWS, ECS, EKR, ECR, Nextflow WORK EXPERIENCE Lantern Pharma : Clinical stage pharmaceutical company Jan 2021 - Apr 2021 Data Scientist and Platform Development Intern Dallas,TX Data Pipeline for Cancer Drug exploration platform – Develop module to ingest genomics data (TCGA) of 50k+ samples and 3 billion+ data points into cloud infrastructure – Designed and implemented microservice in AWS for ingesting genomics data Capital One: Bank Holding Company Jun 2020 - Aug 2020 Data Science Intern McLean,VA Neural Network Model to predict Mortgage Based Security Prepayment Rate – Performed EDA and preprocessesed a specific category of Mortgage Based Securities(MBS) in investment portfolio. – Implemented parallel processing for tuning hyperparameters of model to predict prepayment rate with extensive logging. – Built and analyzed automated performance report (PDP/ICE, SHAP, and S-Curves) for comparing models. VHSS Lab: Research Lab at UTD Sep 2019 - May 2020 Machine Learning Specialist Richardson,TX NSF funded Conversational Emotive Virtual Reality patient project – Researched and trained transformer-based model for virtual patient interacting with medical students using Pytorch. Huddl.ai : Video Communications Provider Apr 2018 - Jun 2019 Artificial Intelligence Engineer Hyderabad, India Named Entity Recognition for Voice Assistant – Supervised data preparation team and retrained custom spacy model for Named-Entity-Recognition.. – Built module using Levenshtein Distance and Phonetic similarity to fix incorrect transcription for recognized entities. – Developed micro-service using Node.js and DynamoDB to use as gazetteer in NER. – Designed module using regex for extracting entities like Time and Date from voice commands. – Packaged micro-services into docker for deployment in Kubernetes. Reverse image search for information retrieval – Developed parser for OCR response and utilized K-means to classify content to reduce false positive. – Built keyword extraction using RAKE and graph-based algorithm for ranking meetings on search results. Action Item Detection in Meeting Transcript – Trained LSTM-RNN based model to classify the action items in the meeting transcript 95% accuracy. – Deployed ML-flow for internal use and track experiments with different hyperparameters. CoArtha Technosolution: Talent Acquisition platform Sep 2017 - Apr 2018 Associate Data Science Engineer Hyderabad, India Semantic Understanding of Job Description for ranking resumes
  • 2. – Trained model using Naive Bayes to classify sentences in job descriptions with 90%+ accuracy for matching resumes. – Assisted in developing scoring logic to match job descriptions with resumes. Candidate screening from audio interviews – Employed Random Forest for classifying candidates using interview response audio with 90+% accuracy. CoArtha Technosolution: Talent Acquisition platform Sep 2016 - Aug 2017 Associate Software Engineer Hyderabad, India Knowledge Graph from job descriptions for ranking resumes – Built a part of pipeline to scrape US job boards and pre-processed data using Selenium and Beautiful Soup. – Built a part of Knowledge graph using Neo4j with skills, job titles & education entities from 100k+ job descriptions. Semantic parsing of resumes – Trained ensemble model to identify sections(contact, education, experience and skill) – Implemented solution using regex for extracting entities and parsed table to correlate extracted entities. DigiFledged: Digital Marketing Startup Jul 2015 - Aug 2016 Founder Bhilai, India – Managed daily operation and acquired technical & functional requirements of projects from new clients. – Led a team to deliver 5+ web development, 17+ freelancing projects and establish a blog with 130K+ page-views. JSW Steel Ltd: Steel Manufacturing Company Feb 2014 - Apr 2015 Junior Manager Bellary, India – Analyzed production reports discovering insights through exploratory data analysis using MS Excel and R. PROJECTS Question Answering on SQUAD 1.0 (LSTM | RNN | Self Attention | Pytorch) – Preprocessed and extracted custom features along with pre-trained word embedding(Glove). – Trained QA model(simplified Stanford Attentive Reader) with 70% F1 Score. Language Model for Auto complete sentence – Preprocess data for n-gram language model with smoothing for sentence auto complete Named Entity Recognition on CONLL 2003 (RNN | GRU | Pytorch) – Preprocess and prepare vocabulary for embedding layer – Trained and compared Vanilla RNN(83%) and GRU RNN(86%) Document search using approximate k-nearest neighbor – Implemented local sensitive hashing(LSH) for multiple universes (different set of random planes) – Developed document search using approximate k-nearest neighbor and LSH Credit Card Transaction Fraud Detection(Random Forest | Logistic Regression | Feature Engineering | Sci-kit) – Imputed missing data,created custom features,normalize and encode features – Performed exploratory data analysis of features – Handled imbalanced data using SMOTE and custom loss function. – Train and evaluate linear/tree based classifier methods – Analyzed feature importance w.r.t to dependent variable Image Identification on CIFAR-10 dataset (Pytorch | Convolutions Neural Network ) – Augmented image data using transformation technique(random crop,vertical flip) – Implemented and trained RESNET family of architecture for image detection with 86% accuracy Ensemble Method and Decision Tree from scratch (Decision Tree | Bagging | Adaboost | Sci-Kit) – Implemented fixed depth decision(ID3) tree from scratch for monk’s classification dataset – Implemented Bagging and AdaBoost and compared with Sci-kit implementation for Mushroom bruises CERTIFICATIONS AND ACTIVITIES – Natural Language Processing, Machine Learning and Deep Learning by Coursera – AWS Services by The University of Texas at Dallas and Linkedin Learning – Linked Data Engineering by Hasso-Plattner Institute: Building Knowledge Graph,2016. – M101: MongoDB for Developers by MongoDB University