SlideShare a Scribd company logo
1 of 2
Download to read offline
International School of Engineering
awards
Certificate of Completion
to
Sanna Reddy Bharath
for the 288-hour program in
Big Data Analytics and Optimization
conducted between August 09, 2014 and December 21, 2014.
This program is certified for quality of content, assessment and pedagogy by the Language Technologies Institute (LTI)
of Carnegie Mellon University (CMU). LTI also provided assistance in curriculum development for this program.
Dated this sixth day of February, two thousand and fifteen.
Dr. Dakshinamurthy V Kolluru Dr. Sridhar Pappu
President Executive VP - Academics
01CSE03/201408/392 Program details are on the back
Mode: Classroom Teaching
Topics Covered
Certificate Type
Certificate of Participation Assessment-based training program Professional certification
Essential Business Skills for a Data Scientist
Why build models or use data to run a business? What kind of models are built? Were do models not work? How do
you make predictions? When does big unstructured data become important? What do you need to build an analytics
group in your organization?
Developing a business plan; Case analysis
Planning and Thinking Skills for Architecting Data Science Solutions
Thinking tools: Approximations and estimations, Geometric visualization of data and models, Probabilistic analysis of
data and models, Analyzing networks and graphs: Analyzing transitions, Markov chains and unstructured data;
Estimating complexity of algorithms
Choosing the right models and architecting a solution: Structure and anatomy of models, Problematic data and
choosing the right experimentation
Sources of errors in predictive models and techniques to minimize them
Interacting with technical and business teams; Case study
Essential Engineering Skills in Big Data Analytics
Reading from Excel, CSV and other forms; Data exploration (histograms, bar charts, box plots, line graphs and scatter
graphs); Storytelling with data: The science, ggplot, bubble charts with multiple dimensions, gauge charts, tree maps,
heat maps and motion charts
Advanced data pre-processing using Excel
Data pre-processing of structured data: R, Handling missing values, Binning, Standardization, Outliers/Noise, PCA, Type
conversion
Statistical Modeling for Predictive Analytics in Engineering and Business
Computing the properties of an attribute: Central tendencies (Mean, Median, Mode, Range, Variance, Standard
Deviation); Expectations of a Variable; Moment Generating Functions; Describing an attribute: Probability distributions
(Discrete and Continuous) - Bernoulli, Geometric, Binomial, Poisson and Exponential distributions; Special emphasis on
Normal distribution; Central Limit Theorem
Describing the relationship between attributes: Covariance; Correlation; ChiSquare
Inferential statistics: How to learn about the population from a sample and vice-versa, Sampling distributions,
Confidence Intervals, Hypothesis Testing
ANOVA; SPC
Regression (Linear, Multivariate Regression) in forecasting; Analyzing and interpreting regression results; Logistic
Regression
Trend analysis and Time Series; Cyclical and Seasonal analysis; Box-Jenkins method; Smoothing; Moving averages; Auto
-correlation; ARIMA – Holt-Winters method
Bayesian analysis and Naïve Bayes classifier; Bayesian Belief Networks
Optimization and Decision Analysis
Genetic algorithms: The algorithm and the process, Representing data, Why and how do they work?
Linear Programming: Graphical analysis; Sensitivity and Duality analyses; Integer and Binary programming: Applications,
Problem formulation, Solving in R; Goal programming; Data development analysis; Quadratic programming
Engineering Big Data with R and Hadoop Ecosystem
Introduction—Big Data, Hadoop applications; Parallel and Distributed computing; Introduction to algorithms;
Concurrent algorithms; Linux and Java refresher; R and Python refresher; NoSQL; HDFS; CDH4-HDFS
Map Reduce: YARN
Map Reduce Applications: Text Mining, Page Rank, Graph processing
Hadoop ecosystem components: Pig, Hive, HBase, Sqoop, Mahout, Hama, Flume, Chukwa, Avro, Whirr, Hue, Oozie,
Zookeeper
R-Hadoop
Text Mining, Social Network Analysis and Natural Language Processing
Introduction to text mining and text pre-processing: Write a web crawler to collect data, R, Find unique words and
counts, Handling number, Punctuations, Stop words, Incorrect spellings, Stemming, Lemmatization and TxD
computation
Unstructured vs. semi-structured data; Fundamentals of information retrieval
Properties of words; Vector space models; Creating Term-Document (TxD) matrices; Similarity measures
Low-level processes (Sentence Splitting; Tokenization; Part-of-Speech Tagging; Stemming; Chunking)
Text classification and feature selection: How to use Naïve Bayes classifier for text classification
Evaluation systems on the accuracy of text mining
Sentiment Analysis
Natural Language Analysis
Discussion of text mining tools and applications
Methods and Algorithms in Machine Learning
Rule based knowledge: Logic of rules, Evaluating rules, Rule induction and Association rules
Construction of Decision Trees through simplified examples; Choosing the "best" attribute at each
non-leaf node; Entropy; Information Gain; Generalizing Decision Trees; Information Content and Gain Ratio; Dealing
with numerical variables; Other measures of randomness; Pruning a Decision Tree; Cost as a consideration; Unwrapping
Trees as rules
Specialized decision trees (oblique trees)
Ensemble and Hybrid models
AdaBoost, Random Forests and Gradient boosting machines
K-Nearest Neighbor method; Wilson editing and triangulations; K-nearest neighbors in collaborative filtering, digit
recognition
Motivation for Neural Networks and its applications; Perceptron and Single Layer Neural Network, and hand
calculations; Learning in a Neural Net: Back propagation and conjugant gradient techniques; Application of Neural Net
in Face and Digit Recognition
Deep Learning techniques
Connectivity models (hierarchical clustering); Centroid models (K-Means algorithm); Distribution models (Expectation
maximization); Spectral clustering
Linear learning machines and Kernel methods in learning
VC (Vapnik-Chervonenkis) dimension; Shattering power of models
Algorithm of Support Vector Machines (SVM)
Communication, Ethical and IP Challenges for Analytics Professionals
Why is Communication important?
How to communicate effectively: Telling stories
Communications issues from daily life using examples using audio, video, blogs, charts, email, etc.
Seeing the big picture; Paying attention to details; Seeing things from multiple perspectives
Challenges: Mix of stakeholders, Explicability of results, Visualization
Guiding Principles: Clarity, Transparency, Integrity, Humility
Framework for Effective Presentations; Examples of bad and good presentations
Writing effective technical reports
Difference between Legal and Ethical issues
Challenges in current laws, regulations and fair information practices: Data protection, Intellectual property rights,
Confidentiality, Contractual liability, Competition law, Licensing of Open Source software and Open Data
How to handle legal, ethical and IP issues at an organization and an individual level?
The “Ethics Check” questions

More Related Content

What's hot

438_AmeeruddinMohammed
438_AmeeruddinMohammed438_AmeeruddinMohammed
438_AmeeruddinMohammedAmeeruddin MD
 
776_AlluruMPranav_CEE
776_AlluruMPranav_CEE776_AlluruMPranav_CEE
776_AlluruMPranav_CEEPranav A
 
797_NaveenKKapoor_CEE
797_NaveenKKapoor_CEE797_NaveenKKapoor_CEE
797_NaveenKKapoor_CEENaveen Kapoor
 
Ds shipra sharan_resume
Ds shipra sharan_resumeDs shipra sharan_resume
Ds shipra sharan_resumeshiprasharan3
 
Big data Intro - Presentation to OCHackerz Meetup Group
Big data Intro - Presentation to OCHackerz Meetup GroupBig data Intro - Presentation to OCHackerz Meetup Group
Big data Intro - Presentation to OCHackerz Meetup GroupSri Kanajan
 
Bootcamp python-1
Bootcamp python-1Bootcamp python-1
Bootcamp python-1Era Wibowo
 
Data visualization in a Nutshell
Data visualization in a NutshellData visualization in a Nutshell
Data visualization in a NutshellWingChan46
 
Sivrama Sarma - Profile_July_2015
Sivrama Sarma - Profile_July_2015Sivrama Sarma - Profile_July_2015
Sivrama Sarma - Profile_July_2015Siva Rama Sarma
 
Machine Learning Real Life Applications By Examples
Machine Learning Real Life Applications By ExamplesMachine Learning Real Life Applications By Examples
Machine Learning Real Life Applications By ExamplesMario Cartia
 
20170110_IOuellette_CV
20170110_IOuellette_CV20170110_IOuellette_CV
20170110_IOuellette_CVIan Ouellette
 
How to Effectively Combine Numerical Features and Categorical Features
How to Effectively Combine Numerical Features and Categorical FeaturesHow to Effectively Combine Numerical Features and Categorical Features
How to Effectively Combine Numerical Features and Categorical FeaturesDomino Data Lab
 

What's hot (19)

671_JeevanRavula_CEE
671_JeevanRavula_CEE671_JeevanRavula_CEE
671_JeevanRavula_CEE
 
438_AmeeruddinMohammed
438_AmeeruddinMohammed438_AmeeruddinMohammed
438_AmeeruddinMohammed
 
566_SriramDandamudi_CEE
566_SriramDandamudi_CEE566_SriramDandamudi_CEE
566_SriramDandamudi_CEE
 
HiteshAgarwal_CPEE
HiteshAgarwal_CPEEHiteshAgarwal_CPEE
HiteshAgarwal_CPEE
 
776_AlluruMPranav_CEE
776_AlluruMPranav_CEE776_AlluruMPranav_CEE
776_AlluruMPranav_CEE
 
Miraj Vashi_CPEE
Miraj Vashi_CPEEMiraj Vashi_CPEE
Miraj Vashi_CPEE
 
797_NaveenKKapoor_CEE
797_NaveenKKapoor_CEE797_NaveenKKapoor_CEE
797_NaveenKKapoor_CEE
 
Ds shipra sharan_resume
Ds shipra sharan_resumeDs shipra sharan_resume
Ds shipra sharan_resume
 
Big data Intro - Presentation to OCHackerz Meetup Group
Big data Intro - Presentation to OCHackerz Meetup GroupBig data Intro - Presentation to OCHackerz Meetup Group
Big data Intro - Presentation to OCHackerz Meetup Group
 
Bootcamp python-1
Bootcamp python-1Bootcamp python-1
Bootcamp python-1
 
Data visualization in a Nutshell
Data visualization in a NutshellData visualization in a Nutshell
Data visualization in a Nutshell
 
Prashant resume
Prashant resumePrashant resume
Prashant resume
 
Sivrama Sarma - Profile_July_2015
Sivrama Sarma - Profile_July_2015Sivrama Sarma - Profile_July_2015
Sivrama Sarma - Profile_July_2015
 
Machine Learning Real Life Applications By Examples
Machine Learning Real Life Applications By ExamplesMachine Learning Real Life Applications By Examples
Machine Learning Real Life Applications By Examples
 
Data analytics
Data analyticsData analytics
Data analytics
 
Data Mining
Data MiningData Mining
Data Mining
 
Data Visualization
Data VisualizationData Visualization
Data Visualization
 
20170110_IOuellette_CV
20170110_IOuellette_CV20170110_IOuellette_CV
20170110_IOuellette_CV
 
How to Effectively Combine Numerical Features and Categorical Features
How to Effectively Combine Numerical Features and Categorical FeaturesHow to Effectively Combine Numerical Features and Categorical Features
How to Effectively Combine Numerical Features and Categorical Features
 

Similar to 392_SannaReddyBharath (1)

848_VamsiKrishnaPenumadu_CEE
848_VamsiKrishnaPenumadu_CEE848_VamsiKrishnaPenumadu_CEE
848_VamsiKrishnaPenumadu_CEEVamsi Krishna
 
Data Science Course in Pune
Data Science Course in Pune Data Science Course in Pune
Data Science Course in Pune nmdfilmProduction
 
Big Data Conference
Big Data ConferenceBig Data Conference
Big Data ConferenceDataTactics
 
A Blended Approach to Analytics at Data Tactics Corporation
A Blended Approach to Analytics at Data Tactics CorporationA Blended Approach to Analytics at Data Tactics Corporation
A Blended Approach to Analytics at Data Tactics CorporationRich Heimann
 
Imtiaz khan data_science_analytics
Imtiaz khan data_science_analyticsImtiaz khan data_science_analytics
Imtiaz khan data_science_analyticsimtiaz khan
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxsumitkumar600840
 
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisBig Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisYuanyuan Tian
 
Business intelligence data analytics-visualization
Business intelligence data analytics-visualizationBusiness intelligence data analytics-visualization
Business intelligence data analytics-visualizationMuthu Natarajan
 
Tips for Effective Data Science in the Enterprise
Tips for Effective Data Science in the EnterpriseTips for Effective Data Science in the Enterprise
Tips for Effective Data Science in the EnterpriseLisa Cohen
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxNagarajanG35
 
"Data Science: Insight & Analysis" and fundamental of data science?
"Data Science: Insight & Analysis" and fundamental of data science?"Data Science: Insight & Analysis" and fundamental of data science?
"Data Science: Insight & Analysis" and fundamental of data science?arjunnegi34
 
DILEEP DATA SCIERNCES PROJECT POWERPOINT PPT
DILEEP DATA SCIERNCES PROJECT POWERPOINT PPTDILEEP DATA SCIERNCES PROJECT POWERPOINT PPT
DILEEP DATA SCIERNCES PROJECT POWERPOINT PPTPatnalaVeenamadhuri
 
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION Elvis Muyanja
 
Business intelligence, Data Analytics & Data Visualization
Business intelligence, Data Analytics & Data VisualizationBusiness intelligence, Data Analytics & Data Visualization
Business intelligence, Data Analytics & Data VisualizationMuthu Natarajan
 
Python for Data Analysis: A Comprehensive Guide
Python for Data Analysis: A Comprehensive GuidePython for Data Analysis: A Comprehensive Guide
Python for Data Analysis: A Comprehensive GuideAivada
 
CourseWork
CourseWorkCourseWork
CourseWorksreya1
 

Similar to 392_SannaReddyBharath (1) (20)

848_VamsiKrishnaPenumadu_CEE
848_VamsiKrishnaPenumadu_CEE848_VamsiKrishnaPenumadu_CEE
848_VamsiKrishnaPenumadu_CEE
 
Data Science Course in Pune
Data Science Course in Pune Data Science Course in Pune
Data Science Course in Pune
 
Big Data Conference
Big Data ConferenceBig Data Conference
Big Data Conference
 
A Blended Approach to Analytics at Data Tactics Corporation
A Blended Approach to Analytics at Data Tactics CorporationA Blended Approach to Analytics at Data Tactics Corporation
A Blended Approach to Analytics at Data Tactics Corporation
 
Imtiaz khan data_science_analytics
Imtiaz khan data_science_analyticsImtiaz khan data_science_analytics
Imtiaz khan data_science_analytics
 
resume_MH
resume_MHresume_MH
resume_MH
 
DataScience_RoadMap_2023.pdf
DataScience_RoadMap_2023.pdfDataScience_RoadMap_2023.pdf
DataScience_RoadMap_2023.pdf
 
What is Data Science?
What is Data Science?What is Data Science?
What is Data Science?
 
Data Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptxData Science Introduction: Concepts, lifecycle, applications.pptx
Data Science Introduction: Concepts, lifecycle, applications.pptx
 
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph AnalysisBig Data Analytics: From SQL to Machine Learning and Graph Analysis
Big Data Analytics: From SQL to Machine Learning and Graph Analysis
 
Business intelligence data analytics-visualization
Business intelligence data analytics-visualizationBusiness intelligence data analytics-visualization
Business intelligence data analytics-visualization
 
Introduction to BigData
Introduction to BigData Introduction to BigData
Introduction to BigData
 
Tips for Effective Data Science in the Enterprise
Tips for Effective Data Science in the EnterpriseTips for Effective Data Science in the Enterprise
Tips for Effective Data Science in the Enterprise
 
Data science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptxData science Nagarajan and madhav.pptx
Data science Nagarajan and madhav.pptx
 
"Data Science: Insight & Analysis" and fundamental of data science?
"Data Science: Insight & Analysis" and fundamental of data science?"Data Science: Insight & Analysis" and fundamental of data science?
"Data Science: Insight & Analysis" and fundamental of data science?
 
DILEEP DATA SCIERNCES PROJECT POWERPOINT PPT
DILEEP DATA SCIERNCES PROJECT POWERPOINT PPTDILEEP DATA SCIERNCES PROJECT POWERPOINT PPT
DILEEP DATA SCIERNCES PROJECT POWERPOINT PPT
 
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
DATA SCIENCE IS CATALYZING BUSINESS AND INNOVATION
 
Business intelligence, Data Analytics & Data Visualization
Business intelligence, Data Analytics & Data VisualizationBusiness intelligence, Data Analytics & Data Visualization
Business intelligence, Data Analytics & Data Visualization
 
Python for Data Analysis: A Comprehensive Guide
Python for Data Analysis: A Comprehensive GuidePython for Data Analysis: A Comprehensive Guide
Python for Data Analysis: A Comprehensive Guide
 
CourseWork
CourseWorkCourseWork
CourseWork
 

392_SannaReddyBharath (1)

  • 1. International School of Engineering awards Certificate of Completion to Sanna Reddy Bharath for the 288-hour program in Big Data Analytics and Optimization conducted between August 09, 2014 and December 21, 2014. This program is certified for quality of content, assessment and pedagogy by the Language Technologies Institute (LTI) of Carnegie Mellon University (CMU). LTI also provided assistance in curriculum development for this program. Dated this sixth day of February, two thousand and fifteen. Dr. Dakshinamurthy V Kolluru Dr. Sridhar Pappu President Executive VP - Academics 01CSE03/201408/392 Program details are on the back
  • 2. Mode: Classroom Teaching Topics Covered Certificate Type Certificate of Participation Assessment-based training program Professional certification Essential Business Skills for a Data Scientist Why build models or use data to run a business? What kind of models are built? Were do models not work? How do you make predictions? When does big unstructured data become important? What do you need to build an analytics group in your organization? Developing a business plan; Case analysis Planning and Thinking Skills for Architecting Data Science Solutions Thinking tools: Approximations and estimations, Geometric visualization of data and models, Probabilistic analysis of data and models, Analyzing networks and graphs: Analyzing transitions, Markov chains and unstructured data; Estimating complexity of algorithms Choosing the right models and architecting a solution: Structure and anatomy of models, Problematic data and choosing the right experimentation Sources of errors in predictive models and techniques to minimize them Interacting with technical and business teams; Case study Essential Engineering Skills in Big Data Analytics Reading from Excel, CSV and other forms; Data exploration (histograms, bar charts, box plots, line graphs and scatter graphs); Storytelling with data: The science, ggplot, bubble charts with multiple dimensions, gauge charts, tree maps, heat maps and motion charts Advanced data pre-processing using Excel Data pre-processing of structured data: R, Handling missing values, Binning, Standardization, Outliers/Noise, PCA, Type conversion Statistical Modeling for Predictive Analytics in Engineering and Business Computing the properties of an attribute: Central tendencies (Mean, Median, Mode, Range, Variance, Standard Deviation); Expectations of a Variable; Moment Generating Functions; Describing an attribute: Probability distributions (Discrete and Continuous) - Bernoulli, Geometric, Binomial, Poisson and Exponential distributions; Special emphasis on Normal distribution; Central Limit Theorem Describing the relationship between attributes: Covariance; Correlation; ChiSquare Inferential statistics: How to learn about the population from a sample and vice-versa, Sampling distributions, Confidence Intervals, Hypothesis Testing ANOVA; SPC Regression (Linear, Multivariate Regression) in forecasting; Analyzing and interpreting regression results; Logistic Regression Trend analysis and Time Series; Cyclical and Seasonal analysis; Box-Jenkins method; Smoothing; Moving averages; Auto -correlation; ARIMA – Holt-Winters method Bayesian analysis and Naïve Bayes classifier; Bayesian Belief Networks Optimization and Decision Analysis Genetic algorithms: The algorithm and the process, Representing data, Why and how do they work? Linear Programming: Graphical analysis; Sensitivity and Duality analyses; Integer and Binary programming: Applications, Problem formulation, Solving in R; Goal programming; Data development analysis; Quadratic programming Engineering Big Data with R and Hadoop Ecosystem Introduction—Big Data, Hadoop applications; Parallel and Distributed computing; Introduction to algorithms; Concurrent algorithms; Linux and Java refresher; R and Python refresher; NoSQL; HDFS; CDH4-HDFS Map Reduce: YARN Map Reduce Applications: Text Mining, Page Rank, Graph processing Hadoop ecosystem components: Pig, Hive, HBase, Sqoop, Mahout, Hama, Flume, Chukwa, Avro, Whirr, Hue, Oozie, Zookeeper R-Hadoop Text Mining, Social Network Analysis and Natural Language Processing Introduction to text mining and text pre-processing: Write a web crawler to collect data, R, Find unique words and counts, Handling number, Punctuations, Stop words, Incorrect spellings, Stemming, Lemmatization and TxD computation Unstructured vs. semi-structured data; Fundamentals of information retrieval Properties of words; Vector space models; Creating Term-Document (TxD) matrices; Similarity measures Low-level processes (Sentence Splitting; Tokenization; Part-of-Speech Tagging; Stemming; Chunking) Text classification and feature selection: How to use Naïve Bayes classifier for text classification Evaluation systems on the accuracy of text mining Sentiment Analysis Natural Language Analysis Discussion of text mining tools and applications Methods and Algorithms in Machine Learning Rule based knowledge: Logic of rules, Evaluating rules, Rule induction and Association rules Construction of Decision Trees through simplified examples; Choosing the "best" attribute at each non-leaf node; Entropy; Information Gain; Generalizing Decision Trees; Information Content and Gain Ratio; Dealing with numerical variables; Other measures of randomness; Pruning a Decision Tree; Cost as a consideration; Unwrapping Trees as rules Specialized decision trees (oblique trees) Ensemble and Hybrid models AdaBoost, Random Forests and Gradient boosting machines K-Nearest Neighbor method; Wilson editing and triangulations; K-nearest neighbors in collaborative filtering, digit recognition Motivation for Neural Networks and its applications; Perceptron and Single Layer Neural Network, and hand calculations; Learning in a Neural Net: Back propagation and conjugant gradient techniques; Application of Neural Net in Face and Digit Recognition Deep Learning techniques Connectivity models (hierarchical clustering); Centroid models (K-Means algorithm); Distribution models (Expectation maximization); Spectral clustering Linear learning machines and Kernel methods in learning VC (Vapnik-Chervonenkis) dimension; Shattering power of models Algorithm of Support Vector Machines (SVM) Communication, Ethical and IP Challenges for Analytics Professionals Why is Communication important? How to communicate effectively: Telling stories Communications issues from daily life using examples using audio, video, blogs, charts, email, etc. Seeing the big picture; Paying attention to details; Seeing things from multiple perspectives Challenges: Mix of stakeholders, Explicability of results, Visualization Guiding Principles: Clarity, Transparency, Integrity, Humility Framework for Effective Presentations; Examples of bad and good presentations Writing effective technical reports Difference between Legal and Ethical issues Challenges in current laws, regulations and fair information practices: Data protection, Intellectual property rights, Confidentiality, Contractual liability, Competition law, Licensing of Open Source software and Open Data How to handle legal, ethical and IP issues at an organization and an individual level? The “Ethics Check” questions