SlideShare a Scribd company logo
BIG DATA
BIG DATA is not the next big thing
It is Here And Now
BIG DATA
BENEFITS OF BIG DATA ANALYTICS
61%
45%
41%
38%
37%
35%
33%
30%
30%
30%
30%
29%
29%
27%
6%
4%
Better targeted social influencer marketing
More numerous and accurate business insights
Segmentation of customer base
Recognition of sales and market opportunities
Automated decisions for real-time processes
Definitions of churn and other customer behaviors
Detection of fraud
Greater leverage and ROI for big data
Quantification of risks
Trending for market sentiments
Understanding of business change
Better planning and forecasting
Identification of root causes of cost
Understanding consumer behavior from clickstreams
Manufacturing yield improvements
Other
•Profit growth
than their
competition
•CIOs have
visionary plans
that include
business
analytics
•Leaders don’t
have information
they need for
critical decisions
•More likely to
outperform their
competition
2X 1/3
2X83%
Big Data is about getting valuable information hidden in data
we weren’t able to analyze by traditional approaches
BENEFITS OF BIG DATA ANALYTICS
3V of Big Data
Structured
Unstructured
Semi structured
All the above
Batch
Near time
Real time
Streams
Terabytes
Records
Transactions
Tables, files
Volume
Velocity
Variety
Source: IBM Website
Typical Applications of Big Data
Sentiment
Analysis
Text Analytics
Volume
Trending
Influencer
Identification
Predictive
Analytics
In-Memory
Analytics
Massively
Scalable
Architectures
Forecasting estimating quarterly sales, product
demand. Neural networks can assess how likely it is that
a credit card transaction is being performed by
the cardholder.
Response models can predict how likely a
particular person is to respond to a particular
marketing offer, based on the success or failure
of offers made in the past.
Predictive scorecards can determine the
likelihood that someone will fail to make
payments on his or her loan in the coming
year.
MAJOR TOOLS USED
FOR BIG DATA ANALYTICS
Apache Hadoop is a
framework that allows for
the distributed processing
of large data sets across
clusters of computers
using simple programming
models.
Scalable, reliable, fault
tolerant
Provides storage layer and
execution layer
Heart of Hadoop
A programming paradigm
that allows for massive
scalability across
hundreds or thousands of
servers in a Hadoop
cluster
World’s most widely used
statistics programming
language
designed to handle big data
through a high-performance
disk-based data store called
XDF and high performance
computing across large
clusters
An environment for machine
learning, data mining, text
mining, predictive analysis and
business analytics
Provides data loading and
transformation (Extract,
transform, load ETL) data
preprocessing and
visualization, modelling,
evaluation, and deployment
It is written in the Java
programming language
MAJOR TOOLS USED
FOR BIG DATA ANALYTICS
BIG DATA ANALYTICS –
WHO ARE USING IT?
Science and Research
Government
Data Analytics in Education
A Case Study
The Study
• This study examines whether specific instructional
strategies are associated with incidence of off-task
behavior in elementary school children.
These findings can begin to form a foundation for development of
research-based guidelines for instructional design aimed to
optimize focused attention in classroom settings.
The Purpose
Methodology
– 22 classrooms participated
– 5 local charter schools
– 5 grade levels (K-4)
– Average class size: 21 students (10 males, 11 females)
– Each classroom was observed four times (total 84 observations)
– Each observation lasted for 1 hr apprx.
On-task: If the child was looking at the teacher (or classroom assistant), the
instructional activity, and/or the relevant instructional materials, they were
categorized as on-task.
Off-task: If the child was looking elsewhere, they were categorized as off-task
Off-task behaviour
• Self-distraction
• Peer distraction
• Environmental distraction
• Supplies
• Walking
• Other
• Unknown
Data Analysis: Variables
• Predictor Variables
– Student characteristics
• Gender
• Grade
– Instructional design
• Instructional format
– Individual work
– Small group or partner work
– whole-group instruction at desks
– whole-group instruction while sitting on the carpet
– dancing, and
– Testing
• Duration of Instructional format
Results
Data Analysis:
Approach
•Regression tree analysis than linear regression
•Resultant models were evaluated using six-fold
student level cross-validation.
Results
Conclusion
• Instructional format and instructional duration
both are related to the overall rate of off-task
behaviour.
• Certain types of instructional format are associated
with more on-task behavior than others.
• Instructional activities that take place individually
or at the students’ desks may be less engaging or
motivating than small-group activities
• Better attention in blocks of activities than an
activity for a longer duration.
Big Data Analytics - It is here and now!

More Related Content

What's hot

Data Science
Data ScienceData Science
Data Science
Amit Singh
 
Data Quality Analytics: Understanding what is in your data, before using it
Data Quality Analytics: Understanding what is in your data, before using itData Quality Analytics: Understanding what is in your data, before using it
Data Quality Analytics: Understanding what is in your data, before using it
Domino Data Lab
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
VijayMohan Vasu
 
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
Daniel Westzaan
 
Predictive Analytics: Advanced techniques in data mining
Predictive Analytics: Advanced techniques in data miningPredictive Analytics: Advanced techniques in data mining
Predictive Analytics: Advanced techniques in data mining
SAS Asia Pacific
 
Leveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive IndustryLeveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive Industry
Domino Data Lab
 
Data Science Salon: Building a Data Science Culture
Data Science Salon: Building a Data Science CultureData Science Salon: Building a Data Science Culture
Data Science Salon: Building a Data Science Culture
Formulatedby
 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop
Rising Media, Inc.
 
Moving Data Science from an Event to A Program: Considerations in Creating Su...
Moving Data Science from an Event to A Program: Considerations in Creating Su...Moving Data Science from an Event to A Program: Considerations in Creating Su...
Moving Data Science from an Event to A Program: Considerations in Creating Su...
Domino Data Lab
 
Operational analytics overview
Operational analytics overviewOperational analytics overview
Operational analytics overview
pallavi pentapati
 
Business Analytics and Big Data
Business Analytics and Big DataBusiness Analytics and Big Data
Business Analytics and Big DataAbhishek Kapoor
 
The Business Analytics Value Proposition
The Business Analytics Value PropositionThe Business Analytics Value Proposition
The Business Analytics Value Proposition
Eric Stephens
 
Course - Machine Learning Basics with R
Course - Machine Learning Basics with R Course - Machine Learning Basics with R
Course - Machine Learning Basics with R
Persontyle
 
940 diamond sponsor sengupta
940 diamond sponsor sengupta940 diamond sponsor sengupta
940 diamond sponsor sengupta
Rising Media, Inc.
 
Scope and Career in Analytics
Scope and Career in AnalyticsScope and Career in Analytics
Scope and Career in Analytics
International School of Engineering
 
Leveraging Your Data Report
Leveraging Your Data ReportLeveraging Your Data Report
Leveraging Your Data Report
NAED_Org
 
Predictive analytics in action real-world examples and advice
Predictive analytics in action real-world examples and advicePredictive analytics in action real-world examples and advice
Predictive analytics in action real-world examples and adviceThe Marketing Distillery
 
Hortonworks laurie maclachlan
Hortonworks laurie maclachlanHortonworks laurie maclachlan
Hortonworks laurie maclachlan
BigDataExpo
 
Ai2020 ai and or final
Ai2020 ai and or finalAi2020 ai and or final
Ai2020 ai and or final
Richard Vidgen
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
ANAND PRAKASH
 

What's hot (20)

Data Science
Data ScienceData Science
Data Science
 
Data Quality Analytics: Understanding what is in your data, before using it
Data Quality Analytics: Understanding what is in your data, before using itData Quality Analytics: Understanding what is in your data, before using it
Data Quality Analytics: Understanding what is in your data, before using it
 
Data science & data scientist
Data science & data scientistData science & data scientist
Data science & data scientist
 
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
PoT - probeer de mogelijkheden van datamining zelf uit 30-10-2014
 
Predictive Analytics: Advanced techniques in data mining
Predictive Analytics: Advanced techniques in data miningPredictive Analytics: Advanced techniques in data mining
Predictive Analytics: Advanced techniques in data mining
 
Leveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive IndustryLeveraging Data Science in the Automotive Industry
Leveraging Data Science in the Automotive Industry
 
Data Science Salon: Building a Data Science Culture
Data Science Salon: Building a Data Science CultureData Science Salon: Building a Data Science Culture
Data Science Salon: Building a Data Science Culture
 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop
 
Moving Data Science from an Event to A Program: Considerations in Creating Su...
Moving Data Science from an Event to A Program: Considerations in Creating Su...Moving Data Science from an Event to A Program: Considerations in Creating Su...
Moving Data Science from an Event to A Program: Considerations in Creating Su...
 
Operational analytics overview
Operational analytics overviewOperational analytics overview
Operational analytics overview
 
Business Analytics and Big Data
Business Analytics and Big DataBusiness Analytics and Big Data
Business Analytics and Big Data
 
The Business Analytics Value Proposition
The Business Analytics Value PropositionThe Business Analytics Value Proposition
The Business Analytics Value Proposition
 
Course - Machine Learning Basics with R
Course - Machine Learning Basics with R Course - Machine Learning Basics with R
Course - Machine Learning Basics with R
 
940 diamond sponsor sengupta
940 diamond sponsor sengupta940 diamond sponsor sengupta
940 diamond sponsor sengupta
 
Scope and Career in Analytics
Scope and Career in AnalyticsScope and Career in Analytics
Scope and Career in Analytics
 
Leveraging Your Data Report
Leveraging Your Data ReportLeveraging Your Data Report
Leveraging Your Data Report
 
Predictive analytics in action real-world examples and advice
Predictive analytics in action real-world examples and advicePredictive analytics in action real-world examples and advice
Predictive analytics in action real-world examples and advice
 
Hortonworks laurie maclachlan
Hortonworks laurie maclachlanHortonworks laurie maclachlan
Hortonworks laurie maclachlan
 
Ai2020 ai and or final
Ai2020 ai and or finalAi2020 ai and or final
Ai2020 ai and or final
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 

Viewers also liked

Supply Chain Analytics with Simulation
Supply Chain Analytics with SimulationSupply Chain Analytics with Simulation
Supply Chain Analytics with Simulation
Steve Haekler
 
Big Data
Big DataBig Data
Big Data
NGDATA
 
Big data presentation
Big data presentationBig data presentation
Big data presentation
Chinh Vo Wili
 
Innovations Manifesto
Innovations ManifestoInnovations Manifesto
Innovations Manifesto
Peter Dawson
 
Big Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and moreBig Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and more
Softweb Solutions
 
Big Data Overview
Big Data OverviewBig Data Overview
Big Data Overview
Keeyong Han
 
Innovaction point for busines 1 Slide, wow
Innovaction point for busines 1 Slide, wowInnovaction point for busines 1 Slide, wow
Innovaction point for busines 1 Slide, wow
B2B Group and AMICHI
 
Concepts of Intelligence Led Policing
Concepts of Intelligence Led PolicingConcepts of Intelligence Led Policing
Concepts of Intelligence Led Policinggroundskeeper20
 
Assignment 3 Major Essay
Assignment 3   Major EssayAssignment 3   Major Essay
Assignment 3 Major Essay
Julie Ambury
 
Pivotal role of Intelligence analysts in intelligence-led-policing
Pivotal role of Intelligence analysts in intelligence-led-policingPivotal role of Intelligence analysts in intelligence-led-policing
Pivotal role of Intelligence analysts in intelligence-led-policing
Dalene
 
Introduction to Ethics of Big Data
Introduction to Ethics of Big DataIntroduction to Ethics of Big Data
Introduction to Ethics of Big Data28 Burnside
 
Seven principles of outrageous customer service
Seven principles of outrageous customer serviceSeven principles of outrageous customer service
Seven principles of outrageous customer serviceEugene Moreau
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar Slides
Hortonworks
 
Five ways a Technology refresh strategy can lead to a Successful Digital Tran...
Five ways a Technology refresh strategy can lead to a Successful Digital Tran...Five ways a Technology refresh strategy can lead to a Successful Digital Tran...
Five ways a Technology refresh strategy can lead to a Successful Digital Tran...
Rahul Singh
 
Building the Analytics Capability
Building the Analytics CapabilityBuilding the Analytics Capability
Building the Analytics CapabilityBala Iyer
 
Innovations :Technology unfolding
Innovations :Technology unfolding Innovations :Technology unfolding
Innovations :Technology unfolding
Centre for Application of renewable Energy
 
How to Create Compelling Value Propositions That Turns Prospects into Customers
How to Create Compelling Value Propositions That Turns Prospects into CustomersHow to Create Compelling Value Propositions That Turns Prospects into Customers
How to Create Compelling Value Propositions That Turns Prospects into Customers
Kissmetrics on SlideShare
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
Global Business Solutions SME
 

Viewers also liked (20)

Supply Chain Analytics with Simulation
Supply Chain Analytics with SimulationSupply Chain Analytics with Simulation
Supply Chain Analytics with Simulation
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big Data
Big DataBig Data
Big Data
 
Big data presentation
Big data presentationBig data presentation
Big data presentation
 
Innovations Manifesto
Innovations ManifestoInnovations Manifesto
Innovations Manifesto
 
Big Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and moreBig Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and more
 
Big Data Overview
Big Data OverviewBig Data Overview
Big Data Overview
 
Innovaction point for busines 1 Slide, wow
Innovaction point for busines 1 Slide, wowInnovaction point for busines 1 Slide, wow
Innovaction point for busines 1 Slide, wow
 
Concepts of Intelligence Led Policing
Concepts of Intelligence Led PolicingConcepts of Intelligence Led Policing
Concepts of Intelligence Led Policing
 
Assignment 3 Major Essay
Assignment 3   Major EssayAssignment 3   Major Essay
Assignment 3 Major Essay
 
Pivotal role of Intelligence analysts in intelligence-led-policing
Pivotal role of Intelligence analysts in intelligence-led-policingPivotal role of Intelligence analysts in intelligence-led-policing
Pivotal role of Intelligence analysts in intelligence-led-policing
 
Introduction to Ethics of Big Data
Introduction to Ethics of Big DataIntroduction to Ethics of Big Data
Introduction to Ethics of Big Data
 
Seven principles of outrageous customer service
Seven principles of outrageous customer serviceSeven principles of outrageous customer service
Seven principles of outrageous customer service
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar Slides
 
Five ways a Technology refresh strategy can lead to a Successful Digital Tran...
Five ways a Technology refresh strategy can lead to a Successful Digital Tran...Five ways a Technology refresh strategy can lead to a Successful Digital Tran...
Five ways a Technology refresh strategy can lead to a Successful Digital Tran...
 
Building the Analytics Capability
Building the Analytics CapabilityBuilding the Analytics Capability
Building the Analytics Capability
 
Innovations :Technology unfolding
Innovations :Technology unfolding Innovations :Technology unfolding
Innovations :Technology unfolding
 
How to Create Compelling Value Propositions That Turns Prospects into Customers
How to Create Compelling Value Propositions That Turns Prospects into CustomersHow to Create Compelling Value Propositions That Turns Prospects into Customers
How to Create Compelling Value Propositions That Turns Prospects into Customers
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 

Similar to Big Data Analytics - It is here and now!

Presentation For Gene S Revision 3
Presentation For Gene S Revision 3Presentation For Gene S Revision 3
Presentation For Gene S Revision 3WSU Cougars
 
The Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer DatasetThe Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer Dataset
CongChen35
 
27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...
27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...
27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...
eMadrid network
 
Learning Analytics for Self-Regulated Learning (2019)
Learning Analytics for Self-Regulated Learning (2019)Learning Analytics for Self-Regulated Learning (2019)
Learning Analytics for Self-Regulated Learning (2019)
Wolfgang Greller
 
Learning Analytics
Learning AnalyticsLearning Analytics
Learning Analytics
James Little
 
Data fluency
Data fluencyData fluency
Data fluency
Martha Horler
 
Student Activity Hub community Meeting 10-25-2017
Student Activity Hub community Meeting 10-25-2017Student Activity Hub community Meeting 10-25-2017
Student Activity Hub community Meeting 10-25-2017
Brett Pollak
 
Learning Analytics – From Reactive to Predictive
Learning Analytics – From Reactive to PredictiveLearning Analytics – From Reactive to Predictive
Learning Analytics – From Reactive to Predictive
LearningCafe
 
Total Cost of Ownership, User Acceptance and *Perceived Success of ERP Software
Total Cost of Ownership, User Acceptance and *Perceived Success of ERP SoftwareTotal Cost of Ownership, User Acceptance and *Perceived Success of ERP Software
Total Cost of Ownership, User Acceptance and *Perceived Success of ERP Softwaremfryling
 
Data Fluency - AUA Conference
Data Fluency - AUA ConferenceData Fluency - AUA Conference
Data Fluency - AUA Conference
Martha Horler
 
Modeling Framework to Support Evidence-Based Decisions
Modeling Framework to Support Evidence-Based DecisionsModeling Framework to Support Evidence-Based Decisions
Modeling Framework to Support Evidence-Based Decisions
Albert Simard
 
Learning Analytics for SRL
Learning Analytics for SRLLearning Analytics for SRL
Learning Analytics for SRL
Wolfgang Greller
 
Learning Analytics for SRL
Learning Analytics for SRLLearning Analytics for SRL
Learning Analytics for SRL
slideshowproject
 
Implementation science and learning health systems: Connecting the dots
Implementation science and learning health systems:  Connecting the dotsImplementation science and learning health systems:  Connecting the dots
Implementation science and learning health systems: Connecting the dots
Department of Learning Health Sciences, University of Michigan Medical School
 
Open Education 2011: Openness and Learning Analytics
Open Education 2011: Openness and Learning AnalyticsOpen Education 2011: Openness and Learning Analytics
Open Education 2011: Openness and Learning AnalyticsJohn Rinderle
 
DC Salesforce1 Tour Data Governance Lunch Best Practices deck
DC Salesforce1 Tour Data Governance Lunch Best Practices deckDC Salesforce1 Tour Data Governance Lunch Best Practices deck
DC Salesforce1 Tour Data Governance Lunch Best Practices deck
Beth Fitzpatrick
 
Qualitative and quantitative analysis
Qualitative and quantitative analysisQualitative and quantitative analysis
Qualitative and quantitative analysis
Nellie Deutsch (Ed.D)
 
Images of Information Systems
Images of Information SystemsImages of Information Systems
Images of Information Systems
guestead93f3
 
Data carpentry replicathon_2017-03-24
Data carpentry replicathon_2017-03-24Data carpentry replicathon_2017-03-24
Data carpentry replicathon_2017-03-24
tracykteal
 
EDUCA Leveraging Analytics FINAL
EDUCA Leveraging Analytics FINALEDUCA Leveraging Analytics FINAL
EDUCA Leveraging Analytics FINALEllen Wagner
 

Similar to Big Data Analytics - It is here and now! (20)

Presentation For Gene S Revision 3
Presentation For Gene S Revision 3Presentation For Gene S Revision 3
Presentation For Gene S Revision 3
 
The Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer DatasetThe Simulacrum, a Synthetic Cancer Dataset
The Simulacrum, a Synthetic Cancer Dataset
 
27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...
27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...
27_06_2019 Wolfgang Greller, from University of Teacher Education (Viena), on...
 
Learning Analytics for Self-Regulated Learning (2019)
Learning Analytics for Self-Regulated Learning (2019)Learning Analytics for Self-Regulated Learning (2019)
Learning Analytics for Self-Regulated Learning (2019)
 
Learning Analytics
Learning AnalyticsLearning Analytics
Learning Analytics
 
Data fluency
Data fluencyData fluency
Data fluency
 
Student Activity Hub community Meeting 10-25-2017
Student Activity Hub community Meeting 10-25-2017Student Activity Hub community Meeting 10-25-2017
Student Activity Hub community Meeting 10-25-2017
 
Learning Analytics – From Reactive to Predictive
Learning Analytics – From Reactive to PredictiveLearning Analytics – From Reactive to Predictive
Learning Analytics – From Reactive to Predictive
 
Total Cost of Ownership, User Acceptance and *Perceived Success of ERP Software
Total Cost of Ownership, User Acceptance and *Perceived Success of ERP SoftwareTotal Cost of Ownership, User Acceptance and *Perceived Success of ERP Software
Total Cost of Ownership, User Acceptance and *Perceived Success of ERP Software
 
Data Fluency - AUA Conference
Data Fluency - AUA ConferenceData Fluency - AUA Conference
Data Fluency - AUA Conference
 
Modeling Framework to Support Evidence-Based Decisions
Modeling Framework to Support Evidence-Based DecisionsModeling Framework to Support Evidence-Based Decisions
Modeling Framework to Support Evidence-Based Decisions
 
Learning Analytics for SRL
Learning Analytics for SRLLearning Analytics for SRL
Learning Analytics for SRL
 
Learning Analytics for SRL
Learning Analytics for SRLLearning Analytics for SRL
Learning Analytics for SRL
 
Implementation science and learning health systems: Connecting the dots
Implementation science and learning health systems:  Connecting the dotsImplementation science and learning health systems:  Connecting the dots
Implementation science and learning health systems: Connecting the dots
 
Open Education 2011: Openness and Learning Analytics
Open Education 2011: Openness and Learning AnalyticsOpen Education 2011: Openness and Learning Analytics
Open Education 2011: Openness and Learning Analytics
 
DC Salesforce1 Tour Data Governance Lunch Best Practices deck
DC Salesforce1 Tour Data Governance Lunch Best Practices deckDC Salesforce1 Tour Data Governance Lunch Best Practices deck
DC Salesforce1 Tour Data Governance Lunch Best Practices deck
 
Qualitative and quantitative analysis
Qualitative and quantitative analysisQualitative and quantitative analysis
Qualitative and quantitative analysis
 
Images of Information Systems
Images of Information SystemsImages of Information Systems
Images of Information Systems
 
Data carpentry replicathon_2017-03-24
Data carpentry replicathon_2017-03-24Data carpentry replicathon_2017-03-24
Data carpentry replicathon_2017-03-24
 
EDUCA Leveraging Analytics FINAL
EDUCA Leveraging Analytics FINALEDUCA Leveraging Analytics FINAL
EDUCA Leveraging Analytics FINAL
 

More from Farhan Khan

Multimodal Transport in an Indian Perspective
Multimodal Transport in an Indian PerspectiveMultimodal Transport in an Indian Perspective
Multimodal Transport in an Indian Perspective
Farhan Khan
 
Development of Cold Chain Supply in India
Development of Cold Chain Supply in IndiaDevelopment of Cold Chain Supply in India
Development of Cold Chain Supply in India
Farhan Khan
 
Atlantic computer case analysis
Atlantic computer case analysisAtlantic computer case analysis
Atlantic computer case analysisFarhan Khan
 
MIT Maine Medicaid
MIT Maine MedicaidMIT Maine Medicaid
MIT Maine Medicaid
Farhan Khan
 
Say no to smoke
Say no to smokeSay no to smoke
Say no to smoke
Farhan Khan
 
Samarthan Sehore
Samarthan SehoreSamarthan Sehore
Samarthan Sehore
Farhan Khan
 

More from Farhan Khan (6)

Multimodal Transport in an Indian Perspective
Multimodal Transport in an Indian PerspectiveMultimodal Transport in an Indian Perspective
Multimodal Transport in an Indian Perspective
 
Development of Cold Chain Supply in India
Development of Cold Chain Supply in IndiaDevelopment of Cold Chain Supply in India
Development of Cold Chain Supply in India
 
Atlantic computer case analysis
Atlantic computer case analysisAtlantic computer case analysis
Atlantic computer case analysis
 
MIT Maine Medicaid
MIT Maine MedicaidMIT Maine Medicaid
MIT Maine Medicaid
 
Say no to smoke
Say no to smokeSay no to smoke
Say no to smoke
 
Samarthan Sehore
Samarthan SehoreSamarthan Sehore
Samarthan Sehore
 

Recently uploaded

Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
vaibhavrinwa19
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
joachimlavalley1
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
kaushalkr1407
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
CarlosHernanMontoyab2
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 

Recently uploaded (20)

Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Acetabularia Information For Class 9 .docx
Acetabularia Information For Class 9  .docxAcetabularia Information For Class 9  .docx
Acetabularia Information For Class 9 .docx
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
Additional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdfAdditional Benefits for Employee Website.pdf
Additional Benefits for Employee Website.pdf
 
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 

Big Data Analytics - It is here and now!

  • 1. BIG DATA BIG DATA is not the next big thing It is Here And Now
  • 3. BENEFITS OF BIG DATA ANALYTICS 61% 45% 41% 38% 37% 35% 33% 30% 30% 30% 30% 29% 29% 27% 6% 4% Better targeted social influencer marketing More numerous and accurate business insights Segmentation of customer base Recognition of sales and market opportunities Automated decisions for real-time processes Definitions of churn and other customer behaviors Detection of fraud Greater leverage and ROI for big data Quantification of risks Trending for market sentiments Understanding of business change Better planning and forecasting Identification of root causes of cost Understanding consumer behavior from clickstreams Manufacturing yield improvements Other
  • 4. •Profit growth than their competition •CIOs have visionary plans that include business analytics •Leaders don’t have information they need for critical decisions •More likely to outperform their competition 2X 1/3 2X83% Big Data is about getting valuable information hidden in data we weren’t able to analyze by traditional approaches BENEFITS OF BIG DATA ANALYTICS
  • 5. 3V of Big Data Structured Unstructured Semi structured All the above Batch Near time Real time Streams Terabytes Records Transactions Tables, files Volume Velocity Variety
  • 7. Typical Applications of Big Data Sentiment Analysis Text Analytics Volume Trending Influencer Identification Predictive Analytics In-Memory Analytics Massively Scalable Architectures Forecasting estimating quarterly sales, product demand. Neural networks can assess how likely it is that a credit card transaction is being performed by the cardholder. Response models can predict how likely a particular person is to respond to a particular marketing offer, based on the success or failure of offers made in the past. Predictive scorecards can determine the likelihood that someone will fail to make payments on his or her loan in the coming year.
  • 8. MAJOR TOOLS USED FOR BIG DATA ANALYTICS Apache Hadoop is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. Scalable, reliable, fault tolerant Provides storage layer and execution layer Heart of Hadoop A programming paradigm that allows for massive scalability across hundreds or thousands of servers in a Hadoop cluster
  • 9. World’s most widely used statistics programming language designed to handle big data through a high-performance disk-based data store called XDF and high performance computing across large clusters An environment for machine learning, data mining, text mining, predictive analysis and business analytics Provides data loading and transformation (Extract, transform, load ETL) data preprocessing and visualization, modelling, evaluation, and deployment It is written in the Java programming language MAJOR TOOLS USED FOR BIG DATA ANALYTICS
  • 10. BIG DATA ANALYTICS – WHO ARE USING IT? Science and Research Government
  • 11. Data Analytics in Education A Case Study
  • 12. The Study • This study examines whether specific instructional strategies are associated with incidence of off-task behavior in elementary school children. These findings can begin to form a foundation for development of research-based guidelines for instructional design aimed to optimize focused attention in classroom settings. The Purpose
  • 13. Methodology – 22 classrooms participated – 5 local charter schools – 5 grade levels (K-4) – Average class size: 21 students (10 males, 11 females) – Each classroom was observed four times (total 84 observations) – Each observation lasted for 1 hr apprx. On-task: If the child was looking at the teacher (or classroom assistant), the instructional activity, and/or the relevant instructional materials, they were categorized as on-task. Off-task: If the child was looking elsewhere, they were categorized as off-task
  • 14. Off-task behaviour • Self-distraction • Peer distraction • Environmental distraction • Supplies • Walking • Other • Unknown
  • 15. Data Analysis: Variables • Predictor Variables – Student characteristics • Gender • Grade – Instructional design • Instructional format – Individual work – Small group or partner work – whole-group instruction at desks – whole-group instruction while sitting on the carpet – dancing, and – Testing • Duration of Instructional format
  • 16. Results Data Analysis: Approach •Regression tree analysis than linear regression •Resultant models were evaluated using six-fold student level cross-validation.
  • 18. Conclusion • Instructional format and instructional duration both are related to the overall rate of off-task behaviour. • Certain types of instructional format are associated with more on-task behavior than others. • Instructional activities that take place individually or at the students’ desks may be less engaging or motivating than small-group activities • Better attention in blocks of activities than an activity for a longer duration.

Editor's Notes

  1. Big science[edit] The Large Hadron Collider experiments represent about 150 million sensors delivering data 40 million times per second. There are nearly 600 million collisions per second. After filtering and refraining from recording more than 99.999% of these streams, there are 100 collisions of interest per second.[27][28][29] As a result, only working with less than 0.001% of the sensor stream data, the data flow from all four LHC experiments represents 25 petabytes annual rate before replication (as of 2012). This becomes nearly 200 petabytes after replication. If all sensor data were to be recorded in LHC, the data flow would be extremely hard to work with. The data flow would exceed 150 million petabytes annual rate, or nearly 500exabytes per day, before replication. To put the number in perspective, this is equivalent to 500 quintillion (5×1020) bytes per day, almost 200 times higher than all the other sources combined in the world Science and Research: Decoding the human genome originally took 10 years to process, now it can be achieved in less than a week : the DNA sequencers have divided the sequencing cost by 10,000 in the last ten years, which is 100 times faster than the reduction in cost predicted by Moore's Law.[30] Government[edit] In 2012, the Obama administration announced the Big Data Research and Development Initiative, which explored how big data could be used to address important problems faced by the government.[46] The initiative was composed of 84 different big data programs spread across six departments.[47] Big data analysis played a large role in Barack Obama's successful 2012 re-election campaign.[48] The United States Federal Government owns six of the ten most powerful supercomputers in the world.[49] Private sector[edit] Bus wrapped with SAP Big data parked outside IDF13. eBay.com uses two data warehouses at 7.5 petabytes and 40PB as well as a 40PB Hadoop cluster for search, consumer recommendations, and merchandising. Inside eBay’s 90PB data warehouse Amazon.com handles millions of back-end operations every day, as well as queries from more than half a million third-party sellers. The core technology that keeps Amazon running is Linux-based and as of 2005 they had the world’s three largest Linux databases, with capacities of 7.8 TB, 18.5 TB, and 24.7 TB.[53] Walmart handles more than 1 million customer transactions every hour, which is imported into databases estimated to contain more than 2.5 petabytes (2560 terabytes) of data – the equivalent of 167 times the information contained in all the books in the US Library of Congress.[5] Facebook handles 50 billion photos from its user base.[54]
  2. This study examines whether specific instructional strategies are associated with incidence of off-task behavior in elementary school children, both in terms of the overall amount of off-task behavior, and the form which off-task behavior takes. Towards this goal we recorded patterns of attention allocation in elementary school students during a variety of instructional activities (e.g., whole-group instruction, small-group work, etc.).