SlideShare a Scribd company logo
1 of 24
Bilwa Upadhye - FPM03
Chetna Chauhan – FPM04
Leon Dukkipati – PGP0686
Manzoor Ul Akram – FPM05
Soumya Soni – PGP06105
IIM Rohtak
• The exponential growth and availability of data, both
structured and unstructured.
Structured Data
• Data that resides in a fixed field within a record or file is
called structured data. This includes data contained in
relational databases and spreadsheets.
Unstructured Data
• Text and multimedia content like e-mail messages, word
processing documents, videos, photos, audio files,
presentations, webpages and many other kinds of
business documents. The data doesn't fit neatly in a
database
• 80 – 90% data in any organization is unstructured12/6/2015 Big Data 2
• eBay – 100 PB
• Google – 100 PB
• Facebook - 600 PB
• Twitter – 100 TB
• NSA – 29 TB
• 90% of the data in the world today has been created in
the last two years alone
Examples :
• Sensors used to gather climate information, posts to
social media sites, digital pictures and videos, purchase
transaction records, cell phone GPS signals, UID
information, patient information etc.
Source : http://wikibon.org12/6/2015 Big Data 3
12/6/2015 Big Data 4
• Organization, administration and governance of large
volumes of both structured and unstructured data
• Tools used:
Hadoop, NoSQL, Platfora
• Big data management is important to business, and
society, because more data may lead to more accurate
analyses.
12/6/2015 Big Data 5
RDBMS
• Structured data
• ER model defined
perfectly
• Less amount of data
• Relational data base
management system
• Applications: IIM Rohtak
Big data management
technologies
• Unstructured data, semi-
structured data,
unstructured data
• No perfect ER model
• Large amount of data
• Node based flat structure
• Healthcare, retail, Google,
IBM
12/6/2015 Big Data 6
• Open source software framework – JAVA
• Fundamental assumption
• Storage part: HDFS ( Hadoop distributed file system)
• Processing part: Map reduce
• Working of Hadoop
12/6/2015 Big Data 8
Map reduce
divides
application
into blocks
HDFS
creates
multiple
replicas of
data blocks
HDFS places
data blocks
on different
nodes around
cluster
Map reduce
accesses
data
Map reduce
processes
data
12/6/2015 Big Data 9
• Non SQL database
• Provides mechanism for storage and retrieval of data
• Horizontal scaling
Platfora
• Software works with open source software framework
Hadoop
• When user queries database, software delivers answer in
real time
12/6/2015 Big Data 10
• Highly fault - tolerant and is designed to be deployed on
low-cost hardware
• Provides high throughput access to application data and
is suitable for applications that have large data sets
• Relaxes a few POSIX requirements to enable streaming
access to file system data
12/6/2015 Big Data 11
Large: Thousands of
server machines
Replicated data
blocks
Failure is norm
Fast detection and
recovery of faults
Properties
of HDFS
12/6/2015 Big Data 12
• Programming model for processing large data sets
• Developed by Google for internal search applications
• Currently used by Yahoo, Amazon, IBM etc
• The run time partitions the input and provides it to
different Map instances
12/6/2015 Big Data 13
Partitioning
the input
Mapping of
instances
Map (key,
value) 
(key’, value’)
Collection
of the
(key’,
value’)
pairs
Distribution
to reduce
functions
Each
reduce
produces
single file
output
12/6/2015 Big Data 14
15
Users only provide the “Map” and “Reduce” functions
12/6/2015 Big Data
• $300 billion potential
annual value to US
health care.
• $600 billion potential
annual consumer
surplus from using
personal location data.
• 60% potential in
retailers’ operating
margins.
• Gaining attraction
• Huge market opportunities for IT services
(82.9% of revenues) and analytics firms
(17.1 % )
• Current market size is $200 million. By 2015 $1
billion
• The opportunity for Indian service providers lies
in offering services around Big Data
implementation and analytics for global
multinationals
18
Big Data Challenges
• Hard to quantify
value to the
enterprise
• Data Scientists
roles are difficult
to fill
• Difficult to
design effective
visualization and
reporting of
new data sets
• Goal of improving retention and
graduation rates
• Developing a more pro-active
relationship with students to help
them be more successful during
and after graduation
• Approach:
1. Online Applications for
Education
2. Forums
3. Help desk
4. Student Demographic and
Operational Information
12/6/2015 Big Data 21
Narendra Modi wins 2014 Lok Sabha Elections
12/6/2015 Big Data 22
What makes Modi’s use of big data so impressive
Volume of Data : 814 million voters
Variety of data – 12 different languages
-- 900,000 PDF’s amounting
-- 25 million pages
-- heterogeneous, non-uniform data
For what purpose did he use Big Data ?
-> To drive donations, enroll volunteers, and improve
the effectiveness of everything from door knocks…to social media
BJP’s website, planted cookies on all computers that visited its site - for customised
advertisements.
#IndiaVotes
Source : dataconomy.com
• http://dataconomy.com/narendra-modi-first-prime-
minister-use-big-data-analytics/
• http://dataconomy.com/narendra-modi-first-prime-
minister-use-big-data-analytics/
• http://blog.pivotal.io/data-science-pivotal/case-
studies/big-data-in-education-analyzing-student-clusters-
to-influence-success-and-retention
12/6/2015 Big Data 23
12/6/2015 Big Data 24

More Related Content

What's hot

Big data introduction
Big data introductionBig data introduction
Big data introductionChirag Ahuja
 
Introduction to BIG DATA
Introduction to BIG DATA Introduction to BIG DATA
Introduction to BIG DATA Zeeshan Khan
 
Analysis of big data in pandemic case
Analysis of big data in pandemic case Analysis of big data in pandemic case
Analysis of big data in pandemic case Muh Saleh
 
The importance of data
The importance of dataThe importance of data
The importance of dataAPNIC
 
Big data ppt
Big data pptBig data ppt
Big data pptYash Raj
 
Hadoop Training Tutorial for Freshers
Hadoop Training Tutorial for FreshersHadoop Training Tutorial for Freshers
Hadoop Training Tutorial for Freshersrajkamaltibacademy
 
Big data analytics, research report
Big data analytics, research reportBig data analytics, research report
Big data analytics, research reportJULIO GONZALEZ SANZ
 
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...yashbheda
 
Big Data Projects Research Ideas
Big Data Projects Research IdeasBig Data Projects Research Ideas
Big Data Projects Research IdeasMatlab Simulation
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data AnalyticsTUSHAR GARG
 
Bigdata Analytics using Hadoop
Bigdata Analytics using HadoopBigdata Analytics using Hadoop
Bigdata Analytics using HadoopNagamani Gurram
 
BIG Data and Methodology-A review
BIG Data and Methodology-A reviewBIG Data and Methodology-A review
BIG Data and Methodology-A reviewShilpa Soi
 
000 introduction to big data analytics 2021
000   introduction to big data analytics  2021000   introduction to big data analytics  2021
000 introduction to big data analytics 2021Dendej Sawarnkatat
 
Big data and its applications
Big data and its applicationsBig data and its applications
Big data and its applicationsali easazadeh
 

What's hot (20)

Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Big data introduction
Big data introductionBig data introduction
Big data introduction
 
Introduction to BIG DATA
Introduction to BIG DATA Introduction to BIG DATA
Introduction to BIG DATA
 
Big Data
Big DataBig Data
Big Data
 
View on big data technologies
View on big data technologiesView on big data technologies
View on big data technologies
 
Analysis of big data in pandemic case
Analysis of big data in pandemic case Analysis of big data in pandemic case
Analysis of big data in pandemic case
 
The importance of data
The importance of dataThe importance of data
The importance of data
 
Introduction to BigData
Introduction to BigData Introduction to BigData
Introduction to BigData
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data
Big dataBig data
Big data
 
Hadoop Training Tutorial for Freshers
Hadoop Training Tutorial for FreshersHadoop Training Tutorial for Freshers
Hadoop Training Tutorial for Freshers
 
Big data analytics, research report
Big data analytics, research reportBig data analytics, research report
Big data analytics, research report
 
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
Big data (4Vs,history,concept,algorithm) analysis and applications #bigdata #...
 
Big Data Projects Research Ideas
Big Data Projects Research IdeasBig Data Projects Research Ideas
Big Data Projects Research Ideas
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Bigdata Analytics using Hadoop
Bigdata Analytics using HadoopBigdata Analytics using Hadoop
Bigdata Analytics using Hadoop
 
BIG Data and Methodology-A review
BIG Data and Methodology-A reviewBIG Data and Methodology-A review
BIG Data and Methodology-A review
 
000 introduction to big data analytics 2021
000   introduction to big data analytics  2021000   introduction to big data analytics  2021
000 introduction to big data analytics 2021
 
Big data and its applications
Big data and its applicationsBig data and its applications
Big data and its applications
 
Big Data & Data Mining
Big Data & Data MiningBig Data & Data Mining
Big Data & Data Mining
 

Similar to Big data management

02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataSpringPeople
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigManish Chopra
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo
 
Big data seminor
Big data seminorBig data seminor
Big data seminorberasrujana
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxdickonsondorris
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big dataVedanand Singh
 
INTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOPINTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOPDr Geetha Mohan
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Denodo
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - IntroductionTomy Rhymond
 
Big data
Big dataBig data
Big dataRiya
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesDATAVERSITY
 

Similar to Big data management (20)

02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Big data and analytics
Big data and analyticsBig data and analytics
Big data and analytics
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data
Big dataBig data
Big data
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
 
Pres_Big Data for Finance_vsaini
Pres_Big Data for Finance_vsainiPres_Big Data for Finance_vsaini
Pres_Big Data for Finance_vsaini
 
Big data
Big dataBig data
Big data
 
Kartikey tripathi
Kartikey tripathiKartikey tripathi
Kartikey tripathi
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
 
Big data seminor
Big data seminorBig data seminor
Big data seminor
 
Big data in telecom
Big data in telecomBig data in telecom
Big data in telecom
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big data
 
INTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOPINTRODUCTION TO BIG DATA AND HADOOP
INTRODUCTION TO BIG DATA AND HADOOP
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
 
Big data with Hadoop - Introduction
Big data with Hadoop - IntroductionBig data with Hadoop - Introduction
Big data with Hadoop - Introduction
 
Big data
Big dataBig data
Big data
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 

Recently uploaded

Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknowmakika9823
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationBoston Institute of Analytics
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083
 

Recently uploaded (20)

Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service LucknowAminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
Aminabad Call Girl Agent 9548273370 , Call Girls Service Lucknow
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project Presentation
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
 

Big data management

  • 1. Bilwa Upadhye - FPM03 Chetna Chauhan – FPM04 Leon Dukkipati – PGP0686 Manzoor Ul Akram – FPM05 Soumya Soni – PGP06105 IIM Rohtak
  • 2. • The exponential growth and availability of data, both structured and unstructured. Structured Data • Data that resides in a fixed field within a record or file is called structured data. This includes data contained in relational databases and spreadsheets. Unstructured Data • Text and multimedia content like e-mail messages, word processing documents, videos, photos, audio files, presentations, webpages and many other kinds of business documents. The data doesn't fit neatly in a database • 80 – 90% data in any organization is unstructured12/6/2015 Big Data 2
  • 3. • eBay – 100 PB • Google – 100 PB • Facebook - 600 PB • Twitter – 100 TB • NSA – 29 TB • 90% of the data in the world today has been created in the last two years alone Examples : • Sensors used to gather climate information, posts to social media sites, digital pictures and videos, purchase transaction records, cell phone GPS signals, UID information, patient information etc. Source : http://wikibon.org12/6/2015 Big Data 3
  • 5. • Organization, administration and governance of large volumes of both structured and unstructured data • Tools used: Hadoop, NoSQL, Platfora • Big data management is important to business, and society, because more data may lead to more accurate analyses. 12/6/2015 Big Data 5
  • 6. RDBMS • Structured data • ER model defined perfectly • Less amount of data • Relational data base management system • Applications: IIM Rohtak Big data management technologies • Unstructured data, semi- structured data, unstructured data • No perfect ER model • Large amount of data • Node based flat structure • Healthcare, retail, Google, IBM 12/6/2015 Big Data 6
  • 7.
  • 8. • Open source software framework – JAVA • Fundamental assumption • Storage part: HDFS ( Hadoop distributed file system) • Processing part: Map reduce • Working of Hadoop 12/6/2015 Big Data 8
  • 9. Map reduce divides application into blocks HDFS creates multiple replicas of data blocks HDFS places data blocks on different nodes around cluster Map reduce accesses data Map reduce processes data 12/6/2015 Big Data 9
  • 10. • Non SQL database • Provides mechanism for storage and retrieval of data • Horizontal scaling Platfora • Software works with open source software framework Hadoop • When user queries database, software delivers answer in real time 12/6/2015 Big Data 10
  • 11. • Highly fault - tolerant and is designed to be deployed on low-cost hardware • Provides high throughput access to application data and is suitable for applications that have large data sets • Relaxes a few POSIX requirements to enable streaming access to file system data 12/6/2015 Big Data 11
  • 12. Large: Thousands of server machines Replicated data blocks Failure is norm Fast detection and recovery of faults Properties of HDFS 12/6/2015 Big Data 12
  • 13. • Programming model for processing large data sets • Developed by Google for internal search applications • Currently used by Yahoo, Amazon, IBM etc • The run time partitions the input and provides it to different Map instances 12/6/2015 Big Data 13
  • 14. Partitioning the input Mapping of instances Map (key, value)  (key’, value’) Collection of the (key’, value’) pairs Distribution to reduce functions Each reduce produces single file output 12/6/2015 Big Data 14
  • 15. 15 Users only provide the “Map” and “Reduce” functions 12/6/2015 Big Data
  • 16. • $300 billion potential annual value to US health care. • $600 billion potential annual consumer surplus from using personal location data. • 60% potential in retailers’ operating margins.
  • 17. • Gaining attraction • Huge market opportunities for IT services (82.9% of revenues) and analytics firms (17.1 % ) • Current market size is $200 million. By 2015 $1 billion • The opportunity for Indian service providers lies in offering services around Big Data implementation and analytics for global multinationals
  • 18. 18 Big Data Challenges • Hard to quantify value to the enterprise • Data Scientists roles are difficult to fill • Difficult to design effective visualization and reporting of new data sets
  • 19. • Goal of improving retention and graduation rates • Developing a more pro-active relationship with students to help them be more successful during and after graduation • Approach: 1. Online Applications for Education 2. Forums 3. Help desk 4. Student Demographic and Operational Information
  • 20.
  • 21. 12/6/2015 Big Data 21 Narendra Modi wins 2014 Lok Sabha Elections
  • 22. 12/6/2015 Big Data 22 What makes Modi’s use of big data so impressive Volume of Data : 814 million voters Variety of data – 12 different languages -- 900,000 PDF’s amounting -- 25 million pages -- heterogeneous, non-uniform data For what purpose did he use Big Data ? -> To drive donations, enroll volunteers, and improve the effectiveness of everything from door knocks…to social media BJP’s website, planted cookies on all computers that visited its site - for customised advertisements. #IndiaVotes Source : dataconomy.com
  • 23. • http://dataconomy.com/narendra-modi-first-prime- minister-use-big-data-analytics/ • http://dataconomy.com/narendra-modi-first-prime- minister-use-big-data-analytics/ • http://blog.pivotal.io/data-science-pivotal/case- studies/big-data-in-education-analyzing-student-clusters- to-influence-success-and-retention 12/6/2015 Big Data 23

Editor's Notes

  1. InformationWeek 2012 Big Data Survey Information Management being critical to big data