SlideShare a Scribd company logo
1 of 20
Big Data
SUBMITTED TO:-
MR.
CSE DEPARTMENT
Presented by:
Yash raj sharma(6CS-91)
B.Tech VI Sem.
Jaipur National University , Jaipur
Contents
Introduction
Problem of Data Explosion
Big Data Characteristics
Issues and Challenges in Big Data
Advantages of Big Data
Projects using Big Data
Conclusion
Introduction
Big Data is large volume of Data in structured or unstructured form.
The rate of data generation has increased exponentially by increasing use of
data intensive technologies.
Processing or analyzing the huge amount of data is a challenging task.
It requires new infrastructure and a new way of thinking about the way
business and IT industry works
Problem of Data Explosion
The International Data Corporation (IDC) study predicts that overall data
will grow by 50 times by 2020.
The digital universe is 1.8 trillion gigabytes (109) in size and stored in 500
quadrillion (1015) files.
Information Bits in the digital universe as stars in our physical universe.
90% Data is in unstructured form.
Volume
Velocity
Variety
Big data can be described by the following characteristics:
Volume – The quantity of data that is generated is very important in this context. It is the size of the data
which determines the value and potential of the data under consideration and whether it can actually be
considered Big Data or not. The name ‘Big Data’ itself contains a term which is related to size and hence
the characteristic.
Variety - The next aspect of Big Data is its variety. This means that the category to which Big Data belongs
to is
also a very essential fact that needs to be known by the data analysts. This helps the people, who are
closely analyzing the data and are associated with it, to effectively use the data to their advantage and thus
upholding the importance of the Big Data.
Velocity - The term ‘velocity’ in the context refers to the speed of generation of data or how fast the data is
generated and processed to meet the demands and the challenges which lie ahead in the path of growth
and development.
Veracity - The quality of the data being captured can vary greatly. Accuracy of analysis depends on the
veracity of the source data.
Complexity - Data management can become a very complex process, especially when large volumes
of data come from multiple sources. These data need to be linked, connected and correlated in order
to be able to grasp the information that is supposed to be conveyed by these data. This situation, is
therefore, termed as the ‘complexity’ of Big Data.
Factory work and Cyber Physical System may have a 6C system:
1.Connection (sensor and networks),
2.Cloud (computing and data on demand),
3.Cyber (model and memory),
4.content/context (meaning and correlation),
5.community (sharing and collaboration), and
6.customization (personalization and value).
In this scenario and in order to provide useful insight to the factory management and gain correct
content, data has to be processed with advanced tools (analytics and algorithms) to generate
meaningful information. Considering the presence of visible and invisible issues in an industrial
factory, the information generation algorithm has to be capable of detecting and addressing invisible
issues such as machine degradation, component wear, etc. in the factory floor
Issues in Big Data
Issues related to the Characteristics
Storage and Transfer Issues
Data Management Issues
Processing Issues
Issues in Characteristics
Data Volume Issues
Data Velocity Issues
Data Variety Issues
Worth of Data Issues
Data Complexity Issues
Storage and Transfer Issues
Current Storage Techniques and Storage Medium are not appropriate for effectively
handling Big Data.
Current Technology limits 4 Terabytes (1012) per disk, so 1 Exabyte (1018) size data
will take 25,000 Disks.
Accessing that data will also overwhelm network.
Assuming a sustained transfer of 1 Exabyte will take 2,800 hours with a 1 Gbps
capable network with 80% effective transfer rate and 100Mbps sustainable speed.
Data Management Issues
Resolving issues of access, utilization, updating, governance,
and reference (in publications) have proven to be major
stumbling blocks.
In such volume, it is impractical to validate every data item.
New approaches and research to data qualification and
validation are needed.
The richness of digital data representation prohibits a
personalized methodology for data collection.
Processing Issues
The Processing Issues are critical to handle.
Example:
1 Exabyte = 1000 Petabytes (1015).
Assuming a processor expends 100 instructions on one block
at 5 gigahertz, the time required for end to-end processing
would be 20 nanoseconds.
To process 1K petabytes would require a total end-to-end
processing time of roughly 635 years.
Effective processing of Exabyte of data will require extensive
parallel processing and new analytics algorithms
Challenges in Big Data
Privacy and Security
Data Access and Sharing of Information
Analytical Challenges
Human Resources and Manpower
Technical Challenges
Privacy and Security
Privacy and Security are sensitive and includes conceptual,
Technical as well as legal significance.
Most Peoples are vulnerable to Information Theft.
Privacy can be compromised in the large data sets.
The Security is also critical to handle in such large data.
Social stratification would be important arising consequence
Data Access and Sharing of Information
Data should be available in accurate, complete and timely
manner.
The data management and governance process bit complex
adding the necessity to make data open and make it available to
government agencies.
Expecting sharing of data between companies is awkward
Analytical Challenges
Big data brings along with it some huge analytical
challenges.
Analysis on such huge data, requires a large number of
advance skills.
The type of analysis which is needed to be done on the data
depends highly on the results to be obtained.
Technical Challenges
Fault Tolerance: If the failure occurs the damage done
should be within acceptable threshold rather than
beginning the whole task from the scratch.
Scalability: Requires a high level of sharing of
resources which is expensive and dealing with the
system failures in an efficient manner.
Quality of Data: Big data focuses on quality data
storage rather than having very large irrelevant
data.
Heterogeneous Data: Structured and Unstructured
Data
Advantages of Big Data
Understanding and Targeting Customers
Understanding and Optimizing Business Process
Improving Science and Research
Improving Healthcare and Public Health
Optimizing Machine and Device Performance
Financial Trading
Improving Sports Performance
Improving Security and Law Enforcement
Conclusions
The commercial impacts of the Big data have the potential
to generate significant productivity growth for a number of
vertical sectors.
Big Data presents opportunity to create unprecedented business
advantages and better service delivery.
All the challenges and issues are needed to be handle effectively
and in a efficient manner.
Growing talent and building teams to make analytic-based
decisions is the key to realize the value of Big Data.

More Related Content

What's hot

A Model Design of Big Data Processing using HACE Theorem
A Model Design of Big Data Processing using HACE TheoremA Model Design of Big Data Processing using HACE Theorem
A Model Design of Big Data Processing using HACE TheoremAnthonyOtuonye
 
5 Factors Impacting Your Big Data Project's Performance
5 Factors Impacting Your Big Data Project's Performance 5 Factors Impacting Your Big Data Project's Performance
5 Factors Impacting Your Big Data Project's Performance Qubole
 
Issues, challenges, and solutions
Issues, challenges, and solutionsIssues, challenges, and solutions
Issues, challenges, and solutionscsandit
 
Mining Big Data using Genetic Algorithm
Mining Big Data using Genetic AlgorithmMining Big Data using Genetic Algorithm
Mining Big Data using Genetic AlgorithmIRJET Journal
 
elgendy2014.pdf
elgendy2014.pdfelgendy2014.pdf
elgendy2014.pdfAkuhuruf
 
Data Mining Algorithm and New HRDSD Theory for Big Data
Data Mining Algorithm and New HRDSD Theory for Big DataData Mining Algorithm and New HRDSD Theory for Big Data
Data Mining Algorithm and New HRDSD Theory for Big DataKamleshKumar394
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big datakk1718
 
Data mining & big data presentation 01
Data mining & big data presentation 01Data mining & big data presentation 01
Data mining & big data presentation 01Aseem Chakrabarthy
 
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET Journal
 
Big Data and Classification
Big Data and ClassificationBig Data and Classification
Big Data and Classification303Computing
 

What's hot (19)

A Model Design of Big Data Processing using HACE Theorem
A Model Design of Big Data Processing using HACE TheoremA Model Design of Big Data Processing using HACE Theorem
A Model Design of Big Data Processing using HACE Theorem
 
Big data
Big dataBig data
Big data
 
5 Factors Impacting Your Big Data Project's Performance
5 Factors Impacting Your Big Data Project's Performance 5 Factors Impacting Your Big Data Project's Performance
5 Factors Impacting Your Big Data Project's Performance
 
Issues, challenges, and solutions
Issues, challenges, and solutionsIssues, challenges, and solutions
Issues, challenges, and solutions
 
Big Data & Data Mining
Big Data & Data MiningBig Data & Data Mining
Big Data & Data Mining
 
Mining Big Data using Genetic Algorithm
Mining Big Data using Genetic AlgorithmMining Big Data using Genetic Algorithm
Mining Big Data using Genetic Algorithm
 
elgendy2014.pdf
elgendy2014.pdfelgendy2014.pdf
elgendy2014.pdf
 
Data Mining Algorithm and New HRDSD Theory for Big Data
Data Mining Algorithm and New HRDSD Theory for Big DataData Mining Algorithm and New HRDSD Theory for Big Data
Data Mining Algorithm and New HRDSD Theory for Big Data
 
Bigdata analytics
Bigdata analyticsBigdata analytics
Bigdata analytics
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Big data
Big dataBig data
Big data
 
Data mining & big data presentation 01
Data mining & big data presentation 01Data mining & big data presentation 01
Data mining & big data presentation 01
 
Big Data: Issues and Challenges
Big Data: Issues and ChallengesBig Data: Issues and Challenges
Big Data: Issues and Challenges
 
Big Data for Ag (2019)
Big Data for Ag (2019)Big Data for Ag (2019)
Big Data for Ag (2019)
 
Big data, Big decision
Big data, Big decisionBig data, Big decision
Big data, Big decision
 
M.Florence Dayana
M.Florence DayanaM.Florence Dayana
M.Florence Dayana
 
IRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth EnhancementIRJET- Big Data Management and Growth Enhancement
IRJET- Big Data Management and Growth Enhancement
 
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
[IJET-V1I3P10] Authors : Kalaignanam.K, Aishwarya.M, Vasantharaj.K, Kumaresan...
 
Big Data and Classification
Big Data and ClassificationBig Data and Classification
Big Data and Classification
 

Similar to Big data

Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)NikitaRajbhoj
 
big data Big Things
big data Big Thingsbig data Big Things
big data Big Thingspateelhs
 
Seminar presentation
Seminar presentationSeminar presentation
Seminar presentationKlawal13
 
Big data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docxBig data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docxhartrobert670
 
Identifying and analyzing the transient and permanent barriers for big data
Identifying and analyzing the transient and permanent barriers for big dataIdentifying and analyzing the transient and permanent barriers for big data
Identifying and analyzing the transient and permanent barriers for big datasarfraznawaz
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big dataHari Priya
 
Smart Data for Smart Labs
Smart Data for Smart Labs Smart Data for Smart Labs
Smart Data for Smart Labs OSTHUS
 
A Survey on Big Data Mining Challenges
A Survey on Big Data Mining ChallengesA Survey on Big Data Mining Challenges
A Survey on Big Data Mining ChallengesEditor IJMTER
 
How do data analysts work with big data and distributed computing frameworks.pdf
How do data analysts work with big data and distributed computing frameworks.pdfHow do data analysts work with big data and distributed computing frameworks.pdf
How do data analysts work with big data and distributed computing frameworks.pdfSoumodeep Nanee Kundu
 
E content.1 - P.SENEKA II-MSC COMPUTER SCIENCE,BON SECOURS COLLEGE FOR WOMEN
E content.1 - P.SENEKA II-MSC COMPUTER SCIENCE,BON SECOURS COLLEGE FOR WOMENE content.1 - P.SENEKA II-MSC COMPUTER SCIENCE,BON SECOURS COLLEGE FOR WOMEN
E content.1 - P.SENEKA II-MSC COMPUTER SCIENCE,BON SECOURS COLLEGE FOR WOMENsenekapseneka
 
The Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallThe Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallTrillium Software
 
Big Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar SemwalBig Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar SemwalIIIT Allahabad
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo
 

Similar to Big data (20)

Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)Nikita rajbhoj(a 50)
Nikita rajbhoj(a 50)
 
big data Big Things
big data Big Thingsbig data Big Things
big data Big Things
 
Research paper on big data and hadoop
Research paper on big data and hadoopResearch paper on big data and hadoop
Research paper on big data and hadoop
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Seminar presentation
Seminar presentationSeminar presentation
Seminar presentation
 
Big data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docxBig data is a broad term for data sets so large or complex that tr.docx
Big data is a broad term for data sets so large or complex that tr.docx
 
Big Data Analysis
Big Data AnalysisBig Data Analysis
Big Data Analysis
 
Identifying and analyzing the transient and permanent barriers for big data
Identifying and analyzing the transient and permanent barriers for big dataIdentifying and analyzing the transient and permanent barriers for big data
Identifying and analyzing the transient and permanent barriers for big data
 
Introduction to big data
Introduction to big dataIntroduction to big data
Introduction to big data
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Smart Data for Smart Labs
Smart Data for Smart Labs Smart Data for Smart Labs
Smart Data for Smart Labs
 
A Survey on Big Data Mining Challenges
A Survey on Big Data Mining ChallengesA Survey on Big Data Mining Challenges
A Survey on Big Data Mining Challenges
 
Unit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdfUnit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdf
 
Unit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdfUnit-1 introduction to Big data.pdf
Unit-1 introduction to Big data.pdf
 
How do data analysts work with big data and distributed computing frameworks.pdf
How do data analysts work with big data and distributed computing frameworks.pdfHow do data analysts work with big data and distributed computing frameworks.pdf
How do data analysts work with big data and distributed computing frameworks.pdf
 
E content.1 - P.SENEKA II-MSC COMPUTER SCIENCE,BON SECOURS COLLEGE FOR WOMEN
E content.1 - P.SENEKA II-MSC COMPUTER SCIENCE,BON SECOURS COLLEGE FOR WOMENE content.1 - P.SENEKA II-MSC COMPUTER SCIENCE,BON SECOURS COLLEGE FOR WOMEN
E content.1 - P.SENEKA II-MSC COMPUTER SCIENCE,BON SECOURS COLLEGE FOR WOMEN
 
The Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallThe Bigger They Are The Harder They Fall
The Bigger They Are The Harder They Fall
 
Big Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar SemwalBig Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar Semwal
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 

Recently uploaded

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
DATA ANALYTICS PPT definition usage example
DATA ANALYTICS PPT definition usage exampleDATA ANALYTICS PPT definition usage example
DATA ANALYTICS PPT definition usage examplePragyanshuParadkar1
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .Satyam Kumar
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxk795866
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHC Sai Kiran
 
Effects of rheological properties on mixing
Effects of rheological properties on mixingEffects of rheological properties on mixing
Effects of rheological properties on mixingviprabot1
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfme23b1001
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidNikhilNagaraju
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.eptoze12
 

Recently uploaded (20)

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
DATA ANALYTICS PPT definition usage example
DATA ANALYTICS PPT definition usage exampleDATA ANALYTICS PPT definition usage example
DATA ANALYTICS PPT definition usage example
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptx
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECH
 
Effects of rheological properties on mixing
Effects of rheological properties on mixingEffects of rheological properties on mixing
Effects of rheological properties on mixing
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdf
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
main PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfidmain PPT.pptx of girls hostel security using rfid
main PPT.pptx of girls hostel security using rfid
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.Oxy acetylene welding presentation note.
Oxy acetylene welding presentation note.
 

Big data

  • 1. Big Data SUBMITTED TO:- MR. CSE DEPARTMENT Presented by: Yash raj sharma(6CS-91) B.Tech VI Sem. Jaipur National University , Jaipur
  • 2. Contents Introduction Problem of Data Explosion Big Data Characteristics Issues and Challenges in Big Data Advantages of Big Data Projects using Big Data Conclusion
  • 3. Introduction Big Data is large volume of Data in structured or unstructured form. The rate of data generation has increased exponentially by increasing use of data intensive technologies. Processing or analyzing the huge amount of data is a challenging task. It requires new infrastructure and a new way of thinking about the way business and IT industry works
  • 4.
  • 5. Problem of Data Explosion The International Data Corporation (IDC) study predicts that overall data will grow by 50 times by 2020. The digital universe is 1.8 trillion gigabytes (109) in size and stored in 500 quadrillion (1015) files. Information Bits in the digital universe as stars in our physical universe. 90% Data is in unstructured form.
  • 7. Big data can be described by the following characteristics: Volume – The quantity of data that is generated is very important in this context. It is the size of the data which determines the value and potential of the data under consideration and whether it can actually be considered Big Data or not. The name ‘Big Data’ itself contains a term which is related to size and hence the characteristic. Variety - The next aspect of Big Data is its variety. This means that the category to which Big Data belongs to is also a very essential fact that needs to be known by the data analysts. This helps the people, who are closely analyzing the data and are associated with it, to effectively use the data to their advantage and thus upholding the importance of the Big Data. Velocity - The term ‘velocity’ in the context refers to the speed of generation of data or how fast the data is generated and processed to meet the demands and the challenges which lie ahead in the path of growth and development. Veracity - The quality of the data being captured can vary greatly. Accuracy of analysis depends on the veracity of the source data.
  • 8. Complexity - Data management can become a very complex process, especially when large volumes of data come from multiple sources. These data need to be linked, connected and correlated in order to be able to grasp the information that is supposed to be conveyed by these data. This situation, is therefore, termed as the ‘complexity’ of Big Data. Factory work and Cyber Physical System may have a 6C system: 1.Connection (sensor and networks), 2.Cloud (computing and data on demand), 3.Cyber (model and memory), 4.content/context (meaning and correlation), 5.community (sharing and collaboration), and 6.customization (personalization and value). In this scenario and in order to provide useful insight to the factory management and gain correct content, data has to be processed with advanced tools (analytics and algorithms) to generate meaningful information. Considering the presence of visible and invisible issues in an industrial factory, the information generation algorithm has to be capable of detecting and addressing invisible issues such as machine degradation, component wear, etc. in the factory floor
  • 9. Issues in Big Data Issues related to the Characteristics Storage and Transfer Issues Data Management Issues Processing Issues
  • 10. Issues in Characteristics Data Volume Issues Data Velocity Issues Data Variety Issues Worth of Data Issues Data Complexity Issues
  • 11. Storage and Transfer Issues Current Storage Techniques and Storage Medium are not appropriate for effectively handling Big Data. Current Technology limits 4 Terabytes (1012) per disk, so 1 Exabyte (1018) size data will take 25,000 Disks. Accessing that data will also overwhelm network. Assuming a sustained transfer of 1 Exabyte will take 2,800 hours with a 1 Gbps capable network with 80% effective transfer rate and 100Mbps sustainable speed.
  • 12. Data Management Issues Resolving issues of access, utilization, updating, governance, and reference (in publications) have proven to be major stumbling blocks. In such volume, it is impractical to validate every data item. New approaches and research to data qualification and validation are needed. The richness of digital data representation prohibits a personalized methodology for data collection.
  • 13. Processing Issues The Processing Issues are critical to handle. Example: 1 Exabyte = 1000 Petabytes (1015). Assuming a processor expends 100 instructions on one block at 5 gigahertz, the time required for end to-end processing would be 20 nanoseconds. To process 1K petabytes would require a total end-to-end processing time of roughly 635 years. Effective processing of Exabyte of data will require extensive parallel processing and new analytics algorithms
  • 14. Challenges in Big Data Privacy and Security Data Access and Sharing of Information Analytical Challenges Human Resources and Manpower Technical Challenges
  • 15. Privacy and Security Privacy and Security are sensitive and includes conceptual, Technical as well as legal significance. Most Peoples are vulnerable to Information Theft. Privacy can be compromised in the large data sets. The Security is also critical to handle in such large data. Social stratification would be important arising consequence
  • 16. Data Access and Sharing of Information Data should be available in accurate, complete and timely manner. The data management and governance process bit complex adding the necessity to make data open and make it available to government agencies. Expecting sharing of data between companies is awkward
  • 17. Analytical Challenges Big data brings along with it some huge analytical challenges. Analysis on such huge data, requires a large number of advance skills. The type of analysis which is needed to be done on the data depends highly on the results to be obtained.
  • 18. Technical Challenges Fault Tolerance: If the failure occurs the damage done should be within acceptable threshold rather than beginning the whole task from the scratch. Scalability: Requires a high level of sharing of resources which is expensive and dealing with the system failures in an efficient manner. Quality of Data: Big data focuses on quality data storage rather than having very large irrelevant data. Heterogeneous Data: Structured and Unstructured Data
  • 19. Advantages of Big Data Understanding and Targeting Customers Understanding and Optimizing Business Process Improving Science and Research Improving Healthcare and Public Health Optimizing Machine and Device Performance Financial Trading Improving Sports Performance Improving Security and Law Enforcement
  • 20. Conclusions The commercial impacts of the Big data have the potential to generate significant productivity growth for a number of vertical sectors. Big Data presents opportunity to create unprecedented business advantages and better service delivery. All the challenges and issues are needed to be handle effectively and in a efficient manner. Growing talent and building teams to make analytic-based decisions is the key to realize the value of Big Data.