SlideShare a Scribd company logo
Big Data
&
Big DataTechnologies
Presenters
Swikar Bhandari
Yaju Shrestha
Yubraj Ghimire 7/12/20171
Overview
 Introduction to Big Data
 Characteristics of Big Data
 Challenges in Big Data
 Big DataTrends
 Data Scientist & their roles
7/12/20172
Introduction to Big Data
7/12/20173
 Information that can’t be processed or analyzed using
traditional processes or tools.
 Data sets that are so large and complex
 Data which are difficult to capture, store, process,
search and analysis
 ‘Big-data’ is similar to‘Small-data’, but bigger.
Characteristics of Big Data
Volume
• Data size
• Data is generated by machines, networks and human
interaction on systems like social media
• 2.5 Exabytes of data produced everyday which is equivalent
to 90 years of HD video
Velocity
• The pace of flow of data from sources like business,
machines, network and human interaction with social medias
or mobiles
• The data flow is massive and continuous
7/12/20174
Characteristics of Big Data
Variety
• Data heterogeneity: Structured and Unstructured data
• Many sources and types of data both structured and
unstructured.
• Data comes in the form of emails, photos, videos,
monitoring devices, PDFs, audio, etc
Veracity
• Uncertainty of accuracy and authenticity of data
• Biases, noise and abnormality in data.
• Data that is being stored, and mined meaningful to the
problem being analyzed or not.
7/12/20175
Characteristics of Big Data
Validity
• The issue of validity meaning is the data correct and accurate
for the intended use.
• Valid data is key to making the right decisions
Volatility
• How long is data valid and how long should it be stored.
• Data need to determine at what point is data no longer
relevant to the current analysis.
7/12/20176
Challenges in Big Data
 Fault tolerance: ability to handle failures
 Scalability : ability to handle data with time
 Heterogeneity: ability to handle various kinds of data
7/12/20177
Big Data in Information System
 Unstructured data handling capability
 Real time data processing
 Predictive analytics and in-memory analytics
7/12/20178
Big Data Trends
 NoSQL database: for handling unstructured data
 Cloud based analytics: migrating data to cloud platform
 Deep learning: algorithms are used for mining data
 In memory analytics: speed up analytical processing.
7/12/20179
Data Scientist
 Person that analyses and interprets data to assist in
decision making.
 The people who understand how to fish out answers to
important business questions from today's tsunami of
unstructured information
 A hybrid of data hacker, analyst, communicator, and
trusted adviser
.
7/12/201710
Roles and Skills Of Data Scientist
 Use technologies that make taming big data possible,
including Hadoop, and related open-source tools, cloud
computing, and data visualization.
 Make discoveries while swimming in pool of data
 Bring structure to large quantities of formless data and make
analysis possible
 Write code
7/12/201711
Roles and Skills Of Data Scientist
 Communicate what they’ve learned and suggest its
implications for new business directions
 Fashion their own tools and even conduct academic-style
research
 Be creative in displaying information visually and making the
patterns they find clear and compelling
7/12/201712
THANK YOU
7/12/201713

More Related Content

What's hot

Data science
Data scienceData science
Data science
shankar_radhakrishnan
 
SKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSISSKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSIS
Skillwise Consulting
 
How I Learned to Stop Worrying and Love Linked Data
How I Learned to Stop Worrying and Love Linked DataHow I Learned to Stop Worrying and Love Linked Data
How I Learned to Stop Worrying and Love Linked Data
Domino Data Lab
 
The Top 5 Factors to Consider When Choosing a Big Data Solution
The Top 5 Factors to Consider When Choosing a Big Data SolutionThe Top 5 Factors to Consider When Choosing a Big Data Solution
The Top 5 Factors to Consider When Choosing a Big Data Solution
DATAVERSITY
 
Prague data management meetup 2016-01-12 pub
Prague data management meetup 2016-01-12 pubPrague data management meetup 2016-01-12 pub
Prague data management meetup 2016-01-12 pub
Martin Bém
 
Kurukshetra - Big Data
Kurukshetra - Big DataKurukshetra - Big Data
Kurukshetra - Big Data
shankar_radhakrishnan
 
BIG DATA-Seminar Report
BIG DATA-Seminar ReportBIG DATA-Seminar Report
BIG DATA-Seminar Report
josnapv
 
Dell hans timmerman v1.1
Dell hans timmerman v1.1Dell hans timmerman v1.1
Dell hans timmerman v1.1
BigDataExpo
 
Big data
Big dataBig data
IBM Analytics at Scale: Because Business Outcomes Matter
IBM Analytics at Scale: Because Business Outcomes MatterIBM Analytics at Scale: Because Business Outcomes Matter
IBM Analytics at Scale: Because Business Outcomes Matter
Christine O'Connor
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
Srinath Perera
 
Reveelium Data Science as a Service - Datasheet EN
Reveelium Data Science as a Service - Datasheet ENReveelium Data Science as a Service - Datasheet EN
Reveelium Data Science as a Service - Datasheet EN
ITrust - Cybersecurity as a Service
 
Data Science towards the Digital Enterprise
Data Science towards the Digital EnterpriseData Science towards the Digital Enterprise
Data Science towards the Digital Enterprise
Jake Bouma
 
Pieter den Hamer Alliander
Pieter den Hamer Alliander Pieter den Hamer Alliander
Pieter den Hamer Alliander
BigDataExpo
 
Big Data and Harvesting Data from Social Media
Big Data and Harvesting Data from Social MediaBig Data and Harvesting Data from Social Media
Big Data and Harvesting Data from Social Media
R A Akerkar
 
The Role of Artificial Intelligence in Corporate Innovation
The Role of Artificial Intelligence in Corporate InnovationThe Role of Artificial Intelligence in Corporate Innovation
The Role of Artificial Intelligence in Corporate Innovation
Dickson Lukose
 
Talk at IEEE Big Data/Cloud conference in Santa Clara, June 28th, 2013.
Talk at IEEE Big Data/Cloud conference in Santa Clara, June 28th, 2013.Talk at IEEE Big Data/Cloud conference in Santa Clara, June 28th, 2013.
Talk at IEEE Big Data/Cloud conference in Santa Clara, June 28th, 2013.
Jari Koister
 
Mining Big Data in Real Time
Mining Big Data in Real TimeMining Big Data in Real Time
Mining Big Data in Real Time
Albert Bifet
 
Big data ppt
Big data pptBig data ppt
Big data ppt
AKASH SIHAG
 
Ds01 data science
Ds01   data scienceDs01   data science
Ds01 data science
DotNetCampus
 

What's hot (20)

Data science
Data scienceData science
Data science
 
SKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSISSKILLWISE-BIGDATA ANALYSIS
SKILLWISE-BIGDATA ANALYSIS
 
How I Learned to Stop Worrying and Love Linked Data
How I Learned to Stop Worrying and Love Linked DataHow I Learned to Stop Worrying and Love Linked Data
How I Learned to Stop Worrying and Love Linked Data
 
The Top 5 Factors to Consider When Choosing a Big Data Solution
The Top 5 Factors to Consider When Choosing a Big Data SolutionThe Top 5 Factors to Consider When Choosing a Big Data Solution
The Top 5 Factors to Consider When Choosing a Big Data Solution
 
Prague data management meetup 2016-01-12 pub
Prague data management meetup 2016-01-12 pubPrague data management meetup 2016-01-12 pub
Prague data management meetup 2016-01-12 pub
 
Kurukshetra - Big Data
Kurukshetra - Big DataKurukshetra - Big Data
Kurukshetra - Big Data
 
BIG DATA-Seminar Report
BIG DATA-Seminar ReportBIG DATA-Seminar Report
BIG DATA-Seminar Report
 
Dell hans timmerman v1.1
Dell hans timmerman v1.1Dell hans timmerman v1.1
Dell hans timmerman v1.1
 
Big data
Big dataBig data
Big data
 
IBM Analytics at Scale: Because Business Outcomes Matter
IBM Analytics at Scale: Because Business Outcomes MatterIBM Analytics at Scale: Because Business Outcomes Matter
IBM Analytics at Scale: Because Business Outcomes Matter
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
 
Reveelium Data Science as a Service - Datasheet EN
Reveelium Data Science as a Service - Datasheet ENReveelium Data Science as a Service - Datasheet EN
Reveelium Data Science as a Service - Datasheet EN
 
Data Science towards the Digital Enterprise
Data Science towards the Digital EnterpriseData Science towards the Digital Enterprise
Data Science towards the Digital Enterprise
 
Pieter den Hamer Alliander
Pieter den Hamer Alliander Pieter den Hamer Alliander
Pieter den Hamer Alliander
 
Big Data and Harvesting Data from Social Media
Big Data and Harvesting Data from Social MediaBig Data and Harvesting Data from Social Media
Big Data and Harvesting Data from Social Media
 
The Role of Artificial Intelligence in Corporate Innovation
The Role of Artificial Intelligence in Corporate InnovationThe Role of Artificial Intelligence in Corporate Innovation
The Role of Artificial Intelligence in Corporate Innovation
 
Talk at IEEE Big Data/Cloud conference in Santa Clara, June 28th, 2013.
Talk at IEEE Big Data/Cloud conference in Santa Clara, June 28th, 2013.Talk at IEEE Big Data/Cloud conference in Santa Clara, June 28th, 2013.
Talk at IEEE Big Data/Cloud conference in Santa Clara, June 28th, 2013.
 
Mining Big Data in Real Time
Mining Big Data in Real TimeMining Big Data in Real Time
Mining Big Data in Real Time
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Ds01 data science
Ds01   data scienceDs01   data science
Ds01 data science
 

Similar to Big data presentation

UNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdfUNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdf
vvpadhu
 
Big Data Presentation
Big Data PresentationBig Data Presentation
Big Data Presentation
AbhijeetPandey71
 
TOPIC.pptx
TOPIC.pptxTOPIC.pptx
TOPIC.pptx
infinix8
 
Big Data Analysis
Big Data AnalysisBig Data Analysis
Big Data Analysis
IRJET Journal
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigData
Valarmathi V
 
1
11
Big data.pptx
Big data.pptxBig data.pptx
Big data.pptx
Honey166829
 
Big data analytics - Introduction to Big Data and Hadoop
Big data analytics - Introduction to Big Data and HadoopBig data analytics - Introduction to Big Data and Hadoop
Big data analytics - Introduction to Big Data and Hadoop
SamiraChandan
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
Sandip Tipayle Patil
 
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
IJSRD
 
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
IJSRD
 
Data warehouse Vs Big Data
Data warehouse Vs Big Data Data warehouse Vs Big Data
Data warehouse Vs Big Data
Lisette ZOUNON
 
Big data
Big dataBig data
Big data
Young Alista
 
Big data
Big dataBig data
Big data
Hoang Nguyen
 
Big data
Big dataBig data
Big data
Tony Nguyen
 
Big data
Big dataBig data
Big data
Harry Potter
 
Big data
Big dataBig data
Big data
Fraboni Ec
 
Big data
Big dataBig data
Big data
James Wong
 
Big data
Big dataBig data
Big data
Luis Goldster
 
Seminarppt
SeminarpptSeminarppt
Seminarppt
Monali Akhare
 

Similar to Big data presentation (20)

UNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdfUNIT 1 -BIG DATA ANALYTICS Full.pdf
UNIT 1 -BIG DATA ANALYTICS Full.pdf
 
Big Data Presentation
Big Data PresentationBig Data Presentation
Big Data Presentation
 
TOPIC.pptx
TOPIC.pptxTOPIC.pptx
TOPIC.pptx
 
Big Data Analysis
Big Data AnalysisBig Data Analysis
Big Data Analysis
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigData
 
1
11
1
 
Big data.pptx
Big data.pptxBig data.pptx
Big data.pptx
 
Big data analytics - Introduction to Big Data and Hadoop
Big data analytics - Introduction to Big Data and HadoopBig data analytics - Introduction to Big Data and Hadoop
Big data analytics - Introduction to Big Data and Hadoop
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
 
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
Big Data Mining, Techniques, Handling Technologies and Some Related Issues: A...
 
Data warehouse Vs Big Data
Data warehouse Vs Big Data Data warehouse Vs Big Data
Data warehouse Vs Big Data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Seminarppt
SeminarpptSeminarppt
Seminarppt
 

Recently uploaded

Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.pptUnit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
KrishnaveniKrishnara1
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
bijceesjournal
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
IJECEIAES
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
ecqow
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
MDSABBIROJJAMANPAYEL
 
Data Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptxData Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptx
ramrag33
 
Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...
Prakhyath Rai
 
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
171ticu
 
Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
co23btech11018
 
Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...
bijceesjournal
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
Mahmoud Morsy
 
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
ydzowc
 
CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1
PKavitha10
 
Design and optimization of ion propulsion drone
Design and optimization of ion propulsion droneDesign and optimization of ion propulsion drone
Design and optimization of ion propulsion drone
bjmsejournal
 
People as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimalaPeople as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimala
riddhimaagrawal986
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
Madan Karki
 
cnn.pptx Convolutional neural network used for image classication
cnn.pptx Convolutional neural network used for image classicationcnn.pptx Convolutional neural network used for image classication
cnn.pptx Convolutional neural network used for image classication
SakkaravarthiShanmug
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
abbyasa1014
 
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURSCompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
RamonNovais6
 
An Introduction to the Compiler Designss
An Introduction to the Compiler DesignssAn Introduction to the Compiler Designss
An Introduction to the Compiler Designss
ElakkiaU
 

Recently uploaded (20)

Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.pptUnit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
Unit-III-ELECTROCHEMICAL STORAGE DEVICES.ppt
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
 
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
Electric vehicle and photovoltaic advanced roles in enhancing the financial p...
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
 
Properties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptxProperties Railway Sleepers and Test.pptx
Properties Railway Sleepers and Test.pptx
 
Data Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptxData Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptx
 
Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...Software Engineering and Project Management - Introduction, Modeling Concepts...
Software Engineering and Project Management - Introduction, Modeling Concepts...
 
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样学校原版美国波士顿大学毕业证学历学位证书原版一模一样
学校原版美国波士顿大学毕业证学历学位证书原版一模一样
 
Computational Engineering IITH Presentation
Computational Engineering IITH PresentationComputational Engineering IITH Presentation
Computational Engineering IITH Presentation
 
Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
 
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
原版制作(Humboldt毕业证书)柏林大学毕业证学位证一模一样
 
CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1CEC 352 - SATELLITE COMMUNICATION UNIT 1
CEC 352 - SATELLITE COMMUNICATION UNIT 1
 
Design and optimization of ion propulsion drone
Design and optimization of ion propulsion droneDesign and optimization of ion propulsion drone
Design and optimization of ion propulsion drone
 
People as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimalaPeople as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimala
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
 
cnn.pptx Convolutional neural network used for image classication
cnn.pptx Convolutional neural network used for image classicationcnn.pptx Convolutional neural network used for image classication
cnn.pptx Convolutional neural network used for image classication
 
Engineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdfEngineering Drawings Lecture Detail Drawings 2014.pdf
Engineering Drawings Lecture Detail Drawings 2014.pdf
 
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURSCompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
 
An Introduction to the Compiler Designss
An Introduction to the Compiler DesignssAn Introduction to the Compiler Designss
An Introduction to the Compiler Designss
 

Big data presentation

  • 1. Big Data & Big DataTechnologies Presenters Swikar Bhandari Yaju Shrestha Yubraj Ghimire 7/12/20171
  • 2. Overview  Introduction to Big Data  Characteristics of Big Data  Challenges in Big Data  Big DataTrends  Data Scientist & their roles 7/12/20172
  • 3. Introduction to Big Data 7/12/20173  Information that can’t be processed or analyzed using traditional processes or tools.  Data sets that are so large and complex  Data which are difficult to capture, store, process, search and analysis  ‘Big-data’ is similar to‘Small-data’, but bigger.
  • 4. Characteristics of Big Data Volume • Data size • Data is generated by machines, networks and human interaction on systems like social media • 2.5 Exabytes of data produced everyday which is equivalent to 90 years of HD video Velocity • The pace of flow of data from sources like business, machines, network and human interaction with social medias or mobiles • The data flow is massive and continuous 7/12/20174
  • 5. Characteristics of Big Data Variety • Data heterogeneity: Structured and Unstructured data • Many sources and types of data both structured and unstructured. • Data comes in the form of emails, photos, videos, monitoring devices, PDFs, audio, etc Veracity • Uncertainty of accuracy and authenticity of data • Biases, noise and abnormality in data. • Data that is being stored, and mined meaningful to the problem being analyzed or not. 7/12/20175
  • 6. Characteristics of Big Data Validity • The issue of validity meaning is the data correct and accurate for the intended use. • Valid data is key to making the right decisions Volatility • How long is data valid and how long should it be stored. • Data need to determine at what point is data no longer relevant to the current analysis. 7/12/20176
  • 7. Challenges in Big Data  Fault tolerance: ability to handle failures  Scalability : ability to handle data with time  Heterogeneity: ability to handle various kinds of data 7/12/20177
  • 8. Big Data in Information System  Unstructured data handling capability  Real time data processing  Predictive analytics and in-memory analytics 7/12/20178
  • 9. Big Data Trends  NoSQL database: for handling unstructured data  Cloud based analytics: migrating data to cloud platform  Deep learning: algorithms are used for mining data  In memory analytics: speed up analytical processing. 7/12/20179
  • 10. Data Scientist  Person that analyses and interprets data to assist in decision making.  The people who understand how to fish out answers to important business questions from today's tsunami of unstructured information  A hybrid of data hacker, analyst, communicator, and trusted adviser . 7/12/201710
  • 11. Roles and Skills Of Data Scientist  Use technologies that make taming big data possible, including Hadoop, and related open-source tools, cloud computing, and data visualization.  Make discoveries while swimming in pool of data  Bring structure to large quantities of formless data and make analysis possible  Write code 7/12/201711
  • 12. Roles and Skills Of Data Scientist  Communicate what they’ve learned and suggest its implications for new business directions  Fashion their own tools and even conduct academic-style research  Be creative in displaying information visually and making the patterns they find clear and compelling 7/12/201712