SlideShare a Scribd company logo
Assoc.Prof.Assoc.Prof. Abzetdin ADAMOVAbzetdin ADAMOV
CeDAWI - Center for Data Analytics and Web InsightsCeDAWI - Center for Data Analytics and Web Insights
Qafqaz UniversityQafqaz University
aadamov@qu.edu.azaadamov@qu.edu.az
http://ce.qu.edu.az/~aadamov
12 March 201512 March 2015
Big Data Ecosystem forBig Data Ecosystem for
Data-Driven Decision MakingData-Driven Decision Making
Digital UniverseDigital Universe
Volume of Digital DataVolume of Digital Data
IDC's Digital Universe Study
• 2003 – 5 exabytes from beginning of civilization
• 2005 – 130 exabytes
• 2008 – 480.000 petabytes (PB)
• 2009 – 800.000 PB
• 2010 – 1200 000 PB or 1.2 zettabyte (ZB)
• 2011 – 1.8 ZB
• 2012 – 2.7 ZB
• 2014 ~ 6.2 ZB
• Expected to reach 44 ZB by 2020
Every day now we create as much information as we did from theEvery day now we create as much information as we did from the
dawn of civilization up until 2003dawn of civilization up until 2003
Big Measures for Big DataBig Measures for Big Data
• kilobyte (kB) 103
210
• megabyte (MB) 106
220
• gigabyte (GB) 109
230
• terabyte (TB) 1012
240
• petabyte (PB) 1015
250
• exabyte (EB) 1018
260
• zettabyte (ZB) 1021
270
• yottabyte (YB) 1024
280
Why Data Grows so Fast?Why Data Grows so Fast?
Data sets gathered by ubiquity devices:
• Information-sensing mobile devices,
• Aerial sensory technologies (remote
sensing),
• Software logs,
• Cameras,
• Microphones,
• Radio-frequency identification readers,
• wireless sensor networks
Internet PenetrationInternet Penetration
Note: Internet stats for December 2001
Avarage Internet usage ın the world 8% - 500 Million - 2001
Foundations of the WebFoundations of the Web
Note: Internet stats for January 2014
Avarage Internet usage ın the world 42% - 3.0 Billion - 2014
Top 15 Most Popular Social Networking Sites | January 2015
1,310,000,000 - Estimated
Unique Monthly Visitors | 2
- Compete Rank
284,000,000 - Estimated
Unique Monthly Visitors | 24 -
Compete Rank
347,000,000 - Estimated
Unique Monthly Visitors | 44 -
Compete Rank
70,500,000 - Estimated
Unique Monthly Visitors | 51 -
Compete Rank
343,000,000 - Estimated
Unique Monthly Visitors
25,500,000 - Estimated
Unique Monthly Visitors |
346 - Compete Rank
20,500,000 - Estimated
Unique Monthly Visitors |
605 - Compete Rank
19,500,000 - Estimated
Unique Monthly Visitors |
447 - Compete Rank
17,500,000 - Estimated
Unique Monthly
Visitors | *NA* - Compete
Rank
12,500,000 - Estimated
Unique Monthly
Visitors | 127 - Compete
Rank
12,000,000 - Estimated
Unique Monthly
Visitors | 617 - Compete
Rank
7,500,000 - Estimated
Unique Monthly
Visitors | 838 - Compete
Rank
5,400,000 - Estimated
Unique Monthly
Visitors | 122 - Compete
Rank
3,000,000 - Estimated
Unique Monthly
Visitors | 451 - Compete
Rank
2,500,000 - Estimated
Unique Monthly
Visitors | 1,596 - Compete
Rank
Social NetworkingSocial Networking
Problem with Moore’s LawProblem with Moore’s Law
• The number of transistors that can be placed on an
integrated circuit doubles every 18 months to two years
• It’s predicted to reach its limit with existing technology in
2020
• Cutting the size of a transistor to a single atom may
defeat that concept
• The Digital Universe is growing much more faster
than Processing Power
What Big Data is and isn’t?What Big Data is and isn’t?
Computing + Internet = Big DataComputing + Internet = Big Data
• Big Data is not new technologyBig Data is not new technology
• Big Data is not just about sizeBig Data is not just about size
• Big Data is not Business Intelligence (BI)Big Data is not Business Intelligence (BI)
• Big Data is not Solution by itself!Big Data is not Solution by itself!
Is it time to move from Big Data 1 to Big Data 2?Is it time to move from Big Data 1 to Big Data 2?
Interdisciplinary Subfield ofInterdisciplinary Subfield of
Computer ScienceComputer Science
• Artificial Intelligence,
• Machine Learning,
• Statistics,
• Applied Mathematics,
• Text Mining,
• Database Systems,
• Business Intelligence,
• Computational Linguistics,
• Natural Language Processing (NLP),
• ….
Jobs Derived from Big DataJobs Derived from Big Data
• Chief Data Officer,
• Big Data Solution Architect,
• Big Data Platform Engineer,
• Big Data Analyst,
• Big Data Analytics Business Consultant,
• Big Data Software Designer,
• Big Data Consultant,
• Hadoop Architects,
• Consultant Hadoop Developer,
• Senior Analytics Manager,
• Data & Reporting Analyst,
• Analytics Analyst (Big Data)
Forbes - Where Big Data Jobs Will be in 2015
Data-Driven Decision MakingData-Driven Decision Making
(DDD)(DDD)
Data-driven decision making (DDD) refers to the practice of basing
decisions on the analysis of data rather than purely on intuition.
Data alone won’t change the world. It’s the people that use data
to make better decisions.
Data Science ApplicationData Science Application
• Direct Marketing,
• Online Advertising,
• Credit Scoring and Risk Management
• Help Desk Management
• Fraud Detection
• Search Ranking
• Product Recommendation
• Predicting Unusual Behavior
• Customer Retention in Telecom
Big Data Management Life-CycleBig Data Management Life-Cycle
- Apache Hadoop
- HDFS
- Microsoft Azure
- ….
- Microsoft Analytics
Platform System
- Excel
- R Programming
- Python
- ….
- Web Crawling
- Data Mining
- Information
Retrieval
- ….
- Parsing
- Indexing
- Searching
- Ranking
- NLP
- ….
Big Data Management involves Data Science and Data Engineering areas for
implementing Data Mining Techniques
Big Data InfrastructureBig Data Infrastructure
Google’s First Data CentersGoogle’s First Data Centers
Google’s first data center
Google New Data CentersGoogle New Data Centers
Map of Google Data Centers Worldwide
450,000 servers range upwards of
20 megawatts, which cost on the
order of US$2 million per month in
electricity charges.
Big Data Terms andBig Data Terms and
ComponentsComponents
• Microsoft AzureMicrosoft Azure
• Red Hat GFS - Global File SystemRed Hat GFS - Global File System
• GoogleFS or GFS - Google File SystemGoogleFS or GFS - Google File System
• HDFS - Hadoop Distributed File SystemHDFS - Hadoop Distributed File System
• SAN - Storage Area NetworkSAN - Storage Area Network
• Google BigTableGoogle BigTable
• VFS - Virtual File SystemVFS - Virtual File System
• IBM GPFS - General Parallel File SystemIBM GPFS - General Parallel File System
• HPSS - High Performance Storage SystemHPSS - High Performance Storage System
Hadoop Distributed File SystemHadoop Distributed File System
Web Crowlers for Web AnalyticsWeb Crowlers for Web Analytics
• Indexing
• Searching
• Ranking
• Analysis
• Crowling is Essential Job for all Internet
Giants: Google, Yahoo, Facebook, etc.
Some of available open source crowlers: Apache Nutch, Crawler4j,
Bixo, Heritrix, etc.
Web Crowlers for Web AnalyticsWeb Crowlers for Web Analytics
• Thanks to Crowlers any website can appear in
search results without doing any extra work.
• Customized Crowling by METATags and
“ROBOTS.TXT”
Natural Language ProcessingNatural Language Processing
(NLP)(NLP)
• Natural Language Processing (NLP)
• Computational Linguistics (CL)
• Machine Translation (MT)
Data Mining and KnowledgeData Mining and Knowledge
DiscoveryDiscovery
• Data collection
• Selection of useful data
• Data transformation: smoothing, aggregation,
normalization
• Discovering of interesting patterns:
classification, clustering, regression, anomaly
detection, association
• Knowledge visualization
Some of available open source Data Mining tools: RapidMiner,
RapidAnalytics, OpenNN, Carrot2, KNIME, etc.
Quotes on Big DataQuotes on Big Data
“You can have data without information, but you cannot have
information without data.” – Daniel Keys Moran
“War is ninety percent information.” – Napoleon Bonaparte
“If you torture the data long enough, it will confess.” – Ronald Coase,
Economist
“He who search for pearls must dive below” – John Dryden
Thank youThank you
www.www.CeDAWICeDAWI.qu.edu.az.qu.edu.az
CeDAWICeDAWI@qu.edu.az@qu.edu.az

More Related Content

What's hot

Personal data and blockchain: Opportunities and Challenges - Michele Nati - L...
Personal data and blockchain: Opportunities and Challenges - Michele Nati - L...Personal data and blockchain: Opportunities and Challenges - Michele Nati - L...
Personal data and blockchain: Opportunities and Challenges - Michele Nati - L...
MicheleNati
 
GDPR and evolving international privacy regulations
GDPR and evolving international privacy regulationsGDPR and evolving international privacy regulations
GDPR and evolving international privacy regulations
Ulf Mattsson
 
Getting Started with GDPR Compliance
Getting Started with GDPR ComplianceGetting Started with GDPR Compliance
Getting Started with GDPR Compliance
DATAVERSITY
 
Keith prabhu global high on cloud summit
Keith prabhu  global high on cloud summitKeith prabhu  global high on cloud summit
Keith prabhu global high on cloud summit
administrator_confidis
 
Semantic Computing Executive Briefing
Semantic Computing Executive Briefing Semantic Computing Executive Briefing
Semantic Computing Executive Briefing
Graeme Wood
 
Big data
Big dataBig data
Big data
neharun07
 
Consent Receipts: The Future of Personal Data - Michele Nati - Lead Technolog...
Consent Receipts: The Future of Personal Data - Michele Nati - Lead Technolog...Consent Receipts: The Future of Personal Data - Michele Nati - Lead Technolog...
Consent Receipts: The Future of Personal Data - Michele Nati - Lead Technolog...
MicheleNati
 
Legal certainty as a tool for the spread of the internet of things
Legal certainty as a tool for the spread of the internet of thingsLegal certainty as a tool for the spread of the internet of things
Legal certainty as a tool for the spread of the internet of things
Guido Noto La Diega
 
Privacy Regulations and Your Digital Setup
Privacy Regulations and Your Digital SetupPrivacy Regulations and Your Digital Setup
Privacy Regulations and Your Digital Setup
Piwik PRO
 
Making sense of big data
Making sense of big dataMaking sense of big data
Making sense of big data
bis_foresight
 
LAK16 privacy and analytics (2016)
LAK16 privacy and analytics (2016)LAK16 privacy and analytics (2016)
LAK16 privacy and analytics (2016)
Wolfgang Greller
 
e-SIDES workshop at ICE-IEEE Conference, Madeira 28/06/2017
e-SIDES workshop at ICE-IEEE Conference, Madeira 28/06/2017e-SIDES workshop at ICE-IEEE Conference, Madeira 28/06/2017
e-SIDES workshop at ICE-IEEE Conference, Madeira 28/06/2017
e-SIDES.eu
 
Personal Data Receipts - Michele Nati - Lead Technologist Privacy and Trust -...
Personal Data Receipts - Michele Nati - Lead Technologist Privacy and Trust -...Personal Data Receipts - Michele Nati - Lead Technologist Privacy and Trust -...
Personal Data Receipts - Michele Nati - Lead Technologist Privacy and Trust -...
MicheleNati
 
Big Data & Implementasinya
Big Data & ImplementasinyaBig Data & Implementasinya
Big Data & Implementasinya
Anshar Abdullah
 
The Future Matters - Mike Maiorana
The Future Matters - Mike MaioranaThe Future Matters - Mike Maiorana
The Future Matters - Mike Maiorana
scoopnewsgroup
 
Web Analytics and Privacy
Web Analytics and Privacy Web Analytics and Privacy
Web Analytics and Privacy
Piwik PRO
 
What I learned at the Infosecurity ISACA North America Conference 2019
What I learned at the Infosecurity ISACA North America Conference 2019What I learned at the Infosecurity ISACA North America Conference 2019
What I learned at the Infosecurity ISACA North America Conference 2019
Ulf Mattsson
 
Digital Enterprise Festival Birmingham 13/04/17 - Ian West Cognizant VP Data ...
Digital Enterprise Festival Birmingham 13/04/17 - Ian West Cognizant VP Data ...Digital Enterprise Festival Birmingham 13/04/17 - Ian West Cognizant VP Data ...
Digital Enterprise Festival Birmingham 13/04/17 - Ian West Cognizant VP Data ...
CIO Edge
 
Federal and Private Sector Joint Venture Partnership for Data Innovation - Av...
Federal and Private Sector Joint Venture Partnership for Data Innovation - Av...Federal and Private Sector Joint Venture Partnership for Data Innovation - Av...
Federal and Private Sector Joint Venture Partnership for Data Innovation - Av...
scoopnewsgroup
 
Internet of Things (IotT) Legal Issues Privacy and Cybersecurity
Internet of Things (IotT) Legal Issues Privacy and Cybersecurity Internet of Things (IotT) Legal Issues Privacy and Cybersecurity
Internet of Things (IotT) Legal Issues Privacy and Cybersecurity
Darek Czuchaj
 

What's hot (20)

Personal data and blockchain: Opportunities and Challenges - Michele Nati - L...
Personal data and blockchain: Opportunities and Challenges - Michele Nati - L...Personal data and blockchain: Opportunities and Challenges - Michele Nati - L...
Personal data and blockchain: Opportunities and Challenges - Michele Nati - L...
 
GDPR and evolving international privacy regulations
GDPR and evolving international privacy regulationsGDPR and evolving international privacy regulations
GDPR and evolving international privacy regulations
 
Getting Started with GDPR Compliance
Getting Started with GDPR ComplianceGetting Started with GDPR Compliance
Getting Started with GDPR Compliance
 
Keith prabhu global high on cloud summit
Keith prabhu  global high on cloud summitKeith prabhu  global high on cloud summit
Keith prabhu global high on cloud summit
 
Semantic Computing Executive Briefing
Semantic Computing Executive Briefing Semantic Computing Executive Briefing
Semantic Computing Executive Briefing
 
Big data
Big dataBig data
Big data
 
Consent Receipts: The Future of Personal Data - Michele Nati - Lead Technolog...
Consent Receipts: The Future of Personal Data - Michele Nati - Lead Technolog...Consent Receipts: The Future of Personal Data - Michele Nati - Lead Technolog...
Consent Receipts: The Future of Personal Data - Michele Nati - Lead Technolog...
 
Legal certainty as a tool for the spread of the internet of things
Legal certainty as a tool for the spread of the internet of thingsLegal certainty as a tool for the spread of the internet of things
Legal certainty as a tool for the spread of the internet of things
 
Privacy Regulations and Your Digital Setup
Privacy Regulations and Your Digital SetupPrivacy Regulations and Your Digital Setup
Privacy Regulations and Your Digital Setup
 
Making sense of big data
Making sense of big dataMaking sense of big data
Making sense of big data
 
LAK16 privacy and analytics (2016)
LAK16 privacy and analytics (2016)LAK16 privacy and analytics (2016)
LAK16 privacy and analytics (2016)
 
e-SIDES workshop at ICE-IEEE Conference, Madeira 28/06/2017
e-SIDES workshop at ICE-IEEE Conference, Madeira 28/06/2017e-SIDES workshop at ICE-IEEE Conference, Madeira 28/06/2017
e-SIDES workshop at ICE-IEEE Conference, Madeira 28/06/2017
 
Personal Data Receipts - Michele Nati - Lead Technologist Privacy and Trust -...
Personal Data Receipts - Michele Nati - Lead Technologist Privacy and Trust -...Personal Data Receipts - Michele Nati - Lead Technologist Privacy and Trust -...
Personal Data Receipts - Michele Nati - Lead Technologist Privacy and Trust -...
 
Big Data & Implementasinya
Big Data & ImplementasinyaBig Data & Implementasinya
Big Data & Implementasinya
 
The Future Matters - Mike Maiorana
The Future Matters - Mike MaioranaThe Future Matters - Mike Maiorana
The Future Matters - Mike Maiorana
 
Web Analytics and Privacy
Web Analytics and Privacy Web Analytics and Privacy
Web Analytics and Privacy
 
What I learned at the Infosecurity ISACA North America Conference 2019
What I learned at the Infosecurity ISACA North America Conference 2019What I learned at the Infosecurity ISACA North America Conference 2019
What I learned at the Infosecurity ISACA North America Conference 2019
 
Digital Enterprise Festival Birmingham 13/04/17 - Ian West Cognizant VP Data ...
Digital Enterprise Festival Birmingham 13/04/17 - Ian West Cognizant VP Data ...Digital Enterprise Festival Birmingham 13/04/17 - Ian West Cognizant VP Data ...
Digital Enterprise Festival Birmingham 13/04/17 - Ian West Cognizant VP Data ...
 
Federal and Private Sector Joint Venture Partnership for Data Innovation - Av...
Federal and Private Sector Joint Venture Partnership for Data Innovation - Av...Federal and Private Sector Joint Venture Partnership for Data Innovation - Av...
Federal and Private Sector Joint Venture Partnership for Data Innovation - Av...
 
Internet of Things (IotT) Legal Issues Privacy and Cybersecurity
Internet of Things (IotT) Legal Issues Privacy and Cybersecurity Internet of Things (IotT) Legal Issues Privacy and Cybersecurity
Internet of Things (IotT) Legal Issues Privacy and Cybersecurity
 

Similar to Big Data Ecosystem for Data-Driven Decision Making

Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
ALTER WAY
 
Overview of Bigdata Analytics
Overview of Bigdata Analytics Overview of Bigdata Analytics
Overview of Bigdata Analytics
Sankarapu Anjaneyulu
 
Big data
Big dataBig data
Big data
nikki135
 
Big Data and BI Best Practices
Big Data and BI Best PracticesBig Data and BI Best Practices
Big Data and BI Best Practices
Yellowfin
 
Big data and bi best practices slidedeck
Big data and bi best practices slidedeckBig data and bi best practices slidedeck
Big data and bi best practices slidedeck
Actian Corporation
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
Md. Salman Ahmed
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
VMware Tanzu
 
Level Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationLevel Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentation
Doug Denton
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion
Inside Analysis
 
Big Data Landscape 2018
Big Data Landscape 2018Big Data Landscape 2018
Big Data Landscape 2018
Leanne Hwee
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
Bob Hardaway
 
Ds01 data science
Ds01   data scienceDs01   data science
Ds01 data science
DotNetCampus
 
Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage
Geospatial Intelligence Middle East 2013_Big Data_Steven RamageGeospatial Intelligence Middle East 2013_Big Data_Steven Ramage
Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage
Steven Ramage
 
CDOVision - RJA Presentation FINAL
CDOVision - RJA Presentation FINALCDOVision - RJA Presentation FINAL
CDOVision - RJA Presentation FINAL
Robert J. Abate, CBIP, CDMP
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
dickonsondorris
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
Manish Chopra
 
Graphs & Big Data - Philip Rathle and Andreas Kollegger @ Big Data Science Me...
Graphs & Big Data - Philip Rathle and Andreas Kollegger @ Big Data Science Me...Graphs & Big Data - Philip Rathle and Andreas Kollegger @ Big Data Science Me...
Graphs & Big Data - Philip Rathle and Andreas Kollegger @ Big Data Science Me...
Neo4j
 
Graph tour keynote 2019
Graph tour keynote 2019Graph tour keynote 2019
Graph tour keynote 2019
Neo4j
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
bhavesh lande
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
Nasrin Hussain
 

Similar to Big Data Ecosystem for Data-Driven Decision Making (20)

Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
 
Overview of Bigdata Analytics
Overview of Bigdata Analytics Overview of Bigdata Analytics
Overview of Bigdata Analytics
 
Big data
Big dataBig data
Big data
 
Big Data and BI Best Practices
Big Data and BI Best PracticesBig Data and BI Best Practices
Big Data and BI Best Practices
 
Big data and bi best practices slidedeck
Big data and bi best practices slidedeckBig data and bi best practices slidedeck
Big data and bi best practices slidedeck
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Analytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data PlatformAnalytical Innovation: How to Build the Next Generation Data Platform
Analytical Innovation: How to Build the Next Generation Data Platform
 
Level Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentationLevel Seven - Expedient Big Data presentation
Level Seven - Expedient Big Data presentation
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion
 
Big Data Landscape 2018
Big Data Landscape 2018Big Data Landscape 2018
Big Data Landscape 2018
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
 
Ds01 data science
Ds01   data scienceDs01   data science
Ds01 data science
 
Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage
Geospatial Intelligence Middle East 2013_Big Data_Steven RamageGeospatial Intelligence Middle East 2013_Big Data_Steven Ramage
Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage
 
CDOVision - RJA Presentation FINAL
CDOVision - RJA Presentation FINALCDOVision - RJA Presentation FINAL
CDOVision - RJA Presentation FINAL
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
 
Graphs & Big Data - Philip Rathle and Andreas Kollegger @ Big Data Science Me...
Graphs & Big Data - Philip Rathle and Andreas Kollegger @ Big Data Science Me...Graphs & Big Data - Philip Rathle and Andreas Kollegger @ Big Data Science Me...
Graphs & Big Data - Philip Rathle and Andreas Kollegger @ Big Data Science Me...
 
Graph tour keynote 2019
Graph tour keynote 2019Graph tour keynote 2019
Graph tour keynote 2019
 
introduction to data science
introduction to data scienceintroduction to data science
introduction to data science
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 

More from Abzetdin Adamov

Understanding your Data - Data Analytics Lifecycle and Machine Learning
Understanding your Data - Data Analytics Lifecycle and Machine LearningUnderstanding your Data - Data Analytics Lifecycle and Machine Learning
Understanding your Data - Data Analytics Lifecycle and Machine Learning
Abzetdin Adamov
 
Big Data & Privacy
Big Data & PrivacyBig Data & Privacy
Big Data & Privacy
Abzetdin Adamov
 
Latest Trends in Technology: BigData Analytics, Virtualization, Cloud Computi...
Latest Trends in Technology:BigData Analytics, Virtualization, Cloud Computi...Latest Trends in Technology:BigData Analytics, Virtualization, Cloud Computi...
Latest Trends in Technology: BigData Analytics, Virtualization, Cloud Computi...
Abzetdin Adamov
 
Steps and Tips to Protect Yourself and your Private Information while Online....
Steps and Tips to Protect Yourself and your Private Information while Online....Steps and Tips to Protect Yourself and your Private Information while Online....
Steps and Tips to Protect Yourself and your Private Information while Online....
Abzetdin Adamov
 
Technical, Legal and Political Issues of Combating Terrorism on the Internet.
Technical, Legal and Political Issues of Combating Terrorism on the Internet.Technical, Legal and Political Issues of Combating Terrorism on the Internet.
Technical, Legal and Political Issues of Combating Terrorism on the Internet.
Abzetdin Adamov
 
Introduction to object oriented programming
Introduction to object oriented programmingIntroduction to object oriented programming
Introduction to object oriented programming
Abzetdin Adamov
 
Introduction to AJAX
Introduction to AJAXIntroduction to AJAX
Introduction to AJAX
Abzetdin Adamov
 
Introduction to HTML
Introduction to HTMLIntroduction to HTML
Introduction to HTML
Abzetdin Adamov
 
Qafqaz university-inegrated-management-information-system
Qafqaz university-inegrated-management-information-systemQafqaz university-inegrated-management-information-system
Qafqaz university-inegrated-management-information-system
Abzetdin Adamov
 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
Abzetdin Adamov
 
Üniversite Bilgi Sistemi - Birimlerin İşbirliği Platformu
Üniversite Bilgi Sistemi - Birimlerin İşbirliği PlatformuÜniversite Bilgi Sistemi - Birimlerin İşbirliği Platformu
Üniversite Bilgi Sistemi - Birimlerin İşbirliği Platformu
Abzetdin Adamov
 
INFORMATION TECHNOLOGIES AS THE BASE OF THE BUSINESS PROCESS MANAGEMENT IMPLE...
INFORMATION TECHNOLOGIES AS THE BASE OF THE BUSINESS PROCESS MANAGEMENT IMPLE...INFORMATION TECHNOLOGIES AS THE BASE OF THE BUSINESS PROCESS MANAGEMENT IMPLE...
INFORMATION TECHNOLOGIES AS THE BASE OF THE BUSINESS PROCESS MANAGEMENT IMPLE...
Abzetdin Adamov
 
e-Government Strategy. Government Transformation in Developing Countries of t...
e-Government Strategy. Government Transformation in Developing Countries of t...e-Government Strategy. Government Transformation in Developing Countries of t...
e-Government Strategy. Government Transformation in Developing Countries of t...
Abzetdin Adamov
 
The Truth about Cloud Computing as new Paradigm in IT
The Truth about Cloud Computing  as new Paradigm in ITThe Truth about Cloud Computing  as new Paradigm in IT
The Truth about Cloud Computing as new Paradigm in IT
Abzetdin Adamov
 
The Role of Business Process Management in Success of the e-Government Projec...
The Role of Business Process Management in Success of the e-Government Projec...The Role of Business Process Management in Success of the e-Government Projec...
The Role of Business Process Management in Success of the e-Government Projec...
Abzetdin Adamov
 
University Management Information System
University Management Information SystemUniversity Management Information System
University Management Information System
Abzetdin Adamov
 

More from Abzetdin Adamov (16)

Understanding your Data - Data Analytics Lifecycle and Machine Learning
Understanding your Data - Data Analytics Lifecycle and Machine LearningUnderstanding your Data - Data Analytics Lifecycle and Machine Learning
Understanding your Data - Data Analytics Lifecycle and Machine Learning
 
Big Data & Privacy
Big Data & PrivacyBig Data & Privacy
Big Data & Privacy
 
Latest Trends in Technology: BigData Analytics, Virtualization, Cloud Computi...
Latest Trends in Technology:BigData Analytics, Virtualization, Cloud Computi...Latest Trends in Technology:BigData Analytics, Virtualization, Cloud Computi...
Latest Trends in Technology: BigData Analytics, Virtualization, Cloud Computi...
 
Steps and Tips to Protect Yourself and your Private Information while Online....
Steps and Tips to Protect Yourself and your Private Information while Online....Steps and Tips to Protect Yourself and your Private Information while Online....
Steps and Tips to Protect Yourself and your Private Information while Online....
 
Technical, Legal and Political Issues of Combating Terrorism on the Internet.
Technical, Legal and Political Issues of Combating Terrorism on the Internet.Technical, Legal and Political Issues of Combating Terrorism on the Internet.
Technical, Legal and Political Issues of Combating Terrorism on the Internet.
 
Introduction to object oriented programming
Introduction to object oriented programmingIntroduction to object oriented programming
Introduction to object oriented programming
 
Introduction to AJAX
Introduction to AJAXIntroduction to AJAX
Introduction to AJAX
 
Introduction to HTML
Introduction to HTMLIntroduction to HTML
Introduction to HTML
 
Qafqaz university-inegrated-management-information-system
Qafqaz university-inegrated-management-information-systemQafqaz university-inegrated-management-information-system
Qafqaz university-inegrated-management-information-system
 
Grid Computing
Grid ComputingGrid Computing
Grid Computing
 
Üniversite Bilgi Sistemi - Birimlerin İşbirliği Platformu
Üniversite Bilgi Sistemi - Birimlerin İşbirliği PlatformuÜniversite Bilgi Sistemi - Birimlerin İşbirliği Platformu
Üniversite Bilgi Sistemi - Birimlerin İşbirliği Platformu
 
INFORMATION TECHNOLOGIES AS THE BASE OF THE BUSINESS PROCESS MANAGEMENT IMPLE...
INFORMATION TECHNOLOGIES AS THE BASE OF THE BUSINESS PROCESS MANAGEMENT IMPLE...INFORMATION TECHNOLOGIES AS THE BASE OF THE BUSINESS PROCESS MANAGEMENT IMPLE...
INFORMATION TECHNOLOGIES AS THE BASE OF THE BUSINESS PROCESS MANAGEMENT IMPLE...
 
e-Government Strategy. Government Transformation in Developing Countries of t...
e-Government Strategy. Government Transformation in Developing Countries of t...e-Government Strategy. Government Transformation in Developing Countries of t...
e-Government Strategy. Government Transformation in Developing Countries of t...
 
The Truth about Cloud Computing as new Paradigm in IT
The Truth about Cloud Computing  as new Paradigm in ITThe Truth about Cloud Computing  as new Paradigm in IT
The Truth about Cloud Computing as new Paradigm in IT
 
The Role of Business Process Management in Success of the e-Government Projec...
The Role of Business Process Management in Success of the e-Government Projec...The Role of Business Process Management in Success of the e-Government Projec...
The Role of Business Process Management in Success of the e-Government Projec...
 
University Management Information System
University Management Information SystemUniversity Management Information System
University Management Information System
 

Recently uploaded

办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
uehowe
 
Understanding User Behavior with Google Analytics.pdf
Understanding User Behavior with Google Analytics.pdfUnderstanding User Behavior with Google Analytics.pdf
Understanding User Behavior with Google Analytics.pdf
SEO Article Boost
 
办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理
办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理
办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理
uehowe
 
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
bseovas
 
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
cuobya
 
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
cuobya
 
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdfMeet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Florence Consulting
 
Discover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to IndiaDiscover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to India
davidjhones387
 
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
uehowe
 
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaalmanuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
wolfsoftcompanyco
 
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
cuobya
 
国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理
国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理
国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理
zoowe
 
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
ukwwuq
 
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
fovkoyb
 
[HUN][hackersuli] Red Teaming alapok 2024
[HUN][hackersuli] Red Teaming alapok 2024[HUN][hackersuli] Red Teaming alapok 2024
[HUN][hackersuli] Red Teaming alapok 2024
hackersuli
 
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
vmemo1
 
7 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 20247 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 2024
Danica Gill
 
Search Result Showing My Post is Now Buried
Search Result Showing My Post is Now BuriedSearch Result Showing My Post is Now Buried
Search Result Showing My Post is Now Buried
Trish Parr
 
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
ysasp1
 
Gen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needsGen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needs
Laura Szabó
 

Recently uploaded (20)

办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
办理毕业证(UPenn毕业证)宾夕法尼亚大学毕业证成绩单快速办理
 
Understanding User Behavior with Google Analytics.pdf
Understanding User Behavior with Google Analytics.pdfUnderstanding User Behavior with Google Analytics.pdf
Understanding User Behavior with Google Analytics.pdf
 
办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理
办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理
办理毕业证(NYU毕业证)纽约大学毕业证成绩单官方原版办理
 
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
不能毕业如何获得(USYD毕业证)悉尼大学毕业证成绩单一比一原版制作
 
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
制作毕业证书(ANU毕业证)莫纳什大学毕业证成绩单官方原版办理
 
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
假文凭国外(Adelaide毕业证)澳大利亚国立大学毕业证成绩单办理
 
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdfMeet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
Meet up Milano 14 _ Axpo Italia_ Migration from Mule3 (On-prem) to.pdf
 
Discover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to IndiaDiscover the benefits of outsourcing SEO to India
Discover the benefits of outsourcing SEO to India
 
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
留学挂科(UofM毕业证)明尼苏达大学毕业证成绩单复刻办理
 
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaalmanuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
manuaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaal
 
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
可查真实(Monash毕业证)西澳大学毕业证成绩单退学买
 
国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理
国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理
国外证书(Lincoln毕业证)新西兰林肯大学毕业证成绩单不能毕业办理
 
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
制作原版1:1(Monash毕业证)莫纳什大学毕业证成绩单办理假
 
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
存档可查的(USC毕业证)南加利福尼亚大学毕业证成绩单制做办理
 
[HUN][hackersuli] Red Teaming alapok 2024
[HUN][hackersuli] Red Teaming alapok 2024[HUN][hackersuli] Red Teaming alapok 2024
[HUN][hackersuli] Red Teaming alapok 2024
 
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
重新申请毕业证书(RMIT毕业证)皇家墨尔本理工大学毕业证成绩单精仿办理
 
7 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 20247 Best Cloud Hosting Services to Try Out in 2024
7 Best Cloud Hosting Services to Try Out in 2024
 
Search Result Showing My Post is Now Buried
Search Result Showing My Post is Now BuriedSearch Result Showing My Post is Now Buried
Search Result Showing My Post is Now Buried
 
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
成绩单ps(UST毕业证)圣托马斯大学毕业证成绩单快速办理
 
Gen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needsGen Z and the marketplaces - let's translate their needs
Gen Z and the marketplaces - let's translate their needs
 

Big Data Ecosystem for Data-Driven Decision Making

  • 1. Assoc.Prof.Assoc.Prof. Abzetdin ADAMOVAbzetdin ADAMOV CeDAWI - Center for Data Analytics and Web InsightsCeDAWI - Center for Data Analytics and Web Insights Qafqaz UniversityQafqaz University aadamov@qu.edu.azaadamov@qu.edu.az http://ce.qu.edu.az/~aadamov 12 March 201512 March 2015 Big Data Ecosystem forBig Data Ecosystem for Data-Driven Decision MakingData-Driven Decision Making
  • 2. Digital UniverseDigital Universe Volume of Digital DataVolume of Digital Data IDC's Digital Universe Study • 2003 – 5 exabytes from beginning of civilization • 2005 – 130 exabytes • 2008 – 480.000 petabytes (PB) • 2009 – 800.000 PB • 2010 – 1200 000 PB or 1.2 zettabyte (ZB) • 2011 – 1.8 ZB • 2012 – 2.7 ZB • 2014 ~ 6.2 ZB • Expected to reach 44 ZB by 2020 Every day now we create as much information as we did from theEvery day now we create as much information as we did from the dawn of civilization up until 2003dawn of civilization up until 2003
  • 3. Big Measures for Big DataBig Measures for Big Data • kilobyte (kB) 103 210 • megabyte (MB) 106 220 • gigabyte (GB) 109 230 • terabyte (TB) 1012 240 • petabyte (PB) 1015 250 • exabyte (EB) 1018 260 • zettabyte (ZB) 1021 270 • yottabyte (YB) 1024 280
  • 4. Why Data Grows so Fast?Why Data Grows so Fast? Data sets gathered by ubiquity devices: • Information-sensing mobile devices, • Aerial sensory technologies (remote sensing), • Software logs, • Cameras, • Microphones, • Radio-frequency identification readers, • wireless sensor networks
  • 5. Internet PenetrationInternet Penetration Note: Internet stats for December 2001 Avarage Internet usage ın the world 8% - 500 Million - 2001
  • 6. Foundations of the WebFoundations of the Web Note: Internet stats for January 2014 Avarage Internet usage ın the world 42% - 3.0 Billion - 2014
  • 7. Top 15 Most Popular Social Networking Sites | January 2015 1,310,000,000 - Estimated Unique Monthly Visitors | 2 - Compete Rank 284,000,000 - Estimated Unique Monthly Visitors | 24 - Compete Rank 347,000,000 - Estimated Unique Monthly Visitors | 44 - Compete Rank 70,500,000 - Estimated Unique Monthly Visitors | 51 - Compete Rank 343,000,000 - Estimated Unique Monthly Visitors 25,500,000 - Estimated Unique Monthly Visitors | 346 - Compete Rank 20,500,000 - Estimated Unique Monthly Visitors | 605 - Compete Rank 19,500,000 - Estimated Unique Monthly Visitors | 447 - Compete Rank 17,500,000 - Estimated Unique Monthly Visitors | *NA* - Compete Rank 12,500,000 - Estimated Unique Monthly Visitors | 127 - Compete Rank 12,000,000 - Estimated Unique Monthly Visitors | 617 - Compete Rank 7,500,000 - Estimated Unique Monthly Visitors | 838 - Compete Rank 5,400,000 - Estimated Unique Monthly Visitors | 122 - Compete Rank 3,000,000 - Estimated Unique Monthly Visitors | 451 - Compete Rank 2,500,000 - Estimated Unique Monthly Visitors | 1,596 - Compete Rank Social NetworkingSocial Networking
  • 8. Problem with Moore’s LawProblem with Moore’s Law • The number of transistors that can be placed on an integrated circuit doubles every 18 months to two years • It’s predicted to reach its limit with existing technology in 2020 • Cutting the size of a transistor to a single atom may defeat that concept • The Digital Universe is growing much more faster than Processing Power
  • 9. What Big Data is and isn’t?What Big Data is and isn’t? Computing + Internet = Big DataComputing + Internet = Big Data • Big Data is not new technologyBig Data is not new technology • Big Data is not just about sizeBig Data is not just about size • Big Data is not Business Intelligence (BI)Big Data is not Business Intelligence (BI) • Big Data is not Solution by itself!Big Data is not Solution by itself! Is it time to move from Big Data 1 to Big Data 2?Is it time to move from Big Data 1 to Big Data 2?
  • 10. Interdisciplinary Subfield ofInterdisciplinary Subfield of Computer ScienceComputer Science • Artificial Intelligence, • Machine Learning, • Statistics, • Applied Mathematics, • Text Mining, • Database Systems, • Business Intelligence, • Computational Linguistics, • Natural Language Processing (NLP), • ….
  • 11. Jobs Derived from Big DataJobs Derived from Big Data • Chief Data Officer, • Big Data Solution Architect, • Big Data Platform Engineer, • Big Data Analyst, • Big Data Analytics Business Consultant, • Big Data Software Designer, • Big Data Consultant, • Hadoop Architects, • Consultant Hadoop Developer, • Senior Analytics Manager, • Data & Reporting Analyst, • Analytics Analyst (Big Data) Forbes - Where Big Data Jobs Will be in 2015
  • 12. Data-Driven Decision MakingData-Driven Decision Making (DDD)(DDD) Data-driven decision making (DDD) refers to the practice of basing decisions on the analysis of data rather than purely on intuition. Data alone won’t change the world. It’s the people that use data to make better decisions.
  • 13. Data Science ApplicationData Science Application • Direct Marketing, • Online Advertising, • Credit Scoring and Risk Management • Help Desk Management • Fraud Detection • Search Ranking • Product Recommendation • Predicting Unusual Behavior • Customer Retention in Telecom
  • 14. Big Data Management Life-CycleBig Data Management Life-Cycle - Apache Hadoop - HDFS - Microsoft Azure - …. - Microsoft Analytics Platform System - Excel - R Programming - Python - …. - Web Crawling - Data Mining - Information Retrieval - …. - Parsing - Indexing - Searching - Ranking - NLP - …. Big Data Management involves Data Science and Data Engineering areas for implementing Data Mining Techniques
  • 15. Big Data InfrastructureBig Data Infrastructure
  • 16. Google’s First Data CentersGoogle’s First Data Centers Google’s first data center
  • 17. Google New Data CentersGoogle New Data Centers Map of Google Data Centers Worldwide 450,000 servers range upwards of 20 megawatts, which cost on the order of US$2 million per month in electricity charges.
  • 18. Big Data Terms andBig Data Terms and ComponentsComponents • Microsoft AzureMicrosoft Azure • Red Hat GFS - Global File SystemRed Hat GFS - Global File System • GoogleFS or GFS - Google File SystemGoogleFS or GFS - Google File System • HDFS - Hadoop Distributed File SystemHDFS - Hadoop Distributed File System • SAN - Storage Area NetworkSAN - Storage Area Network • Google BigTableGoogle BigTable • VFS - Virtual File SystemVFS - Virtual File System • IBM GPFS - General Parallel File SystemIBM GPFS - General Parallel File System • HPSS - High Performance Storage SystemHPSS - High Performance Storage System
  • 19. Hadoop Distributed File SystemHadoop Distributed File System
  • 20. Web Crowlers for Web AnalyticsWeb Crowlers for Web Analytics • Indexing • Searching • Ranking • Analysis • Crowling is Essential Job for all Internet Giants: Google, Yahoo, Facebook, etc. Some of available open source crowlers: Apache Nutch, Crawler4j, Bixo, Heritrix, etc.
  • 21. Web Crowlers for Web AnalyticsWeb Crowlers for Web Analytics • Thanks to Crowlers any website can appear in search results without doing any extra work. • Customized Crowling by METATags and “ROBOTS.TXT”
  • 22. Natural Language ProcessingNatural Language Processing (NLP)(NLP) • Natural Language Processing (NLP) • Computational Linguistics (CL) • Machine Translation (MT)
  • 23. Data Mining and KnowledgeData Mining and Knowledge DiscoveryDiscovery • Data collection • Selection of useful data • Data transformation: smoothing, aggregation, normalization • Discovering of interesting patterns: classification, clustering, regression, anomaly detection, association • Knowledge visualization Some of available open source Data Mining tools: RapidMiner, RapidAnalytics, OpenNN, Carrot2, KNIME, etc.
  • 24. Quotes on Big DataQuotes on Big Data “You can have data without information, but you cannot have information without data.” – Daniel Keys Moran “War is ninety percent information.” – Napoleon Bonaparte “If you torture the data long enough, it will confess.” – Ronald Coase, Economist “He who search for pearls must dive below” – John Dryden