SlideShare a Scribd company logo
1 of 15
Download to read offline
Notes
Workshop
AGENDA
BIG DATA !
BIG DATA: HYPE OR REALITY?
DEEP DIVE INTO THE INFRASTRUCTURE
BIG DATA SCENARIO A 2 Z
DATA ANALYTICS
DATA VISUALIZATION
EMOTION INTELIGENCE
WORD EMBEDDING IN NLP
DEEP LEARNING IN AUTONOMOUS CAR
PREDICTION MODELS IN OIL AND GAS
MICROSOFT AZURE
5G: IMT 2020
This Document Includes lecture/workshop notes regarding BIG
DATA SCIENCE workshop at NTI 6-7th of Dec 2017
https://www.linkedin.com/in/mrastro
BIGDATA !
Not About How BIG it is!
https://www.linkedin.com/in/mrastro
“Big data is a term that describes the large volume of data – both structured
and unstructured – that inundates a business on a day-to-day basis. But it’s not
the amount of data that’s important. It’s what organizations do with the data
that matters. Big data can be analyzed for insights that lead to better decisions
and strategic business moves.” 2
Definition
“Big data is about looking ahead, beyond
what everybody else sees.” 1
Peter Sondergaard, senior vice president and global head of research at Gartner
Although there’s no fixed number marking the beginning of “big”, we’re talking much bigger
than conventional tools like spreadsheets and relational databases can handle easily. Many
case studies of big data involve datasets of many petabytes—or even exabytes—
made possible only by using high-performance cloud-based computing.
Many big-data applications, such as cancer research, use historical data, but much attention
is being paid to how to leverage real-time data—not just collected in real time, but processed
and accessed in real time too. In many scenarios, users must be able to ask questions
iteratively and get answers in minutes, not days.
Big data covers not just “structured” data neatly normalized into a fixed schema
and exported from ERP or CRM systems. It also includes semi-structured data,
(which, although it has no fixed configuration, is categorized using tags or other
metadata) and unstructured data, such as email messages and videos.
MOST DEFINITIONS OF BIG DATA AGREE THAT IT INVOLVES THE “THREE VS” 4
Any technology is only useful if it solves a problem (or problems).
As we all know, there is data, lots of it: historical data, sure, but also new
data generated from social media apps, click stream data from web
applications, IoT sensor data, and on and on. The amount of data is larger
than ever, coming in at ever-increasing rates, and in many different formats.
3
The
Problem
Gartner published earlier this year 2017 5 on emerging technologies.
They mention Many of the emerging technologies, including virtual personal
assistants, machine learning, the IoT, and M2M, use data to track performance and
generate big data to define success.
A closer look to the peak, we can see IoT, machine/deep learning with about 2-5 years to
diverse (expected between 2020-22) which creates a world of connectivity
And HINT
The Connected World Amplifies Big Data AND ITS EXISTENCE EVERYWHERE
A DEEP DIVE INTO INFRASTRUCTURE
Traditional Data Management Systems [6]
SHARED I/O
SHARED PROCESSING
LIMITED SCALABILITY
SERVICE BOTTLENECKS
HIGH COST FACTOR
Abstraction of BIG DATA Platform [6]
PARALLEL PROCESSING
LINEAR SCALABILITY
DISTRIBUTED SERVICE
LOW COST FACTOR
Notes: The Main Key Advantages of Distributed Systems are being Software Defined
where cluster is optimized for software execution (e.g Hadoop). Files/DataSet can be
split in to segments and can be distributed across different nodes (Worker Nodes )
within the network to be processed in parallel which in turn gives more performance.
Reliability and Capability for to be upgradable where more resources can be added
easily, this also reduces the cost factor.
SHARED NOTHING
Notes: For any Big data File, Slice the File into blocks then those blocks will be spreaded into
the available worker nodes. Hint: n nodes (They are not necessary to be physical nodes but
we can deploy n-physical node with m-vm (virtual nodes/machines) to act finally as a single
Cluster. Hint: each node takes one or more block (depending on the size)
SCENARIO
Selecting a Modeling Technique [6]
DEVELOP YOUR USE CASE [6]
“Formulate a Data-Driven Use Case
Hi-level description and objectives of the use case
Challenges addressed by the use case
Pain points and impact of each challenge
Goals, success criteria, constraints and assumptions
Available data, data sources and required resources
Modeling approach for each challenge
Overall model structure & workflow
Application of the use case into operational solution”
STRUCTURED DATA [6]
“Commonly refers to Database Tables with well defined columns structure including
data types and specifications It might also include other non-database managed
formats like OLAP Cubes, csv files and fixed column files as long as they are
consistently generated. i.e. exported from database, generated by ATM
machine…etc”
UNSTRUCTURED DATA [6]
“Data NOT following well defined structure either because of the nature of data
generation or the nature of the data format. Most of the data generated around the
globe is unstructured data with different degree:
Semi-structured: XML log files, HTML content
Quasi-structured: query strings in websites URLs, log events/alerts
Unstructured: text, pdf, word, social feeds, web content, images, video”
Img src: http://bigdata.black/infrastructure/storage/unstructured-data
“Unfortunately, it’s often very difficult to analyze unstructured data. To help with the
problem, organizations have turned to a number of different software solutions designed
to search unstructured data and extract important information. The primary benefit of
these tools is the ability to glean actionable information that can help a business succeed
in a competitive environment. Because the volume of unstructured data is growing so
rapidly, many enterprises also turn to technological solutions to help them better manage
and store their unstructured data. These can include hardware or software solutions that
enable them to make the most efficient use of their available storage space. “ [7]
DESCRIPTIVE ANALYTICS -PRACTICAL TOUR
AUTOMOTIVE INDUSTRY
DEEP LEARNING IN AUTONOMOUS CAR
Self-Driving Cars to the 2020 Tokyo Olympics
Telecom: Case Study
Leveraging Data to better satisfy Understand Customers
needs ,Churn prevention
Monitor and Visualize all kind of site and services Alarms,
solve KPIs problems, and predict insights almost in realtime
Predictive Maintenance
Hisham Arafat
Digital Transformation Lead Consultant Solutions
Architect, Technology Strategist & Researcher
Linkedin
Thanks to
References
1:Gartner Says Big Data Creates Big Jobs
2:SaaS-Big Data! What it is and why it matters
3:IBM-What is big data? More than volume, velocity and variety
4:Verizon-BIG DATA: HYPE OR REALITY?
5:Top Trends in the Gartner Hype Cycle for Emerging Technologies, 2017
6: Digital Transformation Industry Perspective, Eng.Hisham
7:Unstructured Data: BIGDATA
https://www.linkedin.com/in/mrastro

More Related Content

What's hot

Dcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrisonDcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrisonAlan Morrison
 
Business_Analytics_Presentation_Luke_Caratan
Business_Analytics_Presentation_Luke_CaratanBusiness_Analytics_Presentation_Luke_Caratan
Business_Analytics_Presentation_Luke_CaratanLuke Caratan
 
Operationalize Your Linked Data
Operationalize Your Linked DataOperationalize Your Linked Data
Operationalize Your Linked DataMatt Turner
 
Big Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 ConferenceBig Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 ConferenceDavid Feinleib
 
Towards a big data roadmap for europe
Towards a big data roadmap for europeTowards a big data roadmap for europe
Towards a big data roadmap for europeBIG Project
 
Data Mining And Visualization of Large Databases
Data Mining And Visualization of Large DatabasesData Mining And Visualization of Large Databases
Data Mining And Visualization of Large DatabasesCSCJournals
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesT.S. Lim
 
BIG DATA(PPT)
BIG DATA(PPT)BIG DATA(PPT)
BIG DATA(PPT)josnapv
 
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONBRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONijmnct
 
What is big data ? | Big Data Applications
What is big data ? | Big Data ApplicationsWhat is big data ? | Big Data Applications
What is big data ? | Big Data ApplicationsShilpaKrishna6
 
Big data using Public Cloud
Big data using Public CloudBig data using Public Cloud
Big data using Public CloudIMC Institute
 
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of thingsBig Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of thingsRamakant Gawande
 
Data Lake-based Approaches to Regulatory-Driven Technology Challenges
Data Lake-based Approaches to Regulatory-Driven Technology ChallengesData Lake-based Approaches to Regulatory-Driven Technology Challenges
Data Lake-based Approaches to Regulatory-Driven Technology ChallengesBooz Allen Hamilton
 
Enabling Cloud Analytics with Data-Level Security
Enabling Cloud Analytics with Data-Level SecurityEnabling Cloud Analytics with Data-Level Security
Enabling Cloud Analytics with Data-Level SecurityBooz Allen Hamilton
 

What's hot (20)

Big Data
Big DataBig Data
Big Data
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Dcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrisonDcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrison
 
Business_Analytics_Presentation_Luke_Caratan
Business_Analytics_Presentation_Luke_CaratanBusiness_Analytics_Presentation_Luke_Caratan
Business_Analytics_Presentation_Luke_Caratan
 
Operationalize Your Linked Data
Operationalize Your Linked DataOperationalize Your Linked Data
Operationalize Your Linked Data
 
Big Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 ConferenceBig Data Trends - WorldFuture 2015 Conference
Big Data Trends - WorldFuture 2015 Conference
 
Towards a big data roadmap for europe
Towards a big data roadmap for europeTowards a big data roadmap for europe
Towards a big data roadmap for europe
 
Data Mining And Visualization of Large Databases
Data Mining And Visualization of Large DatabasesData Mining And Visualization of Large Databases
Data Mining And Visualization of Large Databases
 
Big Data Trends
Big Data TrendsBig Data Trends
Big Data Trends
 
Applications of Big Data Analytics in Businesses
Applications of Big Data Analytics in BusinessesApplications of Big Data Analytics in Businesses
Applications of Big Data Analytics in Businesses
 
Visual Data Mining
Visual Data MiningVisual Data Mining
Visual Data Mining
 
BIG DATA(PPT)
BIG DATA(PPT)BIG DATA(PPT)
BIG DATA(PPT)
 
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATIONBRIDGING DATA SILOS USING BIG DATA INTEGRATION
BRIDGING DATA SILOS USING BIG DATA INTEGRATION
 
Fraud and Risk in Big Data
Fraud and Risk in Big DataFraud and Risk in Big Data
Fraud and Risk in Big Data
 
What is big data ? | Big Data Applications
What is big data ? | Big Data ApplicationsWhat is big data ? | Big Data Applications
What is big data ? | Big Data Applications
 
Big data using Public Cloud
Big data using Public CloudBig data using Public Cloud
Big data using Public Cloud
 
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of thingsBig Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
 
"Big Data Dreams"
"Big Data Dreams""Big Data Dreams"
"Big Data Dreams"
 
Data Lake-based Approaches to Regulatory-Driven Technology Challenges
Data Lake-based Approaches to Regulatory-Driven Technology ChallengesData Lake-based Approaches to Regulatory-Driven Technology Challenges
Data Lake-based Approaches to Regulatory-Driven Technology Challenges
 
Enabling Cloud Analytics with Data-Level Security
Enabling Cloud Analytics with Data-Level SecurityEnabling Cloud Analytics with Data-Level Security
Enabling Cloud Analytics with Data-Level Security
 

Similar to Big Data Science Workshop Documentation V1.0

IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET Journal
 
Big data – A Review
Big data – A ReviewBig data – A Review
Big data – A ReviewIRJET Journal
 
Big data analytics use cases: all you need to know
Big data analytics use cases:  all you need to knowBig data analytics use cases:  all you need to know
Big data analytics use cases: all you need to knowJane Brewer
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big dataDigimark
 
IRJET- Big Data: A Study
IRJET-  	  Big Data: A StudyIRJET-  	  Big Data: A Study
IRJET- Big Data: A StudyIRJET Journal
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big DataIRJET Journal
 
Notes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdfNotes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdfKarishma Chaudhary
 
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET Journal
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigDataValarmathi V
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)Shahbaz Anjam
 
The future of big data analytics
The future of big data analyticsThe future of big data analytics
The future of big data analyticsAhmed Banafa
 
Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...
Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...
Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...YogeshIJTSRD
 
Age Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAge Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAgeFriendlyEconomy
 
Real callenges in big data security
Real callenges in big data securityReal callenges in big data security
Real callenges in big data securitybalasahebcomp
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...IT Support Engineer
 

Similar to Big Data Science Workshop Documentation V1.0 (20)

IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
IRJET- A Scrutiny on Research Analysis of Big Data Analytical Method and Clou...
 
Big data – A Review
Big data – A ReviewBig data – A Review
Big data – A Review
 
Complete-SRS.doc
Complete-SRS.docComplete-SRS.doc
Complete-SRS.doc
 
Big data analytics use cases: all you need to know
Big data analytics use cases:  all you need to knowBig data analytics use cases:  all you need to know
Big data analytics use cases: all you need to know
 
Big Data.pdf
Big Data.pdfBig Data.pdf
Big Data.pdf
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big data
 
IRJET- Big Data: A Study
IRJET-  	  Big Data: A StudyIRJET-  	  Big Data: A Study
IRJET- Big Data: A Study
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
 
Notes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdfNotes on Current trends in IT (1) (1).pdf
Notes on Current trends in IT (1) (1).pdf
 
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial Domain
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigData
 
The ABCs of Big Data
The ABCs of Big DataThe ABCs of Big Data
The ABCs of Big Data
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
The future of big data analytics
The future of big data analyticsThe future of big data analytics
The future of big data analytics
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...
Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...
Cloud Analytics Ability to Design, Build, Secure, and Maintain Analytics Solu...
 
Age Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big DataAge Friendly Economy - Introduction to Big Data
Age Friendly Economy - Introduction to Big Data
 
Real callenges in big data security
Real callenges in big data securityReal callenges in big data security
Real callenges in big data security
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
 

Recently uploaded

College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxvipinkmenon1
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 

Recently uploaded (20)

College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Introduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptxIntroduction to Microprocesso programming and interfacing.pptx
Introduction to Microprocesso programming and interfacing.pptx
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 

Big Data Science Workshop Documentation V1.0

  • 2. AGENDA BIG DATA ! BIG DATA: HYPE OR REALITY? DEEP DIVE INTO THE INFRASTRUCTURE BIG DATA SCENARIO A 2 Z DATA ANALYTICS DATA VISUALIZATION EMOTION INTELIGENCE WORD EMBEDDING IN NLP DEEP LEARNING IN AUTONOMOUS CAR PREDICTION MODELS IN OIL AND GAS MICROSOFT AZURE 5G: IMT 2020 This Document Includes lecture/workshop notes regarding BIG DATA SCIENCE workshop at NTI 6-7th of Dec 2017 https://www.linkedin.com/in/mrastro
  • 3. BIGDATA ! Not About How BIG it is! https://www.linkedin.com/in/mrastro
  • 4. “Big data is a term that describes the large volume of data – both structured and unstructured – that inundates a business on a day-to-day basis. But it’s not the amount of data that’s important. It’s what organizations do with the data that matters. Big data can be analyzed for insights that lead to better decisions and strategic business moves.” 2 Definition “Big data is about looking ahead, beyond what everybody else sees.” 1 Peter Sondergaard, senior vice president and global head of research at Gartner Although there’s no fixed number marking the beginning of “big”, we’re talking much bigger than conventional tools like spreadsheets and relational databases can handle easily. Many case studies of big data involve datasets of many petabytes—or even exabytes— made possible only by using high-performance cloud-based computing. Many big-data applications, such as cancer research, use historical data, but much attention is being paid to how to leverage real-time data—not just collected in real time, but processed and accessed in real time too. In many scenarios, users must be able to ask questions iteratively and get answers in minutes, not days. Big data covers not just “structured” data neatly normalized into a fixed schema and exported from ERP or CRM systems. It also includes semi-structured data, (which, although it has no fixed configuration, is categorized using tags or other metadata) and unstructured data, such as email messages and videos. MOST DEFINITIONS OF BIG DATA AGREE THAT IT INVOLVES THE “THREE VS” 4 Any technology is only useful if it solves a problem (or problems). As we all know, there is data, lots of it: historical data, sure, but also new data generated from social media apps, click stream data from web applications, IoT sensor data, and on and on. The amount of data is larger than ever, coming in at ever-increasing rates, and in many different formats. 3 The Problem
  • 5. Gartner published earlier this year 2017 5 on emerging technologies. They mention Many of the emerging technologies, including virtual personal assistants, machine learning, the IoT, and M2M, use data to track performance and generate big data to define success. A closer look to the peak, we can see IoT, machine/deep learning with about 2-5 years to diverse (expected between 2020-22) which creates a world of connectivity And HINT The Connected World Amplifies Big Data AND ITS EXISTENCE EVERYWHERE
  • 6. A DEEP DIVE INTO INFRASTRUCTURE
  • 7. Traditional Data Management Systems [6] SHARED I/O SHARED PROCESSING LIMITED SCALABILITY SERVICE BOTTLENECKS HIGH COST FACTOR Abstraction of BIG DATA Platform [6] PARALLEL PROCESSING LINEAR SCALABILITY DISTRIBUTED SERVICE LOW COST FACTOR Notes: The Main Key Advantages of Distributed Systems are being Software Defined where cluster is optimized for software execution (e.g Hadoop). Files/DataSet can be split in to segments and can be distributed across different nodes (Worker Nodes ) within the network to be processed in parallel which in turn gives more performance. Reliability and Capability for to be upgradable where more resources can be added easily, this also reduces the cost factor. SHARED NOTHING Notes: For any Big data File, Slice the File into blocks then those blocks will be spreaded into the available worker nodes. Hint: n nodes (They are not necessary to be physical nodes but we can deploy n-physical node with m-vm (virtual nodes/machines) to act finally as a single Cluster. Hint: each node takes one or more block (depending on the size) SCENARIO
  • 8. Selecting a Modeling Technique [6] DEVELOP YOUR USE CASE [6] “Formulate a Data-Driven Use Case Hi-level description and objectives of the use case Challenges addressed by the use case Pain points and impact of each challenge Goals, success criteria, constraints and assumptions Available data, data sources and required resources Modeling approach for each challenge Overall model structure & workflow Application of the use case into operational solution”
  • 9. STRUCTURED DATA [6] “Commonly refers to Database Tables with well defined columns structure including data types and specifications It might also include other non-database managed formats like OLAP Cubes, csv files and fixed column files as long as they are consistently generated. i.e. exported from database, generated by ATM machine…etc” UNSTRUCTURED DATA [6] “Data NOT following well defined structure either because of the nature of data generation or the nature of the data format. Most of the data generated around the globe is unstructured data with different degree: Semi-structured: XML log files, HTML content Quasi-structured: query strings in websites URLs, log events/alerts Unstructured: text, pdf, word, social feeds, web content, images, video” Img src: http://bigdata.black/infrastructure/storage/unstructured-data “Unfortunately, it’s often very difficult to analyze unstructured data. To help with the problem, organizations have turned to a number of different software solutions designed to search unstructured data and extract important information. The primary benefit of these tools is the ability to glean actionable information that can help a business succeed in a competitive environment. Because the volume of unstructured data is growing so rapidly, many enterprises also turn to technological solutions to help them better manage and store their unstructured data. These can include hardware or software solutions that enable them to make the most efficient use of their available storage space. “ [7]
  • 11. AUTOMOTIVE INDUSTRY DEEP LEARNING IN AUTONOMOUS CAR Self-Driving Cars to the 2020 Tokyo Olympics
  • 12. Telecom: Case Study Leveraging Data to better satisfy Understand Customers needs ,Churn prevention Monitor and Visualize all kind of site and services Alarms, solve KPIs problems, and predict insights almost in realtime Predictive Maintenance
  • 13.
  • 14. Hisham Arafat Digital Transformation Lead Consultant Solutions Architect, Technology Strategist & Researcher Linkedin Thanks to
  • 15. References 1:Gartner Says Big Data Creates Big Jobs 2:SaaS-Big Data! What it is and why it matters 3:IBM-What is big data? More than volume, velocity and variety 4:Verizon-BIG DATA: HYPE OR REALITY? 5:Top Trends in the Gartner Hype Cycle for Emerging Technologies, 2017 6: Digital Transformation Industry Perspective, Eng.Hisham 7:Unstructured Data: BIGDATA https://www.linkedin.com/in/mrastro