SlideShare a Scribd company logo
What is Data
&
Information
?
Introduction to Analytics
And Architecture
Copyright 2021. Genese Cloud Academy
Data Analytics:
● handling data collected by systems
● generate the insights
● improve decision making
○ with facts based on data.
What is Data Analytics?🤔
Copyright 2021. Genese Cloud Academy
Types of Data Analytics
● Descriptive Analytics
● Diagnostic Analytics
● Predictive Analytics
● Prescriptive Analytics
What Are the Types of Data Analytics?🤔
Benefits Of Data Analytics
Credit card Fraud Detection Customer Personalization Security Threats Detection
Real time Alerting User Behaviour
Financial Modeling
Use Cases:
● Social Media
● Ecommerce
● Information Security
● Logistics
● Factory operations
● Internet of Things
What Might be the Use
Cases of Data Analytics?🤔
Challenges(5 V’s)
Data Analytics Tools 🤔🤔 and Use Cases
Amazon Athena
Interactive analysis
Amazon EMR
Big Data Processing
Amazon Elasticsearch
Operational Analytics
Amazon Kinesis
Real-Time Analysis
Amazon RedShift
Data WareHouse
Amazon QuickSight
Dashboards & Visualizations
Amazon Cloud Watch
monitoring & management
Introduction to High
Performance Computing
HPC Workshop
AWS
Copyright 2021. Genese Cloud Academy
High Speed computing:
● implemented in
○ Super Computers for Scientific Research.
Main Area Of Discipline:
● Developing Parallel Processing Algorithm
● Developing Software
shifted from supercomputers to computing Clusters.
Application Areas of HPC
Climate Modeling
Data Analysis
Drug Discovery
Protein Folding
Energy Research
Why Do We need Ever Increasing Performance?
When do you actually need an HPC ?🤔
● Complete time consuming operations in less time
● Complete an operation under a tight schedule
● Perform a high number of operations per second
What is Streaming Data?
Copyright 2021. Genese Cloud Academy
Streaming Data:
● generated continuously by thousands of data sources
Data Stream:
● Continuous
● Ordered
● Changing
● Fast
● huge amount
Traditional DBMS:
Data stored:
● Finite data sets
● Persistent data sets
Streaming data includes a wide variety of data
● log files
● ecommerce purchases,
● in-game player activity,
● information from social networks,
● financial trading floors,
● geospatial services,
● telemetry from connected devices or instrumentation in data centers.
Core Banking
● Improved Scalability
● Met HA and SHA needs
Online Gaming
● Increased reliability
● Accurate and real time data
● Ability to process data at scale
● Faster ramp time
Government Services
● Near real time events and
better data quality
● Increased efficiency
● Produce and store data
● Better privacy
Financial Services
● Enhanced Customer experience
● Improved fraud detection engine
Real-Time Fraud Detection
● Act in real time
● Detect Fraud
● Minimize risk
● Improve customer experience
Real-Time E-Commerce
● OnBoarding New Merchants Faster
● Enabled 360 view of customers
● Enhanced Performance & Monitoring
● Projected saving of Millions of dollars
Do You know About Amazon GO?
Benefits of streaming data
● Improve operational efficiencies
● Reduce infrastructure cost
● Provide faster insights and actions
How To Design Stream Analytics solution?
Challenges in working with streaming data
Streaming data processing requires two layers:
● a storage layer
○ record ordering
○ strong consistency
■ Fast
■ Inexpensive
■ replayable
● Reads
● Writes
● a processing layer.
○ consuming data from the storage layer
○ running computations on that data
○ notifying the storage layer to delete data that is no longer needed.
● Scalability
● data durability
● fault tolerance
Infrastructure to build streaming data applications:
● Amazon Kinesis Data Streams,
● Amazon Kinesis Data Firehose,
● Amazon Managed Streaming for Apache Kafka (Amazon MSK),
● Apache Flume,
● Apache Spark Streaming,
● Apache Storm.
Working with streaming data
on AWS
Amazon Kinesis is a platform for streaming data on AWS
● load and analyze streaming data
● custom streaming data applications
offers three services:
● Amazon Managed Streaming for Apache Kafka
(Amazon MSK).
● Amazon Kinesis Data Firehose,
● Amazon Kinesis Data Streams
run other streaming data platforms
● Apache Flume,
● Apache Spark Streaming, and
● Apache Storm
○ on Amazon EC2
What is Visualization?🤔🤔🤔
any technique for creating images, diagrams, or animations to communicate a message.
Copyright 2021. Genese Cloud Academy
Why is data visualization important?
Data visualization is a:
● Quick & easy way
○ to convey concepts in a universal manner
○ can experiment with different scenarios by making slight adjustments.
● Identify areas that need attention or improvement.
● Clarify which factors influence customer behavior.
● Help you understand which products to place where.
● Predict sales volumes.
Data visualization tools
business intelligence (BI) reporting tool.
set up visualization tools to:
● generate automatic dashboards that track company performance across key performance indicators
(KPIs)
● visually interpret the results.
Kibana Tableau Grafana QuickSight Power BI
What is Big Data?
Copyright 2021. Genese Cloud Academy
BIG DATA:
● collection of data
huge in volume
growing exponentially
with time.
Types Of Big Data
● Structured
● Unstructured
● Semi-structured
Why Learn Big Data?
● Gartner – Big Data is the new Oil.
● IDC – Its market will be growing 7 times faster than the overall IT market.
● IBM – It is not just a technology – it’s a Business Strategy for capitalizing on information resources.
● IBM – Big Data is the biggest buzz word because technology makes it possible to analyze all the
available data.
Characteristics Of Big Data
Your First Big Data
Application on AWS
Word Count example
Copyright 2021. Genese Cloud Academy
Introduction to Big Data using AWS Services

More Related Content

What's hot

Tapjoy: Building a Real-Time Data Science Service for Mobile Advertising
Tapjoy: Building a Real-Time Data Science Service for Mobile AdvertisingTapjoy: Building a Real-Time Data Science Service for Mobile Advertising
Tapjoy: Building a Real-Time Data Science Service for Mobile Advertising
SingleStore
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Dataconomy Media
 
About Pragmatic Works
About Pragmatic WorksAbout Pragmatic Works
About Pragmatic Works
MILL5
 
Driving the On-Demand Economy with Spark and Predictive Analytics
Driving the On-Demand Economy with Spark and Predictive AnalyticsDriving the On-Demand Economy with Spark and Predictive Analytics
Driving the On-Demand Economy with Spark and Predictive Analytics
SingleStore
 
Analytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual Workshop
CCG
 
Hassle-Free Data Lake Governance: Automating Your Analytics with a Semantic L...
Hassle-Free Data Lake Governance: Automating Your Analytics with a Semantic L...Hassle-Free Data Lake Governance: Automating Your Analytics with a Semantic L...
Hassle-Free Data Lake Governance: Automating Your Analytics with a Semantic L...
Tyler Wishnoff
 
Comment transformer vos données en informations exploitables
Comment transformer vos données en informations exploitablesComment transformer vos données en informations exploitables
Comment transformer vos données en informations exploitables
Elasticsearch
 
Importance of Big Data Analytics
Importance of Big Data AnalyticsImportance of Big Data Analytics
Importance of Big Data Analytics
Impetus Technologies
 
Disrupting Risk Management through Emerging Technologies
Disrupting Risk Management through Emerging TechnologiesDisrupting Risk Management through Emerging Technologies
Disrupting Risk Management through Emerging Technologies
Databricks
 
Opportunities derived by AI
Opportunities derived by AIOpportunities derived by AI
Opportunities derived by AI
Amazon Web Services
 
Pyramid vs QlikView
Pyramid vs QlikViewPyramid vs QlikView
Pyramid vs QlikView
Pyramid Analytics
 
Why Business Intelligence Should Consider Agile Modern Data Delivery Platform
Why Business Intelligence Should Consider Agile Modern Data Delivery PlatformWhy Business Intelligence Should Consider Agile Modern Data Delivery Platform
Why Business Intelligence Should Consider Agile Modern Data Delivery Platform
syed_javed
 
Master the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - SnowflakeMaster the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - Snowflake
Matillion
 
Scaling Privacy in a Spark Ecosystem
Scaling Privacy in a Spark EcosystemScaling Privacy in a Spark Ecosystem
Scaling Privacy in a Spark Ecosystem
Databricks
 
Attributes of a Modern Data Warehouse - Gartner Catalyst
Attributes of a Modern Data Warehouse - Gartner CatalystAttributes of a Modern Data Warehouse - Gartner Catalyst
Attributes of a Modern Data Warehouse - Gartner Catalyst
Jack Mardack
 
Pyramid Analytics vs Sisense
Pyramid Analytics vs SisensePyramid Analytics vs Sisense
Pyramid Analytics vs Sisense
Pyramid Analytics
 
Infrastructure migration to azure cloud
Infrastructure migration to azure cloudInfrastructure migration to azure cloud
Infrastructure migration to azure cloud
Alletec
 
Why HR Should Consider Agile Modern Data Delivery Platform
Why HR Should Consider Agile Modern Data Delivery PlatformWhy HR Should Consider Agile Modern Data Delivery Platform
Why HR Should Consider Agile Modern Data Delivery Platform
syed_javed
 
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCEBig Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Matt Stubbs
 

What's hot (20)

Tapjoy: Building a Real-Time Data Science Service for Mobile Advertising
Tapjoy: Building a Real-Time Data Science Service for Mobile AdvertisingTapjoy: Building a Real-Time Data Science Service for Mobile Advertising
Tapjoy: Building a Real-Time Data Science Service for Mobile Advertising
 
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
 
About Pragmatic Works
About Pragmatic WorksAbout Pragmatic Works
About Pragmatic Works
 
Analytics in the Cloud
Analytics in the CloudAnalytics in the Cloud
Analytics in the Cloud
 
Driving the On-Demand Economy with Spark and Predictive Analytics
Driving the On-Demand Economy with Spark and Predictive AnalyticsDriving the On-Demand Economy with Spark and Predictive Analytics
Driving the On-Demand Economy with Spark and Predictive Analytics
 
Analytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual WorkshopAnalytics in a Day Ft. Synapse Virtual Workshop
Analytics in a Day Ft. Synapse Virtual Workshop
 
Hassle-Free Data Lake Governance: Automating Your Analytics with a Semantic L...
Hassle-Free Data Lake Governance: Automating Your Analytics with a Semantic L...Hassle-Free Data Lake Governance: Automating Your Analytics with a Semantic L...
Hassle-Free Data Lake Governance: Automating Your Analytics with a Semantic L...
 
Comment transformer vos données en informations exploitables
Comment transformer vos données en informations exploitablesComment transformer vos données en informations exploitables
Comment transformer vos données en informations exploitables
 
Importance of Big Data Analytics
Importance of Big Data AnalyticsImportance of Big Data Analytics
Importance of Big Data Analytics
 
Disrupting Risk Management through Emerging Technologies
Disrupting Risk Management through Emerging TechnologiesDisrupting Risk Management through Emerging Technologies
Disrupting Risk Management through Emerging Technologies
 
Opportunities derived by AI
Opportunities derived by AIOpportunities derived by AI
Opportunities derived by AI
 
Pyramid vs QlikView
Pyramid vs QlikViewPyramid vs QlikView
Pyramid vs QlikView
 
Why Business Intelligence Should Consider Agile Modern Data Delivery Platform
Why Business Intelligence Should Consider Agile Modern Data Delivery PlatformWhy Business Intelligence Should Consider Agile Modern Data Delivery Platform
Why Business Intelligence Should Consider Agile Modern Data Delivery Platform
 
Master the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - SnowflakeMaster the Multi-Clustered Data Warehouse - Snowflake
Master the Multi-Clustered Data Warehouse - Snowflake
 
Scaling Privacy in a Spark Ecosystem
Scaling Privacy in a Spark EcosystemScaling Privacy in a Spark Ecosystem
Scaling Privacy in a Spark Ecosystem
 
Attributes of a Modern Data Warehouse - Gartner Catalyst
Attributes of a Modern Data Warehouse - Gartner CatalystAttributes of a Modern Data Warehouse - Gartner Catalyst
Attributes of a Modern Data Warehouse - Gartner Catalyst
 
Pyramid Analytics vs Sisense
Pyramid Analytics vs SisensePyramid Analytics vs Sisense
Pyramid Analytics vs Sisense
 
Infrastructure migration to azure cloud
Infrastructure migration to azure cloudInfrastructure migration to azure cloud
Infrastructure migration to azure cloud
 
Why HR Should Consider Agile Modern Data Delivery Platform
Why HR Should Consider Agile Modern Data Delivery PlatformWhy HR Should Consider Agile Modern Data Delivery Platform
Why HR Should Consider Agile Modern Data Delivery Platform
 
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCEBig Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
 

Similar to Introduction to Big Data using AWS Services

Integrating Web and Business Data
Integrating Web and Business DataIntegrating Web and Business Data
Integrating Web and Business Data
Safe Software
 
Analyzing Real-time Streaming Data with Amazon Kinesis
Analyzing Real-time Streaming Data with Amazon KinesisAnalyzing Real-time Streaming Data with Amazon Kinesis
Analyzing Real-time Streaming Data with Amazon Kinesis
Amazon Web Services
 
Accelerate ML Deployment with H2O Driverless AI on AWS
Accelerate ML Deployment with H2O Driverless AI on AWSAccelerate ML Deployment with H2O Driverless AI on AWS
Accelerate ML Deployment with H2O Driverless AI on AWS
Sri Ambati
 
Fast Data at ING – the why, what and how of the streaming analytics platform ...
Fast Data at ING – the why, what and how of the streaming analytics platform ...Fast Data at ING – the why, what and how of the streaming analytics platform ...
Fast Data at ING – the why, what and how of the streaming analytics platform ...
Bas Geerdink
 
Analytics in Your Enterprise
Analytics in Your EnterpriseAnalytics in Your Enterprise
Analytics in Your Enterprise
WSO2
 
Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...
Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...
Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...
Lviv Startup Club
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
Big Data User Group Karlsruhe/Stuttgart
 
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward
 
Using Data Science to Build an End-to-End Recommendation System
Using Data Science to Build an End-to-End Recommendation SystemUsing Data Science to Build an End-to-End Recommendation System
Using Data Science to Build an End-to-End Recommendation System
VMware Tanzu
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4Janani Eshwaran
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4Janani Eshwaran
 
Globant and Big Data on AWS
Globant and Big Data on AWSGlobant and Big Data on AWS
Globant and Big Data on AWS
Amazon Web Services LATAM
 
16h00 globant - aws globant-big-data_summit2012
16h00   globant - aws globant-big-data_summit201216h00   globant - aws globant-big-data_summit2012
16h00 globant - aws globant-big-data_summit2012infolive
 
Integrating Web and Business Data
Integrating Web and Business DataIntegrating Web and Business Data
Integrating Web and Business Data
Sterling Geo
 
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...
Amazon Web Services
 
Big Data for Smart City
Big Data for Smart CityBig Data for Smart City
Big Data for Smart CityKoltiva
 
Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise
deteo
 
OpenSymmetry - Business Intelligence Maturity
OpenSymmetry - Business Intelligence MaturityOpenSymmetry - Business Intelligence Maturity
OpenSymmetry - Business Intelligence Maturity
OpenSymmetry
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
tsigitnist02
 
Cloud Experience: Data-driven Applications Made Simple and Fast
Cloud Experience: Data-driven Applications Made Simple and FastCloud Experience: Data-driven Applications Made Simple and Fast
Cloud Experience: Data-driven Applications Made Simple and Fast
Databricks
 

Similar to Introduction to Big Data using AWS Services (20)

Integrating Web and Business Data
Integrating Web and Business DataIntegrating Web and Business Data
Integrating Web and Business Data
 
Analyzing Real-time Streaming Data with Amazon Kinesis
Analyzing Real-time Streaming Data with Amazon KinesisAnalyzing Real-time Streaming Data with Amazon Kinesis
Analyzing Real-time Streaming Data with Amazon Kinesis
 
Accelerate ML Deployment with H2O Driverless AI on AWS
Accelerate ML Deployment with H2O Driverless AI on AWSAccelerate ML Deployment with H2O Driverless AI on AWS
Accelerate ML Deployment with H2O Driverless AI on AWS
 
Fast Data at ING – the why, what and how of the streaming analytics platform ...
Fast Data at ING – the why, what and how of the streaming analytics platform ...Fast Data at ING – the why, what and how of the streaming analytics platform ...
Fast Data at ING – the why, what and how of the streaming analytics platform ...
 
Analytics in Your Enterprise
Analytics in Your EnterpriseAnalytics in Your Enterprise
Analytics in Your Enterprise
 
Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...
Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...
Vitalii Bondarenko and Eugene Berko "Cloud AI Platform as an accelerator of e...
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
 
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...
 
Using Data Science to Build an End-to-End Recommendation System
Using Data Science to Build an End-to-End Recommendation SystemUsing Data Science to Build an End-to-End Recommendation System
Using Data Science to Build an End-to-End Recommendation System
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4
 
2016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V42016 DSG Webinar Azure HDInsight 2 V4
2016 DSG Webinar Azure HDInsight 2 V4
 
Globant and Big Data on AWS
Globant and Big Data on AWSGlobant and Big Data on AWS
Globant and Big Data on AWS
 
16h00 globant - aws globant-big-data_summit2012
16h00   globant - aws globant-big-data_summit201216h00   globant - aws globant-big-data_summit2012
16h00 globant - aws globant-big-data_summit2012
 
Integrating Web and Business Data
Integrating Web and Business DataIntegrating Web and Business Data
Integrating Web and Business Data
 
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...
 
Big Data for Smart City
Big Data for Smart CityBig Data for Smart City
Big Data for Smart City
 
Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise Deteo. Data science, Big Data expertise
Deteo. Data science, Big Data expertise
 
OpenSymmetry - Business Intelligence Maturity
OpenSymmetry - Business Intelligence MaturityOpenSymmetry - Business Intelligence Maturity
OpenSymmetry - Business Intelligence Maturity
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
 
Cloud Experience: Data-driven Applications Made Simple and Fast
Cloud Experience: Data-driven Applications Made Simple and FastCloud Experience: Data-driven Applications Made Simple and Fast
Cloud Experience: Data-driven Applications Made Simple and Fast
 

Recently uploaded

Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
GeoBlogs
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
bennyroshan06
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
PedroFerreira53928
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
Steve Thomason
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
Vivekanand Anglo Vedic Academy
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
AzmatAli747758
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
Jisc
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
Celine George
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
Excellence Foundation for South Sudan
 

Recently uploaded (20)

Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
Basic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumersBasic phrases for greeting and assisting costumers
Basic phrases for greeting and assisting costumers
 
The Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve ThomasonThe Art Pastor's Guide to Sabbath | Steve Thomason
The Art Pastor's Guide to Sabbath | Steve Thomason
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Sectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdfSectors of the Indian Economy - Class 10 Study Notes pdf
Sectors of the Indian Economy - Class 10 Study Notes pdf
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
The approach at University of Liverpool.pptx
The approach at University of Liverpool.pptxThe approach at University of Liverpool.pptx
The approach at University of Liverpool.pptx
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
 

Introduction to Big Data using AWS Services

  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 7. Introduction to Analytics And Architecture Copyright 2021. Genese Cloud Academy
  • 8. Data Analytics: ● handling data collected by systems ● generate the insights ● improve decision making ○ with facts based on data. What is Data Analytics?🤔 Copyright 2021. Genese Cloud Academy
  • 9. Types of Data Analytics ● Descriptive Analytics ● Diagnostic Analytics ● Predictive Analytics ● Prescriptive Analytics What Are the Types of Data Analytics?🤔
  • 10. Benefits Of Data Analytics Credit card Fraud Detection Customer Personalization Security Threats Detection Real time Alerting User Behaviour Financial Modeling
  • 11. Use Cases: ● Social Media ● Ecommerce ● Information Security ● Logistics ● Factory operations ● Internet of Things What Might be the Use Cases of Data Analytics?🤔
  • 13. Data Analytics Tools 🤔🤔 and Use Cases Amazon Athena Interactive analysis Amazon EMR Big Data Processing Amazon Elasticsearch Operational Analytics Amazon Kinesis Real-Time Analysis Amazon RedShift Data WareHouse Amazon QuickSight Dashboards & Visualizations Amazon Cloud Watch monitoring & management
  • 14.
  • 15. Introduction to High Performance Computing HPC Workshop AWS
  • 16. Copyright 2021. Genese Cloud Academy
  • 17.
  • 18. High Speed computing: ● implemented in ○ Super Computers for Scientific Research. Main Area Of Discipline: ● Developing Parallel Processing Algorithm ● Developing Software shifted from supercomputers to computing Clusters. Application Areas of HPC
  • 19. Climate Modeling Data Analysis Drug Discovery Protein Folding Energy Research Why Do We need Ever Increasing Performance?
  • 20. When do you actually need an HPC ?🤔 ● Complete time consuming operations in less time ● Complete an operation under a tight schedule ● Perform a high number of operations per second
  • 21. What is Streaming Data? Copyright 2021. Genese Cloud Academy
  • 22. Streaming Data: ● generated continuously by thousands of data sources Data Stream: ● Continuous ● Ordered ● Changing ● Fast ● huge amount Traditional DBMS: Data stored: ● Finite data sets ● Persistent data sets Streaming data includes a wide variety of data ● log files ● ecommerce purchases, ● in-game player activity, ● information from social networks, ● financial trading floors, ● geospatial services, ● telemetry from connected devices or instrumentation in data centers.
  • 23. Core Banking ● Improved Scalability ● Met HA and SHA needs Online Gaming ● Increased reliability ● Accurate and real time data ● Ability to process data at scale ● Faster ramp time
  • 24. Government Services ● Near real time events and better data quality ● Increased efficiency ● Produce and store data ● Better privacy Financial Services ● Enhanced Customer experience ● Improved fraud detection engine
  • 25. Real-Time Fraud Detection ● Act in real time ● Detect Fraud ● Minimize risk ● Improve customer experience Real-Time E-Commerce ● OnBoarding New Merchants Faster ● Enabled 360 view of customers ● Enhanced Performance & Monitoring ● Projected saving of Millions of dollars Do You know About Amazon GO?
  • 26.
  • 27. Benefits of streaming data ● Improve operational efficiencies ● Reduce infrastructure cost ● Provide faster insights and actions
  • 28. How To Design Stream Analytics solution?
  • 29. Challenges in working with streaming data Streaming data processing requires two layers: ● a storage layer ○ record ordering ○ strong consistency ■ Fast ■ Inexpensive ■ replayable ● Reads ● Writes ● a processing layer. ○ consuming data from the storage layer ○ running computations on that data ○ notifying the storage layer to delete data that is no longer needed. ● Scalability ● data durability ● fault tolerance
  • 30. Infrastructure to build streaming data applications: ● Amazon Kinesis Data Streams, ● Amazon Kinesis Data Firehose, ● Amazon Managed Streaming for Apache Kafka (Amazon MSK), ● Apache Flume, ● Apache Spark Streaming, ● Apache Storm.
  • 31.
  • 32. Working with streaming data on AWS Amazon Kinesis is a platform for streaming data on AWS ● load and analyze streaming data ● custom streaming data applications offers three services: ● Amazon Managed Streaming for Apache Kafka (Amazon MSK). ● Amazon Kinesis Data Firehose, ● Amazon Kinesis Data Streams run other streaming data platforms ● Apache Flume, ● Apache Spark Streaming, and ● Apache Storm ○ on Amazon EC2
  • 33. What is Visualization?🤔🤔🤔 any technique for creating images, diagrams, or animations to communicate a message. Copyright 2021. Genese Cloud Academy
  • 34.
  • 35. Why is data visualization important? Data visualization is a: ● Quick & easy way ○ to convey concepts in a universal manner ○ can experiment with different scenarios by making slight adjustments. ● Identify areas that need attention or improvement. ● Clarify which factors influence customer behavior. ● Help you understand which products to place where. ● Predict sales volumes.
  • 36. Data visualization tools business intelligence (BI) reporting tool. set up visualization tools to: ● generate automatic dashboards that track company performance across key performance indicators (KPIs) ● visually interpret the results. Kibana Tableau Grafana QuickSight Power BI
  • 37. What is Big Data? Copyright 2021. Genese Cloud Academy
  • 38. BIG DATA: ● collection of data huge in volume growing exponentially with time. Types Of Big Data ● Structured ● Unstructured ● Semi-structured
  • 39. Why Learn Big Data? ● Gartner – Big Data is the new Oil. ● IDC – Its market will be growing 7 times faster than the overall IT market. ● IBM – It is not just a technology – it’s a Business Strategy for capitalizing on information resources. ● IBM – Big Data is the biggest buzz word because technology makes it possible to analyze all the available data.
  • 41.
  • 42. Your First Big Data Application on AWS Word Count example Copyright 2021. Genese Cloud Academy

Editor's Notes

  1. Data Analytics is Vital to every Business. -helps decision makers (based on analytics and data) -critical tasks (launching New Product,Offering Discounts,marketing New Areas) requires time sensitive decision and experience. Organizations spend millions of dollars on data storage. The problem isn’t finding the data — the problem is failing to do anything with it, AWS
  2. Volume : total amount of data that is coming in and will be ingested into the system. Velocity : speed at which data is flowing in, the challenge consists of processing the data in near real-time and return results as quickly as possible. validating a credit card transaction must instantaneous (near real-time) Variety : Data to be ingested in the system can have different formats,. Veracity : accuracy of incoming data. Value :Decisions makers seek to extract meaningful information and insights from systems to have a competitive edge.
  3. Area of discipline can be divided into small independent parts and can be executed simultaneously by separate processors.
  4. Bank Pubg
  5. Amazon Go LInk to video: https://youtu.be/NrmMk1Myrxc
  6. applying machine learning algorithms, and extract deeper insights from the data. Over time, complex, stream and event processing algorithms, like decaying time windows to find the most recent popular movies, are applied, further enriching the insights.
  7. applying machine learning algorithms, and extract deeper insights from the data. Over time, complex, stream and event processing algorithms, like decaying time windows to find the most recent popular movies, are applied, further enriching the insights.
  8. Drift : The ability to detect and adapt to changes in the distribution of examples is paramount for data stream mining algorithms One pass:a one-pass or single-pass is a streaming algorithm which reads its input exactly once. It does so by processing items in order, without unbounded buffering; it reads a block into an input buffer, processes it, and moves the result into an output buffer for each step in the process. Real-time: Streaming data is data that is continuously generated and delivered rather than processed in batches or micro-batches. ... The terms “real-time” and “stream” converge in “real-time stream processing” to describe streams of real-time data that are gathered and processed as they are generated. Bound data is finite and unchanging data, where everything is known about the set of data. Typically Bound data has a known ending point and is relatively fixed.
  9. applying machine learning algorithms, and extract deeper insights from the data. Over time, complex, stream and event processing algorithms, like decaying time windows to find the most recent popular movies, are applied, further enriching the insights.
  10. deploy and manage your own streaming data solution in the cloud on Amazon EC2.
  11. Because of the way the human brain processes information, using charts or graphs to visualize large amounts of complex data is easier than poring over spreadsheets or reports. Data visualization is a quick, easy way to convey concepts in a universal manner – and can experiment with different scenarios by making slight adjustments.
  12. data with so large size and complexity that none of traditional data management tools can store it or process it efficiently. Big data is also a data but with huge size.