This document provides an introduction to key concepts in data analytics including data, information, the types of data analytics, benefits and use cases of data analytics, challenges, and common data analytics tools. It also covers related topics like streaming data, data visualization, and big data.
AWS is hosting the first FSI Cloud Symposium in Hong Kong, which will take place on Thursday, March 23, 2017 at Grand Hyatt Hotel. The event will bring together FSI customers, industry professional and AWS experts, to explore how to turn the dream of transformation, innovation and acceleration into reality by exploiting Cloud, Voice to Text and IoT technologies. The packed agenda includes expert sessions on a host of pressing issues, such as security and compliance, as well as customer experience sharing on how cloud computing is benefiting the industry.
Speaker: Lijia Xu, Big Data Practice Lead, Professional Services, AWS
Modernizing to a Cloud Data ArchitectureDatabricks
Organizations with on-premises Hadoop infrastructure are bogged down by system complexity, unscalable infrastructure, and the increasing burden on DevOps to manage legacy architectures. Costs and resource utilization continue to go up while innovation has flatlined. In this session, you will learn why, now more than ever, enterprises are looking for cloud alternatives to Hadoop and are migrating off of the architecture in large numbers. You will also learn how elastic compute models’ benefits help one customer scale their analytics and AI workloads and best practices from their experience on a successful migration of their data and workloads to the cloud.
Transforming data into actionable insightsElasticsearch
Learn about the strategic feature areas of the Elastic Stack—Elasticsearch, a data engine like no other, and Kibana, the window into the Elastic Stack.
The session will cover:
Bringing data into the Elastic Stack
Storing data
Analyzing data
Acting on data
AWS is hosting the first FSI Cloud Symposium in Hong Kong, which will take place on Thursday, March 23, 2017 at Grand Hyatt Hotel. The event will bring together FSI customers, industry professional and AWS experts, to explore how to turn the dream of transformation, innovation and acceleration into reality by exploiting Cloud, Voice to Text and IoT technologies. The packed agenda includes expert sessions on a host of pressing issues, such as security and compliance, as well as customer experience sharing on how cloud computing is benefiting the industry.
Speaker: Lijia Xu, Big Data Practice Lead, Professional Services, AWS
Modernizing to a Cloud Data ArchitectureDatabricks
Organizations with on-premises Hadoop infrastructure are bogged down by system complexity, unscalable infrastructure, and the increasing burden on DevOps to manage legacy architectures. Costs and resource utilization continue to go up while innovation has flatlined. In this session, you will learn why, now more than ever, enterprises are looking for cloud alternatives to Hadoop and are migrating off of the architecture in large numbers. You will also learn how elastic compute models’ benefits help one customer scale their analytics and AI workloads and best practices from their experience on a successful migration of their data and workloads to the cloud.
Transforming data into actionable insightsElasticsearch
Learn about the strategic feature areas of the Elastic Stack—Elasticsearch, a data engine like no other, and Kibana, the window into the Elastic Stack.
The session will cover:
Bringing data into the Elastic Stack
Storing data
Analyzing data
Acting on data
Tapjoy: Building a Real-Time Data Science Service for Mobile AdvertisingSingleStore
Robin Li, Director of Data Engineering and Yohan Chin, VP Data Science at Tapjoy share how to architect the best application experience for mobile users using technologies including Apache Kafka, Apache Spark, and MemSQL.
Speaker: Robin Li - Director of Data Engineering, Tapjoy and Yohan Chin - VP Data Science, Tapjoy
Analytics in a Day Ft. Synapse Virtual WorkshopCCG
Say goodbye to data silos! Analytics in a Day will simplify and accelerate your journey towards the modern data warehouse. Join CCG and Microsoft for a half-day virtual workshop, hosted by James McAuliffe.
Hassle-Free Data Lake Governance: Automating Your Analytics with a Semantic L...Tyler Wishnoff
Simplify data lake governance, no matter how much data you work with and how many data sources and BI tools you manage. This presentation offers all you need to develop your own strategy for smarter data lake governance. Learn more at: https://kyligence.io/
Comment transformer vos données en informations exploitablesElasticsearch
Découvrez des fonctionnalités stratégiques de la Suite Elastic, notamment Elasticsearch, un moteur de données incomparable, et Kibana, véritable fenêtre ouverte sur la Suite Elastic.
Dans cette session, vous apprendrez à :
injecter des données dans la Suite Elastic ;
stocker des données ;
analyser des données ;
exploiter des données.
StreamAnalytix is a software platform that enables enterprises to analyze and respond to events in real-time at Big Data scale. It is designed to rapidly build and deploy streaming analytics applications for any industry vertical, any data format, and any use case.
Disrupting Risk Management through Emerging TechnologiesDatabricks
In the Financial markets with credit card companies there is always a need to measure the risk optimally and understand the performance of products before we could invest and make strategic decisions.
Speaker: Vince Leat, Industry Consulting Executive, Teradata
Large enterprises need a partner who has done it before. Teradata has successfully implemented AI across multiple industries, proving the technology as well as producing material business outcomes. Teradata continues to channel IP from successful, field-based AI client engagements into accelerators that lead to faster time to value and reduce the risk of custom AI initiatives. Hear how Teradata helps customers build opportunities derived from AI.
Comparison between top BI & Data Discovery Solutions.
This is a short deck comparing Pyramid Analytics to QlikView, focusing on the core differences showing strengths and weaknesses.
Why Business Intelligence Should Consider Agile Modern Data Delivery Platformsyed_javed
Modern data solution like Lyftron provides high availability and concurrency at all scales for modern analytical and business intelligence applications such as Looker, Tableau, PowerBI, Sisence, PeriscopeData etc. and can deliver timely results for you.
Master the Multi-Clustered Data Warehouse - SnowflakeMatillion
Snowflake is one of the most powerful, efficient data warehouses on the market today—and we joined forces with the Snowflake team to show you how it works!
In this webinar:
- Learn how to optimize Snowflake
- Hear insider tips and tricks on how to improve performance
- Get expert insights from Craig Collier, Technical Architect from Snowflake, and Kalyan Arangam, Solution Architect from Matillion
- Find out how leading brands like Converse, Duo Security, and Pets at Home use Snowflake and Matillion ETL to make data-driven decisions
- Discover how Matillion ETL and Snowflake work together to modernize your data world
- Learn how to utilize the impressive scalability of Snowflake and Matillion
Privacy has become one of the most important critical topics in data today. It is more than how do we ingest and consume data but the important factors about how you protect your customer’s rights while balancing the business need. In our session, we will bring CTO, Privacera, Don Bosco Durai together with Northwestern Mutual to detail an important use case in privacy and then show how to scale Privacy with a focus on the business needs. We will make the ability to scale effortless.
Attributes of a Modern Data Warehouse - Gartner CatalystJack Mardack
Most data-driven enterprises continue to struggle to generate the insights they need from their data. More data volumes from more data sources, combined with escalating user concurrency, have led to declining query throughput performance and skyrocketing data warehouse costs. Moreover, modern use cases such as customer-360 and hyper-personalization have blurred the boundaries between operational and analytics systems, making even greater demands on data warehouse solutions.
Customer engaged Alletec as the Microsoft experts to perform the migration of existing on premise servers to the Microsoft Azure cloud environment within
two months.
Why HR Should Consider Agile Modern Data Delivery Platformsyed_javed
Modern data delivery platform like Lyftron provides universal data model capability to HR departments that enables changes from the source dynamically in the semantic layer and allows enterprises to avoid manual semantic data model changes.
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCEMatt Stubbs
Date: 14th November 2018
Location: Customer Experience Theatre
Time: 12:30 - 13:00
Speaker: David Maitland
Organisation: Redis Labs
About: This session will cover the technology underpinning at the software infrastructure level required to deliver the instant experience to the end user and enterprises alike. Use cases and value derived by major brands will be shared in this insightful session based the world's most loved database REDIS.
Go beyond spatial data and connect to a range of web and business formats. Plus, learn techniques for generating reports, dashboards, and analytics — whether you prefer Tableau, PDFs, spreadsheets, or web interfaces. We’ll look at what people have been doing to make their data more readable and how you can do it too.
Amazon Kinesis is a platform for streaming data on AWS, offering powerful services to make it easy to load and analyze streaming data. In this session, you'll learn about how AWS customers are transitioning from batch to real-time processing using Amazon Kinesis, and how to get started. We will provide an overview of streaming data applications and introduce the Amazon Kinesis platform and its services. We will walk through a production use case to demonstrate how to ingest streaming data, prepare it, and analyze it to gain actionable insights in real time using Amazon Kinesis. We will also provide pointers to tutorials and other resources so you can quickly get started with your streaming data application.
Tapjoy: Building a Real-Time Data Science Service for Mobile AdvertisingSingleStore
Robin Li, Director of Data Engineering and Yohan Chin, VP Data Science at Tapjoy share how to architect the best application experience for mobile users using technologies including Apache Kafka, Apache Spark, and MemSQL.
Speaker: Robin Li - Director of Data Engineering, Tapjoy and Yohan Chin - VP Data Science, Tapjoy
Analytics in a Day Ft. Synapse Virtual WorkshopCCG
Say goodbye to data silos! Analytics in a Day will simplify and accelerate your journey towards the modern data warehouse. Join CCG and Microsoft for a half-day virtual workshop, hosted by James McAuliffe.
Hassle-Free Data Lake Governance: Automating Your Analytics with a Semantic L...Tyler Wishnoff
Simplify data lake governance, no matter how much data you work with and how many data sources and BI tools you manage. This presentation offers all you need to develop your own strategy for smarter data lake governance. Learn more at: https://kyligence.io/
Comment transformer vos données en informations exploitablesElasticsearch
Découvrez des fonctionnalités stratégiques de la Suite Elastic, notamment Elasticsearch, un moteur de données incomparable, et Kibana, véritable fenêtre ouverte sur la Suite Elastic.
Dans cette session, vous apprendrez à :
injecter des données dans la Suite Elastic ;
stocker des données ;
analyser des données ;
exploiter des données.
StreamAnalytix is a software platform that enables enterprises to analyze and respond to events in real-time at Big Data scale. It is designed to rapidly build and deploy streaming analytics applications for any industry vertical, any data format, and any use case.
Disrupting Risk Management through Emerging TechnologiesDatabricks
In the Financial markets with credit card companies there is always a need to measure the risk optimally and understand the performance of products before we could invest and make strategic decisions.
Speaker: Vince Leat, Industry Consulting Executive, Teradata
Large enterprises need a partner who has done it before. Teradata has successfully implemented AI across multiple industries, proving the technology as well as producing material business outcomes. Teradata continues to channel IP from successful, field-based AI client engagements into accelerators that lead to faster time to value and reduce the risk of custom AI initiatives. Hear how Teradata helps customers build opportunities derived from AI.
Comparison between top BI & Data Discovery Solutions.
This is a short deck comparing Pyramid Analytics to QlikView, focusing on the core differences showing strengths and weaknesses.
Why Business Intelligence Should Consider Agile Modern Data Delivery Platformsyed_javed
Modern data solution like Lyftron provides high availability and concurrency at all scales for modern analytical and business intelligence applications such as Looker, Tableau, PowerBI, Sisence, PeriscopeData etc. and can deliver timely results for you.
Master the Multi-Clustered Data Warehouse - SnowflakeMatillion
Snowflake is one of the most powerful, efficient data warehouses on the market today—and we joined forces with the Snowflake team to show you how it works!
In this webinar:
- Learn how to optimize Snowflake
- Hear insider tips and tricks on how to improve performance
- Get expert insights from Craig Collier, Technical Architect from Snowflake, and Kalyan Arangam, Solution Architect from Matillion
- Find out how leading brands like Converse, Duo Security, and Pets at Home use Snowflake and Matillion ETL to make data-driven decisions
- Discover how Matillion ETL and Snowflake work together to modernize your data world
- Learn how to utilize the impressive scalability of Snowflake and Matillion
Privacy has become one of the most important critical topics in data today. It is more than how do we ingest and consume data but the important factors about how you protect your customer’s rights while balancing the business need. In our session, we will bring CTO, Privacera, Don Bosco Durai together with Northwestern Mutual to detail an important use case in privacy and then show how to scale Privacy with a focus on the business needs. We will make the ability to scale effortless.
Attributes of a Modern Data Warehouse - Gartner CatalystJack Mardack
Most data-driven enterprises continue to struggle to generate the insights they need from their data. More data volumes from more data sources, combined with escalating user concurrency, have led to declining query throughput performance and skyrocketing data warehouse costs. Moreover, modern use cases such as customer-360 and hyper-personalization have blurred the boundaries between operational and analytics systems, making even greater demands on data warehouse solutions.
Customer engaged Alletec as the Microsoft experts to perform the migration of existing on premise servers to the Microsoft Azure cloud environment within
two months.
Why HR Should Consider Agile Modern Data Delivery Platformsyed_javed
Modern data delivery platform like Lyftron provides universal data model capability to HR departments that enables changes from the source dynamically in the semantic layer and allows enterprises to avoid manual semantic data model changes.
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCEMatt Stubbs
Date: 14th November 2018
Location: Customer Experience Theatre
Time: 12:30 - 13:00
Speaker: David Maitland
Organisation: Redis Labs
About: This session will cover the technology underpinning at the software infrastructure level required to deliver the instant experience to the end user and enterprises alike. Use cases and value derived by major brands will be shared in this insightful session based the world's most loved database REDIS.
Go beyond spatial data and connect to a range of web and business formats. Plus, learn techniques for generating reports, dashboards, and analytics — whether you prefer Tableau, PDFs, spreadsheets, or web interfaces. We’ll look at what people have been doing to make their data more readable and how you can do it too.
Amazon Kinesis is a platform for streaming data on AWS, offering powerful services to make it easy to load and analyze streaming data. In this session, you'll learn about how AWS customers are transitioning from batch to real-time processing using Amazon Kinesis, and how to get started. We will provide an overview of streaming data applications and introduce the Amazon Kinesis platform and its services. We will walk through a production use case to demonstrate how to ingest streaming data, prepare it, and analyze it to gain actionable insights in real time using Amazon Kinesis. We will also provide pointers to tutorials and other resources so you can quickly get started with your streaming data application.
To view recording of this webinar please use below URL:
http://wso2.com/library/webinars/2016/06/analytics-in-your-enterprise/
Big data spans many fields and brings together technologies like distributed systems, machine learning, statistics and Internet of Things (IoT). It has now become a multi-billion dollar industry with use cases ranging from targeted advertising and fraud detection to product recommendations and market surveys.
Some use cases such as urban planning can be slower (done in batch mode), while others such as the stock market needs results in milliseconds (done is a streaming fashion). Different technologies are used for each case; MapReduce for batch analytics, complex event processing for real-time analytics and machine learning for predictive analytics. Furthermore, the type of analysis ranges from basic statistics to complicated prediction models.
This webinar will discuss the big data landscape including
Concepts, use cases and technologies
Capabilities and applications of the WSO2 analytics platform
WSO2 Data Analytics Server
WSO2 Complex Event Processor
WSO2 Machine Learner
Big Data kennen sehr viele IT-Experten, wenigstens haben Sie eine Vorstellung davon. In der Praxis arbeiten damit in Deutschland derzeit nur wenige. Dabei bringt Big Data ein ganz neues Momentum in moderne Softwarelösungen und ist im Kontext der Mobil-, Cloud- und Social-Veränderungen nicht wegzudenken. Big Data macht Software intelligent und damit auf eine ganz neue Art für die Benutzer erlebbar. Mit Big Data entstehen neue Softwarearchitekturen, weil Informationen völlig anders verarbeitet werden - nämlich schneller, differenzierter und oft mit dem Ziel, Schlüsse zu ziehen und Vorhersagen zu treffen.
In diesem Vortrag wird erläutert, wie moderne Softwarearchitekturen gestaltet werden, sodass Sie Big Data Paradigmen erfolgreich umsetzen und welche Vorteile sich für die zunehmend mobilen Softwarelösungen ergeben. Wir werfen zudem einen Blick auf die Potentiale und Optionen in Branchen wie Banken, Versicherung oder Handel.
Flink Forward Berlin 2017: Bas Geerdink, Martijn Visser - Fast Data at ING - ...Flink Forward
ING is using Apache Flink for creating streaming analytics ('fast data') solutions. We created a platform with Flink and Kafka that offers high-throughput and low-latency, ideally suited for complex and demanding use cases in the international bank such as customer notifications and fraud detection. These use cases require fast data processing and a business rules engine and/or machine learning evaluation system. Integrating these components together in a always-on, distributed architecture can be challenging. In this talk, we'll start with a brief overview of the use cases. You'll learn why ING chose Flink for these use cases, and see the architecture of the streaming data platform in depth. Finally, we'll share some lessons learned and useful insights for organizations who embark on a similar journey.
Using Data Science to Build an End-to-End Recommendation SystemVMware Tanzu
We get recommendations everyday: Facebook recommends people we should connect with; Amazon recommends products we should buy; and Google Maps recommends routes to take. What all these recommendation systems have in common are data science and modern software development.
Recommendation systems are also valuable for companies in industries as diverse as retail, telecommunications, and energy. In a recent engagement, for example, Pivotal data scientists and developers worked with a large energy company to build a machine learning-based product recommendation system to deliver intelligent and targeted product recommendations to customers to increase revenue.
In this webinar, Pivotal data scientist Ambarish Joshi will take you step-by-step through the engagement, explaining how he and his Pivotal colleagues worked with the customer to collect and analyze data, develop predictive models, and operationalize the resulting insights and surface them via APIs to customer-facing applications. In addition, you will learn how to:
- Apply agile practices to data science and analytics.
- Use test-driven development for feature engineering, model scoring, and validating scripts.
- Automate data science pipelines using pyspark scripts to generate recommendations.
- Apply a microservices-based architecture to integrate product recommendations into mobile applications and call center systems.
Presenters: Ambarish Joshi and Jeff Kelly, Pivotal
Emerging Prevalence of Data Streaming in Analytics and it's Business Signific...Amazon Web Services
Learning Objectives:
- Get an overview of streaming data and it's application in analytics and big data.
- Understand the factors driving the accelerating transformation of batch processing to real-time.
- Learn how you should plan for incorporating data streaming in your analytics and processing workloads.
Business can now easily perform real-time analytics on data that has been traditionally analyzed using batch processing in data warehouses or using Hadoop frameworks, and react to new information in minutes or seconds instead of hours or days. In this webinar, Forrester analyst Mike Gualtieri and Amazon Kinesis GM Roger Barga will discuss this prevalent trend, it's business significance, and how you should plan for it. You will also learn about the AWS services that can help you get started quickly with real-time, streaming applications fore your analytics and big data workloads.
OpenSymmetry - Business Intelligence MaturityOpenSymmetry
OpenSymmetry Breakout Session during the Callidus Customer Connections Conference in Las Vegas - May 2013. Presenter: Trevor Dunham, Director of Business Intelligence with OpenSymmetry
Cloud Experience: Data-driven Applications Made Simple and FastDatabricks
A complex real-time data workflow implementation is very challenging. This session will describe the architecture of a data platform that provides a single, secure, high-performance system that can be deployed in a hybrid cloud architectures. We will present how to support simultaneous, consistent and high-performance access through multiple industry open source and cloud compatible standards of streaming, table, TSDB, object, and file APIs. A new serverless technology is also used in the architecture to support a dynamic and flexible implementations. The presenter will also outline how the platform was integrated with the Spark eco-system, including AI and ML tools, to simplify the development process
Similar to Introduction to Big Data using AWS Services (20)
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
Palestine last event orientationfvgnh .pptxRaedMohamed3
An EFL lesson about the current events in Palestine. It is intended to be for intermediate students who wish to increase their listening skills through a short lesson in power point.
Model Attribute Check Company Auto PropertyCeline George
In Odoo, the multi-company feature allows you to manage multiple companies within a single Odoo database instance. Each company can have its own configurations while still sharing common resources such as products, customers, and suppliers.
The Art Pastor's Guide to Sabbath | Steve ThomasonSteve Thomason
What is the purpose of the Sabbath Law in the Torah. It is interesting to compare how the context of the law shifts from Exodus to Deuteronomy. Who gets to rest, and why?
The Indian economy is classified into different sectors to simplify the analysis and understanding of economic activities. For Class 10, it's essential to grasp the sectors of the Indian economy, understand their characteristics, and recognize their importance. This guide will provide detailed notes on the Sectors of the Indian Economy Class 10, using specific long-tail keywords to enhance comprehension.
For more information, visit-www.vavaclasses.com
How to Split Bills in the Odoo 17 POS ModuleCeline George
Bills have a main role in point of sale procedure. It will help to track sales, handling payments and giving receipts to customers. Bill splitting also has an important role in POS. For example, If some friends come together for dinner and if they want to divide the bill then it is possible by POS bill splitting. This slide will show how to split bills in odoo 17 POS.
This is a presentation by Dada Robert in a Your Skill Boost masterclass organised by the Excellence Foundation for South Sudan (EFSS) on Saturday, the 25th and Sunday, the 26th of May 2024.
He discussed the concept of quality improvement, emphasizing its applicability to various aspects of life, including personal, project, and program improvements. He defined quality as doing the right thing at the right time in the right way to achieve the best possible results and discussed the concept of the "gap" between what we know and what we do, and how this gap represents the areas we need to improve. He explained the scientific approach to quality improvement, which involves systematic performance analysis, testing and learning, and implementing change ideas. He also highlighted the importance of client focus and a team approach to quality improvement.
8. Data Analytics:
● handling data collected by systems
● generate the insights
● improve decision making
○ with facts based on data.
What is Data Analytics?🤔
Copyright 2021. Genese Cloud Academy
9. Types of Data Analytics
● Descriptive Analytics
● Diagnostic Analytics
● Predictive Analytics
● Prescriptive Analytics
What Are the Types of Data Analytics?🤔
10. Benefits Of Data Analytics
Credit card Fraud Detection Customer Personalization Security Threats Detection
Real time Alerting User Behaviour
Financial Modeling
11. Use Cases:
● Social Media
● Ecommerce
● Information Security
● Logistics
● Factory operations
● Internet of Things
What Might be the Use
Cases of Data Analytics?🤔
18. High Speed computing:
● implemented in
○ Super Computers for Scientific Research.
Main Area Of Discipline:
● Developing Parallel Processing Algorithm
● Developing Software
shifted from supercomputers to computing Clusters.
Application Areas of HPC
20. When do you actually need an HPC ?🤔
● Complete time consuming operations in less time
● Complete an operation under a tight schedule
● Perform a high number of operations per second
22. Streaming Data:
● generated continuously by thousands of data sources
Data Stream:
● Continuous
● Ordered
● Changing
● Fast
● huge amount
Traditional DBMS:
Data stored:
● Finite data sets
● Persistent data sets
Streaming data includes a wide variety of data
● log files
● ecommerce purchases,
● in-game player activity,
● information from social networks,
● financial trading floors,
● geospatial services,
● telemetry from connected devices or instrumentation in data centers.
23. Core Banking
● Improved Scalability
● Met HA and SHA needs
Online Gaming
● Increased reliability
● Accurate and real time data
● Ability to process data at scale
● Faster ramp time
24. Government Services
● Near real time events and
better data quality
● Increased efficiency
● Produce and store data
● Better privacy
Financial Services
● Enhanced Customer experience
● Improved fraud detection engine
25. Real-Time Fraud Detection
● Act in real time
● Detect Fraud
● Minimize risk
● Improve customer experience
Real-Time E-Commerce
● OnBoarding New Merchants Faster
● Enabled 360 view of customers
● Enhanced Performance & Monitoring
● Projected saving of Millions of dollars
Do You know About Amazon GO?
26.
27. Benefits of streaming data
● Improve operational efficiencies
● Reduce infrastructure cost
● Provide faster insights and actions
29. Challenges in working with streaming data
Streaming data processing requires two layers:
● a storage layer
○ record ordering
○ strong consistency
■ Fast
■ Inexpensive
■ replayable
● Reads
● Writes
● a processing layer.
○ consuming data from the storage layer
○ running computations on that data
○ notifying the storage layer to delete data that is no longer needed.
● Scalability
● data durability
● fault tolerance
30. Infrastructure to build streaming data applications:
● Amazon Kinesis Data Streams,
● Amazon Kinesis Data Firehose,
● Amazon Managed Streaming for Apache Kafka (Amazon MSK),
● Apache Flume,
● Apache Spark Streaming,
● Apache Storm.
31.
32. Working with streaming data
on AWS
Amazon Kinesis is a platform for streaming data on AWS
● load and analyze streaming data
● custom streaming data applications
offers three services:
● Amazon Managed Streaming for Apache Kafka
(Amazon MSK).
● Amazon Kinesis Data Firehose,
● Amazon Kinesis Data Streams
run other streaming data platforms
● Apache Flume,
● Apache Spark Streaming, and
● Apache Storm
○ on Amazon EC2
33. What is Visualization?🤔🤔🤔
any technique for creating images, diagrams, or animations to communicate a message.
Copyright 2021. Genese Cloud Academy
34.
35. Why is data visualization important?
Data visualization is a:
● Quick & easy way
○ to convey concepts in a universal manner
○ can experiment with different scenarios by making slight adjustments.
● Identify areas that need attention or improvement.
● Clarify which factors influence customer behavior.
● Help you understand which products to place where.
● Predict sales volumes.
36. Data visualization tools
business intelligence (BI) reporting tool.
set up visualization tools to:
● generate automatic dashboards that track company performance across key performance indicators
(KPIs)
● visually interpret the results.
Kibana Tableau Grafana QuickSight Power BI
37. What is Big Data?
Copyright 2021. Genese Cloud Academy
38. BIG DATA:
● collection of data
huge in volume
growing exponentially
with time.
Types Of Big Data
● Structured
● Unstructured
● Semi-structured
39. Why Learn Big Data?
● Gartner – Big Data is the new Oil.
● IDC – Its market will be growing 7 times faster than the overall IT market.
● IBM – It is not just a technology – it’s a Business Strategy for capitalizing on information resources.
● IBM – Big Data is the biggest buzz word because technology makes it possible to analyze all the
available data.
42. Your First Big Data
Application on AWS
Word Count example
Copyright 2021. Genese Cloud Academy
Editor's Notes
Data Analytics is Vital to every Business.
-helps decision makers (based on analytics and data)
-critical tasks (launching New Product,Offering Discounts,marketing New Areas) requires time sensitive decision and experience.
Organizations spend millions of dollars on data storage. The problem isn’t finding the data — the problem is failing to do anything with it, AWS
Volume : total amount of data that is coming in and will be ingested into the system.
Velocity : speed at which data is flowing in, the challenge consists of processing the data in near real-time and return results as quickly as possible. validating a credit card transaction must instantaneous (near real-time)
Variety : Data to be ingested in the system can have different formats,.
Veracity : accuracy of incoming data.
Value :Decisions makers seek to extract meaningful information and insights from systems to have a competitive edge.
Area of discipline can be divided into small independent parts and can be executed simultaneously by separate processors.
Bank
Pubg
Amazon Go
LInk to video: https://youtu.be/NrmMk1Myrxc
applying machine learning algorithms,
and extract deeper insights from the data.
Over time, complex, stream and event processing algorithms, like decaying time windows
to find the most recent popular movies, are applied, further enriching the insights.
applying machine learning algorithms,
and extract deeper insights from the data.
Over time, complex, stream and event processing algorithms, like decaying time windows
to find the most recent popular movies, are applied, further enriching the insights.
Drift : The ability to detect and adapt to changes in the distribution of examples is paramount for data stream mining algorithms
One pass:a one-pass or single-pass is a streaming algorithm which reads its input exactly once. It does so by processing items in order, without unbounded buffering; it reads a block into an input buffer, processes it, and moves the result into an output buffer for each step in the process.
Real-time:
Streaming data is data that is continuously generated and delivered rather than processed in batches or micro-batches. ... The terms “real-time” and “stream” converge in “real-time stream processing” to describe streams of real-time data that are gathered and processed as they are generated.
Bound data is finite and unchanging data, where everything is known about the set of data. Typically Bound data has a known ending point and is relatively fixed.
applying machine learning algorithms,
and extract deeper insights from the data.
Over time, complex, stream and event processing algorithms, like decaying time windows
to find the most recent popular movies, are applied, further enriching the insights.
deploy and manage your own streaming data solution in the cloud on Amazon EC2.
Because of the way the human brain processes information,
using charts or graphs to visualize large amounts of complex data is easier than poring over spreadsheets or reports.
Data visualization is a quick, easy way to convey concepts in a universal manner – and
can experiment with different scenarios by making slight adjustments.
data with so large size and complexity that none of traditional data management tools can store it or process it efficiently. Big data is also a data but with huge size.