When the world is moving fast with digital transformation, organisations are relying heavily on the data getting generated through multiple operations, machine learning, edge computing data, and many more. So what's the exact role of big data in this? How can we leverage big data? Is it powerful enough to transform industries as a whole? Learn more
3. “Big data is at the foundation of all of the
megatrends that are happening today,
from social to mobile to the cloud to
gaming”
4. Introducing Big Data
• Big Data not only refers to the data itself, but also the set of
technologies that capture, store, manage and analyse large and
variable collections of data in order to solve the complex problems
• Big Data solutions have enabled the organizations to effectively
manage the large data volumes thereby reducing costs
• The definition of Big Data depends on whether the data can be
processed and examined in a time that has met a particular
business’ requirements
6. 4 V’s of Big Data
• The 4 V’s of Big Data are a set of specific attributes that define
Big Data
• The 4 V’s of Big Data includes the following-
i. Volume
ii. Variety
iii. Veracity
iv. Velocity
7. Explaining the 4 V’s
1. Volume:
Big Data implies large volumes of data. This data is now
generated by machines, networks and human interactions
on social media and the volume of data that has to be
analysed is large
2. Variety:
It refers to the sources and types of data- both structured
and unstructured. Generally, the variety of an unstructured
data creates problems for storage, mining and analysing
data
8. Explaining the 4 V’s
(contd…)
3. Veracity:
Big Data veracity refers to the biases, noise and abnormality
that is present in the data. It could also be the data being
stored and mined meaningful to the problem that needs to
be analysed
4. Velocity
The Big Data velocity refers to the pace at which the data
flows in from the various sources such as business processes,
machines, networks and human interactions with social
media, mobile devices etc. The flow of data is massive and
continuous
9. Interesting Stats on Big Data
1. It is predicted that by 2020, the amount of data will rise to 40
zettabytes
2. 90% of all the data has been created in the last two years
3. In the year 2012, only 0.5% of data was analysed
4. Today, Internet is responsible for generating about 2.5 quintillion bytes
of data every day
5. About 97.2% of the organizations are heavily investing towards Big
Data and AI
10. Global Big Data Market
• The global Big Data Market has accounted for an increase from $42
billion in the year 2018
• It is expected to reach $103 billion in the year 2027 at a CAGR of 10.48%
• The Big Data market is growing at a CAGR of 10.48%
• The combined market of Big Data and Hadoop is projected to grow from
$17.1 Bn to $99.31 Bn by 2022 with a CAGR of 28.5%
• The Big Data applications and analytics is expected to grow from $5.3 Bn
in 2018 to $19.4 Bn in 2026 with a CAGR of 15.49%
11.
12.
13. Integration With Other Technologies
• Today, there are devices that “talk” to each other over a connected
network and share and generate the data fed by the user
• There are algorithms learning patterns to process the information
from the generated data
• The simplest example of Internet of Things or IoT can be the smart
TV present in a home, connected to the user’s home network and
generating data based on the user viewing patterns, interests and
more
• The social media applications installed, are also considering the
personal tastes and preferences and cumulatively working on the
user persona for better delivery of the online content and
streaming options
16. Future Trends
1. Dark Data:
Dark data simply refers to the data that is generated from the non-digital sources and digital data
has always been undermined by the value that it holds. The data sets are usually untapped,
unstructured and untagged and may also be referred as dusty data. In future, such kind of data sets
are about come into limelight and will revolutionize this industry
2. Privacy:
Privacy continues to be one of the major shortcomings in today’s time. We, users of data barely
have any idea regarding how the data generated by us, is going to be used and shared amongst
the firms. New policies have and will be introduced in context of data consumption and analytics, by
paving a way for a safer ecosystem where consumers generate data
3. Data on the Rise:
The Big Data Hadoop future is always on the rise and the amount of data generated per day will
also continue to grow. In today’s time, the amount of data generated per day is 2.3 trillion gigabytes
of data and will continue to rise ever. The presence of smartwatches, smart TVs and smart wearables
present in the market are continuously collecting user data