SlideShare a Scribd company logo
1 of 9
Y:BhavanaORMBlogs
PRESENTED BY
RAVI NAMBOORI
1. Introduction
2. What is Big Data?
3. Characteristic of Big Data
4. Storing, selecting and processing of Big Data
5. Benefits of Big Data
6. Future of Big Data
Contents
• Big Data may well be the Next Big Thing in the IT world.
• The first organizations to embrace it were online and
startup firms.
• Firms like Google, eBay, LinkedIn, and Facebook were
built around big data from the beginning.
• Like many new information technologies, big data can
bring about dramatic cost reductions, substantial
improvements in the time required to perform a
computing task, or new product and service offerings.
• Big data burst upon the scene in the first decade of the
21st century.
Introduction
• Big Data is similar to ‘small data’, but bigger in size.
• But having data bigger it requires different approaches: –
Techniques, tools and architecture .
• An aim to solve new problems or old problems in a better
way.
• Big Data generates value from the storage and processing
of very large quantities of digital information that cannot
be analyzed with traditional computing techniques.
What is Big Data
• V3s Volume Velocity Variety
• Big Data Volume - A typical PC might have had 10 gigabytes
of storage in 2000. •Today, Facebook ingests 500 terabytes
of new data every day. •Boeing 737 will generate 240
terabytes of flight data during a single flight across the US.
• Big Data Velocity - Clickstreams and ad impressions capture
user behavior at millions of events per second high-
frequency stock trading algorithms reflect market changes
within microseconds. Machine to machine processes
exchange data between billions of devices.
• Big Data Variety - Big Data isn't just numbers, dates, and
strings. Big Data is also geospatial data, 3D data, audio and
video, and unstructured text, including log files and social
media.
Characteristics of Big Data
• Storing Big Data - Selecting data sources for analysis
• Eliminating redundant data.
• Establishing the role of No SQL.
• Overview of Big Data stores .
• Data models: key value, graph, document, column-family .Hadoop Distributed
File System
• Selecting Big Data stores - Choosing the correct data stores based on your
data characteristics. Moving code to data .Implementing polyglot data store
solutions .Aligning business goals to the appropriate data store
• Processing Big Data - Integrating disparate data stores . Mapping data to the
programming framework. Connecting and extracting data from storage .
• Transforming data for processing. Subdividing data in preparation for Hadoop
MapReduce .
Storing, selecting and processing of Big Data
• Real-time big data isn’t just a process for storing petabytes or exabytes of data in a
data warehouse, It’s about the ability to make better decisions and take meaningful
actions at the right time.
• Fast forward to the present and technologies like Hadoop give you the scale and
flexibility to store data before you know how you are going to process it.
• Technologies such as MapReduce,Hive and Impala enable you to run queries without
changing the data structures underneath
Benefits
• The Future Data sets will continue to grow Storage unit costs will continue to decrease
Processing costs will decrease Network capacity will continue to grow Data growth may
exceed processing capacity
• $15 billion on software firms only specializing in data management and analytics.
• This industry on its own is worth more than $100 billion and growing at almost 10% a year
which is roughly twice as fast as the software business as a whole.
Future
THANK YOU

More Related Content

Viewers also liked

Big Data Presentation - Data Center Dynamics Sydney 2014 - Dez Blanchfield
Big Data Presentation - Data Center Dynamics Sydney 2014 - Dez BlanchfieldBig Data Presentation - Data Center Dynamics Sydney 2014 - Dez Blanchfield
Big Data Presentation - Data Center Dynamics Sydney 2014 - Dez BlanchfieldDez Blanchfield
 
Presentation Big Data
Presentation Big DataPresentation Big Data
Presentation Big DataRené Kuipers
 
Big data presentation at Data Driven congres
Big data presentation at Data Driven congresBig data presentation at Data Driven congres
Big data presentation at Data Driven congresHans Smellinckx
 
Global Partner PPT Presentation Template | Export Business PowerPoint Themes
Global Partner PPT Presentation Template | Export Business PowerPoint ThemesGlobal Partner PPT Presentation Template | Export Business PowerPoint Themes
Global Partner PPT Presentation Template | Export Business PowerPoint ThemesFPPT.com
 
Presentation on Big Data Analytics
Presentation on Big Data AnalyticsPresentation on Big Data Analytics
Presentation on Big Data AnalyticsS P Sajjan
 
Deep learning presentation
Deep learning presentationDeep learning presentation
Deep learning presentationBaptiste Wicht
 
Big Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must KnowBig Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must KnowBernard Marr
 
Deep learning - Conceptual understanding and applications
Deep learning - Conceptual understanding and applicationsDeep learning - Conceptual understanding and applications
Deep learning - Conceptual understanding and applicationsBuhwan Jeong
 
Big Data
Big DataBig Data
Big DataNGDATA
 

Viewers also liked (14)

Big data(1st presentation)
Big data(1st presentation)Big data(1st presentation)
Big data(1st presentation)
 
Big Data Presentation - Data Center Dynamics Sydney 2014 - Dez Blanchfield
Big Data Presentation - Data Center Dynamics Sydney 2014 - Dez BlanchfieldBig Data Presentation - Data Center Dynamics Sydney 2014 - Dez Blanchfield
Big Data Presentation - Data Center Dynamics Sydney 2014 - Dez Blanchfield
 
Presentation Big Data
Presentation Big DataPresentation Big Data
Presentation Big Data
 
Big data presentation at Data Driven congres
Big data presentation at Data Driven congresBig data presentation at Data Driven congres
Big data presentation at Data Driven congres
 
Global Partner PPT Presentation Template | Export Business PowerPoint Themes
Global Partner PPT Presentation Template | Export Business PowerPoint ThemesGlobal Partner PPT Presentation Template | Export Business PowerPoint Themes
Global Partner PPT Presentation Template | Export Business PowerPoint Themes
 
Presentation on Big Data Analytics
Presentation on Big Data AnalyticsPresentation on Big Data Analytics
Presentation on Big Data Analytics
 
Deep learning presentation
Deep learning presentationDeep learning presentation
Deep learning presentation
 
Big Data simplified
Big Data simplifiedBig Data simplified
Big Data simplified
 
Big Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must KnowBig Data: The 4 Layers Everyone Must Know
Big Data: The 4 Layers Everyone Must Know
 
Big Idea For Big Data
Big Idea For Big DataBig Idea For Big Data
Big Idea For Big Data
 
Deep learning - Conceptual understanding and applications
Deep learning - Conceptual understanding and applicationsDeep learning - Conceptual understanding and applications
Deep learning - Conceptual understanding and applications
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big Data
Big DataBig Data
Big Data
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 

Recently uploaded

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 

Recently uploaded (20)

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 

Ravi Namboori Big Data Presentation

  • 2. 1. Introduction 2. What is Big Data? 3. Characteristic of Big Data 4. Storing, selecting and processing of Big Data 5. Benefits of Big Data 6. Future of Big Data Contents
  • 3. • Big Data may well be the Next Big Thing in the IT world. • The first organizations to embrace it were online and startup firms. • Firms like Google, eBay, LinkedIn, and Facebook were built around big data from the beginning. • Like many new information technologies, big data can bring about dramatic cost reductions, substantial improvements in the time required to perform a computing task, or new product and service offerings. • Big data burst upon the scene in the first decade of the 21st century. Introduction
  • 4. • Big Data is similar to ‘small data’, but bigger in size. • But having data bigger it requires different approaches: – Techniques, tools and architecture . • An aim to solve new problems or old problems in a better way. • Big Data generates value from the storage and processing of very large quantities of digital information that cannot be analyzed with traditional computing techniques. What is Big Data
  • 5. • V3s Volume Velocity Variety • Big Data Volume - A typical PC might have had 10 gigabytes of storage in 2000. •Today, Facebook ingests 500 terabytes of new data every day. •Boeing 737 will generate 240 terabytes of flight data during a single flight across the US. • Big Data Velocity - Clickstreams and ad impressions capture user behavior at millions of events per second high- frequency stock trading algorithms reflect market changes within microseconds. Machine to machine processes exchange data between billions of devices. • Big Data Variety - Big Data isn't just numbers, dates, and strings. Big Data is also geospatial data, 3D data, audio and video, and unstructured text, including log files and social media. Characteristics of Big Data
  • 6. • Storing Big Data - Selecting data sources for analysis • Eliminating redundant data. • Establishing the role of No SQL. • Overview of Big Data stores . • Data models: key value, graph, document, column-family .Hadoop Distributed File System • Selecting Big Data stores - Choosing the correct data stores based on your data characteristics. Moving code to data .Implementing polyglot data store solutions .Aligning business goals to the appropriate data store • Processing Big Data - Integrating disparate data stores . Mapping data to the programming framework. Connecting and extracting data from storage . • Transforming data for processing. Subdividing data in preparation for Hadoop MapReduce . Storing, selecting and processing of Big Data
  • 7. • Real-time big data isn’t just a process for storing petabytes or exabytes of data in a data warehouse, It’s about the ability to make better decisions and take meaningful actions at the right time. • Fast forward to the present and technologies like Hadoop give you the scale and flexibility to store data before you know how you are going to process it. • Technologies such as MapReduce,Hive and Impala enable you to run queries without changing the data structures underneath Benefits
  • 8. • The Future Data sets will continue to grow Storage unit costs will continue to decrease Processing costs will decrease Network capacity will continue to grow Data growth may exceed processing capacity • $15 billion on software firms only specializing in data management and analytics. • This industry on its own is worth more than $100 billion and growing at almost 10% a year which is roughly twice as fast as the software business as a whole. Future