SlideShare a Scribd company logo
1 of 4
Download to read offline
Author:-
Neeraj Negi
What is Big Data?
Big data is basically a term for large data-sets, these data sets are so complex and large
in size that it becomes difficult to store, access or process in traditional database
applications or tools. It exceeds the processing capacity of conventional database
systems. Big data is too big (petabytes or exabytes), it moves too fast, or it doesn’t fit
structures of database architectures. The data is typically loosely structured data that is
often incomplete and inaccessible.
Specifically, Big Data is related to data creation, retrieval, manipulation and analysis of
data that is exceptional in terms of volume, velocity and variety: -
1. Volume – Facebook consumes more than 500 TB of data in one day. Google
receives 2 million search queries per minute. 40 terabytes of data is generated
every second from nuclear physics experiments at the Large Hadron Collider at
CERN.
This volume presents the most immediate challenge to traditional IT structures. It
demands scalable storage, and a distributed approach to querying.
2. Velocity – It represents the frequency of data processing or data generation.
Many MNCs and organizations have capturing click streams of data from
websites (Google, Yahoo, Facebook, Microsoft, etc.), using that streaming data
these corporations make purchase recommendations in form of ads to web
visitors. Streaming data also have to make sense to analysis that goes with it, at
the same time it also have to produce results and take actions – all in real time.
3. Variety – Big data is not just in form of strings or numbers. It also includes 3d
data, audio, video, pictures, log files, GPS data, etc. Conventional databases were
designed to address smaller volumes of structured data and predictable and
Author:-
Neeraj Negi
consistent data structures. With increasing number of users, traditional RDBMS
has become liability for organizations, making it harder to serve their users.
Every enterprise needs to understand Big data, and how it affects them. Standard
tools and procedures are not designed to analyze and search massive datasets.
Big Data requires exceptional technology to efficiently process large amount of data in
acceptable amount of time lapse. Technologies like massively parallel processing
databases, search-based applications, data mining grids, distributed file systems and
databases, cloud based infrastructure are suitable.
Author:-
Neeraj Negi
Big Data Softwares:-
1. Hadoop - Apache Foundation.
It is an open source software project that enables the distributed processing of
large data sets across clusters of commodity servers. Hadoop makes it possible to
run applications on systems with thousands of nodes involving thousands of
terabytes. Rather than relying on high-end hardware, the resiliency of these clusters
comes from the software’s ability to detect and handle failures at the application
layer.
The Hadoop framework is used by major players including Google, Yahoo and IBM,
largely for applications involving search engines and advertising. The preferred
operating systems are Windows and Linux but Hadoop can also work with BSD and
OS X.
2. MongoDB - MongoDB, Inc.
It is a document-oriented database system classified as NoSQL* database. MySQL
is written using SQL queries, while MongoDB is focused on BSON (Binary
JSON**).
It is a handy tool for smaller database requirements. MongoDB supports complex
operations like join, indexing much easily and efficiently as compared to
traditional RDBMSs
*A NoSQL or Not Only SQL database provides a mechanism for storage and
retrieval of data that is modeled in means other than the tabular relations used in
relational databases.
**JSON an open standard format that uses human-readable text to transmit data
objects consisting of attribute–value pairs.
Author:-
Neeraj Negi
3. Splunk - Splunk Inc.
Splunk is an advanced IT search tool that offers users, administrators, and
developers the ability to instantly search all data generated by applications, servers,
and network devices in the IT infrastructure. It generates reports, graphs, alerts and
visualizations from the data which it captures and correlates in a repository. Splunk
turns machine data into valuable insights no matter what business you're in.

More Related Content

Viewers also liked

Rcc marketing plan part 3
Rcc marketing plan part 3Rcc marketing plan part 3
Rcc marketing plan part 3Bryan Griffith
 
Mobile and the Path to Purchase
Mobile and the Path to PurchaseMobile and the Path to Purchase
Mobile and the Path to PurchasePlanimedia
 
Power point basketball
Power point basketballPower point basketball
Power point basketballjuulaa
 
History of mathematics in India
History of mathematics in IndiaHistory of mathematics in India
History of mathematics in IndiaAbhishek Das
 
Basic Princibles of International Organizations of United Nations
Basic Princibles of International Organizations of United NationsBasic Princibles of International Organizations of United Nations
Basic Princibles of International Organizations of United NationsOzan Bayındırlı
 
Guide to marketing on mobile
Guide to marketing on mobile Guide to marketing on mobile
Guide to marketing on mobile Planimedia
 
Containers and security
Containers and securityContainers and security
Containers and securitysriram_rajan
 
Top Drivers of Marketing Success – What to Budget for in 2015
Top Drivers of Marketing Success – What to Budget for in 2015Top Drivers of Marketing Success – What to Budget for in 2015
Top Drivers of Marketing Success – What to Budget for in 2015Planimedia
 
Guía Social Media for Lead Generation
Guía Social Media for Lead GenerationGuía Social Media for Lead Generation
Guía Social Media for Lead GenerationPlanimedia
 
Comscore state of the_mobile_market_final
Comscore state of the_mobile_market_finalComscore state of the_mobile_market_final
Comscore state of the_mobile_market_finalPlanimedia
 
Thomas Rowley - Welcome to the Punch (1)
Thomas Rowley - Welcome to the Punch (1)Thomas Rowley - Welcome to the Punch (1)
Thomas Rowley - Welcome to the Punch (1)Thomas Rowley
 
Forrester. Impacto del Customer Ecperience en el negocio
Forrester. Impacto del Customer Ecperience en el negocioForrester. Impacto del Customer Ecperience en el negocio
Forrester. Impacto del Customer Ecperience en el negocioPlanimedia
 
Drive Potential Customers With Effective Mobile Marketing
Drive Potential Customers With Effective Mobile Marketing Drive Potential Customers With Effective Mobile Marketing
Drive Potential Customers With Effective Mobile Marketing Planimedia
 

Viewers also liked (14)

Rcc marketing plan part 3
Rcc marketing plan part 3Rcc marketing plan part 3
Rcc marketing plan part 3
 
Mobile and the Path to Purchase
Mobile and the Path to PurchaseMobile and the Path to Purchase
Mobile and the Path to Purchase
 
Power point basketball
Power point basketballPower point basketball
Power point basketball
 
History of mathematics in India
History of mathematics in IndiaHistory of mathematics in India
History of mathematics in India
 
Basic Princibles of International Organizations of United Nations
Basic Princibles of International Organizations of United NationsBasic Princibles of International Organizations of United Nations
Basic Princibles of International Organizations of United Nations
 
Guide to marketing on mobile
Guide to marketing on mobile Guide to marketing on mobile
Guide to marketing on mobile
 
My little pony
My little ponyMy little pony
My little pony
 
Containers and security
Containers and securityContainers and security
Containers and security
 
Top Drivers of Marketing Success – What to Budget for in 2015
Top Drivers of Marketing Success – What to Budget for in 2015Top Drivers of Marketing Success – What to Budget for in 2015
Top Drivers of Marketing Success – What to Budget for in 2015
 
Guía Social Media for Lead Generation
Guía Social Media for Lead GenerationGuía Social Media for Lead Generation
Guía Social Media for Lead Generation
 
Comscore state of the_mobile_market_final
Comscore state of the_mobile_market_finalComscore state of the_mobile_market_final
Comscore state of the_mobile_market_final
 
Thomas Rowley - Welcome to the Punch (1)
Thomas Rowley - Welcome to the Punch (1)Thomas Rowley - Welcome to the Punch (1)
Thomas Rowley - Welcome to the Punch (1)
 
Forrester. Impacto del Customer Ecperience en el negocio
Forrester. Impacto del Customer Ecperience en el negocioForrester. Impacto del Customer Ecperience en el negocio
Forrester. Impacto del Customer Ecperience en el negocio
 
Drive Potential Customers With Effective Mobile Marketing
Drive Potential Customers With Effective Mobile Marketing Drive Potential Customers With Effective Mobile Marketing
Drive Potential Customers With Effective Mobile Marketing
 

Recently uploaded

%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyviewmasabamasaba
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrainmasabamasaba
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba
 
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfThe Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfayushiqss
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfonteinmasabamasaba
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...masabamasaba
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastPapp Krisztián
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...Shane Coughlan
 
%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...masabamasaba
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Hararemasabamasaba
 
%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durbanmasabamasaba
 
SHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions PresentationSHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions PresentationShrmpro
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesVictorSzoltysek
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...masabamasaba
 

Recently uploaded (20)

%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
%in Hazyview+277-882-255-28 abortion pills for sale in Hazyview
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfThe Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Vancouver Psychic Readings, Attraction spells,Br...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare%in Harare+277-882-255-28 abortion pills for sale in Harare
%in Harare+277-882-255-28 abortion pills for sale in Harare
 
%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban
 
SHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions PresentationSHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions Presentation
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 

Big Data and Hadoop

  • 1. Author:- Neeraj Negi What is Big Data? Big data is basically a term for large data-sets, these data sets are so complex and large in size that it becomes difficult to store, access or process in traditional database applications or tools. It exceeds the processing capacity of conventional database systems. Big data is too big (petabytes or exabytes), it moves too fast, or it doesn’t fit structures of database architectures. The data is typically loosely structured data that is often incomplete and inaccessible. Specifically, Big Data is related to data creation, retrieval, manipulation and analysis of data that is exceptional in terms of volume, velocity and variety: - 1. Volume – Facebook consumes more than 500 TB of data in one day. Google receives 2 million search queries per minute. 40 terabytes of data is generated every second from nuclear physics experiments at the Large Hadron Collider at CERN. This volume presents the most immediate challenge to traditional IT structures. It demands scalable storage, and a distributed approach to querying. 2. Velocity – It represents the frequency of data processing or data generation. Many MNCs and organizations have capturing click streams of data from websites (Google, Yahoo, Facebook, Microsoft, etc.), using that streaming data these corporations make purchase recommendations in form of ads to web visitors. Streaming data also have to make sense to analysis that goes with it, at the same time it also have to produce results and take actions – all in real time. 3. Variety – Big data is not just in form of strings or numbers. It also includes 3d data, audio, video, pictures, log files, GPS data, etc. Conventional databases were designed to address smaller volumes of structured data and predictable and
  • 2. Author:- Neeraj Negi consistent data structures. With increasing number of users, traditional RDBMS has become liability for organizations, making it harder to serve their users. Every enterprise needs to understand Big data, and how it affects them. Standard tools and procedures are not designed to analyze and search massive datasets. Big Data requires exceptional technology to efficiently process large amount of data in acceptable amount of time lapse. Technologies like massively parallel processing databases, search-based applications, data mining grids, distributed file systems and databases, cloud based infrastructure are suitable.
  • 3. Author:- Neeraj Negi Big Data Softwares:- 1. Hadoop - Apache Foundation. It is an open source software project that enables the distributed processing of large data sets across clusters of commodity servers. Hadoop makes it possible to run applications on systems with thousands of nodes involving thousands of terabytes. Rather than relying on high-end hardware, the resiliency of these clusters comes from the software’s ability to detect and handle failures at the application layer. The Hadoop framework is used by major players including Google, Yahoo and IBM, largely for applications involving search engines and advertising. The preferred operating systems are Windows and Linux but Hadoop can also work with BSD and OS X. 2. MongoDB - MongoDB, Inc. It is a document-oriented database system classified as NoSQL* database. MySQL is written using SQL queries, while MongoDB is focused on BSON (Binary JSON**). It is a handy tool for smaller database requirements. MongoDB supports complex operations like join, indexing much easily and efficiently as compared to traditional RDBMSs *A NoSQL or Not Only SQL database provides a mechanism for storage and retrieval of data that is modeled in means other than the tabular relations used in relational databases. **JSON an open standard format that uses human-readable text to transmit data objects consisting of attribute–value pairs.
  • 4. Author:- Neeraj Negi 3. Splunk - Splunk Inc. Splunk is an advanced IT search tool that offers users, administrators, and developers the ability to instantly search all data generated by applications, servers, and network devices in the IT infrastructure. It generates reports, graphs, alerts and visualizations from the data which it captures and correlates in a repository. Splunk turns machine data into valuable insights no matter what business you're in.