Hadoopppt.pptx

•Download as PPTX, PDF•

0 likes•2 views

ssuser552a8f

Hadoop

Technology

 Introduction
 What is Hadoop?
 Hadoop Applications
 Hadoop Architecture
 Importance
 Advantages
 Disadvantages
 When to use Hadoop?
 Reference
2

 Hadoop is an Apache open source framework
written in java that allows distributed
processing of large datasets across clusters of
computers using simple programming models.
 A Hadoop frame-worked application works in
an environment that provides distributed
storage and computation across clusters of
computers.
3

 Hadoop is sub-project of Lucene (a collection of
industrial-strength search tools), under the
umbrella of the Apache Software Foundation.
 Hadoop parallelizes data processing across
many nodes (computers) in a compute cluster,
speeding up large computations and hiding I/O
latency through increased concurrency.
4

 Making Hadoop Applications More Widely
Accessible
 A Graphical Abstraction Layer on Top of Hadoop
Applications
5

 Ability to store and process huge amounts of
any kind of data, quickly
 Computing power
 Fault tolerance
 Flexibility
 Low cost
 Scalability
7

 Scalable
 Cost effective
 Flexible
 Fast
 Resilient to failure
8

 Security Concerns
 Vulnerable By Nature
 Not Fit for Small Data
 Potential Stability Issues
 General Limitations
9

 Hadoop Common (formerly Hadoop Core)
 Hadoop MapReduce
 Hadoop YARN (MapReduce 2.0)
 Hadoop Distributed File System (HDFS)
11

 Ambari, Zookeeper (managing & monitoring)
 HBase, Cassandra (database)
 Hive, Pig (data warehouse and query
language)
 Mahout (machine learning)
 Chukwa, Avro, Oozie, Giraph, and many more
12

 Generally, always when “standard tools” don’t
work anymore because of sheer data size
(rule of thumb: if your data fits on a regular
hard drive, your better off sticking to
Python/SQL/Bash/etc.!)
 Aggregation across large data sets: use the
power of Reducers!
 Large-scale ETL operations (extract,
transform, load)
13

 www.google.com
 www.wikipedia.com
 www.studymafia.org
 www.projectsreports.org

Similar to Hadoopppt.pptx

Hadoop infoNikita Sure

Big Data Hadoop TechnologyRahul Sharma

HadoopKartik Kalpande Patil

The solution for big dataShubham Pendharkar

Hadoop An IntroductionMohanasundaram Ponnusamy

Hadoop basicsLaxmi Rauth

Big Data Technology Stack : NutshellKhalid Imran

Hadoop architecture-tutorialvinayiqbusiness

Big data and hadoop product pageJanu Jahnavi

M. Florence Dayana - Hadoop Foundation for Analytics.pptxDr.Florence Dayana

Attaching cloud storage to a campus grid using parrot, chirp, and hadoopJoão Gabriel Lima

CSB_communityAlbert Anthony Gavino, MBA

Hadoop training in bangaloreTIB Academy

Hadoop in a NutshellAnthony Thomas

Hadoop Business CasesJoey Jablonski

Hadoop Tutorial for Beginnersbusiness Corporate

Overview of big data & hadoop version 1 - Tony NguyenThanh Nguyen

Overview of Big data, Hadoop and Microsoft BI - version1Thanh Nguyen

Introduction to Big Data Analytics on Apache HadoopAvkash Chauhan

Hadoop hdfsSudipta Ghosh

Similar to Hadoopppt.pptx (20)

Hadoop info

Big Data Hadoop Technology

Hadoop

The solution for big data

Hadoop An Introduction

Hadoop basics

Big Data Technology Stack : Nutshell

Hadoop architecture-tutorial

Big data and hadoop product page

M. Florence Dayana - Hadoop Foundation for Analytics.pptx

Attaching cloud storage to a campus grid using parrot, chirp, and hadoop

CSB_community

Hadoop training in bangalore

Hadoop in a Nutshell

Hadoop Business Cases

Hadoop Tutorial for Beginners

Overview of big data & hadoop version 1 - Tony Nguyen

Overview of Big data, Hadoop and Microsoft BI - version1

Introduction to Big Data Analytics on Apache Hadoop

Hadoop hdfs

Recently uploaded

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptxnull - The Open Security Community

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

AI as an Interface for Commercial BuildingsMemoori

Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your BudgetEnjoy Anytime

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Artificial intelligence in the post-deep learning eraDeakin University

Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4

Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

CloudStudio User manual (basic edition):comworks

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

The transition to renewables in India.pdfCompetition Advisory Services (India) LLP

Recently uploaded (20)

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Making_way_through_DLL_hollowing_inspite_of_CFG_by_Debjeet Banerjee.pptx

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

AI as an Interface for Commercial Buildings

Hyderabad Call Girls Khairatabad ✨ 7001305949 ✨ Cheap Price Your Budget

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Benefits Of Flutter Compared To Other Frameworks

Human Factors of XR: Using Human Factors to Design XR Systems

Artificial intelligence in the post-deep learning era

Azure Monitor & Application Insight to monitor Infrastructure & Application

Next-generation AAM aircraft unveiled by Supernal, S-A2

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

08448380779 Call Girls In Civil Lines Women Seeking Men

CloudStudio User manual (basic edition):

Advanced Test Driven-Development @ php[tek] 2024

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi

Injustice - Developers Among Us (SciFiDevCon 2024)

The transition to renewables in India.pdf

Hadoopppt.pptx

1. www.studymafia.org Submitted To: Submitted By: www.studymafia.org www.studymafia.org Seminar On HADOOP

2.  Introduction  What is Hadoop?  Hadoop Applications  Hadoop Architecture  Importance  Advantages  Disadvantages  When to use Hadoop?  Reference 2

3.  Hadoop is an Apache open source framework written in java that allows distributed processing of large datasets across clusters of computers using simple programming models.  A Hadoop frame-worked application works in an environment that provides distributed storage and computation across clusters of computers. 3

4.  Hadoop is sub-project of Lucene (a collection of industrial-strength search tools), under the umbrella of the Apache Software Foundation.  Hadoop parallelizes data processing across many nodes (computers) in a compute cluster, speeding up large computations and hiding I/O latency through increased concurrency. 4

5.  Making Hadoop Applications More Widely Accessible  A Graphical Abstraction Layer on Top of Hadoop Applications 5

6. 6

7.  Ability to store and process huge amounts of any kind of data, quickly  Computing power  Fault tolerance  Flexibility  Low cost  Scalability 7

8.  Scalable  Cost effective  Flexible  Fast  Resilient to failure 8

9.  Security Concerns  Vulnerable By Nature  Not Fit for Small Data  Potential Stability Issues  General Limitations 9

10. 10

11.  Hadoop Common (formerly Hadoop Core)  Hadoop MapReduce  Hadoop YARN (MapReduce 2.0)  Hadoop Distributed File System (HDFS) 11

12.  Ambari, Zookeeper (managing & monitoring)  HBase, Cassandra (database)  Hive, Pig (data warehouse and query language)  Mahout (machine learning)  Chukwa, Avro, Oozie, Giraph, and many more 12

13.  Generally, always when “standard tools” don’t work anymore because of sheer data size (rule of thumb: if your data fits on a regular hard drive, your better off sticking to Python/SQL/Bash/etc.!)  Aggregation across large data sets: use the power of Reducers!  Large-scale ETL operations (extract, transform, load) 13

14.  www.google.com  www.wikipedia.com  www.studymafia.org  www.projectsreports.org

15. Thank You ALL

Hadoopppt.pptx

Recommended

Recommended

More Related Content

Similar to Hadoopppt.pptx

Similar to Hadoopppt.pptx (20)

Recently uploaded

Recently uploaded (20)

Hadoopppt.pptx