SlideShare a Scribd company logo
1 of 8
Java/J2EE Capabilities
What is Apache Hadoop?
•A proficient data management framework for Big Data
•Open source software for distributed processing of
large chunks of data
•Offers distributed parallel processing across servers,
ranging from a single server to multiple machines
•Processing and analysis of thousands of terabytes of
data
•Apt framework to increase business efficiency and
maximize ROI
•Latest Release on 18 November, 2014: Release 2.6.0
What is Apache Hadoop?
Main Modules of Hadoop
Hadoop
Common
HDFS (Hadoop
Distributed File
System)
Hadoop YARN
Hadoop
MapReduce
Main Modules of Hadoop
Main Modules of Hadoop (contd.)
•Hadoop Common
Common utilities to help other Hadoop modules
and support subprojects
Includes File System, RPC and serialization libraries
•Hadoop Distributed File System (HDFS)
Distributed File System giving access to application
data
Spans across all nodes in a Hadoop cluster to link
them into one big file system
Java based, giving scalable and reliable data
storage
Main Modules of Hadoop (contd.)
Main Modules of Hadoop (contd.)
•Hadoop YARN
Utilized for job scheduling and resource
management of clusters
Splits up two roles of JobTracker, namely, resource
management and job scheduling into different areas
•Hadoop MapReduce
System for parallel processing of large data sets
A framework that gets into work assignment to
nodes in a particular cluster
Writes applications processing large amount of
data, on multiple nodes of hardware with utmost
reliability
Main Modules of Hadoop (contd.)
Other Hadoop Related Projects at Apache
• Avro
•Cassandra
•Hbase
•Hive
•Pig
•Spark
• Ambari
•Chukwa
•Mahout
•Tez
•ZooKeeper
Other Hadoop Related Projects at Apache
Why Hadoop?
• Next generation real time analytics
•Rich eco systems
•Scale-out storage
•Reduced cost of ownership
•Scalability, Flexibility and Reliability
•Fault tolerance
•Simplistic programming models
Why Hadoop?
Looking Forward To Have A Mutually Beneficial Association.
Assuring You Of Our Best Services Always.
SPEC INDIA
"SPEC House“, Parth
Complex,
Swastik Cross Road,
Navrangpura,
Ahmedabad-380
009, INDIA.
Tel.:+91-79-26404031 to
34
VoIP : + 1 - 908 - 450 -
9862
Instant Messengers
spec.bd | spec_india |
bd.spec
specindia2009 |
specindia.bd
e-mail: lead@spec-india.com
URL: http://www.spec-
india.com
THANK YOU

More Related Content

More from sara stanford

IndexedDB - An Efficient Way to Manage Data
IndexedDB - An Efficient Way to Manage DataIndexedDB - An Efficient Way to Manage Data
IndexedDB - An Efficient Way to Manage Datasara stanford
 
Lambda Behave - Java 8's Testing Framework
Lambda Behave - Java 8's Testing FrameworkLambda Behave - Java 8's Testing Framework
Lambda Behave - Java 8's Testing Frameworksara stanford
 
Mobile device management
Mobile device managementMobile device management
Mobile device managementsara stanford
 
Ember js java script framework
Ember js   java script frameworkEmber js   java script framework
Ember js java script frameworksara stanford
 
Google Material Design
Google Material DesignGoogle Material Design
Google Material Designsara stanford
 
10 Reality Checks to be Performed Before Implementing Mobile Sales Force Aut...
10 Reality Checks to be Performed Before Implementing  Mobile Sales Force Aut...10 Reality Checks to be Performed Before Implementing  Mobile Sales Force Aut...
10 Reality Checks to be Performed Before Implementing Mobile Sales Force Aut...sara stanford
 
SQL Server Service Broker – A Competent Architecture by Microsoft
SQL Server Service Broker – A Competent Architecture by MicrosoftSQL Server Service Broker – A Competent Architecture by Microsoft
SQL Server Service Broker – A Competent Architecture by Microsoftsara stanford
 

More from sara stanford (10)

IndexedDB - An Efficient Way to Manage Data
IndexedDB - An Efficient Way to Manage DataIndexedDB - An Efficient Way to Manage Data
IndexedDB - An Efficient Way to Manage Data
 
Big data combat
Big data combatBig data combat
Big data combat
 
Lambda Behave - Java 8's Testing Framework
Lambda Behave - Java 8's Testing FrameworkLambda Behave - Java 8's Testing Framework
Lambda Behave - Java 8's Testing Framework
 
Robotium
RobotiumRobotium
Robotium
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Mobile device management
Mobile device managementMobile device management
Mobile device management
 
Ember js java script framework
Ember js   java script frameworkEmber js   java script framework
Ember js java script framework
 
Google Material Design
Google Material DesignGoogle Material Design
Google Material Design
 
10 Reality Checks to be Performed Before Implementing Mobile Sales Force Aut...
10 Reality Checks to be Performed Before Implementing  Mobile Sales Force Aut...10 Reality Checks to be Performed Before Implementing  Mobile Sales Force Aut...
10 Reality Checks to be Performed Before Implementing Mobile Sales Force Aut...
 
SQL Server Service Broker – A Competent Architecture by Microsoft
SQL Server Service Broker – A Competent Architecture by MicrosoftSQL Server Service Broker – A Competent Architecture by Microsoft
SQL Server Service Broker – A Competent Architecture by Microsoft
 

Recently uploaded

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 

Recently uploaded (20)

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 

Apache Hadoop – The Big Name In The Big Data World

  • 2. What is Apache Hadoop? •A proficient data management framework for Big Data •Open source software for distributed processing of large chunks of data •Offers distributed parallel processing across servers, ranging from a single server to multiple machines •Processing and analysis of thousands of terabytes of data •Apt framework to increase business efficiency and maximize ROI •Latest Release on 18 November, 2014: Release 2.6.0 What is Apache Hadoop?
  • 3. Main Modules of Hadoop Hadoop Common HDFS (Hadoop Distributed File System) Hadoop YARN Hadoop MapReduce Main Modules of Hadoop
  • 4. Main Modules of Hadoop (contd.) •Hadoop Common Common utilities to help other Hadoop modules and support subprojects Includes File System, RPC and serialization libraries •Hadoop Distributed File System (HDFS) Distributed File System giving access to application data Spans across all nodes in a Hadoop cluster to link them into one big file system Java based, giving scalable and reliable data storage Main Modules of Hadoop (contd.)
  • 5. Main Modules of Hadoop (contd.) •Hadoop YARN Utilized for job scheduling and resource management of clusters Splits up two roles of JobTracker, namely, resource management and job scheduling into different areas •Hadoop MapReduce System for parallel processing of large data sets A framework that gets into work assignment to nodes in a particular cluster Writes applications processing large amount of data, on multiple nodes of hardware with utmost reliability Main Modules of Hadoop (contd.)
  • 6. Other Hadoop Related Projects at Apache • Avro •Cassandra •Hbase •Hive •Pig •Spark • Ambari •Chukwa •Mahout •Tez •ZooKeeper Other Hadoop Related Projects at Apache
  • 7. Why Hadoop? • Next generation real time analytics •Rich eco systems •Scale-out storage •Reduced cost of ownership •Scalability, Flexibility and Reliability •Fault tolerance •Simplistic programming models Why Hadoop?
  • 8. Looking Forward To Have A Mutually Beneficial Association. Assuring You Of Our Best Services Always. SPEC INDIA "SPEC House“, Parth Complex, Swastik Cross Road, Navrangpura, Ahmedabad-380 009, INDIA. Tel.:+91-79-26404031 to 34 VoIP : + 1 - 908 - 450 - 9862 Instant Messengers spec.bd | spec_india | bd.spec specindia2009 | specindia.bd e-mail: lead@spec-india.com URL: http://www.spec- india.com THANK YOU