Hadoop hdfs
Upcoming SlideShare
Loading in...5
×
 

Like this? Share it with your network

Share

Hadoop hdfs

on

  • 212 views

Distributed File System

Distributed File System

Statistics

Views

Total Views
212
Views on SlideShare
212
Embed Views
0

Actions

Likes
0
Downloads
17
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Hadoop hdfs Presentation Transcript

  • 1. BE Seminar Topic on- HADOOP HDFS Presented By – SUDIPTA GHOSH (2010-3016) University Institute Of Technology , Burdwan University
  • 2. What is Big Data ?
  • 3. Why DFS ?
  • 4. What is Hadoop ? Apache Hadoop is a framework that allows for the distributed processing of large data sets across clusters of commodity computers using a simple programming model.
  • 5. Companies using Hadoop  Yahoo  Google  Facebook  Amazon  AOL  IBM  LinkedIn  And many more at http://wiki.apache.org/hadoop/PoweredBy
  • 6. What is HDFS ? • HDFS – Hadoop Distributed File System • HDFS is a file system designed for storing very large files with streaming data access patterns, running clusters on commodity hardware.
  • 7. Why HDFS ? – – – – – Highly fault-tolerant High throughput Suitable for applications with large data sets Streaming access to file system data Can be built out of commodity hardware
  • 8. Main Components Of Hadoop : Name Node:  master of the system  maintains and manages the blocks which are present in the data node Data Nodes:  Slaves which are deployed in each machine and provide the actual storage  Responsible for serving read and write requests for the client
  • 9. HDFS Architecture
  • 10. Job Tracker
  • 11. Job Tracker( Contnd.)
  • 12. Running Jobs On Hadoop
  • 13. Areas Where Hadoop Is Not A Good Fit Today : • Low-Latency data access • Lots of small files • Multiple writers, arbitrary file modifications
  • 14. References 1. http://hadoop.apache.org/core/docs 2. https://www.youtube.com/watch?v=A02SRd yoshM
  • 15. THANK YOU