Getting Started with HDFS

Posted at  08:51  |  in  HDFS

      Hadoop is most popularly known for its storage.  It uses HDFS (HadoopDistribution File System) for storing such huge amount of data. HDFS has some unique features which made Hadoop to stand first to store BIGDATA. We will go through those features in our further posts.

       HDFS is a clustered structure which consists of one Master Node and some Slave Nodes. The following is the typical structure of HDFS Cluster.


HDFS cluster
    

      The above is the 5 node cluster, in which Master node is referred as Name Node and Slave node is referred as Data Node. Each node should consists of two components (each node should be able to store the data as well as process the data), then only the node is the part of the HDFS Cluster. No node is supposed only for storage and no one is supposed only for processing. There is no limit for the number of nodes in the cluster, it is based on the amount of data stored on top of HDFS Cluster. As the data increases, the number of nodes also gets increased. 



 

Share this post

About-Privacy Policy-Contact us
Copyright © 2013 Hadoop Tutor. Blogger Template by Bloggertheme9
Proudly Powered by Blogger.
back to top