Getting Started with HDFS

Hadoop is most popularly known for its storage. It uses HDFS (HadoopDistribution File System) for storing such huge amount of data. HDFS has some unique features which made Hadoop to stand first to store BIGDATA. We will go through those features in our further posts.

HDFS is a clustered structure which consists of one Master Node and some Slave Nodes. The following is the typical structure of HDFS Cluster.

The above is the 5 node cluster, in which Master node is referred as Name Node and Slave node is referred as Data Node. Each node should consists of two components (each node should be able to store the data as well as process the data), then only the node is the part of the HDFS Cluster. No node is supposed only for storage and no one is supposed only for processing. There is no limit for the number of nodes in the cluster, it is based on the amount of data stored on top of HDFS Cluster. As the data increases, the number of nodes also gets increased.

Getting Started with HDFS

Search This Blog

Follow in Facebook

Popular Posts

About Me

Getting Started with HDFS

Share this post

Search This Blog

Follow in Facebook

Popular Posts

About Me