Hadoop is most popularly known for its storage. It uses HDFS (HadoopDistribution File System) for storing such huge amount of data. HDFS has some unique features which made Hadoop to stand first to store BIGDATA. We will go through those features in our further posts.
HDFS is a clustered structure which consists of one Master
Node and some Slave Nodes. The following is the typical structure of HDFS
Cluster.
The above is the
5 node cluster, in which Master node is referred as Name Node and Slave node is
referred as Data Node. Each node should consists of two components (each node
should be able to store the data as well as process the data), then only the
node is the part of the HDFS Cluster. No node is supposed only for storage and
no one is supposed only for processing. There is no limit for the number of
nodes in the cluster, it is based on the amount of data stored on top of HDFS
Cluster. As the data increases, the number of nodes also gets increased.