Since file metadata is stored in memory, the amount of memory in the NameNodes defines the number of files that can be available on an Hadoop cluster. To overcome the limitation of a single NameNode memory and being able to scale the name service horizontally, Hadoop has introduced HDFS Federation, which is based on multiple independent NameNodes/namespaces. Following are the main benefits of HDFS Federation:
Under federation, each namenode manages a namespace volume, which is made up of the metadata for the namespace, and a block pool containing all the blocks for the files in the namespace. Namespace volumes are independent of each other, which means namenodes do not communicate with one another, and furthermore the failure of one namenode does not affect the availability of the namespaces managed by other namenodes.
Although HDFS Federation solves the problem of HDFS scalability, it does not solve the NameNode reliability issue.