Implementing CTE on HDFS on a Single Host
It is possible, though not recommended, that an HDFS NameNode and DataNode exist as separate processes on the same host. If this is your deployment, use the following CTE deployment guidelines:
-
Configure the HDFS NameNodes (see Configure NameNodes):
The directories specified by
dfs.datanode.data.dir
already exist on the local file system so you do not have to create them. -
Pass the following information to the Administrator:
-
The
dfs.datanode.data.dir
directory list and instructions to create a GuardPoint for the HDFS Host Group on each of these directories. -
Instructions to add the NameNodes IP addresses, or host names, to the HDFS Host Group.
-
-
Create an HDFS host/client group and host/client group GuardPoint:
-
Administrator must create an HDFS Host Group to contain the HDFS nodes.
-
The Administrator must create a GuardPoint for the Host Group on each of the directories specified by
dfs.datanode.data.dir
-
-
Take the DataNode offline and perform a data transformation.
-
Add the NameNode/DataNode host/client to the host/client group.