10-22-2013 05:11 PM
When using curl to put data via the webhdfs restful api to a cluster with a high availability name node that has failed over there is an error message that the name node is on standby and the write fails.
Basically the curl put request goes directly to the standby name node and it gets a referral to a data node that also includes itself as the name node rather than the active namenode. It would be nice the referral included the active name node so the subsequent write would work even though the original request went to the standby node.
We are using curl because it simpler and a lighter weight than running a full hdfs client or using flume. Obviously we can do our own failover on the client, although that needs knowledge of name nodes. Is there a better way to do this using the webhfds restful api?
12-28-2013 09:13 PM