Hello,
Any help would be apprecaited..
I have below setup for High Availability
NN1 (active) + ZKFC1
NN2 (standby) + ZKFC2
Then I kill the active node : kill -9 on NN1 process
NN2 stay on standby instead of changing to ACTIVE status
I see below error in the log of Failover Controller FC1 which runs on NN1
The error in the log is infinitely printing
6:45:51.232 PM INFO org.apache.hadoop.ipc.Client
Retrying connect to server: vm-F0CD-5B46.nam.nsroot.net/10.49.216.121:8020 <http://vm-f0cd-5b46.nam.nsroot.net/10.49.216.121:8020>. Already tried 0 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=1, sleepTime=1 SECONDS)
6:45:51.233 PM WARN org.apache.hadoop.ha.HealthMonitor
Transport-level exception trying to monitor health of NameNode at vm-F0CD-5B46.nam.nsroot.net/10.49.216.121:8020 <http://vm-f0cd-5b46.nam.nsroot.net/10.49.216.121:8020>: Call From vm-F0CD-5B46.nam.nsroot.net/10.49.216.121 <http://vm-f0cd-5b46.nam.nsroot.net/10.49.216.121> to vm-F0CD-5B46.nam.nsroot.net:8020 <http://vm-f0cd-5b46.nam.nsroot.net:8020/> failed on connection exception: java.net.ConnectException: Connection refused; For more details see: http://wiki.apache.org/hadoop/ConnectionRefused
6:45:53.237 PM INFO org.apache.hadoop.ipc.Client
Retrying connect to server: vm-F0CD-5B46.nam.nsroot.net/10.49.216.121:8020 <http://vm-f0cd-5b46.nam.nsroot.net/10.49.216.121:8020>. Already tried 0 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=1, sleepTime=1 SECONDS)
6:45:53.237 PM
Thanks
Madhu
WARN org
--