During a general routine MapReduce job, my map tasks start to fail on only
one regionserver until the entire job fails. The error tasks are getting
is related to Lease Timeouts, so I decided to raise my timeout to 600
seconds (via CDH Manager) and ran it again. I once again found the same
issue persisting.
I investigated the regionserver logs and found something very odd:
Aborting call [...truncated...] after 61741 ms, since caller disconnected
Notice that the call is being aborted after only 60 seconds. 1/10 the time
I had just set as the lease timeout.
I looked inside my hbase-site.xml in the path "/etc/hbase/conf" to find
that no lease.period was actually set there.
Am I missing something? Or is this a bug of sorts?
Environment:
Cloudera Manager 4.5.2 (#327 built by jenkins on 20130429-1453 git:
16cab2c7b76194b7877d64a4215494daa387a266)
CDH 4.2.1-1.cdh4.2.1.p0.5