Hi,
I tried the new snapshotfeatures with a cluster installed with the brand
new version of Cloudera Manager 4.6 like so, from the hbase shell logged in
as user hdfs:
hbase shell> snapshot 'myTable', 'myTable-snapshot'
but got a timeout. Has anybody any idea why this happend, respective what
is wrong. The only thing i changed in the cluster is to set the
snapshot.enabled property to true.
Thanks
Karl-Heinz
Here is the stack trace from the HBaes-master:
--------------------------------------------------------------------
2013-06-24 16:16:13,939 INFO
org.apache.hadoop.hbase.master.snapshot.TakeSnapshotHandler:
Running FLUSH table snapshot myTable-snapshot C_M_SNAPSHOT_TABLE on
table myTable
2013-06-24 16:16:14,021 INFO org.apache.hadoop.hbase.procedure.Procedure:
Starting procedure 'myTable-snapshot'
2013-06-24 16:17:14,037 ERROR org.apache.hadoop.hbase.procedure.Procedure:
Procedure 'myTable-snapshot' execution failed!
org.apache.hadoop.hbase.errorhandling.TimeoutException via
timer-java.util.Timer@3feef1eb:org.apache.hadoop.hbase.errorhandling.TimeoutException:
Timeout elapsed! Source:Timeout caused Foreign Exception
Start:1372083374021, End:1372083434021, diff:60000, max:60000 ms
at
org.apache.hadoop.hbase.errorhandling.ForeignExceptionDispatcher.rethrowException(ForeignExceptionDispatcher.java:85)
at
org.apache.hadoop.hbase.procedure.Procedure.waitForLatch(Procedure.java:369)
...
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
Caused by: org.apache.hadoop.hbase.errorhandling.TimeoutException: Timeout
elapsed!
Source:Timeout caused Foreign Exception
Start:1372083374021, End:1372083434021, diff:60000, max:60000 ms
at
org.apache.hadoop.hbase.errorhandling.TimeoutExceptionInjector$1.run(TimeoutExceptionInjector.java:68)
at java.util.TimerThread.mainLoop(Timer.java:512)
at java.util.TimerThread.run(Timer.java:462)
2013-06-24 16:17:14,038 INFO
org.apache.hadoop.hbase.procedure.ZKProcedureUtil:
Clearing all znodes for procedure myTable-snapshotincluding nodes
/hbase/online-snapshot/acquired /hbase/online-snapshot/reached
/hbase/online-snapshot/abort
2013-06-24 16:17:14,045 WARN
org.apache.hadoop.hbase.errorhandling.TimeoutExceptionInjector:
Timer already marked completed, ignoring!
2013-06-24 16:17:14,060 ERROR
org.apache.hadoop.hbase.master.snapshot.TakeSnapshotHandler:
Failed taking snapshot { ss=myTable-snapshot table=myTable type=FLUSH }
due to exception:
No region directory found for region:
{NAME => 'myTable,,1371712315618.fce74b75bbff1b61d2ae1bc604b46877.',
STARTKEY => '', ENDKEY => '5',
ENCODED => fce74b75bbff1b61d2ae1bc604b46877,}
org.apache.hadoop.hbase.snapshot.CorruptedSnapshotException: No region
directory found for region:
{NAME => 'myTable,,1371712315618.fce74b75bbff1b61d2ae1bc604b46877.',
STARTKEY => '',
ENDKEY => '5',
ENCODED => fce74b75bbff1b61d2ae1bc604b46877,}