Hairong Kuang commented on HDFS-1496:
-------------------------------------
This turns out to be a bug in storage directory restoration. Image validation exposes the error.
Currently NN uses rollFSEdits to trigger storage directory recovery. The recovery may trigger a saving of the namespace to the newly restored directory which as a result changes in memory image digest. However later on image & edits were fetched from an old storage directory, thus causing the checksum mismatch.
The problem with this storage restoration scheme is that it makes the on-disk state of all storage directories inconsistent.
TestStorageRestore is failing after HDFS-903 fix
------------------------------------------------
Key: HDFS-1496
URL: https://issues.apache.org/jira/browse/HDFS-1496
Project: Hadoop HDFS
Issue Type: Bug
Components: test
Reporter: Konstantin Boudnik
Assignee: Hairong Kuang
TestStorageRestore seems to be failing after HDFS-903 commit. Running git bisect confirms it.
--------------------------------------------------
Key: HDFS-1496
URL: https://issues.apache.org/jira/browse/HDFS-1496
Project: Hadoop HDFS
Issue Type: Bug
Components: test
Reporter: Konstantin Boudnik
Assignee: Hairong Kuang
TestStorageRestore seems to be failing after HDFS-903 commit. Running git bisect confirms it.
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
